Palm: Scaling language modeling with pathways A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ...
arXiv preprint arXiv:2204.02311, 2022
2112 2022 Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ...
arXiv preprint arXiv:2305.10403, 2023
302 2023 Unifying language learning paradigms Y Tay, M Dehghani, VQ Tran, X Garcia, D Bahri, T Schuster, HS Zheng, ...
arXiv preprint arXiv:2205.05131, 2022
96 2022 Scaling Up Models and Data with and A Roberts, HW Chung, A Levskaya, G Mishra, J Bradbury, D Andor, ...
arXiv preprint arXiv:2203.17189, 2022
82 2022 Ul2: Unifying language learning paradigms Y Tay, M Dehghani, VQ Tran, X Garcia, J Wei, X Wang, HW Chung, ...
The Eleventh International Conference on Learning Representations, 2022
71 2022 Scaling laws for neural machine translation B Ghorbani, O Firat, M Freitag, A Bapna, M Krikun, X Garcia, C Chelba, ...
arXiv preprint arXiv:2109.07740, 2021
53 2021 Palm: Scaling language modeling with pathways. arXiv 2022 A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ...
arXiv preprint arXiv:2204.02311, 0
46 Building machine translation systems for the next thousand languages A Bapna, I Caswell, J Kreutzer, O Firat, D van Esch, A Siddhant, M Niu, ...
arXiv preprint arXiv:2205.03983, 2022
39 2022 A multilingual view of unsupervised machine translation X Garcia, P Foret, T Sellam, AP Parikh
arXiv preprint arXiv:2002.02955, 2020
35 2020 Transcending scaling laws with 0.1% extra compute Y Tay, J Wei, HW Chung, VQ Tran, DR So, S Shakeri, X Garcia, HS Zheng, ...
arXiv preprint arXiv:2210.11399, 2022
32 2022 Towards continual learning for multilingual machine translation via vocabulary substitution X Garcia, N Constant, AP Parikh, O Firat
arXiv preprint arXiv:2103.06799, 2021
29 2021 Harnessing multilinguality in unsupervised machine translation for rare languages X Garcia, A Siddhant, O Firat, AP Parikh
arXiv preprint arXiv:2009.11201, 2020
28 2020 Towards the next 1000 languages in multilingual machine translation: Exploring the synergy between supervised and self-supervised learning A Siddhant, A Bapna, O Firat, Y Cao, MX Chen, I Caswell, X Garcia
arXiv preprint arXiv:2201.03110, 2022
24 2022 On rationally ergodic and rationally weakly mixing rank-one transformations I Dai, X Garcia, T Pădurariu, CE Silva
Ergodic Theory and Dynamical Systems 35 (4), 1141-1164, 2015
19 2015 The unreasonable effectiveness of few-shot learning for machine translation X Garcia, Y Bansal, C Cherry, G Foster, M Krikun, M Johnson, O Firat
International Conference on Machine Learning, 10867-10878, 2023
18 2023 Few-shot controllable style transfer for low-resource multilingual settings K Krishna, D Nathani, X Garcia, B Samanta, P Talukdar
arXiv preprint arXiv:2110.07385, 2021
15 2021 PaLM: Scaling Language Modeling with Pathways (2022), doi: 10.48550 A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ...
arXiv preprint arXiv.2204.02311, 0
13 Using natural language prompts for machine translation X Garcia, O Firat
arXiv preprint arXiv:2202.11822, 2022
12 2022 PaLM: Scaling Language Modeling with Pathways (No. arXiv: 2204.02311). arXiv A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ...
10 2022 Unimax: Fairer and more effective language sampling for large-scale multilingual pretraining HW Chung, N Constant, X Garcia, A Roberts, Y Tay, S Narang, O Firat
arXiv preprint arXiv:2304.09151, 2023
8 2023