Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... | 1748 | 2023 |
Large-scale contrastive language-audio pretraining with feature fusion and keyword-to-caption augmentation Y Wu, K Chen, T Zhang, Y Hui, T Berg-Kirkpatrick, S Dubnov ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 520 | 2023 |
Musicldm: Enhancing novelty in text-to-music generation using beat-synchronous mixup strategies K Chen, Y Wu, H Liu, M Nezhurina, T Berg-Kirkpatrick, S Dubnov ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 69 | 2024 |
Bigbio: A framework for data-centric biomedical natural language processing J Fries, L Weber, N Seelam, G Altay, D Datta, S Garda, S Kang, R Su, ... Advances in Neural Information Processing Systems 35, 25792-25806, 2022 | 50 | 2022 |
Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models M Nezhurina, L Cipolina-Kun, M Cherti, J Jitsev arXiv preprint arXiv:2406.02061, 2024 | 42 | 2024 |
Datacomp-lm: In search of the next generation of training sets for language models J Li, A Fang, G Smyrnis, M Ivgi, M Jordan, S Gadre, H Bansal, E Guha, ... arXiv preprint arXiv:2406.11794, 2024 | 41 | 2024 |
Alaaeldin El-Nouby, Hadi Pouransari, Alexander Toshev, Stephanie Wang, Dirk Groeneveld, Luca Soldaini, Pang Wei Koh, Jenia Jitsev, Thomas Kollar, Alexandros G J Li, A Fang, G Smyrnis, M Ivgi, M Jordan, S Gadre, H Bansal, E Guha, ... Dimakis, Yair Carmon, Achal Dave, Ludwig Schmidt, and Vaishaal Shankar …, 2024 | 29 | 2024 |
Language models scale reliably with over-training and on downstream tasks SY Gadre, G Smyrnis, V Shankar, S Gururangan, M Wortsman, R Shao, ... arXiv preprint arXiv:2403.08540, 2024 | 24 | 2024 |
Language models scale reliably with over-training and on downstream tasks S Yitzhak Gadre, G Smyrnis, V Shankar, S Gururangan, M Wortsman, ... arXiv e-prints, arXiv: 2403.08540, 2024 | | 2024 |
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model V Danchev, V Nikoulina, V Laippala, V Lepercq, V Prabhu, Z Alyafeai, ... | | 2023 |
Alice in Wonderland: Simple Tasks Reveal Severe Generalization and Basic Reasoning Deficits in State-Of-the-Art Large Language Models M Nezhurina, L Cipolina-Kun, M Cherti, J Jitsev NeurIPS 2024 Workshop on Scientific Methods for Understanding Deep Learning, 0 | | |
Composing and Validating Large-Scale Datasets for Training Open Foundation Models for Audio M Nezhurina, K Chen, Y Wu, T Zhang, Y Hui, H Liu, T Berg-Kirkpatrick, ... | | |