Follow
Yonatan Belinkov
Title
Cited by
Cited by
Year
Bloom: A 176b-parameter open-access multilingual language model
T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...
15652023
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
arXiv preprint arXiv:2206.04615, 2022
10902022
Locating and editing factual associations in GPT
K Meng, D Bau, A Andonian, Y Belinkov
Advances in Neural Information Processing Systems 35, 17359-17372, 2022
9012022
Synthetic and natural noise both break neural machine translation
Y Belinkov, Y Bisk
arXiv preprint arXiv:1711.02173, 2017
8352017
Linguistic knowledge and transferability of contextual representations
NF Liu, M Gardner, Y Belinkov, ME Peters, NA Smith
arXiv preprint arXiv:1903.08855, 2019
8182019
Fine-grained analysis of sentence embeddings using auxiliary prediction tasks
Y Adi, E Kermany, Y Belinkov, O Lavi, Y Goldberg
arXiv preprint arXiv:1608.04207, 2016
6352016
Analysis methods in neural language processing: A survey
Y Belinkov, J Glass
Transactions of the Association for Computational Linguistics 7, 49-72, 2019
5962019
What do neural machine translation models learn about morphology?
Y Belinkov, N Durrani, F Dalvi, H Sajjad, J Glass
arXiv preprint arXiv:1704.03471, 2017
4552017
Probing classifiers: Promises, shortcomings, and advances
Y Belinkov
Computational Linguistics 48 (1), 207-219, 2022
3832022
Investigating gender bias in language models using causal mediation analysis
J Vig, S Gehrmann, Y Belinkov, S Qian, D Nevo, Y Singer, S Shieber
Advances in neural information processing systems 33, 12388-12401, 2020
3792020
Analyzing the structure of attention in a transformer language model
J Vig, Y Belinkov
arXiv preprint arXiv:1906.04284, 2019
3712019
Mass-editing memory in a transformer
K Meng, AS Sharma, A Andonian, Y Belinkov, D Bau
arXiv preprint arXiv:2210.07229, 2022
3682022
Identifying and Controlling Important Neurons in Neural Machine Translation
A Bau, Y Belinkov, H Sajjad, N Durrani, F Dalvi, J Glass
International Conference on Learning Representations, 2019
2022019
What is one grain of sand in the desert? analyzing individual neurons in deep nlp models
F Dalvi, N Durrani, H Sajjad, Y Belinkov, A Bau, J Glass
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 6309-6317, 2019
1942019
Evaluating layers of representation in neural machine translation on part-of-speech and semantic tagging tasks
Y Belinkov, L Màrquez, H Sajjad, N Durrani, F Dalvi, J Glass
arXiv preprint arXiv:1801.07772, 2018
181*2018
A constructive prediction of the generalization error across scales
JS Rosenfeld, A Rosenfeld, Y Belinkov, N Shavit
arXiv preprint arXiv:1909.12673, 2019
1792019
End-to-end bias mitigation by modelling biases in corpora
RK Mahabadi, Y Belinkov, J Henderson
arXiv preprint arXiv:1909.06321, 2019
1682019
Probing the probing paradigm: Does probing accuracy entail task relevance?
A Ravichander, Y Belinkov, E Hovy
arXiv preprint arXiv:2005.00719, 2020
1182020
Jamba: A hybrid transformer-mamba language model
O Lieber, B Lenz, H Bata, G Cohen, J Osin, I Dalmedigos, E Safahi, ...
arXiv preprint arXiv:2403.19887, 2024
1112024
Analyzing individual neurons in pre-trained language models
N Durrani, H Sajjad, F Dalvi, Y Belinkov
arXiv preprint arXiv:2010.02695, 2020
1002020
The system can't perform the operation now. Try again later.
Articles 1–20