Brian Yan

Cited by

	All	Since 2019
Citations	414	414
h-index	12	12
i10-index	16	16

240

120

180

20212022202320249 65 237 102

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Siddharth DalmiaResearch Scientist, Google DeepMindVerified email at google.com
Siddhant AroraGraduate Student, Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Yifan PengCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Jiatong Shi (史嘉彤)Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Xuankai ChangCarnegie Mellon University, StudentVerified email at andrew.cmu.edu
Dan BerrebbiApple - Carnegie Mellon University - Ecole PolytechniqueVerified email at andrew.cmu.edu
Alan W BlackProfessor, Language Technologies Institute, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Soumi MaitiCarnegie Mellon UniversityVerified email at andrew.cmu.edu
William ChenCarnegie Mellon UniversityVerified email at cmu.edu
Florian MetzeCarnegie Mellon University; Meta AIVerified email at andrew.cmu.edu
Jinchuan TianLanguage Technologies Institute, Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Hirofumi InagumaFundamental AI Research (FAIR) at MetaVerified email at meta.com
Yosuke HiguchiWaseda UniversityVerified email at pcl.cs.waseda.ac.jp
Graham NeubigCarnegie Mellon UniversityVerified email at cs.cmu.edu
Yuekai ZhangNvidiaVerified email at nvidia.com
Pengcheng GuoNorthwestern Polytechnical UniversityVerified email at nwpu-aslp.org
Chunlei ZhangTencent AI Lab, Bellevue.Verified email at global.tencent.com
Patrick FernandesCarnegie Mellon University & Instituto Superior TécnicoVerified email at cs.cmu.edu
Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA FellowVerified email at global.tencent.com

Brian Yan

Carnegie Mellon University

Verified email at cs.cmu.edu - Homepage

Speech Recognition Speech Translation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet S Arora, S Dalmia, P Denisov, X Chang, Y Ueda, Y Peng, Y Zhang, ... ICASSP 2022, 2022	66	2022
Searchable hidden intermediates for end-to-end models of decomposable sequence tasks S Dalmia, B Yan, V Raunak, F Metze, S Watanabe NAACL 2021, 2021	30	2021
CTC Alignments Improve Autoregressive Translation B Yan, S Dalmia, Y Higuchi, G Neubig, F Metze, AW Black, S Watanabe EACL 2023, 2022	25	2022
Improving massively multilingual ASR with auxiliary CTC objectives W Chen, B Yan, J Shi, Y Peng, S Maiti, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	22	2023
BERT meets CTC: New formulation of end-to-end speech recognition with pre-trained masked language model Y Higuchi, B Yan, S Arora, T Ogawa, T Kobayashi, S Watanabe EMNLP 2022, 2022	20	2022
ESPnet-ST IWSLT 2021 Offline Speech Translation System H Inaguma, B Yan, S Dalmia, P Gu, J Shi, K Duh, S Watanabe IWSLT 2021, 2021	19	2021
Prompting the hidden talent of web-scale speech models for zero-shot task generalization P Peng, B Yan, S Watanabe, D Harwath arXiv preprint arXiv:2305.11095, 2023	18	2023
Combining spectral and self-supervised features for low resource speech recognition and translation D Berrebbi, J Shi, B Yan, O López-Francisco, JD Amith, S Watanabe arXiv preprint arXiv:2204.02470, 2022	18	2022
Exploration of efficient end-to-end asr using discretized input from self-supervised learning X Chang, B Yan, Y Fujita, T Maekaku, S Watanabe arXiv preprint arXiv:2305.18108, 2023	17	2023
ESPnet-SE++: Speech enhancement for robust speech recognition, translation, and understanding YJ Lu, X Chang, C Li, W Zhang, S Cornell, Z Ni, Y Masuyama, B Yan, ... arXiv preprint arXiv:2207.09514, 2022	17	2022
Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization B Yan, C Zhang, M Yu, SX Zhang, S Dalmia, D Berrebbi, C Weng, ... ICASSP 2022, 2022	17	2022
Two-pass low latency end-to-end spoken language understanding S Arora, S Dalmia, X Chang, B Yan, A Black, S Watanabe arXiv preprint arXiv:2207.06670, 2022	14	2022
Reproducing whisper-style training using an open-source toolkit and publicly available data Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	11	2023
CMU’s IWSLT 2022 dialect speech translation system B Yan, P Fernandes, S Dalmia, J Shi, Y Peng, D Berrebbi, X Wang, ... Proceedings of the 19th International Conference on Spoken Language …, 2022	11	2022
Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates H Inaguma, S Dalmia, B Yan, S Watanabe ASRU 2021, 2021	11	2021
Differentiable Allophone Graphs for Language-Universal Speech Recognition B Yan, S Dalmia, DR Mortensen, F Metze, S Watanabe INTERSPEECH 2021, 2021	10	2021
Highland Puebla Nahuatl speech translation corpus for endangered language documentation J Shi, JD Amith, X Chang, S Dalmia, B Yan, S Watanabe Proceedings of the First Workshop on Natural Language Processing for …, 2021	9	2021
Towards zero-shot code-switched speech recognition B Yan, M Wiesner, O Klejch, P Jyothi, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	8	2023
A comparative study on E-branchformer vs conformer in speech recognition, translation, and understanding tasks Y Peng, K Kim, F Wu, B Yan, S Arora, W Chen, J Tang, S Shon, P Sridhar, ... arXiv preprint arXiv:2305.11073, 2023	8	2023
Token-level sequence labeling for spoken language understanding using compositional end-to-end models S Arora, S Dalmia, B Yan, F Metze, AW Black, S Watanabe arXiv preprint arXiv:2210.15734, 2022	8	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors