Follow
Jay Mahadeokar
Jay Mahadeokar
Facebook AI
Verified email at fb.com
Title
Cited by
Cited by
Year
Transformer-based acoustic modeling for hybrid speech recognition
Y Wang, A Mohamed, D Le, C Liu, A Xiao, J Mahadeokar, H Huang, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2642020
The llama 3 herd of models
A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ...
arXiv preprint arXiv:2407.21783, 2024
2202024
Torchaudio: Building blocks for audio and speech processing
YY Yang, M Hira, Z Ni, A Astafurov, C Chen, C Puhrsch, D Pollack, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
1912022
Transformer-transducer: End-to-end speech recognition with self-attention
CF Yeh, J Mahadeokar, K Kalgaonkar, Y Wang, D Le, M Jain, K Schubert, ...
arXiv preprint arXiv:1910.12977, 2019
1722019
Voicebox: Text-guided multilingual universal speech generation at scale
M Le, A Vyas, B Shi, B Karrer, L Sari, R Moritz, M Williamson, V Manohar, ...
Advances in neural information processing systems 36, 2024
1682024
Contextual RNN-T for open domain ASR
M Jain, G Keren, J Mahadeokar, G Zweig, F Metze, Y Saraf
arXiv preprint arXiv:2006.03411, 2020
972020
Contextualized streaming end-to-end speech recognition with trie-based deep biasing and shallow fusion
D Le, M Jain, G Keren, S Kim, Y Shi, J Mahadeokar, J Chan, ...
arXiv preprint arXiv:2104.02194, 2021
822021
Deep shallow fusion for RNN-T personalization
D Le, G Keren, J Chan, J Mahadeokar, C Fuegen, ML Seltzer
2021 IEEE Spoken Language Technology Workshop (SLT), 251-257, 2021
802021
Prompting large language models with speech recognition abilities
Y Fathullah, C Wu, E Lakomkin, J Jia, Y Shangguan, K Li, J Guo, W Xiong, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
792024
Alignment restricted streaming recurrent neural network transducer
J Mahadeokar, Y Shangguan, D Le, G Keren, H Su, T Le, CF Yeh, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 52-59, 2021
692021
RNN-T for latency controlled ASR with improved beam search
M Jain, K Schubert, J Mahadeokar, CF Yeh, K Kalgaonkar, A Sriram, ...
arXiv preprint arXiv:1911.01629, 2019
442019
Improved neural language model fusion for streaming recurrent neural network transducer
S Kim, Y Shangguan, J Mahadeokar, A Bruguier, C Fuegen, ML Seltzer, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
262021
Dissecting user-perceived latency of on-device E2E speech recognition
Y Shangguan, R Prabhavalkar, H Su, J Mahadeokar, Y Shi, J Zhou, C Wu, ...
arXiv preprint arXiv:2104.02207, 2021
252021
Computerized system and method for automatically identifying and providing digital content based on physical geographic location data
V Mahadevan, SS Farfade, JK Mahadeokar, A Arasu, VKR Barakam, ...
US Patent 11,194,856, 2021
172021
Dynamic encoder transducer: A flexible solution for trading off accuracy for latency
Y Shi, V Nagaraja, C Wu, J Mahadeokar, D Le, R Prabhavalkar, A Xiao, ...
arXiv preprint arXiv:2104.02176, 2021
162021
Streaming transformer transducer based speech recognition using non-causal convolution
Y Shi, C Wu, D Wang, A Xiao, J Mahadeokar, X Zhang, C Liu, K Li, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
132022
Streaming parallel transducer beam search with fast-slow cascaded encoders
J Mahadeokar, Y Shi, K Li, D Le, J Zhu, V Chandra, O Kalinli, ML Seltzer
arXiv preprint arXiv:2203.15773, 2022
132022
Memory-efficient speech recognition on smart devices
G Venkatesh, A Valliappan, J Mahadeokar, Y Shangguan, C Fuegen, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
132021
Procter: Pronunciation-aware contextual adapter for personalized speech recognition in neural transducers
R Pandey, R Ren, Q Luo, J Liu, A Rastrow, A Gandhe, D Filimonov, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
122023
Federated domain adaptation for asr with full self-supervision
J Jia, J Mahadeokar, W Zheng, Y Shangguan, O Kalinli, F Seide
arXiv preprint arXiv:2203.15966, 2022
122022
The system can't perform the operation now. Try again later.
Articles 1–20