Torchaudio: Building blocks for audio and speech processing YY Yang, M Hira, Z Ni, A Astafurov, C Chen, C Puhrsch, D Pollack, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 183 | 2022 |
Transformer-transducer: End-to-end speech recognition with self-attention CF Yeh, J Mahadeokar, K Kalgaonkar, Y Wang, D Le, M Jain, K Schubert, ... arXiv preprint arXiv:1910.12977, 2019 | 170 | 2019 |
Emformer: Efficient memory transformer based acoustic model for low latency streaming speech recognition Y Shi, Y Wang, C Wu, CF Yeh, J Chan, F Zhang, D Le, M Seltzer ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 168 | 2021 |
Domain Adversarial Training for Accented Speech Recognition S Sun, CF Yeh, MY Hwang, M Ostendorf, L Xie Acoustics, Speech and Signal Processing (ICASSP), 2018 IEEE International …, 2018 | 139 | 2018 |
Training Augmentation with Adversarial Examples for Robust Speech Recognition S Sun, CF Yeh, M Ostendorf, MY Hwang, L Xie INTERSPEECH, 2018 | 77 | 2018 |
Alignment restricted streaming recurrent neural network transducer J Mahadeokar, Y Shangguan, D Le, G Keren, H Su, T Le, CF Yeh, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 52-59, 2021 | 68 | 2021 |
Streaming transformer-based acoustic models using self-attention with augmented memory C Wu, Y Wang, Y Shi, CF Yeh, F Zhang arXiv preprint arXiv:2005.08042, 2020 | 68 | 2020 |
RNN-T for latency controlled ASR with improved beam search M Jain, K Schubert, J Mahadeokar, CF Yeh, K Kalgaonkar, A Sriram, ... arXiv preprint arXiv:1911.01629, 2019 | 44 | 2019 |
Spoken Lecture Summarization by Random Walk over a Graph Constructed with Automatically Extracted Key Terms. YN Chen, Y Huang, CF Yeh, LS Lee Interspeech, 933-936, 2011 | 39 | 2011 |
An integrated framework for transcribing Mandarin-English code-mixed lectures with improved acoustic and language modeling CF Yeh, CY Huang, LC Sun, LS Lee 2010 7th International Symposium on Chinese Spoken Language Processing, 214-219, 2010 | 39 | 2010 |
Aipnet: Generative adversarial pre-training of accent-invariant networks for end-to-end speech recognition YC Chen, Z Yang, CF Yeh, M Jain, ML Seltzer ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 36 | 2020 |
Superb@ slt 2022: Challenge on generalization and efficiency of self-supervised speech representation learning T Feng, A Dong, CF Yeh, S Yang, TQ Lin, J Shi, KW Chang, Z Huang, ... 2022 IEEE Spoken Language Technology Workshop (SLT), 1096-1103, 2023 | 32 | 2023 |
Weak-attention suppression for transformer based speech recognition Y Shi, Y Wang, C Wu, C Fuegen, F Zhang, D Le, CF Yeh, ML Seltzer arXiv preprint arXiv:2005.09137, 2020 | 30 | 2020 |
Spoken knowledge organization by semantic structuring and a prototype course lecture system for personalized learning H Lee, SR Shiang, C Yeh, YN Chen, Y Huang, SY Kong, L Lee IEEE/ACM transactions on audio, speech, and language processing 22 (5), 883-898, 2014 | 28 | 2014 |
Semantic distance: A new metric for asr performance analysis towards spoken language understanding S Kim, A Arora, D Le, CF Yeh, C Fuegen, O Kalinli, ML Seltzer arXiv preprint arXiv:2104.02138, 2021 | 25 | 2021 |
Improved spoken term detection by feature space pseudo-relevance feedback C Chen, H Lee, C Yeh, L Lee Eleventh Annual Conference of the International Speech Communication Association, 2010 | 23 | 2010 |
Benchmarking lf-mmi, ctc and rnn-t criteria for streaming asr X Zhang, F Zhang, C Liu, K Schubert, J Chan, P Prakash, J Liu, CF Yeh, ... 2021 IEEE spoken language technology workshop (SLT), 46-51, 2021 | 22 | 2021 |
Bilingual acoustic modeling with state mapping and three-stage adaptation for transcribing unbalanced code-mixed lectures CF Yeh, LC Sun, CY Huang, LS Lee 2011 IEEE International Conference on Acoustics, Speech and Signal …, 2011 | 20 | 2011 |
Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications Y Wang, Y Shi, F Zhang, C Wu, J Chan, CF Yeh, A Xiao ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 19 | 2021 |
An improved framework for recognizing highly imbalanced bilingual code-switched lectures with cross-language acoustic modeling and frame-level language identification CF Yeh, LS Lee IEEE/ACM Transactions on Audio, Speech, and Language Processing 23 (7), 1144 …, 2015 | 18 | 2015 |