Recent developments on espnet toolkit boosted by conformer P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 299 | 2021 |
ESPnet-SE: End-to-end speech enhancement and separation toolkit designed for ASR integration C Li, J Shi, W Zhang, AS Subramanian, X Chang, N Kamo, M Hira, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 785-792, 2021 | 90 | 2021 |
The 2020 espnet update: new features, broadened applications, performance improvements, and future plans S Watanabe, F Boyer, X Chang, P Guo, T Hayashi, Y Higuchi, T Hori, ... 2021 IEEE Data Science and Learning Workshop (DSLW), 1-6, 2021 | 56 | 2021 |
Deep audio-visual speech separation with attention mechanism C Li, Y Qian ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 31 | 2020 |
Listen, Watch and Understand at the Cocktail Party: Audio-Visual-Contextual Speech Separation C Li, Y Qian Proc. Interspeech 2020, 1426-1430, 2020 | 29 | 2020 |
ESPnet-SE++: Speech enhancement for robust speech recognition, translation, and understanding YJ Lu, X Chang, C Li, W Zhang, S Cornell, Z Ni, Y Masuyama, B Yan, ... arXiv preprint arXiv:2207.09514, 2022 | 28 | 2022 |
Towards low-distortion multi-channel speech enhancement: The ESPNet-SE submission to the L3DAS22 challenge YJ Lu, S Cornell, X Chang, W Zhang, C Li, Z Ni, ZQ Wang, S Watanabe ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 28 | 2022 |
Dual-path RNN for long recording speech separation C Li, Y Luo, C Han, J Li, T Yoshioka, T Zhou, M Delcroix, K Kinoshita, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 865-872, 2021 | 26 | 2021 |
Closing the gap between time-domain multi-channel speech enhancement on real and simulation conditions W Zhang, J Shi, C Li, S Watanabe, Y Qian 2021 IEEE Workshop on Applications of Signal Processing to Audio and …, 2021 | 25 | 2021 |
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation C Li, L Yang, W Wang, Y Qian ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 23 | 2022 |
Continuous Speech Separation Using Speaker Inventory for Long Recording. C Han, Y Luo, C Li, T Zhou, K Kinoshita, S Watanabe, M Delcroix, ... Interspeech, 3036-3040, 2021 | 14 | 2021 |
Time-domain audio-visual speech separation on low quality videos Y Wu, C Li, J Bai, Z Wu, Y Qian ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 13 | 2022 |
Target sound extraction with variable cross-modality clues C Li, Y Qian, Z Chen, D Wang, T Yoshioka, S Liu, Y Qian, M Zeng ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 11 | 2023 |
Dual-Path Modeling for Long Recording Speech Separation in Meetings C Li, Z Chen, Y Luo, C Han, T Zhou, K Kinoshita, M Delcroix, S Watanabe, ... ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 11 | 2021 |
Rethinking the separation layers in speech separation networks Y Luo, Z Chen, C Han, C Li, T Zhou, N Mesgarani ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 11 | 2021 |
Audio-Visual Multi-Talker Speech Recognition in a Cocktail Party Y Wu, C Li, S Yang, Z Wu, Y Qian Proc. Interspeech 2021, 3021-3025, 2021 | 11 | 2021 |
Prosody Usage Optimization for Children Speech Recognition with Zero Resource Children Speech C Li, Y Qian Proc. Interspeech 2019, 3446-3450, 2019 | 11 | 2019 |
Adapting multi-lingual asr models for handling multiple talkers C Li, Y Qian, Z Chen, N Kanda, D Wang, T Yoshioka, Y Qian, M Zeng arXiv preprint arXiv:2305.18747, 2023 | 10 | 2023 |
Continuous speech separation using speaker inventory for long multi-talker recording C Han, Y Luo, C Li, T Zhou, K Kinoshita, S Watanabe, M Delcroix, ... arXiv preprint arXiv:2012.09727, 2020 | 10 | 2020 |
Dual-path modeling with memory embedding model for continuous speech separation C Li, Z Chen, Y Qian IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1508-1520, 2022 | 8 | 2022 |