Wenet: Production oriented streaming and non-streaming end-to-end speech recognition toolkit Z Yao, D Wu, X Wang, B Zhang, F Yu, C Yang, Z Peng, X Chen, L Xie, ... arXiv preprint arXiv:2102.01547, 2021 | 262* | 2021 |
An unsupervised deep domain adaptation approach for robust speech recognition S Sun, B Zhang, L Xie, Y Zhang Neurocomputing 257, 79-87, 2017 | 181 | 2017 |
Wenetspeech: A 10000+ hours multi-domain mandarin corpus for speech recognition B Zhang, H Lv, P Guo, Q Shao, C Yang, L Xie, X Xu, H Bu, X Chen, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 117 | 2022 |
Wenet 2.0: More productive end-to-end speech recognition toolkit B Zhang, D Wu, Z Peng, X Song, Z Yao, H Lv, L Xie, C Yang, F Pan, J Niu arXiv preprint arXiv:2203.15455, 2022 | 86* | 2022 |
Wespeaker: A research and production oriented speaker embedding learning toolkit H Wang, C Liang, S Wang, Z Chen, B Zhang, X Xiang, Y Deng, Y Qian ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 37 | 2023 |
Fast-u2++: Fast and accurate end-to-end speech recognition in joint ctc/attention frames C Liang, XL Zhang, BB Zhang, D Wu, S Li, X Song, Z Peng, F Pan ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 5 | 2023 |
Trimtail: Low-latency streaming asr with simple but effective spectrogram-level length penalty X Song, D Wu, Z Wu, B Zhang, Y Zhang, Z Peng, W Li, F Pan, C Zhu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 5 | 2023 |
Zeroprompt: Streaming acoustic encoders are zero-shot masked lms X Song, D Wu, B Zhang, Z Peng, B Dang, F Pan, Z Wu arXiv preprint arXiv:2305.10649, 2023 | 5 | 2023 |
Empirical evaluation of parallel training algorithms on acoustic modeling W Li, B Zhang, L Xie, D Yu arXiv preprint arXiv:1703.05880, 2017 | 5 | 2017 |
Lightgrad: Lightweight diffusion probabilistic model for text-to-speech J Chen, X Song, Z Peng, B Zhang, F Pan, Z Wu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 4 | 2023 |
WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit J Wang, M Xu, J Hou, B Zhang, XL Zhang, L Xie, F Pan ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 4 | 2023 |
Branch-ECAPA-TDNN: A parallel branch architecture to capture local and global features for speaker verification J Yao, C Liang, Z Peng, B Zhang, XL Zhang Proc. Interspeech, 1943-1947, 2023 | 3 | 2023 |
The iscslp 2022 intelligent cockpit speech recognition challenge (icsrc): Dataset, tracks, baseline and results A Zhang, F Yu, K Huang, L Xie, L Wang, ES Chng, H Bu, B Zhang, ... 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022 | 3 | 2022 |
U2++ MoE: Scaling 4.7 x parameters with minimal impact on RTF X Song, D Wu, B Zhang, D Zhou, Z Peng, B Dang, F Pan, C Yang arXiv preprint arXiv:2404.16407, 2024 | | 2024 |
ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge H Wang, P Guo, Y Li, A Zhang, J Sun, L Xie, W Chen, P Zhou, H Bu, X Xu, ... arXiv preprint arXiv:2401.03473, 2024 | | 2024 |
Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition K Huang, A Zhang, B Zhang, T Xu, X Song, L Xie 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | | 2023 |
The GUA-Speech System Description for CNVSRC Challenge 2023 S Li, C Lei, B Ma, B Zhang, F Pan arXiv preprint arXiv:2312.07254, 2023 | | 2023 |
FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition X Song, D Wu, B Zhang, Z Wu, W Li, D Li, P Zhang, Z Peng, F Pan, C Zhu, ... arXiv preprint arXiv:2210.17079, 2022 | | 2022 |
Advancing Speaker Embedding Learning: Wespeaker Toolkit for Research and Production S Wang, Z Chen, B Han, H Wang, C Liang, B Zhang, X Xiang, W Ding, ... Available at SSRN 4748855, 0 | | |
ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs B Zhang, Z Peng, B Dang, F Pan, Z Wu | | |