Binbin Zhang

Cited by

	All	Since 2019
Citations	717	685
h-index	5	5
i10-index	5	5

300

150

225

2017201820192020202120222023202410 22 33 34 52 165 281 120

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Lei XieNorthwestern Polytechnical UniversityVerified email at nwpu.edu.cn
xingchen songTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Shuai WangSRIBDVerified email at sribd.cn
Pengcheng ZhuFuxi AI Lab, NetEase Inc.Verified email at corp.netease.com
Sining SunDuxiaoman, Beijing

Binbin Zhang

WeNet Community

No verified email - Homepage

speech recognition


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Wenet: Production oriented streaming and non-streaming end-to-end speech recognition toolkit Z Yao, D Wu, X Wang, B Zhang, F Yu, C Yang, Z Peng, X Chen, L Xie, ... arXiv preprint arXiv:2102.01547, 2021	262*	2021
An unsupervised deep domain adaptation approach for robust speech recognition S Sun, B Zhang, L Xie, Y Zhang Neurocomputing 257, 79-87, 2017	181	2017
Wenetspeech: A 10000+ hours multi-domain mandarin corpus for speech recognition B Zhang, H Lv, P Guo, Q Shao, C Yang, L Xie, X Xu, H Bu, X Chen, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	117	2022
Wenet 2.0: More productive end-to-end speech recognition toolkit B Zhang, D Wu, Z Peng, X Song, Z Yao, H Lv, L Xie, C Yang, F Pan, J Niu arXiv preprint arXiv:2203.15455, 2022	86*	2022
Wespeaker: A research and production oriented speaker embedding learning toolkit H Wang, C Liang, S Wang, Z Chen, B Zhang, X Xiang, Y Deng, Y Qian ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	37	2023
Fast-u2++: Fast and accurate end-to-end speech recognition in joint ctc/attention frames C Liang, XL Zhang, BB Zhang, D Wu, S Li, X Song, Z Peng, F Pan ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	5	2023
Trimtail: Low-latency streaming asr with simple but effective spectrogram-level length penalty X Song, D Wu, Z Wu, B Zhang, Y Zhang, Z Peng, W Li, F Pan, C Zhu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	5	2023
Zeroprompt: Streaming acoustic encoders are zero-shot masked lms X Song, D Wu, B Zhang, Z Peng, B Dang, F Pan, Z Wu arXiv preprint arXiv:2305.10649, 2023	5	2023
Empirical evaluation of parallel training algorithms on acoustic modeling W Li, B Zhang, L Xie, D Yu arXiv preprint arXiv:1703.05880, 2017	5	2017
Lightgrad: Lightweight diffusion probabilistic model for text-to-speech J Chen, X Song, Z Peng, B Zhang, F Pan, Z Wu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	4	2023
WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit J Wang, M Xu, J Hou, B Zhang, XL Zhang, L Xie, F Pan ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	4	2023
Branch-ECAPA-TDNN: A parallel branch architecture to capture local and global features for speaker verification J Yao, C Liang, Z Peng, B Zhang, XL Zhang Proc. Interspeech, 1943-1947, 2023	3	2023
The iscslp 2022 intelligent cockpit speech recognition challenge (icsrc): Dataset, tracks, baseline and results A Zhang, F Yu, K Huang, L Xie, L Wang, ES Chng, H Bu, B Zhang, ... 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022	3	2022
U2++ MoE: Scaling 4.7 x parameters with minimal impact on RTF X Song, D Wu, B Zhang, D Zhou, Z Peng, B Dang, F Pan, C Yang arXiv preprint arXiv:2404.16407, 2024		2024
ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge H Wang, P Guo, Y Li, A Zhang, J Sun, L Xie, W Chen, P Zhou, H Bu, X Xu, ... arXiv preprint arXiv:2401.03473, 2024		2024
Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition K Huang, A Zhang, B Zhang, T Xu, X Song, L Xie 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023		2023
The GUA-Speech System Description for CNVSRC Challenge 2023 S Li, C Lei, B Ma, B Zhang, F Pan arXiv preprint arXiv:2312.07254, 2023		2023
FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition X Song, D Wu, B Zhang, Z Wu, W Li, D Li, P Zhang, Z Peng, F Pan, C Zhu, ... arXiv preprint arXiv:2210.17079, 2022		2022
Advancing Speaker Embedding Learning: Wespeaker Toolkit for Research and Production S Wang, Z Chen, B Han, H Wang, C Liang, B Zhang, X Xiang, W Ding, ... Available at SSRN 4748855, 0
ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs B Zhang, Z Peng, B Dang, F Pan, Z Wu

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors