Speech emotion recognition using capsule networks X Wu, S Liu, Y Cao, X Li, J Yu, D Dai, X Ma, S Hu, Z Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 130 | 2019 |
Adversarial attacks on GMM i-vector based speaker verification systems X Li, J Zhong, X Wu, J Yu, X Liu, H Meng ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 98 | 2020 |
Any-to-many voice conversion with location-relative sequence-to-sequence modeling S Liu, Y Cao, D Wang, X Wu, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1717-1728, 2021 | 88 | 2021 |
Channel-wise gated res2net: Towards robust detection of synthetic speech attacks X Li, X Wu, H Lu, X Liu, H Meng arXiv preprint arXiv:2107.08803, 2021 | 71 | 2021 |
Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance. S Liu, J Zhong, L Sun, X Wu, X Liu, H Meng Interspeech, 496-500, 2018 | 65 | 2018 |
Learning discriminative features from spectrograms using center loss for speech emotion recognition D Dai, Z Wu, R Li, X Wu, J Jia, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 62 | 2019 |
Uniaudio: An audio foundation model toward universal audio generation D Yang, J Tian, X Tan, R Huang, S Liu, X Chang, J Shi, S Zhao, J Bian, ... arXiv preprint arXiv:2310.00704, 2023 | 60 | 2023 |
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus. J Yu, X Xie, S Liu, S Hu, MWY Lam, X Wu, KH Wong, X Liu, H Meng Interspeech, 2938-2942, 2018 | 60 | 2018 |
Investigating robustness of adversarial samples detection for automatic speaker verification X Li, N Li, J Zhong, X Wu, X Liu, D Su, D Yu, H Meng arXiv preprint arXiv:2006.06186, 2020 | 46 | 2020 |
End-to-end accent conversion without using native utterances S Liu, D Wang, Y Cao, L Sun, X Wu, S Kang, Z Wu, X Liu, D Su, D Yu, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 43 | 2020 |
End-to-end code-switched tts with mix of monolingual recordings Y Cao, X Wu, S Liu, J Yu, X Li, Z Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 43 | 2019 |
Improved end-to-end dysarthric speech recognition via meta-learning based model re-initialization D Wang, J Yu, X Wu, L Sun, X Liu, H Meng 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 42 | 2021 |
End-to-end voice conversion via cross-modal knowledge distillation for dysarthric speech reconstruction D Wang, J Yu, X Wu, S Liu, L Sun, X Liu, H Meng ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 41 | 2020 |
Intonation classification for L2 English speech using multi-distribution deep neural networks K Li, X Wu, H Meng Computer Speech & Language 43, 18-33, 2017 | 37 | 2017 |
Coupling global and local context for unsupervised aspect extraction M Liao, J Li, H Zhang, L Wang, X Wu, KF Wong Proceedings of the 2019 conference on empirical methods in natural language …, 2019 | 29 | 2019 |
Sail: Search-augmented instruction learning H Luo, YS Chuang, Y Gong, T Zhang, Y Kim, X Wu, D Fox, H Meng, ... arXiv preprint arXiv:2305.15225, 2023 | 27 | 2023 |
Acoustic to articulatory mapping with deep neural network Z Wu, K Zhao, X Wu, X Lan, H Meng Multimedia Tools and Applications 74, 9889-9907, 2015 | 27 | 2015 |
Speech emotion recognition using sequential capsule networks X Wu, Y Cao, H Lu, S Liu, D Wang, Z Wu, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3280-3291, 2021 | 26 | 2021 |
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-Trained BERT. D Dai, Z Wu, S Kang, X Wu, J Jia, D Su, D Yu, H Meng Interspeech, 2090-2094, 2019 | 25 | 2019 |
Interpretable unified language checking T Zhang, H Luo, YS Chuang, W Fang, L Gaitskell, T Hartvigsen, X Wu, ... arXiv preprint arXiv:2304.03728, 2023 | 23 | 2023 |