Speaker augmentation for low resource speech recognition C Du, K Yu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 24 | 2020 |
Data augmentation for end-to-end code-switching speech recognition C Du, H Li, Y Lu, L Wang, Y Qian 2021 IEEE Spoken Language Technology Workshop (SLT), 194-200, 2021 | 18 | 2021 |
VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature C Du, Y Guo, X Chen, K Yu Proc. ISCA Interspeech, 1596-1600, 2022 | 16 | 2022 |
Rich prosody diversity modelling with phone-level mixture density network C Du, K Yu Proc. ISCA Interspeech, 3136-3140, 2021 | 12* | 2021 |
Phone-level prosody modelling with GMM-based MDN for diverse and controllable speech synthesis C Du, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 190-201, 2021 | 10* | 2021 |
Towards data selection on TTS data for children’s speech recognition W Wang, Z Zhou, Y Lu, H Wang, C Du, Y Qian ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 8 | 2021 |
Unsupervised word-level prosody tagging for controllable speech synthesis Y Guo, C Du, K Yu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 6 | 2022 |
SynAug: Synthesis-Based Data Augmentation for Text-Dependent Speaker Verification C Du, B Han, S Wang, Y Qian, K Yu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 5 | 2021 |
Neural Fusion For Voice Cloning B Chen, C Du, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1993-2001, 2022 | 2 | 2022 |
Emodiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance Y Guo, C Du, X Chen, K Yu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |
Acoustic Word Embeddings for End-to-End Speech Synthesis F Shen, C Du, K Yu Applied Sciences 11 (19), 9010, 2021 | 1 | 2021 |
Multi-Speaker Multi-Lingual VQTTS System for LIMMITS 2023 Challenge C Du, Y Guo, F Shen, K Yu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | | 2023 |
DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder C Du, Q Chen, T He, X Tan, X Chen, K Yu, S Zhao, J Bian arXiv preprint arXiv:2303.17550, 2023 | | 2023 |