Chenpeng Du

Cited by

	All	Since 2019
Citations	268	268
h-index	10	10
i10-index	10	10

140

105

202020212022202320242 17 45 127 77

Public access

View all

3 articles

1 article

available

not available

Based on funding mandates

Co-authors

Kai Yu（俞凯）Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Xie ChenShanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Yiwei GuoShanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Yanmin QianProfessor, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Feiyu ShenShanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Shuai WangSRIBDVerified email at sribd.cn

Chenpeng Du

Shanghai Jiao Tong University

Verified email at sjtu.edu.cn - Homepage

Speech Synthesis Speech Recognition


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
VQTTS: High-fidelity text-to-speech synthesis with self-supervised VQ acoustic feature C Du, Y Guo, X Chen, K Yu Proc. Interspeech 2022, 1596-1600, 2022	49	2022
Speaker augmentation for low resource speech recognition C Du, K Yu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	35	2020
Data augmentation for end-to-end code-switching speech recognition C Du, H Li, Y Lu, L Wang, Y Qian 2021 IEEE Spoken Language Technology Workshop (SLT), 194-200, 2021	23	2021
Rich prosody diversity modelling with phone-level mixture density network C Du, K Yu Proc. Interspeech 2021, 3136-3140, 2021	23*	2021
Dae-talker: High fidelity speech-driven talking face generation with diffusion autoencoder C Du, Q Chen, T He, X Tan, X Chen, K Yu, S Zhao, J Bian Proceedings of the 31st ACM International Conference on Multimedia, 4281-4289, 2023	22	2023
Emodiff: Intensity controllable emotional text-to-speech with soft-label guidance Y Guo, C Du, X Chen, K Yu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	20	2023
Phone-level prosody modelling with gmm-based mdn for diverse and controllable speech synthesis C Du, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 190-201, 2021	18*	2021
UniCATS: A unified context-aware text-to-speech framework with contextual vq-diffusion and vocoding C Du, Y Guo, F Shen, Z Liu, Z Liang, X Chen, S Wang, H Zhang, K Yu Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 17924 …, 2024	15	2024
Towards data selection on tts data for children’s speech recognition W Wang, Z Zhou, Y Lu, H Wang, C Du, Y Qian ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	14	2021
Unsupervised word-level prosody tagging for controllable speech synthesis Y Guo, C Du, K Yu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	12	2022
Synaug: Synthesis-based data augmentation for text-dependent speaker verification C Du, B Han, S Wang, Y Qian, K Yu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	8	2021
Neural fusion for voice cloning B Chen, C Du, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1993-2001, 2022	6	2022
VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching Y Guo, C Du, Z Ma, X Chen, K Yu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	4	2024
VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech C Du, Y Guo, H Wang, Y Yang, Z Niu, S Wang, H Zhang, X Chen, K Yu arXiv preprint arXiv:2401.14321, 2024	3	2024
Speaker Adaptive Text-to-Speech with Timbre-Normalized Vector-Quantized Feature C Du, Y Guo, X Chen, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 3446-3456, 2023	3	2023
DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech S Liu, Y Guo, C Du, X Chen, K Yu Proc. Interspeech 2023, 616-620, 2023	3	2023
Acoustic Word Embeddings for End-to-End Speech Synthesis F Shen, C Du, K Yu Applied Sciences 11 (19), 9010, 2021	3	2021
Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS Y Yang, F Shen, C Du, Z Ma, K Yu, D Povey, X Chen ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	2	2024
Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation Z Liang, Z Song, Z Ma, C Du, K Yu, X Chen Proc. Interspeech 2023, 919-923, 2023	2	2023
Multi-Speaker Multi-Lingual VQTTS System for LIMMITS 2023 Challenge C Du, Y Guo, F Shen, K Yu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	2	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors