Improving prosody modelling with cross-utterance bert embeddings for end-to-end speech synthesis G Xu, W Song, Z Zhang, C Zhang, X He, B Zhou ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 55 | 2021 |
Building a mixed-lingual neural TTS system with only monolingual data L Xue, W Song, G Xu, L Xie, Z Wu arXiv preprint arXiv:1904.06063, 2019 | 36 | 2019 |
Dian: Duration informed auto-regressive network for voice cloning W Song, X Yuan, Z Zhang, C Zhang, Y Wu, X He, B Zhou ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 11 | 2021 |
Efficient WaveGlow: An Improved WaveGlow Vocoder with Enhanced Speed. W Song, G Xu, Z Zhang, C Zhang, X He, B Zhou INTERSPEECH, 225-229, 2020 | 7 | 2020 |
Prosody modelling with pre-trained cross-utterance representations for improved speech synthesis YJ Zhang, C Zhang, W Song, Z Zhang, Y Wu, X He IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 2812-2823, 2023 | 5 | 2023 |
MaskedSpeech: Context-aware Speech Synthesis with Masking Strategy YJ Zhang, W Song, Y Yue, Z Zhang, Y Wu, X He arXiv preprint arXiv:2211.06170, 2022 | 5 | 2022 |
Multi-speaker Multi-style Speech Synthesis with Timbre and Style Disentanglement W Song, Y Yue, Y Zhang, Z Zhang, Y Wu, X He National Conference on Man-Machine Speech Communication, 132-140, 2022 | 4 | 2022 |
基于关键块空间分布与 Gabor 滤波的人脸表情识别算法 宋伟, 赵清杰, 宋红, 樊茜 中南大学学报: 自然科学版, 239-243, 2013 | 4 | 2013 |
Singing voice synthesis with vibrato modeling and latent energy representation Y Song, W Song, W Zhang, Z Zhang, D Zeng, Z Liu, Y Yu 2022 IEEE 24th International Workshop on Multimedia Signal Processing (MMSP …, 2022 | 3 | 2022 |
Butterfly image retrieval based on SIFT feature analysis H Hao, C Cai, Y Meng, W Song, X Qin, H Zhao PIAGENG 2009: Image Processing and Photonics for Agricultural Engineering …, 2009 | 3 | 2009 |
Wood image retrieval algorithm based on keyblock distribution S Wei, C Cheng IEEE Int Conf Comput Intell Softw Eng, 2009 | 2 | 2009 |
Wood image retrieval algorithm based on keyblock distribution W Song, C Cai 2009 International Conference on Computational Intelligence and Software …, 2009 | 1 | 2009 |
Apple Physalospora recognition by using Gabor feature-based PCA X Qin, C Cai, W Song, H Hao, Y Meng, J Zhu PIAGENG 2009: Image Processing and Photonics for Agricultural Engineering …, 2009 | 1 | 2009 |
Speech synthesis method, device and computer readable storage medium WU Zhizheng, Z Zhang, W Song, RAO Yonghui, XIE Zhihang, G Xu, ... US Patent 11,881,205, 2024 | | 2024 |
Custom tone and vocal synthesis method and apparatus, electronic device, and storage medium Z Zhang, J Wu, CAI Yuyu, X Yuan, W Song, X He US Patent App. 18/252,186, 2023 | | 2023 |
Multi-speaker Multi-style Speech Synthesis with Timbre and Style Disentanglement W Song, Y Yue, Y Zhang, Z Zhang, Y Wu, X He Man-Machine Speech Communication: 17th National Conference, NCMMSC 2022 …, 2023 | | 2023 |
Text information processing method and apparatus XUE Liumeng, W Song, WU Zhizheng US Patent App. 17/789,513, 2022 | | 2022 |
Speech synthesis method and apparatus, and storage medium WU Zhizheng, W Song US Patent App. 17/629,483, 2022 | | 2022 |
Content-based butterfly image retrieval based on keyblock distribution W Song, C Cai, X Qin, Y Meng, H Hao PIAGENG 2009: Image Processing and Photonics for Agricultural Engineering …, 2009 | | 2009 |
Apple lesion recognition based on Fisherapples Y Meng, C Cai, H Hao, X Qin, W Song, L Huang PIAGENG 2009: Image Processing and Photonics for Agricultural Engineering …, 2009 | | 2009 |