Follow
Yao Qian
Title
Cited by
Cited by
Year
Wavlm: Large-scale self-supervised pre-training for full stack speech processing
S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ...
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022
9092022
TTS synthesis with bidirectional LSTM based recurrent neural networks
Y Fan, Y Qian, FL Xie, FK Soong
Fifteenth annual conference of the international speech communication …, 2014
5982014
On the training aspects of deep neural network (DNN) for parametric TTS synthesis
Y Qian, Y Fan, W Hu, FK Soong
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
2622014
Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers
W Hu, Y Qian, FK Soong, Y Wang
Speech Communication 67, 154-166, 2015
2362015
Part-of-speech tagging with bidirectional long short-term memory recurrent neural network
P Wang, Y Qian, FK Soong, L He, H Zhao
arXiv preprint arXiv:1510.06168, 2015
1532015
Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis
Y Fan, Y Qian, FK Soong, L He
2015 IEEE international conference on acoustics, speech and signal …, 2015
1502015
Speecht5: Unified-modal encoder-decoder pre-training for spoken language processing
J Ao, R Wang, L Zhou, C Wang, S Ren, Y Wu, S Liu, T Ko, Q Li, Y Zhang, ...
arXiv preprint arXiv:2110.07205, 2021
1412021
Using bidirectional LSTM recurrent neural networks to learn high-level abstractions of sequential features for automated scoring of non-native spontaneous speech
Z Yu, V Ramanarayanan, D Suendermann-Oeft, X Wang, K Zechner, ...
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
1272015
A unified tagging solution: Bidirectional lstm recurrent neural network with word embedding
P Wang, Y Qian, FK Soong, L He, H Zhao
arXiv preprint arXiv:1511.00215, 2015
1172015
A report on the 2017 native language identification shared task
S Malmasi, K Evanini, A Cahill, J Tetreault, R Pugh, C Hamill, ...
Proceedings of the 12th Workshop on Innovative Use of NLP for Building …, 2017
1132017
Locating boundaries for prosodic constituents in unrestricted Mandarin texts
M Chu, Y Qian
International Journal of Computational Linguistics & Chinese Language …, 2001
1072001
A new DNN-based high quality pronunciation evaluation for computer-aided language learning (CALL).
W Hu, Y Qian, FK Soong
Interspeech, 1886-1890, 2013
1052013
Unispeech: Unified speech representation learning with labeled and unlabeled data
C Wang, Y Wu, Y Qian, K Kumatani, S Liu, F Wei, M Zeng, X Huang
International Conference on Machine Learning, 10937-10947, 2021
1032021
Large-scale self-supervised speech representation learning for automatic speaker verification
Z Chen, S Chen, Y Wu, Y Qian, C Wang, S Liu, Y Qian, M Zeng
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
862022
A cross-language state sharing and mapping approach to bilingual (Mandarin–English) TTS
Y Qian, H Liang, FK Soong
IEEE Transactions on Audio, Speech, and Language Processing 17 (6), 1231-1239, 2009
852009
End-to-end neural network based automated speech scoring
L Chen, J Tao, S Ghaffarzadegan, Y Qian
2018 IEEE international conference on acoustics, speech and signal …, 2018
822018
Exploring ASR-free end-to-end modeling to improve spoken language understanding in a cloud-based dialog system
Y Qian, R Ubale, V Ramanaryanan, P Lange, D Suendermann-Oeft, ...
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
782017
An HMM-based Mandarin Chinese text-to-speech system
Y Qian, F Soong, Y Chen, M Chu
Chinese Spoken Language Processing: 5th International Symposium, ISCSLP 2006 …, 2006
732006
Word embedding for recurrent neural network based TTS synthesis
P Wang, Y Qian, FK Soong, L He, H Zhao
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
692015
A frame mapping based HMM approach to cross-lingual voice transformation
Y Qian, J Xu, FK Soong
2011 IEEE International Conference on Acoustics, Speech and Signal …, 2011
662011
The system can't perform the operation now. Try again later.
Articles 1–20