Shi-Xiong (Austin) ZHANG
Shi-Xiong (Austin) ZHANG
Microsoft --> Principal Researcher, Tencent AI Lab
Verified email at tencent.com
Title
Cited by
Cited by
Year
End-to-end attention based text-dependent speaker verification
SX Zhang, Z Chen, Y Zhao, J Li, Y Gong
2016 IEEE Spoken Language Technology Workshop (SLT), 171-178, 2016
1632016
Investigation of Multilingual Deep Neural Networks for Spoken Term Detection
K Knill, MJF Gales, S Rath, P Woodland, SX Zhang
ASRU, 2013
872013
SIMPLIFYING LONG SHORT-TERM MEMORY ACOUSTIC MODELS FOR FAST TRAINING AND DECODING
Y Miao, J Li, Y Wang, S Zhang, Y Gong
ICASSP, 2016
692016
Structured SVMs for automatic speech recognition
SX Zhang, MJF Gales
IEEE Transactions on Audio, Speech, and Language Processing 21 (3), 544-555, 2012
422012
A comprehensive study of speech separation: spectrogram vs waveform separation
F Bahmaninezhad, J Wu, R Gu, SX Zhang, Y Xu, M Yu, D Yu
arXiv preprint arXiv:1905.07497, 2019
392019
New era for robust speech recognition: exploiting deep learning
S Watanabe, M Delcroix, F Metze, JR Hershey, et al.
Springer, 2017
392017
DEEP NEURAL SUPPORT VECTOR MACHINES FOR SPEECH RECOGNITION
SX Zhang, C Liu, K Yao, Y Gong
ICASSP 2015, 2015
342015
Structured log linear models for noise robust speech recognition
SX Zhang, A Ragni, MJF Gales
IEEE Signal Processing Letters 17 (11), 945-948, 2010
342010
Structured Support Vector Machines for Noise Robust Continuous Speech Recognition.
SX Zhang, MJF Gales
INTERSPEECH, 989-990, 2011
302011
End-to-End Multi-Channel Speech Separation
R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
https://arxiv.org/abs/1905.06286, 2019
29*2019
Time Domain Audio Visual Speech Separation
J Wu, Y Xu, SX Zhang, LW Chen, M Yu, L Xie, D Yu
Automatic Speech Recognition and Understanding Workshop, ASRU 2019,, 2019
282019
Multi-modal multi-channel target speech separation
R Gu, SX Zhang, Y Xu, L Chen, Y Zou, D Yu
IEEE Journal of Selected Topics in Signal Processing 14 (3), 530-541, 2020
222020
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information
R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu
222019
Speaker verification via high-level feature based phonetic-class pronunciation modeling
SX Zhang, MW Mak, H Meng
IEEE Transactions on Computers 56 (9), 1189-1198, 2007
202007
Domain and speaker adaptation for Cortana Speech Recognition
Y Zhao, J Li, L Chen, Gong, SX Zhang
162018
Enhancing End-to-End Multi-Channel Speech Separation Via Spatial Feature Learning
R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
132020
Recurrent Support Vector Machines for Speech Recognition
SX Zhang, R Zhao, C Liu, J Li, Y Gong
ICASSP, 2016
132016
An overview of deep-learning-based audio-visual speech enhancement and separation
D Michelsanti, ZH Tan, SX Zhang, Y Xu, M Yu, D Yu, J Jensen
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021
122021
Neural Spatio-Temporal Beamformer for Target Speech Separation
Y Xu, M Yu, SX Zhang, L Chen, C Weng, J Liu, D Yu
arXiv preprint arXiv:2005.03889, 2020
122020
Optimized discriminative kernel for SVM scoring and its application to speaker verification
SX Zhang, MW Mak
IEEE transactions on neural networks 22 (2), 173-185, 2010
122010
The system can't perform the operation now. Try again later.
Articles 1–20