|SUPERB: Speech processing Universal PERformance Benchmark|
S Yang, PH Chi, YS Chuang, CIJ Lai, K Lakhotia, YY Lin, AT Liu, J Shi, ...
arXiv preprint arXiv:2105.01051, 2021
|Angular Softmax for Short-Duration Text-independent Speaker Verification.|
Z Huang, S Wang, K Yu
Interspeech, 3623-3627, 2018
|Speaker diarization with region proposal network|
Z Huang, S Watanabe, Y Fujita, P García, Y Shao, D Povey, S Khudanpur
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
|Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis|
D Raj, P Denisov, Z Chen, H Erdogan, Z Huang, M He, S Watanabe, J Du, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 897-904, 2021
|DOVER-Lap: A method for combining overlap-aware diarization outputs|
D Raj, LP Garcia-Perera, Z Huang, S Watanabe, D Povey, A Stolcke, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 881-888, 2021
|Recover missing sensor data with iterative imputing network|
J Zhou, Z Huang
Workshops at the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
|Discriminative neural embedding learning for short-duration text-independent speaker verification|
S Wang, Z Huang, Y Qian, K Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (11 …, 2019
|SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities|
HS Tsai, HJ Chang, WC Huang, Z Huang, K Lakhotia, S Yang, S Dong, ...
arXiv preprint arXiv:2203.06849, 2022
|The hitachi-jhu dihard iii system: Competitive end-to-end neural diarization and x-vector clustering systems combined by dover-lap|
S Horiguchi, N Yalta, P Garcia, Y Takashima, Y Xue, D Raj, Z Huang, ...
arXiv preprint arXiv:2102.01363, 2021
|Joint i-vector with end-to-end system for short duration text-independent speaker verification|
Z Huang, S Wang, Y Qian
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
|Multi-class spectral clustering with overlaps for speaker diarization|
D Raj, Z Huang, S Khudanpur
2021 IEEE Spoken Language Technology Workshop (SLT), 582-589, 2021
|Investigating self-supervised learning for speech enhancement and separation|
Z Huang, S Watanabe, S Yang, P García, S Khudanpur
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
|Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker|
M He, D Raj, Z Huang, J Du, Z Chen, S Watanabe
arXiv preprint arXiv:2108.03342, 2021
|JHU Diarization System Description.|
Z Huang, LP García-Perera, J Villalba, D Povey, N Dehak
IberSPEECH, 236-239, 2018
|SUPERB@ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning|
T Feng, A Dong, CF Yeh, S Yang, TQ Lin, J Shi, KW Chang, Z Huang, ...
arXiv preprint arXiv:2210.08634, 2022
|Joint speaker diarization and speech recognition based on region proposal networks|
Z Huang, M Delcroix, LP Garcia, S Watanabe, D Raj, S Khudanpur
Computer Speech & Language 72, 101316, 2022
|Self-supervised learning with bi-label masked speech prediction for streaming multi-talker speech recognition|
Z Huang, Z Chen, N Kanda, J Wu, Y Wang, J Li, T Yoshioka, X Wang, ...
arXiv preprint arXiv:2211.05564, 2022
|Adapting self-supervised models to multi-talker speech recognition using speaker embeddings|
Z Huang, D Raj, P García, S Khudanpur
arXiv preprint arXiv:2211.00482, 2022