Подписаться
Kaizhi Qian
Kaizhi Qian
MIT-IBM Watson AI Lab
Подтвержден адрес электронной почты в домене ibm.com
Название
Процитировано
Процитировано
Год
Autovc: Zero-shot voice style transfer with only autoencoder loss
K Qian, Y Zhang, S Chang, X Yang, M Hasegawa-Johnson
International Conference on Machine Learning, 5210-5219, 2019
4772019
Unsupervised speech decomposition via triple information bottleneck
K Qian, Y Zhang, S Chang, M Hasegawa-Johnson, D Cox
International Conference on Machine Learning, 7836-7846, 2020
1782020
F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder
K Qian, Z Jin, M Hasegawa-Johnson, GJ Mysore
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1092020
Speech Enhancement Using Bayesian Wavenet.
K Qian, Y Zhang, S Chang, X Yang, D Florêncio, M Hasegawa-Johnson
Interspeech, 2013-2017, 2017
1012017
Contentvec: An improved self-supervised speech representation by disentangling speakers
K Qian, Y Zhang, H Gao, J Ni, CI Lai, D Cox, M Hasegawa-Johnson, ...
International Conference on Machine Learning, 18003-18017, 2022
702022
Parp: Prune, adjust and re-prune for self-supervised speech recognition
CIJ Lai, Y Zhang, AH Liu, S Chang, YL Liao, YS Chuang, K Qian, ...
Advances in Neural Information Processing Systems 34, 21256-21272, 2021
582021
Deep learning based speech beamforming
K Qian, Y Zhang, S Chang, X Yang, D Florencio, M Hasegawa-Johnson
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
442018
Global prosody style transfer without text transcriptions
K Qian, Y Zhang, S Chang, J Xiong, C Gan, D Cox, M Hasegawa-Johnson
International Conference on Machine Learning, 8650-8660, 2021
362021
Speechsplit2. 0: Unsupervised speech disentanglement for voice conversion without tuning autoencoder bottlenecks
CH Chan, K Qian, Y Zhang, M Hasegawa-Johnson
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
342022
Unsupervised text-to-speech synthesis by unsupervised automatic speech recognition
J Ni, L Wang, H Gao, K Qian, Y Zhang, S Chang, M Hasegawa-Johnson
arXiv preprint arXiv:2203.15796, 2022
302022
Wavprompt: Towards few-shot spoken language understanding with frozen language models
H Gao, J Ni, K Qian, Y Zhang, S Chang, M Hasegawa-Johnson
arXiv preprint arXiv:2203.15863, 2022
232022
Physics-driven diffusion models for impact sound synthesis from videos
K Su, K Qian, E Shlizerman, A Torralba, C Gan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
132023
Zero-Shot Cross-Lingual Phonetic Recognition with External Language Embedding.
H Gao, J Ni, Y Zhang, K Qian, S Chang, M Hasegawa-Johnson
Interspeech, 1304-1308, 2021
132021
Speech denoising with auditory models
MR Saddler, A Francl, J Feather, K Qian, Y Zhang, JH McDermott
arXiv preprint arXiv:2011.10706, 2020
8*2020
Continuous cnn for nonuniform time series
H Shi, Y Zhang, H Wu, S Chang, K Qian, M Hasegawa-Johnson, J Zhao
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
7*2021
Master-ASR: achieving multilingual scalability and low-resource adaptation in ASR with modular learning
Z Yu, Y Zhang, K Qian, C Wan, Y Fu, Y Zhang, YC Lin
International Conference on Machine Learning, 40475-40487, 2023
62023
Losses can be blessings: Routing self-supervised speech representations towards efficient multilingual and multitask speech processing
Y Fu, Y Zhang, K Qian, Z Ye, Z Yu, CIJ Lai, C Lin
Advances in Neural Information Processing Systems 35, 20902-20920, 2022
52022
On the interplay between sparsity, naturalness, intelligibility, and prosody in speech synthesis
CIJ Lai, E Cooper, Y Zhang, S Chang, K Qian, YL Liao, YS Chuang, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
42022
Application of local binary patterns for SVM based stop consonant detection
K Qian, Y Zhang, M Hasegawa-Johnson
Proc. Speech Prosody, 1114-1118, 2016
42016
Domain Generalization for Language-Independent Automatic Speech Recognition
H Gao, J Ni, Y Zhang, K Qian, S Chang, M Hasegawa-Johnson
Frontiers in Artificial Intelligence 5, 806274, 2022
12022
В данный момент система не может выполнить эту операцию. Повторите попытку позднее.
Статьи 1–20