Follow
Di He
Di He
Amazon Alexa Speech
Verified email at amazon.com
Title
Cited by
Cited by
Year
Machine learning on FPGAs to face the IoT revolution
X Zhang, A Ramachandran, C Zhuge, D He, W Zuo, Z Cheng, K Rupnow, ...
2017 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 894-901, 2017
782017
Wav2vec-c: A self-supervised model for speech representation learning
S Sadhu, D He, CW Huang, SH Mallidi, M Wu, A Rastrow, A Stolcke, ...
arXiv preprint arXiv:2103.08393, 2021
502021
Poisoning attack on load forecasting
Y Liang, D He, D Chen
2019 IEEE innovative smart grid technologies-Asia (ISGT Asia), 1230-1235, 2019
242019
Optimal blocking device placement for geomagnetic disturbance mitigation
Y Liang, D He, H Zhu, D Chen
IEEE Transactions on Power Delivery 34 (6), 2219-2231, 2019
142019
Acoustic landmarks contain more information about the phone string than other frames for automatic speech recognition with deep neural network acoustic model
D He, BP Lim, X Yang, M Hasegawa-Johnson, D Chen
The Journal of the Acoustical Society of America 143 (6), 3207-3219, 2018
132018
When CTC training meets acoustic landmarks
D He, X Yang, BP Lim, Y Liang, M Hasegawa-Johnson, D Chen
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
92019
Improved ASR for under-resourced languages through multi-task learning with acoustic landmarks
D He, BP Lim, X Yang, M Hasegawa-Johnson, D Chen
arXiv preprint arXiv:1805.05574, 2018
92018
Selecting frames for automatic speech recognition based on acoustic landmarks
D He, BPP Lim, X Yang, M Hasegawa-Johnson, D Chen
The Journal of the Acoustical Society of America 141 (5_Supplement), 3468-3468, 2017
42017
Using Approximated Auditory Roughness as a Pre-Filtering Feature for Human Screaming and Affective Speech AED.
D He, Z Cheng, M Hasegawa-Johnson, D Chen
INTERSPEECH, 1914-1918, 2017
42017
VADOI: Voice-activity-detection overlapping inference for end-to-end long-form speech recognition
J Wang, X Tong, J Guo, D He, R Maas
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
22022
Personalized predictive asr for latency reduction in voice assistants
A Schwarz, D He, M Van Segbroeck, M Hethnawi, A Rastrow
arXiv preprint arXiv:2305.13794, 2023
12023
Turn-taking and backchannel prediction with acoustic and large language model fusion
J Wang, L Chen, A Khare, A Raju, P Dheram, D He, M Wu, A Stolcke, ...
arXiv preprint arXiv:2401.14717, 2024
2024
Two-pass endpoint detection for speech recognition
A Raju, A Khare, D He, I Sklyar, L Chen, S Alptekin, VA Trinh, Z Zhang, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
2023
Towards Accurate and Real-Time End-of-Speech Estimation
Y Fan, C Vaz, D He, J Heymann, VA Trinh, Z Zhang, V Ravichandran
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
2023
Adaptive Endpointing with Deep Contextual Multi-Armed Bandits
A Stolcke, A Raju, C Vaz, D He, V Ravichandran, VA Trinh
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
2023
Adaptive Endpointing with Deep Contextual Multi-armed Bandits
DJ Min, A Stolcke, A Raju, C Vaz, D He, V Ravichandran, VA Trinh
arXiv preprint arXiv:2303.13407, 2023
2023
The benefits of acoustic perceptual information for speech processing systems
D He
University of Illinois at Urbana-Champaign, 2019
2019
Augmenting Input Method Language Model with user Location Type Information
D He
arXiv preprint arXiv:1809.08349, 2018
2018
Accelerating lattice scoring of automatic speech recognition through acoustic pre-pruning on GPU
D He
University of Illinois at Urbana-Champaign, 2015
2015
The system can't perform the operation now. Try again later.
Articles 1–19