Attentive statistics pooling for deep speaker embedding K Okabe, T Koshinaka, K Shinoda arXiv preprint arXiv:1803.10963, 2018 | 450 | 2018 |
MDL-based context-dependent subword modeling for speech recognition K Shinoda, T Watanabe Acoustical Science and Technology 21 (2), 79-86, 2000 | 366 | 2000 |
A structural Bayes approach to speaker adaptation K Shinoda, CH Lee IEEE Transactions on Speech and Audio Processing 9 (3), 276-287, 2001 | 220 | 2001 |
Acoustic modeling based on the MDL principle for speech recognition K Shinoda, T Watanabe Fifth European Conference on Speech Communication and Technology, 1997 | 196 | 1997 |
Structural MAP speaker adaptation using hierarchical priors K Shinoda, CH Lee 1997 IEEE Workshop on Automatic Speech Recognition and Understanding …, 1997 | 117 | 1997 |
GINGA observation of the X-ray pulsar 1E 2259+ 586 in the supernova remnant G109. 1-1.0 K Koyama, F Nagase, Y Ogawara, K Shinoda, N Kawai, MH Jones, ... Publications of the Astronomical Society of Japan 41, 461-471, 1989 | 84 | 1989 |
Multimodal fusion of bert-cnn and gated cnn representations for depression detection M Rodrigues Makiuchi, T Warnita, K Uto, K Shinoda Proceedings of the 9th International on Audio/Visual Emotion Challenge and …, 2019 | 73 | 2019 |
An online attention-based model for speech recognition R Fan, P Zhou, W Chen, J Jia, G Liu arXiv preprint arXiv:1811.05247, 2018 | 73* | 2018 |
Technique for adaptation of hidden markov models for speech recognition CH Lee, K Shinoda US Patent 6,151,574, 2000 | 73 | 2000 |
Speaker adaptation with autonomous model complexity control by MDL principle K Shinoda, T Watanabe 1996 IEEE International Conference on Acoustics, Speech, and Signal …, 1996 | 67 | 1996 |
Discovery of the quasi-periodic oscillations from the X-ray pulsar X1627-673 K Shinoda, T Kii, K Mitsuda, F Nagase, Y Tanaka, K Makishima, ... Publications of the Astronomical Society of Japan 42, L27-L32, 1990 | 65 | 1990 |
A fast and accurate video semantic-indexing system using fast MAP adaptation and GMM supervectors N Inoue, K Shinoda IEEE Transactions on Multimedia 14 (4), 1196-1205, 2012 | 58 | 2012 |
High speed speech recognition using tree-structured probability density function T Watanabe, K Shinoda, K Takagi, KI Iso 1995 International Conference on Acoustics, Speech, and Signal Processing 1 …, 1995 | 54 | 1995 |
Speaker adaptation techniques for automatic speech recognition K Shinoda Proc. APSIPA ASC 2011, 2011 | 50 | 2011 |
Speech recognition apparatus K Shinoda US Patent 7,437,288, 2008 | 47 | 2008 |
Speaker adaptation with autonomous control using tree structure K Shinoda, T Watanabe Fourth European Conference on Speech Communication and Technology, 1995 | 47 | 1995 |
Detecting Alzheimer's disease using gated convolutional neural network from audio data T Warnita, N Inoue, K Shinoda arXiv preprint arXiv:1803.11344, 2018 | 46 | 2018 |
Spectral graph skeletons for 3D action recognition T Kerola, N Inoue, K Shinoda Computer Vision--ACCV 2014: 12th Asian Conference on Computer Vision …, 2015 | 44 | 2015 |
Hidden Markov model for automatic transcription of MIDI signals H Takeda, N Saito, T Otsuki, M Nakai, H Shimodaira, S Sagayama 2002 IEEE Workshop on Multimedia Signal Processing., 428-431, 2002 | 44 | 2002 |
Tokyotech+ canon at trecvid 2011 N Inoue, Y Kamishima, T Wada, K Shinoda, S Sato Proc. TRECVID Workshop 2011, 2011 | 43 | 2011 |