Long short-term memory recurrent neural network architectures for large scale acoustic modeling H Sak, A Senior, F Beaufays Fifteenth Annual Conference of the International Speech Communication …, 2014 | 3389 | 2014 |
Convolutional, long short-term memory, fully connected deep neural networks TN Sainath, O Vinyals, A Senior, H Sak Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International …, 2015 | 1912 | 2015 |
Long short-term memory based recurrent neural network architectures for large vocabulary speech recognition H Sak, A Senior, F Beaufays arXiv preprint arXiv:1402.1128, 2014 | 1485 | 2014 |
Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition H Sak, A Senior, K Rao, F Beaufays arXiv preprint arXiv:1507.06947, 2015 | 553 | 2015 |
Exploring architectures, data and units for streaming end-to-end speech recognition with RNN-transducer K Rao, H Sak, R Prabhavalkar Automatic Speech Recognition and Understanding Workshop (ASRU), 2017 IEEE …, 2017 | 410 | 2017 |
Neural speech recognizer: Acoustic-to-word LSTM model for large vocabulary speech recognition H Soltau, H Liao, H Sak arXiv preprint arXiv:1610.09975, 2016 | 398 | 2016 |
Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis H Zen, H Sak Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International …, 2015 | 387 | 2015 |
Grapheme-to-phoneme conversion using long short-term memory recurrent neural networks K Rao, F Peng, H Sak, F Beaufays Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International …, 2015 | 280 | 2015 |
Turkish language resources: Morphological parser, morphological disambiguator and web corpus H Sak, T Güngör, M Saraçlar Advances in natural language processing, 417-427, 2008 | 229 | 2008 |
Personalized speech recognition on mobile devices I McGraw, R Prabhavalkar, R Alvarez, MG Arenas, K Rao, D Rybach, ... Acoustics, Speech and Signal Processing (ICASSP), 2016 IEEE International …, 2016 | 228 | 2016 |
Learning acoustic frame labeling for speech recognition with recurrent neural networks H Sak, A Senior, K Rao, O Irsoy, A Graves, F Beaufays, J Schalkwyk Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International …, 2015 | 228 | 2015 |
Automatic language identification using long short-term memory recurrent neural networks J Gonzalez-Dominguez, I Lopez-Moreno, H Sak, J Gonzalez-Rodriguez, ... Fifteenth Annual Conference of the International Speech Communication …, 2014 | 214 | 2014 |
Acoustic modeling for google home B Li, T Sainath, A Narayanan, J Caroselli, M Bacchiani, A Misra, I Shafran, ... INTERSPEECH-2017, 399-403, 2017 | 205 | 2017 |
Large-Scale Visual Speech Recognition B Shillingford, Y Assael, MW Hoffman, T Paine, C Hughes, U Prabhu, ... arXiv preprint arXiv:1807.05162, 2018 | 201 | 2018 |
Sequence discriminative distributed training of long short-term memory recurrent neural networks H Sak, O Vinyals, G Heigold, A Senior, E McDermott, R Monga, M Mao Fifteenth Annual Conference of the International Speech Communication …, 2014 | 172 | 2014 |
Acoustic modelling with cd-ctc-smbr lstm rnns A Senior, H Sak, F de Chaumont Quitry, TN Sainath, K Rao | 159 | 2015 |
Recurrent Neural Aligner: An Encoder-Decoder Neural Network Model for Sequence to Sequence Mapping H Sak, M Shannon, K Rao, F Beaufays Proc. Interspeech 2017, 1298-1302, 2017 | 151 | 2017 |
Morphological disambiguation of Turkish text with perceptron algorithm H Sak, T Güngör, M Saraçlar International Conference on Intelligent Text Processing and Computational …, 2007 | 146 | 2007 |
Turkish broadcast news transcription and retrieval E Arisoy, D Can, S Parlak, H Sak, M Saraçlar IEEE Transactions on Audio, Speech, and Language Processing 17 (5), 874-883, 2009 | 143 | 2009 |
Multi-accent speech recognition with hierarchical grapheme based models K Rao, H Sak Acoustics, Speech and Signal Processing (ICASSP), 2017 IEEE International …, 2017 | 89 | 2017 |