Streaming end-to-end speech recognition for mobile devices Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 741 | 2019 |
Location-relative attention mechanisms for robust long-form speech synthesis E Battenberg, RJ Skerry-Ryan, S Mariooryad, D Stanton, D Kao, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 132 | 2020 |
Semi-supervised generative modeling for controllable speech synthesis R Habib, S Mariooryad, M Shannon, E Battenberg, RJ Skerry-Ryan, ... arXiv preprint arXiv:1910.01709, 2019 | 61 | 2019 |
Effective use of variational embedding capacity in expressive end-to-end speech synthesis E Battenberg, S Mariooryad, D Stanton, RJ Skerry-Ryan, M Shannon, ... arXiv preprint arXiv:1906.03402, 2019 | 57 | 2019 |
Efficient implementation of recurrent neural network transducer in tensorflow T Bagby, K Rao, KC Sim 2018 IEEE Spoken Language Technology Workshop (SLT), 506-512, 2018 | 42 | 2018 |
Speaker generation D Stanton, M Shannon, S Mariooryad, RJ Skerry-Ryan, E Battenberg, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 34 | 2022 |
End-to-End Training of Acoustic Models for Large Vocabulary Continuous Speech Recognition with TensorFlow. E Variani, T Bagby, E McDermott, M Bacchiani Interspeech, 1641-1645, 2017 | 31 | 2017 |
Non-saturating GAN training as divergence minimization M Shannon, B Poole, S Mariooryad, T Bagby, E Battenberg, D Kao, ... arXiv preprint arXiv:2010.08029, 2020 | 21 | 2020 |
Complex evolution recurrent neural networks (cernns) I Shafran, T Bagby, RJ Skerry-Ryan 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 14 | 2018 |
Improving the efficiency of forward-backward algorithm using batched computation in TensorFlow KC Sim, A Narayanan, T Bagby, TN Sainath, M Bacchiani 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 14 | 2017 |
Sampled connectionist temporal classification E Variani, T Bagby, K Lahouel, E McDermott, M Bacchiani 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 10 | 2018 |
Learning the joint distribution of two sequences using little or no paired data S Mariooryad, M Shannon, S Ma, T Bagby, D Kao, D Stanton, ... arXiv preprint arXiv:2212.03232, 2022 | 3 | 2022 |
Last: Scalable Lattice-Based Speech Modelling in Jax K Wu, E Variani, T Bagby, M Riley ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | | 2023 |
Generative semi-supervised learning with a neural seq2seq noisy channel S Mariooryad, M Shannon, S Ma, T Bagby, DTH Kao, D Stanton, ... ICML 2023 Workshop on Structured Probabilistic Inference {\&} Generative …, 0 | | |