Follow
Shubham Toshniwal
Shubham Toshniwal
Senior Research Scientist, NVIDIA
Verified email at nvidia.com - Homepage
Title
Cited by
Cited by
Year
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
TMLR, 2023
10192023
Multilingual speech recognition with a single end-to-end model
S Toshniwal, TN Sainath, RJ Weiss, B Li, P Moreno, E Weinstein, K Rao
ICASSP 2018, 2018
2922018
Lingvo: a modular and scalable framework for sequence-to-sequence modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
2092019
A comparison of techniques for language model integration in encoder-decoder speech recognition
S Toshniwal, A Kannan, CC Chiu, Y Wu, TN Sainath, K Livescu
SLT 2018, 2018
1912018
Multitask learning with low-level auxiliary tasks for encoder-decoder based speech recognition
S Toshniwal, H Tang, L Lu, K Livescu
Interspeech 2017, 2017
1302017
Pre-Trained Text Embeddings for Enhanced Text-to-Speech Synthesis
T Hayashi, S Watanabe, T Toda, K Takeda, S Toshniwal, K Livescu
Interspeech 2019, 2019
912019
Parsing speech: a neural approach to integrating lexical and acoustic-prosodic information
T Tran, S Toshniwal, M Bansal, K Gimpel, K Livescu, M Ostendorf
NAACL 2018, 2017
82*2017
Generating natural language dialog using a questions corpus
J Ajmera, AK Gupta, S Joshi, S Toshniwal
US Patent 10,049,152, 2018
562018
Learning to Ignore: Long Document Coreference with Bounded Memory Neural Networks
S Toshniwal, S Wiseman, A Ettinger, K Livescu, K Gimpel
EMNLP 2020, 2020
552020
Jointly learning to align and convert graphemes to phonemes with neural attention models
S Toshniwal, K Livescu
SLT 2016, 2016
542016
Hierarchical multitask learning for ctc-based speech recognition
K Krishna, S Toshniwal, K Livescu
arXiv preprint arXiv:1807.06234, 2018
532018
A Cross-Task Analysis of Text Span Representations
S Toshniwal, H Shi, B Shi, L Gao, K Livescu, K Gimpel
RepL4NLP 2020, 2020
432020
On Generalization in Coreference Resolution
S Toshniwal, P Xia, S Wiseman, K Livescu, K Gimpel
CRAC@EMNLP 2021, 2021
412021
Chess as a Testbed for Language Model State Tracking
S Toshniwal, S Wiseman, K Livescu, K Gimpel
AAAI 2022 36 (10), 11385-11393, 2022
33*2022
Openmathinstruct-1: A 1.8 million math instruction tuning dataset
S Toshniwal, I Moshkov, S Narenthiran, D Gitman, F Jia, I Gitman
NeurIPS Datasets and Benchmark, 2024
262024
Adapting pretrained text-to-text models for long text sequences
W Xiong, A Gupta, S Toshniwal, Y Mehdad, W Yih
Findings of EMNLP 2023, 2023
222023
Learning to reason and memorize with self-notes
J Lanchantin, S Toshniwal, J Weston, S Sukhbaatar
NeurIPS 2023, 2023
202023
VibRein: an engaging and assistive mobile learning companion for students with intellectual disabilities
S Toshniwal, P Dey, N Rajput, S Srivastava
Proceedings of the annual meeting of the Australian special interest group …, 2015
152015
Nemotron-4 340B Technical Report
B Adler, N Agarwal, A Aithal, DH Anh, P Bhattacharya, A Brundyn, ...
arXiv preprint arXiv:2406.11704, 2024
142024
PeTra: A Sparsely Supervised Memory Model for People Tracking
S Toshniwal, A Ettinger, K Gimpel, K Livescu
ACL 2020, 2020
82020
The system can't perform the operation now. Try again later.
Articles 1–20