Follow
Sourish Chaudhuri
Sourish Chaudhuri
Google Inc, Carnegie Mellon University
Verified email at google.com
Title
Cited by
Cited by
Year
CNN architectures for large-scale audio classification
S Hershey, S Chaudhuri, DPW Ellis, JF Gemmeke, A Jansen, RC Moore, ...
2017 ieee international conference on acoustics, speech and signal …, 2017
31572017
Ava active speaker: An audio-visual dataset for active speaker detection
J Roth, S Chaudhuri, O Klejch, R Marvin, A Gallagher, L Kaver, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1892020
Non-negative matrix factorization based compensation of music for automatic speech recognition.
B Raj, T Virtanen, S Chaudhuri, R Singh
Interspeech, 717-720, 2010
1582010
Associating faces with voices for speaker diarization within videos
S Chaudhuri, K Hoover
US Patent 10,497,382, 2019
822019
Audio event detection from acoustic unit occurrence patterns
A Kumar, P Dighe, R Singh, S Chaudhuri, B Raj
2012 IEEE international conference on acoustics, speech and signal …, 2012
762012
Unsupervised Learning of Acoustic Unit Descriptors for Audio Content Representation and Classification.
S Chaudhuri, M Harvilla, B Raj
Interspeech, 2265-2268, 2011
762011
Engaging collaborative learners with helping agents
S Chaudhuri, R Kumar, I Howley, CP Rosé
Artificial Intelligence in Education, 365-372, 2009
562009
Ava-speech: A densely labeled dataset of speech activity in movies
S Chaudhuri, J Roth, DPW Ellis, A Gallagher, L Kaver, R Marvin, ...
arXiv preprint arXiv:1808.00606, 2018
492018
Using audio-visual information to understand speaker activity: Tracking active speakers on and off screen
K Hoover, S Chaudhuri, C Pantofaru, I Sturdy, M Slaney
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
49*2018
Unsupervised structure discovery for semantic analysis of audio
S Chaudhuri, B Raj
Advances in Neural Information Processing Systems 25, 2012
342012
It’s not easy being green: Supporting collaborative “green design” learning
S Chaudhuri, R Kumar, M Joshi, E Terrell, F Higgs, V Aleven, ...
Intelligent Tutoring Systems: 9th International Conference, ITS 2008 …, 2008
332008
An HMM based part-of-speech tagger and statistical chunker for 3 Indian languages
GMR Sastry, S Chaudhuri, PN Reddy
Shallow Parsing for South Asian Languages 13, 2007
262007
Unsupervised hierarchical structure induction for deeper semantic analysis of audio
S Chaudhuri, B Raj
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
242013
Unsupervised word discovery from phonetic input using nested pitman-yor language modeling
O Walter, R Haeb-Umbach, S Chaudhuri, B Raj
ICRA Workshop on Autonomous Learning, 2013
222013
Exploiting Temporal Sequence Structure for Semantic Analysis of Multimedia.
S Chaudhuri, R Singh, B Raj
INTERSPEECH, 1728-1731, 2012
192012
Structured Models for Semantic Analysis of Audio Content
S Chaudhuri
PhD thesis, Carnegie Mellon University. 46, 47, 2013
17*2013
Automatic smoothed captioning of non-speech sounds from audio
F Wang, S Chaudhuri, D Ellis, N Reale
US Patent 10,037,313, 2018
162018
Learning contextual relevance of audio segments using discriminative models over AUD sequences
S Chaudhuri, B Raj
2011 IEEE Workshop on Applications of Signal Processing to Audio and …, 2011
162011
Helping agents in VMT
Y Cui, R Kumar, S Chaudhuri, G Gweon, CP Rosé
Studying virtual math teams, 335-354, 2009
162009
VMT-Basilica: an environment for rapid prototyping of collaborative learning environments with dynamic support.
R Kumar, S Chaudhuri, IK Howley, CP Rosé
CSCL (2), 192-194, 2009
142009
The system can't perform the operation now. Try again later.
Articles 1–20