CNN architectures for large-scale audio classification S Hershey, S Chaudhuri, DPW Ellis, JF Gemmeke, A Jansen, RC Moore, ... 2017 ieee international conference on acoustics, speech and signal …, 2017 | 1673 | 2017 |
Learning the speech front-end with raw waveform CLDNNs T Sainath, RJ Weiss, K Wilson, AW Senior, O Vinyals | 523 | 2015 |
Looking to listen at the cocktail party: A speaker-independent audio-visual model for speech separation A Ephrat, I Mosseri, O Lang, T Dekel, K Wilson, A Hassidim, WT Freeman, ... arXiv preprint arXiv:1804.03619, 2018 | 494 | 2018 |
First Results from CUORE: A Search for Lepton Number Violation via Decay of C Alduino, F Alessandria, K Alfonso, E Andreotti, C Arnaboldi, ... Physical review letters 120 (13), 132501, 2018 | 387 | 2018 |
Speech denoising using nonnegative matrix factorization with priors KW Wilson, B Raj, P Smaragdis, A Divakaran 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 328 | 2008 |
Speech acoustic modeling from raw multichannel waveforms Y Hoshen, RJ Weiss, KW Wilson 2015 IEEE international conference on acoustics, speech and signal …, 2015 | 244 | 2015 |
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ... arXiv preprint arXiv:1810.04826, 2018 | 216 | 2018 |
Multichannel signal processing with deep neural networks for automatic speech recognition TN Sainath, RJ Weiss, KW Wilson, B Li, A Narayanan, E Variani, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (5), 965-979, 2017 | 194 | 2017 |
Improved Limit on Neutrinoless Double-Beta Decay in with CUORE DQ Adams, C Alduino, K Alfonso, FT Avignone III, O Azzolini, G Bari, ... Physical review letters 124 (12), 122501, 2020 | 172 | 2020 |
Acoustic Modeling for Google Home. B Li, TN Sainath, A Narayanan, J Caroselli, M Bacchiani, A Misra, ... Interspeech, 399-403, 2017 | 153 | 2017 |
Processing multi-channel audio waveforms TN Sainath, RJ Weiss, KW Wilson, AW Senior, A Narayanan, Y Hoshen, ... US Patent 9,697,826, 2017 | 133 | 2017 |
Regularized non-negative matrix factorization with temporal dependencies for speech denoising. KW Wilson, B Raj, P Smaragdis Interspeech, 411-414, 2008 | 118 | 2008 |
Neural network adaptive beamforming for robust multichannel speech recognition B Li, TN Sainath, RJ Weiss, KW Wilson, M Bacchiani | 114 | 2016 |
Universal sound separation I Kavalerov, S Wisdom, H Erdogan, B Patton, K Wilson, J Le Roux, ... 2019 IEEE Workshop on Applications of Signal Processing to Audio and …, 2019 | 113 | 2019 |
Low latency video storyboard delivery with selectable resolution levels NO Krahnstoever, KW Wilson US Patent App. 13/785,913, 2014 | 107 | 2014 |
Visual speech recognition with loosely synchronized feature streams K Saenko, K Livescu, M Siracusa, K Wilson, J Glass, T Darrell Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 2 …, 2005 | 100 | 2005 |
Multiple person and speaker activity tracking with a particle filter N Checka, KW Wilson, MR Siracusa, T Darrell 2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004 | 99 | 2004 |
Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms TN Sainath, RJ Weiss, KW Wilson, A Narayanan, M Bacchiani, A Senior 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 80 | 2015 |
Indexing a recording of audiovisual content to enable rich navigation SW Fu, B Keating, S Vedula, K Wilson, S Ahmad US Patent App. 11/282,318, 2007 | 75 | 2007 |
Factored spatial and spectral multichannel raw waveform CLDNNs TN Sainath, RJ Weiss, KW Wilson, A Narayanan, M Bacchiani 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 73 | 2016 |