Audiolm: a language modeling approach to audio generation Z Borsos, R Marinier, D Vincent, E Kharitonov, O Pietquin, M Sharifi, ... IEEE/ACM transactions on audio, speech, and language processing 31, 2523-2533, 2023 | 458 | 2023 |
Fr\'echet audio distance: A metric for evaluating music enhancement algorithms K Kilgour, M Zuluaga, D Roblek, M Sharifi arXiv preprint arXiv:1812.08466, 2018 | 320 | 2018 |
Efficient utterance-specific endpointer triggering for always-on hotwording M Sharifi, D Roblek, S Siddhartha US Patent 8,775,191, 2014 | 168 | 2014 |
Speaker identification using a text-independent model and a text-dependent model M Sharifi, D Roblek US Patent 10,255,922, 2019 | 121 | 2019 |
SPICE: Self-supervised pitch estimation B Gfeller, C Frank, D Roblek, M Sharifi, M Tagliasacchi, M Velimirović IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1118-1128, 2020 | 93 | 2020 |
Text-dependent speaker identification D Roblek, M Sharifi, RA Guevara US Patent 9,542,948, 2017 | 92 | 2017 |
Large-scale speaker identification M Sharifi, D Roblek US Patent 9,123,330, 2015 | 76 | 2015 |
SEANet: A multi-modal speech enhancement network M Tagliasacchi, Y Li, K Misiunas, D Roblek arXiv preprint arXiv:2009.02095, 2020 | 58 | 2020 |
Training keyword spotters with limited and synthesized speech data J Lin, K Kilgour, D Roblek, M Sharifi ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 58 | 2020 |
Pre-training audio representations with self-supervision M Tagliasacchi, B Gfeller, F de Chaumont Quitry, D Roblek IEEE Signal Processing Letters 27, 600-604, 2020 | 58 | 2020 |
Now playing: Continuous low-power music recognition BA y Arcas, B Gfeller, R Guo, K Kilgour, S Kumar, J Lyon, J Odell, M Ritter, ... arXiv preprint arXiv:1711.10958 [cs, eess], 2017 | 51* | 2017 |
Real-time speech frequency bandwidth extension Y Li, M Tagliasacchi, O Rybakov, V Ungureanu, D Roblek ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 49 | 2021 |
Self-supervised audio representation learning for mobile devices M Tagliasacchi, B Gfeller, FC Quitry, D Roblek arXiv preprint arXiv:1905.11796, 2019 | 49 | 2019 |
Dual model speaker identification M Sharifi, D Roblek US Patent 9,711,148, 2017 | 43 | 2017 |
Container for welding wire C Gelmetti US Patent 6,938,767, 2005 | 43 | 2005 |
One-shot conditional audio filtering of arbitrary sounds B Gfeller, D Roblek, M Tagliasacchi ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 41 | 2021 |
Frequency based audio analysis using neural networks D Roblek, M Sharifi US Patent 10,460,747, 2019 | 36 | 2019 |
Aggregation of related media content Y Matias, M Sharifi, T Bugnon, D Roblek, A Chen US Patent 9,159,364, 2015 | 35 | 2015 |
Personalized entity repository M Sharifi, J Pereira, D Roblek, J Odell, C Li, D Petrou US Patent 10,178,527, 2019 | 31 | 2019 |
Inverted client-side fingerprinting and matching M Wiseman, M Sharifi, Y Bernstein, A Chen, D Roblek US Patent 9,113,202, 2015 | 31 | 2015 |