Tie your embeddings down: Cross-modal latent spaces for end-to-end spoken language understanding B Agrawal, M Müller, S Choudhary, M Radfar, A Mouchtaris, R McGowan, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 39 | 2022 |
Convrnn-t: Convolutional augmented recurrent neural network transducers for streaming speech recognition M Radfar, R Barnwal, RV Swaminathan, FJ Chang, GP Strimel, N Susanj, ... arXiv preprint arXiv:2209.14868, 2022 | 14 | 2022 |
Revisiting pretraining with adapters S Kim, A Shum, N Susanj, J Hilgart Proceedings of the 6th Workshop on Representation Learning for NLP (RepL4NLP …, 2021 | 13 | 2021 |
Knowledge distillation via module replacing for automatic speech recognition with recurrent neural network transducer K Zhao, HD Nguyen, A Jain, N Susanj, A Mouchtaris, L Gupta, M Zhao 23rd Interspeech Conference, 2022 | 9 | 2022 |
Attentive contextual carryover for multi-turn end-to-end spoken language understanding K Wei, T Tran, FJ Chang, KM Sathyendra, T Muniyappa, J Liu, A Raju, ... 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 9 | 2021 |
Sub-8-bit quantization for on-device speech recognition: A regularization-free approach K Zhen, M Radfar, H Nguyen, GP Strimel, N Susanj, A Mouchtaris 2022 IEEE Spoken Language Technology Workshop (SLT), 15-22, 2023 | 7 | 2023 |
Sub-8-bit quantization aware training for 8-bit neural network accelerator with on-device speech recognition K Zhen, HD Nguyen, R Chinta, N Susanj, A Mouchtaris, T Afzal, ... arXiv preprint arXiv:2206.15408, 2022 | 7 | 2022 |
Gated contextual adapters for selective contextual biasing in neural transducers A Alexandridis, KM Sathyendra, GP Strimel, FJ Chang, A Rastrow, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 6 | 2023 |
Conmer: Streaming Conformer without self-attention for interactive voice assistants M Radfar, P Lyskawa, B Trujillo, Y Xie, K Zhen, J Heymann, D Filimonov, ... | 5 | 2023 |
Adaptive global-local context fusion for multi-turn spoken language understanding T Tran, K Wei, W Ruan, R McGowan, N Susanj, GP Strimel Proceedings of the AAAI Conference on Artificial Intelligence 36 (11), 12622 …, 2022 | 5 | 2022 |
A neural prosody encoder for end-to-end dialogue act classification K Wei, D Knox, M Radfar, T Tran, M Müller, GP Strimel, N Susanj, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 4 | 2022 |
Comparing data augmentation and annotation standardization to improve end-to-end spoken language understanding models L Nicolich-Henkin, T Nakatani, Z Trozenski, J Whiteman, N Susanj | 2 | 2021 |
Accelerator-aware training for transducer-based speech recognition SM Shakiah, RV Swaminathan, HD Nguyen, R Chinta, T Afzal, N Susanj, ... 2022 IEEE Spoken Language Technology Workshop (SLT), 100-107, 2023 | 1 | 2023 |
Max-margin transducer loss: Improving sequence-discriminative training using a large-margin learning strategy RV Swaminathan, GP Strimel, A Rastrow, H Mallidi, K Zhen, HD Nguyen, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
Multilingual end-to-end spoken language understanding for ultra-low footprint applications M Müller, A Alexandridis, Z Trozenski, J Whiteman, G Strimel, N Susanj, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | | 2023 |