Follow
Abhinav Garg
Abhinav Garg
Verified email at stanford.edu
Title
Cited by
Cited by
Year
Improved Vocal Tract Length Perturbation for a State-of-the-Art End-to-End Speech Recognition System.
C Kim, M Shin, A Garg, D Gowda
Interspeech, 739-743, 2019
452019
A review of on-device fully neural end-to-end automatic speech recognition algorithms
C Kim, D Gowda, D Lee, J Kim, A Kumar, S Kim, A Garg, C Han
ACSSC 2020: Asilomar Conference on Signals, Systems, and Computers, 2020
372020
end-to-end training of a large vocabulary end-to-end speech recognition system
C Kim, S Kim, K Kim, M Kumar, J Kim, K Lee, C Han, A Garg, E Kim, ...
ASRU 2019 : IEEE Workshop on Automatic Speech Recognition & Understanding, 2019
292019
Utterance Confidence Measure for End-to-End Speech Recognition with Applications to Distributed Speech Recognition Scenarios.
A Kumar, S Singh, D Gowda, A Garg, S Singh, C Kim
Interspeech 2020, 4357-4361, 2020
232020
Improved multi-stage training of online attention-based encoder-decoder models
A Garg, D Gowda, A Kumar, K Kim, M Kumar, C Kim
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 70-77, 2019
222019
Multi-Task Multi-Resolution Char-to-BPE Cross-Attention Decoder for End-to-End Speech Recognition.
D Gowda, A Garg, K Kim, M Kumar, C Kim
Interspeech, 2783-2787, 2019
212019
Hierarchical Multi-Stage Word-to-Grapheme Named Entity Corrector for Automatic Speech Recognition.
A Garg, A Gupta, D Gowda, S Singh, C Kim
INTERSPEECH, 1793-1797, 2020
202020
Streaming On-Device End-to-End ASR System for Privacy-Sensitive Voice-Typing.
A Garg, GP Vadisetti, D Gowda, S Jin, A Jayasimha, Y Han, J Kim, J Park, ...
INTERSPEECH, 3371-3375, 2020
172020
Streaming end-to-end speech recognition with jointly trained neural feature enhancement
C Kim, A Garg, D Gowda, S Mun, C Han
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
112021
Utterance Invariant Training for Hybrid Two-Pass End-to-End Speech Recognition.
D Gowda, A Kumar, K Kim, H Yang, A Garg, S Singh, J Kim, M Kumar, ...
Interspeech, 2827-2831, 2020
72020
Voice recognition device and method
C Kim, DN Gowda, S Kim, M Shin, LP Heck, A Garg, KIM Kwangyoun, ...
US Patent App. 17/296,806, 2022
52022
A comparison of streaming models and data augmentation methods for robust speech recognition
J Kim, M Kumar, D Gowda, A Garg, C Kim
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
52021
Method and device for speech recognition
DN Gowda, KIM Kwangyoun, A Garg, C Kim
US Patent 11,302,331, 2022
32022
Self-supervised accent learning for under-resourced accents using native language data
M Kumar, J Kim, D Gowda, A Garg, C Kim
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
22023
Semi-supervised transfer learning for language expansion of end-to-end speech recognition models to low-resource languages
J Kim, M Kumar, D Gowda, A Garg, C Kim
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
22021
Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech
A Garg, J Kim, S Khyalia, C Kim, D Gowda
arXiv preprint arXiv:2401.10465, 2024
2024
System and method for modifying speech recognition result
C Kim, DN Gowda, A Garg, K Lee
US Patent 11,521,619, 2022
2022
HiTNet: Byte-to-BPE Hierarchical Transcription Network for End-to-End Speech Recognition
D Gowda, A Garg, J Kim, M Kumar, S Singh, A Gupta, A Kumar, ...
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–18