A comparative study on transformer vs rnn in speech applications S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ... 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 162 | 2019 |
Acoustic-to-word attention-based model complemented with character-level CTC-based model S Ueno, H Inaguma, M Mimura, T Kawahara 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 36 | 2018 |
ESPnet-ST: All-in-one speech translation toolkit H Inaguma, S Kiyono, K Duh, S Karita, NEY Soplin, T Hayashi, ... arXiv preprint arXiv:2004.10234, 2020 | 22 | 2020 |
Multilingual end-to-end speech translation H Inaguma, K Duh, T Kawahara, S Watanabe 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 19 | 2019 |
Transfer learning of language-independent end-to-end ASR with language model fusion H Inaguma, J Cho, MK Baskar, T Kawahara, S Watanabe ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 18 | 2019 |
Leveraging sequence-to-sequence speech synthesis for enhancing acoustic-to-word speech recognition M Mimura, S Ueno, H Inaguma, S Sakai, T Kawahara 2018 IEEE Spoken Language Technology Workshop (SLT), 477-484, 2018 | 15 | 2018 |
Minimum latency training strategies for streaming sequence-to-sequence ASR H Inaguma, Y Gaur, L Lu, J Li, Y Gong ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 14 | 2020 |
Social Signal Detection in Spontaneous Dialogue Using Bidirectional LSTM-CTC H Inaguma, K Inoue, M Mimura, T Kawahara INTERSPEECH, 1691-1695, 2017 | 8 | 2017 |
Prediction of ice-breaking between participants using prosodic features in the first meeting dialogue H Inaguma, K Inoue, S Nakamura, K Takanashi, T Kawahara Proceedings of the 2nd Workshop on Advancements in Social Signal Processing …, 2016 | 7 | 2016 |
Language model integration based on memory control for sequence to sequence speech recognition J Cho, S Watanabe, T Hori, MK Baskar, H Inaguma, J Villalba, N Dehak ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 5 | 2019 |
The JHU/KyotoU speech translation system for IWSLT 2018 H Inaguma, X Zhang, Z Wang, A Renduchintala, S Watanabe, K Duh International Workshop on Spoken Language Translation, 153-159, 2018 | 4 | 2018 |
Enhancing monotonic multihead attention for streaming asr H Inaguma, M Mimura, T Kawahara arXiv preprint arXiv:2005.09394, 2020 | 3 | 2020 |
Recent Developments on ESPnet Toolkit Boosted by Conformer P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ... arXiv preprint arXiv:2010.13956, 2020 | 2 | 2020 |
Improving OOV detection and resolution with external language models in acoustic-to-word ASR H Inaguma, M Mimura, S Sakai, T Kawahara 2018 IEEE Spoken Language Technology Workshop (SLT), 212-218, 2018 | 2 | 2018 |
Improved Mask-CTC for Non-Autoregressive End-to-End ASR Y Higuchi, H Inaguma, S Watanabe, T Ogawa, T Kobayashi arXiv preprint arXiv:2010.13270, 2020 | 1 | 2020 |
Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder H Inaguma, Y Higuchi, K Duh, T Kawahara, S Watanabe arXiv preprint arXiv:2010.13047, 2020 | 1 | 2020 |
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR H Futami, H Inaguma, S Ueno, M Mimura, S Sakai, T Kawahara arXiv preprint arXiv:2008.03822, 2020 | 1 | 2020 |
CTC-synchronous training for monotonic attention model H Inaguma, M Mimura, T Kawahara arXiv preprint arXiv:2005.04712, 2020 | 1 | 2020 |
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans S Watanabe, F Boyer, X Chang, P Guo, T Hayashi, Y Higuchi, T Hori, ... arXiv preprint arXiv:2012.13006, 2020 | | 2020 |
End-to-end speech-to-dialog-act recognition VT Dang, T Zhao, S Ueno, H Inaguma, T Kawahara arXiv preprint arXiv:2004.11419, 2020 | | 2020 |