Takaaki Hori
Title
Cited by
Cited by
Year
Joint CTC-attention based end-to-end speech recognition using multi-task learning
S Kim, T Hori, S Watanabe
2017 IEEE international conference on acoustics, speech and signal …, 2017
3032017
Espnet: End-to-end speech processing toolkit
S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ...
arXiv preprint arXiv:1804.00015, 2018
2162018
Efficient WFST-based one-pass decoding with on-the-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition
T Hori, C Hori, Y Minami, A Nakamura
IEEE Transactions on audio, speech, and language processing 15 (4), 1352-1365, 2007
1642007
Attention-based multimodal fusion for video description
C Hori, T Hori, TY Lee, Z Zhang, B Harsham, JR Hershey, TK Marks, ...
Proceedings of the IEEE international conference on computer vision, 4193-4202, 2017
1602017
Hybrid CTC/attention architecture for end-to-end speech recognition
S Watanabe, T Hori, S Kim, JR Hershey, T Hayashi
IEEE Journal of Selected Topics in Signal Processing 11 (8), 1240-1253, 2017
1582017
Advances in joint CTC-attention based end-to-end speech recognition with a deep CNN encoder and RNN-LM
T Hori, S Watanabe, Y Zhang, W Chan
arXiv preprint arXiv:1706.02737, 2017
1532017
Open-vocabulary spoken utterance retrieval using confusion networks
T Hori, IL Hetherington, TJ Hazen, JR Glass
2007 IEEE International Conference on Acoustics, Speech and Signal …, 2007
1172007
Linear prediction-based dereverberation with advanced speech enhancement and recognition technologies for the REVERB challenge
M Delcroix, T Yoshioka, A Ogawa, Y Kubo, M Fujimoto, N Ito, K Kinoshita, ...
Reverb workshop, 2014
1012014
A comparative study on transformer vs RNN in speech applications
S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ...
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
922019
Low-latency real-time meeting recognition and understanding using distant microphones and omni-directional camera
T Hori, S Araki, T Yoshioka, M Fujimoto, S Watanabe, T Oba, A Ogawa, ...
IEEE transactions on audio, speech, and language processing 20 (2), 499-513, 2011
892011
Fast on-the-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous speech recognition
T Hori, C Hori, Y Minami
Eighth International Conference on Spoken Language Processing, 2004
582004
Language independent end-to-end architecture for joint language identification and speech recognition
S Watanabe, T Hori, JR Hershey
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
542017
Multi-channel speech recognition: Lstms all the way through
H Erdogan, T Hayashi, JR Hershey, T Hori, C Hori, WN Hsu, S Kim, ...
CHiME-4 workshop, 1-4, 2016
542016
The MERL/SRI system for the 3rd CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition
T Hori, Z Chen, H Erdogan, JR Hershey, J Le Roux, V Mitra, S Watanabe
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
532015
Strategies for distant speech recognitionin reverberant environments
M Delcroix, T Yoshioka, A Ogawa, Y Kubo, M Fujimoto, N Ito, K Kinoshita, ...
EURASIP Journal on Advances in Signal Processing 2015 (1), 60, 2015
502015
Multichannel end-to-end speech recognition
T Ochiai, S Watanabe, T Hori, JR Hershey
arXiv preprint arXiv:1703.04783, 2017
492017
Student-teacher network learning with enhanced features
S Watanabe, T Hori, J Le Roux, JR Hershey
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
492017
Context adaptive deep neural networks for fast acoustic model adaptation
M Delcroix, K Kinoshita, T Hori, T Nakatani
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
492015
End-to-end audio visual scene-aware dialog using multimodal attention-based video features
C Hori, H Alamri, J Wang, G Wichern, T Hori, A Cherian, TK Marks, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
482019
Duration-controlled LSTM for polyphonic sound event detection
T Hayashi, S Watanabe, T Toda, T Hori, J Le Roux, K Takeda
IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (11 …, 2017
462017
The system can't perform the operation now. Try again later.
Articles 1–20