Segui
Naohiro Tawara
Naohiro Tawara
NTT Corporation
Email verificata su ieee.org
Titolo
Citata da
Citata da
Anno
Improving speaker discrimination of target speech extraction with time-domain speakerbeam
M Delcroix, T Ochiai, K Zmolikova, K Kinoshita, N Tawara, T Nakatani, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1002020
Integrating end-to-end neural and clustering-based diarization: Getting the best of both worlds
K Kinoshita, M Delcroix, N Tawara
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
612021
Multi-Channel Speech Enhancement Using Time-Domain Convolutional Denoising Autoencoder.
N Tawara, T Kobayashi, T Ogawa
INTERSPEECH, 86-90, 2019
392019
Advances in integration of end-to-end neural and clustering-based diarization for real conversational speech
K Kinoshita, M Delcroix, N Tawara
arXiv preprint arXiv:2105.09040, 2021
382021
Speaker invariant feature extraction for zero-resource languages with adversarial learning
T Tsuchiya, N Tawara, T Ogawa, T Kobayashi
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
362018
Frame-level phoneme-invariant speaker embedding for text-independent speaker recognition on extremely short utterances
N Tawara, A Ogawa, T Iwata, M Delcroix, T Ogawa
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
262020
Age-vox-celeb: Multi-modal corpus for facial and speech estimation
N Tawara, A Ogawa, Y Kitagishi, H Kamiyama
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
212021
Language model domain adaptation via recurrent neural networks with domain-shared and domain-specific representations
T Moriokal, N Tawara, T Ogawa, A Ogawa, T Iwata, T Kobayashi
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
102018
Fully Bayesian inference of multi-mixture Gaussian model and its evaluation using speaker clustering
N Tawara, T Ogawa, S Watanabe, T Kobayashi
2012 IEEE International Conference on Acoustics, Speech and Signal …, 2012
92012
Speaker age estimation using age-dependent insensitive loss
Y Kitagishi, H Kamiyama, A Ando, N Tawara, T Mori, S Kobashikawa
2020 Asia-Pacific Signal and Information Processing Association Annual …, 2020
82020
Sequential fish catch forecasting using Bayesian state space models
Y Kokaki, N Tawara, T Kobayashi, K Hashimoto, T Ogawa
2018 24th International Conference on Pattern Recognition (ICPR), 776-781, 2018
72018
A comparative study of spectral clustering for i-vector-based speaker clustering under noisy conditions
N Tawara, T Ogawa, T Kobayashi
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
72015
Speaker Adversarial Training of DPGMM-Based Feature Extractor for Zero-Resource Languages.
Y Higuchi, N Tawara, T Kobayashi, T Ogawa
INTERSPEECH, 266-270, 2019
62019
Speaker Clustering Based on Utterance-Oriented Dirichlet Process Mixture Model.
N Tawara, S Watanabe, T Ogawa, T Kobayashi
INTERSPEECH, 2905-2908, 2011
62011
Blstm-based confidence estimation for end-to-end speech recognition
A Ogawa, N Tawara, T Kano, M Delcroix
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
52021
Adversarial autoencoder for reducing nonlinear distortion
N Tawara, T Kobayashi, M Fujieda, K Katagiri, T Yazu, T Ogawa
2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018
52018
Fully Bayesian speaker clustering based on hierarchically structured utterance-oriented Dirichlet process mixture model.
N Tawara, T Ogawa, S Watanabe, A Nakamura, T Kobayashi
INTERSPEECH, 2166-2169, 2012
52012
A sampling-based speaker clustering using utterance-oriented Dirichlet process mixture model and its evaluation on large-scale data
N Tawara, T Ogawa, S Watanabe, A Nakamura, T Kobayashi
APSIPA Transactions on Signal and Information Processing 4, e16, 2015
42015
Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization
M Delcroix, N Tawara, M Diez, F Landini, A Silnova, A Ogawa, T Nakatani, ...
arXiv preprint arXiv:2305.13580, 2023
32023
Language Model Data Augmentation Based on Text Domain Transfer.
A Ogawa, N Tawara, M Delcroix
INTERSPEECH, 4926-4930, 2020
32020
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20