Yao Qian
Title
Cited by
Cited by
Year
TTS synthesis with bidirectional LSTM based recurrent neural networks
Y Fan, Y Qian, FL Xie, FK Soong
Fifteenth annual conference of the international speech communication …, 2014
4592014
On the training aspects of deep neural network (DNN) for parametric TTS synthesis
Y Qian, Y Fan, W Hu, FK Soong
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
1872014
Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers
W Hu, Y Qian, FK Soong, Y Wang
Speech Communication 67, 154-166, 2015
1452015
Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis
Y Fan, Y Qian, FK Soong, L He
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
1092015
Locating boundaries for prosodic constituents in unrestricted Mandarin texts
M Chu, Y Qian
International Journal of Computational Linguistics & Chinese Language …, 2001
972001
A unified tagging solution: Bidirectional lstm recurrent neural network with word embedding
P Wang, Y Qian, FK Soong, L He, H Zhao
arXiv preprint arXiv:1511.00215, 2015
952015
Part-of-speech tagging with bidirectional long short-term memory recurrent neural network
P Wang, Y Qian, FK Soong, L He, H Zhao
arXiv preprint arXiv:1510.06168, 2015
952015
A report on the 2017 native language identification shared task
S Malmasi, K Evanini, A Cahill, J Tetreault, R Pugh, C Hamill, ...
Proceedings of the 12th Workshop on Innovative Use of NLP for Building …, 2017
722017
A new DNN-based high quality pronunciation evaluation for computer-aided language learning (CALL).
W Hu, Y Qian, FK Soong
Interspeech, 1886-1890, 2013
722013
A cross-language state sharing and mapping approach to bilingual (Mandarin–English) TTS
Y Qian, H Liang, FK Soong
IEEE Transactions on Audio, Speech, and Language Processing 17 (6), 1231-1239, 2009
712009
An HMM-based Mandarin Chinese text-to-speech system
Y Qian, F Soong, Y Chen, M Chu
International Symposium on Chinese Spoken Language Processing, 223-232, 2006
682006
Using bidirectional LSTM recurrent neural networks to learn high-level abstractions of sequential features for automated scoring of non-native spontaneous speech
Z Yu, V Ramanarayanan, D Suendermann-Oeft, X Wang, K Zechner, ...
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
652015
Segmenting unrestricted Chinese text into prosodic words instead of lexical words
Y Qian, M Chu, H Peng
2001 IEEE International Conference on Acoustics, Speech, and Signal …, 2001
622001
A frame mapping based HMM approach to cross-lingual voice transformation
Y Qian, J Xu, FK Soong
2011 IEEE International Conference on Acoustics, Speech and Signal …, 2011
532011
A unified trajectory tiling approach to high quality speech rendering
Y Qian, FK Soong, ZJ Yan
IEEE transactions on audio, speech, and language processing 21 (2), 280-290, 2012
522012
Word embedding for recurrent neural network based TTS synthesis
P Wang, Y Qian, FK Soong, L He, H Zhao
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
512015
Automatic prosody prediction and detection with Conditional Random Field (CRF) models
Y Qian, Z Wu, X Ma, F Soong
2010 7th International Symposium on Chinese Spoken Language Processing, 135-138, 2010
452010
Improved prosody generation by maximizing joint probability of state and longer units
Y Qian, Z Wu, B Gao, FK Soong
IEEE Transactions on Audio, Speech, and Language Processing 19 (6), 1702-1710, 2010
442010
Method and apparatus for identifying prosodic word boundaries
M Chu, Y Qian
US Patent 7,263,488, 2007
442007
Exploring ASR-free end-to-end modeling to improve spoken language understanding in a cloud-based dialog system
Y Qian, R Ubale, V Ramanaryanan, P Lange, D Suendermann-Oeft, ...
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
422017
The system can't perform the operation now. Try again later.
Articles 1–20