Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ... arXiv preprint arXiv:1712.05884, 2017 | 1642 | 2017 |
TACOTRON: TOWARDS END-TO-END SPEECH SYN Y Wang, RJ Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ... arXiv preprint arXiv:1703.10135, 2017 | 1433* | 2017 |
A leaf recognition algorithm for plant classification using probabilistic neural network SG Wu, FS Bao, EY Xu, YX Wang, YF Chang, QL Xiang Signal Processing and Information Technology, 2007 IEEE International …, 2007 | 1011 | 2007 |
On training targets for supervised speech separation Y Wang, A Narayanan, DL Wang IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 22 …, 2014 | 898 | 2014 |
Complex ratio masking for monaural speech separation DS Williamson, Y Wang, DL Wang IEEE/ACM Transactions on Audio, Speech, and Language Processing 24 (3), 483-492, 2016 | 494 | 2016 |
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis Y Wang, D Stanton, Y Zhang, RJ Skerry-Ryan, E Battenberg, J Shor, ... arXiv preprint arXiv:1803.09017, 2018 | 487 | 2018 |
Towards scaling up classification-based speech separation Y Wang, DL Wang IEEE Transactions on Audio, Speech, and Language Processing 21 (7), 1381-1390, 2013 | 451 | 2013 |
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron RJ Skerry-Ryan, E Battenberg, Y Xiao, Y Wang, D Stanton, J Shor, ... arXiv preprint arXiv:1803.09047, 2018 | 389 | 2018 |
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition Q Kong, Y Cao, T Iqbal, Y Wang, W Wang, MD Plumbley arXiv preprint arXiv:1912.10211, 2019 | 273 | 2019 |
An algorithm to improve speech recognition in noise for hearing-impaired listeners EW Healy, SE Yoho, Y Wang, DL Wang The Journal of the Acoustical Society of America 134 (4), 3029-3038, 2013 | 232 | 2013 |
Learning spectral mapping for speech dereverberation and denoising K Han, Y Wang, DL Wang, WS Woods, I Merks, T Zhang IEEE Transactions on Audio, Speech, and Language Processing 23 (6), 982-992, 2015 | 229 | 2015 |
Exploring monaural features for classification-based speech segregation Y Wang, K Han, DL Wang IEEE Transactions on Audio, Speech, and Language Processing 21 (2), 270-279, 2013 | 214 | 2013 |
A feature study for classification-based speech separation at low signal-to-noise ratios J Chen, Y Wang, DL Wang IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 22 …, 2014 | 197 | 2014 |
A feature study for classification-based speech separation at low signal-to-noise ratios J Chen, Y Wang, DL Wang IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 22 …, 2014 | 197 | 2014 |
Hierarchical Generative Modeling for Controllable Speech Synthesis WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ... arXiv preprint arXiv:1810.07217, 2018 | 166 | 2018 |
Robust speaker identification in noisy and reverberant conditions X Zhao, Y Wang, DL Wang IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 22 (4 …, 2014 | 155 | 2014 |
Large-scale training to increase speech intelligibility for hearing-impaired listeners in novel noises J Chen, Y Wang, SE Yoho, DL Wang, EW Healy The Journal of the Acoustical Society of America 139 (5), 2604-2612, 2016 | 147 | 2016 |
A deep neural network for time-domain signal reconstruction Y Wang, DL Wang Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International …, 2015 | 104 | 2015 |
Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization WN Hsu, Y Zhang, RJ Weiss, YA Chung, Y Wang, Y Wu, J Glass | 94 | 2018 |
Trainable frontend for robust and far-field keyword spotting Y Wang, P Getreuer, T Hughes, RF Lyon, RA Saurous Acoustics, Speech and Signal Processing (ICASSP), 2017 IEEE International …, 2017 | 94 | 2017 |