Ego4d: Around the world in 3,000 hours of egocentric video K Grauman, A Westbury, E Byrne, Z Chavis, A Furnari, R Girdhar, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 805 | 2022 |
Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition and Alignment S Deena, M Hasan, M Doulaty, O Saz, T Hain IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 27 (3 …, 2019 | 35 | 2019 |
Combining feature and model-based adaptation of RNNLMs for multi-genre broadcast speech recognition S Deena, M Hasan, M Doulaty, O Saz, T Hain Proceedings of the Annual Conference of the International Speech …, 2016 | 30 | 2016 |
The 2015 Sheffield System for Transcription of Multi-Genre Broadcast Media O Saz, M Doulaty, S Deena, R Milner, RWM Ng, M Hasan, Y Liu, T Hain IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2015), 2015 | 25 | 2015 |
Data-Selective Transfer Learning for Multi-Domain Speech Recognition M Doulaty, O Saz, T Hain Sixteenth Annual Conference of the International Speech Communication …, 2015 | 21 | 2015 |
Timetabling: A State-of-the-Art Evolutionary Approach M Doulaty, MRF Derakhshi, M Abdi International Journal of Machine Learning and Computing 3 (3), 255, 2013 | 20 | 2013 |
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision X Liu, E Lakomkin, K Vougioukas, P Ma, H Chen, R Xie, M Doulaty, ... Conference on Computer Vision and Pattern Recognition (CVPR), 2023 | 16 | 2023 |
Automatic Genre and Show Identification of Broadcast Media M Doulaty, O Saz, RWM Ng, T Hain 17th Annual Conference of the International Speech Communication Association …, 2016 | 16 | 2016 |
Unsupervised Domain Discovery Using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition M Doulaty, O Saz, T Hain Sixteenth Annual Conference of the International Speech Communication …, 2015 | 16 | 2015 |
The 2015 Sheffield system for longitudinal diarisation of broadcast media R Milner, O Saz, S Deena, M Doulaty, RWM Ng, T Hain 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 15 | 2015 |
Automatic Optimization of Data Perturbation Distributions for Multi-Style Training in Speech Recognition M Doulaty, R Rose, O Siohan IEEE Workshop on Spoken Language Technology (SLT), 2016 | 14 | 2016 |
Latent Dirichlet Allocation Based Organisation of Broadcast Media Archives for Deep Neural Network Adaptation M Doulaty, O Saz, RWM Ng, T Hain IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2015), 2015 | 14 | 2015 |
The USFD spoken language translation system for IWSLT 2014 RWM Ng, M Doulaty, R Doddipatla, W Aziz, K Shah, O Saz, M Hasan, ... Proc. IWSLT, 86-91, 2014, 2015 | 11 | 2015 |
Long-term Statistical Feature Extraction from Speech Signal and its Application in Emotion Recognition E Loweimi, M Doulaty, J Barker, T Hain 3rd International Conference on Statistical Language and Speech Processing …, 2015 | 10 | 2015 |
Background-tracking acoustic features for genre identification of broadcast shows O Saz, M Doulaty, T Hain 2014 IEEE Spoken Language Technology Workshop (SLT), 118-123, 2014 | 10 | 2014 |
The Sheffield language recognition system in NIST LRE 2015 RWM Ng, M Nicolao, O Saz, M Hasan, B Chettri, M Doulaty, T Lee, T Hain The Speaker and Language Recognition Workshop Odyssey 2016, 2016 | 8 | 2016 |
Lightly supervised alignment of subtitles on multi-genre broadcasts O Saz, S Deena, M Doulaty, M Hasan, B Khaliq, R Milner, RWM Ng, ... Multimedia Tools and Applications 77 (23), 30533-30550, 2018 | 5 | 2018 |
webASR 2-Improved Cloud Based Speech Technology. T Hain, J Christian, O Saz, S Deena, M Hasan, RWM Ng, R Milner, ... INTERSPEECH, 1613-1617, 2016 | 5 | 2016 |
Latent Dirichlet Allocation Based Acoustic Data Selection for Automatic Speech Recognition M Doulaty, T Hain Interspeech 2019, 2019 | 4 | 2019 |
Speech ReaLLM--Real-time Streaming Speech Recognition with Multimodal LLMs by Teaching the Flow of Time F Seide, M Doulaty, Y Shi, Y Gaur, J Jia, C Wu arXiv preprint arXiv:2406.09569, 2024 | 3 | 2024 |