TUT database for acoustic scene classification and sound event detection A Mesaros, T Heittola, T Virtanen 2016 24th European Signal Processing Conference (EUSIPCO), 1128-1132, 2016 | 471 | 2016 |
DCASE 2017 challenge setup: Tasks, datasets and baseline system A Mesaros, T Heittola, A Diment, B Elizalde, A Shah, E Vincent, B Raj, ... DCASE 2017-Workshop on Detection and Classification of Acoustic Scenes and …, 2017 | 377 | 2017 |
Metrics for polyphonic sound event detection A Mesaros, T Heittola, T Virtanen Applied Sciences 6 (6), 162, 2016 | 362 | 2016 |
Acoustic event detection in real life recordings A Mesaros, T Heittola, A Eronen, T Virtanen 2010 18th European Signal Processing Conference, 1267-1271, 2010 | 289 | 2010 |
Context-dependent sound event detection T Heittola, A Mesaros, A Eronen, T Virtanen EURASIP Journal on Audio, Speech, and Music Processing 2013 (1), 1-13, 2013 | 208 | 2013 |
Detection and classification of acoustic scenes and events: Outcome of the DCASE 2016 challenge A Mesaros, T Heittola, E Benetos, P Foster, M Lagrange, T Virtanen, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (2), 379-393, 2017 | 202 | 2017 |
A multi-device dataset for urban acoustic scene classification A Mesaros, T Heittola, T Virtanen arXiv preprint arXiv:1807.09840, 2018 | 187 | 2018 |
Sound event detection in multisource environments using source separation T Heittola, A Mesaros, T Virtanen, A Eronen Machine Listening in Multisource Environments, 2011 | 126 | 2011 |
Singer identification in polyphonic music using vocal separation and pattern recognition methods. A Mesaros, T Virtanen, A Klapuri ISMIR, 375-378, 2007 | 120 | 2007 |
Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations A Mesaros, T Heittola, O Dikmen, T Virtanen 2015 IEEE international conference on acoustics, speech and signal …, 2015 | 99 | 2015 |
Automatic recognition of lyrics in singing A Mesaros, T Virtanen EURASIP Journal on Audio Speech and Music Processing 2010, 33, 2009 | 97 | 2009 |
Combining pitch-based inference and non-negative spectrogram factorization in separating vocals from polyphonic music. T Virtanen, A Mesaros, M Ryynänen SAPA@ INTERSPEECH, 17-22, 2008 | 95 | 2008 |
Audio context recognition using audio event histograms T Heittola, A Mesaros, A Eronen, T Virtanen 2010 18th European Signal Processing Conference, 1272-1276, 2010 | 86 | 2010 |
Supervised model training for overlapping sound events based on unsupervised source separation T Heittola, A Mesaros, T Virtanen, M Gabbouj 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 72 | 2013 |
Sound event detection in the DCASE 2017 challenge A Mesaros, A Diment, B Elizalde, T Heittola, E Vincent, B Raj, T Virtanen IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (6), 992-1006, 2019 | 50 | 2019 |
Sound event detection using non-negative dictionaries learned from annotated overlapping events O Dikmen, A Mesaros 2013 IEEE Workshop on Applications of Signal Processing to Audio and …, 2013 | 50 | 2013 |
Latent semantic analysis in sound event detection A Mesaros, T Heittola, A Klapuri 2011 19th European Signal Processing Conference, 1307-1311, 2011 | 50 | 2011 |
Acoustic scene classification: an overview of DCASE 2017 Challenge entries A Mesaros, T Heittola, T Virtanen 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC …, 2018 | 46 | 2018 |
Automatic alignment of music audio and lyrics A Mesaros, T Virtanen Proceedings of the 11th Int. Conference on Digital Audio Effects (DAFx-08), 2008 | 43 | 2008 |
The Mel-Frequency Cepstral Coefficients in the Context of Singer Identification. A Mesaros, J Astola ISMIR, 610-613, 2005 | 42 | 2005 |