Yutong Ban
Yutong Ban
Massachusetts Institute of Technology / Massachusetts General Hospital
Verified email at mit.edu - Homepage
Title
Cited by
Cited by
Year
Tracking multiple persons based on a variational bayesian model
Y Ban, S Ba, X Alameda-Pineda, R Horaud
European Conference on Computer Vision, 52-67, 2016
572016
Exploiting the complementarity of audio and visual data in multi-speaker tracking
Y Ban, L Girin, X Alameda-Pineda, R Horaud
Proceedings of the IEEE International Conference on Computer Vision …, 2017
202017
Tracking a varying number of people with a visually-controlled robotic head
Y Ban, X Alameda-Pineda, F Badeig, S Ba, R Horaud
2017 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2017
192017
A deep network for arousal-valence emotion prediction with acoustic-visual cues
S Peng, L Zhang, Y Ban, M Fang, S Winkler
arXiv preprint arXiv:1805.00638, 2018
172018
Variational bayesian inference for audio-visual tracking of multiple speakers
Y Ban, X Alameda-Pineda, L Girin, R Horaud
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019
162019
DeepMOT: A differentiable framework for training multiple object trackers
Y Xu, Y Ban, X Alameda-Pineda, R Horaud
arXiv preprint arXiv:1906.06618, 2019
102019
How To Train Your Deep Multi-Object Tracker
Y Xu, A Osep, Y Ban, R Horaud, L Leal-Taixé, X Alameda-Pineda
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
92020
Tracking multiple audio sources with the von mises distribution and variational em
Y Ban, X Alameda-Pineda, C Evers, R Horaud
IEEE Signal Processing Letters 26 (6), 798-802, 2019
92019
Online localization and tracking of multiple moving speakers in reverberant environments
X Li, Y Ban, L Girin, X Alameda-Pineda, R Horaud
IEEE Journal of Selected Topics in Signal Processing 13 (1), 88-103, 2019
92019
Accounting for room acoustics in audio-visual multi-speaker tracking
Y Ban, X Li, X Alameda-Pineda, L Girin, R Horaud
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
92018
A cascaded multiple-speaker localization and tracking system
X Li, Y Ban, L Girin, X Alameda-Pineda, R Horaud
arXiv preprint arXiv:1812.04417, 2018
12018
Training with Pooled Annotations from Multiple Surgeons Has No Effect on a Deep Learning Artificial Intelligence Model's Performance
TM Ward, D Hashimoto, Y Ban, ER Witkowski, KD Lillemoe, G Rosman, ...
Journal of the American College of Surgeons 231 (4), e203, 2020
2020
Aggregating Long-Term Context for Learning Surgical Workflows
Y Ban, G Rosman, T Ward, D Hashimoto, T Kondo, O Meireles, D Rus
arXiv preprint arXiv:2009.00681, 2020
2020
Automated operative phase identification in peroral endoscopic myotomy
TM Ward, DA Hashimoto, Y Ban, DW Rattner, H Inoue, KD Lillemoe, ...
Surgical Endoscopy, 1-8, 2020
2020
Audio-Visual Variational Fusion for Multi-Person Tracking with Robots
X Alameda-Pineda, S Arias, Y Ban, G Delorme, L Girin, R Horaud, X Li, ...
Proceedings of the 27th ACM International Conference on Multimedia, 1059-1061, 2019
2019
2019 Index IEEE Journal of Selected Topics in Signal Processing Vol. 13
S Adavanne, P Agrawal, N Al-Dhahir, MS Alam, X Alameda-Pineda, ...
IEEE Journal of Selected Topics in Signal Processing 13 (6), 2019
2019
Suivi multi-locuteurs avec information audio-visuel pour la perception du robot
Y Ban
Université Grenoble Alpes (ComUE), 2019
2019
Audio-visual multiple-speaker tracking for robot perception
Y Ban
2019
How To Train Your Deep Multi-Object Tracker Download PDF
Y Xu, A Osep, Y Ban, R Horaud, L Leal-Taixe, X Alameda-Pineda
Supplementary Material: How To Train Your Deep Multi-Object Tracker
Y Xu, A Osep, Y Ban, R Horaud, LLTX Alameda-Pineda
The system can't perform the operation now. Try again later.
Articles 1–20