Yutong Ban
Citata da
Citata da
How to train your deep multi-object tracker
Y Xu, A Osep, Y Ban, R Horaud, L Leal-Taixé, X Alameda-Pineda
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
Tracking multiple persons based on a variational bayesian model
Y Ban, S Ba, X Alameda-Pineda, R Horaud
European Conference on Computer Vision, 52-67, 2016
Deepmot: a differentiable framework for training multiple object trackers
Y Xu, Y Ban, X Alameda-Pineda, R Horaud
arXiv preprint arXiv:1906.06618, 2019
Variational Bayesian inference for audio-visual tracking of multiple speakers
Y Ban, X Alameda-Pineda, L Girin, R Horaud
IEEE transactions on pattern analysis and machine intelligence, 2019
Exploiting the complementarity of audio and visual data in multi-speaker tracking
Y Ban, L Girin, X Alameda-Pineda, R Horaud
Proceedings of the IEEE International Conference on Computer Vision …, 2017
Online localization and tracking of multiple moving speakers in reverberant environments
X Li, Y Ban, L Girin, X Alameda-Pineda, R Horaud
IEEE Journal of Selected Topics in Signal Processing 13 (1), 88-103, 2019
Tracking a varying number of people with a visually-controlled robotic head
Y Ban, X Alameda-Pineda, F Badeig, S Ba, R Horaud
2017 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2017
Automated operative phase identification in peroral endoscopic myotomy
TM Ward, DA Hashimoto, Y Ban, DW Rattner, H Inoue, KD Lillemoe, ...
Surgical Endoscopy 35 (7), 4008-4015, 2021
A deep network for arousal-valence emotion prediction with acoustic-visual cues
S Peng, L Zhang, Y Ban, M Fang, S Winkler
arXiv preprint arXiv:1805.00638, 2018
Tracking multiple audio sources with the von mises distribution and variational em
Y Ban, X Alameda-Pineda, C Evers, R Horaud
IEEE Signal Processing Letters 26 (6), 798-802, 2019
Accounting for room acoustics in audio-visual multi-speaker tracking
Y Ban, X Li, X Alameda-Pineda, L Girin, R Horaud
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
Computer vision in surgery
TM Ward, P Mascagni, Y Ban, G Rosman, N Padoy, O Meireles, ...
Surgery 169 (5), 1253-1256, 2021
TransCenter: Transformers with Dense Queries for Multiple-Object Tracking
Y Xu, Y Ban, G Delorme, C Gan, D Rus, X Alameda-Pineda
arXiv preprint arXiv:2103.15145, 2021
Challenges in surgical video annotation
TM Ward, DM Fer, Y Ban, G Rosman, OR Meireles, DA Hashimoto
Computer Assisted Surgery 26 (1), 58-68, 2021
Aggregating Long-Term Context for Learning Laparoscopic and Robot-Assisted Surgical Workflows
Y Ban, G Rosman, T Ward, D Hashimoto, T Kondo, H Iwaki, O Meireles, ...
IEEE International Conference on Robotics and Automation (ICRA), 2021, 2021
SAGES consensus recommendations on an annotation framework for surgical video
OR Meireles, G Rosman, MS Altieri, L Carin, G Hager, A Madani, N Padoy, ...
Surgical endoscopy 35 (9), 4918-4929, 2021
Audio-Visual Variational Fusion for Multi-Person Tracking with Robots
X Alameda-Pineda, S Arias, Y Ban, G Delorme, L Girin, R Horaud, X Li, ...
Proceedings of the 27th ACM International Conference on Multimedia, 1059-1061, 2019
A cascaded multiple-speaker localization and tracking system
X Li, Y Ban, L Girin, X Alameda-Pineda, R Horaud
arXiv preprint arXiv:1812.04417, 2018
SUrgical PRediction GAN for Events Anticipation
Y Ban, G Rosman, T Ward, D Hashimoto, T Kondo, H Iwaki, O Meireles, ...
arXiv preprint arXiv:2105.04642, 2021
Enhancing Direct-Path Relative Transfer Function Using Deep Neural Network for Robust Sound Source Localization
B Yang, R Ding, Y Ban, X Li, H Liu
CAAI Transactions on Intelligence Technology, 2021
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20