Xinyuan Qian

Cited by

	All	Since 2019
Citations	674	666
h-index	13	13
i10-index	15	15

320

160

240

201720182019202020212022202320243 4 21 21 34 95 187 306

Public access

View all

13 articles

7 articles

available

not available

Based on funding mandates

Co-authors

Haizhou LiThe Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), China; NUS, SingaporeVerified email at u.nus.edu
Pan ZexuAlibaba; MERL; National University of SingaporeVerified email at u.nus.edu
Tao RuijieResearch Fellow, National University of SingaporeVerified email at u.nus.edu
Alessio BruttiFBKVerified email at fbk.eu
Andrea CavallaroDirector, Idiap Research Institute; Professor, EPFLVerified email at epfl.ch
Oswald LanzFree University of Bozen-BolzanoVerified email at inf.unibz.it
Alessio XomperoQueen Mary University of LondonVerified email at qmul.ac.uk
Wei XueHKUSTVerified email at ust.hk
Hao TangPeking University | CMU | ETH Zurich | University of Oxford | University of TrentoVerified email at pku.edu.cn
Maurizio OmologoPrincipal Applied Scientist, Amazon Alexa, Italy and USA

Xinyuan Qian

Associate Professor, University of Science and Technology Beijing, China

Verified email at nus.edu.sg - Homepage

speech processing multimedia human robot interaction


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Is someone speaking? exploring long-term temporal features for audio-visual active speaker detection R Tao, Z Pan, RK Das, X Qian, MZ Shou, H Li Proceedings of the 29th ACM international conference on multimedia, 3927-3935, 2021	163	2021
Seeing what you said: Talking face generation guided by a lip reading expert J Wang, X Qian, M Zhang, RT Tan, H Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	68	2023
Multi-speaker tracking from an audio–visual sensing device X Qian, A Brutti, O Lanz, M Omologo, A Cavallaro IEEE Transactions on Multimedia 21 (10), 2576-2588, 2019	61	2019
Multi-target DoA estimation with an audio-visual fusion mechanism X Qian, M Madhavi, Z Pan, J Wang, H Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	40	2021
3D audio-visual speaker tracking with an adaptive particle filter X Qian, A Brutti, M Omologo, A Cavallaro 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017	38	2017
A time-frequency attention module for neural speech enhancement Q Zhang, X Qian, Z Ni, A Nicolson, E Ambikairajah, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 462-475, 2022	32	2022
Audio-visual cross-attention network for robotic speaker tracking X Qian, Z Wang, J Wang, G Guan, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 550-562, 2022	28	2022
Audio-visual tracking of concurrent speakers X Qian, A Brutti, O Lanz, M Omologo, A Cavallaro IEEE Transactions on Multimedia 24, 942-954, 2021	27	2021
3D mouth tracking from a compact microphone array co-located with a camera X Qian, A Xompero, A Cavallaro, A Brutti, O Lanz, M Omologo 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	24	2018
Speaker extraction with co-speech gestures cue Z Pan, X Qian, H Li IEEE Signal Processing Letters 29, 1467-1471, 2022	23	2022
L F-TOUCH: A Wireless GelSight with Decoupled Tactile and Three-axis Force Sensing W Li, M Wang, J Li, Y Su, DK Jha, X Qian, K Althoefer, H Liu IEEE Robotics and Automation Letters, 2023	20	2023
Mamba in Speech: Towards an Alternative to Self-Attention X Zhang, Q Zhang, H Liu, T Xiao, X Qian, B Ahmed, E Ambikairajah, H Li, ... arXiv preprint arXiv:2405.12609, 2024	18	2024
GCC-PHAT with speech-oriented attention for robotic sound source localization J Wang, X Qian, Z Pan, M Zhang, H Li 2021 IEEE International Conference on Robotics and Automation (ICRA), 5876-5883, 2021	14	2021
Deep audio-visual beamforming for speaker localization X Qian, Q Zhang, G Guan, W Xue IEEE Signal Processing Letters 29, 1132-1136, 2022	12	2022
Predict-and-update network: Audio-visual speech recognition inspired by human speech perception J Wang, X Qian, H Li arXiv preprint arXiv:2209.01768, 2022	10	2022
Speech-oriented sparse attention denoising for voice user interface toward industry 5.0 H Zhu, Q Zhang, P Gao, X Qian IEEE Transactions on Industrial Informatics 19 (2), 2151-2160, 2022	9	2022
A miniaturised camera-based multi-modal tactile sensor K Althoefer, Y Ling, W Li, X Qian, WW Lee, P Qi 2023 IEEE International Conference on Robotics and Automation (ICRA), 12570 …, 2023	7	2023
Neural-Free Attention for Monaural Speech Enhancement Toward Voice User Interface for Consumer Electronics M Chen, Q Zhang, Q Song, X Qian, R Guo, M Wang, D Chen IEEE Transactions on Consumer Electronics 69 (4), 765-774, 2023	7	2023
Iterative Sound Source Localization for Unknown Number of Sources Y Fu, M Ge, H Yin, X Qian, L Wang, G Zhang, J Dang arXiv preprint arXiv:2206.12273, 2022	7	2022
Device features based on linear transformation with parallel training data for replay speech detection L Xu, J Yang, CH You, X Qian, D Huang IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1574-1586, 2023	6	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors