Yaodong Yang

Cited by

	All	Since 2019
Citations	5432	5312
h-index	35	35
i10-index	66	65

1900

950

475

1425

2017201820192020202120222023202430 81 173 317 551 883 1561 1813

Public access

View all

30 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Jun WangProfessor, Computer Science, University College LondonVerified email at cs.ucl.ac.uk
Ying WenAssociate Professor, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Weinan ZhangAssociate Professor, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Jiaming Ji (吉嘉铭)Peking UniversityVerified email at stu.pku.edu.cn
David MguniLecturer, Computer Science, Queen Mary University of LondonVerified email at qmul.ac.uk
Stephen McAleerOpenAIVerified email at openai.com
Jakub Grudzien KubaUC BerkeleyVerified email at berkeley.edu
Nicolas Perez-NievesResearch Engineer, DeepMindVerified email at google.com
Haitham Bou-AmmarRL-Team Leader, BO-Team Leader, MAS-Team Leader @ Huawei London & H. Assistant Professor @ UCLVerified email at huawei.com
Xiaotie DengChair Professor of Computer Science, Peking University, Beijing, ChinaVerified email at pku.edu.cn
Jieping Ye, IEEE Fellow & ACM Distin...Alibaba GroupVerified email at umich.edu
Matthew E. TaylorProfessor, University of AlbertaVerified email at ualberta.ca

Yaodong Yang

BOYA (博雅) Assistant Professor at Peking University

Verified email at pku.edu.cn - Homepage

AI Alignment Reinforcement Learning Multi-Agent Reinforcement Learning Game Theory


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Mean field multi-agent reinforcement learning Y Yang, R Luo, M Li, M Zhou, W Zhang, J Wang ICML 2018, Long Talk, 5571-5580, 2018	749	2018
Multiagent bidirectionally-coordinated nets: Emergence of human-level coordination in learning to play starcraft combat games P Peng, Y Wen, Y Yang, Q Yuan, Z Tang, H Long, J Wang NeurIPS 2017 Workshop: Emergent Communication, 2017	573	2017
An Overview of Multi-Agent Reinforcement Learning from Game Theoretical Perspective Y Yang, J Wang arXiv preprint arXiv:2011.00583, 2020	292	2020
Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning M Li, Y Jiao, T Qin, Y Yang, Z Gong, J Wang, C Wang, G Wu, J Ye WWW 2019 (oral), 2019	283	2019
Baichuan 2: Open Large-scale Language Models A Yang, B Xiao, B Wang, B Zhang, C Yin, C Lv, D Pan, D Wang, D Yan, ... arXiv preprint arXiv:2309.10305, 2023	272*	2023
A Review of Safe Reinforcement Learning: Methods, Theory and Applications S Gu, L Yang, Y Du, G Chen, F Walter, J Wang, Y Yang, A Knoll arXiv preprint arXiv:2205.10330, 2022	206	2022
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning JG Kuba, R Chen, M Wen, Y Wen, F Sun, J Wang, Y Yang ICLR 2022, 2021	198	2021
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving M Zhou, J Luo, J Villela, Y Yang, D Rusu, J Miao, W Zhang, M Alban, ... Conference on Robotic Learning 2020 (Best System Paper Award), 2020	186*	2020
Probabilistic Recursive Reasoning for Multi-Agent Reinforcement Learning Y Wen, Y Yang, R Luo, J Wang, W Pan ICLR 2019, 2019	160	2019
Beavertails: Towards improved safety alignment of llm via a human-preference dataset J Ji, M Liu, J Dai, X Pan, C Zhang, C Bian, R Sun, Y Wang, Y Yang NeurIPS 2023, 2023	136	2023
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem M Wen, JG Kuba, R Lin, W Zhang, Y Wen, J Wang, Y Yang NeurIPS 2022, 2022	134	2022
Can deep learning predict risky retail investors? A case study in financial risk behavior forecasting A Kim, Y Yang, S Lessmann, T Ma, MC Sung, JEV Johnson European Journal of Operational Research 283 (1), 217-234, 2020	113	2020
Ai alignment: A comprehensive survey J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang, Y Duan, Z He, J Zhou, ... arXiv preprint arXiv:2310.19852, 2023	100	2023
Safe RLHF: Safe Reinforcement Learning from Human Feedback J Dai, X Pan, R Sun, J Ji, X Xu, M Liu, Y Wang, Y Yang arXiv preprint arXiv:2310.12773, 2023	90	2023
Bi-level Actor-Critic for Multi-agent Coordination H Zhang, W Chen, Z Huang, M Li, Y Yang, W Zhang, J Wang AAAI 2020, 2019	89	2019
Multi-Agent Determinantal Q-Learning Y Yang, Y Wen, L Chen, J Wang, K Shao, D Mguni, W Zhang ICML 2020, 2020	74	2020
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning Y Chen, Y Yang, T Wu, S Wang, X Feng, J Jiang, SM McAleer, H Dong, ... NeurIPS 2022, 2022	72	2022
Factorized Q-learning for large-scale multi-agent systems M Zhou, Y Chen, Y Wen, Y Yang, Y Su, W Zhang, D Zhang, J Wang International Conference on Distributed Artificial Intelligence, 1-7, 2019	72	2019
Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning Y Wen, Y Yang, R Luo, J Wang IJCAI 2020, 2019	64	2019
Modelling Behavioural Diversity for Learning in Open-Ended Games NP Nieves, Y Yang, O Slumbers, DH Mguni, J Wang ICML 2021, Long Oral, 2021	60*	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors