Segui
Hengyuan Hu
Hengyuan Hu
Email verificata su stanford.edu
Titolo
Citata da
Citata da
Anno
Network trimming: A data-driven neuron pruning approach towards efficient deep architectures
H Hu, R Peng, YW Tai, CK Tang
arXiv preprint arXiv:1607.03250, 2016
10352016
“Other-Play” for Zero-Shot Coordination
H Hu, A Lerer, A Peysakhovich, J Foerster
International Conference on Machine Learning, 4399-4410, 2020
1522020
Human-level play in the game of Diplomacy by combining language models with strategic reasoning
Meta Fundamental AI Research Diplomacy Team (FAIR)†, A Bakhtin, ...
Science 378 (6624), 1067-1074, 2022
1422022
Simplified action decoder for deep multi-agent reinforcement learning
H Hu, JN Foerster
ICLR 2019, 2019
872019
Trajectory diversity for zero-shot coordination
A Lupu, B Cui, H Hu, J Foerster
International Conference on Machine Learning, 7204-7213, 2021
822021
Improving policies via search in cooperative partially observable games
A Lerer, H Hu, J Foerster, N Brown
Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 7187-7194, 2020
702020
Hierarchical decision making by generating and following natural language instructions
H Hu, D Yarats, Q Gong, Y Tian, M Lewis
Advances in neural information processing systems 32, 2019
612019
Off-belief learning
H Hu, A Lerer, B Cui, L Pineda, N Brown, J Foerster
International Conference on Machine Learning, 4369-4379, 2021
572021
Polygames: Improved zero learning
T Cazenave, YC Chen, GW Chen, SY Chen, XD Chiu, J Dehos, M Elsa, ...
ICGA Journal 42 (4), 244-256, 2020
432020
Modeling strong and human-like gameplay with KL-regularized search
AP Jacob, DJ Wu, G Farina, A Lerer, H Hu, A Bakhtin, J Andreas, N Brown
International Conference on Machine Learning, 9695-9728, 2022
382022
Language instructed reinforcement learning for human-ai coordination
H Hu, D Sadigh
International Conference on Machine Learning, 13584-13598, 2023
312023
K-level Reasoning for Zero-Shot Coordination in Hanabi
B Cui, H Hu, L Pineda, J Foerster
Advances in Neural Information Processing Systems 34, 8215-8228, 2021
262021
Ridge rider: Finding diverse solutions by following eigenvectors of the hessian
J Parker-Holder, L Metz, C Resnick, H Hu, A Lerer, A Letcher, ...
Advances in Neural Information Processing Systems 33, 753-765, 2020
252020
Scalable online planning via reinforcement learning fine-tuning
A Fickinger, H Hu, B Amos, S Russell, N Brown
Advances in Neural Information Processing Systems 34, 16951-16963, 2021
142021
Adversarial Diversity in Hanabi
B Cui, A Lupu, S Sokota, H Hu, DJ Wu, JN Foerster
The Eleventh International Conference on Learning Representations, 2022
122022
A fine-tuning approach to belief state modeling
S Sokota, H Hu, DJ Wu, JZ Kolter, JN Foerster, N Brown
International Conference on Learning Representations, 2021
62021
Human-AI Coordination via Human-Regularized Search and Learning
H Hu, DJ Wu, A Lerer, J Foerster, N Brown
arXiv preprint arXiv:2210.05125, 2022
52022
Learned belief search: Efficiently improving policies in partially observable settings
H Hu, A Lerer, N Brown, J Foerster
arXiv preprint arXiv:2106.09086, 2021
52021
Toward Grounded Social Reasoning
M Kwon, H Hu, V Myers, S Karamcheti, A Dragan, D Sadigh
arXiv preprint arXiv:2306.08651, 2023
42023
Imitation Bootstrapped Reinforcement Learning
H Hu, S Mirchandani, D Sadigh
arXiv preprint arXiv:2311.02198, 2023
22023
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20