Leonard Hasenclever

Citata da

	Tutte	Dal 2019
Citazioni	1954	1821
Indice H	23	23
i10-index	29	28

480

240

120

360

20152016201720182019202020212022202320249 14 19 82 128 191 252 352 479 415

Accesso pubblico

Visualizza tutto

5 articoli

0 articoli

Disponibili

Non disponibili

In base ai mandati di finanziamento

Segui

Leonard Hasenclever

Research Scientist at DeepMind

Email verificata su google.com - Home page

Machine Learning Reinforcement Learning Statistics


Titolo Ordina per citazioni Ordina per anno Ordina per titolo	Citata da Citata da	Anno
Sylvester Normalizing Flows for Variational Inference R Berg, L Hasenclever, JM Tomczak, M Welling UAI, 2018	246	2018
Language to rewards for robotic skill synthesis W Yu, N Gileadi, C Fu, S Kirmani, KH Lee, MG Arenas, HTL Chiang, ... arXiv preprint arXiv:2306.08647, 2023	152	2023
Neural probabilistic motor primitives for humanoid control J Merel, L Hasenclever, A Galashov, A Ahuja, V Pham, G Wayne, YW Teh, ... arXiv preprint arXiv:1811.11711, 2018	149	2018
Meta reinforcement learning as task inference J Humplik, A Galashov, L Hasenclever, PA Ortega, YW Teh, N Heess arXiv preprint arXiv:1905.06424, 2019	136	2019
Catch & carry: reusable neural controllers for vision-guided whole-body tasks J Merel, S Tunyasuvunakool, A Ahuja, Y Tassa, L Hasenclever, V Pham, ... ACM Transactions on Graphics (TOG) 39 (4), 39: 1-39: 12, 2020	124	2020
From motor control to team play in simulated humanoid football S Liu, G Lever, Z Wang, J Merel, SMA Eslami, D Hennes, WM Czarnecki, ... Science Robotics 7 (69), eabo0235, 2022	112	2022
Information asymmetry in KL-regularized RL A Galashov, SM Jayakumar, L Hasenclever, D Tirumala, J Schwarz, ... International Conference on Learning Representations, 2018	104	2018
Mix & match agent curricula for reinforcement learning W Czarnecki, S Jayakumar, M Jaderberg, L Hasenclever, YW Teh, ... International Conference on Machine Learning, 1087-1095, 2018	90	2018
Distributed Bayesian learning with stochastic natural gradient expectation propagation and the posterior server L Hasenclever, S Webb, T Lienart, S Vollmer, B Lakshminarayanan, ... Journal of Machine Learning Research 18 (106), 1-37, 2017	80*	2017
A distributional view on multi-objective policy optimization A Abdolmaleki, S Huang, L Hasenclever, M Neunert, F Song, M Zambelli, ... International conference on machine learning, 11-22, 2020	77	2020
Observational learning by reinforcement learning D Borsa, B Piot, R Munos, O Pietquin arXiv preprint arXiv:1706.06617, 2017	72	2017
The true cost of stochastic gradient Langevin dynamics T Nagapetyan, AB Duncan, L Hasenclever, SJ Vollmer, L Szpruch, ... arXiv preprint arXiv:1706.02692, 2017	62	2017
Learning agile soccer skills for a bipedal robot with deep reinforcement learning T Haarnoja, B Moran, G Lever, SH Huang, D Tirumala, J Humplik, ... Science Robotics 9 (89), eadi8022, 2024	58	2024
CoMic: Complementary task learning & mimicry for reusable skills L Hasenclever, F Pardo, R Hadsell, N Heess, J Merel International Conference on Machine Learning, 4105-4115, 2020	49	2020
Relativistic Monte Carlo X Lu, V Perrone, L Hasenclever, YW Teh, SJ Vollmer AISTATS, 2017	48	2017
Exploiting hierarchy for learning and transfer in kl-regularized rl D Tirumala, H Noh, A Galashov, L Hasenclever, A Ahuja, G Wayne, ... arXiv preprint arXiv:1903.07438, 2019	44	2019
Towards a unified agent with foundation models N Di Palo, A Byravan, L Hasenclever, M Wulfmeier, N Heess, ... arXiv preprint arXiv:2307.09668, 2023	39	2023
Imitate and repurpose: Learning reusable robot movement skills from human and animal behaviors S Bohez, S Tunyasuvunakool, P Brakel, F Sadeghi, L Hasenclever, ... arXiv preprint arXiv:2203.17138, 2022	36	2022
Behavior priors for efficient reinforcement learning D Tirumala, A Galashov, H Noh, L Hasenclever, R Pascanu, J Schwarz, ... Journal of Machine Learning Research 23 (221), 1-68, 2022	33	2022
Divide-and-conquer monte carlo tree search for goal-directed planning G Parascandolo, L Buesing, J Merel, L Hasenclever, J Aslanides, ... arXiv preprint arXiv:2004.11410, 2020	32	2020

Il sistema al momento non può eseguire l'operazione. Riprova più tardi.

Articoli 1–20

Citazioni per anno

Citazioni duplicate

Citazioni unite

Aggiungi coautoriCoautori

Segui

Citata da