Segui
Jan Humplik
Jan Humplik
Research Scientist, DeepMind
Email verificata su deepmind.com
Titolo
Citata da
Citata da
Anno
Meta reinforcement learning as task inference
J Humplik, A Galashov, L Hasenclever, PA Ortega, YW Teh, N Heess
arXiv preprint arXiv:1905.06424, 2019
1282019
Language to rewards for robotic skill synthesis
W Yu, N Gileadi, C Fu, S Kirmani, KH Lee, MG Arenas, HTL Chiang, ...
arXiv preprint arXiv:2306.08647, 2023
912023
Learning agile soccer skills for a bipedal robot with deep reinforcement learning
T Haarnoja, B Moran, G Lever, SH Huang, D Tirumala, M Wulfmeier, ...
arXiv preprint arXiv:2304.13653, 2023
352023
Probabilistic models for neural populations that naturally capture global coupling and criticality
J Humplik, G Tkačik
PLoS computational biology 13 (9), e1005763, 2017
332017
Imitate and repurpose: Learning reusable robot movement skills from human and animal behaviors
S Bohez, S Tunyasuvunakool, P Brakel, F Sadeghi, L Hasenclever, ...
arXiv preprint arXiv:2203.17138, 2022
292022
Nerf2real: Sim2real transfer of vision-guided bipedal motion skills using neural radiance fields
A Byravan, J Humplik, L Hasenclever, A Brussee, F Nori, T Haarnoja, ...
2023 IEEE International Conference on Robotics and Automation (ICRA), 9362-9369, 2023
212023
Towards real robot learning in the wild: A case study in bipedal locomotion
M Bloesch, J Humplik, V Patraucean, R Hafner, T Haarnoja, A Byravan, ...
Conference on Robot Learning, 1502-1511, 2022
202022
Neural belief states for partially observed domains
P Moreno, J Humplik, G Papamakarios, BA Pires, L Buesing, N Heess, ...
NeurIPS 2018 workshop on reinforcement learning under partial observability, 2018
192018
Evolutionary dynamics of infectious diseases in finite populations
J Humplik, AL Hill, MA Nowak
Journal of theoretical biology 360, 149-162, 2014
192014
Forgetting and imbalance in robot lifelong learning with off-policy data
W Zhou, S Bohez, J Humplik, N Heess, A Abdolmaleki, D Rao, ...
Conference on Lifelong Learning Agents, 294-309, 2022
62022
Inferring couplings in networks across order-disorder phase transitions
V Ngampruetikorn, V Sachdeva, J Torrence, J Humplik, DJ Schwab, ...
Physical review research 4 (2), 023240, 2022
42022
Importance weighted policy learning and adaptation
A Galashov, J Sygnowski, G Desjardins, J Humplik, L Hasenclever, ...
arXiv preprint arXiv:2009.04875, 2020
42020
Semiparametric energy-based probabilistic models
J Humplik, G Tkačik
arXiv preprint arXiv:1605.07371, 2016
42016
Skills: Adaptive skill sequencing for efficient temporally-extended exploration
G Vezzani, D Tirumala, M Wulfmeier, D Rao, A Abdolmaleki, B Moran, ...
arXiv preprint arXiv:2211.13743, 2022
32022
Learning to Learn Faster from Human Feedback with Language Model Predictive Control
J Liang, F Xia, W Yu, A Zeng, MG Arenas, M Attarian, M Bauza, M Bennice, ...
arXiv preprint arXiv:2402.11450, 2024
12024
Offline Distillation for Robot Lifelong Learning with Imbalanced Experience
W Zhou, S Bohez, J Humplik, A Abdolmaleki, D Rao, M Wulfmeier, ...
CoRR abs/2204.05893, 2022
12022
Semiparametric energy-based models of systems exhibiting criticality
J Humplik, G Tkacik
APS March Meeting Abstracts 2016, F41. 002, 2016
2016
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–17