Yonathan Efroni
Yonathan Efroni
Microsoft Research, New York
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
Adaptive trust region policy optimization: Global convergence and faster rates for regularized mdps
L Shani, Y Efroni, S Mannor
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5668-5675, 2020
262020
Action Robust Reinforcement Learning and Applications in Continuous Control
C Tessler, Y Efroni, S Mannor
arXiv preprint arXiv:1901.09184, 2019
262019
Tight regret bounds for model-based reinforcement learning with greedy policies
Y Efroni, N Merlis, M Ghavamzadeh, S Mannor
Advances in Neural Information Processing Systems, 12203-12213, 2019
212019
Universality of local weak interactions and its application for interferometric alignment
J Dziewior, L Knips, D Farfurnik, K Senkalla, N Benshalom, J Efroni, ...
Proceedings of the National Academy of Sciences 116 (8), 2881-2890, 2019
202019
Beyond the one step greedy approach in reinforcement learning
Y Efroni, G Dalal, B Scherrer, S Mannor
arXiv preprint arXiv:1802.03654, 2018
192018
Multiple-step greedy policies in approximate and online reinforcement learning
Y Efroni, G Dalal, B Scherrer, S Mannor
Advances in Neural Information Processing Systems, 5238-5247, 2018
15*2018
Optimistic Policy Optimization with Bandit Feedback
Y Efroni, L Shani, A Rosenberg, S Mannor
arXiv preprint arXiv:2002.08243, 2020
102020
How to combine tree-search methods in reinforcement learning
Y Efroni, G Dalal, B Scherrer, S Mannor
Proceedings of the AAAI Conference on Artificial Intelligence 33, 3494-3501, 2019
92019
Exploration-Exploitation in Constrained MDPs
Y Efroni, S Mannor, M Pirotta
arXiv preprint arXiv:2003.02189, 2020
72020
Online Planning with Lookahead Policies
Y Efroni, M Ghavamzadeh, S Mannor
Advances in Neural Information Processing Systems 33, 2020
6*2020
Topological transitions and fractional charges induced by strain and a magnetic field in carbon nanotubes
Y Efroni, S Ilani, E Berg
Physical Review Letters 119 (14), 147704, 2017
62017
Exploration conscious reinforcement learning revisited
L Shani, Y Efroni, S Mannor
International Conference on Machine Learning, 5680-5689, 2019
5*2019
Mirror Descent Policy Optimization
M Tomar, L Shani, Y Efroni, M Ghavamzadeh
arXiv preprint arXiv:2005.09814, 2020
12020
Multi-step Greedy Policies in Model-Free Deep Reinforcement Learning
M Tomar, Y Efroni, M Ghavamzadeh
arXiv preprint arXiv:1910.02919, 2019
12019
Training Deep Neural Networks by optimizing over nonlocal paths in hyperparameter space
V Pushkarov, J Efroni, M Maksymenko, M Koch-Janusz
arXiv preprint arXiv:1909.04013, 2019
12019
Reinforcement Learning with Trajectory Feedback
Y Efroni, N Merlis, S Mannor
arXiv preprint arXiv:2008.06036, 2020
2020
Bandits with Partially Observable Offline Data
G Tennenholtz, U Shalit, S Mannor, Y Efroni
arXiv preprint arXiv:2006.06731, 2020
2020
The system can't perform the operation now. Try again later.
Articles 1–17