Follow
Supratik Paul
Supratik Paul
Waymo
Verified email at google.com
Title
Cited by
Cited by
Year
Learning from demonstration in the wild
F Behbahani, K Shiarlis, X Chen, V Kurin, S Kasewa, C Stirbu, J Gomes, ...
2019 International Conference on Robotics and Automation (ICRA), 775-781, 2019
742019
Fast efficient hyperparameter tuning for policy gradient methods
S Paul, V Kurin, S Whiteson
Advances in Neural Information Processing Systems 32, 2019
70*2019
Hierarchical model-based imitation learning for planning in autonomous driving
E Bronstein, M Palatucci, D Notz, B White, A Kuefler, Y Lu, S Paul, ...
2022 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2022
362022
Alternating optimisation and quadrature for robust control
S Paul, K Chatzilygeroudis, K Ciosek, JB Mouret, M Osborne, S Whiteson
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
31*2018
Fingerprint policy optimisation for robust reinforcement learning
S Paul, MA Osborne, S Whiteson
International Conference on Machine Learning, 5082-5091, 2019
202019
Embedding synthetic off-policy experience for autonomous driving via zero-shot curricula
E Bronstein, S Srinivasan, S Paul, A Sinha, M O’Kelly, P Nikdel, ...
Conference on Robot Learning, 188-198, 2023
82023
Hierarchical model-based imitation learning for planning in autonomous driving. In 2022 IEEE
E Bronstein, M Palatucci, D Notz, B White, A Kuefler, Y Lu, S Paul, ...
RSJ International Conference on Intelligent Robots and Systems (IROS), 8652-8659, 0
5
Contextual policy optimisation
S Paul, MA Osborne, S Whiteson
CoRR, vol. abs/1805.10662, 2018
32018
Robust reinforcement learning with Bayesian optimisation and quadrature
S Paul, K Chatzilygeroudis, K Ciosek, JB Mouret, MA Osborne, ...
Journal of Machine Learning Research 21 (151), 1-31, 2020
2020
Towards robust reinforcement learning
S Paul
University of Oxford, 2020
2020
The system can't perform the operation now. Try again later.
Articles 1–10