Matteo Papini
Titolo
Citata da
Citata da
Anno
Stochastic variance-reduced policy gradient
M Papini, D Binaghi, G Canonaco, M Pirotta, M Restelli
Proceedings of the 35th International Conference on Machine Learning 80 …, 2018
852018
Policy optimization via importance sampling
AM Metelli, M Papini, F Faccio, M Restelli
arXiv preprint arXiv:1809.06098, 2018
582018
Adaptive batch size for safe policy gradients
M Papini, M Pirotta, M Restelli
The Thirty-first Annual Conference on Neural Information Processing Systems …, 2017
292017
Feature selection via mutual information: new theoretical insights
M Beraha, AM Metelli, M Papini, A Tirinzoni, M Restelli
2019 International Joint Conference on Neural Networks (IJCNN), 1-9, 2019
172019
Smoothing policies and safe policy gradients
M Papini, M Pirotta, M Restelli
arXiv preprint arXiv:1905.03231, 2019
142019
Importance Sampling Techniques for Policy Optimization.
AM Metelli, M Papini, N Montali, M Restelli
J. Mach. Learn. Res. 21, 141:1-141:75, 2020
112020
Risk-averse trust region optimization for reward-volatility reduction
L Bisi, L Sabbioni, E Vittori, M Papini, M Restelli
arXiv preprint arXiv:1912.03193, 2019
112019
Optimistic policy optimization via multiple importance sampling
M Papini, AM Metelli, L Lupo, M Restelli
International Conference on Machine Learning, 4989-4999, 2019
112019
Gradient-aware model-based policy search
P D'Oro, AM Metelli, A Tirinzoni, M Papini, M Restelli
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 3801-3808, 2020
92020
Leveraging good representations in linear contextual bandits
M Papini, A Tirinzoni, M Restelli, A Lazaric, M Pirotta
arXiv preprint arXiv:2104.03781, 2021
72021
Balancing learning speed and stability in policy gradient via adaptive exploration
M Papini, A Battistello, M Restelli
International Conference on Artificial Intelligence and Statistics, 1188-1199, 2020
62020
Policy Optimization as Online Learning with Mediator Feedback
AM Metelli, M Papini, P D'Oro, M Restelli
arXiv preprint arXiv:2012.08225, 2020
42020
Safely Exploring Policy Gradient
M Papini, A Battistello, M Restelli, A Battistello
European Workshop on Reinforcement Learning 14, 2018
12018
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
M Papini, A Tirinzoni, A Pacchiano, M Restelli, A Lazaric, M Pirotta
Advances in Neural Information Processing Systems 34, 2021
2021
Safe policy optimization
M Papini
Italy, 2021
2021
Automated Reasoning for Reinforcement Learning Agents in Structured Environments
A Gianola, M Montali, M Papini
2021
Safe Exploration in Gaussian Policy Gradient
M Papini, A Battistello, M Restelli
NeurIPS-2019 Workshop on Safety and Robustness in Decision Making, 2019
2019
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–17