Samuele Tosatto

Cited by

	All	Since 2019
Citations	180	171
h-index	5	5
i10-index	4	4

20182019202020212022202320248 18 23 29 47 42 12

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Jan PetersProfessor for Intelligent Autonomous Systems/TU Darmstadt, Dept. Head/German AI Research Center DFKIVerified email at ias.tu-darmstadt.de
Marcello RestelliAssociate Professor, Politecnico di MilanoVerified email at polimi.it
Carlo D'EramoProfessor of Reinforcement Learning @ University of Würzburg | Group leader @ TU DarmstadtVerified email at uni-wuerzburg.de
Matteo PirottaResearch Scientist, Meta (FAIR)Verified email at fb.com
João CarvalhoTechnische Universität DarmstadtVerified email at ias.informatik.tu-darmstadt.de
Hany AbdulsamadPostdoc, Aalto UniversityVerified email at aalto.fi
Riad AkrourInria ScoolVerified email at inria.fr
Joni PajarinenAssistant Professor at Aalto UniversityVerified email at aalto.fi

Samuele Tosatto

Assistant Professor @ Universität Innsbruck

Verified email at uibk.ac.at - Homepage

Robot Learning Reinforcement Learning Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Learning inverse dynamics models in o (n) time with lstm networks E Rueckert, M Nakatenus, S Tosatto, J Peters 2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids …, 2017	73	2017
Boosted Fitted Q-Iteration S Tosatto, DE Carlo, P Matteo, R Marcello International Conference of Machine Learning, 2017	48	2017
A Nonparametric Off-Policy Policy Gradient S Tosatto, J Carvalho, H Abdulsamad, J Peters International Conference on Artificial Intelligence and Statistics (AISTATS), 2020	14	2020
Contextual latent-movements off-policy optimization for robotic manipulation skills S Tosatto, G Chalvatzaki, J Peters 2021 IEEE International Conference on Robotics and Automation (ICRA), 10815 …, 2021	13	2021
Model-free Policy Learning with Reward Gradients Q Lan, S Tosatto, H Farrahi, A Mahmood arXiv preprint arXiv:2103.05147, 2021	8	2021
Batch reinforcement learning with a nonparametric off-policy policy gradient S Tosatto, J Carvalho, J Peters IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (10), 5996 …, 2021	5	2021
An upper bound of the bias of Nadaraya-Watson kernel regression under Lipschitz assumptions S Tosatto, R Akrour, J Peters Stats 4 (1), 1-17, 2020	5	2020
Exploration Driven By an Optimistic Bellman Equation S Tosatto, C D'Eramo, J Pajarinen, M Restelli, J Peters International Joint Conference on Neural Networks, 2019	4	2019
Deep probabilistic movement primitives with a bayesian aggregator M Przystupa, F Haghverd, M Jagersand, S Tosatto 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2023	3	2023
An alternate policy gradient estimator for softmax policies S Garg, S Tosatto, Y Pan, M White, AR Mahmood arXiv preprint arXiv:2112.11622, 2021	3	2021
A temporal-difference approach to policy gradient estimation S Tosatto, A Patterson, M White, R Mahmood International Conference on Machine Learning, 21609-21632, 2022	2	2022
Dynamic decision frequency with continuous options A Karimi, J Jin, J Luo, AR Mahmood, M Jagersand, S Tosatto 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2023	1	2023
A Gradient Critic for Policy Gradient Estimation S Tosatto, A Patterson, M White, AR Mahmood Sixteenth European Workshop on Reinforcement Learning, 2023	1	2023
Variable-Decision Frequency Option Critic. A Karimi, J Jin, J Luo, AR Mahmood, M Jägersand, S Tosatto CoRR, 2022		2022
Off-Policy Reinforcement Learning for Robotics S Tosatto Technische Universität, 2021		2021
Dimensionality Reduction of Movement Primitives in Parameter Space S Tosatto, J Stadtmüller, J Peters arXiv preprint arXiv:2003.02634, 2020		2020
An Upper Bound of the Bias of Nadaraya–Watson Kernel Regression under Lipschitz Assumptions. Stats 2021, 4, 1–17 S Tosatto, R Akrour, J Peters s Note: MDPI stays neu-tral with regard to jurisdictional clai-ms in …, 2020		2020
Technical Report:“Exploration Driven by an Optimistic Bellman Equation” S Tosatto, C D’Eramo, J Pajarinen, M Restelli, J Peters		2018
Making Policy Gradient Estimators for Softmax Policies More Robust to Non-stationarities S Garg, S Tosatto, Y Pan, M White, AR Mahmood
Balloon Estimators for Improving and Scaling the Nonparametric Off-Policy Policy Gradient FA Hilt, JN Kolf, C Weiland, J Carvalho, S Tosatto

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors