A. Rupam Mahmood

Citata da

	Tutte	Dal 2019
Citazioni	1406	1119
Indice H	17	15
i10-index	19	17

280

140

210

2013201420152016201720182019202020212022202320247 19 34 55 59 104 122 175 233 234 262 93

Accesso pubblico

Visualizza tutto

6 articoli

0 articoli

Disponibili

Non disponibili

In base ai mandati di finanziamento

Coautori

Richard S. SuttonKeen, Amii, and University of AlbertaEmail verificata su richsutton.com
Martha WhiteUniversity of AlbertaEmail verificata su ualberta.ca
Gautham VasanAmii, University of AlbertaEmail verificata su ualberta.ca
Dmytro KorenkevychMeta AIEmail verificata su meta.com
James BergstraPrincipal Engineer, Ocado TechnologyEmail verificata su ocado.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLEmail verificata su google.com
Patrick M. PilarskiUniversity of Alberta, Amii (Alberta Machine Intelligence Institute)Email verificata su ualberta.ca
Qingfeng LanPhD student @ University of AlbertaEmail verificata su ualberta.ca
Shibhansh DoharePhD Student, University of AlbertaEmail verificata su ualberta.ca
Harm van SeijenSony AIEmail verificata su sony.com
Brent KomerPhD Student, University of WaterlooEmail verificata su uwaterloo.ca
Marlos C. MachadoUniversity of AlbertaEmail verificata su ualberta.ca
Doina PrecupDeepMind and McGill UniversityEmail verificata su cs.mcgill.ca
Thomas DegrisDeepMindEmail verificata su google.com
Oliver LimoyoUniversity of Toronto Institute for Aerospace StudiesEmail verificata su mail.utoronto.ca
Bryan ChanUniversity of AlbertaEmail verificata su ualberta.ca
Jonathan KellyUniversity of Toronto Institute for Aerospace StudiesEmail verificata su utias.utoronto.ca

Segui

A. Rupam Mahmood

University of Alberta, Amii

Email verificata su ualberta.ca - Home page

Reinforcement learning robotics artificial intelligence machine learning


Titolo Ordina per citazioni Ordina per anno Ordina per titolo	Citata da Citata da	Anno
An emphatic approach to the problem of off-policy temporal-difference learning RS Sutton, AR Mahmood, M White (JMLR) Journal of Machine Learning Research 17, 2016	277	2016
Benchmarking reinforcement learning algorithms on real-world robots AR Mahmood, D Korenkevych, G Vasan, W Ma, J Bergstra (CoRL) Proceedings of the 2nd Annual Conference on Robot Learning, 2018	184	2018
Weighted importance sampling for off-policy learning with linear function approximation AR Mahmood, H van Hasselt, RS Sutton (NeurIPS) Advances in Neural Information Processing Systems 27, 2014	165	2014
True online temporal-difference learning H van Seijen, AR Mahmood, PM Pilarski, MC Machado, RS Sutton (JMLR) Journal of Machine Learning Research 17, 2016	110	2016
Setting up a reinforcement learning task with a real-world robot AR Mahmood, D Korenkevych, BJ Komer, J Bergstra (IROS) 2018 IEEE/RSJ International Conference on Intelligent Robots and …, 2018	84	2018
Tuning-free step-size adaptation AR Mahmood, RS Sutton, T Degris, PM Pilarski (ICASSP) Acoustics, Speech and Signal Processing, 2012 IEEE International …, 2012	78	2012
Maintaining plasticity in deep continual learning S Dohare, JF Hernandez-Garcia, P Rahman, AR Mahmood, RS Sutton arXiv preprint arXiv:2306.13812, 2024	53*	2024
Multi-step off-policy learning without importance sampling ratios AR Mahmood, H Yu, RS Sutton arXiv preprint arXiv:1702.03006, 2017	49	2017
Representation Search through Generate and Test AR Mahmood, RS Sutton Workshops at the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013	48	2013
Off-policy TD (λ) with a true online equivalence H van Hasselt, AR Mahmood, RS Sutton (UAI) Proceedings of the 30th Conference on Uncertainty in Artificial …, 2014	45	2014
A new Q (λ) with interim forward view and Monte Carlo equivalence RS Sutton, AR Mahmood, D Precup, M CA, H van Hasselt, U CA (ICML) In International Conference on Machine Learning, 2014	40	2014
On generalized Bellman equations and temporal-difference learning H Yu, AR Mahmood, RS Sutton (JMLR) The Journal of Machine Learning Research 19 (1), 1864-1912, 2018	39	2018
Emphatic temporal-difference learning AR Mahmood, H Yu, M White, RS Sutton In European Workshops on Reinforcement Learning, 2015	37	2015
Off-policy learning based on weighted importance sampling with linear computational complexity AR Mahmood, RS Sutton (UAI) Proceedings of the 31st Conference on Uncertainty in Artificial …, 2015	30	2015
Autoregressive policies for continuous control deep reinforcement learning D Korenkevych, AR Mahmood, G Vasan, J Bergstra (IJCAI) Proceedings of the 28th International Joint Conference on Artificial …, 2019	26	2019
Incremental Off-policy Reinforcement Learning Algorithms A Mahmood University of Alberta, 2017	18	2017
Greedification operators for policy optimization: investigating forward and reverse KL divergences A Chan, H Silva, S Lim, T Kozuno, AR Mahmood, M White (JMLR) Journal of Machine Learning Research, 2022	17	2022
Structure Learning of Causal Bayesian Networks: A Survey A Mahmood Department of Computing Science, University of Alberta, Edmonton, Canada …, 2011	11	2011
Automatic Step-size Adaptation In Incremental Supervised Learning A Mahmood University of Alberta, 2010	11	2010
Asynchronous reinforcement learning for real-time control of physical robots Y Yuan, AR Mahmood (ICRA) In Proceedings of the 2022 International Conference on Robotics and …, 2022	9	2022

Il sistema al momento non può eseguire l'operazione. Riprova più tardi.

Articoli 1–20

Citazioni per anno

Citazioni duplicate

Citazioni unite

Aggiungi coautoriCoautori

Segui

Citata da

Coautori