Richard S. Sutton
Richard S. Sutton
DeepMind, Amii, and University of Alberta
Email verificata su richsutton.com - Home page
Titolo
Citata da
Citata da
Anno
Reinforcement learning: An Introduction, 2nd edition
RS Sutton, AG Barto
MIT press, 2018
435572018
Learning to predict by the methods of temporal differences
RS Sutton
Machine learning 3 (1), 9-44, 1988
63011988
Policy gradient methods for reinforcement learning with function approximation.
RS Sutton, DA McAllester, SP Singh, Y Mansour
NIPs 99, 1057-1063, 1999
43161999
Neuronlike adaptive elements that can solve difficult learning control problems
AG Barto, RS Sutton, CW Anderson
IEEE transactions on systems, man, and cybernetics 13 (5), 834-846, 1983
42351983
Reinforcement learning: An Introduction, 1st edition
RS Sutton, AG Barto
MIT press, 1998
3909*1998
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
RS Sutton, D Precup, S Singh
Artificial intelligence 112 (1-2), 181-211, 1999
28521999
Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
RS Sutton
Proceedings of the International Conference on Machine Learning, 216-224, 1990
17531990
Toward a modern theory of adaptive networks: Expectation and prediction.
RS Sutton, AG Barto
Psychological review 88 (2), 135, 1981
16621981
Generalization in reinforcement learning: Successful examples using sparse coarse coding
RS Sutton
Advances in neural information processing systems, 1038-1044, 1996
15491996
Neural networks for control
WT Miller, PJ Werbos, RS Sutton
MIT press, 1990
14961990
Guidelines on management (diagnosis and treatment) of syncope–update 2004: the Task Force on Syncope, European Society of Cardiology
M Brignole, P Alboni, DG Benditt, L Bergfeldt, JJ Blanc, PEB Thomsen, ...
European heart journal 25 (22), 2054-2072, 2004
12412004
Temporal credit assignment in reinforcement learning
RS Sutton
University of Massachusetts, Amherst, http://www.incompleteideas.net/papers …, 1984
9531984
Reinforcement learning with replacing eligibility traces
SP Singh, RS Sutton
Machine learning 22 (1), 123-158, 1996
8841996
Time-derivative models of Pavlovian reinforcement.
RS Sutton, AG Barto
Learning and Computational Neuroscience: Foundations of Adaptive Networks …, 1990
7441990
A menu of designs for reinforcement learning over time
PJ Werbos, WT Miller, RS Sutton
Neural networks for control, 67-95, 1990
6581990
Incremental natural actor-critic algorithms
S Bhatnagar, RS Sutton, M Ghavamzadeh, M Lee
Advances in neural information processing systems, 2008
621*2008
A bradford book
RS Sutton, AG Barto
Reinforcement learning: An introduction, 1998
6191998
Dyna, an integrated architecture for learning, planning, and reacting
RS Sutton
ACM Sigart Bulletin 2 (4), 160-163, 1991
6101991
Predictive representations of state
ML Littman, RS Sutton, S Singh
Advances in neural information processing systems, 1555-1561, 2002
6032002
Learning and sequential decision making
AG Barto, RS Sutton, CJCH Watkins
Learning and Computational Neuroscience: Foundations of Adaptive Networks, 1990
5941990
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20