Thomas Degris
Thomas Degris
DeepMind
Email verificata su google.com
Titolo
Citata da
Citata da
Anno
Deterministic policy gradient algorithms
D Silver, G Lever, N Heess, T Degris, D Wierstra, M Riedmiller
International conference on machine learning, 387-395, 2014
21012014
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
RS Sutton, J Modayil, M Delp, T Degris, PM Pilarski, A White, D Precup
The 10th International Conference on Autonomous Agents and Multiagent …, 2011
3652011
Vector-based navigation using grid-like representations in artificial agents
A Banino, C Barry, B Uria, C Blundell, T Lillicrap, P Mirowski, A Pritzel, ...
Nature 557 (7705), 429-433, 2018
3392018
Off-policy actor-critic
T Degris, M White, RS Sutton
arXiv preprint arXiv:1205.4839, 2012
3072012
Deep reinforcement learning in large discrete action spaces
G Dulac-Arnold, R Evans, H van Hasselt, P Sunehag, T Lillicrap, J Hunt, ...
arXiv preprint arXiv:1512.07679, 2015
2222015
The predictron: End-to-end learning and planning
D Silver, H Hasselt, M Hessel, T Schaul, A Guez, T Harley, ...
International Conference on Machine Learning, 3191-3199, 2017
2012017
Model-free reinforcement learning with continuous action in practice
T Degris, PM Pilarski, RS Sutton
2012 American Control Conference (ACC), 2177-2182, 2012
1812012
Online human training of a myoelectric prosthesis controller via actor-critic reinforcement learning
PM Pilarski, MR Dawson, T Degris, F Fahimi, JP Carey, RS Sutton
2011 IEEE international conference on rehabilitation robotics, 1-7, 2011
1472011
Learning the structure of factored markov decision processes in reinforcement learning problems
T Degris, O Sigaud, PH Wuillemin
Proceedings of the 23rd international conference on Machine learning, 257-264, 2006
1412006
Adaptive artificial limbs: A real-time approach to prediction and anticipation
PM Pilarski, MR Dawson, T Degris, JP Carey, KM Chan, JS Hebert, ...
IEEE Robotics & Automation Magazine 20 (1), 53-64, 2013
682013
Dynamic switching and real-time machine learning for improved human control of assistive biomedical robots
PM Pilarski, MR Dawson, T Degris, JP Carey, RS Sutton
2012 4th IEEE RAS & EMBS International Conference on Biomedical Robotics and …, 2012
492012
Tuning-free step-size adaptation
AR Mahmood, RS Sutton, T Degris, PM Pilarski
2012 IEEE International Conference on Acoustics, Speech and Signal …, 2012
422012
A spiking neuron model of head-direction cells for robot orientation
T Degris, L Lacheze, C Boucheny, A Arleo
Proceedings of the eighth int. conf. on the simulation of adaptive behavior …, 2004
272004
Factored markov decision processes
T Degris, O Sigaud
Markov Decision Processes in Artificial Intelligence, 99-126, 2013
202013
Chi-square tests driven method for learning the structure of factored MDPs
T Degris, O Sigaud, PH Wuillemin
arXiv preprint arXiv:1206.6842, 2012
202012
Rapid response of head direction cells to reorienting visual cues: a computational model
T Degris, O Sigaud, SI Wiener, A Arleo
Neurocomputing 58, 675-682, 2004
202004
Scaling-up knowledge for a cognizant robot
T Degris, J Modayil
AAAI Spring Symposium on Designing Intelligent Robots: Reintegrating AI., 2012
122012
Apprentissage par renforcement dans les processus de décision Markoviens factorisés
T Degris
Paris 6, 2007
112007
Meta-descent for online, continual prediction
A Jacobsen, M Schlegel, C Linke, T Degris, A White, M White
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 3943-3950, 2019
92019
Exploiting additive structure in factored MDPs for reinforcement learning
T Degris, O Sigaud, PH Wuillemin
European Workshop on Reinforcement Learning, 15-26, 2008
82008
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20