Jean Harb
Titolo
Citata da
Citata da
Anno
Multi-agent actor-critic for mixed cooperative-competitive environments
R Lowe, Y Wu, A Tamar, J Harb, OAIP Abbeel, I Mordatch
Advances in neural information processing systems, 6379-6390, 2017
5862017
The option-critic architecture
PL Bacon, J Harb, D Precup
Thirty-First AAAI Conference on Artificial Intelligence, 2017
3512017
When waiting is not an option: Learning options with a deliberation cost
J Harb, PL Bacon, M Klissarov, D Precup
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
402018
Learnings options end-to-end for continuous action tasks
M Klissarov, PL Bacon, J Harb, D Precup
arXiv preprint arXiv:1712.00004, 2017
142017
Investigating recurrence and eligibility traces in deep Q-networks
J Harb, D Precup
arXiv preprint arXiv:1704.05495, 2017
102017
The Barbados 2018 List of Open Issues in Continual Learning
T Schaul, H van Hasselt, J Modayil, M White, A White, PL Bacon, J Harb, ...
arXiv preprint arXiv:1811.07004, 2018
22018
Policy Evaluation Networks
J Harb, T Schaul, D Precup, PL Bacon
arXiv preprint arXiv:2002.11833, 2020
2020
Asynchronous Advantage Option-Critic with Deliberation Cost
J Harb, PL Bacon, D Precup
RLDM, 2017
2017
Learning options in deep reinforcement learning
J Merheb-Harb
McGill University, 2017
2017
Il sistema al momento non pu eseguire l'operazione. Riprova pi tardi.
Articoli 1–9