Shibl Mourad
Shibl Mourad
DeepMind
Email verificata su google.com - Home page
Titolo
Citata da
Citata da
Anno
The hanabi challenge: A new frontier for ai research
N Bard, JN Foerster, S Chandar, N Burch, M Lanctot, HF Song, E Parisotto, ...
Artificial Intelligence 280, 103216, 2020
1082020
The option keyboard: Combining skills in reinforcement learning
A Barreto, D Borsa, S Hou, G Comanici, E Aygn, P Hamel, D Toyama, ...
arXiv preprint arXiv:2106.13105, 2021
172021
Task switching or task launching based on a ranked list of tasks
PJ Beaudoin, TC Huang, R Lee, PA Manzagol, RDP McFarlane, ...
US Patent App. 14/446,760, 2018
122018
The Hanabi challenge: a new frontier for AI research. CoRR abs/1902.00506 (2019)
N Bard, JN Foerster, S Chandar, N Burch, M Lanctot, HF Song, E Parisotto, ...
arXiv preprint arXiv:1902.00506, 2019
82019
Shaping representations through communication: community size effect in artificial learning systems
O Tieleman, A Lazaridou, S Mourad, C Blundell, D Precup
arXiv preprint arXiv:1912.06208, 2019
72019
Learning to prove from synthetic theorems
E Aygn, Z Ahmed, A Anand, V Firoiu, X Glorot, L Orseau, D Precup, ...
arXiv preprint arXiv:2006.11259, 2020
62020
The barbados 2018 list of open issues in continual learning
T Schaul, H van Hasselt, J Modayil, M White, A White, PL Bacon, J Harb, ...
arXiv preprint arXiv:1811.07004, 2018
52018
Shaping representations through communication
O Tieleman, A Lazaridou, S Mourad, C Blundell, D Precup
52018
Knowledge representation for reinforcement learning using general value functions
G Comanici, D Precup, A Barreto, DK Toyama, E Aygn, P Hamel, ...
52018
Learning representations of logical formulae using graph neural networks
X Glorot, A Anand, E Aygun, S Mourad, P Kohli, D Precup
Neural Information Processing Systems, Workshop on Graph Representation Learning, 2019
42019
Anonymous personalized recommendation method
S Mourad, CK Phillips, MA Courteau, P Beaudoin
US Patent 8,745,049, 2014
42014
Anonymous personalized recommendation method
S Mourad, CK Phillips, MA Courteau, P Beaudoin
US Patent 8,521,735, 2013
32013
AndroidEnv: A Reinforcement Learning Platform for Android
D Toyama, P Hamel, A Gergely, G Comanici, A Glaese, Z Ahmed, ...
arXiv preprint arXiv:2105.13231, 2021
2021
Training a First-Order Theorem Prover from Synthetic Data
V Firoiu, E Aygun, A Anand, Z Ahmed, X Glorot, L Orseau, L Zhang, ...
arXiv preprint arXiv:2103.03798, 2021
2021
Community size effect in artificial learning systems.
O Tieleman, A Lazaridou, S Mourad, C Blundell, D Precup
ViGIL@ NeurIPS, 2019
2019
Il sistema al momento non pu eseguire l'operazione. Riprova pi tardi.
Articoli 1–15