Segui
Victor Gabillon
Victor Gabillon
Affiliazione sconosciuta
Nessuna email verificata - Home page
Titolo
Citata da
Citata da
Anno
Best arm identification: A unified approach to fixed budget and fixed confidence
V Gabillon, M Ghavamzadeh, A Lazaric
NIPS, Neural Information Processing Systems, 2012
3382012
Approximate modified policy iteration and its application to the game of Tetris.
B Scherrer, M Ghavamzadeh, V Gabillon, B Lesner, M Geist
JMLR, Journal of Machine Learning Research 16, 2015
1482015
Multi-bandit best arm identification
V Gabillon, M Ghavamzadeh, A Lazaric, S Bubeck
NIPS, Neural Information Processing Systems, 2011
1222011
Approximate dynamic programming finally performs well in the game of Tetris
V Gabillon, M Ghavamzadeh, B Scherrer
NIPS, Neural Information Processing systems, 2013
762013
Adaptive submodular maximization in bandit setting
V Gabillon, B Kveton, Z Wen, B Eriksson, S Muthukrishnan
NIPS, Neural Information Processing Systems, 2013
622013
Approximate modified policy iteration
B Scherrer, V Gabillon, M Ghavamzadeh, M Geist
ICML, International Conference on Machine Learning, 2012
582012
Best of both worlds: Stochastic & adversarial best-arm identification
Y Abbasi-Yadkori, P Bartlett, V Gabillon, A Malek, M Valko
Conference on learning theory, 918-949, 2018
462018
Improved learning complexity in combinatorial pure exploration bandits
V Gabillon, A Lazaric, M Ghavamzadeh, R Ortner, P Bartlett
AISTATS, Artificial Intelligence and Statistics, 2016
442016
A simple parameter-free and adaptive approach to optimization under a minimal local smoothness assumption
PL Bartlett, V Gabillon, M Valko
ALT, Algorithmic Learning Theory, 2019
392019
Classification-based policy iteration with a critic
V Gabillon, A Lazaric, M Ghavamzadeh, B Scherrer
ICML, International Conference on Machine Learning, 2011
302011
MANAS: Multi-agent neural architecture search
V Lopes, FM Carlucci, PM Esperança, M Singh, V Gabillon, A Yang, H Xu, ...
arXiv preprint arXiv:1909.01051, 2019
24*2019
Hit-and-Run for Sampling and Planning in Non-Convex Spaces
Y Abbasi-Yadkori, PL Bartlett, V Gabillon, A Malek
AISTATS, Artificial Intelligence and Statistics, 2017
222017
Large-Scale Optimistic Adaptive Submodularity.
V Gabillon, B Kveton, Z Wen, B Eriksson, S Muthukrishnan
AAAI, Association for the Advancement of Artificial Intelligence, 2014
172014
Near Minimax Optimal Players for the Finite-Time 3-Expert Prediction Problem
Y Abbasi-Yadkori, PL Bartlett, V Gabillon
NIPS, Neural Information Processing Systems, 2017
142017
Rollout allocation strategies for classification-based policy iteration
V Gabillon, A Lazaric, M Ghavamzadeh
Workshop on Reinforcement Learning and Search in Very Large Spaces, 2010
142010
Derivative-Free & Order-Robust Optimisation
V Gabillon, R Tutunov, M Valko, HB Ammar
AISTATS, Artificial Intelligence and Statistics, 2020
8*2020
Scale-free adaptive planning for deterministic dynamics & discounted rewards
P Bartlett, V Gabillon, J Healey, M Valko
ICML, International Conference on Machine Learning, 495-504, 2019
72019
Multi-media content-recommender system that learns how to elicit user preferences
VF Gabillon, B Kveton, B Eriksson
US Patent App. 14/489,703, 2016
52016
Machine learning tools for online advertisement
V Gabillon
Technical report, INRIA Lille, France, 2009
52009
Adaptive multi-fidelity optimization with fast learning rates
C Fiegel, V Gabillon, M Valko
International Conference on Artificial Intelligence and Statistics, 3493-3502, 2020
42020
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20