Victor Gabillon
Victor Gabillon
Affiliazione sconosciuta
Email verificata su huawei.com - Home page
Titolo
Citata da
Citata da
Anno
Best arm identification: A unified approach to fixed budget and fixed confidence
V Gabillon, M Ghavamzadeh, A Lazaric
NIPS, Neural Information Processing Systems, 2012
1882012
Multi-bandit best arm identification
V Gabillon, M Ghavamzadeh, A Lazaric, S Bubeck
NIPS, Neural Information Processing Systems, 2011
872011
Approximate dynamic programming finally performs well in the game of Tetris
V Gabillon, M Ghavamzadeh, B Scherrer
NIPS, Neural Information Processing systems, 2013
612013
Approximate modified policy iteration and its application to the game of Tetris.
B Scherrer, M Ghavamzadeh, V Gabillon, B Lesner, M Geist
JMLR, Journal of Machine Learning Research 16, 2015
602015
Approximate modified policy iteration
B Scherrer, V Gabillon, M Ghavamzadeh, M Geist
ICML, International Conference on Machine Learning, 2012
432012
Adaptive submodular maximization in bandit setting
V Gabillon, B Kveton, Z Wen, B Eriksson, S Muthukrishnan
NIPS, Neural Information Processing Systems, 2013
392013
Classification-based policy iteration with a critic
V Gabillon, A Lazaric, M Ghavamzadeh, B Scherrer
ICML, International Conference on Machine Learning, 2011
302011
Improved learning complexity in combinatorial pure exploration bandits
V Gabillon, A Lazaric, M Ghavamzadeh, R Ortner, P Bartlett
AISTATS, Artificial Intelligence and Statistics, 2016
262016
Best of both worlds: Stochastic & adversarial best-arm identification
Y Abbasi-Yadkori, P Bartlett, V Gabillon, A Malek, M Valko
COLT, Conference on Learning Theory, 2018
122018
Rollout allocation strategies for classification-based policy iteration
V Gabillon, A Lazaric, M Ghavamzadeh
Workshop on Reinforcement Learning and Search in Very Large Spaces, 2010
122010
Large-Scale Optimistic Adaptive Submodularity.
V Gabillon, B Kveton, Z Wen, B Eriksson, S Muthukrishnan
AAAI, Association for the Advancement of Artificial Intelligence, 2014
102014
A simple parameter-free and adaptive approach to optimization under a minimal local smoothness assumption
PL Bartlett, V Gabillon, M Valko
ALT, Algorithmic Learning Theory, 2019
92019
Hit-and-Run for Sampling and Planning in Non-Convex Spaces
Y Abbasi-Yadkori, PL Bartlett, V Gabillon, A Malek
AISTATS, Artificial Intelligence and Statistics, 2017
82017
MANAS: multi-agent neural architecture search
FM Carlucci, P Esperanca, R Tutunov, M Singh, V Gabillon, A Yang, H Xu, ...
arXiv preprint arXiv:1909.01051, 2019
72019
Near Minimax Optimal Players for the Finite-Time 3-Expert Prediction Problem
Y Abbasi-Yadkori, PL Bartlett, V Gabillon
NIPS, Neural Information Processing Systems, 2017
62017
Scale-free adaptive planning for deterministic dynamics & discounted rewards
P Bartlett, V Gabillon, J Healey, M Valko
ICML, International Conference on Machine Learning, 495-504, 2019
32019
Machine learning tools for online advertisement
V Gabillon
Technical report, INRIA, Lille, France, 2009
32009
Multi-media content-recommender system that learns how to elicit user preferences
VF Gabillon, B Kveton, B Eriksson
US Patent App. 14/489,703, 2016
22016
Asymptotic performance analysis of PCA algorithms based on the weighted subspace criterion
JP Delmas, V Gabillon
ICASSP, Acoustics, Speech and Signal Processing, 2009
22009
Derivative-Free & Order-Robust Optimisation
V Gabillon, R Tutunov, M Valko, HB Ammar
AISTATS, Artificial Intelligence and Statistics, 2020
12020
Il sistema al momento non pu˛ eseguire l'operazione. Riprova pi¨ tardi.
Articoli 1–20