Ziyu Wang
Ziyu Wang
Deepmind
Email verificata su google.com - Home page
TitoloCitata daAnno
Taking the human out of the loop: A review of Bayesian optimization
B Shahriari, K Swersky, Z Wang, RP Adams, N De Freitas
Proceedings of the IEEE 104 (1), 148-175, 2015
9852015
Dueling network architectures for deep reinforcement learning
Z Wang, T Schaul, M Hessel, H Van Hasselt, M Lanctot, N De Freitas
arXiv preprint arXiv:1511.06581, 2015
9212015
Emergence of locomotion behaviours in rich environments
N Heess, D TB, S Sriram, J Lemmon, J Merel, G Wayne, Y Tassa, T Erez, ...
arXiv preprint arXiv:1707.02286, 2017
3012017
Sample efficient actor-critic with experience replay
Z Wang, V Bapst, N Heess, V Mnih, R Munos, K Kavukcuoglu, ...
arXiv preprint arXiv:1611.01224, 2016
2862016
Deep fried convnets
Z Yang, M Moczulski, M Denil, N de Freitas, A Smola, L Song, Z Wang
Proceedings of the IEEE International Conference on Computer Vision, 1476-1483, 2015
2022015
Bayesian optimization in high dimensions via random embeddings
Z Wang, M Zoghi, F Hutter, D Matheson, N De Freitas
Twenty-Third International Joint Conference on Artificial Intelligence, 2013
1822013
Bayesian optimization in a billion dimensions via random embeddings
Z Wang, F Hutter, M Zoghi, D Matheson, N de Feitas
Journal of Artificial Intelligence Research 55, 361-387, 2016
1492016
Alphastar: Mastering the real-time strategy game starcraft ii
O Vinyals, I Babuschkin, J Chung, M Mathieu, M Jaderberg, ...
DeepMind blog, 2, 2019
1172019
Adaptive hamiltonian and riemann manifold monte carlo
Z Wang, S Mohamed, N Freitas
International conference on machine learning, 1462-1470, 2013
96*2013
Reinforcement and imitation learning for diverse visuomotor skills
Y Zhu, Z Wang, J Merel, A Rusu, T Erez, S Cabi, S Tunyasuvunakool, ...
arXiv preprint arXiv:1802.09564, 2018
782018
Playing hard exploration games by watching youtube
Y Aytar, T Pfaff, D Budden, T Paine, Z Wang, N de Freitas
Advances in Neural Information Processing Systems, 2930-2941, 2018
782018
Parallel multiscale autoregressive density estimation
S Reed, A van den Oord, N Kalchbrenner, SG Colmenarejo, Z Wang, ...
Proceedings of the 34th International Conference on Machine Learning-Volumeá…, 2017
782017
Learning an embedding space for transferable robot skills
K Hausman, JT Springenberg, Z Wang, N Heess, M Riedmiller
692018
Robust imitation of diverse behaviors
Z Wang, JS Merel, SE Reed, N de Freitas, G Wayne, N Heess
Advances in Neural Information Processing Systems, 5320-5329, 2017
662017
Adaptive MCMC with Bayesian optimization
N Mahendran, Z Wang, F Hamze, N De Freitas
Artificial Intelligence and Statistics, 751-760, 2012
642012
Learning human behaviors from motion capture by adversarial imitation
J Merel, Y Tassa, D TB, S Srinivasan, J Lemmon, Z Wang, G Wayne, ...
arXiv preprint arXiv:1707.02201, 2017
632017
Bayesian Multi− Scale Optimistic Optimization
Z Wang, B Shakibi, L Jin, N de Freitas
552014
Theoretical analysis of Bayesian optimisation with unknown Gaussian process hyper-parameters
Z Wang, N de Freitas
arXiv preprint arXiv:1406.7758, 2014
322014
An entropy search portfolio for Bayesian optimization
B Shahriari, Z Wang, MW Hoffman, A Bouchard-C˘tÚ, N de Freitas
arXiv preprint arXiv:1406.4625, 2014
322014
Grandmaster level in StarCraft II using multi-agent reinforcement learning
O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ...
Nature 575 (7782), 350-354, 2019
262019
Il sistema al momento non pu˛ eseguire l'operazione. Riprova pi¨ tardi.
Articoli 1–20