Wesley Chung
Wesley Chung
Email verificata su mail.mcgill.ca
Titolo
Citata da
Citata da
Anno
Two-timescale networks for nonlinear value function approximation
W Chung, S Nath, A Joseph, M White
International conference on learning representations, 2018
212018
Importance resampling for off-policy prediction
M Schlegel, W Chung, D Graves, J Qian, M White
arXiv preprint arXiv:1906.04328, 2019
142019
High-confidence error estimates for learned value functions
T Sajed, W Chung, M White
arXiv preprint arXiv:1808.09127, 2018
62018
Beyond variance reduction: Understanding the true impact of baselines on policy optimization
W Chung, V Thomas, MC Machado, NL Roux
arXiv preprint arXiv:2008.13773, 2020
12020
Incrementally Learning Functions of the Return
B Bennett, W Chung, M Zaheer, V Liu
arXiv preprint arXiv:1907.04651, 2019
12019
Importance Resampling for Off-policy Policy Evaluation
M Schlegel, W Chung, D Graves, M White
2018
Il sistema al momento non pu eseguire l'operazione. Riprova pi tardi.
Articoli 1–6