Segui
Xiaoyu Chen
Titolo
Citata da
Citata da
Anno
Q-learning with ucb exploration is sample efficient for infinite-horizon mdp
K Dong, Y Wang, X Chen, L Wang
International Conference on Learning Representations, 2020
1122020
Distributed bandit learning: Near-optimal regret with efficient communication
Y Wang, J Hu, X Chen, L Wang
International Conference on Learning Representations, 2020
862020
Understanding Domain Randomization for Sim-to-real Transfer
X Chen, J Hu, C Jin, L Li, L Wang
International Conference on Learning Representations, 2022
572022
Near-Optimal Representation Learning for Linear Bandits and Linear RL
J Hu, X Chen, C Jin, L Li, L Wang
International Conference on Machine Learning, 4349-4358, 2021
442021
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation
X Chen, H Zhong, Z Yang, Z Wang, L Wang
International Conference on Machine Learning, 3773-3793, 2022
312022
Efficient Reinforcement Learning in Factored MDPs with Application to Constrained RL
X Chen, J Hu, L Li, L Wang
International Conference on Learning Representations, 2021
212021
Near-Optimal Reward-Free Exploration for Linear Mixture MDPs with Plug-in Solver
X Chen, J Hu, LF Yang, L Wang
International Conference on Learning Representations, 2022
172022
(Locally) Differentially Private Combinatorial Semi-Bandits
X Chen, K Zheng, Z Zhou, Y Yang, W Chen, L Wang
International Conference on Machine Learning, 1757-1767, 2020
32020
On the power of pre-training for generalization in rl: Provable benefits and hardness
H Ye, X Chen, L Wang, SS Du
International Conference on Machine Learning, 39770-39800, 2023
12023
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–9