Segui
Yijun Yang
Yijun Yang
Tencent AI Lab
Email verificata su tencent.com - Home page
Titolo
Citata da
Citata da
Anno
Active disturbance rejection control for small unmanned helicopters via Levy flight-based pigeon-inspired optimization
D Zhang, H Duan, Y Yang
Aircraft Engineering and Aerospace Technology 89 (6), 946-952, 2017
342017
Embodied multi-modal agent trained by an llm from a parallel textworld
Y Yang, T Zhou, K Li, D Tao, L Li, L Shen, X He, J Jiang, Y Shi
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
322024
Pareto Policy Pool for Model-based Offline Reinforcement Learning
Y Yang, J Jiang, T Zhou, J Ma, Y Shi
International Conference on Learning Representations, 2022
222022
Continual Task Allocation in Meta-Policy Network via Sparse Prompting
Y Yang, T Zhou, J Jiang, G Long, Y Shi
Fortieth International Conference on Machine Learning (ICML) 2023, 2023
72023
Pypop7: A pure-python library for population-based black-box optimization
Q Duan, G Zhou, C Shao, Z Wang, M Feng, Y Huang, Y Tan, Y Yang, ...
Journal of Machine Learning Research 25 (296), 1-28, 2024
62024
Critical Developments and Applications of Swarm Intelligence
Y Shi
IGI Global, 2018
62018
Pendulum-like oscillation controller for UAV based on Lévy-flight pigeon-inspired optimization and LQR
Z Liu, H Duan, Y Yang, X Hu
2016 IEEE Symposium Series on Computational Intelligence (SSCI), 1-6, 2016
62016
Collective learning of low-memory matrix adaptation for large-scale black-box optimization
Q Duan, G Zhou, C Shao, Y Yang, Y Shi
International Conference on Parallel Problem Solving from Nature, 281-294, 2022
52022
Distributed evolution strategies for large-scale optimization
Q Duan, G Zhou, C Shao, Y Yang, Y Shi
Proceedings of the Genetic and Evolutionary Computation Conference Companion …, 2022
32022
BiES: adaptive policy optimization for model-based offline reinforcement learning
Y Yang, J Jiang, Z Wang, Q Duan, Y Shi
Australasian Joint Conference on Artificial Intelligence, 570-581, 2022
32022
Wall-e: World alignment by rule learning improves world model-based llm agents
S Zhou, T Zhou, Y Yang, G Long, D Ye, J Jiang, C Zhang
arXiv preprint arXiv:2410.07484, 2024
22024
MuEP: A Multimodal Benchmark for Embodied Planning with Foundation Models [C]
K Li, B Yu, Q Zheng, Y Zhan, Y Zhang, T Zhang, Y Yang, Y Chen, L Sun, ...
Intemational Joint Conferences on Artificial Intelligence. IJCAI, 129-138, 2024
12024
Building Versatile Reinforcement Learning Agents with Prior Knowledge
Y Yang
PQDT-Global, 2023
12023
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models
H Cai, Y Yang, W Hu
arXiv preprint arXiv:2502.00698, 2025
2025
System-2 Mathematical Reasoning via Enriched Instruction Tuning
H Cai, Y Yang, Z Li
arXiv preprint arXiv:2412.16964, 2024
2024
Distributed Population-Based Simultaneous Perturbation Stochastic Approximation for Fine-Tuning Large Language Models
Y Tan, Y Huang, Q Duan, Y Yang, Y Shi
Pacific Rim International Conference on Artificial Intelligence, 21-26, 2024
2024
Controllable Pareto Trade-off between Fairness and Accuracy
Y Du, J Zhao, Y Yang, T Zhou
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–17