Probabilistic end-to-end noise correction for learning with noisy labels K Yi, J Wu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 548 | 2019 |
Seed-x: Multimodal models with unified multi-granularity comprehension and generation Y Ge, S Zhao, J Zhu, Y Ge, K Yi, L Song, C Li, X Ding, Y Shan arXiv preprint arXiv:2404.14396, 2024 | 77 | 2024 |
Masked image modeling with denoising contrast K Yi, Y Ge, X Li, S Yang, D Li, J Wu, Y Shan, X Qie arXiv preprint arXiv:2205.09616, 2022 | 53 | 2022 |
mc-BEiT: Multi-choice Discretization for Image BERT Pre-training X Li, Y Ge, K Yi, Z Hu, Y Shan, LY Duan European Conference on Computer Vision, 231-246, 2022 | 44 | 2022 |
Vit-lens: Towards omni-modal representations W Lei, Y Ge, K Yi, J Zhang, D Gao, D Sun, Y Ge, Y Shan, MZ Shou Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 16 | 2024 |
Rils: Masked visual reconstruction in language semantic space S Yang, Y Ge, K Yi, D Li, Y Shan, X Qie, X Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 7 | 2023 |
Vit-lens-2: Gateway to omni-modal intelligence W Lei, Y Ge, K Yi, J Zhang, D Gao, D Sun, Y Ge, Y Shan, MZ Shou CoRR, 2023 | 5 | 2023 |
Masked visual reconstruction in language semantic space S Yang, Y Ge, K Yi, D Li, Y Shan, X Qie, X Wang arXiv preprint arXiv:2301.06958 2, 2023 | 3 | 2023 |
ViT-Lens: Initiating Omni-Modal Exploration through 3D Insights W Lei, Y Ge, J Zhang, D Sun, K Yi, Y Shan, MZ Shou arXiv preprint arXiv:2308.10185, 2023 | 1 | 2023 |
PENCIL: Deep learning with noisy labels K Yi, GH Wang, J Wu arXiv preprint arXiv:2202.08436, 2022 | 1 | 2022 |
SEED-X: Multimodal Models in Real World Y Ge, S Zhao, J Zhu, Y Ge, K Yi, L Song, C Li, Y Shan | | |
A Robustly and Effectively Optimized Pretraining Approach for Masked Autoencoder R Xu, Y Ge, K Yi, X XU, Y Wang, YC Chen, H Chen, Y Shan | | |