Pseudo Numerical Methods for Diffusion Models on Manifolds L Liu, Y Ren, Z Lin, Z Zhao International Conference on Learning Representations (ICLR 2022), 2022 | 363 | 2022 |
Make-an-audio: Text-to-audio generation with prompt-enhanced diffusion models R Huang, J Huang, D Yang, Y Ren, L Liu, M Li, Z Ye, J Liu, X Yin, Z Zhao International Conference on Machine Learning, 13916-13932, 2023 | 133 | 2023 |
Ptqd: Accurate post-training quantization for diffusion models Y He, L Liu, J Liu, W Wu, H Zhou, B Zhuang Advances in Neural Information Processing Systems 36, 2024 | 16 | 2024 |
Make-a-voice: Unified voice synthesis with discrete representation R Huang, C Zhang, Y Wang, D Yang, L Liu, Z Ye, Z Jiang, C Weng, ... arXiv preprint arXiv:2305.19269, 2023 | 14 | 2023 |
Era-solver: Error-robust adams solver for fast sampling of diffusion probabilistic models S Li, L Liu, Z Chai, R Li, X Tan arXiv preprint arXiv:2301.12935, 2023 | 9 | 2023 |
Detector guidance for multi-object text-to-image generation L Liu, Z Zhang, Y Ren, R Huang, X Yin, Z Zhao arXiv preprint arXiv:2306.02236, 2023 | 6 | 2023 |
Diffusion denoising process for perceptron bias in out-of-distribution detection L Liu, Y Ren, X Cheng, R Huang, C Li, Z Zhao arXiv preprint arXiv:2211.11255, 2022 | 6 | 2022 |
Extending multi-modal contrastive representations Z Wang, Z Zhang, L Liu, Y Zhao, H Huang, T Jin, Z Zhao arXiv preprint arXiv:2310.08884, 2023 | 2 | 2023 |
Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers H Huang, Z Wang, R Huang, L Liu, X Cheng, Y Zhao, T Jin, Z Zhao arXiv preprint arXiv:2312.08168, 2023 | 1 | 2023 |
Molecule-Space: Free Lunch in Unified Multimodal Space via Knowledge Fusion Z Wang, Z Zhang, X Cheng, R Huang, L Liu, Z Ye, H Huang, Y Zhao, T Jin, ... arXiv preprint arXiv:2405.04883, 2024 | | 2024 |
Listen to Motion: Robustly Learning Correlated Audio-Visual Representations Z Wang, X Cheng, L Tang, L Liu, Y Zhao, T Jin, C Cai, W HongFa, W Liu, ... | | |