Animatediff: Animate your personalized text-to-image diffusion models without specific tuning Y Guo, C Yang, A Rao, Z Liang, Y Wang, Y Qiao, M Agrawala, D Lin, ... International Conference on Learning Representations (ICLR) 2024, 2023 | 343 | 2023 |
Lavie: High-quality video generation with cascaded latent diffusion models Y Wang, X Chen, X Ma, S Zhou, Z Huang, Y Wang, C Yang, Y He, J Yu, ... arXiv preprint arXiv:2309.15103, 2023 | 124 | 2023 |
Sparsectrl: Adding sparse controls to text-to-video diffusion models Y Guo, C Yang, A Rao, M Agrawala, D Lin, B Dai The 18th European Conference on Computer Vision (ECCV) 2024, 2023 | 34 | 2023 |
Cameractrl: Enabling camera control for text-to-video generation H He, Y Xu, Y Guo, G Wetzstein, B Dai, H Li, C Yang arXiv preprint arXiv:2404.02101, 2024 | 17 | 2024 |
Dynamic storyboard generation in an engine-based virtual environment for video production A Rao, X Jiang, Y Guo, L Xu, L Yang, L Jin, D Lin, B Dai ACM SIGGRAPH 2023 Posters, 1-2, 2023 | 14 | 2023 |
Temporal and contextual transformer for multi-camera editing of TV shows A Rao, X Jiang, S Wang, Y Guo, Z Liu, B Dai, L Pang, X Wu, D Lin, L Jin arXiv preprint arXiv:2210.08737, 2022 | 5 | 2022 |
HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation Z Wang, Y Li, Y Zeng, Y Fang, Y Guo, W Liu, J Tan, K Chen, T Xue, B Dai, ... arXiv preprint arXiv:2407.17438, 2024 | | 2024 |