Motionctrl: A unified and flexible motion controller for video generation Z Wang, Z Yuan, X Wang, Y Li, T Chen, M Xia, P Luo, Y Shan ACM SIGGRAPH 2024 Conference Papers, 1-11, 2024 | 119 | 2024 |
Smartedit: Exploring complex instruction-based image editing with multimodal large language models Y Huang, L Xie, X Wang, Z Yuan, X Cun, Y Ge, J Zhou, C Dong, R Huang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 60 | 2024 |
Make encoder great again in 3d gan inversion through geometry and occlusion-aware encoding Z Yuan, Y Zhu, Y Li, H Liu, C Yuan Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 33 | 2023 |
One model to edit them all: Free-form text-driven image manipulation with semantic modulations Y Zhu, H Liu, Y Song, Z Yuan, X Han, C Yuan, Q Chen, J Wang Advances in Neural Information Processing Systems 35, 25146-25159, 2022 | 33 | 2022 |
Miradata: A large-scale video dataset with long durations and structured captions X Ju, Y Gao, Z Zhang, Z Yuan, X Wang, A Zeng, Y Xiong, Q Xu, Y Shan arXiv preprint arXiv:2407.06358, 2024 | 28 | 2024 |
Customnet: Object customization with variable-viewpoints in text-to-image diffusion models Z Yuan, M Cao, X Wang, Z Qi, C Yuan, Y Shan Proceedings of the 32nd ACM International Conference on Multimedia, 10976-10984, 2024 | 21* | 2024 |
Image conductor: Precision control for interactive video synthesis Y Li, X Wang, Z Zhang, Z Wang, Z Yuan, L Xie, Y Zou, Y Shan arXiv preprint arXiv:2406.15339, 2024 | 11 | 2024 |
Blind Face Restoration under Extreme Conditions: Leveraging 3D-2D Prior Fusion for Superior Structural and Texture Recovery Z Chen, L Lu, Z Yuan, Y Zhu, Y Li, C Yuan, W Deng Proceedings of the AAAI Conference on Artificial Intelligence 38 (2), 1263-1271, 2024 | 2 | 2024 |
Improving Video Generation with Human Feedback J Liu, G Liu, J Liang, Z Yuan, X Liu, M Zheng, X Wu, Q Wang, W Qin, ... arXiv preprint arXiv:2501.13918, 2025 | | 2025 |
ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning Y Huang, Z Yuan, Q Liu, Q Wang, X Wang, R Zhang, P Wan, D Zhang, ... arXiv preprint arXiv:2501.04698, 2025 | | 2025 |
Consistent Human Image and Video Generation with Spatially Conditioned Diffusion M Cao, C Mou, Z Yuan, X Wang, Z Zhang, Y Shan, Y Zheng arXiv preprint arXiv:2412.14531, 2024 | | 2024 |
3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation X Fu, X Liu, X Wang, S Peng, M Xia, X Shi, Z Yuan, P Wan, D Zhang, ... arXiv preprint arXiv:2412.07759, 2024 | | 2024 |
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints J Bai, M Xia, X Wang, Z Yuan, X Fu, Z Liu, H Hu, P Wan, D Zhang arXiv preprint arXiv:2412.07760, 2024 | | 2024 |
Self-Conditioned Diffusion Model for Consistent Human Image and Video Synthesis M Cao, C Mou, X Wang, Z Yuan, Z Zhang, Y Shan, Y Zheng | | |