Score: 1

DANCER: Dance ANimation via Condition Enhancement and Rendering with diffusion model

Published: October 31, 2025 | arXiv ID: 2510.27169v1

By: Yucheng Xing, Jinxing Yin, Xiaodong Liu

Potential Business Impact:

Makes realistic dancing videos from a picture.

Business Areas:

Motion Capture Media and Entertainment, Video

Recently, diffusion models have shown their impressive ability in visual generation tasks. Besides static images, more and more research attentions have been drawn to the generation of realistic videos. The video generation not only has a higher requirement for the quality, but also brings a challenge in ensuring the video continuity. Among all the video generation tasks, human-involved contents, such as human dancing, are even more difficult to generate due to the high degrees of freedom associated with human motions. In this paper, we propose a novel framework, named as DANCER (Dance ANimation via Condition Enhancement and Rendering with Diffusion Model), for realistic single-person dance synthesis based on the most recent stable video diffusion model. As the video generation is generally guided by a reference image and a video sequence, we introduce two important modules into our framework to fully benefit from the two inputs. More specifically, we design an Appearance Enhancement Module (AEM) to focus more on the details of the reference image during the generation, and extend the motion guidance through a Pose Rendering Module (PRM) to capture pose conditions from extra domains. To further improve the generation capability of our model, we also collect a large amount of video data from Internet, and generate a novel datasetTikTok-3K to enhance the model training. The effectiveness of the proposed model has been evaluated through extensive experiments on real-world datasets, where the performance of our model is superior to that of the state-of-the-art methods. All the data and codes will be released upon acceptance.

Reframing Music-Driven 2D Dance Pose Generation as Multi-Channel Image Generation

CV and Pattern Recognition

Makes dancing robots move to music perfectly.

12 Dec 2025 0

88%

TalkingPose: Efficient Face and Gesture Animation with Feedback-guided Diffusion Model

CV and Pattern Recognition

Creates long, smooth talking animations from pictures.

30 Nov 2025 1

88%

Dress&Dance: Dress up and Dance as You Like It - Technical Preview

CV and Pattern Recognition

Lets you try on clothes in a video.

28 Aug 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

12 pages

DANCER: Dance ANimation via Condition Enhancement and Rendering with diffusion model

Makes realistic dancing videos from a picture.

Technical Abstract

Reframing Music-Driven 2D Dance Pose Generation as Multi-Channel Image Generation

TalkingPose: Efficient Face and Gesture Animation with Feedback-guided Diffusion Model

Dress&Dance: Dress up and Dance as You Like It - Technical Preview