Abstract
This paper presents a method that allows users to design cinematic video shots in the context of image-to-video generation. Shot design, a critical aspect of filmmaking, involves meticulously planning both camera movements and object motions in a scene. However, enabling intuitive shot design in modern image-to-video generation systems presents two main challenges: first, effectively capturing user intentions on the motion design, where both camera movements and scene-space object motions must be specified jointly; and second, representing motion information that can be effectively utilized by a video diffusion model to synthesize the image animations. To address these challenges, we introduce MotionCanvas, a method that integrates user-driven controls into image-to-video (I2V) generation models, allowing users to control both object and camera motions in a scene-aware manner. By connecting insights from classical computer graphics and contemporary video generation techniques, we demonstrate the ability to achieve 3D-aware motion control in I2V synthesis without requiring costly 3D-related training data. MotionCanvas enables users to intuitively depict scene-space motion intentions, and translates them into spatiotemporal motion-conditioning signals for video diffusion models. We demonstrate the effectiveness of our method on a wide range of real-world image content and shot-design scenarios, highlighting its potential to enhance the creative workflows in digital content creation and adapt to various image and video editing applications.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - SIGGRAPH 2025 Conference Papers |
| Editors | Stephen N. Spencer |
| Place of Publication | New York NY USA |
| Publisher | Association for Computing Machinery (ACM) |
| Number of pages | 11 |
| ISBN (Electronic) | 9798400715402 |
| DOIs | |
| Publication status | Published - 27 Jul 2025 |
| Event | Special Interest Group on Computer Graphics and Interactive Techniques Conference 2025 - Vancouver, Canada Duration: 10 Aug 2025 → 14 Oct 2025 https://dl.acm.org/doi/proceedings/10.1145/3721238 (Proceedings) https://s2025.siggraph.org/ (Website) |
Conference
| Conference | Special Interest Group on Computer Graphics and Interactive Techniques Conference 2025 |
|---|---|
| Abbreviated title | SIGGRAPH 2025 |
| Country/Territory | Canada |
| City | Vancouver |
| Period | 10/08/25 → 14/10/25 |
| Internet address |
|
Keywords
- motion control
- Video generative models
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver