CVPR 2026 Workshop · Exploring multi-shot, multi-minute video generation with human–AI co-creation
This workshop investigates how far current AI models and systems are from producing long-form, multi-shot videos that truly satisfy real users and creators. While recent generative models excel at short clips, long video creation introduces challenges in narrative structure, temporal consistency, multimodal alignment, and scalable human-in-the-loop editing.
A key motivation of this workshop is grounded in Bilibili’s large-scale creator ecosystem, where millions of creators produce narrative-driven, multi-minute videos spanning animation, education, entertainment, and storytelling. Such real-world production settings expose gaps between current academic benchmarks and practical creator needs, particularly in maintaining cross-shot consistency, narrative coherence, and efficient human–AI collaboration over long temporal horizons.
Leveraging production-grade data, creator workflows, and user feedback from the Bilibili platform, this workshop aims to bridge academic research with real-world impact. By bringing together researchers, industry engineers, and content creators, we seek to define technical roadmaps, creator-centered evaluation protocols, and reproducible benchmarks that measure not only visual quality, but also narrative satisfaction, usability, and audience engagement in AI-assisted long video creation.
We invite submissions that advance methods, datasets, systems, or evaluations for generating multi-shot, multi-minute videos that are coherent, controllable, and ethically responsible.
Submission Platform: All submissions will be handled via OpenReview.
Paper Format: Submissions should follow the standard CVPR 2026 workshop paper format and are limited to 4 pages (excluding references).
| Event | Date |
|---|---|
| Call for Papers Opens | December 22, 2025 |
| Submission Deadline | March 1, 2026 |
| Reviews Released | April 4, 2026 |
| Camera-ready Deadline | April 11, 2026 |
| Time | Session |
|---|---|
| 09:00 – 09:15 | Opening & Motivation |
| 09:15 – 09:45 | Invited Talk 1 |
| 09:45 – 10:15 | Invited Talk 2 |
| 10:15 – 10:30 | Coffee Break |
| 10:30 – 11:30 | Oral Paper Presentations |
| 11:30 – 12:15 | Panel Discussion |
| 12:15 – 12:30 | Closing Remarks |
The workshop will feature invited talks from leading researchers and practitioners in video generation, multimedia understanding, and AI-assisted creation.
Additional speakers may be announced.
AI-assisted long video creation has the potential to democratize storytelling and empower creators. At the same time, it raises concerns around misinformation, deepfakes, copyright, and labor impacts. This workshop explicitly encourages ethical analysis, mitigation strategies, and responsible deployment of generative video technologies.