AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
Minheng Ni, Chenfei Wu, Huaying Yuan, Zhengyuan Yang, Ming Gong,, Lijuan Wang, Zicheng Liu, Wangmeng Zuo, Nan Duan

TL;DR
AutoDirector is an interactive AI framework that automates online scheduling and multi-sensory composition, enabling efficient, user-guided creation of complex multi-sensory films with diverse elements.
Contribution
It introduces a novel online scheduling and interactive framework for multi-sensory film production, addressing dependency management and iterative user feedback.
Findings
Supports long shots, special effects, music, dubbing, lip-syncing
Improves efficiency of multi-sensory film production
Demonstrates AI-human collaboration in film directing
Abstract
With the advancement of generative models, the synthesis of different sensory elements such as music, visuals, and speech has achieved significant realism. However, the approach to generate multi-sensory outputs has not been fully explored, limiting the application on high-value scenarios such as of directing a film. Developing a movie director agent faces two major challenges: (1) Lack of parallelism and online scheduling with production steps: In the production of multi-sensory films, there are complex dependencies between different sensory elements, and the production time for each element varies. (2) Diverse needs and clear communication demands with users: Users often cannot clearly express their needs until they see a draft, which requires human-computer interaction and iteration to continually adjust and optimize the film content based on user feedback. To address these issues,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOlfactory and Sensory Function Studies
