PipeFlow: Pipelined Processing and Motion-Aware Frame Selection for Long-Form Video Editing
Mustafa Munir, Md Mostafijur Rahman, Kartikeya Bhardwaj, Paul Whatmough, Radu Marculescu

TL;DR
PipeFlow introduces a scalable, motion-aware, pipelined approach for long-form video editing that significantly reduces computational costs and speeds up processing by dividing videos into segments and interpolating skipped frames.
Contribution
The paper presents a novel pipelined video editing method that combines motion analysis, parallel processing, and frame interpolation to enable efficient editing of arbitrarily long videos.
Findings
Achieves up to 9.6X speedup over TokenFlow.
Achieves up to 31.7X speedup over DMT.
Enables linear scaling of editing time with video length.
Abstract
Long-form video editing poses unique challenges due to the exponential increase in the computational cost from joint editing and Denoising Diffusion Implicit Models (DDIM) inversion across extended sequences. To address these limitations, we propose PipeFlow, a scalable, pipelined video editing method that introduces three key innovations: First, based on a motion analysis using Structural Similarity Index Measure (SSIM) and Optical Flow, we identify and propose to skip editing of frames with low motion. Second, we propose a pipelined task scheduling algorithm that splits a video into multiple segments and performs DDIM inversion and joint editing in parallel based on available GPU memory. Lastly, we leverage a neural network-based interpolation technique to smooth out the border frames between segments and interpolate the previously skipped frames. Our method uniquely scales to longer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Video Coding and Compression Technologies · Generative Adversarial Networks and Image Synthesis
