GeCo: Evaluating Geometric Consistency for Video Generation via Motion and Structure
Leslie Gu, Junhwa Hur, Charles Herrmann, Fangneng Zhan, Todd Zickler, Deqing Sun, Hanspeter Pfister

TL;DR
GeCo is a geometry-based metric that detects geometric and occlusion artifacts in static scenes, aiding benchmarking and improving video generation quality.
Contribution
We introduce GeCo, a novel geometry-grounded metric that jointly detects artifacts and guides video generation without additional training.
Findings
GeCo produces interpretable, dense consistency maps revealing artifacts.
Using GeCo as guidance reduces deformation artifacts in video generation.
Benchmarking with GeCo uncovers common failure modes in recent models.
Abstract
We introduce GeCo, a geometry-grounded metric for jointly detecting geometric deformation and occlusion-inconsistency artifacts in static scenes. By fusing residual motion and depth priors, GeCo produces interpretable, dense consistency maps that reveal these artifacts. We use GeCo to systematically benchmark recent video generation models, uncovering common failure modes, and further employ it as a training-free guidance loss to reduce deformation artifacts during video generation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
