A Physical Coherence Benchmark for Evaluating Video Generation Models   via Optical Flow-guided Frame Prediction

Yongfan Chen; Xiuwen Zhu; Tianyu Li

arXiv:2502.05503·cs.CV·March 6, 2025

A Physical Coherence Benchmark for Evaluating Video Generation Models via Optical Flow-guided Frame Prediction

Yongfan Chen, Xiuwen Zhu, Tianyu Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces PhyCoBench, a benchmark for assessing physical coherence in video generation, along with PhyCoPredictor, an automated evaluation model that aligns well with human judgments.

Contribution

The paper presents a new benchmark and an automated evaluation model specifically designed to measure physical coherence in generated videos.

Findings

01

PhyCoPredictor closely matches human evaluations.

02

The benchmark covers 7 categories of physical principles.

03

The dataset and tools are publicly available on GitHub.

Abstract

Recent advances in video generation models demonstrate their potential as world simulators, but they often struggle with videos deviating from physical laws, a key concern overlooked by most text-to-video benchmarks. We introduce a benchmark designed specifically to assess the Physical Coherence of generated videos, PhyCoBench. Our benchmark includes 120 prompts covering 7 categories of physical principles, capturing key physical laws observable in video content. We evaluated four state-of-the-art (SoTA) T2V models on PhyCoBench and conducted manual assessments. Additionally, we propose an automated evaluation model: PhyCoPredictor, a diffusion model that generates optical flow and video frames in a cascade manner. Through a consistency evaluation comparing automated and manual sorting, the experimental results show that PhyCoPredictor currently aligns most closely with human…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Jeckinchen/PhyCoBench
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image and Video Quality Assessment · Advanced Image Processing Techniques

MethodsDiffusion