Causally Steered Diffusion for Automated Video Counterfactual Generation

Nikos Spyrou; Athanasios Vlontzos; Paraskevas Pegios; Thomas Melistas; Nefeli Gkouti; Yannis Panagakis; Giorgos Papanastasiou; Sotirios A. Tsaftaris

arXiv:2506.14404·cs.CV·August 6, 2025

Causally Steered Diffusion for Automated Video Counterfactual Generation

Nikos Spyrou, Athanasios Vlontzos, Paraskevas Pegios, Thomas Melistas, Nefeli Gkouti, Yannis Panagakis, Giorgos Papanastasiou, Sotirios A. Tsaftaris

PDF

Open Access 1 Repo

TL;DR

This paper introduces CSVC, a framework that uses causal knowledge and prompt optimization to generate causally faithful counterfactual videos, improving realism and causal consistency without needing system fine-tuning.

Contribution

It presents a novel causally faithful counterfactual video generation method that leverages causal graphs and vision-language models, compatible with black-box video editing systems.

Findings

01

Achieves state-of-the-art causal effectiveness in counterfactual video generation.

02

Maintains high visual quality and temporal consistency.

03

Works across diverse real-world facial videos.

Abstract

Adapting text-to-image (T2I) latent diffusion models (LDMs) to video editing has shown strong visual fidelity and controllability, but challenges remain in maintaining causal relationships inherent to the video data generating process. Edits affecting causally dependent attributes often generate unrealistic or misleading outcomes if these relationships are ignored. In this work, we introduce a causally faithful framework for counterfactual video generation, formulated as an Out-of-Distribution (OOD) prediction problem. We embed prior causal knowledge by encoding the relationships specified in a causal graph into text prompts and guide the generation process by optimizing these prompts using a vision-language model (VLM)-based textual loss. This loss encourages the latent space of the LDMs to capture OOD variations in the form of counterfactuals, effectively steering generation toward…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nysp78/counterfactual-video-generation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Advanced Steganography and Watermarking Techniques · Generative Adversarial Networks and Image Synthesis

MethodsCounterfactuals Explanations · Diffusion