Unbiased Scene Graph Generation from Biased Training

Kaihua Tang; Yulei Niu; Jianqiang Huang; Jiaxin Shi; Hanwang Zhang

arXiv:2002.11949·cs.CV·October 28, 2025·39 cites

Unbiased Scene Graph Generation from Biased Training

Kaihua Tang, Yulei Niu, Jianqiang Huang, Jiaxin Shi, Hanwang Zhang

PDF

Open Access 5 Repos 1 Video

TL;DR

This paper introduces a causal inference-based framework for unbiased scene graph generation that effectively distinguishes and removes bad biases, leading to significant improvements on benchmark datasets.

Contribution

It proposes a novel causal inference approach using Total Direct Effect to debias scene graph generation, applicable across various models.

Findings

01

Significant performance improvements on Visual Genome benchmark.

02

Effective removal of bad biases while preserving good context.

03

Framework is model-agnostic and widely applicable.

Abstract

Today's scene graph generation (SGG) task is still far from practical, mainly due to the severe training bias, e.g., collapsing diverse "human walk on / sit on / lay on beach" into "human on beach". Given such SGG, the down-stream tasks such as VQA can hardly infer better scene structures than merely a bag of objects. However, debiasing in SGG is not trivial because traditional debiasing methods cannot distinguish between the good and bad bias, e.g., good context prior (e.g., "person read book" rather than "eat") and bad long-tailed bias (e.g., "near" dominating "behind / in front of"). In this paper, we present a novel SGG framework based on causal inference but not the conventional likelihood. We first build a causal graph for SGG, and perform traditional biased training with the graph. Then, we propose to draw the counterfactual causality from the trained graph to infer the effect…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Unbiased Scene Graph Generation From Biased Training· youtube

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques

MethodsCausal inference