NICEST: Noisy Label Correction and Training for Robust Scene Graph Generation
Lin Li, Jun Xiao, Hanrong Shi, Hanwang Zhang, Yi Yang and, Wei Liu, Long Chen

TL;DR
This paper introduces NICEST, a novel approach for scene graph generation that corrects noisy labels and employs a multi-teacher training strategy, significantly improving model robustness and generalization on a new challenging benchmark.
Contribution
NICEST is a new framework that detects and reassigns noisy labels and uses multi-teacher knowledge distillation for unbiased scene graph generation.
Findings
NICEST improves performance across various backbone architectures.
The approach effectively reduces bias and noise in training data.
Results demonstrate strong generalization on the VG-OOD benchmark.
Abstract
Nearly all existing scene graph generation (SGG) models have overlooked the ground-truth annotation qualities of mainstream SGG datasets, i.e., they assume: 1) all the manually annotated positive samples are equally correct; 2) all the un-annotated negative samples are absolutely background. In this paper, we argue that neither of the assumptions applies to SGG: there are numerous noisy ground-truth predicate labels that break these two assumptions and harm the training of unbiased SGG models. To this end, we propose a novel NoIsy label CorrEction and Sample Training strategy for SGG: NICEST. Specifically, it consists of two parts: NICE and NIST, which rule out these noisy label issues by generating high-quality samples and the effective training strategy, respectively. NICE first detects noisy samples and then reassigns them more high-quality soft predicate labels. NIST is a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Video Analysis and Summarization
MethodsTest · Normalizing Flows · Affine Coupling · Non-linear Independent Component Estimation · Knowledge Distillation
