Learning from Synthetic Data via Provenance-Based Input Gradient Guidance

Koshiro Nagano; Ryo Fujii; Ryo Hachiuma; Fumiaki Sato; Taiki Sekii; Hideo Saito

arXiv:2604.02946·cs.CV·April 6, 2026

Learning from Synthetic Data via Provenance-Based Input Gradient Guidance

Koshiro Nagano, Ryo Fujii, Ryo Hachiuma, Fumiaki Sato, Taiki Sekii, Hideo Saito

PDF

TL;DR

This paper introduces a provenance-guided input gradient method that enhances model robustness by focusing on target regions during training with synthetic data, reducing reliance on spurious correlations.

Contribution

It proposes a novel framework leveraging provenance information to explicitly guide models toward target regions, improving discrimination and robustness across various tasks.

Findings

01

Effective in weakly supervised object localization

02

Improves spatio-temporal action localization accuracy

03

Enhances image classification robustness

Abstract

Learning methods using synthetic data have attracted attention as an effective approach for increasing the diversity of training data while reducing collection costs, thereby improving the robustness of model discrimination. However, many existing methods improve robustness only indirectly through the diversification of training samples and do not explicitly teach the model which regions in the input space truly contribute to discrimination; consequently, the model may learn spurious correlations caused by synthesis biases and artifacts. Motivated by this limitation, this paper proposes a learning framework that uses provenance information obtained during the training data synthesis process, indicating whether each region in the input space originates from the target object, as an auxiliary supervisory signal to promote the acquisition of representations focused on target regions.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.