SADGE: Structure and Appearance Domain Gap Estimation of Synthetic and Real Data
Patryk Bartkowiak, Bartosz Kotrys, Dominik Michels, Soren Pirk, Wojtek Palubicki

TL;DR
SADGE is a new metric that predicts how well synthetic datasets will perform on real-world computer vision tasks by analyzing the complex interplay of appearance and structural similarities.
Contribution
This paper introduces SADGE, the first metric to effectively combine appearance and geometry similarities for predicting synthetic data utility.
Findings
SADGE correlates strongly with downstream performance (Pearson r=0.88).
Combining appearance and geometry metrics outperforms individual metrics.
Fusing DINOv3 and MASt3R yields the best predictive results.
Abstract
We propose SADGE, a quantitative similarity metric that predicts the performance of synthetic image datasets for common computer vision tasks without downstream model training. Estimating whether a synthetic dataset will lead to a model that performs well on real-world data remains a bottleneck in model development. Existing evaluation metrics (e.g., PSNR, FID, CLIP) primarily measure semantic alignment between real and synthetic images (Appearance Similarity Score). Less commonly, structural similarity between images is considered to assess the domain gap (Geometric Similarity Score). However, to the best of our knowledge there exists no studies that evaluate which similarity metric is the best downstream predictor for a given synthetic dataset. In this paper, we show over a wide variety of different synthetic datasets and downstream tasks that neither appearance nor geometry alone can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
