The frame-level leakage trap: rethinking evaluation protocols for intrinsic image decomposition, with source-separable uncertainty as a case study
Jihwan Woo

TL;DR
This paper highlights the importance of scene-level evaluation splits in intrinsic image decomposition, demonstrates the benefits of source-separable uncertainty modeling, and provides empirical evidence for improved evaluation protocols and uncertainty quantification.
Contribution
It advocates for scene-level splits over frame-level splits in evaluation, introduces a source-separable uncertainty model, and verifies its effectiveness and interpretability in intrinsic image decomposition.
Findings
Frame-level splits inflate performance metrics by up to 10 dB.
Scene-level splits provide more accurate evaluation protocols.
Source-separable uncertainty correlates with residual errors and improves downstream tasks.
Abstract
Evaluation protocols for learned intrinsic image decomposition on MPI Sintel have been inconsistent. Several prior works split the dataset by frames, which allows spatially similar frames of the same scene to appear in both train and test partitions. We quantify this leakage effect for the first time, across three architectures: a frame-level split inflates test R_PSNR by 1.6 to 2.0 dB (p less than 0.01 for all three, paired t-test across 3 seeds) relative to a scene-level split, confirming an architecture-independent protocol effect. A three-point gradient (random/temporal/scene) shows the gap is continuous, and under extended training the frame-level inflation exceeds 10 dB. We advocate scene-level splits as the community standard and provide reference numbers for six representative models under this protocol. As a case study within the corrected protocol, we present a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
