The Matching Principle: A Geometric Theory of Loss Functions for Nuisance-Robust Representation Learning
Vishal Rajput

TL;DR
This paper introduces a geometric theory of loss functions for nuisance-robust representation learning, unifying various robustness methods under the matching principle and providing theoretical and empirical validation.
Contribution
It formalizes the matching principle as a unified statistical framework for robustness, introduces the Trajectory Deviation Index, and offers a falsifiable theory with empirical tests.
Findings
Closed-form optimality in linear-Gaussian models
Range coverage is necessary for quadratic Jacobian penalties
Most tested robustness methods pass the proposed geometric tests
Abstract
Robustness, domain adaptation, photometric and occlusion invariance, compositional generalisation, temporal robustness, alignment safety, and classical anisotropic regularisation are usually treated as separate problems with separate method families. This paper argues that much of their shared structure is one statistical problem: estimate the covariance of label-preserving deployment nuisance, then regularise the encoder Jacobian along a matrix whose range covers that covariance (the matching principle). CORAL, adversarial training, IRM, augmentation, metric learning, Jacobian penalties, and alignment-style constraints are different estimators of that object, not independent robustness tricks. In the linear-Gaussian model we prove closed-form optimality (Theorem A), including cube-root water-filling within the matched range; necessity of range coverage for quadratic Jacobian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
