Towards Mitigating Systematics in Large-Scale Surveys via Few-Shot Optimal Transport-Based Feature Alignment
Sultan Hassan, Sambatra Andrianomena, Benjamin D. Wandelt

TL;DR
This paper introduces a feature alignment method using optimal transport to mitigate systematics in large-scale surveys, improving the robustness of models against distribution shifts in observational data.
Contribution
The paper presents a novel optimal transport-based feature alignment technique to address systematic contamination in large-scale survey data, especially effective with limited data and unknown ID-OOD parity.
Findings
Optimal transport effectively aligns OOD features.
Method improves robustness in large-scale survey analysis.
Limited data scenarios still benefit from the approach.
Abstract
Systematics contaminate observables, leading to distribution shifts relative to theoretically simulated signals-posing a major challenge for using pre-trained models to label such observables. Since systematics are often poorly understood and difficult to model, removing them directly and entirely may not be feasible. To address this challenge, we propose a novel method that aligns learned features between in-distribution (ID) and out-of-distribution (OOD) samples by optimizing a feature-alignment loss on the representations extracted from a pre-trained ID model. We first experimentally validate the method on the MNIST dataset using possible alignment losses, including mean squared error and optimal transport, and subsequently apply it to large-scale maps of neutral hydrogen. Our results show that optimal transport is particularly effective at aligning OOD features when parity between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMorphological variations and asymmetry · Species Distribution and Climate Change · Genomics and Phylogenetic Studies
