Exploring Optimal Substructure for Out-of-distribution Generalization via Feature-targeted Model Pruning
Yingchun Wang, Jingcai Guo, Song Guo, Weizhan Zhang, Jie Zhang

TL;DR
This paper introduces SFP, a feature-targeted model pruning method that enhances out-of-distribution generalization by removing reliance on spurious features, supported by theoretical analysis and superior experimental results.
Contribution
The paper proposes a novel feature-targeted pruning framework, SFP, which improves OOD generalization by focusing on invariant features and providing theoretical guarantees.
Findings
SFP outperforms state-of-the-art OOD methods with up to 4.72% accuracy improvement.
Theoretical analysis links model sparsity to OOD structure robustness.
SFP effectively reduces dependence on spurious features in biased datasets.
Abstract
Recent studies show that even highly biased dense networks contain an unbiased substructure that can achieve better out-of-distribution (OOD) generalization than the original model. Existing works usually search the invariant subnetwork using modular risk minimization (MRM) with out-domain data. Such a paradigm may bring about two potential weaknesses: 1) Unfairness, due to the insufficient observation of out-domain data during training; and 2) Sub-optimal OOD generalization, due to the feature-untargeted model pruning on the whole data distribution. In this paper, we propose a novel Spurious Feature-targeted model Pruning framework, dubbed SFP, to automatically explore invariant substructures without referring to the above weaknesses. Specifically, SFP identifies in-distribution (ID) features during training using our theoretically verified task loss, upon which, SFP can perform ID…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications · Infrastructure Maintenance and Monitoring
MethodsPruning
