Statistical Inference for Feature Selection after Optimal   Transport-based Domain Adaptation

Nguyen Thang Loi; Duong Tan Loc; Vo Nguyen Le Duy

arXiv:2410.15022·stat.ML·October 22, 2024

Statistical Inference for Feature Selection after Optimal Transport-based Domain Adaptation

Nguyen Thang Loi, Duong Tan Loc, Vo Nguyen Le Duy

PDF

Open Access

TL;DR

This paper introduces SFS-DA, a statistical method for feature selection under domain adaptation that guarantees false positive control and improves true positive detection, validated through experiments.

Contribution

The paper develops a novel statistical testing framework for feature selection under domain adaptation using selective inference, ensuring FPR control and enhanced detection.

Findings

01

SFS-DA controls FPR below the specified level.

02

SFS-DA improves true positive detection rate.

03

Experimental results validate theoretical guarantees.

Abstract

Feature Selection (FS) under domain adaptation (DA) is a critical task in machine learning, especially when dealing with limited target data. However, existing methods lack the capability to guarantee the reliability of FS under DA. In this paper, we introduce a novel statistical method to statistically test FS reliability under DA, named SFS-DA (statistical FS-DA). The key strength of SFS-DA lies in its ability to control the false positive rate (FPR) below a pre-specified level $α$ (e.g., 0.05) while maximizing the true positive rate. Compared to the literature on statistical FS, SFS-DA presents a unique challenge in addressing the effect of DA to ensure the validity of the inference on FS results. We overcome this challenge by leveraging the Selective Inference (SI) framework. Specifically, by carefully examining the FS process under DA whose operations can be characterized by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning