Domain Adaptation Principal Component Analysis: base linear method for learning with out-of-distribution data
Evgeny M Mirkes, Jonathan Bac, Aziz Fouch\'e, Sergey V. Stasenko,, Andrei Zinovyev, Alexander N. Gorban

TL;DR
This paper introduces DAPCA, a linear method for domain adaptation that reduces data divergence by embedding source and target datasets into a common space, offering a simpler alternative to neural network-based approaches.
Contribution
The paper proposes DAPCA, a novel linear domain adaptation technique that generalizes PCA with weighted data point pairs, providing an efficient and convergent solution.
Findings
DAPCA effectively reduces domain divergence in benchmark datasets.
DAPCA improves downstream classification performance.
DAPCA is useful in biomedical single-cell data analysis.
Abstract
Domain adaptation is a popular paradigm in modern machine learning which aims at tackling the problem of divergence (or shift) between the labeled training and validation datasets (source domain) and a potentially large unlabeled dataset (target domain). The task is to embed both datasets red into a common space in which the source dataset is informative for training while the divergence between source and target is minimized. The most popular domain adaptation solutions are based on training neural networks that combine classification and adversarial learning modules, frequently making them both data-hungry and difficult to train. We present a method called Domain Adaptation Principal Component Analysis (DAPCA) that identifies a linear reduced data representation useful for solving the domain adaptation task. DAPCA algorithm introduces positive and negative weights between pairs of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Cancer-related molecular mechanisms research · Single-cell and spatial transcriptomics
