Domain Adaptation Principal Component Analysis: base linear method for   learning with out-of-distribution data

Evgeny M Mirkes; Jonathan Bac; Aziz Fouch\'e; Sergey V. Stasenko,; Andrei Zinovyev; Alexander N. Gorban

arXiv:2208.13290·cs.LG·November 10, 2023·1 cites

Domain Adaptation Principal Component Analysis: base linear method for learning with out-of-distribution data

Evgeny M Mirkes, Jonathan Bac, Aziz Fouch\'e, Sergey V. Stasenko,, Andrei Zinovyev, Alexander N. Gorban

PDF

Open Access 1 Repo

TL;DR

This paper introduces DAPCA, a linear method for domain adaptation that reduces data divergence by embedding source and target datasets into a common space, offering a simpler alternative to neural network-based approaches.

Contribution

The paper proposes DAPCA, a novel linear domain adaptation technique that generalizes PCA with weighted data point pairs, providing an efficient and convergent solution.

Findings

01

DAPCA effectively reduces domain divergence in benchmark datasets.

02

DAPCA improves downstream classification performance.

03

DAPCA is useful in biomedical single-cell data analysis.

Abstract

Domain adaptation is a popular paradigm in modern machine learning which aims at tackling the problem of divergence (or shift) between the labeled training and validation datasets (source domain) and a potentially large unlabeled dataset (target domain). The task is to embed both datasets red into a common space in which the source dataset is informative for training while the divergence between source and target is minimized. The most popular domain adaptation solutions are based on training neural networks that combine classification and adversarial learning modules, frequently making them both data-hungry and difficult to train. We present a method called Domain Adaptation Principal Component Analysis (DAPCA) that identifies a linear reduced data representation useful for solving the domain adaptation task. DAPCA algorithm introduces positive and negative weights between pairs of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mirkes/dapca
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Cancer-related molecular mechanisms research · Single-cell and spatial transcriptomics