# Domain Adaptation on Graphs by Learning Graph Topologies: Theoretical   Analysis and an Algorithm

**Authors:** Elif Vural

arXiv: 1812.06944 · 2019-02-26

## TL;DR

This paper provides a theoretical analysis and an algorithm for domain adaptation on graphs, showing how graph topology influences classification performance and proposing a method to learn graph structures and labels simultaneously.

## Contribution

It introduces a theoretical framework with performance bounds for graph domain adaptation and proposes a novel algorithm that learns graph topologies and labels jointly.

## Key findings

- Balanced graph topologies improve classification accuracy.
- Avoiding weak and distant edges enhances generalization.
- The proposed method outperforms baseline approaches on real data.

## Abstract

Traditional machine learning algorithms assume that the training and test data have the same distribution, while this assumption does not necessarily hold in real applications. Domain adaptation methods take into account the deviations in the data distribution. In this work, we study the problem of domain adaptation on graphs. We consider a source graph and a target graph constructed with samples drawn from data manifolds. We study the problem of estimating the unknown class labels on the target graph using the label information on the source graph and the similarity between the two graphs. We particularly focus on a setting where the target label function is learnt such that its spectrum is similar to that of the source label function. We first propose a theoretical analysis of domain adaptation on graphs and present performance bounds that characterize the target classification error in terms of the properties of the graphs and the data manifolds. We show that the classification performance improves as the topologies of the graphs get more balanced, i.e., as the numbers of neighbors of different graph nodes become more proportionate, and weak edges with small weights are avoided. Our results also suggest that graph edges between too distant data samples should be avoided for good generalization performance. We then propose a graph domain adaptation algorithm inspired by our theoretical findings, which estimates the label functions while learning the source and target graph topologies at the same time. The joint graph learning and label estimation problem is formulated through an objective function relying on our performance bounds, which is minimized with an alternating optimization scheme. Experiments on synthetic and real data sets suggest that the proposed method outperforms baseline approaches.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1812.06944/full.md

## Figures

22 figures with captions in the complete paper: https://tomesphere.com/paper/1812.06944/full.md

## References

39 references — full list in the complete paper: https://tomesphere.com/paper/1812.06944/full.md

---
Source: https://tomesphere.com/paper/1812.06944