Covariate-assisted graph matching

Trisha Dawn; Jes\'us Arroyo

arXiv:2512.11761·stat.ME·December 15, 2025

Covariate-assisted graph matching

Trisha Dawn, Jes\'us Arroyo

PDF

Open Access

TL;DR

This paper introduces two covariate-assisted graph matching methods that improve alignment accuracy by integrating node and edge features, with theoretical guarantees and practical applications demonstrated.

Contribution

The paper proposes novel covariate-assisted seeded graph matching algorithms that incorporate auxiliary information, enhancing accuracy and scalability over existing methods.

Findings

01

Improved matching accuracy using covariate information.

02

Theoretical guarantees for model estimation and exact recovery.

03

Successful application to academic genealogy and collaboration networks.

Abstract

Data integration is essential across diverse domains, from historical records to biomedical research, facilitating joint statistical inference. A crucial initial step in this process involves merging multiple data sources based on matching individual records, often in the absence of unique identifiers. When the datasets are networks, this problem is typically addressed through graph matching methodologies. For such cases, auxiliary features or covariates associated with nodes or edges can be instrumental in achieving improved accuracy. However, most existing graph matching techniques do not incorporate this information, limiting their performance against non-identifiable and erroneous matches. To overcome these limitations, we propose two novel covariate-assisted seeded graph matching methods, where a partial alignment for a set of nodes, called seeds, is known. The first one solves a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGraph Theory and Algorithms · Advanced Graph Neural Networks · Data Quality and Management