Incentive Compatible Pareto Alignment for Multi-Source Large Graphs

Jian Liang; Fangrui Lv; Di Liu; Zehui Dai; Xu Tian; Shuang Li; Fei; Wang; Han Li

arXiv:2112.02792·stat.ML·December 7, 2021

Incentive Compatible Pareto Alignment for Multi-Source Large Graphs

Jian Liang, Fangrui Lv, Di Liu, Zehui Dai, Xu Tian, Shuang Li, Fei, Wang, Han Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces ICPA, a novel method for multi-source large-scale entity matching that optimizes source alignments via Pareto front optimization to reduce negative transfer and improve learning effectiveness.

Contribution

The paper proposes the Incentive Compatible Pareto Alignment (ICPA) method, which uniquely combines Pareto front optimization with alignment to address entangled challenges in multi-source large-scale data.

Findings

01

ICPA effectively reduces negative transfer in large-scale entity matching.

02

Empirical results show ICPA outperforms baseline methods on four datasets.

03

Online A/B tests confirm ICPA's practical benefits in production environments.

Abstract

In this paper, we focus on learning effective entity matching models over multi-source large-scale data. For real applications, we relax typical assumptions that data distributions/spaces, or entity identities are shared between sources, and propose a Relaxed Multi-source Large-scale Entity-matching (RMLE) problem. Challenges of the problem include 1) how to align large-scale entities between sources to share information and 2) how to mitigate negative transfer from joint learning multi-source data. What's worse, one practical issue is the entanglement between both challenges. Specifically, incorrect alignments may increase negative transfer; while mitigating negative transfer for one source may result in poorly learned representations for other sources and then decrease alignment accuracy. To handle the entangled challenges, we point out that the key is to optimize information sharing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anonmlresearcher/icpa
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Advanced Graph Neural Networks