Incentive Compatible Pareto Alignment for Multi-Source Large Graphs
Jian Liang, Fangrui Lv, Di Liu, Zehui Dai, Xu Tian, Shuang Li, Fei, Wang, Han Li

TL;DR
This paper introduces ICPA, a novel method for multi-source large-scale entity matching that optimizes source alignments via Pareto front optimization to reduce negative transfer and improve learning effectiveness.
Contribution
The paper proposes the Incentive Compatible Pareto Alignment (ICPA) method, which uniquely combines Pareto front optimization with alignment to address entangled challenges in multi-source large-scale data.
Findings
ICPA effectively reduces negative transfer in large-scale entity matching.
Empirical results show ICPA outperforms baseline methods on four datasets.
Online A/B tests confirm ICPA's practical benefits in production environments.
Abstract
In this paper, we focus on learning effective entity matching models over multi-source large-scale data. For real applications, we relax typical assumptions that data distributions/spaces, or entity identities are shared between sources, and propose a Relaxed Multi-source Large-scale Entity-matching (RMLE) problem. Challenges of the problem include 1) how to align large-scale entities between sources to share information and 2) how to mitigate negative transfer from joint learning multi-source data. What's worse, one practical issue is the entanglement between both challenges. Specifically, incorrect alignments may increase negative transfer; while mitigating negative transfer for one source may result in poorly learned representations for other sources and then decrease alignment accuracy. To handle the entangled challenges, we point out that the key is to optimize information sharing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Advanced Graph Neural Networks
