Semi-Supervised Variational User Identity Linkage via Noise-Aware   Self-Learning

Chaozhuo Li; Senzhang Wang; Zheng Liu; Xing Xie; Lei Chen; Philip S.; Yu

arXiv:2112.07373·cs.SI·December 15, 2021·1 cites

Semi-Supervised Variational User Identity Linkage via Noise-Aware Self-Learning

Chaozhuo Li, Senzhang Wang, Zheng Liu, Xing Xie, Lei Chen, Philip S., Yu

PDF

Open Access

TL;DR

This paper introduces NSVUIL, a semi-supervised model for user identity linkage across social platforms that uses Gaussian distributions and noise-aware self-learning to improve accuracy amid uncertain data and limited annotations.

Contribution

The paper proposes a novel variational model with Gaussian embeddings and a noise-aware self-learning module to enhance semi-supervised user identity linkage.

Findings

01

Gaussian distribution representations improve identity modeling.

02

Noise-aware self-learning effectively filters pseudo-label noise.

03

Model outperforms existing semi-supervised linkage methods.

Abstract

User identity linkage, which aims to link identities of a natural person across different social platforms, has attracted increasing research interest recently. Existing approaches usually first embed the identities as deterministic vectors in a shared latent space, and then learn a classifier based on the available annotations. However, the formation and characteristics of real-world social platforms are full of uncertainties, which makes these deterministic embedding based methods sub-optimal. In addition, it is intractable to collect sufficient linkage annotations due to the tremendous gaps between different platforms. Semi-supervised models utilize the unlabeled data to help capture the intrinsic data distribution, which are more promising in practical usage. However, the existing semi-supervised linkage methods heavily rely on the heuristically defined similarity measurements to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Mental Health via Writing