Semi-Supervised Variational User Identity Linkage via Noise-Aware Self-Learning
Chaozhuo Li, Senzhang Wang, Zheng Liu, Xing Xie, Lei Chen, Philip S., Yu

TL;DR
This paper introduces NSVUIL, a semi-supervised model for user identity linkage across social platforms that uses Gaussian distributions and noise-aware self-learning to improve accuracy amid uncertain data and limited annotations.
Contribution
The paper proposes a novel variational model with Gaussian embeddings and a noise-aware self-learning module to enhance semi-supervised user identity linkage.
Findings
Gaussian distribution representations improve identity modeling.
Noise-aware self-learning effectively filters pseudo-label noise.
Model outperforms existing semi-supervised linkage methods.
Abstract
User identity linkage, which aims to link identities of a natural person across different social platforms, has attracted increasing research interest recently. Existing approaches usually first embed the identities as deterministic vectors in a shared latent space, and then learn a classifier based on the available annotations. However, the formation and characteristics of real-world social platforms are full of uncertainties, which makes these deterministic embedding based methods sub-optimal. In addition, it is intractable to collect sufficient linkage annotations due to the tremendous gaps between different platforms. Semi-supervised models utilize the unlabeled data to help capture the intrinsic data distribution, which are more promising in practical usage. However, the existing semi-supervised linkage methods heavily rely on the heuristically defined similarity measurements to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Mental Health via Writing
