node2bits: Compact Time- and Attribute-aware Node Representations for User Stitching
Di Jin, Mark Heimann, Ryan Rossi, and Danai Koutra

TL;DR
This paper introduces node2bits, a novel framework for user identity stitching that creates compact, binary node representations in heterogeneous networks, significantly improving accuracy and efficiency over traditional methods.
Contribution
node2bits is the first application-independent method that uses binary hashcodes and feature-based temporal walks for efficient user stitching in large-scale web networks.
Findings
Outperforms traditional techniques by up to 5.16% in F1 score.
Uses only 1.56% of storage compared to real-valued embeddings.
Effectively captures multi-dimensional node features with binary representations.
Abstract
Identity stitching, the task of identifying and matching various online references (e.g., sessions over different devices and timespans) to the same user in real-world web services, is crucial for personalization and recommendations. However, traditional user stitching approaches, such as grouping or blocking, require quadratic pairwise comparisons between a massive number of user activities, thus posing both computational and storage challenges. Recent works, which are often application-specific, heuristically seek to reduce the amount of comparisons, but they suffer from low precision and recall. To solve the problem in an application-independent way, we take a heterogeneous network-based approach in which users (nodes) interact with content (e.g., sessions, websites), and may have attributes (e.g., location). We propose node2bits, an efficient framework that represents…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods · Human Pose and Action Recognition
