Measuring similarity in co-occurrence data using ego-networks
Xiaomeng Wang, Yijun Ran, Tao Jia

TL;DR
This paper introduces a new ego-network-based similarity measure for co-occurrence data that outperforms traditional methods and offers a novel perspective by capturing changes in entity centrality.
Contribution
The paper proposes a simple, physically meaningful similarity index based on ego networks, addressing limitations of aggregated network methods in co-occurrence data analysis.
Findings
The new index outperforms traditional network-based similarity measures.
It can sometimes surpass embedding methods in effectiveness.
The measure provides a different dimension for quantifying similarities.
Abstract
The co-occurrence association is widely observed in many empirical data. Mining the information in co-occurrence data is essential for advancing our understanding of systems such as social networks, ecosystem, and brain network. Measuring similarity of entities is one of the important tasks, which can usually be achieved using a network-based approach. Here we show that traditional methods based on the aggregated network can bring unwanted in-directed relationship. To cope with this issue, we propose a similarity measure based on the ego network of each entity, which effectively considers the change of an entity's centrality from one ego network to another. The index proposed is easy to calculate and has a clear physical meaning. Using two different data sets, we compare the new index with other existing ones. We find that the new index outperforms the traditional network-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
