Identifying User Survival Types via Clustering of Censored Social   Network Data

S Chandra Mouli; Abhishek Naik; Bruno Ribeiro; Jennifer Neville

arXiv:1703.03401·cs.SI·March 10, 2017·1 cites

Identifying User Survival Types via Clustering of Censored Social Network Data

S Chandra Mouli, Abhishek Naik, Bruno Ribeiro, Jennifer Neville

PDF

Open Access

TL;DR

This paper introduces a decision tree-based clustering method for identifying distinct user survival types in social networks, effectively handling censored data and outperforming existing approaches.

Contribution

It presents a novel algorithm that normalizes p-values globally to cluster users by survival characteristics in large social network datasets.

Findings

01

The proposed method outperforms competing clustering techniques.

02

Clusters identified are significantly associated with different survival distributions.

03

The approach is effective for large, censored social network data.

Abstract

The goal of cluster analysis in survival data is to identify clusters that are decidedly associated with the survival outcome. Previous research has explored this problem primarily in the medical domain with relatively small datasets, but the need for such a clustering methodology could arise in other domains with large datasets, such as social networks. Concretely, we wish to identify different survival classes in a social network by clustering the users based on their lifespan in the network. In this paper, we propose a decision tree based algorithm that uses a global normalization of $p$ -values to identify clusters with significantly different survival distributions. We evaluate the clusters from our model with the help of a simple survival prediction task and show that our model outperforms other competing methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Network Analysis Techniques · Advanced Clustering Algorithms Research · Data Mining Algorithms and Applications