Mutual Clustering on Comparative Texts via Heterogeneous Information Networks
Jianping Cao, Senzhang Wang (Corresponding author), Danyan Wen,, Zhaohui Peng, Philip S. Yu, Fei-yue Wang

TL;DR
This paper introduces HINT, a novel framework using heterogeneous information networks to jointly cluster comparative texts from multiple sources, effectively connecting semantically related texts despite data format differences.
Contribution
The paper proposes a new HINT framework that models multi-source texts as heterogeneous networks and introduces a mutual clustering algorithm with consistency constraints.
Findings
HINT effectively clusters comparative texts from diverse sources.
The method demonstrates robustness across multiple datasets.
Experimental results outperform baseline approaches.
Abstract
Currently, many intelligence systems contain the texts from multi-sources, e.g., bulletin board system (BBS) posts, tweets and news. These texts can be ``comparative'' since they may be semantically correlated and thus provide us with different perspectives toward the same topics or events. To better organize the multi-sourced texts and obtain more comprehensive knowledge, we propose to study the novel problem of Mutual Clustering on Comparative Texts (MCCT), which aims to cluster the comparative texts simultaneously and collaboratively. The MCCT problem is difficult to address because 1) comparative texts usually present different data formats and structures and thus they are hard to organize, and 2) there lacks an effective method to connect the semantically correlated comparative texts to facilitate clustering them in an unified way. To this aim, in this paper we propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Web Data Mining and Analysis · Advanced Graph Neural Networks
