Top-k Representative Search for Comparative Tree Summarization
Yuqi Chen, Xin Huang, Bilian Chen

TL;DR
This paper introduces a novel method for comparative hierarchical data summarization of two trees with the same topology, balancing commonality and differences using a new distance measure and an efficient greedy algorithm.
Contribution
It proposes a new framework for comparative tree summarization that handles two trees simultaneously, including a novel distance measure and an extension for different structures.
Findings
The SVDT algorithm achieves high-quality summaries with approximation guarantees.
The method effectively balances similarity and difference in summaries.
Experiments show superior efficiency and effectiveness over existing methods.
Abstract
Data summarization aims at utilizing a small-scale summary to represent massive datasets as a whole, which is useful for visualization and information sipped generation. However, most existing studies of hierarchical summarization only work on \emph{one single tree} by selecting representative nodes, which neglects an important problem of comparative summarization on two trees. In this paper, given two trees with the same topology structure and different node weights, we aim at finding representative nodes, where nodes summarize the common relationship between them and nodes highlight significantly different sub-trees meanwhile satisfying . To optimize summarization results, we introduce a scaling coefficient for balancing the summary view between two sub-trees in terms of similarity and difference. Additionally, we propose a novel definition based on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Semantic Web and Ontologies · Advanced Database Systems and Queries
