Graph-Community Detection for Cross-Document Topic Segment Relationship Identification
Pedro Mota, Maxine Eskenazi, Luisa Coheur

TL;DR
This paper introduces a graph-community detection method to identify cross-document topic segment relationships, improving multi-document browsing by clustering similar content across related texts.
Contribution
It presents a novel approach combining graph-community detection with different weighting and mapping functions to find equivalent topic segments across multiple documents.
Findings
Better discovery of equivalence relationships in educational materials
Effective clustering in social dialogue scenarios
Approach enhances multi-document content access
Abstract
In this paper we propose a graph-community detection approach to identify cross-document relationships at the topic segment level. Given a set of related documents, we automatically find these relationships by clustering segments with similar content (topics). In this context, we study how different weighting mechanisms influence the discovery of word communities that relate to the different topics found in the documents. Finally, we test different mapping functions to assign topic segments to word communities, determining which topic segments are considered equivalent. By performing this task it is possible to enable efficient multi-document browsing, since when a user finds relevant content in one document we can provide access to similar topics in other documents. We deploy our approach in two different scenarios. One is an educational scenario where equivalence relationships…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Text Analysis Techniques · Topic Modeling
