Improving Long Document Topic Segmentation Models With Enhanced Coherence Modeling
Hai Yu, Chong Deng, Qinglin Zhang, Jiaqing Liu, Qian Chen, Wen Wang

TL;DR
This paper enhances supervised neural models for long document topic segmentation by improving coherence modeling through structural and semantic similarity tasks, leading to significant performance gains.
Contribution
It introduces TSSP and CSSL methods that better capture coherence, improving segmentation accuracy over previous state-of-the-art models.
Findings
Significantly outperforms previous SOTA methods.
Improves F1 score by 3.42 points on WIKI-727K.
Reduces P_k by 1.11 points on WIKI-727K.
Abstract
Topic segmentation is critical for obtaining structured documents and improving downstream tasks such as information retrieval. Due to its ability of automatically exploring clues of topic shift from abundant labeled data, recent supervised neural models have greatly promoted the development of long document topic segmentation, but leaving the deeper relationship between coherence and topic segmentation underexplored. Therefore, this paper enhances the ability of supervised models to capture coherence from both logical structure and semantic similarity perspectives to further improve the topic segmentation performance, proposing Topic-aware Sentence Structure Prediction (TSSP) and Contrastive Semantic Similarity Learning (CSSL). Specifically, the TSSP task is proposed to force the model to comprehend structural information by learning the original relations between adjacent sentences in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Text and Document Classification Technologies
MethodsHow do I get a human at Expedia immediately? (2025-2026) · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · How do I complain to Expedia?*ComplainByAgent · WordPiece · Dropout · Weight Decay · Linear Warmup With Linear Decay · How do I make a claim with Expedia?*Make FastClaimService
