Fixed versus Dynamic Co-Occurrence Windows in TextRank Term Weights for Information Retrieval
Wei Lu, Qikai Cheng, Christina Lioma

TL;DR
This paper investigates the impact of using fixed versus dynamic co-occurrence windows in TextRank for IR, demonstrating that dynamic windows aligned with document structure improve early search precision.
Contribution
It introduces a method for dynamically adjusting co-occurrence windows in TextRank based on sentence and paragraph boundaries, enhancing IR performance.
Findings
Dynamic co-occurrence windows improve early precision in IR
Adjusting co-occurrence context benefits term weighting
TextRank with dynamic windows outperforms fixed window approaches
Abstract
TextRank is a variant of PageRank typically used in graphs that represent documents, and where vertices denote terms and edges denote relations between terms. Quite often the relation between terms is simple term co-occurrence within a fixed window of k terms. The output of TextRank when applied iteratively is a score for each vertex, i.e. a term weight, that can be used for information retrieval (IR) just like conventional term frequency based term weights. So far, when computing TextRank term weights over co- occurrence graphs, the window of term co-occurrence is al- ways ?xed. This work departs from this, and considers dy- namically adjusted windows of term co-occurrence that fol- low the document structure on a sentence- and paragraph- level. The resulting TextRank term weights are used in a ranking function that re-ranks 1000 initially returned search results in order to improve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Data Visualization and Analytics · Information Retrieval and Search Behavior
