Automatic forming lists of semantically related terms based on texts rating in the corpus with hyperlinks and categories (In Russian)
A. Krizhanovsky

TL;DR
This paper presents Synarcher, a program that uses an adapted HITS algorithm to find semantically related terms in structured text corpora like Wikipedia, visualizing results as interactive graphs for enhanced exploration.
Contribution
The paper introduces a novel application of the HITS algorithm for synonym and related term search in structured text corpora, along with a dedicated program architecture and evaluation.
Findings
Effective synonym search in Wikipedia corpus
Interactive graph visualization of related terms
Potential for extending search and building synonym dictionaries
Abstract
HITS adapted algorithm for synonym search, the program architecture, and the program work evaluation with test examples are presented in the paper. Synarcher program for synonym (and related terms) search in the text corpus of special structure (Wikipedia) was developed. The results of search are presented in the form of a graph. It is possible to explore the graph and search graph elements interactively. The proposed algorithm could be applied to the search request extending and for synonym dictionary forming.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Advanced Text Analysis Techniques · Topic Modeling
