Real-Time Construction Algorithm of Co-Occurrence Network Based on Inverted Index
Jiahao Cheng

TL;DR
This paper introduces an optimized algorithm based on inverted indexing and breadth-first search for constructing co-occurrence networks efficiently, significantly reducing time and memory usage compared to traditional methods, especially for large-scale text data.
Contribution
The paper presents a novel, efficient algorithm for co-occurrence network construction that outperforms traditional traversal algorithms in speed and memory consumption.
Findings
The optimized algorithm significantly reduces construction time.
Memory usage is substantially lower with the new method.
Experimental results confirm improved performance over traditional algorithms.
Abstract
Co-occurrence networks are an important method in the field of natural language processing and text mining for discovering semantic relationships within texts. However, the traditional traversal algorithm for constructing co-occurrence networks has high time complexity and space complexity when dealing with large-scale text data. In this paper, we propose an optimized algorithm based on inverted indexing and breadth-first search to improve the efficiency of co-occurrence network construction and reduce memory consumption. Firstly, the traditional traversal algorithm is analyzed, and its performance issues in constructing co-occurrence networks are identified. Then, the detailed implementation process of the optimized algorithm is presented. Subsequently, the CSL large-scale Chinese scientific literature dataset is used for experimental validation, comparing the performance of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Semantic Web and Ontologies
MethodsCircular Smooth Label
