LGDE: Local Graph-based Dictionary Expansion
Juni Schindler, Sneha Jha, Xixuan Zhang, Kilian Buehling, Annett Heft, Mauricio Barahona

TL;DR
LGDE is a novel method that uses manifold learning and network science to discover semantic neighborhoods of words, enabling more effective dictionary expansion for information retrieval tasks.
Contribution
LGDE introduces a graph diffusion-based approach for local community detection in word similarity graphs, enhancing dictionary expansion beyond direct similarities.
Findings
LGDE improves keyword expansion performance over traditional methods.
The method effectively captures complex nonlinear semantic relationships.
LGDE is validated on multiple corpora and a real-world communication science case.
Abstract
We present Local Graph-based Dictionary Expansion (LGDE), a method for data-driven discovery of the semantic neighbourhood of words using tools from manifold learning and network science. At the heart of LGDE lies the creation of a word similarity graph from the geometry of word embeddings followed by local community detection based on graph diffusion. The diffusion in the local graph manifold allows the exploration of the complex nonlinear geometry of word embeddings to capture word similarities based on paths of semantic association, over and above direct pairwise similarities. Exploiting such semantic neighbourhoods enables the expansion of dictionaries of pre-selected keywords, an important step for tasks in information retrieval, such as database queries and online data collection. We validate LGDE on two user-generated English-language corpora and show that LGDE enriches the list…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Topic Modeling
MethodsDiffusion
