Knowledge-based Word Sense Disambiguation using Topic Models

Devendra Singh Chaplot; Ruslan Salakhutdinov

arXiv:1801.01900·cs.CL·January 9, 2018·20 cites

Knowledge-based Word Sense Disambiguation using Topic Models

Devendra Singh Chaplot, Ruslan Salakhutdinov

PDF

Open Access

TL;DR

This paper introduces a scalable, unsupervised knowledge-based Word Sense Disambiguation method using topic models, leveraging entire documents as context and outperforming existing systems on multiple datasets.

Contribution

The paper presents a novel variant of LDA that incorporates synset information and WordNet priors, enabling efficient use of full document context for WSD.

Findings

01

Outperforms state-of-the-art unsupervised WSD systems on multiple datasets

02

Scales linearly with the number of words, allowing full document context utilization

03

Demonstrates significant accuracy improvements in disambiguation tasks

Abstract

Word Sense Disambiguation is an open problem in Natural Language Processing which is particularly challenging and useful in the unsupervised setting where all the words in any given text need to be disambiguated without using any labeled data. Typically WSD systems use the sentence or a small window of words around the target word as the context for disambiguation because their computational complexity scales exponentially with the size of the context. In this paper, we leverage the formalism of topic model to design a WSD system that scales linearly with the number of words in the context. As a result, our system is able to utilize the whole document as the context for a word to be disambiguated. The proposed method is a variant of Latent Dirichlet Allocation in which the topic proportions for a document are replaced by synset proportions. We further utilize the information in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Advanced Text Analysis Techniques