Word Sense Induction with Neural biLM and Symmetric Patterns

Asaf Amrami; Yoav Goldberg

arXiv:1808.08518·cs.CL·August 31, 2018

Word Sense Induction with Neural biLM and Symmetric Patterns

Asaf Amrami, Yoav Goldberg

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel approach for Word Sense Induction using a recurrent language model combined with dynamic symmetric patterns, significantly improving state-of-the-art results on the SemEval 2013 WSI task.

Contribution

It replaces ngram-based models with a recurrent language model and introduces dynamic symmetric patterns for better sense induction.

Findings

01

Surpasses previous state-of-the-art on SemEval 2013 WSI

02

Recurrent LM provides more accurate substitute vectors

03

Dynamic symmetric patterns enhance query effectiveness

Abstract

An established method for Word Sense Induction (WSI) uses a language model to predict probable substitutes for target words, and induces senses by clustering these resulting substitute vectors. We replace the ngram-based language model (LM) with a recurrent one. Beyond being more accurate, the use of the recurrent LM allows us to effectively query it in a creative way, using what we call dynamic symmetric patterns. The combination of the RNN-LM and the dynamic symmetric patterns results in strong substitute vectors for WSI, allowing to surpass the current state-of-the-art on the SemEval 2013 WSI shared task by a large margin.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

asafamr/SymPatternWSI
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems