A Neural Generative Model for Joint Learning Topics and Topic-Specific   Word Embeddings

Lixing Zhu; Yulan He; Deyu Zhou

arXiv:2008.04702·cs.CL·August 12, 2020

A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings

Lixing Zhu, Yulan He, Deyu Zhou

PDF

1 Repo

TL;DR

This paper introduces a neural generative model that jointly learns topics and topic-specific word embeddings, effectively capturing polysemy and improving performance in word similarity, sense disambiguation, and topic coherence.

Contribution

It presents a novel generative framework that integrates topic modeling with word embeddings, addressing polysemy and enhancing downstream NLP tasks.

Findings

01

Outperforms existing word embedding methods in similarity and disambiguation tasks.

02

Produces more coherent and meaningful topics than previous models.

03

Can be integrated with deep contextualized embeddings for better downstream performance.

Abstract

We propose a novel generative model to explore both local and global context for joint learning topics and topic-specific word embeddings. In particular, we assume that global latent topics are shared across documents, a word is generated by a hidden semantic vector encoding its contextual semantic meaning, and its context words are generated conditional on both the hidden semantic vector and global latent topics. Topics are trained jointly with the word embeddings. The trained model maps words to topic-dependent embeddings, which naturally addresses the issue of word polysemy. Experimental results show that the proposed model outperforms the word-level embedding methods in both word similarity evaluation and word sense disambiguation. Furthermore, the model also extracts more coherent topics compared with existing neural topic models or other models for joint learning of topics and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

somethingx02/topical_wordvec_models
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.