Contrastive Learning for Neural Topic Model

Thong Nguyen; Anh Tuan Luu

arXiv:2110.12764·cs.CL·October 26, 2021

Contrastive Learning for Neural Topic Model

Thong Nguyen, Anh Tuan Luu

PDF

Open Access 2 Repos 1 Datasets 1 Video

TL;DR

This paper introduces a novel neural topic modeling approach that incorporates relations among similar documents and external information, improving topic coherence across diverse datasets.

Contribution

It reformulates adversarial neural topic models as an optimization problem and designs a new sampling method to better integrate external variables and document relations.

Findings

01

Outperforms state-of-the-art models on benchmark datasets

02

Achieves higher topic coherence scores

03

Effectively incorporates external information into topic modeling

Abstract

Recent empirical studies show that adversarial topic models (ATM) can successfully capture semantic patterns of the document by differentiating a document with another dissimilar sample. However, utilizing that discriminative-generative architecture has two important drawbacks: (1) the architecture does not relate similar documents, which has the same document-word distribution of salient words; (2) it restricts the ability to integrate external information, such as sentiments of the document, which has been shown to benefit the training of neural topic model. To address those issues, we revisit the adversarial topic architecture in the viewpoint of mathematical analysis, propose a novel approach to re-formulate discriminative goal as an optimization problem, and design a novel sampling method which facilitates the integration of external variables. The reformulation encourages the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

Kylan12/Synthetic-AI-ML-Dataset
dataset· 42 dl
42 dl

Videos

Contrastive Learning for Neural Topic Model· slideslive

Taxonomy

TopicsTopic Modeling · Computational and Text Analysis Methods · Generative Adversarial Networks and Image Synthesis