Graph Contrastive Topic Model

Zheheng Luo; Lei Liu; Qianqian Xie; Sophia Ananiadou

arXiv:2307.02078·cs.CL·July 6, 2023

Graph Contrastive Topic Model

Zheheng Luo, Lei Liu, Qianqian Xie, Sophia Ananiadou

PDF

Open Access 1 Repo

TL;DR

This paper introduces GCTM, a novel graph contrastive topic model that uses graph-based sampling to generate semantically relevant positive and negative samples, improving topic coherence and document representation.

Contribution

GCTM employs a graph contrastive learning framework with a new sampling strategy based on document-word bipartite graphs and word co-occurrence graphs, addressing sample bias in NTMs.

Findings

01

Outperforms existing methods on benchmark datasets

02

Enhances topic coherence and document representation

03

Effectively models semantic relevance among words

Abstract

Existing NTMs with contrastive learning suffer from the sample bias problem owing to the word frequency-based sampling strategy, which may result in false negative samples with similar semantics to the prototypes. In this paper, we aim to explore the efficient sampling strategy and contrastive learning in NTMs to address the aforementioned issue. We propose a new sampling assumption that negative samples should contain words that are semantically irrelevant to the prototype. Based on it, we propose the graph contrastive topic model (GCTM), which conducts graph contrastive learning (GCL) using informative positive and negative samples that are generated by the graph-based sampling strategy leveraging in-depth correlation and irrelevance among documents and words. In GCTM, we first model the input document as the document word bipartite graph (DWBG), and construct positive and negative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhehengluok/gctm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text and Document Classification Technologies · Advanced Graph Neural Networks

MethodsContrastive Learning