Graph Contrastive Topic Model
Zheheng Luo, Lei Liu, Qianqian Xie, Sophia Ananiadou

TL;DR
This paper introduces GCTM, a novel graph contrastive topic model that uses graph-based sampling to generate semantically relevant positive and negative samples, improving topic coherence and document representation.
Contribution
GCTM employs a graph contrastive learning framework with a new sampling strategy based on document-word bipartite graphs and word co-occurrence graphs, addressing sample bias in NTMs.
Findings
Outperforms existing methods on benchmark datasets
Enhances topic coherence and document representation
Effectively models semantic relevance among words
Abstract
Existing NTMs with contrastive learning suffer from the sample bias problem owing to the word frequency-based sampling strategy, which may result in false negative samples with similar semantics to the prototypes. In this paper, we aim to explore the efficient sampling strategy and contrastive learning in NTMs to address the aforementioned issue. We propose a new sampling assumption that negative samples should contain words that are semantically irrelevant to the prototype. Based on it, we propose the graph contrastive topic model (GCTM), which conducts graph contrastive learning (GCL) using informative positive and negative samples that are generated by the graph-based sampling strategy leveraging in-depth correlation and irrelevance among documents and words. In GCTM, we first model the input document as the document word bipartite graph (DWBG), and construct positive and negative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Text and Document Classification Technologies · Advanced Graph Neural Networks
MethodsContrastive Learning
