Self-Supervised Learning for Neural Topic Models with   Variance-Invariance-Covariance Regularization

Weiran Xu; Kengo Hirami; Koji Eguchi

arXiv:2502.09944·cs.LG·February 27, 2025

Self-Supervised Learning for Neural Topic Models with Variance-Invariance-Covariance Regularization

Weiran Xu, Kengo Hirami, Koji Eguchi

PDF

Open Access

TL;DR

This paper introduces a self-supervised neural topic model that leverages variance-invariance-covariance regularization and adversarial data augmentation to improve topic coherence and model performance, outperforming existing methods.

Contribution

The study presents a novel self-supervised neural topic model combining regularization techniques and adversarial augmentation, enhancing topic quality and outperforming state-of-the-art models.

Findings

01

Outperforms baseline models quantitatively.

02

Produces more coherent and meaningful topics.

03

Effective regularization improves latent representations.

Abstract

In our study, we propose a self-supervised neural topic model (NTM) that combines the power of NTMs and regularized self-supervised learning methods to improve performance. NTMs use neural networks to learn latent topics hidden behind the words in documents, enabling greater flexibility and the ability to estimate more coherent topics compared to traditional topic models. On the other hand, some self-supervised learning methods use a joint embedding architecture with two identical networks that produce similar representations for two augmented versions of the same input. Regularizations are applied to these representations to prevent collapse, which would otherwise result in the networks outputting constant or redundant representations for all inputs. Our model enhances topic quality by explicitly regularizing latent topic representations of anchor and positive samples. We also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Expert finding and Q&A systems · Topic Modeling