Self-Supervised Learning for Neural Topic Models with Variance-Invariance-Covariance Regularization
Weiran Xu, Kengo Hirami, Koji Eguchi

TL;DR
This paper introduces a self-supervised neural topic model that leverages variance-invariance-covariance regularization and adversarial data augmentation to improve topic coherence and model performance, outperforming existing methods.
Contribution
The study presents a novel self-supervised neural topic model combining regularization techniques and adversarial augmentation, enhancing topic quality and outperforming state-of-the-art models.
Findings
Outperforms baseline models quantitatively.
Produces more coherent and meaningful topics.
Effective regularization improves latent representations.
Abstract
In our study, we propose a self-supervised neural topic model (NTM) that combines the power of NTMs and regularized self-supervised learning methods to improve performance. NTMs use neural networks to learn latent topics hidden behind the words in documents, enabling greater flexibility and the ability to estimate more coherent topics compared to traditional topic models. On the other hand, some self-supervised learning methods use a joint embedding architecture with two identical networks that produce similar representations for two augmented versions of the same input. Regularizations are applied to these representations to prevent collapse, which would otherwise result in the networks outputting constant or redundant representations for all inputs. Our model enhances topic quality by explicitly regularizing latent topic representations of anchor and positive samples. We also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Expert finding and Q&A systems · Topic Modeling
