Neural Topic Modeling with Continual Lifelong Learning
Pankaj Gupta, Yatin Chaudhary, Thomas Runkler, Hinrich, Sch\"utze

TL;DR
This paper introduces a lifelong neural topic modeling framework that continuously learns from document streams, transferring knowledge to improve topic coherence and retrieval in sparse data scenarios.
Contribution
It proposes a novel lifelong learning approach for neural topic modeling that mitigates data sparsity issues through knowledge transfer and prevents catastrophic forgetting.
Findings
Improved perplexity scores on sparse document collections.
Enhanced topic coherence and quality.
Better performance in information retrieval tasks.
Abstract
Lifelong learning has recently attracted attention in building machine learning systems that continually accumulate and transfer knowledge to help future learning. Unsupervised topic modeling has been popularly used to discover topics from document collections. However, the application of topic modeling is challenging due to data sparsity, e.g., in a small collection of (short) documents and thus, generate incoherent topics and sub-optimal document representations. To address the problem, we propose a lifelong learning framework for neural topic modeling that can continuously process streams of document collections, accumulate topics and guide future topic modeling tasks by knowledge transfer from several sources to better deal with the sparse data. In the lifelong process, we particularly investigate jointly: (1) sharing generative homologies (latent topics) over lifetime to transfer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
