Few-shot Learning for Topic Modeling

Tomoharu Iwata

arXiv:2104.09011·cs.CL·April 20, 2021

Few-shot Learning for Topic Modeling

Tomoharu Iwata

PDF

Open Access

TL;DR

This paper introduces a neural network-based few-shot learning approach for topic modeling, enabling effective topic extraction from very limited documents by integrating EM algorithm differentiability and episodic training.

Contribution

It presents a novel neural network method that learns topic model priors from few documents, improving upon traditional models that require many documents for training.

Findings

01

Achieves better perplexity than existing methods on real-world datasets

02

Effectively learns from just a few documents using episodic training

03

Integrates EM algorithm into neural network training for topic modeling

Abstract

Topic models have been successfully used for analyzing text documents. However, with existing topic models, many documents are required for training. In this paper, we propose a neural network-based few-shot learning method that can learn a topic model from just a few documents. The neural networks in our model take a small number of documents as inputs, and output topic model priors. The proposed method trains the neural networks such that the expected test likelihood is improved when topic model parameters are estimated by maximizing the posterior probability using the priors based on the EM algorithm. Since each step in the EM algorithm is differentiable, the proposed method can backpropagate the loss through the EM algorithm to train the neural networks. The expected test likelihood is maximized by a stochastic gradient descent method using a set of multiple text corpora with an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Text and Document Classification Technologies