Text Classification: A Sequential Reading Approach

Gabriel Dulac-Arnold; Ludovic Denoyer; Patrick Gallinari

arXiv:1107.1322·cs.AI·March 19, 2015

Text Classification: A Sequential Reading Approach

Gabriel Dulac-Arnold, Ludovic Denoyer, Patrick Gallinari

PDF

TL;DR

This paper introduces a novel sequential reading approach to text classification, modeling it as a Markov Decision Process and using reinforcement learning to improve classification efficiency and accuracy, especially with limited training data.

Contribution

It presents a new reinforcement learning-based method that models text classification as a sequential decision process, allowing adaptive reading and decision-making.

Findings

01

Performs comparably to SVM on large datasets

02

Outperforms SVM on small datasets

03

Automatically adapts reading process based on training data quantity

Abstract

We propose to model the text classification process as a sequential decision process. In this process, an agent learns to classify documents into topics while reading the document sentences sequentially and learns to stop as soon as enough information was read for deciding. The proposed algorithm is based on a modelisation of Text Classification as a Markov Decision Process and learns by using Reinforcement Learning. Experiments on four different classical mono-label corpora show that the proposed approach performs comparably to classical SVM approaches for large training sets, and better for small training sets. In addition, the model automatically adapts its reading process to the quantity of training information provided.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.