End-to-End Multi-View Networks for Text Classification

Hongyu Guo; Colin Cherry; Jiang Su

arXiv:1704.05907·cs.CL·April 21, 2017·6 cites

End-to-End Multi-View Networks for Text Classification

Hongyu Guo, Colin Cherry, Jiang Su

PDF

Open Access

TL;DR

This paper introduces a multi-view network architecture for text classification that automatically generates diverse attention-based views of input text, leading to improved accuracy and robustness.

Contribution

It presents a novel multi-view architecture that stacks and concatenates attention-based views, achieving state-of-the-art results in text classification.

Findings

01

Achieved new state-of-the-art accuracy on benchmark datasets.

02

Demonstrated robustness through diverse attention views.

03

Enabled faster training convergence.

Abstract

We propose a multi-view network for text classification. Our method automatically creates various views of its input text, each taking the form of soft attention weights that distribute the classifier's focus among a set of base features. For a bag-of-words representation, each view focuses on a different subset of the text's words. Aggregating many such views results in a more discriminative and robust representation. Through a novel architecture that both stacks and concatenates views, we produce a network that emphasizes both depth and width, allowing training to converge quickly. Using our multi-view architecture, we establish new state-of-the-art accuracies on two benchmark tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text and Document Classification Technologies · Natural Language Processing Techniques