Efficient Character-level Document Classification by Combining   Convolution and Recurrent Layers

Yijun Xiao; Kyunghyun Cho

arXiv:1602.00367·cs.CL·February 2, 2016·172 cites

Efficient Character-level Document Classification by Combining Convolution and Recurrent Layers

Yijun Xiao, Kyunghyun Cho

PDF

Open Access

TL;DR

This paper introduces a neural network combining convolutional and recurrent layers for character-level document classification, achieving similar accuracy to convolution-only models but with fewer parameters.

Contribution

The novel architecture efficiently encodes character inputs by integrating convolution and recurrent layers, reducing model complexity while maintaining performance.

Findings

01

Achieves comparable accuracy to convolution-only models

02

Uses significantly fewer parameters

03

Validated on eight large-scale datasets

Abstract

Document classification tasks were primarily tackled at word level. Recent research that works with character-level inputs shows several benefits over word-level approaches such as natural incorporation of morphemes and better handling of rare words. We propose a neural network architecture that utilizes both convolution and recurrent layers to efficiently encode character inputs. We validate the proposed model on eight large scale document classification tasks and compare with character-level convolution-only models. It achieves comparable performances with much less parameters.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text and Document Classification Technologies · Natural Language Processing Techniques

MethodsConvolution