Comparative Study of CNN and RNN for Natural Language Processing

Wenpeng Yin; Katharina Kann; Mo Yu; Hinrich Sch\"utze

arXiv:1702.01923·cs.CL·February 8, 2017·897 cites

Comparative Study of CNN and RNN for Natural Language Processing

Wenpeng Yin, Katharina Kann, Mo Yu, Hinrich Sch\"utze

PDF

Open Access 4 Repos

TL;DR

This paper systematically compares CNN and RNN architectures across various NLP tasks to guide the selection of the most suitable deep neural network model for specific applications.

Contribution

It provides the first comprehensive analysis of CNN versus RNN performance on multiple NLP tasks, offering practical guidance for model choice.

Findings

01

CNN excels at position-invariant feature extraction.

02

RNN performs better at modeling sequential data.

03

Performance varies depending on the specific NLP task.

Abstract

Deep neural networks (DNN) have revolutionized the field of natural language processing (NLP). Convolutional neural network (CNN) and recurrent neural network (RNN), the two main types of DNN architectures, are widely explored to handle various NLP tasks. CNN is supposed to be good at extracting position-invariant features and RNN at modeling units in sequence. The state of the art on many NLP tasks often switches due to the battle between CNNs and RNNs. This work is the first systematic comparison of CNN and RNN on a wide range of representative NLP tasks, aiming to give basic guidance for DNN selection.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech Recognition and Synthesis · Natural Language Processing Techniques