Effects of padding on LSTMs and CNNs

Mahidhar Dwarampudi; N V Subba Reddy

arXiv:1903.07288·cs.LG·March 19, 2019·19 cites

Effects of padding on LSTMs and CNNs

Mahidhar Dwarampudi, N V Subba Reddy

PDF

Open Access

TL;DR

This paper investigates how different padding strategies affect the performance of LSTM and CNN models in sentiment analysis, providing guidelines for optimal padding techniques.

Contribution

It systematically compares padding methods for LSTMs and CNNs using a sentiment analysis task, offering practical recommendations for preprocessing.

Findings

01

Padding significantly impacts model accuracy and performance.

02

Optimal padding strategies vary between LSTMs and CNNs.

03

Preprocessing techniques can enhance model effectiveness.

Abstract

Long Short-Term Memory (LSTM) Networks and Convolutional Neural Networks (CNN) have become very common and are used in many fields as they were effective in solving many problems where the general neural networks were inefficient. They were applied to various problems mostly related to images and sequences. Since LSTMs and CNNs take inputs of the same length and dimension, input images and sequences are padded to maximum length while testing and training. This padding can affect the way the networks function and can make a great deal when it comes to performance and accuracies. This paper studies this and suggests the best way to pad an input sequence. This paper uses a simple sentiment analysis task for this purpose. We use the same dataset on both the networks with various padding to show the difference. This paper also discusses some preprocessing techniques done on the data to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques