An Exploration of Dropout with RNNs for Natural Language Inference

Amit Gajbhiye; Sardar Jaf; Noura Al Moubayed; A. Stephen McGough,; Steven Bradley

arXiv:1810.08606·cs.CL·October 23, 2018

An Exploration of Dropout with RNNs for Natural Language Inference

Amit Gajbhiye, Sardar Jaf, Noura Al Moubayed, A. Stephen McGough,, Steven Bradley

PDF

TL;DR

This paper investigates the effects of applying dropout at different layers and rates in RNN models for Natural Language Inference, revealing layer-specific impacts on accuracy across datasets.

Contribution

It introduces a novel RNN model for NLI and provides empirical analysis of dropout effects at various layers and rates, which was previously underexplored.

Findings

01

Dropout at feed-forward connections significantly impacts accuracy.

02

Regularizing embedding layers benefits SNLI performance.

03

Regularizing recurrent layers improves SciTail accuracy.

Abstract

Dropout is a crucial regularization technique for the Recurrent Neural Network (RNN) models of Natural Language Inference (NLI). However, dropout has not been evaluated for the effectiveness at different layers and dropout rates in NLI models. In this paper, we propose a novel RNN model for NLI and empirically evaluate the effect of applying dropout at different layers in the model. We also investigate the impact of varying dropout rates at these layers. Our empirical evaluation on a large (Stanford Natural Language Inference (SNLI)) and a small (SciTail) dataset suggest that dropout at each feed-forward connection severely affects the model accuracy at increasing dropout rates. We also show that regularizing the embedding layer is efficient for SNLI whereas regularizing the recurrent layer improves the accuracy for SciTail. Our model achieved an accuracy 86.14% on the SNLI dataset and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDropout