Regularizing Recurrent Neural Networks via Sequence Mixup

Armin Karamzade; Amir Najafi; Seyed Abolfazl Motahari

arXiv:2012.07527·cs.CL·December 15, 2020

Regularizing Recurrent Neural Networks via Sequence Mixup

Armin Karamzade, Amir Najafi, Seyed Abolfazl Motahari

PDF

Open Access

TL;DR

This paper adapts mixup regularization techniques from feed-forward neural networks to RNNs, improving performance on sequence tasks with minimal added complexity.

Contribution

It introduces sequence mixup methods for RNNs, providing an easy-to-implement regularization approach validated through experiments and theoretical analysis.

Findings

01

Improved F-1 score on NER task

02

Reduced loss in RNN training

03

Validated with real-world datasets

Abstract

In this paper, we extend a class of celebrated regularization techniques originally proposed for feed-forward neural networks, namely Input Mixup (Zhang et al., 2017) and Manifold Mixup (Verma et al., 2018), to the realm of Recurrent Neural Networks (RNN). Our proposed methods are easy to implement and have a low computational complexity, while leverage the performance of simple neural architectures in a variety of tasks. We have validated our claims through several experiments on real-world datasets, and also provide an asymptotic theoretical analysis to further investigate the properties and potential impacts of our proposed techniques. Applying sequence mixup to BiLSTM-CRF model (Huang et al., 2015) to Named Entity Recognition task on CoNLL-2003 data (Sang and De Meulder, 2003) has improved the F-1 score on the test stage and reduced the loss, considerably.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Topic Modeling · Generative Adversarial Networks and Image Synthesis

MethodsManifold Mixup · Mixup