SpanDrop: Simple and Effective Counterfactual Learning for Long   Sequences

Peng Qi; Guangtao Wang; Jing Huang

arXiv:2208.02169·cs.LG·August 4, 2022

SpanDrop: Simple and Effective Counterfactual Learning for Long Sequences

Peng Qi, Guangtao Wang, Jing Huang

PDF

Open Access

TL;DR

SpanDrop is a data augmentation method that improves model performance on long sequence tasks by identifying true supervision signals through sequence ablation, especially effective with limited data.

Contribution

The paper introduces SpanDrop, a novel simple augmentation technique for counterfactual learning in long sequences, with a beta-Bernoulli variant for diverse augmentation.

Findings

01

Improves performance on long sequence reasoning tasks.

02

Effective with both scarce and abundant data.

03

Enhances model understanding of true supervision signals.

Abstract

Distilling supervision signal from a long sequence to make predictions is a challenging task in machine learning, especially when not all elements in the input sequence contribute equally to the desired output. In this paper, we propose SpanDrop, a simple and effective data augmentation technique that helps models identify the true supervision signal in a long sequence with very few examples. By directly manipulating the input sequence, SpanDrop randomly ablates parts of the sequence at a time and ask the model to perform the same task to emulate counterfactual learning and achieve input attribution. Based on theoretical analysis of its properties, we also propose a variant of SpanDrop based on the beta-Bernoulli distribution, which yields diverse augmented sequences while providing a learning objective that is more consistent with the original dataset. We demonstrate the effectiveness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)