Learning to Learn to be Right for the Right Reasons

Pride Kavumba; Benjamin Heinzerling; Ana Brassard; Kentaro Inui

arXiv:2104.11514·cs.CL·April 26, 2021·1 cites

Learning to Learn to be Right for the Right Reasons

Pride Kavumba, Benjamin Heinzerling, Ana Brassard, Kentaro Inui

PDF

Open Access

TL;DR

This paper introduces a meta-learning approach to improve model generalization in commonsense reasoning by performing well on both easy and hard test sets, addressing superficial cue overfitting.

Contribution

It proposes a novel meta-learning method that explicitly trains models to perform well on both superficial and non-superficial test instances, enhancing robustness.

Findings

01

Up to 16.5 percentage points improvement over baseline.

02

Effective on COPA and Commonsense Explanation datasets.

03

Balances performance on easy and hard test sets.

Abstract

Improving model generalization on held-out data is one of the core objectives in commonsense reasoning. Recent work has shown that models trained on the dataset with superficial cues tend to perform well on the easy test set with superficial cues but perform poorly on the hard test set without superficial cues. Previous approaches have resorted to manual methods of encouraging models not to overfit to superficial cues. While some of the methods have improved performance on hard instances, they also lead to degraded performance on easy instances. Here, we propose to explicitly learn a model that does well on both the easy test set with superficial cues and hard test set without superficial cues. Using a meta-learning objective, we learn such a model that improves performance on both the easy test set and the hard test set. By evaluating our models on Choice of Plausible Alternatives…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Bayesian Modeling and Causal Inference · Explainable Artificial Intelligence (XAI)