Towards Automated Error Analysis: Learning to Characterize Errors

Tong Gao; Shivang Singh; Raymond J. Mooney

arXiv:2201.05017·cs.CL·February 15, 2022

Towards Automated Error Analysis: Learning to Characterize Errors

Tong Gao, Shivang Singh, Raymond J. Mooney

PDF

Open Access

TL;DR

This paper introduces a meta learning approach that automatically learns interpretable rules to characterize and understand errors in NLP systems, leading to insights and modest performance improvements.

Contribution

It presents a novel method for automatically learning interpretable error characterization rules using meta features, applicable to different NLP systems.

Findings

01

Learned interpretable error rules for VilBERT and RoBERTa

02

Provided insights into systemic errors of NLP models

03

Achieved modest performance improvements using error insights

Abstract

Characterizing the patterns of errors that a system makes helps researchers focus future development on increasing its accuracy and robustness. We propose a novel form of "meta learning" that automatically learns interpretable rules that characterize the types of errors that a system makes, and demonstrate these rules' ability to help understand and improve two NLP systems. Our approach works by collecting error cases on validation data, extracting meta-features describing these samples, and finally learning rules that characterize errors using these features. We apply our approach to VilBERT, for Visual Question Answering, and RoBERTa, for Common Sense Question Answering. Our system learns interpretable rules that provide insights into systemic errors these systems make on the given tasks. Using these insights, we are also able to "close the loop" and modestly improve performance of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsAttention Is All You Need · Linear Layer · Dense Connections · Multi-Head Attention · Weight Decay · Softmax · Linear Warmup With Linear Decay · Attention Dropout · Residual Connection · Layer Normalization