Towards Automated Error Analysis: Learning to Characterize Errors
Tong Gao, Shivang Singh, Raymond J. Mooney

TL;DR
This paper introduces a meta learning approach that automatically learns interpretable rules to characterize and understand errors in NLP systems, leading to insights and modest performance improvements.
Contribution
It presents a novel method for automatically learning interpretable error characterization rules using meta features, applicable to different NLP systems.
Findings
Learned interpretable error rules for VilBERT and RoBERTa
Provided insights into systemic errors of NLP models
Achieved modest performance improvements using error insights
Abstract
Characterizing the patterns of errors that a system makes helps researchers focus future development on increasing its accuracy and robustness. We propose a novel form of "meta learning" that automatically learns interpretable rules that characterize the types of errors that a system makes, and demonstrate these rules' ability to help understand and improve two NLP systems. Our approach works by collecting error cases on validation data, extracting meta-features describing these samples, and finally learning rules that characterize errors using these features. We apply our approach to VilBERT, for Visual Question Answering, and RoBERTa, for Common Sense Question Answering. Our system learns interpretable rules that provide insights into systemic errors these systems make on the given tasks. Using these insights, we are also able to "close the loop" and modestly improve performance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsAttention Is All You Need · Linear Layer · Dense Connections · Multi-Head Attention · Weight Decay · Softmax · Linear Warmup With Linear Decay · Attention Dropout · Residual Connection · Layer Normalization
