TL;DR
iSEA is an interactive tool that automatically identifies and analyzes semantically-grounded subpopulations in NLP models to help developers understand and diagnose model errors more effectively.
Contribution
This paper introduces iSEA, a novel interactive pipeline that discovers and analyzes semantic subpopulations with high error rates in NLP models, supporting human-in-the-loop error diagnosis.
Findings
iSEA effectively identifies error-prone subpopulations in NLP models.
The tool enables validation and hypothesis testing of model errors.
Expert interviews confirm iSEA's usefulness in error analysis.
Abstract
Error analysis in NLP models is essential to successful model development and deployment. One common approach for diagnosing errors is to identify subpopulations in the dataset where the model produces the most errors. However, existing approaches typically define subpopulations based on pre-defined features, which requires users to form hypotheses of errors in advance. To complement these approaches, we propose iSEA, an Interactive Pipeline for Semantic Error Analysis in NLP Models, which automatically discovers semantically-grounded subpopulations with high error rates in the context of a human-in-the-loop interactive system. iSEA enables model developers to learn more about their model errors through discovered subpopulations, validate the sources of errors through interactive analysis on the discovered subpopulations, and test hypotheses about model errors by defining custom…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
