Azimuth: Systematic Error Analysis for Text Classification

Gabrielle Gauthier-Melan\c{c}on; Orlando Marquez Ayala; Lindsay Brin,; Chris Tyler; Fr\'ed\'eric Branchaud-Charron; Joseph Marinier; Karine Grande,; Di Le

arXiv:2212.08216·cs.LG·December 20, 2022·1 cites

Azimuth: Systematic Error Analysis for Text Classification

Gabrielle Gauthier-Melan\c{c}on, Orlando Marquez Ayala, Lindsay Brin,, Chris Tyler, Fr\'ed\'eric Branchaud-Charron, Joseph Marinier, Karine Grande,, Di Le

PDF

Open Access

TL;DR

Azimuth is an open-source tool that streamlines error analysis in text classification, helping practitioners identify and address model weaknesses to improve AI system reliability.

Contribution

It introduces a systematic approach and a comprehensive tool for error analysis in text classification, integrating multiple ML techniques in one platform.

Findings

01

Facilitates discovery of model weaknesses

02

Integrates diverse ML techniques for error analysis

03

Enhances reliability of text classification models

Abstract

We present Azimuth, an open-source and easy-to-use tool to perform error analysis for text classification. Compared to other stages of the ML development cycle, such as model training and hyper-parameter tuning, the process and tooling for the error analysis stage are less mature. However, this stage is critical for the development of reliable and trustworthy AI systems. To make error analysis more systematic, we propose an approach comprising dataset analysis and model quality assessment, which Azimuth facilitates. We aim to help AI practitioners discover and address areas where the model does not generalize by leveraging and integrating a range of ML techniques, such as saliency maps, similarity, uncertainty, and behavioral analyses, all in one tool. Our code and documentation are available at github.com/servicenow/azimuth.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification