Evaluating Machine Learning Models with NERO: Non-Equivariance Revealed on Orbits
Zhuokai Zhao, Takumi Matsuzawa, William Irvine, Michael Maire, Gordon, L Kindlmann

TL;DR
NERO evaluation offers a novel, interactive approach to assess and visualize model equivariance and robustness across various tasks, moving beyond traditional scalar metrics to improve interpretability and model understanding.
Contribution
The paper introduces NERO, a new evaluation workflow with visualizations that reveal model equivariance and robustness, applicable across multiple research domains.
Findings
NERO effectively visualizes model equivariance properties.
NERO helps identify and interpret model weaknesses.
The approach is applicable to diverse tasks like recognition, detection, and 3D classification.
Abstract
Proper evaluations are crucial for better understanding, troubleshooting, interpreting model behaviors and further improving model performance. While using scalar-based error metrics provides a fast way to overview model performance, they are often too abstract to display certain weak spots and lack information regarding important model properties, such as robustness. This not only hinders machine learning models from being more interpretable and gaining trust, but also can be misleading to both model developers and users. Additionally, conventional evaluation procedures often leave researchers unclear about where and how model fails, which complicates model comparisons and further developments. To address these issues, we propose a novel evaluation workflow, named Non-Equivariance Revealed on Orbits (NERO) Evaluation. The goal of NERO evaluation is to turn focus from traditional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning
MethodsFocus
