Effective Guidance for Model Attention with Simple Yes-no Annotations
Seongmin Lee, Ali Payani, Duen Horng Chau

TL;DR
CRAYON is a practical method that uses simple yes-no annotations to effectively guide and refine model attention, improving interpretability and performance across multiple datasets.
Contribution
CRAYON introduces a scalable approach to correct model attention using minimal annotations, combining classical and modern interpretation techniques.
Findings
CRAYON outperforms 12 methods on 3 benchmark datasets.
It effectively refines model attention with simple yes-no annotations.
CRAYON enhances interpretability and generalization of models.
Abstract
Modern deep learning models often make predictions by focusing on irrelevant areas, leading to biased performance and limited generalization. Existing methods aimed at rectifying model attention require explicit labels for irrelevant areas or complex pixel-wise ground truth attention maps. We present CRAYON (Correcting Reasoning with Annotations of Yes Or No), offering effective, scalable, and practical solutions to rectify model attention using simple yes-no annotations. CRAYON empowers classical and modern model interpretation techniques to identify and guide model reasoning: CRAYON-ATTENTION directs classic interpretations based on saliency maps to focus on relevant image regions, while CRAYON-PRUNING removes irrelevant neurons identified by modern concept-based methods to mitigate their influence. Through extensive experiments with both quantitative and human evaluation, we showcase…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Semantic Web and Ontologies · Scientific Computing and Data Management
MethodsSoftmax · Attention Is All You Need · Focus
