Automatic Concept Embedding Model (ACEM): No train-time concepts, No   issue!

Rishabh Jain

arXiv:2309.03970·cs.LG·September 11, 2023

Automatic Concept Embedding Model (ACEM): No train-time concepts, No issue!

Rishabh Jain

PDF

Open Access

TL;DR

ACEM introduces an automatic approach to learn concept embeddings in neural networks, eliminating the need for manual concept annotations and enhancing interpretability without sacrificing performance.

Contribution

This paper presents ACEM, a novel concept embedding model that automatically learns concept annotations, reducing annotation costs and improving explainability in neural networks.

Findings

01

ACEM effectively learns concepts without manual annotations.

02

ACEM maintains high performance while improving interpretability.

03

The approach reduces the need for costly data labeling.

Abstract

Interpretability and explainability of neural networks is continuously increasing in importance, especially within safety-critical domains and to provide the social right to explanation. Concept based explanations align well with how humans reason, proving to be a good way to explain models. Concept Embedding Models (CEMs) are one such concept based explanation architectures. These have shown to overcome the trade-off between explainability and performance. However, they have a key limitation -- they require concept annotations for all their training data. For large datasets, this can be expensive and infeasible. Motivated by this, we propose Automatic Concept Embedding Models (ACEMs), which learn the concept annotations automatically.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Data Stream Mining Techniques · Topic Modeling

MethodsALIGN