Towards Robust and Reliable Concept Representations:   Reliability-Enhanced Concept Embedding Model

Yuxuan Cai; Xiyu Wang; Satoshi Tsutsui; Winnie Pang; Bihan Wen

arXiv:2502.01191·cs.CV·February 4, 2025

Towards Robust and Reliable Concept Representations: Reliability-Enhanced Concept Embedding Model

Yuxuan Cai, Xiyu Wang, Satoshi Tsutsui, Winnie Pang, Bihan Wen

PDF

Open Access

TL;DR

This paper introduces RECEM, a novel model that improves the reliability and robustness of concept representations in concept bottleneck models by disentangling irrelevant features and aligning semantics across samples.

Contribution

RECEM is the first to combine concept-level disentanglement with a concept mixup mechanism to enhance concept reliability and robustness in CBMs.

Findings

01

RECEM outperforms baselines on multiple datasets.

02

It maintains performance under background and domain shifts.

03

Demonstrates improved interpretability and stability.

Abstract

Concept Bottleneck Models (CBMs) aim to enhance interpretability by predicting human-understandable concepts as intermediates for decision-making. However, these models often face challenges in ensuring reliable concept representations, which can propagate to downstream tasks and undermine robustness, especially under distribution shifts. Two inherent issues contribute to concept unreliability: sensitivity to concept-irrelevant features (e.g., background variations) and lack of semantic consistency for the same concept across different samples. To address these limitations, we propose the Reliability-Enhanced Concept Embedding Model (RECEM), which introduces a two-fold strategy: Concept-Level Disentanglement to separate irrelevant features from concept-relevant information and a Concept Mixup mechanism to ensure semantic alignment across samples. These mechanisms work together to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Data Stream Mining Techniques