SHARCS: Shared Concept Space for Explainable Multimodal Learning
Gabriele Dominici, Pietro Barbiero, Lucie Charlotte Magister, Pietro, Li\`o, Nikola Simidjievski

TL;DR
SHARCS introduces a shared concept space for explainable multimodal learning, enabling interpretable cross-modal analysis, improved predictive performance, and effective handling of missing modalities across diverse data types.
Contribution
It proposes a novel unified concept manifold that maps heterogeneous modalities into an interpretable space, enhancing explainability and performance in multimodal tasks.
Findings
SHARCS improves interpretability of multimodal models.
It outperforms existing methods in missing modality retrieval.
The approach is model-agnostic and versatile across modalities.
Abstract
Multimodal learning is an essential paradigm for addressing complex real-world problems, where individual data modalities are typically insufficient to accurately solve a given modelling task. While various deep learning approaches have successfully addressed these challenges, their reasoning process is often opaque; limiting the capabilities for a principled explainable cross-modal analysis and any domain-expert intervention. In this paper, we introduce SHARCS (SHARed Concept Space) -- a novel concept-based approach for explainable multimodal learning. SHARCS learns and maps interpretable concepts from different heterogeneous modalities into a single unified concept-manifold, which leads to an intuitive projection of semantically similar cross-modal concepts. We demonstrate that such an approach can lead to inherently explainable task predictions while also improving downstream…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications
