Intermediate Entity-based Sparse Interpretable Representation Learning

Diego Garcia-Olano; Yasumasa Onoe; Joydeep Ghosh; Byron C. Wallace

arXiv:2212.01641·cs.CL·December 6, 2022

Intermediate Entity-based Sparse Interpretable Representation Learning

Diego Garcia-Olano, Yasumasa Onoe, Joydeep Ghosh, Byron C. Wallace

PDF

Open Access 1 Repo

TL;DR

This paper introduces ItsIRL, a novel method for learning sparse, interpretable entity representations that enhance biomedical task performance while preserving interpretability and enabling model debugging through counterfactual manipulation.

Contribution

ItsIRL improves upon prior IERs by maintaining interpretability and boosting predictive accuracy, especially in biomedical applications, and introduces techniques for global semantic analysis and counterfactual entity manipulation.

Findings

01

Enhanced performance on biomedical tasks compared to previous IERs.

02

Maintains interpretability and supports counterfactual entity type manipulation.

03

Provides methods for global semantic property analysis of learned classes.

Abstract

Interpretable entity representations (IERs) are sparse embeddings that are "human-readable" in that dimensions correspond to fine-grained entity types and values are predicted probabilities that a given entity is of the corresponding type. These methods perform well in zero-shot and low supervision settings. Compared to standard dense neural embeddings, such interpretable representations may permit analysis and debugging. However, while fine-tuning sparse, interpretable representations improves accuracy on downstream tasks, it destroys the semantics of the dimensions which were enforced in pre-training. Can we maintain the interpretable semantics afforded by IERs while improving predictive performance on downstream tasks? Toward this end, we propose Intermediate enTity-based Sparse Interpretable Representation Learning (ItsIRL). ItsIRL realizes improved performance over prior IERs on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

diegoolano/itsirl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Explainable Artificial Intelligence (XAI) · Topic Modeling