Projective Latent Interventions for Understanding and Fine-tuning   Classifiers

Andreas Hinterreiter; Marc Streit; Bernhard Kainz

arXiv:2006.12902·cs.LG·September 8, 2021

Projective Latent Interventions for Understanding and Fine-tuning Classifiers

Andreas Hinterreiter, Marc Streit, Bernhard Kainz

PDF

1 Repo

TL;DR

This paper introduces Projective Latent Interventions (PLIs), a method that enables domain experts to intuitively modify and interpret high-dimensional neural network latent spaces by back-propagating manual adjustments in low-dimensional embeddings, improving classifier understanding and performance.

Contribution

The paper proposes PLIs, a novel technique for retraining classifiers through manual modifications of low-dimensional latent embeddings, enhancing interpretability and control in high-dimensional spaces.

Findings

01

PLIs allow intuitive control over latent decision spaces.

02

Manual separation of class clusters improves classification performance.

03

Demonstrated effectiveness on fetal ultrasound imaging data.

Abstract

High-dimensional latent representations learned by neural network classifiers are notoriously hard to interpret. Especially in medical applications, model developers and domain experts desire a better understanding of how these latent representations relate to the resulting classification performance. We present Projective Latent Interventions (PLIs), a technique for retraining classifiers by back-propagating manual changes made to low-dimensional embeddings of the latent space. The back-propagation is based on parametric approximations of t-distributed stochastic neighbourhood embeddings. PLIs allow domain experts to control the latent decision space in an intuitive way in order to better match their expectations. For instance, the performance for specific pairs of classes can be enhanced by manually separating the class clusters in the embedding. We evaluate our technique on a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

einbandi/latent-projective-interventions
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.