On Learning Prediction-Focused Mixtures

Abhishek Sharma; Catherine Zeng; Sanjana Narayanan; Sonali Parbhoo and; Finale Doshi-Velez

arXiv:2110.13221·cs.LG·October 29, 2021

On Learning Prediction-Focused Mixtures

Abhishek Sharma, Catherine Zeng, Sanjana Narayanan, Sonali Parbhoo and, Finale Doshi-Velez

PDF

Open Access

TL;DR

This paper introduces a prediction-focused approach for mixture models that automatically selects relevant features to improve prediction performance in low-capacity settings, enhancing interpretability and optimization.

Contribution

It proposes a novel prediction-focused modeling method for mixtures that identifies relevant input signals and outperforms traditional models in low-capacity scenarios.

Findings

01

Outperforms non-prediction-focused models in relevant signal detection

02

Eases optimization of mixture models with prediction focus

03

Provides theoretical characterization of when prediction-focused modeling is effective

Abstract

Probabilistic models help us encode latent structures that both model the data and are ideally also useful for specific downstream tasks. Among these, mixture models and their time-series counterparts, hidden Markov models, identify discrete components in the data. In this work, we focus on a constrained capacity setting, where we want to learn a model with relatively few components (e.g. for interpretability purposes). To maintain prediction performance, we introduce prediction-focused modeling for mixtures, which automatically selects the dimensions relevant to the prediction task. Our approach identifies relevant signal from the input, outperforms models that are not prediction-focused, and is easy to optimize; we also characterize when prediction-focused modeling can be expected to work.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques · Explainable Artificial Intelligence (XAI) · Machine Learning in Healthcare