Knowledge Elicitation via Sequential Probabilistic Inference for   High-Dimensional Prediction

Pedram Daee; Tomi Peltola; Marta Soare; Samuel Kaski

arXiv:1612.03328·cs.AI·July 14, 2017

Knowledge Elicitation via Sequential Probabilistic Inference for High-Dimensional Prediction

Pedram Daee, Tomi Peltola, Marta Soare, Samuel Kaski

PDF

1 Repo

TL;DR

This paper introduces a sequential probabilistic inference method to efficiently incorporate expert knowledge into high-dimensional prediction models, significantly improving accuracy in small-sample scenarios.

Contribution

It proposes a novel algorithm for knowledge elicitation in sparse linear regression, enabling efficient expert interaction to enhance prediction accuracy.

Findings

01

Improved prediction accuracy with minimal expert effort.

02

Effective identification of most informative features for querying.

03

Validated on both simulated and real user data.

Abstract

Prediction in a small-sized sample with a large number of covariates, the "small n, large p" problem, is challenging. This setting is encountered in multiple applications, such as precision medicine, where obtaining additional samples can be extremely costly or even impossible, and extensive research effort has recently been dedicated to finding principled solutions for accurate prediction. However, a valuable source of additional information, domain experts, has not yet been efficiently exploited. We formulate knowledge elicitation generally as a probabilistic inference process, where expert knowledge is sequentially queried to improve predictions. In the specific case of sparse linear regression, where we assume the expert has knowledge about the values of the regression coefficients or about the relevance of the features, we propose an algorithm and computational approximation for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HIIT/knowledge-elicitation-for-linear-regression
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.