Integrating Learning from Examples into the Search for Diagnostic Policies
V. Bayer-Zubek, T. G. Dietterich

TL;DR
This paper presents a formal framework and new algorithms for learning cost-effective diagnostic policies from data, balancing test costs and misdiagnosis costs, and demonstrates their effectiveness on benchmark datasets.
Contribution
It introduces a Markov Decision Process formulation for diagnostic decision making and develops efficient systematic search algorithms with heuristics and regularization to learn policies from examples.
Findings
Systematic search algorithms outperform greedy methods in most cases.
The proposed algorithms are practical on standard desktop computers.
Regularization reduces overfitting in learned diagnostic policies.
Abstract
This paper studies the problem of learning diagnostic policies from training examples. A diagnostic policy is a complete description of the decision-making actions of a diagnostician (i.e., tests followed by a diagnostic decision) for all possible combinations of test results. An optimal diagnostic policy is one that minimizes the expected total cost, which is the sum of measurement costs and misdiagnosis costs. In most diagnostic settings, there is a tradeoff between these two kinds of costs. This paper formalizes diagnostic decision making as a Markov Decision Process (MDP). The paper introduces a new family of systematic search algorithms based on the AO* algorithm to solve this MDP. To make AO* efficient, the paper describes an admissible heuristic that enables AO* to prune large parts of the search space. The paper also introduces several greedy algorithms including some…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
