TL;DR
This paper introduces AI-Interpret, a novel method that transforms opaque reinforcement learning policies into simple, interpretable decision rules, significantly enhancing human decision-making in various sequential problems.
Contribution
AI-Interpret combines imitation learning and program induction to generate human-understandable decision rules from complex policies, aiding the design of decision aids.
Findings
Flowcharts based on AI-Interpret improved human decision strategies.
The approach outperformed performance feedback in training effectiveness.
Ablation studies confirmed AI-Interpret's essential role in discovering interpretable rules.
Abstract
When making decisions, people often overlook critical information or are overly swayed by irrelevant information. A common approach to mitigate these biases is to provide decision-makers, especially professionals such as medical doctors, with decision aids, such as decision trees and flowcharts. Designing effective decision aids is a difficult problem. We propose that recently developed reinforcement learning methods for discovering clever heuristics for good decision-making can be partially leveraged to assist human experts in this design process. One of the biggest remaining obstacles to leveraging the aforementioned methods is that the policies they learn are opaque to people. To solve this problem, we introduce AI-Interpret: a general method for transforming idiosyncratic policies into simple and interpretable descriptions. Our algorithm combines recent advances in imitation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
