Interpretable and Personalized Apprenticeship Scheduling: Learning Interpretable Scheduling Policies from Heterogeneous User Demonstrations
Rohan Paleja, Andrew Silva, Letian Chen, Matthew Gombolay

TL;DR
This paper introduces a personalized, interpretable apprenticeship scheduling method that learns from heterogeneous human demonstrations to produce scalable, understandable scheduling policies, outperforming baselines and enhancing interpretability.
Contribution
The paper presents a novel personalized, interpretable apprenticeship learning framework that effectively captures diverse human strategies for resource scheduling.
Findings
Achieved 88.22% accuracy on real-world planning domain
Produced more interpretable models than neural networks ($p < 0.05$)
Near-perfect learning from synthetic demonstrations
Abstract
Resource scheduling and coordination is an NP-hard optimization requiring an efficient allocation of agents to a set of tasks with upper- and lower bound temporal and resource constraints. Due to the large-scale and dynamic nature of resource coordination in hospitals and factories, human domain experts manually plan and adjust schedules on the fly. To perform this job, domain experts leverage heterogeneous strategies and rules-of-thumb honed over years of apprenticeship. What is critically needed is the ability to extract this domain knowledge in a heterogeneous and interpretable apprenticeship learning framework to scale beyond the power of a single human expert, a necessity in safety-critical domains. We propose a personalized and interpretable apprenticeship scheduling algorithm that infers an interpretable representation of all human task demonstrators by extracting decision-making…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Machine Learning and Data Classification
