Human-Machine Collaborative Optimization via Apprenticeship Scheduling

Matthew Gombolay; Reed Jensen; Jessica Stigile; Toni Golen; Neel Shah,; Sung-Hyun Son; and Julie Shah

arXiv:1805.04220·cs.AI·May 14, 2018

Human-Machine Collaborative Optimization via Apprenticeship Scheduling

Matthew Gombolay, Reed Jensen, Jessica Stigile, Toni Golen, Neel Shah,, Sung-Hyun Son, and Julie Shah

PDF

Open Access

TL;DR

This paper introduces a model-free apprenticeship learning approach that captures human scheduling heuristics to improve optimization efficiency in complex, constrained scheduling problems, outperforming human experts and traditional methods.

Contribution

A novel pairwise ranking formulation for capturing human heuristics without enumerating large state spaces, enabling scalable human-machine collaborative optimization.

Findings

01

Accurately learns heuristics on synthetic and real-world datasets.

02

Policies significantly improve search efficiency in optimization.

03

Outperforms human experts in solution quality and speed.

Abstract

Coordinating agents to complete a set of tasks with intercoupled temporal and resource constraints is computationally challenging, yet human domain experts can solve these difficult scheduling problems using paradigms learned through years of apprenticeship. A process for manually codifying this domain knowledge within a computational framework is necessary to scale beyond the ``single-expert, single-trainee" apprenticeship model. However, human domain experts often have difficulty describing their decision-making processes, causing the codification of this knowledge to become laborious. We propose a new approach for capturing domain-expert heuristics through a pairwise ranking formulation. Our approach is model-free and does not require enumerating or iterating through a large state space. We empirically demonstrate that this approach accurately learns multifaceted heuristics on a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Optimization and Search Problems · Constraint Satisfaction and Optimization