Design and Scheduling of an AI-based Queueing System
Jiung Lee, Hongseok Namkoong, Yibo Zeng

TL;DR
This paper develops a queueing system model that integrates AI-based predictions to optimize scheduling, analyzing the effects of prediction errors on congestion and proposing near-optimal policies for AI-assisted triage systems.
Contribution
It introduces an index-based scheduling policy that effectively incorporates predictive class information and provides a model selection procedure focused on queueing performance.
Findings
Mispredictions significantly affect congestion costs in heavy traffic.
The proposed policy achieves near-optimal performance in simulated environments.
Application to content moderation demonstrates practical effectiveness.
Abstract
To leverage prediction models to make optimal scheduling decisions in service systems, we must understand how predictive errors impact congestion due to externalities on the delay of other jobs. Motivated by applications where prediction models interact with human servers (e.g., content moderation), we consider a large queueing system comprising of many single server queues where the class of a job is estimated using a prediction model. By characterizing the impact of mispredictions on congestion cost in heavy traffic, we design an index-based policy that incorporates the predicted class information in a near-optimal manner. Our theoretical results guide the design of predictive models by providing a simple model selection procedure with downstream queueing performance as a central concern, and offer novel insights on how to design queueing systems with AI-based triage. We illustrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Optimization Algorithms
Methodstravel james
