LLMs can Schedule

Henrik Abgaryan; Ararat Harutyunyan; Tristan Cazenave

arXiv:2408.06993·cs.AI·August 14, 2024·3 cites

LLMs can Schedule

Henrik Abgaryan, Ararat Harutyunyan, Tristan Cazenave

PDF

Open Access 1 Repo

TL;DR

This paper investigates the application of Large Language Models to the job shop scheduling problem, introducing a new dataset and methods that enable LLMs to perform competitively with specialized neural approaches.

Contribution

The paper presents the first supervised dataset for training LLMs on JSSP and proposes a sampling method to improve their scheduling performance.

Findings

01

LLMs can achieve performance comparable to other neural approaches in JSSP.

02

A new 120k supervised dataset for LLM training on scheduling tasks.

03

Sampling methods enhance LLM effectiveness in job scheduling.

Abstract

The job shop scheduling problem (JSSP) remains a significant hurdle in optimizing production processes. This challenge involves efficiently allocating jobs to a limited number of machines while minimizing factors like total processing time or job delays. While recent advancements in artificial intelligence have yielded promising solutions, such as reinforcement learning and graph neural networks, this paper explores the potential of Large Language Models (LLMs) for JSSP. We introduce the very first supervised 120k dataset specifically designed to train LLMs for JSSP. Surprisingly, our findings demonstrate that LLM-based scheduling can achieve performance comparable to other neural approaches. Furthermore, we propose a sampling method that enhances the effectiveness of LLMs in tackling JSSP.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

starjob42/datasetjsp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenetics, Bioinformatics, and Biomedical Research