Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing
Kevin Sim, Quentin Renau, Emma Hart

TL;DR
This paper conducts the first comprehensive benchmarking of LLM-evolved heuristics for bin packing, revealing limited generalization and emphasizing the importance of thorough evaluation over narrow testing.
Contribution
It provides a rigorous benchmarking framework for LLM-generated heuristics in bin packing, comparing them to existing methods across diverse instances and analyzing their performance in the feature space.
Findings
Most LLM heuristics do not generalize well across benchmarks.
Existing simple heuristics outperform LLM heuristics on broad tests.
Specialist heuristics may not justify their high generation costs.
Abstract
Coupling Large Language Models (LLMs) with Evolutionary Algorithms has recently shown significant promise as a technique to design new heuristics that outperform existing methods, particularly in the field of combinatorial optimisation. An escalating arms race is both rapidly producing new heuristics and improving the efficiency of the processes evolving them. However, driven by the desire to quickly demonstrate the superiority of new approaches, evaluation of the new heuristics produced for a specific domain is often cursory: testing on very few datasets in which instances all belong to a specific class from the domain, and on few instances per class. Taking bin-packing as an example, to the best of our knowledge we conduct the first rigorous benchmarking study of new LLM-generated heuristics, comparing them to well-known existing heuristics across a large suite of benchmark instances…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Packing Problems · Advanced Manufacturing and Logistics Optimization · Manufacturing Process and Optimization
