BikeBench: A Bicycle Design Benchmark for Generative Models with Objectives and Constraints
Lyle Regenwetter, Yazan Abu Obaideh, Fabien Chiotti, Ioanna Lykourentzou, Faez Ahmed

TL;DR
BikeBench is a comprehensive benchmark for evaluating AI models' ability to generate bicycle designs that meet multiple real-world objectives and constraints, advancing AI capabilities in engineering design.
Contribution
It introduces BikeBench, a novel benchmark with datasets and evaluation metrics for assessing generative models on multi-objective bicycle design tasks.
Findings
LLMs and tabular models underperform compared to hybrid algorithms.
Hybrid GenAI+optimization approaches achieve better design quality.
The benchmark facilitates systematic evaluation of AI in constrained engineering design.
Abstract
We introduce BikeBench, an engineering design benchmark for evaluating generative models on problems with multiple real-world objectives and constraints. As generative AI's reach continues to grow, evaluating its capability to understand physical laws, human guidelines, and hard constraints grows increasingly important. Engineering product design lies at the intersection of these difficult tasks, providing new challenges for AI capabilities. BikeBench evaluates AI models' capabilities to generate bicycle designs that not only resemble the dataset, but meet specific performance objectives and constraints. To do so, BikeBench quantifies a variety of human-centered and multiphysics performance characteristics, such as aerodynamics, ergonomics, structural mechanics, human-rated usability, and similarity to subjective text or image prompts. Supporting the benchmark are several datasets of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Multi-Objective Optimization Algorithms · 3D Shape Modeling and Analysis · Model Reduction and Neural Networks
