Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design
Lyle Regenwetter, Akash Srivastava, Dan Gutfreund, Faez Ahmed

TL;DR
This paper reviews and proposes evaluation metrics tailored for deep generative models in engineering design, emphasizing the importance of design-specific criteria over traditional statistical metrics, and demonstrates their application through case studies and real-world examples.
Contribution
It introduces a set of design-specific evaluation metrics for deep generative models, addressing limitations of traditional metrics in engineering applications.
Findings
Traditional statistical metrics often fail to capture design requirements.
Proposed metrics effectively evaluate constraint satisfaction, novelty, and performance.
Application to bicycle and topology design problems demonstrates practical utility.
Abstract
Deep generative models such as Variational Autoencoders (VAEs), Generative Adversarial Networks (GANs), Diffusion Models, and Transformers, have shown great promise in a variety of applications, including image and speech synthesis, natural language processing, and drug discovery. However, when applied to engineering design problems, evaluating the performance of these models can be challenging, as traditional statistical metrics based on likelihood may not fully capture the requirements of engineering applications. This paper doubles as a review and practical guide to evaluation metrics for deep generative models (DGMs) in engineering design. We first summarize the well-accepted `classic' evaluation metrics for deep generative models grounded in machine learning theory. Using case studies, we then highlight why these metrics seldom translate well to design problems but see frequent use…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBIM and Construction Integration · Design Education and Practice
MethodsDiffusion
