Scene Graph-Guided Generative AI Framework for Synthesizing and Evaluating Industrial Hazard Scenarios

Sanjay Acharjee; Abir Khan Ratul; Diego Patino; Md Nazmus Sakib

arXiv:2511.13970·cs.AI·November 19, 2025

Scene Graph-Guided Generative AI Framework for Synthesizing and Evaluating Industrial Hazard Scenarios

Sanjay Acharjee, Abir Khan Ratul, Diego Patino, Md Nazmus Sakib

PDF

Open Access

TL;DR

This paper introduces a scene graph-guided generative AI framework that synthesizes realistic hazardous workplace images from OSHA reports, aiding hazard detection model training and evaluation.

Contribution

It presents a novel method combining scene graphs and diffusion models to generate and evaluate hazardous scenarios grounded in safety reports.

Findings

01

VQA Graph Score outperforms CLIP and BLIP metrics

02

Generated hazard scenes are realistic and semantically accurate

03

Framework improves hazard dataset diversity and quality

Abstract

Training vision models to detect workplace hazards accurately requires realistic images of unsafe conditions that could lead to accidents. However, acquiring such datasets is difficult because capturing accident-triggering scenarios as they occur is nearly impossible. To overcome this limitation, this study presents a novel scene graph-guided generative AI framework that synthesizes photorealistic images of hazardous scenarios grounded in historical Occupational Safety and Health Administration (OSHA) accident reports. OSHA narratives are analyzed using GPT-4o to extract structured hazard reasoning, which is converted into object-level scene graphs capturing spatial and contextual relationships essential for understanding risk. These graphs guide a text-to-image diffusion model to generate compositionally accurate hazard scenes. To evaluate the realism and semantic fidelity of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Topic Modeling