Template-as-Ontology: Configurable Synthetic Data Infrastructure for Cross-Domain Manufacturing AI Validation
Grama Chethan

TL;DR
This paper presents a unified, schema-driven synthetic data infrastructure using a Python configuration module that guarantees alignment between manufacturing simulation and AI validation tools, enabling cross-domain manufacturing AI validation.
Contribution
It introduces the Template-as-Ontology principle, a single configuration schema serving as both simulator specification and domain schema, ensuring structural alignment and data consistency.
Findings
Validated across six industry templates with consistent KPI ranges.
Demonstrated elimination of tool-parameter hallucination through ontology constraints.
Confirmed parametric controllability and structural alignment in manufacturing data simulation.
Abstract
LLarge language model (LLM)-based AI agents deployed in manufacturing environments require populated, schema-correct data for validation, yet production MES data is proprietary, privacy-encumbered, and vendor-specific. This paper introduces the Template-as-Ontology principle: a single Python configuration module (700-770 lines, 45 validated exports) serves simultaneously as the specification for a time-stepped manufacturing simulator and as the runtime domain schema for AI analytics tools, producing alignment by construction rather than integration. We formally define the domain template as a typed relational configuration schema and prove that structural alignment between simulation and tool layers is guaranteed by single-source consumption. A five-layer pipeline--simulation, PostgreSQL, CDC/Iceberg lakehouse, star schema, and 12 parameterized AI tools--generates causally coherent,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
