Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination Reduction
Brian Freeman, Adam Kicklighter, Matt Erdman, Zach Gordon

TL;DR
This paper compares five prompt engineering strategies to reduce hallucinations in industrial large language models, aiming for more consistent and reliable outputs without altering model weights.
Contribution
It introduces and evaluates five novel prompt engineering methods for enhancing the factual consistency of LLM outputs in industrial applications.
Findings
Enhanced Data Registry (M4) achieved 100% positive verdicts in tests.
Method M2 improved from 34% to 80% accuracy after revisions.
M3 and M5 reached 80% and 77% success rates, respectively.
Abstract
Hallucinations in large language models (LLMs) are outputs that are syntactically coherent but factually incorrect or contextually inconsistent. They are persistent obstacles in high-stakes industrial settings such as engineering design, enterprise resource planning, and IoT telemetry platforms. We present and compare five prompt engineering strategies intended to reduce the variance of model outputs and move toward repeatable, grounded results without modifying model weights or creating complex validation models. These methods include: (M1) Iterative Similarity Convergence, (M2) Decomposed Model-Agnostic Prompting, (M3) Single-Task Agent Specialization, (M4) Enhanced Data Registry, and (M5) Domain Glossary Injection. Each method is evaluated against an internal baseline using an LLM-as-Judge framework over 100 repeated runs per method (same fixed task prompt, stochastic decoding at tau…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
