Reducing Hallucinations in LLM-Generated Code via Semantic Triangulation

Yihan Dai; Sijie Liang; Haotian Xu; Peichu Xie; Sergey Mechtaev

arXiv:2511.12288·cs.SE·March 31, 2026

Reducing Hallucinations in LLM-Generated Code via Semantic Triangulation

Yihan Dai, Sijie Liang, Haotian Xu, Peichu Xie, Sergey Mechtaev

PDF

TL;DR

This paper proposes semantic triangulation, a framework that reduces hallucinations in LLM-generated code by checking consistency between solutions to transformed problem variants, improving correctness confidence.

Contribution

It introduces a theory-grounded framework with four concrete methods that decorrelate model errors and enhance program correctness verification in code generation.

Findings

01

Increases correct program selection probability by 24% over baselines

02

Achieves 26% higher F1 score in selection-or-abstention scenarios

03

Consistently handles inexact problems with multiple valid solutions

Abstract

Large language models (LLMs) can generate executable code from natural language descriptions, but the resulting programs frequently contain bugs due to hallucinations. In the absence of formal specifications, existing approaches attempt to assess correctness using LLM-generated proxies such as tests or auto-formalized specifications. However, these proxies are produced by the same imperfect models and thus often corroborate rather than catch errors, especially when the model exhibits correlated errors. We introduce semantic triangulation, a theory-grounded framework that decorrelates model errors by transforming the original problem into a dissociative variant - one likely requiring a fundamentally different algorithm - and checks consistency between independently sampled solutions to both problems. We identify theoretical requirements for this framework, and we prove that under a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.