False Security Confidence in Benign LLM Code Generation

Xiaolei Ren

arXiv:2604.17014·cs.CR·April 22, 2026

False Security Confidence in Benign LLM Code Generation

Xiaolei Ren

PDF

TL;DR

This paper introduces the concept of False Security Confidence (FSC) in LLM code generation, focusing on how often security failures occur in correct outputs without explicit attacks, and evaluates detection reliability.

Contribution

It formalizes FSC rate, distinguishes it from prior metrics, and proposes a framework for studying its manifestation across different programming and deployment contexts.

Findings

01

Defines FSC rate as security failures in correct outputs

02

Distinguishes FSC from SAFE and CWEval metrics

03

Proposes a three-ecosystem task view for FSC study

Abstract

Prior work has demonstrated that functionally correct yet vulnerable outputs arise systematically in threat-oriented settings, where adversarial or implicit channels are used to induce security failures in code agents and automated patching workflows. This note introduces a complementary but distinct framing: False Security Confidence (FSC), which studies the same surface phenomenon from a measurement-first perspective in ordinary, non-attack-framed generation tasks. Our interest is not in whether attacks can produce such outputs, but in how frequently and in what forms they appear absent explicit attack pressure, and whether conventional functional evaluation reliably detects them. We formalize FSC rate as the prevalence of security failure within the set of functionally correct outputs, distinguish it from prior joint functional-security metrics such as SAFE and outcome-driven…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.