Lumos: Let there be Language Model System Certification

Isha Chaudhary; Vedaant Jain; Prineet Parhar; Kavya Sachdeva; Avaljot Singh; Sayan Ranu; Gagandeep Singh

arXiv:2512.02966·cs.PL·April 2, 2026

Lumos: Let there be Language Model System Certification

Isha Chaudhary, Vedaant Jain, Prineet Parhar, Kavya Sachdeva, Avaljot Singh, Sayan Ranu, Gagandeep Singh

PDF

TL;DR

Lumos is a formal framework for specifying and certifying language model system behaviors, enabling safety verification and robustness guarantees for diverse prompt distributions.

Contribution

Lumos introduces a novel probabilistic programming DSL with formal semantics for LMS certification, including safety specifications for vision-language models.

Findings

01

Lumos reveals safety failures in Qwen-VL under rainy driving conditions.

02

Lumos can encode complex LMS specifications with few constructs.

03

State-of-the-art models can generate Lumos specifications in zero-shot settings.

Abstract

We introduce the first principled framework, Lumos, for specifying and formally certifying Language Model System (LMS) behaviors. Lumos is an imperative probabilistic programming DSL over graphs, with constructs to generate independent and identically distributed prompts for LMS. It offers a structured view of prompt distributions via graphs, forming random prompts from sampled subgraphs. Lumos supports certifying LMS for arbitrary prompt distributions via integration with statistical certifiers. We provide hybrid (operational and denotational) semantics for Lumos, providing a rigorous way to interpret the specifications. Using only a small set of composable constructs, Lumos can encode existing LMS specifications, including complex relational and temporal specifications. It also facilitates specifying new properties - we present the first safety specifications for vision-language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.