The LLMbda Calculus: AI Agents, Conversations, and Information Flow

Zac Garby; Andrew D. Gordon; David Sands

arXiv:2602.20064·cs.PL·February 24, 2026

The LLMbda Calculus: AI Agents, Conversations, and Information Flow

Zac Garby, Andrew D. Gordon, David Sands

PDF

Open Access

TL;DR

This paper introduces a formal lambda calculus model for AI agent conversations with LLMs, enabling rigorous reasoning about security vulnerabilities and defenses in prompt-based interactions.

Contribution

It develops a semantic framework that models LLM interactions and vulnerabilities, providing formal guarantees for safety and information flow in AI agent systems.

Findings

01

The calculus models prompt injection and its impact on subsequent reasoning.

02

Semantics support reasoning about defenses like quarantining conversations.

03

A noninterference theorem guarantees confidentiality and integrity.

Abstract

A conversation with a large language model (LLM) is a sequence of prompts and responses, with each response generated from the preceding conversation. AI agents build such conversations automatically: given an initial human prompt, a planner loop interleaves LLM calls with tool invocations and code execution. This tight coupling creates a new and poorly understood attack surface. A malicious prompt injected into a conversation can compromise later reasoning, trigger dangerous tool calls, or distort final outputs. Despite the centrality of such systems, we currently lack a principled semantic foundation for reasoning about their behaviour and safety. We address this gap by introducing an untyped call-by-value lambda calculus enriched with dynamic information-flow control and a small number of primitives for constructing prompt-response conversations. Our language includes a primitive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSecurity and Verification in Computing · Logic, programming, and type systems · Formal Methods in Verification