Execution Envelopes: A Shared Admission Contract for Backend AI Execution Requests
Krti Tallam

TL;DR
The paper proposes the execution envelope, a standardized internal admission object for AI backends, enabling shared governance, observability, and resource management across heterogeneous execution requests.
Contribution
It introduces a formalized, descriptive admission seam that can be integrated into backend workflows without replacing existing request models or scheduling mechanisms.
Findings
Defines the structure and invariants of the execution envelope.
Formalizes the distinction between requested and granted resources.
Demonstrates application in POST /serving/deploy_model workflow.
Abstract
Enterprise AI backends increasingly admit heterogeneous execution requests across model deployment, inference, evaluation, data movement, and agentic workflows. In many systems, those requests arrive in service-specific shapes, which makes it difficult to attach shared admission-time behavior such as logging, governance hints, resource accounting, authorization-aware policy hooks, and later runtime review without rebuilding the same contract in each subsystem. This paper introduces the execution envelope, a normalized internal admission object that records who is asking for what kind of execution, what resources were requested, what policy-relevant scope accompanied the request, and what the backend ultimately granted. The proposal is intentionally narrow. It does not replace service-specific request models, perform scheduling, or introduce a new authority token. Instead, it defines a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
