Trustworthy AI in the Agentic Lakehouse: from Concurrency to Governance
Jacopo Tagliabue, Federico Bianchi, Ciro Greco

TL;DR
This paper introduces Bauplan, an agent-first lakehouse architecture that enhances trustworthiness by redesigning data and compute isolation around transactions, enabling reliable agent workflows in enterprise environments.
Contribution
It proposes Bauplan, a novel lakehouse design that addresses agent access challenges and integrates governance, with a reference implementation demonstrating its effectiveness.
Findings
Bauplan improves agent trustworthiness in lakehouses.
The design enables seamless coupling of agent reasoning with guarantees.
A self-healing pipeline showcases practical application.
Abstract
Even as AI capabilities improve, most enterprises do not consider agents trustworthy enough to work on production data. In this paper, we argue that the path to trustworthy agentic workflows begins with solving the infrastructure problem first: traditional lakehouses are not suited for agent access patterns, but if we design one around transactions, governance follows. In particular, we draw an operational analogy to MVCC in databases and show why a direct transplant fails in a decoupled, multi-language setting. We then propose an agent-first design, Bauplan, that reimplements data and compute isolation in the lakehouse. We conclude by sharing a reference implementation of a self-healing pipeline in Bauplan, which seamlessly couples agent reasoning with all the desired guarantees for correctness and trust.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Access Control and Trust · Business Process Modeling and Analysis
