Trustworthy AI in the Agentic Lakehouse: from Concurrency to Governance

Jacopo Tagliabue; Federico Bianchi; Ciro Greco

arXiv:2511.16402·cs.AI·November 21, 2025

Trustworthy AI in the Agentic Lakehouse: from Concurrency to Governance

Jacopo Tagliabue, Federico Bianchi, Ciro Greco

PDF

Open Access

TL;DR

This paper introduces Bauplan, an agent-first lakehouse architecture that enhances trustworthiness by redesigning data and compute isolation around transactions, enabling reliable agent workflows in enterprise environments.

Contribution

It proposes Bauplan, a novel lakehouse design that addresses agent access challenges and integrates governance, with a reference implementation demonstrating its effectiveness.

Findings

01

Bauplan improves agent trustworthiness in lakehouses.

02

The design enables seamless coupling of agent reasoning with guarantees.

03

A self-healing pipeline showcases practical application.

Abstract

Even as AI capabilities improve, most enterprises do not consider agents trustworthy enough to work on production data. In this paper, we argue that the path to trustworthy agentic workflows begins with solving the infrastructure problem first: traditional lakehouses are not suited for agent access patterns, but if we design one around transactions, governance follows. In particular, we draw an operational analogy to MVCC in databases and show why a direct transplant fails in a decoupled, multi-language setting. We then propose an agent-first design, Bauplan, that reimplements data and compute isolation in the lakehouse. We conclude by sharing a reference implementation of a self-healing pipeline in Bauplan, which seamlessly couples agent reasoning with all the desired guarantees for correctness and trust.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation · Access Control and Trust · Business Process Modeling and Analysis