SoftPipe: A Soft-Guided Reinforcement Learning Framework for Automated Data Preparation

Jing Chang; Chang Liu; Jinbin Huang; Shuyuan Zheng; Rui Mao; Jianbin Qin

arXiv:2507.13710·cs.DB·July 29, 2025

SoftPipe: A Soft-Guided Reinforcement Learning Framework for Automated Data Preparation

Jing Chang, Chang Liu, Jinbin Huang, Shuyuan Zheng, Rui Mao, Jianbin Qin

PDF

TL;DR

SoftPipe introduces a flexible reinforcement learning framework for automated data preparation, replacing rigid constraints with soft guidance from a language model, leading to better and faster pipeline optimization.

Contribution

It proposes a novel RL framework that uses soft guidance via Bayesian inference and LLMs, improving data pipeline quality and convergence speed.

Findings

01

Up to 13.9% improvement in pipeline quality

02

2.8× faster convergence than existing methods

03

Effective on 18 diverse datasets

Abstract

Data preparation is a foundational yet notoriously challenging component of the machine learning lifecycle, characterized by a vast combinatorial search space. While reinforcement learning (RL) offers a promising direction, state-of-the-art methods suffer from a critical limitation: to manage the search space, they rely on rigid ``hard constraints'' that prematurely prune the search space and often preclude optimal solutions. To address this, we introduce SoftPipe, a novel RL framework that replaces these constraints with a flexible ``soft guidance'' paradigm. SoftPipe formulates action selection as a Bayesian inference problem. A high-level strategic prior, generated by a Large Language Model (LLM), probabilistically guides exploration. This prior is combined with empirical estimators from two sources through a collaborative process: a fine-grained quality score from a supervised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.