Evolving Demonstration Optimization for Chain-of-Thought Feature Transformation

Xinyuan Wang; Kunpeng Liu; Arun Vignesh Malarkkan; Yanjie Fu

arXiv:2603.09987·cs.CL·March 12, 2026

Evolving Demonstration Optimization for Chain-of-Thought Feature Transformation

Xinyuan Wang, Kunpeng Liu, Arun Vignesh Malarkkan, Yanjie Fu

PDF

Open Access

TL;DR

This paper introduces a dynamic, reinforcement learning-based framework for optimizing feature transformations in data-centric AI, leveraging evolving demonstrations to enhance the diversity, validity, and alignment of transformations with downstream tasks.

Contribution

It presents a novel closed-loop approach that evolves demonstration contexts for LLM-driven feature transformation, improving over static methods and enhancing performance and stability.

Findings

01

Outperforms classical and LLM-based baselines on tabular benchmarks.

02

More stable and diverse transformation generation compared to one-shot methods.

03

Framework generalizes across different LLMs and downstream evaluators.

Abstract

Feature Transformation (FT) is a core data-centric AI task that improves feature space quality to advance downstream predictive performance. However, discovering effective transformations remains challenging due to the large space of feature-operator combinations. Existing solutions rely on discrete search or latent generation, but they are frequently limited by sample inefficiency, invalid candidates, and redundant generations with limited coverage. Large Language Models (LLMs) offer strong priors for producing valid transformations, but current LLM-based FT methods typically rely on static demonstrations, resulting in limited diversity, redundant outputs, and weak alignment with downstream objectives. We propose a framework that optimizes context data for LLM-driven FT by evolving trajectory-level experiences in a closed loop. Starting from high-performing feature transportation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Explainable Artificial Intelligence (XAI)