Evolving Demonstration Optimization for Chain-of-Thought Feature Transformation
Xinyuan Wang, Kunpeng Liu, Arun Vignesh Malarkkan, Yanjie Fu

TL;DR
This paper introduces a dynamic, reinforcement learning-based framework for optimizing feature transformations in data-centric AI, leveraging evolving demonstrations to enhance the diversity, validity, and alignment of transformations with downstream tasks.
Contribution
It presents a novel closed-loop approach that evolves demonstration contexts for LLM-driven feature transformation, improving over static methods and enhancing performance and stability.
Findings
Outperforms classical and LLM-based baselines on tabular benchmarks.
More stable and diverse transformation generation compared to one-shot methods.
Framework generalizes across different LLMs and downstream evaluators.
Abstract
Feature Transformation (FT) is a core data-centric AI task that improves feature space quality to advance downstream predictive performance. However, discovering effective transformations remains challenging due to the large space of feature-operator combinations. Existing solutions rely on discrete search or latent generation, but they are frequently limited by sample inefficiency, invalid candidates, and redundant generations with limited coverage. Large Language Models (LLMs) offer strong priors for producing valid transformations, but current LLM-based FT methods typically rely on static demonstrations, resulting in limited diversity, redundant outputs, and weak alignment with downstream objectives. We propose a framework that optimizes context data for LLM-driven FT by evolving trajectory-level experiences in a closed loop. Starting from high-performing feature transportation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Explainable Artificial Intelligence (XAI)
