# GPT-FT: An Efficient Automated Feature Transformation Using GPT for Sequence Reconstruction and Performance Enhancement

**Authors:** Yang Gao, Dongjie Wang, Scott Piersall, Ye Zhang, Liqiang Wang

arXiv: 2508.20824 · 2025-08-29

## TL;DR

This paper introduces GPT-FT, a transformer-based framework that automates feature transformation for machine learning, achieving high performance with reduced computational costs through a novel multi-step process.

## Contribution

The paper presents a new GPT-based method for automated feature transformation that improves efficiency and scalability over existing encoder-decoder approaches.

## Key findings

- Matches or exceeds baseline performance on benchmarks
- Significantly reduces computational costs
- Enhances scalability of feature transformation processes

## Abstract

Feature transformation plays a critical role in enhancing machine learning model performance by optimizing data representations. Recent state-of-the-art approaches address this task as a continuous embedding optimization problem, converting discrete search into a learnable process. Although effective, these methods often rely on sequential encoder-decoder structures that cause high computational costs and parameter requirements, limiting scalability and efficiency. To address these limitations, we propose a novel framework that accomplishes automated feature transformation through four steps: transformation records collection, embedding space construction with a revised Generative Pre-trained Transformer (GPT) model, gradient-ascent search, and autoregressive reconstruction. In our approach, the revised GPT model serves two primary functions: (a) feature transformation sequence reconstruction and (b) model performance estimation and enhancement for downstream tasks by constructing the embedding space. Such a multi-objective optimization framework reduces parameter size and accelerates transformation processes. Experimental results on benchmark datasets show that the proposed framework matches or exceeds baseline performance, with significant gains in computational efficiency. This work highlights the potential of transformer-based architectures for scalable, high-performance automated feature transformation.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2508.20824/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/2508.20824/full.md

## References

49 references — full list in the complete paper: https://tomesphere.com/paper/2508.20824/full.md

---
Source: https://tomesphere.com/paper/2508.20824