P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models
Shuo Yang, Chenchen Yuan, Yao Rong, Felix Steinbauer, Gjergji, Kasneci

TL;DR
This paper introduces a novel method combining Proximal Policy Optimization with Large Language Models to improve tabular data augmentation, resulting in more accurate synthetic data generation for better model training.
Contribution
The paper presents a new approach that integrates PPO with LLMs to enhance tabular data synthesis, addressing limitations of existing GAN and LLM methods.
Findings
PPO-guided LLMs improve data quality.
Achieved 4% accuracy increase on real-world datasets.
Outperforms state-of-the-art data augmentation methods.
Abstract
A multitude of industries depend on accurate and reasonable tabular data augmentation for their business processes. Contemporary methodologies in generating tabular data revolve around utilizing Generative Adversarial Networks (GAN) or fine-tuning Large Language Models (LLM). However, GAN-based approaches are documented to produce samples with common-sense errors attributed to the absence of external knowledge. On the other hand, LLM-based methods exhibit a limited capacity to capture the disparities between synthesized and actual data distribution due to the absence of feedback from a discriminator during training. Furthermore, the decoding of LLM-based generation introduces gradient breakpoints, impeding the backpropagation of loss from a discriminator, thereby complicating the integration of these two approaches. To solve this challenge, we propose using proximal policy optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsData Quality and Management · Topic Modeling · Text Readability and Simplification
MethodsEntropy Regularization · Proximal Policy Optimization
