FAMOSE: A ReAct Approach to Automated Feature Discovery
Keith Burghardt, Jienan Liu, Sadman Sakib, Yuning Hao, Bo Li

TL;DR
FAMOSE introduces an innovative agent-based ReAct framework for automated feature engineering in machine learning, significantly improving performance on classification and regression tasks by enabling autonomous feature discovery and selection.
Contribution
This work is the first to apply an agentic ReAct approach to automated feature engineering for both regression and classification tasks, demonstrating state-of-the-art results.
Findings
Achieves 0.23% ROC-AUC improvement on classification tasks with over 10K instances.
Reduces RMSE by 2.0% on average for regression tasks.
Demonstrates robustness and inventiveness in feature discovery.
Abstract
Feature engineering remains a critical yet challenging bottleneck in machine learning, particularly for tabular data, as identifying optimal features from an exponentially large feature space traditionally demands substantial domain expertise. To address this challenge, we introduce FAMOSE (Feature AugMentation and Optimal Selection agEnt), a novel framework that leverages the ReAct paradigm to autonomously explore, generate, and refine features while integrating feature selection and evaluation tools within an agent architecture. To our knowledge, FAMOSE represents the first application of an agentic ReAct framework to automated feature engineering, especially for both regression and classification tasks. Extensive experiments demonstrate that FAMOSE is at or near the state-of-the-art on classification tasks (especially tasks with more than 10K instances, where ROC-AUC increases 0.23%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Data Mining Algorithms and Applications · Data Stream Mining Techniques
