LLM-Driven Reasoning for Constraint-Aware Feature Selection in Industrial Systems
Yuhang Zhou, Zhuokai Zhao, Ke Li, Spilios Evmorfos, G\"okalp Demirci, Mingyi Wang, Qiao Liu, Qifei Wang, Serena Li, Weiwei Li, Tingting Wang, Mingze Gao, Gedi Zhou, Abhishek Kumar, Xiangjun Fan, Lizhu Zhang, Jiayi Liu

TL;DR
This paper introduces MoFA, a reasoning-based framework leveraging large language models for constraint-aware feature selection in industrial systems, improving model performance and efficiency with limited labeled data.
Contribution
The paper presents MoFA, a novel model-driven, reasoning-based feature selection method that incorporates semantic and quantitative feature information for industrial applications.
Findings
MoFA improves prediction accuracy in real-world industrial tasks.
MoFA reduces feature set complexity while maintaining performance.
MoFA discovers high-value feature interactions enhancing online engagement.
Abstract
Feature selection is a crucial step in large-scale industrial machine learning systems, directly affecting model accuracy, efficiency, and maintainability. Traditional feature selection methods rely on labeled data and statistical heuristics, making them difficult to apply in production environments where labeled data are limited and multiple operational constraints must be satisfied. To address this, we propose Model Feature Agent (MoFA), a model-driven framework that performs sequential, reasoning-based feature selection using both semantic and quantitative feature information. MoFA incorporates feature definitions, importance scores, correlations, and metadata (e.g., feature groups or types) into structured prompts and selects features through interpretable, constraint-aware reasoning. We evaluate MoFA in three real-world industrial applications: (1) True Interest and Time-Worthiness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Data Mining Algorithms and Applications · Constraint Satisfaction and Optimization
