AutoML-Med: A Framework for Automated Machine Learning in Medical Tabular Data
Riccardo Francia, Maurizio Leone, Giorgio Leonardi, Stefania Montani, Marzio Pennisi, Manuel Striani, Sandra D'Alfonso

TL;DR
AutoML-Med is a specialized automated machine learning framework designed to handle complex medical tabular data issues, improving predictive accuracy and sensitivity in clinical tasks with minimal user intervention.
Contribution
It introduces a novel AutoML architecture tailored for medical data challenges, integrating Latin Hypercube Sampling and PRCC for optimized preprocessing and model selection.
Findings
Outperforms existing tools in balanced accuracy and sensitivity.
Effectively handles missing data, class imbalance, and heterogeneous features.
Demonstrates significant improvements in clinical prediction tasks.
Abstract
Medical datasets are typically affected by issues such as missing values, class imbalance, a heterogeneous feature types, and a high number of features versus a relatively small number of samples, preventing machine learning models from obtaining proper results in classification and regression tasks. This paper introduces AutoML-Med, an Automated Machine Learning tool specifically designed to address these challenges, minimizing user intervention and identifying the optimal combination of preprocessing techniques and predictive models. AutoML-Med's architecture incorporates Latin Hypercube Sampling (LHS) for exploring preprocessing methods, trains models using selected metrics, and utilizes Partial Rank Correlation Coefficient (PRCC) for fine-tuned optimization of the most influential preprocessing steps. Experimental results demonstrate AutoML-Med's effectiveness in two different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Machine Learning in Healthcare · Artificial Intelligence in Healthcare
