LightAutoDS-Tab: Multi-AutoML Agentic System for Tabular Data
Aleksey Lapin, Igor Hromov, Stanislav Chumakov, Mile Mitrovic, Dmitry Simakov, Nikolay O. Nikitin, Andrey V. Savchenko

TL;DR
LightAutoDS-Tab is a multi-AutoML system that integrates LLM-based code generation with various AutoML tools to enhance flexibility and performance on tabular data tasks, outperforming existing solutions.
Contribution
It introduces a novel multi-AutoML agentic system combining LLMs with multiple AutoML tools for improved tabular data task performance.
Findings
Outperforms state-of-the-art open-source AutoML solutions on Kaggle datasets
Enhances pipeline design flexibility and robustness
Demonstrates effectiveness across diverse tabular data tasks
Abstract
AutoML has advanced in handling complex tasks using the integration of LLMs, yet its efficiency remains limited by dependence on specific underlying tools. In this paper, we introduce LightAutoDS-Tab, a multi-AutoML agentic system for tasks with tabular data, which combines an LLM-based code generation with several AutoML tools. Our approach improves the flexibility and robustness of pipeline design, outperforming state-of-the-art open-source solutions on several data science tasks from Kaggle. The code of LightAutoDS-Tab is available in the open repository https://github.com/sb-ai-lab/LADS
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Text and Document Classification Technologies · vaccines and immunoinformatics approaches
