BudgetMLAgent: A Cost-Effective LLM Multi-Agent system for Automating   Machine Learning Tasks

Shubham Gandhi; Manasi Patwardhan; Lovekesh Vig; Gautam Shroff

arXiv:2411.07464·cs.MA·January 9, 2025

BudgetMLAgent: A Cost-Effective LLM Multi-Agent system for Automating Machine Learning Tasks

Shubham Gandhi, Manasi Patwardhan, Lovekesh Vig, Gautam Shroff

PDF

Open Access

TL;DR

BudgetMLAgent introduces a cost-effective multi-agent system using low-cost LLMs and expert collaboration to improve success rates in ML tasks while significantly reducing costs.

Contribution

The paper presents a novel multi-agent LLM system that combines low-cost models with expert calls, achieving better performance at a fraction of the cost compared to single GPT-4 systems.

Findings

01

94.2% cost reduction over GPT-4 single-agent system

02

Improved success rate of 32.95% versus 22.72% for GPT-4

03

Effective use of low-cost models with expert collaboration

Abstract

Large Language Models (LLMs) excel in diverse applications including generation of code snippets, but often struggle with generating code for complex Machine Learning (ML) tasks. Although existing LLM single-agent based systems give varying performance depending on the task complexity, they purely rely on larger and expensive models such as GPT-4. Our investigation reveals that no-cost and low-cost models such as Gemini-Pro, Mixtral and CodeLlama perform far worse than GPT-4 in a single-agent setting. With the motivation of developing a cost-efficient LLM based solution for solving ML tasks, we propose an LLM Multi-Agent based system which leverages combination of experts using profiling, efficient retrieval of past observations, LLM cascades, and ask-the-expert calls. Through empirical analysis on ML engineering tasks in the MLAgentBench benchmark, we demonstrate the effectiveness of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical and Computational Modeling

MethodsAttention Is All You Need · Linear Layer · Dense Connections · Label Smoothing · Absolute Position Encodings · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Multi-Head Attention · Residual Connection