Patched MOA: optimizing inference for diverse software development tasks

Asankhaya Sharma

arXiv:2407.18521·cs.SE·May 1, 2025·1 cites

Patched MOA: optimizing inference for diverse software development tasks

Asankhaya Sharma

PDF

Open Access 2 Repos 1 Datasets

TL;DR

Patched MOA is a novel inference optimization technique that improves the performance of smaller language models on software development tasks, surpassing larger models in efficiency and cost-effectiveness.

Contribution

This paper introduces Patched MOA, a model-agnostic inference optimization method that enhances LLM performance without fine-tuning or larger models, applicable across diverse tasks.

Findings

01

Boosts smaller model performance by over 15% on benchmarks

02

Outperforms larger models like gpt-4-turbo at lower cost

03

Consistently improves task completion rates in software workflows

Abstract

This paper introduces Patched MOA (Mixture of Agents), an inference optimization technique that significantly enhances the performance of large language models (LLMs) across diverse software development tasks. We evaluate three inference optimization algorithms - Best of N, Mixture of Agents, and Monte Carlo Tree Search and demonstrate that Patched MOA can boost the performance of smaller models to surpass that of larger, more expensive models. Notably, our approach improves the gpt-4o-mini model's performance on the Arena-Hard-Auto benchmark by 15.52%, outperforming gpt-4-turbo at a fraction of the cost. We also apply Patched MOA to various software development workflows, showing consistent improvements in task completion rates. Our method is model-agnostic, transparent to end-users, and can be easily integrated into existing LLM pipelines. This work contributes to the growing field of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

patched-codes/static-analysis-eval
dataset· 588 dl
588 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Techniques and Practices · Model-Driven Software Engineering Techniques · Business Process Modeling and Analysis