Multi-objective Evolutionary Merging Enables Efficient Reasoning Models

Mario Iacobelli; Adrian Robert Minut; Tommaso Mencattini; Donato Crisostomi; Andrea Santilli; Iacopo Masi; Emanuele Rodol\`a

arXiv:2604.06465·cs.CL·April 9, 2026

Multi-objective Evolutionary Merging Enables Efficient Reasoning Models

Mario Iacobelli, Adrian Robert Minut, Tommaso Mencattini, Donato Crisostomi, Andrea Santilli, Iacopo Masi, Emanuele Rodol\`a

PDF

TL;DR

Evo-L2S introduces a multi-objective evolutionary framework to optimize reasoning models for shorter outputs without sacrificing accuracy, significantly reducing inference costs.

Contribution

It formulates the Long-to-Short reasoning as a multi-objective optimization problem and proposes an entropy-based sampling method for efficient model merging.

Findings

01

Reduces reasoning trace length by over 50%

02

Maintains or improves accuracy across multiple model scales

03

Demonstrates effectiveness on six mathematical benchmarks

Abstract

Reasoning models have demonstrated remarkable capabilities in solving complex problems by leveraging long chains of thought. However, this more deliberate reasoning comes with substantial computational overhead at inference time. The Long-to-Short (L2S) reasoning problem seeks to maintain high accuracy using fewer tokens, but current training-free model merging approaches rely on scalarized, fixed-hyperparameter arithmetic methods that are highly brittle and force suboptimal compromises. To address this gap, we introduce Evo-L2S, a novel framework that formulates L2S reasoning as a multi-objective optimization challenge. By leveraging evolutionary model merging, Evo-L2S explicitly optimizes the trade-off between accuracy and output length to produce a robust Pareto front of merged models. To make this search computationally tractable for large language models, we propose an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.