Quantized Evolution Strategies: High-precision Fine-tuning of Quantized LLMs at Low-precision Cost

Yinggan Xu; Risto Miikkulainen; Xin Qiu

arXiv:2602.03120·cs.LG·February 4, 2026

Quantized Evolution Strategies: High-precision Fine-tuning of Quantized LLMs at Low-precision Cost

Yinggan Xu, Risto Miikkulainen, Xin Qiu

PDF

Open Access

TL;DR

This paper introduces Quantized Evolution Strategies (QES), a novel method enabling high-precision fine-tuning of quantized large language models directly in their discrete parameter space, overcoming previous limitations of gradient-based methods.

Contribution

QES is the first approach to perform full-parameter fine-tuning in the quantized space using a backpropagation-free optimization with error feedback and seed replay, enabling scalable LLM fine-tuning.

Findings

01

QES outperforms state-of-the-art zeroth-order fine-tuning methods.

02

QES enables effective fine-tuning of quantized models for arithmetic reasoning.

03

QES reduces memory usage to low-precision inference levels.

Abstract

Post-Training Quantization (PTQ) is essential for deploying Large Language Models (LLMs) on memory-constrained devices, yet it renders models static and difficult to fine-tune. Standard fine-tuning paradigms, including Reinforcement Learning (RL), fundamentally rely on backpropagation and high-precision weights to compute gradients. Thus they cannot be used on quantized models, where the parameter space is discrete and non-differentiable. While Evolution Strategies (ES) offer a backpropagation-free alternative, optimization of the quantized parameters can still fail due to vanishing or inaccurate gradient. This paper introduces Quantized Evolution Strategies (QES), an optimization paradigm that performs full-parameter fine-tuning directly in the quantized space. QES is based on two innovations: (1) it integrates accumulated error feedback to preserve high-precision gradient signals, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification