The Blessing of Dimensionality in LLM Fine-tuning: A Variance-Curvature Perspective

Qiyao Liang; Jinyeop Song; Yizhou Liu; Jeff Gore; Ila Fiete; Risto Miikkulainen; Xin Qiu

arXiv:2602.00170·cs.LG·February 3, 2026

The Blessing of Dimensionality in LLM Fine-tuning: A Variance-Curvature Perspective

Qiyao Liang, Jinyeop Song, Yizhou Liu, Jeff Gore, Ila Fiete, Risto Miikkulainen, Xin Qiu

PDF

Open Access

TL;DR

This paper reveals that the low-dimensional curvature structure of fine-tuning landscapes allows small population evolution strategies to effectively optimize large language models, explaining observed non-monotonic training behaviors.

Contribution

It introduces a geometric perspective on fine-tuning landscapes, demonstrating that high-curvature directions dominate and enable efficient optimization with small populations.

Findings

01

Small populations can effectively fine-tune billion-parameter models.

02

Reward improvements often rise and then degrade due to landscape geometry.

03

High-curvature directions dominate the fine-tuning landscape.

Abstract

Weight-perturbation evolution strategies (ES) can fine-tune billion-parameter language models with surprisingly small populations (e.g., $N \approx 30$ ), contradicting classical zeroth-order curse-of-dimensionality intuition. We also observe a second seemingly separate phenomenon: under fixed hyperparameters, the stochastic fine-tuning reward often rises, peaks, and then degrades in both ES and GRPO. We argue that both effects reflect a shared geometric property of fine-tuning landscapes: they are low-dimensional in curvature. A small set of high-curvature dimensions dominates improvement, producing (i) heterogeneous time scales that yield rise-then-decay under fixed stochasticity, as captured by a minimal quadratic stochastic-ascent model, and (ii) degenerate improving updates, where many random perturbations share similar components along these directions. Using ES as a geometric…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis · Reinforcement Learning in Robotics