Ladder Up, Memory Down: Low-Cost Fine-Tuning With Side Nets

Estelle Zheng (LORIA; ALE); Nathan Cerisara (LORIA); S\'ebastien Warichet (ALE); Emmanuel Helbert (ALE); Christophe Cerisara (SYNALP; LORIA)

arXiv:2512.14237·cs.CL·December 17, 2025

Ladder Up, Memory Down: Low-Cost Fine-Tuning With Side Nets

Estelle Zheng (LORIA, ALE), Nathan Cerisara (LORIA), S\'ebastien Warichet (ALE), Emmanuel Helbert (ALE), Christophe Cerisara (SYNALP, LORIA)

PDF

Open Access

TL;DR

This paper introduces Ladder Side Tuning (LST), a memory-efficient PEFT method for fine-tuning large language models that matches existing methods in accuracy while significantly reducing memory usage, enabling larger models to be fine-tuned on consumer hardware.

Contribution

The paper revisits and demonstrates the effectiveness of Ladder Side Tuning, a rarely explored PEFT technique, showing it reduces memory usage by 50% and scales similarly to QLoRA, with the addition of a new depth-extended variant called xLadder.

Findings

01

LST cuts peak memory by 50% compared to full fine-tuning.

02

LST achieves comparable accuracy to QLoRA across benchmarks.

03

xLadder enables deeper reasoning with no additional memory overhead.

Abstract

Fine-tuning large language models (LLMs) is often limited by the memory available on commodity GPUs. Parameter-efficient fine-tuning (PEFT) methods such as QLoRA reduce the number of trainable parameters, yet still incur high memory usage induced by the backward pass in the full model. We revisit Ladder Side Tuning (LST), a rarely explored PEFT technique that adds a lightweight side network, and show that it matches QLoRA's compute scaling slope while cutting peak memory by 50\%. Across different downstream benchmarks spanning natural language understanding, mathematical and LLM-critic tasks, LST has competitive performance with QLoRA's accuracy on average while being much more memory-efficient. This efficiency enables fine-tuning of 7B-parameter models on a single 12 GB consumer GPU with 2k-token contexts, requiring no gradient checkpointing\textemdash conditions under which QLoRA…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Advanced Neural Network Applications