EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction

Hsi-Che Lin; Yu-Chu Yu; Kai-Po Chang; Yu-Chiang Frank Wang

arXiv:2506.12015·cs.LG·January 6, 2026

EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction

Hsi-Che Lin, Yu-Chu Yu, Kai-Po Chang, Yu-Chiang Frank Wang

PDF

Open Access 1 Video

TL;DR

EMLoC introduces a memory-efficient fine-tuning method for large models using emulator construction and LoRA correction, enabling cost-effective adaptation on standard hardware.

Contribution

The paper presents a novel emulator-based fine-tuning framework with a compensation algorithm, reducing memory requirements and allowing large model fine-tuning on consumer GPUs.

Findings

01

Outperforms baseline methods across multiple datasets and modalities.

02

Enables fine-tuning of a 38B model on a 24GB GPU without quantization.

03

Supports flexible compression ratios and standard training pipelines.

Abstract

Open-source foundation models have seen rapid adoption and development, enabling powerful general-purpose capabilities across diverse domains. However, fine-tuning large foundation models for domain-specific or personalized tasks remains prohibitively expensive for most users due to the significant memory overhead beyond that of inference. We introduce EMLoC, an Emulator-based Memory-efficient fine-tuning framework with LoRA Correction, which enables model fine-tuning within the same memory budget required for inference. EMLoC constructs a task-specific light-weight emulator using activation-aware singular value decomposition (SVD) on a small downstream calibration set. Fine-tuning then is performed on this lightweight emulator via LoRA. To tackle the misalignment between the original model and the compressed emulator, we propose a novel compensation algorithm to correct the fine-tuned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction· slideslive

Taxonomy

TopicsEmbedded Systems Design Techniques · Parallel Computing and Optimization Techniques · Digital Filter Design and Implementation