LoRA in LoRA: Towards Parameter-Efficient Architecture Expansion for Continual Visual Instruction Tuning

Chang Che; Ziqi Wang; Pengwan Yang; Qi Wang; Hui Ma; Zenglin Shi

arXiv:2508.06202·cs.CV·May 15, 2026

LoRA in LoRA: Towards Parameter-Efficient Architecture Expansion for Continual Visual Instruction Tuning

Chang Che, Ziqi Wang, Pengwan Yang, Qi Wang, Hui Ma, Zenglin Shi

PDF

1 Repo 1 Video

TL;DR

LiLoRA is a parameter-efficient architecture expansion method for continual visual instruction tuning, reducing redundancy and preserving shared representations to improve sequential task learning in multimodal models.

Contribution

Introduces LiLoRA, a novel low-rank, shared matrix approach with stability loss for efficient continual learning in multimodal models.

Findings

01

LiLoRA outperforms existing methods in sequential task learning.

02

LiLoRA significantly reduces parameter overhead.

03

LiLoRA maintains better shared representation stability.

Abstract

Continual Visual Instruction Tuning (CVIT) enables Multimodal Large Language Models (MLLMs) to incrementally learn new tasks over time. However, this process is challenged by catastrophic forgetting, where performance on previously learned tasks deteriorates as the model adapts to new ones. A common approach to mitigate forgetting is architecture expansion, which introduces task-specific modules to prevent interference. Yet, existing methods often expand entire layers for each task, leading to significant parameter overhead and poor scalability. To overcome these issues, we introduce LoRA in LoRA (LiLoRA), a highly efficient architecture expansion method tailored for CVIT in MLLMs. LiLoRA shares the LoRA matrix A across tasks to reduce redundancy, applies an additional low-rank decomposition to matrix B to minimize task-specific parameters, and incorporates a cosine-regularized…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chanceche/LiLoRA
github

Videos

LoRA in LoRA: Towards Parameter-Efficient Architecture Expansion for Continual Visual Instruction Tuning· underline