Alignment Adapter to Improve the Performance of Compressed Deep Learning Models

Rohit Raj Rai; Abhishek Dhaka; Amit Awekar

arXiv:2602.14635·cs.LG·February 17, 2026

Alignment Adapter to Improve the Performance of Compressed Deep Learning Models

Rohit Raj Rai, Abhishek Dhaka, Amit Awekar

PDF

Open Access

TL;DR

This paper introduces Alignment Adapter (AlAd), a lightweight module that enhances the performance of compressed deep learning models by aligning their embeddings with those of larger models, applicable across various compression methods.

Contribution

The paper presents AlAd, a novel, flexible, and compression-agnostic adapter that improves compressed model performance without significant overhead.

Findings

01

AlAd significantly boosts compressed model accuracy.

02

AlAd maintains low size and latency overhead.

03

Effective across multiple NLP tasks.

Abstract

Compressed Deep Learning (DL) models are essential for deployment in resource-constrained environments. But their performance often lags behind their large-scale counterparts. To bridge this gap, we propose Alignment Adapter (AlAd): a lightweight, sliding-window-based adapter. It aligns the token-level embeddings of a compressed model with those of the original large model. AlAd preserves local contextual semantics, enables flexible alignment across differing dimensionalities or architectures, and is entirely agnostic to the underlying compression method. AlAd can be deployed in two ways: as a plug-and-play module over a frozen compressed model, or by jointly fine-tuning AlAd with the compressed model for further performance gains. Through experiments on BERT-family models across three token-level NLP tasks, we demonstrate that AlAd significantly boosts the performance of compressed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Big Data and Digital Economy · IoT and Edge/Fog Computing