TL;DR
AdaLoRA-QAT is a two-stage fine-tuning framework for medical image segmentation that combines adaptive low-rank adaptation with quantization-aware training, achieving high accuracy with significantly reduced model size.
Contribution
It introduces a novel combination of adaptive low-rank encoder adaptation and quantization-aware training for efficient, reliable medical image segmentation models.
Findings
Achieves 95.6% Dice score, matching full-precision models.
Reduces trainable parameters by 16.6 times.
Yields 2.24 times model compression without significant accuracy loss.
Abstract
Chest X-ray (CXR) segmentation is an important step in computer-aided diagnosis, yet deploying large foundation models in clinical settings remains challenging due to computational constraints. We propose AdaLoRA-QAT, a two-stage fine-tuning framework that combines adaptive low-rank encoder adaptation with full quantization-aware training. Adaptive rank allocation improves parameter efficiency, while selective mixed-precision INT8 quantization preserves structural fidelity crucial for clinical reliability. Evaluated across large-scale CXR datasets, AdaLoRA-QAT achieves 95.6% Dice, matching full-precision SAM decoder fine-tuning while reducing trainable parameters by 16.6\times and yielding 2.24\times model compression. A Wilcoxon signed-rank test confirms that quantization does not significantly degrade segmentation accuracy. These results demonstrate that AdaLoRA-QAT effectively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
