APT: Improving Specialist LLM Performance with Weakness Case Acquisition and Iterative Preference Training

Jun Rao; Zepeng Lin; Xuebo Liu; Xiaopeng Ke; Lian Lian; Dong Jin; Shengjun Cheng; Jun Yu; Min Zhang

arXiv:2506.03483·cs.CL·June 5, 2025

APT: Improving Specialist LLM Performance with Weakness Case Acquisition and Iterative Preference Training

Jun Rao, Zepeng Lin, Xuebo Liu, Xiaopeng Ke, Lian Lian, Dong Jin, Shengjun Cheng, Jun Yu, Min Zhang

PDF

Open Access 1 Video

TL;DR

The paper introduces APT, a training method that improves domain-specific performance of LLMs by focusing on error cases and similar samples, maintaining general capabilities while enhancing targeted task performance.

Contribution

APT is a novel training approach that selectively uses error and similar cases for domain adaptation, avoiding interference with the model's general knowledge.

Findings

01

APT improves domain-specific task performance.

02

APT maintains the model's general capabilities.

03

Experimental results outperform existing methods.

Abstract

Large Language Models (LLMs) often require domain-specific fine-tuning to address targeted tasks, which risks degrading their general capabilities. Maintaining a balance between domain-specific enhancements and general model utility is a key challenge. This paper proposes a novel approach named APT (Weakness Case Acquisition and Iterative Preference Training) to enhance domain-specific performance with self-generated dis-preferred weakness data (bad cases and similar cases). APT uniquely focuses on training the model using only those samples where errors occur, alongside a small, similar set of samples retrieved for this purpose. This targeted training minimizes interference with the model's existing knowledge base, effectively retaining generic capabilities. Experimental results on the LLama-2 and Mistral-V0.3 models across various benchmarks demonstrate that APT ensures no reduction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

APT: Improving Specialist LLM Performance with Weakness Case Acquisition and Iterative Preference Training· underline

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Artificial Intelligence in Healthcare and Education