Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning

Jiaru Zou; Yikun Ban; Zihao Li; Yunzhe Qi; Ruizhong Qiu; Ling Yang; Jingrui He

arXiv:2505.16270·cs.CL·November 17, 2025

Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning

Jiaru Zou, Yikun Ban, Zihao Li, Yunzhe Qi, Ruizhong Qiu, Ling Yang, Jingrui He

PDF

Open Access 1 Repo

TL;DR

Transformer Copilot introduces a novel framework where a secondary model learns from a log of the primary model's mistakes to refine its outputs, significantly improving performance across diverse tasks with minimal additional computation.

Contribution

The paper proposes a new Pilot-Copilot framework that leverages mistake logs for continuous learning and logits rectification, enhancing LLM fine-tuning effectiveness.

Findings

01

Up to 34.5% performance improvement on benchmarks

02

Effective across diverse tasks including commonsense and arithmetic

03

Minimal additional computational overhead

Abstract

Large language models are typically adapted to downstream tasks through supervised fine-tuning on domain-specific data. While standard fine-tuning focuses on minimizing generation loss to optimize model parameters, we take a deeper step by retaining and leveraging the model's own learning signals, analogous to how human learners reflect on past mistakes to improve future performance. We first introduce the concept of Mistake Log to systematically track the model's learning behavior and recurring errors throughout fine-tuning. Treating the original transformer-based model as the Pilot, we correspondingly design a Copilot model to refine the Pilot's inference performance via logits rectification. We name the overall Pilot-Copilot framework the Transformer Copilot, which introduces (i) a novel Copilot model design, (ii) a joint training paradigm where the Copilot continuously learns from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jiaruzouu/transformercopilot
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Natural Language Processing Techniques

MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Multi-Head Attention · Dense Connections · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Residual Connection · Byte Pair Encoding