Label Supervised LLaMA Finetuning
Zongxi Li, Xianming Li, Yuzhang Liu, Haoran Xie, Jing Li, Fu-lee Wang,, Qing Li, Xiaoqin Zhong

TL;DR
This paper introduces a label-supervised finetuning method for LLaMA-2-7B that leverages latent representations for improved classification and NER tasks, outperforming larger models and traditional baselines without complex prompts.
Contribution
It proposes a novel label-supervised adaptation approach for LLMs using latent representations and LoRA, achieving state-of-the-art results in classification and NER with small-scale models.
Findings
LS-LLaMA outperforms larger LLMs in classification tasks.
Removing causal masks enhances NER performance.
The approach requires only a single GPU for finetuning.
Abstract
The recent success of Large Language Models (LLMs) has gained significant attention in both academia and industry. Substantial efforts have been made to enhance the zero- and few-shot generalization capabilities of open-source LLMs through finetuning. Currently, the prevailing approach is instruction-tuning, which trains LLMs to complete real-world tasks by generating responses guided by natural language instructions. It is worth noticing that such an approach may underperform in sequence and token classification tasks. Unlike text generation tasks, classification tasks have a limited label space, where precise label prediction is more appreciated than generating diverse and human-like responses. Prior research has unveiled that instruction-tuned LLMs cannot outperform BERT, prompting us to explore the potential of leveraging latent representations from LLMs for supervised label…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Dropout · WordPiece · Attention Dropout · Dense Connections · Linear Layer · Weight Decay · Adam
