Label Supervised LLaMA Finetuning

Zongxi Li; Xianming Li; Yuzhang Liu; Haoran Xie; Jing Li; Fu-lee Wang,; Qing Li; Xiaoqin Zhong

arXiv:2310.01208·cs.CL·October 3, 2023·20 cites

Label Supervised LLaMA Finetuning

Zongxi Li, Xianming Li, Yuzhang Liu, Haoran Xie, Jing Li, Fu-lee Wang,, Qing Li, Xiaoqin Zhong

PDF

Open Access 2 Repos 2 Models

TL;DR

This paper introduces a label-supervised finetuning method for LLaMA-2-7B that leverages latent representations for improved classification and NER tasks, outperforming larger models and traditional baselines without complex prompts.

Contribution

It proposes a novel label-supervised adaptation approach for LLMs using latent representations and LoRA, achieving state-of-the-art results in classification and NER with small-scale models.

Findings

01

LS-LLaMA outperforms larger LLMs in classification tasks.

02

Removing causal masks enhances NER performance.

03

The approach requires only a single GPU for finetuning.

Abstract

The recent success of Large Language Models (LLMs) has gained significant attention in both academia and industry. Substantial efforts have been made to enhance the zero- and few-shot generalization capabilities of open-source LLMs through finetuning. Currently, the prevailing approach is instruction-tuning, which trains LLMs to complete real-world tasks by generating responses guided by natural language instructions. It is worth noticing that such an approach may underperform in sequence and token classification tasks. Unlike text generation tasks, classification tasks have a limited label space, where precise label prediction is more appreciated than generating diverse and human-like responses. Prior research has unveiled that instruction-tuned LLMs cannot outperform BERT, prompting us to explore the potential of leveraging latent representations from LLMs for supervised label…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Dropout · WordPiece · Attention Dropout · Dense Connections · Linear Layer · Weight Decay · Adam