Contrastive Instruction Tuning

Tianyi Lorena Yan; Fei Wang; James Y. Huang; Wenxuan Zhou; Fan Yin,; Aram Galstyan; Wenpeng Yin; Muhao Chen

arXiv:2402.11138·cs.CL·June 7, 2024·1 cites

Contrastive Instruction Tuning

Tianyi Lorena Yan, Fei Wang, James Y. Huang, Wenxuan Zhou, Fan Yin,, Aram Galstyan, Wenpeng Yin, Muhao Chen

PDF

Open Access 1 Repo 1 Datasets 1 Video

TL;DR

Contrastive Instruction Tuning enhances large language models' robustness to unseen instructions by aligning semantically equivalent instruction representations, leading to more consistent outputs across varied instruction phrasings.

Contribution

This paper introduces a novel contrastive training method for instruction tuning that improves LLM robustness to instruction variations by augmenting instruction data with paraphrases.

Findings

01

Improves robustness to unseen instructions by +2.5% accuracy on PromptBench

02

Enhances consistency of LLM outputs across instruction variations

03

Demonstrates effectiveness across multiple levels of textual variation

Abstract

Instruction tuning has been used as a promising approach to improve the performance of large language models (LLMs) on unseen tasks. However, current LLMs exhibit limited robustness to unseen instructions, generating inconsistent outputs when the same instruction is phrased with slightly varied forms or language styles. This behavior indicates LLMs' lack of robustness to textual variations and generalizability to unseen instructions, potentially leading to trustworthiness issues. Accordingly, we propose Contrastive Instruction Tuning, which maximizes the similarity between the hidden representations of semantically equivalent instruction-instance pairs while minimizing the similarity between semantically different ones. To facilitate this approach, we augment the existing FLAN collection by paraphrasing task instructions. Experiments on the PromptBench benchmark show that CoIN…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

luka-group/CoIN
pytorchOfficial

Datasets

LorenaYannnnn/contrastive_instruction_tuning
dataset· 18 dl
18 dl

Videos

Contrastive Instruction Tuning· underline

Taxonomy

TopicsEducation and Technology Integration · Innovative Teaching and Learning Methods