ICL Markup: Structuring In-Context Learning using Soft-Token Tags

Marc-Etienne Brunet; Ashton Anderson; Richard Zemel

arXiv:2312.07405·cs.CL·December 13, 2023·1 cites

ICL Markup: Structuring In-Context Learning using Soft-Token Tags

Marc-Etienne Brunet, Ashton Anderson, Richard Zemel

PDF

Open Access

TL;DR

This paper introduces ICL Markup, a method using soft-token tags to structure in-context learning prompts, reducing arbitrary choices and improving performance on various tasks through a meta-learning approach.

Contribution

It proposes a novel soft-token tagging method for ICL prompt design, learned via parameter-efficient fine-tuning, enhancing adaptability without extra training on new tasks.

Findings

01

Improved LLM performance on intent detection and text classification tasks.

02

Reduces arbitrary prompt design decisions in ICL.

03

Promising results in enterprise applications.

Abstract

Large pretrained language models (LLMs) can be rapidly adapted to a wide variety of tasks via a text-to-text approach, where the instruction and input are fed to the model in natural language. Combined with in-context learning (ICL), this paradigm is impressively flexible and powerful. However, it also burdens users with an overwhelming number of choices, many of them arbitrary. Inspired by markup languages like HTML, we contribute a method of using soft-token tags to compose prompt templates. This approach reduces arbitrary decisions and streamlines the application of ICL. Our method is a form of meta-learning for ICL; it learns these tags in advance during a parameter-efficient fine-tuning ``warm-up'' process. The tags can subsequently be used in templates for ICL on new, unseen tasks without any additional fine-tuning. Our experiments with this approach yield promising initial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Data Quality and Management · Machine Learning and Data Classification