Metric-Based In-context Learning: A Case Study in Text Simplification

Subha Vadlamannati; G\"ozde G\"ul \c{S}ahin

arXiv:2307.14632·cs.CL·July 28, 2023·1 cites

Metric-Based In-context Learning: A Case Study in Text Simplification

Subha Vadlamannati, G\"ozde G\"ul \c{S}ahin

PDF

Open Access 1 Repo

TL;DR

This paper introduces Metric-Based In-context Learning (MBL) for text simplification, which selects examples based on specific metrics to improve large language model performance and robustness across different models and datasets.

Contribution

The paper proposes a novel metric-based example selection method for ICL, demonstrating its effectiveness and robustness in text simplification tasks across various GPT models and benchmarks.

Findings

01

Top SARI scores yield best results on larger models

02

Compression ratio favors smaller models

03

MBL outperforms strong baselines and finetuned models

Abstract

In-context learning (ICL) for large language models has proven to be a powerful approach for many natural language processing tasks. However, determining the best method to select examples for ICL is nontrivial as the results can vary greatly depending on the quality, quantity, and order of examples used. In this paper, we conduct a case study on text simplification (TS) to investigate how to select the best and most robust examples for ICL. We propose Metric-Based in-context Learning (MBL) method that utilizes commonly used TS metrics such as SARI, compression ratio, and BERT-Precision for selection. Through an extensive set of experiments with various-sized GPT models on standard TS benchmarks such as TurkCorpus and ASSET, we show that examples selected by the top SARI scores perform the best on larger models such as GPT-175B, while the compression ratio generally performs better on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nlp-ku/metric-based-in-context-learning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Topic Modeling · Natural Language Processing Techniques

MethodsAttention Is All You Need · Cosine Annealing · Linear Layer · Linear Warmup With Cosine Annealing · Adam · Discriminative Fine-Tuning · Multi-Head Attention · Attention Dropout · Byte Pair Encoding · Residual Connection