Metric-Based In-context Learning: A Case Study in Text Simplification
Subha Vadlamannati, G\"ozde G\"ul \c{S}ahin

TL;DR
This paper introduces Metric-Based In-context Learning (MBL) for text simplification, which selects examples based on specific metrics to improve large language model performance and robustness across different models and datasets.
Contribution
The paper proposes a novel metric-based example selection method for ICL, demonstrating its effectiveness and robustness in text simplification tasks across various GPT models and benchmarks.
Findings
Top SARI scores yield best results on larger models
Compression ratio favors smaller models
MBL outperforms strong baselines and finetuned models
Abstract
In-context learning (ICL) for large language models has proven to be a powerful approach for many natural language processing tasks. However, determining the best method to select examples for ICL is nontrivial as the results can vary greatly depending on the quality, quantity, and order of examples used. In this paper, we conduct a case study on text simplification (TS) to investigate how to select the best and most robust examples for ICL. We propose Metric-Based in-context Learning (MBL) method that utilizes commonly used TS metrics such as SARI, compression ratio, and BERT-Precision for selection. Through an extensive set of experiments with various-sized GPT models on standard TS benchmarks such as TurkCorpus and ASSET, we show that examples selected by the top SARI scores perform the best on larger models such as GPT-175B, while the compression ratio generally performs better on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText Readability and Simplification · Topic Modeling · Natural Language Processing Techniques
MethodsAttention Is All You Need · Cosine Annealing · Linear Layer · Linear Warmup With Cosine Annealing · Adam · Discriminative Fine-Tuning · Multi-Head Attention · Attention Dropout · Byte Pair Encoding · Residual Connection
