SpeciaLex: A Benchmark for In-Context Specialized Lexicon Learning

Joseph Marvin Imperial; Harish Tayyar Madabushi

arXiv:2407.13297·cs.CL·October 7, 2024·1 cites

SpeciaLex: A Benchmark for In-Context Specialized Lexicon Learning

Joseph Marvin Imperial, Harish Tayyar Madabushi

PDF

Open Access 1 Repo 1 Video

TL;DR

SpeciaLex is a comprehensive benchmark designed to evaluate large language models' ability to understand and apply specialized lexicons across various tasks, aiding the development of more precise content generation tools.

Contribution

Introduces SpeciaLex, a new benchmark with 18 tasks and 1,785 instances, to assess LLMs' handling of specialized lexicon constraints in content tasks.

Findings

01

Larger models generally perform better on the benchmark.

02

Open-source models show competitive performance with closed-source counterparts.

03

Model recency and setup significantly influence performance.

Abstract

Specialized lexicons are collections of words with associated constraints such as special definitions, specific roles, and intended target audiences. These constraints are necessary for content generation and documentation tasks (e.g., writing technical manuals or children's reading materials), where the goal is to reduce the ambiguity of text content and increase its overall readability for a specific group of audience. Understanding how large language models can capture these constraints can help researchers build better, more impactful tools for wider use beyond the NLP community. Towards this end, we introduce SpeciaLex, a benchmark for evaluating a language model's ability to follow specialized lexicon-based constraints across 18 diverse subtasks with 1,785 test instances covering core tasks of Checking, Identification, Rewriting, and Open Generation. We present an empirical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

imperialite/specialex
pytorchOfficial

Videos

SpeciaLex: A Benchmark for In-Context Specialized Lexicon Learning· underline

Taxonomy

TopicsNatural Language Processing Techniques · linguistics and terminology studies · Lexicography and Language Studies