SpeciaLex: A Benchmark for In-Context Specialized Lexicon Learning
Joseph Marvin Imperial, Harish Tayyar Madabushi

TL;DR
SpeciaLex is a comprehensive benchmark designed to evaluate large language models' ability to understand and apply specialized lexicons across various tasks, aiding the development of more precise content generation tools.
Contribution
Introduces SpeciaLex, a new benchmark with 18 tasks and 1,785 instances, to assess LLMs' handling of specialized lexicon constraints in content tasks.
Findings
Larger models generally perform better on the benchmark.
Open-source models show competitive performance with closed-source counterparts.
Model recency and setup significantly influence performance.
Abstract
Specialized lexicons are collections of words with associated constraints such as special definitions, specific roles, and intended target audiences. These constraints are necessary for content generation and documentation tasks (e.g., writing technical manuals or children's reading materials), where the goal is to reduce the ambiguity of text content and increase its overall readability for a specific group of audience. Understanding how large language models can capture these constraints can help researchers build better, more impactful tools for wider use beyond the NLP community. Towards this end, we introduce SpeciaLex, a benchmark for evaluating a language model's ability to follow specialized lexicon-based constraints across 18 diverse subtasks with 1,785 test instances covering core tasks of Checking, Identification, Rewriting, and Open Generation. We present an empirical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques · linguistics and terminology studies · Lexicography and Language Studies
