Can Language Models Be Specific? How?

Jie Huang; Kevin Chen-Chuan Chang; Jinjun Xiong; Wen-mei Hwu

arXiv:2210.05159·cs.CL·May 29, 2023

Can Language Models Be Specific? How?

Jie Huang, Kevin Chen-Chuan Chang, Jinjun Xiong, Wen-mei Hwu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a benchmark to measure the specificity of pre-trained language models and proposes prompt-based methods to enhance their specificity without additional training.

Contribution

It presents a novel benchmark for specificity testing and two prompt-based techniques to improve model specificity, highlighting an understudied aspect of language model behavior.

Findings

01

Existing PLMs show only slight preference for more specific answers.

02

Proposed methods improve model specificity without additional training.

03

The work raises awareness of the importance of specificity in language models.

Abstract

"He is a person", "Paris is located on the earth". Both statements are correct but meaningless - due to lack of specificity. In this paper, we propose to measure how specific the language of pre-trained language models (PLMs) is. To achieve this, we introduce a novel approach to build a benchmark for specificity testing by forming masked token prediction tasks with prompts. For instance, given "Toronto is located in [MASK].", we want to test whether a more specific answer will be better filled in by PLMs, e.g., Ontario instead of Canada. From our evaluations, we show that existing PLMs have only a slight preference for more specific answers. We identify underlying factors affecting the specificity and design two prompt-based methods to improve the specificity. Results show that the specificity of the models can be improved by the proposed methods without additional training. We hope…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jeffhj/s-test
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsTest