Discriminative Language Model as Semantic Consistency Scorer for   Prompt-based Few-Shot Text Classification

Zhipeng Xie; Yahe Li

arXiv:2210.12763·cs.CL·October 25, 2022

Discriminative Language Model as Semantic Consistency Scorer for Prompt-based Few-Shot Text Classification

Zhipeng Xie, Yahe Li

PDF

Open Access

TL;DR

This paper introduces DLM-SCS, a prompt-based few-shot text classification method that uses ELECTRA as a semantic consistency scorer, effectively distinguishing true labels from false ones without extra parameters.

Contribution

The paper presents a novel approach leveraging ELECTRA's discriminative capabilities to assess semantic consistency in prompts, improving few-shot classification performance.

Findings

01

Outperforms several state-of-the-art prompt-based few-shot methods

02

Utilizes ELECTRA without adding extra parameters

03

Effectively measures semantic consistency for label verification

Abstract

This paper proposes a novel prompt-based finetuning method (called DLM-SCS) for few-shot text classification by utilizing the discriminative language model ELECTRA that is pretrained to distinguish whether a token is original or generated. The underlying idea is that the prompt instantiated with the true label should have higher semantic consistency score than other prompts with false labels. Since a prompt usually consists of several components (or parts), its semantic consistency can be decomposed accordingly. The semantic consistency of each component is then computed by making use of the pretrained ELECTRA model, without introducing extra parameters. Extensive experiments have shown that our model outperforms several state-of-the-art prompt-based few-shot methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Sentiment Analysis and Opinion Mining · Topic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Softmax · Weight Decay · Adam · Attention Dropout · Linear Warmup With Linear Decay