Template-free Prompt Tuning for Few-shot NER

Ruotian Ma; Xin Zhou; Tao Gui; Yiding Tan; Linyang Li; Qi Zhang,; Xuanjing Huang

arXiv:2109.13532·cs.CL·November 24, 2022·27 cites

Template-free Prompt Tuning for Few-shot NER

Ruotian Ma, Xin Zhou, Tao Gui, Yiding Tan, Linyang Li, Qi Zhang,, Xuanjing Huang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a template-free prompt tuning approach for few-shot Named Entity Recognition (NER) that reformulates the task as a language modeling problem, eliminating template design and significantly speeding up decoding.

Contribution

The work proposes a novel template-free method for few-shot NER that simplifies the process and improves speed by predicting label words directly without templates.

Findings

01

Outperforms template-based methods in few-shot NER accuracy

02

Decoding speed is up to 1930 times faster than template-based approaches

03

Effective automatic search for appropriate label words enhances model adaptation

Abstract

Prompt-based methods have been successfully applied in sentence-level few-shot learning tasks, mostly owing to the sophisticated design of templates and label words. However, when applied to token-level labeling tasks such as NER, it would be time-consuming to enumerate the template queries over all potential entity spans. In this work, we propose a more elegant method to reformulate NER tasks as LM problems without any templates. Specifically, we discard the template construction process while maintaining the word prediction paradigm of pre-training models to predict a class-related pivot word (or label word) at the entity position. Meanwhile, we also explore principled ways to automatically search for appropriate label words that the pre-trained models can easily adapt to. While avoiding complicated template-based process, the proposed LM objective also reduces the gap between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rtmaww/EntLM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications