Prompt-Based Metric Learning for Few-Shot NER

Yanru Chen; Yanan Zheng; Zhilin Yang

arXiv:2211.04337·cs.CL·November 9, 2022

Prompt-Based Metric Learning for Few-Shot NER

Yanru Chen, Yanan Zheng, Zhilin Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a prompt-based metric learning approach for few-shot NER that enhances label semantics and combines multiple representations, achieving state-of-the-art results across various settings.

Contribution

It proposes a novel prompt schema and architecture to improve metric learning for few-shot NER, significantly outperforming previous methods.

Findings

01

Achieves state-of-the-art results in 16 out of 18 settings.

02

Substantially outperforms previous SOTA with up to 34.51% relative F1 gain.

03

Demonstrates effectiveness of prompt schemas in few-shot NER.

Abstract

Few-shot named entity recognition (NER) targets generalizing to unseen labels and/or domains with few labeled examples. Existing metric learning methods compute token-level similarities between query and support sets, but are not able to fully incorporate label semantics into modeling. To address this issue, we propose a simple method to largely improve metric learning for NER: 1) multiple prompt schemas are designed to enhance label semantics; 2) we propose a novel architecture to effectively combine multiple prompt-based representations. Empirically, our method achieves new state-of-the-art (SOTA) results under 16 of the 18 considered settings, substantially outperforming the previous SOTA by an average of 8.84% and a maximum of 34.51% in relative gains of micro F1. Our code is available at https://github.com/AChen-qaq/ProML.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

achen-qaq/proml
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text and Document Classification Technologies · Natural Language Processing Techniques