Unifying Token and Span Level Supervisions for Few-Shot Sequence Labeling
Zifeng Cheng, Qingyu Zhou, Zhiwei Jiang, Xuemin Zhao, Yunbo Cao, Qing, Gu

TL;DR
This paper introduces a unified model that combines token and span supervision for few-shot sequence labeling, improving performance by jointly training at multiple granularities and aligning their outputs.
Contribution
The novel CDAP network unifies token and span level supervision with a consistent loss and greedy inference, advancing few-shot sequence labeling methods.
Findings
Achieves state-of-the-art results on three benchmarks.
Effectively unifies token and span supervision.
Demonstrates improved accuracy over single-granularity models.
Abstract
Few-shot sequence labeling aims to identify novel classes based on only a few labeled samples. Existing methods solve the data scarcity problem mainly by designing token-level or span-level labeling models based on metric learning. However, these methods are only trained at a single granularity (i.e., either token level or span level) and have some weaknesses of the corresponding granularity. In this paper, we first unify token and span level supervisions and propose a Consistent Dual Adaptive Prototypical (CDAP) network for few-shot sequence labeling. CDAP contains the token-level and span-level networks, jointly trained at different granularities. To align the outputs of two networks, we further propose a consistent loss to enable them to learn from each other. During the inference phase, we propose a consistent greedy inference algorithm that first adjusts the predicted probability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Video Analysis and Summarization · AI in cancer detection
MethodsALIGN
