GNN-SL: Sequence Labeling Based on Nearest Examples via GNN
Shuhe Wang, Yuxian Meng, Rongbin Ouyang, Jiwei Li, Tianwei Zhang,, Lingjuan Lyu, Guoyin Wang

TL;DR
This paper introduces GNN-SL, a novel sequence labeling method that uses graph neural networks to incorporate similar training examples, significantly improving performance on tasks like NER, POS tagging, and Chinese Word Segmentation.
Contribution
The paper proposes a new GNN-based approach for sequence labeling that effectively leverages retrieved similar examples to enhance prediction accuracy, especially for long-tail cases.
Findings
Achieved state-of-the-art results on multiple sequence labeling tasks.
Significantly improved performance on Chinese Word Segmentation.
Demonstrated effectiveness across NER and POS datasets.
Abstract
To better handle long-tail cases in the sequence labeling (SL) task, in this work, we introduce graph neural networks sequence labeling (GNN-SL), which augments the vanilla SL model output with similar tagging examples retrieved from the whole training set. Since not all the retrieved tagging examples benefit the model prediction, we construct a heterogeneous graph, and leverage graph neural networks (GNNs) to transfer information between the retrieved tagging examples and the input word sequence. The augmented node which aggregates information from neighbors is used to do prediction. This strategy enables the model to directly acquire similar tagging examples and improves the general quality of predictions. We conduct a variety of experiments on three typical sequence labeling tasks: Named Entity Recognition (NER), Part of Speech Tagging (POS), and Chinese Word Segmentation (CWS) to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
