Win-Win Cooperation: Bundling Sequence and Span Models for Named Entity Recognition
Bin Ji, Shasha Li, Jie Yu, Jun Ma, Huijun Liu

TL;DR
This paper introduces Bundling Learning (BL), a method that combines sequence labeling and span-based models for NER, demonstrating consistent performance improvements and error reduction across multiple datasets and models.
Contribution
The paper proposes a novel bundling learning paradigm that effectively integrates sequence and span models for NER, enhancing existing SOTA models and providing insights into when and why BL works.
Findings
BL improves performance of various NER models across datasets
BL reduces boundary and type prediction errors
Incorporating BL into SOTA models yields new state-of-the-art results
Abstract
For Named Entity Recognition (NER), sequence labeling-based and span-based paradigms are quite different. Previous research has demonstrated that the two paradigms have clear complementary advantages, but few models have attempted to leverage these advantages in a single NER model as far as we know. In our previous work, we proposed a paradigm known as Bundling Learning (BL) to address the above problem. The BL paradigm bundles the two NER paradigms, enabling NER models to jointly tune their parameters by weighted summing each paradigm's training loss. However, three critical issues remain unresolved: When does BL work? Why does BL work? Can BL enhance the existing state-of-the-art (SOTA) NER models? To address the first two issues, we implement three NER models, involving a sequence labeling-based model--SeqNER, a span-based NER model--SpanNER, and BL-NER that bundles SeqNER and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
