Transformer-based Named Entity Recognition with Combined Data   Representation

Micha{\l} Marci\'nczuk

arXiv:2406.17474·cs.CL·June 26, 2024

Transformer-based Named Entity Recognition with Combined Data Representation

Micha{\l} Marci\'nczuk

PDF

Open Access

TL;DR

This paper explores transformer-based models for named entity recognition, proposing a combined data representation training method that enhances model stability and adaptability across multiple languages and datasets.

Contribution

It introduces a novel combined training procedure utilizing multiple data representation strategies to improve NER performance and robustness.

Findings

01

Combined training improves model stability across data representations

02

Effective across four languages and various datasets

03

Outperforms single-strategy training methods

Abstract

This study examines transformer-based models and their effectiveness in named entity recognition tasks. The study investigates data representation strategies, including single, merged, and context, which respectively use one sentence, multiple sentences, and sentences joined with attention to context per vector. Analysis shows that training models with a single strategy may lead to poor performance on different data representations. To address this limitation, the study proposes a combined training procedure that utilizes all three strategies to improve model stability and adaptability. The results of this approach are presented and discussed for four languages (English, Polish, Czech, and German) across various datasets, demonstrating the effectiveness of the combined strategy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management

MethodsSoftmax · Attention Is All You Need