LMN at SemEval-2022 Task 11: A Transformer-based System for English   Named Entity Recognition

Ngoc Minh Lai

arXiv:2203.03546·cs.CL·March 8, 2022·1 cites

LMN at SemEval-2022 Task 11: A Transformer-based System for English Named Entity Recognition

Ngoc Minh Lai

PDF

Open Access

TL;DR

This paper presents a Transformer-based system for English complex named entity recognition, achieving competitive results in SemEval-2022, and explores data augmentation with entity linking which did not improve performance.

Contribution

The paper introduces a simple Transformer-based baseline for complex NER and evaluates data augmentation with entity linking in this context.

Findings

01

Achieved a macro F1 score of 72.50% on the test set.

02

Ranked 12th out of 30 teams in the leaderboard.

03

Data augmentation with entity linking did not improve performance.

Abstract

Processing complex and ambiguous named entities is a challenging research problem, but it has not received sufficient attention from the natural language processing community. In this short paper, we present our participation in the English track of SemEval-2022 Task 11: Multilingual Complex Named Entity Recognition. Inspired by the recent advances in pretrained Transformer language models, we propose a simple yet effective Transformer-based baseline for the task. Despite its simplicity, our proposed approach shows competitive results in the leaderboard as we ranked 12 over 30 teams. Our system achieved a macro F1 score of 72.50% on the held-out test set. We have also explored a data augmentation approach using entity linking. While the approach does not improve the final performance, we also discuss it in this paper.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Label Smoothing · Softmax · Adam · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Byte Pair Encoding