Multilinguals at SemEval-2022 Task 11: Transformer Based Architecture for Complex NER
Amit Pandey, Swayatta Daw, Vikram Pudi

TL;DR
This paper explores the use of transformer-based models, specifically BERT, for complex named entity recognition in English, achieving significant improvements over baseline methods.
Contribution
It introduces a transformer-based architecture for complex NER and demonstrates its effectiveness through qualitative analysis and performance gains.
Findings
Best model beats baseline F1-score by over 9%
All models outperform baseline significantly
Qualitative analysis of multiple architectures
Abstract
We investigate the task of complex NER for the English language. The task is non-trivial due to the semantic ambiguity of the textual structure and the rarity of occurrence of such entities in the prevalent literature. Using pre-trained language models such as BERT, we obtain a competitive performance on this task. We qualitatively analyze the performance of multiple architectures for this task. All our models are able to outperform the baseline by a significant margin. Our best performing model beats the baseline F1-score by over 9%.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Computational Physics and Python Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Dense Connections · Attention Dropout · Multi-Head Attention · Residual Connection · WordPiece · Adam · Dropout
