KARL-Trans-NER: Knowledge Aware Representation Learning for Named Entity   Recognition using Transformers

Avi Chawla; Nidhi Mulay; Vikas Bishnoi; Gaurav Dhama

arXiv:2111.15436·cs.CL·December 1, 2021·1 cites

KARL-Trans-NER: Knowledge Aware Representation Learning for Named Entity Recognition using Transformers

Avi Chawla, Nidhi Mulay, Vikas Bishnoi, Gaurav Dhama

PDF

Open Access

TL;DR

This paper introduces KARL-Trans-NER, a knowledge-aware transformer-based model that enhances named entity recognition by integrating large-scale knowledge bases, leading to improved accuracy and generalization on multiple datasets.

Contribution

The paper proposes a novel Knowledge Aware Representation Learning (KARL) network that incorporates world knowledge into transformer models for NER, addressing existing challenges and boosting performance.

Findings

01

Significantly improved NER results on CoNLL 2003, CoNLL++, and OntoNotes v5 datasets.

02

Enhanced generalization to unseen entities in real-world scenarios.

03

Effective integration of knowledge bases as fact triplets for feature augmentation.

Abstract

The inception of modeling contextual information using models such as BERT, ELMo, and Flair has significantly improved representation learning for words. It has also given SOTA results in almost every NLP task - Machine Translation, Text Summarization and Named Entity Recognition, to name a few. In this work, in addition to using these dominant context-aware representations, we propose a Knowledge Aware Representation Learning (KARL) Network for Named Entity Recognition (NER). We discuss the challenges of using existing methods in incorporating world knowledge for NER and show how our proposed methods could be leveraged to overcome those challenges. KARL is based on a Transformer Encoder that utilizes large knowledge bases represented as fact triplets, converts them to a graph context, and extracts essential entity information residing inside to generate contextualized triplet…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Linear Warmup With Linear Decay · Sigmoid Activation · Layer Normalization · WordPiece · Byte Pair Encoding · Label Smoothing · Dense Connections