MINER: Improving Out-of-Vocabulary Named Entity Recognition from an   Information Theoretic Perspective

Xiao Wang; Shihan Dou; Limao Xiong; Yicheng Zou; Qi Zhang; Tao Gui,; Liang Qiao; Zhanzhan Cheng; Xuanjing Huang

arXiv:2204.04391·cs.CL·May 4, 2022·1 cites

MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective

Xiao Wang, Shihan Dou, Limao Xiong, Yicheng Zou, Qi Zhang, Tao Gui,, Liang Qiao, Zhanzhan Cheng, Xuanjing Huang

PDF

Open Access 2 Repos

TL;DR

MINER is a new NER framework that improves out-of-vocabulary entity recognition by using mutual information objectives to enhance context understanding and reduce reliance on memorized entity names.

Contribution

It introduces an information-theoretic approach with mutual information objectives to improve OOV NER performance, addressing over-reliance on entity mention memorization.

Findings

01

Better OOV entity recognition performance across datasets

02

Effective use of mutual information for NER training

03

Reduces bias from memorized entity names

Abstract

NER model has achieved promising performance on standard NER benchmarks. However, recent studies show that previous approaches may over-rely on entity mention information, resulting in poor performance on out-of-vocabulary (OOV) entity recognition. In this work, we propose MINER, a novel NER learning framework, to remedy this issue from an information-theoretic perspective. The proposed approach contains two mutual information-based training objectives: i) generalizing information maximization, which enhances representation via deep understanding of context and entity surface forms; ii) superfluous information minimization, which discourages representation from rote memorizing entity names or exploiting biased cues in data. Experiments on various settings and datasets demonstrate that it achieves better performance in predicting OOV entities.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management