TIER-A: Denoising Learning Framework for Information Extraction

Yongkang Li; Ming Zhang

arXiv:2211.11527·cs.CL·November 22, 2022·1 cites

TIER-A: Denoising Learning Framework for Information Extraction

Yongkang Li, Ming Zhang

PDF

Open Access

TL;DR

This paper introduces TIER-A, a co-regularization framework that uses temperature calibration and entropy regularization to reduce overfitting in neural information extraction models trained on noisy datasets.

Contribution

The paper proposes a novel joint-training framework leveraging information entropy regularization to combat overconfidence and overfitting in neural models for information extraction.

Findings

01

Effective in reducing overfitting on noisy datasets

02

Improves extraction accuracy on TACRED and CoNLL03

03

Validates the entropy-based overfitting hypothesis

Abstract

With the development of deep neural language models, great progress has been made in information extraction recently. However, deep learning models often overfit on noisy data points, leading to poor performance. In this work, we examine the role of information entropy in the overfitting process and draw a key insight that overfitting is a process of overconfidence and entropy decreasing. Motivated by such properties, we propose a simple yet effective co-regularization joint-training framework TIER-A, Aggregation Joint-training Framework with Temperature Calibration and Information Entropy Regularization. Our framework consists of several neural models with identical structures. These models are jointly trained and we avoid overfitting by introducing temperature and information entropy regularization. Extensive experiments on two widely-used but noisy datasets, TACRED and CoNLL03,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech Recognition and Synthesis · Music and Audio Processing

MethodsEntropy Regularization