Improving Downstream Task Performance by Treating Numbers as Entities

Dhanasekar Sundararaman; Vivek Subramanian; Guoyin Wang; Liyan Xu,; Lawrence Carin

arXiv:2205.03559·cs.CL·September 20, 2022

Improving Downstream Task Performance by Treating Numbers as Entities

Dhanasekar Sundararaman, Vivek Subramanian, Guoyin Wang, Liyan Xu,, Lawrence Carin

PDF

Open Access

TL;DR

This paper proposes classifying numbers as entities in NLP models to enhance performance on tasks like fill-in-the-blank and question answering, leveraging the models' inherent numeracy.

Contribution

It introduces a novel classification of numbers as entities, improving NLP task performance beyond standard BERT and RoBERTa baselines.

Findings

01

Improved accuracy on fill-in-the-blank tasks

02

Enhanced question answering performance

03

Outperforms baseline models

Abstract

Numbers are essential components of text, like any other word tokens, from which natural language processing (NLP) models are built and deployed. Though numbers are typically not accounted for distinctly in most NLP tasks, there is still an underlying amount of numeracy already exhibited by NLP models. In this work, we attempt to tap this potential of state-of-the-art NLP models and transfer their ability to boost performance in related tasks. Our proposed classification of numbers into entities helps NLP models perform well on several tasks, including a handcrafted Fill-In-The-Blank (FITB) task and on question answering using joint embeddings, outperforming the BERT and RoBERTa baseline classification.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Adam · Layer Normalization · Linear Warmup With Linear Decay · Residual Connection · Attention Dropout · Weight Decay