Context-NER : Contextual Phrase Generation at Scale
Himanshu Gupta, Shreyas Verma, Santosh Mashetty, Swaroop Mishra

TL;DR
This paper introduces CONTEXT-NER, a new task focused on generating relevant entity contexts in financial texts, supported by the EDGAR10-Q dataset, and demonstrates baseline models achieving promising results and surpassing larger models on downstream tasks.
Contribution
The paper presents the first dataset and baseline models for the novel task of context generation in NER, specifically in financial documents, and shows how pre-finetuning improves downstream performance.
Findings
Baseline models achieve ROUGE-L scores up to 49%.
Pre-finetuned T5-large outperforms vanilla models by 10.81 points on finance tasks.
Small pre-finetuned models surpass larger finance-specific LLMs like BloombergGPT-50B.
Abstract
Named Entity Recognition (NER) has seen significant progress in recent years, with numerous state-of-the-art (SOTA) models achieving high performance. However, very few studies have focused on the generation of entities' context. In this paper, we introduce CONTEXT-NER, a task that aims to generate the relevant context for entities in a sentence, where the context is a phrase describing the entity but not necessarily present in the sentence. To facilitate research in this task, we also present the EDGAR10-Q dataset, which consists of annual and quarterly reports from the top 1500 publicly traded companies. The dataset is the largest of its kind, containing 1M sentences, 2.8M entities, and an average of 35 tokens per sentence, making it a challenging dataset. We propose a baseline approach that combines a phrase generation algorithm with inferencing using a 220M language model, achieving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Natural Language Processing Techniques
MethodsTest
