Large Language Models for Simultaneous Named Entity Extraction and Spelling Correction
Edward Whittaker, Ikuo Kitagishi

TL;DR
This paper explores using decoder-only Large Language Models to simultaneously extract Named Entities and correct spelling errors in OCR-processed Japanese receipt text, demonstrating comparable performance to BERT-based models.
Contribution
It introduces a novel approach of using generative LLMs for joint NE extraction and spelling correction, extending beyond traditional classification methods.
Findings
Best LLM performs as well as BERT models in NE extraction.
LLMs can automatically correct some OCR errors.
Fine-tuned LLMs show potential for joint NE extraction and spelling correction.
Abstract
Language Models (LMs) such as BERT, have been shown to perform well on the task of identifying Named Entities (NE) in text. A BERT LM is typically used as a classifier to classify individual tokens in the input text, or to classify spans of tokens, as belonging to one of a set of possible NE categories. In this paper, we hypothesise that decoder-only Large Language Models (LLMs) can also be used generatively to extract both the NE, as well as potentially recover the correct surface form of the NE, where any spelling errors that were present in the input text get automatically corrected. We fine-tune two BERT LMs as baselines, as well as eight open-source LLMs, on the task of producing NEs from text that was obtained by applying Optical Character Recognition (OCR) to images of Japanese shop receipts; in this work, we do not attempt to find or evaluate the location of NEs in the text.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Sparse Evolutionary Training · Linear Layer · WordPiece · Layer Normalization · Dropout · Multi-Head Attention · Attention Dropout · Linear Warmup With Linear Decay
