Large Language Models for Simultaneous Named Entity Extraction and   Spelling Correction

Edward Whittaker; Ikuo Kitagishi

arXiv:2403.00528·cs.CL·March 4, 2024·3 cites

Large Language Models for Simultaneous Named Entity Extraction and Spelling Correction

Edward Whittaker, Ikuo Kitagishi

PDF

Open Access

TL;DR

This paper explores using decoder-only Large Language Models to simultaneously extract Named Entities and correct spelling errors in OCR-processed Japanese receipt text, demonstrating comparable performance to BERT-based models.

Contribution

It introduces a novel approach of using generative LLMs for joint NE extraction and spelling correction, extending beyond traditional classification methods.

Findings

01

Best LLM performs as well as BERT models in NE extraction.

02

LLMs can automatically correct some OCR errors.

03

Fine-tuned LLMs show potential for joint NE extraction and spelling correction.

Abstract

Language Models (LMs) such as BERT, have been shown to perform well on the task of identifying Named Entities (NE) in text. A BERT LM is typically used as a classifier to classify individual tokens in the input text, or to classify spans of tokens, as belonging to one of a set of possible NE categories. In this paper, we hypothesise that decoder-only Large Language Models (LLMs) can also be used generatively to extract both the NE, as well as potentially recover the correct surface form of the NE, where any spelling errors that were present in the input text get automatically corrected. We fine-tune two BERT LMs as baselines, as well as eight open-source LLMs, on the task of producing NEs from text that was obtained by applying Optical Character Recognition (OCR) to images of Japanese shop receipts; in this work, we do not attempt to find or evaluate the location of NEs in the text.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Sparse Evolutionary Training · Linear Layer · WordPiece · Layer Normalization · Dropout · Multi-Head Attention · Attention Dropout · Linear Warmup With Linear Decay