# Generative Annotation for ASR Named Entity Correction

**Authors:** Yuanchang Luo, Daimeng Wei, Shaojun Li, Hengchao Shang, Jiaxin Guo, Zongyao Li, Zhanglin Wu, Xiaoyu Chen, Zhiqiang Rao, Jinlong Yang, Hao Yang

arXiv: 2508.20700 · 2025-10-27

## TL;DR

This paper introduces a novel generative named entity correction method for ASR transcripts that leverages speech sound features to improve correction accuracy, especially when transcribed words differ significantly from ground-truth entities.

## Contribution

The paper proposes a new NEC approach using speech sound features and generative annotation, addressing limitations of existing phonetic-level edit distance methods.

## Key findings

- Significant improvement in entity accuracy demonstrated.
- Effective correction in cases of word form differences.
- Method outperforms existing NEC models on tested datasets.

## Abstract

End-to-end automatic speech recognition systems often fail to transcribe domain-specific named entities, causing catastrophic failures in downstream tasks. Numerous fast and lightweight named entity correction (NEC) models have been proposed in recent years. These models, mainly leveraging phonetic-level edit distance algorithms, have shown impressive performances. However, when the forms of the wrongly-transcribed words(s) and the ground-truth entity are significantly different, these methods often fail to locate the wrongly transcribed words in hypothesis, thus limiting their usage. We propose a novel NEC method that utilizes speech sound features to retrieve candidate entities. With speech sound features and candidate entities, we inovatively design a generative method to annotate entity errors in ASR transcripts and replace the text with correct entities. This method is effective in scenarios of word form difference. We test our method using open-source and self-constructed test sets. The results demonstrate that our NEC method can bring significant improvement to entity accuracy. The self-constructed training data and test set is publicly available at github.com/L6-NLP/Generative-Annotation-NEC.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2508.20700/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/2508.20700/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/2508.20700/full.md

---
Source: https://tomesphere.com/paper/2508.20700