DEXTER: Deep Encoding of External Knowledge for Named Entity Recognition   in Virtual Assistants

Deepak Muralidharan; Joel Ruben Antony Moniz; Weicheng Zhang; Stephen; Pulman; Lin Li; Megan Barnes; Jingjing Pan; Jason Williams; Alex Acero

arXiv:2108.06633·cs.CL·October 30, 2023

DEXTER: Deep Encoding of External Knowledge for Named Entity Recognition in Virtual Assistants

Deepak Muralidharan, Joel Ruben Antony Moniz, Weicheng Zhang, Stephen, Pulman, Lin Li, Megan Barnes, Jingjing Pan, Jason Williams, Alex Acero

PDF

TL;DR

This paper introduces DEXTER, a deep encoding method for external knowledge to improve named entity recognition in noisy voice assistant inputs, achieving significant error reduction and aiding related semantic parsing tasks.

Contribution

The paper presents a novel NER system that incorporates external knowledge and indirect labeling techniques, specifically designed for noisy voice assistant data, outperforming baseline models.

Findings

01

6% reduction in NER error rate

02

Up to 5% improvement in semantic parsing error rate

03

Effective handling of noisy, user-derived data

Abstract

Named entity recognition (NER) is usually developed and tested on text from well-written sources. However, in intelligent voice assistants, where NER is an important component, input to NER may be noisy because of user or speech recognition error. In applications, entity labels may change frequently, and non-textual properties like topicality or popularity may be needed to choose among alternatives. We describe a NER system intended to address these problems. We test and train this system on a proprietary user-derived dataset. We compare with a baseline text-only NER system; the baseline enhanced with external gazetteers; and the baseline enhanced with the search and indirect labelling techniques we describe below. The final configuration gives around 6% reduction in NER error rate. We also show that this technique improves related tasks, such as semantic parsing, with an improvement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.