Overcoming Conflicting Data when Updating a Neural Semantic Parser

David Gaddy; Alex Kouzemtchenko; Pavankumar Reddy Muddireddy; Prateek; Kolhar; and Rushin Shah

arXiv:2010.12675·cs.CL·December 13, 2021

Overcoming Conflicting Data when Updating a Neural Semantic Parser

David Gaddy, Alex Kouzemtchenko, Pavankumar Reddy Muddireddy, Prateek, Kolhar, and Rushin Shah

PDF

Open Access 1 Repo

TL;DR

This paper investigates updating neural semantic parsers with minimal new data amidst conflicting old data, proposing methods that significantly improve update accuracy and address the challenge of outdated labels.

Contribution

It introduces an experimental setup for updating semantic parsers with conflicting data and proposes multi-task and data selection methods to mitigate their negative effects.

Findings

01

Conflicting data significantly impairs learning updates.

02

Proposed methods improve accuracy over naive data mixing.

03

Best method closes 86% of the accuracy gap to an oracle.

Abstract

In this paper, we explore how to use a small amount of new data to update a task-oriented semantic parsing model when the desired output for some examples has changed. When making updates in this way, one potential problem that arises is the presence of conflicting data, or out-of-date labels in the original training set. To evaluate the impact of this understudied problem, we propose an experimental setup for simulating changes to a neural semantic parser. We show that the presence of conflicting data greatly hinders learning of an update, then explore several methods to mitigate its effect. Our multi-task and data selection methods lead to large improvements in model accuracy compared to a naive data-mixing strategy, and our best method closes 86% of the accuracy gap between this baseline and an oracle upper bound.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google/overcoming-conflicting-data
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications