From Language to Programs: Bridging Reinforcement Learning and Maximum   Marginal Likelihood

Kelvin Guu; Panupong Pasupat; Evan Zheran Liu; Percy Liang

arXiv:1704.07926·cs.AI·April 27, 2017·41 cites

From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood

Kelvin Guu, Panupong Pasupat, Evan Zheran Liu, Percy Liang

PDF

Open Access 3 Repos

TL;DR

This paper introduces a novel learning algorithm that combines reinforcement learning and maximum marginal likelihood to improve neural semantic parsing from indirect supervision, effectively handling spurious programs and achieving state-of-the-art results.

Contribution

A new algorithm that merges RL and MML for semantic parsing, enhancing search and parameter updating to better handle spurious programs.

Findings

01

Significant performance improvements over existing methods.

02

Effective handling of spurious programs in semantic parsing.

03

Achieved state-of-the-art results on a context-dependent task.

Abstract

Our goal is to learn a semantic parser that maps natural language utterances into executable programs when only indirect supervision is available: examples are labeled with the correct execution result, but not the program itself. Consequently, we must search the space of programs for those that output the correct result, while not being misled by spurious programs: incorrect programs that coincidentally output the correct result. We connect two common learning paradigms, reinforcement learning (RL) and maximum marginal likelihood (MML), and then present a new learning algorithm that combines the strengths of both. The new algorithm guards against spurious programs by combining the systematic search traditionally employed in MML with the randomized exploration of RL, and by updating parameters such that probability is spread more evenly across consistent programs. We apply our learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications