Learning with Latent Language

Jacob Andreas; Dan Klein; Sergey Levine

arXiv:1711.00482·cs.CL·November 3, 2017

Learning with Latent Language

Jacob Andreas, Dan Klein, Sergey Levine

PDF

1 Repo

TL;DR

This paper demonstrates that leveraging natural language as a structured parameter space during pretraining enhances the generality and efficiency of classifiers and control policies across various tasks.

Contribution

It introduces a method that uses natural language strings as a parameter space, improving learning efficiency without requiring language data during task-specific training.

Findings

01

Models with linguistic parameterization outperform non-linguistic models.

02

Pretraining with language structures benefits image classification, text editing, and reinforcement learning.

03

Language-based parameter space improves generalization and learning speed.

Abstract

The named concepts and compositional operators present in natural language provide a rich source of information about the kinds of abstractions humans use to navigate the world. Can this linguistic background knowledge improve the generality and efficiency of learned classifiers and control policies? This paper aims to show that using the space of natural language strings as a parameter space is an effective way to capture natural task structure. In a pretraining phase, we learn a language interpretation model that transforms inputs (e.g. images) into outputs (e.g. labels) given natural language descriptions. To learn a new concept (e.g. a classifier), we search directly in the space of descriptions to minimize the interpreter's loss on training examples. Crucially, our models do not require language data to learn these concepts: language is used only in pretraining to impose structure…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jacobandreas/l3
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.