LaMDA: Language Models for Dialog Applications

Romal Thoppilan; Daniel De Freitas; Jamie Hall; Noam Shazeer; Apoorv; Kulshreshtha; Heng-Tze Cheng; Alicia Jin; Taylor Bos; Leslie Baker; Yu Du,; YaGuang Li; Hongrae Lee; Huaixiu Steven Zheng; Amin Ghafouri; Marcelo; Menegali; Yanping Huang; Maxim Krikun; Dmitry Lepikhin; James Qin; Dehao; Chen; Yuanzhong Xu; Zhifeng Chen; Adam Roberts; Maarten Bosma; Vincent Zhao,; Yanqi Zhou; Chung-Ching Chang; Igor Krivokon; Will Rusch; Marc Pickett,; Pranesh Srinivasan; Laichee Man; Kathleen Meier-Hellstern; Meredith Ringel; Morris; Tulsee Doshi; Renelito Delos Santos; Toju Duke; Johnny Soraker; Ben; Zevenbergen; Vinodkumar Prabhakaran; Mark Diaz; Ben Hutchinson; Kristen; Olson; Alejandra Molina; Erin Hoffman-John; Josh Lee; Lora Aroyo; Ravi; Rajakumar; Alena Butryna; Matthew Lamm; Viktoriya Kuzmina; Joe Fenton; Aaron; Cohen; Rachel Bernstein; Ray Kurzweil; Blaise Aguera-Arcas; Claire Cui,; Marian Croak; Ed Chi; Quoc Le

arXiv:2201.08239·cs.CL·February 11, 2022·705 cites

LaMDA: Language Models for Dialog Applications

Romal Thoppilan, Daniel De Freitas, Jamie Hall, Noam Shazeer, Apoorv, Kulshreshtha, Heng-Tze Cheng, Alicia Jin, Taylor Bos, Leslie Baker, Yu Du,, YaGuang Li, Hongrae Lee, Huaixiu Steven Zheng, Amin Ghafouri, Marcelo, Menegali, Yanping Huang, Maxim Krikun, Dmitry Lepikhin

PDF

Open Access 2 Repos

TL;DR

LaMDA is a large Transformer-based dialog model that improves safety and factual grounding through fine-tuning and external knowledge consultation, demonstrating enhanced performance in dialog applications.

Contribution

Introduces LaMDA, a scalable dialog-specific language model, and shows how fine-tuning and external knowledge integration improve safety and factual accuracy.

Findings

01

Filtering responses with a LaMDA classifier improves safety.

02

Consulting external sources enhances factual grounding.

03

LaMDA performs well in education and content recommendation domains.

Abstract

We present LaMDA: Language Models for Dialog Applications. LaMDA is a family of Transformer-based neural language models specialized for dialog, which have up to 137B parameters and are pre-trained on 1.56T words of public dialog data and web text. While model scaling alone can improve quality, it shows less improvements on safety and factual grounding. We demonstrate that fine-tuning with annotated data and enabling the model to consult external knowledge sources can lead to significant improvements towards the two key challenges of safety and factual grounding. The first challenge, safety, involves ensuring that the model's responses are consistent with a set of human values, such as preventing harmful suggestions and unfair bias. We quantify safety using a metric based on an illustrative set of human values, and we find that filtering candidate responses using a LaMDA classifier…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems