LaMDA: Language Models for Dialog Applications

Aaron Daniel Cohen; Adam Roberts; Alejandra Molina; Alena Butryna; Alicia Jin; Apoorv Kulshreshtha; Ben Hutchinson; Ben Zevenbergen; Blaise Hilary Aguera-Arcas; Chung-ching Chang; Claire Cui; Cosmo Du; Daniel De Freitas Adiwardana; Dehao Chen; Dmitry (Dima) Lepikhin; Ed H. Chi; Erin Hoffman-John; Heng-Tze Cheng; Hongrae Lee; Igor Krivokon; James Qin; Jamie Hall; Joe Fenton; Johnny Soraker; Kathy Meier-Hellstern; Kristen Olson; Lora Mois Aroyo; Maarten Paul Bosma; Marc Joseph Pickett; Marcelo Amorim Menegali; Marian Croak; Mark Díaz; Matthew Lamm; Maxim Krikun; Meredith Ringel Morris; Noam Shazeer; Quoc V. Le; Rachel Bernstein; Ravi Rajakumar; Ray Kurzweil; Romal Thoppilan; Steven Zheng; Taylor Bos; Toju Duke; Tulsee Doshi; Vincent Y. Zhao; Vinodkumar Prabhakaran; Will Rusch; YaGuang Li; Yanping Huang; Yanqi Zhou; Yuanzhong Xu; Zhifeng Chen

LaMDA: Language Models for Dialog Applications

Aaron Daniel Cohen

Adam Roberts

Alejandra Molina

Alena Butryna

Alicia Jin

Apoorv Kulshreshtha

Ben Hutchinson

Ben Zevenbergen

Blaise Hilary Aguera-Arcas

Chung-ching Chang

Claire Cui

Cosmo Du

Daniel De Freitas Adiwardana

Dehao Chen

Dmitry (Dima) Lepikhin

Ed H. Chi

Erin Hoffman-John

Heng-Tze Cheng

Hongrae Lee

Igor Krivokon

James Qin

Jamie Hall

Joe Fenton

Johnny Soraker

Kathy Meier-Hellstern

Kristen Olson

Lora Mois Aroyo

Maarten Paul Bosma

Marc Joseph Pickett

Marcelo Amorim Menegali

Marian Croak

Mark Díaz

Matthew Lamm

Maxim Krikun

Meredith Ringel Morris

Noam Shazeer

Quoc V. Le

Rachel Bernstein

Ravi Rajakumar

Ray Kurzweil

Romal Thoppilan

Steven Zheng

Taylor Bos

Toju Duke

Tulsee Doshi

Vincent Y. Zhao

Vinodkumar Prabhakaran

Will Rusch

YaGuang Li

Yanping Huang

Yanqi Zhou

Yuanzhong Xu

Zhifeng Chen

arXiv (2022)

Google Scholar

Abstract

We present LaMDA: Language Models for Dialog Applications. LaMDA is a family of Transformer-based neural language models specialized for dialog, which have up to 137B parameters and are pre-trained on 1.56T words of public dialog data and web text. While model scaling alone can improve quality, it shows less improvements on safety and factual grounding. We demonstrate that fine-tuning with annotated data and enabling the model to consult external knowledge sources can lead to significant improvements towards the two key challenges of safety and factual grounding.The first challenge, safety, involves ensuring that the model’s responses are consistent with a set of human values, such as preventing harmful suggestions and unfair bias. We quantify safety using a metric based on an illustrative set of values, and we find that filtering candidate responses using aLaMDA classifier fine-tuned with a small amount of crowdworker-annotated data offers a promising approach to improving model safety. The second challenge, factual grounding, involves enabling the model to consult external knowledge sources, such as an information retrieval system, a language translator, and a calculator. We quantify factuality using a groundedness metric, and we find that our approach enables the model to generate responses grounded in known sources, rather than responses that merely sound plausible. Finally, we explore the use of LaMDA in the domains of education and content recommendations, and analyze their helpfulness and role consistency.

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

LaMDA: Language Models for Dialog Applications

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs