On the Opportunities and Risks of Foundation Models

Rishi Bommasani; Drew A. Hudson; Ehsan Adeli; Russ Altman; Simran; Arora; Sydney von Arx; Michael S. Bernstein; Jeannette Bohg; Antoine; Bosselut; Emma Brunskill; Erik Brynjolfsson; Shyamal Buch; Dallas Card,; Rodrigo Castellon; Niladri Chatterji; Annie Chen; Kathleen Creel; Jared; Quincy Davis; Dora Demszky; Chris Donahue; Moussa Doumbouya; Esin Durmus,; Stefano Ermon; John Etchemendy; Kawin Ethayarajh; Li Fei-Fei; Chelsea Finn,; Trevor Gale; Lauren Gillespie; Karan Goel; Noah Goodman; Shelby Grossman,; Neel Guha; Tatsunori Hashimoto; Peter Henderson; John Hewitt; Daniel E. Ho,; Jenny Hong; Kyle Hsu; Jing Huang; Thomas Icard; Saahil Jain; Dan Jurafsky,; Pratyusha Kalluri; Siddharth Karamcheti; Geoff Keeling; Fereshte Khani; Omar; Khattab; Pang Wei Koh; Mark Krass; Ranjay Krishna; Rohith Kuditipudi; Ananya; Kumar; Faisal Ladhak; Mina Lee; Tony Lee; Jure Leskovec; Isabelle Levent,; Xiang Lisa Li; Xuechen Li; Tengyu Ma; Ali Malik; Christopher D. Manning,; Suvir Mirchandani; Eric Mitchell; Zanele Munyikwa; Suraj Nair; Avanika; Narayan; Deepak Narayanan; Ben Newman; Allen Nie; Juan Carlos Niebles; Hamed; Nilforoshan; Julian Nyarko; Giray Ogut; Laurel Orr; Isabel Papadimitriou,; Joon Sung Park; Chris Piech; Eva Portelance; Christopher Potts; Aditi; Raghunathan; Rob Reich; Hongyu Ren; Frieda Rong; Yusuf Roohani; Camilo Ruiz,; Jack Ryan; Christopher R\'e; Dorsa Sadigh; Shiori Sagawa; Keshav Santhanam,; Andy Shih; Krishnan Srinivasan; Alex Tamkin; Rohan Taori; Armin W. Thomas,; Florian Tram\`er; Rose E. Wang; William Wang; Bohan Wu; Jiajun Wu; Yuhuai Wu,; Sang Michael Xie; Michihiro Yasunaga; Jiaxuan You; Matei Zaharia; Michael; Zhang; Tianyi Zhang; Xikun Zhang; Yuhui Zhang; Lucia Zheng; Kaitlyn Zhou,; Percy Liang

arXiv:2108.07258·cs.LG·July 14, 2022·2.2k cites

On the Opportunities and Risks of Foundation Models

Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran, Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine, Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card,, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel

PDF

Open Access 2 Repos 2 Models 1 Video

TL;DR

This paper discusses the potential benefits and dangers of foundation models like GPT-3 and DALL-E, emphasizing their capabilities, societal impacts, and the need for interdisciplinary research to understand and manage their widespread deployment.

Contribution

It provides a comprehensive overview of foundation models, highlighting their emergent capabilities, societal implications, and the importance of cautious, interdisciplinary study.

Findings

01

Foundation models exhibit emergent capabilities at scale.

02

Homogenization of models can propagate defects downstream.

03

Understanding their workings and failures requires interdisciplinary research.

Abstract

AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their capabilities (e.g., language, vision, robotics, reasoning, human interaction) and technical principles(e.g., model architectures, training procedures, data, systems, security, evaluation, theory) to their applications (e.g., law, healthcare, education) and societal impact (e.g., inequity, misuse, economic and environmental impact, legal and ethical considerations). Though foundation models are based on standard deep learning and transfer learning, their scale results in new emergent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

Foundation Models | On the opportunities and risks of calling pre-trained models “Foundation Models”· youtube

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Topic Modeling

MethodsAttention Is All You Need · Linear Layer · WordPiece · Attention Dropout · Residual Connection · Dropout · Adam · Dense Connections · Multi-Head Attention · Softmax