On the Opportunities and Risks of Foundation Models
Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran, Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine, Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card,, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel

TL;DR
This paper discusses the potential benefits and dangers of foundation models like GPT-3 and DALL-E, emphasizing their capabilities, societal impacts, and the need for interdisciplinary research to understand and manage their widespread deployment.
Contribution
It provides a comprehensive overview of foundation models, highlighting their emergent capabilities, societal implications, and the importance of cautious, interdisciplinary study.
Findings
Foundation models exhibit emergent capabilities at scale.
Homogenization of models can propagate defects downstream.
Understanding their workings and failures requires interdisciplinary research.
Abstract
AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their capabilities (e.g., language, vision, robotics, reasoning, human interaction) and technical principles(e.g., model architectures, training procedures, data, systems, security, evaluation, theory) to their applications (e.g., law, healthcare, education) and societal impact (e.g., inequity, misuse, economic and environmental impact, legal and ethical considerations). Though foundation models are based on standard deep learning and transfer learning, their scale results in new emergent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Foundation Models | On the opportunities and risks of calling pre-trained models “Foundation Models”· youtube
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Topic Modeling
MethodsAttention Is All You Need · Linear Layer · WordPiece · Attention Dropout · Residual Connection · Dropout · Adam · Dense Connections · Multi-Head Attention · Softmax
