OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models
Anas Awadalla, Irena Gao, Josh Gardner, Jack Hessel, Yusuf, Hanafy, Wanrong Zhu, Kalyani Marathe, Yonatan Bitton, Samir Gadre, and Shiori Sagawa, Jenia Jitsev, Simon Kornblith, Pang Wei Koh and, Gabriel Ilharco, Mitchell Wortsman, Ludwig Schmidt

TL;DR
OpenFlamingo is an open-source framework that replicates large autoregressive vision-language models, achieving comparable performance to DeepMind's Flamingo on multiple datasets, and provides accessible models and code for the community.
Contribution
It introduces an open-source family of large vision-language models that closely replicate Flamingo's performance, facilitating broader research and development.
Findings
Achieves 80-89% of Flamingo's performance on seven datasets.
Provides open-source models and training code.
Demonstrates effective replication of proprietary models.
Abstract
We introduce OpenFlamingo, a family of autoregressive vision-language models ranging from 3B to 9B parameters. OpenFlamingo is an ongoing effort to produce an open-source replication of DeepMind's Flamingo models. On seven vision-language datasets, OpenFlamingo models average between 80 - 89% of corresponding Flamingo performance. This technical report describes our models, training data, hyperparameters, and evaluation suite. We share our models and code at https://github.com/mlfoundations/open_flamingo.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗openflamingo/OpenFlamingo-3B-vitl-mpt1b-langinstructmodel· ♡ 5♡ 5
- 🤗openflamingo/OpenFlamingo-3B-vitl-mpt1bmodel· ♡ 13♡ 13
- 🤗openflamingo/OpenFlamingo-4B-vitl-rpj3bmodel· ♡ 5♡ 5
- 🤗openflamingo/OpenFlamingo-4B-vitl-rpj3b-langinstructmodel· ♡ 2♡ 2
- 🤗openflamingo/OpenFlamingo-9B-vitl-mpt7bmodel· ♡ 45♡ 45
- 🤗sugiv/Spoonbill-Llama2OtterFlamingoAreFriends-7B-Chatmodel
- 🤗matthieufp/multilingual_open_flamingomodel· ♡ 3♡ 3
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Natural Language Processing Techniques
