Towards artificial general intelligence via a multimodal foundation   model

Nanyi Fei; Zhiwu Lu; Yizhao Gao; Guoxing Yang; Yuqi Huo; Jingyuan Wen,; Haoyu Lu; Ruihua Song; Xin Gao; Tao Xiang; Hao Sun; Ji-Rong Wen

arXiv:2110.14378·cs.AI·June 9, 2022

Towards artificial general intelligence via a multimodal foundation model

Nanyi Fei, Zhiwu Lu, Yizhao Gao, Guoxing Yang, Yuqi Huo, Jingyuan Wen,, Haoyu Lu, Ruihua Song, Xin Gao, Tao Xiang, Hao Sun, Ji-Rong Wen

PDF

1 Repo

TL;DR

This paper introduces a multimodal foundation model trained on large-scale internet data, demonstrating promising results across diverse tasks and exhibiting strong imagination capabilities, marking progress towards artificial general intelligence.

Contribution

It presents a novel multimodal foundation model trained with self-supervised learning on weakly correlated data, advancing towards AGI with interpretability and imagination abilities.

Findings

01

Effective across various downstream tasks

02

Exhibits strong imagination capabilities

03

Progresses towards generalized AI

Abstract

The fundamental goal of artificial intelligence (AI) is to mimic the core cognitive activities of human. Despite tremendous success in the AI research, most of existing methods have only single-cognitive ability. To overcome this limitation and take a solid step towards artificial general intelligence (AGI), we develop a foundation model pre-trained with huge multimodal data, which can be quickly adapted for various downstream cognitive tasks. To achieve this goal, we propose to pre-train our foundation model by self-supervised learning with weak semantic correlation data crawled from the Internet and show that promising results can be obtained on a wide range of downstream tasks. Particularly, with the developed model-interpretability tools, we demonstrate that strong imagination ability is now possessed by our foundation model. We believe that our work makes a transformative stride…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

neilfei/brivl-nmi
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.