On the Power of Foundation Models

Yang Yuan

arXiv:2211.16327·cs.AI·April 27, 2026·5 cites

On the Power of Foundation Models

Yang Yuan

PDF

1 Video

TL;DR

This paper uses category theory to analyze the theoretical capabilities of infinite foundation models, revealing limits of prompt-based learning and the potential of fine tuning for generalization.

Contribution

It introduces a categorical framework to understand foundation models, proving limits of prompt-based learning and showing fine tuning can solve tasks with sufficient resources.

Findings

01

Prompt-based learning is limited to representable tasks.

02

Fine tuning can theoretically solve any task within the model’s capacity.

03

Foundation models can generalize to unseen objects using structural information.

Abstract

With infinitely many high-quality data points, infinite computational power, an infinitely large foundation model with a perfect training algorithm and guaranteed zero generalization error on the pretext task, can the model be used for everything? This question cannot be answered by the existing theory of representation, optimization or generalization, because the issues they mainly investigate are assumed to be nonexistent here. In this paper, we show that category theory provides powerful machinery to answer this question. We have proved three results. The first one limits the power of prompt-based learning, saying that the model can solve a downstream task with prompts if and only if the task is representable. The second one says fine tuning does not have this limit, as a foundation model with the minimum required power (up to symmetry) can theoretically solve downstream tasks for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

On the Power of Foundation Models· slideslive