Function graph transformers universally approximate operators between function spaces
Takashi Furuya, David Mis, Ivan Dokmani\'c, Maarten V. de Hoop, Matti Lassas

TL;DR
This paper introduces function graph transformers, a measure-theoretic framework that universally approximates nonlinear operators between function spaces, enhancing understanding and capabilities of transformer-based operator learning.
Contribution
It develops a measure-theoretic approach to model operators with transformers, introduces function graph transformers, and proves their universal approximation capabilities.
Findings
Function graph transformers can approximate broad classes of nonlinear operators.
The framework accommodates regularized negative-order Sobolev inputs and multi-domain queries.
Universal approximation is achieved through compositions of softmax self-attention layers and MLPs.
Abstract
We study the approximation of nonlinear operators between function spaces by transformers. Our approach is to lift functions to measures supported on their graphs and leverage a recently introduced measure-theoretic view of transformers. A function is represented by its graph measure , with finite tokens being its empirical approximations. We show that this framework elegantly models discretization refinement via convergence of measures and provides a natural setting for operator learning. Within this framework, we introduce function graph transformers, a graph-preserving subclass of measure-theoretic transformers that maps graph measures to graph measures, which is to say that outputs remain single-valued functions. Crucially, this additional structure does not reduce generality: we prove that the resulting graph-preserving maps can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
