Function graph transformers universally approximate operators between function spaces

Takashi Furuya; David Mis; Ivan Dokmani\'c; Maarten V. de Hoop; Matti Lassas

arXiv:2605.17968·cs.LG·May 19, 2026

Function graph transformers universally approximate operators between function spaces

Takashi Furuya, David Mis, Ivan Dokmani\'c, Maarten V. de Hoop, Matti Lassas

PDF

TL;DR

This paper introduces function graph transformers, a measure-theoretic framework that universally approximates nonlinear operators between function spaces, enhancing understanding and capabilities of transformer-based operator learning.

Contribution

It develops a measure-theoretic approach to model operators with transformers, introduces function graph transformers, and proves their universal approximation capabilities.

Findings

01

Function graph transformers can approximate broad classes of nonlinear operators.

02

The framework accommodates regularized negative-order Sobolev inputs and multi-domain queries.

03

Universal approximation is achieved through compositions of softmax self-attention layers and MLPs.

Abstract

We study the approximation of nonlinear operators between function spaces by transformers. Our approach is to lift functions to measures supported on their graphs and leverage a recently introduced measure-theoretic view of transformers. A function $h$ is represented by its graph measure $γ_{h}$ , with finite tokens ${(x_{j}, h (x_{j}))}_{j = 1}^{N}$ being its empirical approximations. We show that this framework elegantly models discretization refinement via convergence of measures and provides a natural setting for operator learning. Within this framework, we introduce function graph transformers, a graph-preserving subclass of measure-theoretic transformers that maps graph measures to graph measures, which is to say that outputs remain single-valued functions. Crucially, this additional structure does not reduce generality: we prove that the resulting graph-preserving maps can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.