Universal In-Context Approximation By Prompting Fully Recurrent Models
Aleksandar Petrov, Tom A. Lamb, Alasdair Paren, Philip H.S. Torr, Adel, Bibi

TL;DR
This paper proves that various fully recurrent neural architectures, including RNNs, LSTMs, and SSMs, can serve as universal in-context approximators, extending previous transformer-based results to a broader class of models.
Contribution
It introduces a formal framework and a programming language, LSRL, to demonstrate the universal in-context approximation capability of fully recurrent models.
Findings
Recurrent models like RNNs, LSTMs, and SSMs can approximate any function in-context.
Gated architectures such as LSTMs and GRUs offer more stable operation implementations.
The LSRL language facilitates further analysis and interpretability of recurrent models.
Abstract
Zero-shot and in-context learning enable solving tasks without model fine-tuning, making them essential for developing generative model solutions. Therefore, it is crucial to understand whether a pretrained model can be prompted to approximate any function, i.e., whether it is a universal in-context approximator. While it was recently shown that transformer models do possess this property, these results rely on their attention mechanism. Hence, these findings do not apply to fully recurrent architectures like RNNs, LSTMs, and the increasingly popular SSMs. We demonstrate that RNNs, LSTMs, GRUs, Linear RNNs, and linear gated architectures such as Mamba and Hawk/Griffin can also serve as universal in-context approximators. To streamline our argument, we introduce a programming language called LSRL that compiles to these fully recurrent architectures. LSRL may be of independent interest…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Image Enhancement Techniques
