What Formal Languages Can Transformers Express? A Survey

Lena Strobl; William Merrill; Gail Weiss; David Chiang; Dana; Angluin

arXiv:2311.00208·cs.LG·September 5, 2024·2 cites

What Formal Languages Can Transformers Express? A Survey

Lena Strobl, William Merrill, Gail Weiss, David Chiang, Dana, Angluin

PDF

Open Access 1 Video

TL;DR

This survey reviews recent theoretical research on the expressive power of transformer models in formal language recognition, clarifying their capabilities, limitations, and the influence of architectural choices.

Contribution

It provides a comprehensive overview and unifies diverse findings on what formal languages transformers can recognize, highlighting assumptions and frameworks used.

Findings

01

Transformers can recognize certain classes of formal languages.

02

Architectural choices significantly influence the computational power of transformers.

03

The survey clarifies conflicting results in existing research.

Abstract

As transformers have gained prominence in natural language processing, some researchers have investigated theoretically what problems they can and cannot solve, by treating problems as formal languages. Exploring such questions can help clarify the power of transformers relative to other models of computation, their fundamental capabilities and limits, and the impact of architectural choices. Work in this subarea has made considerable progress in recent years. Here, we undertake a comprehensive survey of this work, documenting the diverse assumptions that underlie different results and providing a unified framework for harmonizing seemingly contradictory findings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

What Formal Languages Can Transformers Express? A Survey· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems