Learning-at-Criticality in Large Language Models for Quantum Field Theory and Beyond

Xiansheng Cai; Sihan Hu; Tao Wang; Yuan Huang; Pan Zhang; Youjin Deng; Kun Chen

arXiv:2506.03703·cs.LG·November 10, 2025

Learning-at-Criticality in Large Language Models for Quantum Field Theory and Beyond

Xiansheng Cai, Sihan Hu, Tao Wang, Yuan Huang, Pan Zhang, Youjin Deng, Kun Chen

PDF

TL;DR

This paper introduces Learning at Criticality (LaC), a reinforcement learning scheme that tunes large language models to a phase transition point, enabling peak generalization from minimal data in complex symbolic reasoning tasks like quantum field theory.

Contribution

The paper presents LaC, a novel RL method that operates LLMs at a critical point to enhance learning efficiency and generalization in data-scarce, complex symbolic reasoning tasks.

Findings

01

LLMs achieve peak generalization at a critical transition point.

02

LaC enables small models to outperform larger models on quantum field theory problems.

03

The minimal concept-network model exhibits phase transition characteristics similar to LLMs.

Abstract

Fundamental physics often confronts complex symbolic problems with few guiding exemplars or established principles. While artificial intelligence (AI) offers promise, its typical need for vast datasets to learn from hinders its use in these information-scarce frontiers. We introduce learning at criticality (LaC), a reinforcement learning (RL) scheme that tunes Large Language Models (LLMs) to a sharp learning transition, addressing this information scarcity. At this transition, LLMs achieve peak generalization from minimal data, exemplified by 7-digit base-7 addition -- a test of nontrivial arithmetic reasoning. To elucidate this peak, we analyze a minimal concept-network model (CoNet) designed to capture the essence of how LLMs might link tokens. Trained on a single exemplar, this model also undergoes a sharp learning transition. This transition exhibits hallmarks of a second-order…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.