Shaping Schema via Language Representation as the Next Frontier for LLM Intelligence Expanding

Zhiqin Yang; Yuhan Liu; Jingwen Fu; Pei Fu; Bo Han; Masashi Sugiyama; Nanning Zheng

arXiv:2605.09271·cs.AI·May 13, 2026

Shaping Schema via Language Representation as the Next Frontier for LLM Intelligence Expanding

Zhiqin Yang, Yuhan Liu, Jingwen Fu, Pei Fu, Bo Han, Masashi Sugiyama, Nanning Zheng

PDF

TL;DR

This paper argues that shaping schemas through advanced language representation is crucial for expanding LLM intelligence, supported by formalization and empirical evidence showing performance improvements through language design.

Contribution

It formalizes the importance of language representation in LLMs and provides empirical evidence that deliberate language design enhances performance without changing model scale.

Findings

01

Performance gains from deliberate language representation design

02

LLM internal features vary with different language representations

03

Controlled experiments show language structure impacts task understanding

Abstract

Although natural language is the default medium for Large Language Models (LLMs), its limited expressive capacity creates a profound bottleneck for complex problem-solving. While recent advancements in AI have relied heavily on scaling, merely internalizing knowledge does not guarantee its effective application. Defining language representation as the linguistic and symbolic constructs used to map and model the real world, this paper argues that shaping schemas through advanced language representation is the next frontier for expanding LLM intelligence. We posit that an LLM's knowledge activation and organization -- its schema -- depends heavily on the structural and symbolic sophistication of the language used to represent a given task. This paper contributes both a formalization of this claim and the empirical evidence to support it. With a new formalization, we present multiple lines…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.