Beyond the Prompt in Large Language Models: Comprehension, In-Context Learning, and Chain-of-Thought

Yuling Jiao; Yanming Lai; Huazhen Lin; Wensen Ma; Houduo Qi; Defeng Sun

arXiv:2603.10000·cs.CL·March 13, 2026

Beyond the Prompt in Large Language Models: Comprehension, In-Context Learning, and Chain-of-Thought

Yuling Jiao, Yanming Lai, Huazhen Lin, Wensen Ma, Houduo Qi, Defeng Sun

PDF

Open Access

TL;DR

This paper investigates the theoretical foundations of how large language models understand prompts, perform in-context learning, and utilize chain-of-thought reasoning, revealing mechanisms behind their emergent capabilities.

Contribution

It provides a theoretical analysis explaining how LLMs decode prompt semantics, facilitate in-context learning, and activate reasoning through chain-of-thought prompting, which was previously not well understood.

Findings

01

LLMs infer token transition probabilities via autoregressive processes

02

In-Context Learning reduces prompt ambiguity and improves task focus

03

Chain-of-Thought prompts enable task decomposition and complex reasoning

Abstract

Large Language Models (LLMs) have demonstrated remarkable proficiency across diverse tasks, exhibiting emergent properties such as semantic prompt comprehension, In-Context Learning (ICL), and Chain-of-Thought (CoT) reasoning. Despite their empirical success, the theoretical mechanisms driving these phenomena remain poorly understood. This study dives into the foundations of these observations by addressing three critical questions: (1) How do LLMs accurately decode prompt semantics despite being trained solely on a next-token prediction objective? (2) Through what mechanism does ICL facilitate performance gains without explicit parameter updates? and (3) Why do intermediate reasoning steps in CoT prompting effectively unlock capabilities for complex, multi-step problems? Our results demonstrate that, through the autoregressive process, LLMs are capable of exactly inferring the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning