Geometry of Reason: Spectral Signatures of Valid Mathematical Reasoning
Valentin No\"el

TL;DR
This paper introduces a training-free spectral analysis method to detect valid mathematical reasoning in large language models by analyzing attention patterns, achieving high accuracy without additional training.
Contribution
The authors develop a novel spectral diagnostics approach that distinguishes valid from invalid reasoning in transformer models without training or fine-tuning.
Findings
Spectral diagnostics significantly differentiate valid and invalid proofs with high effect sizes.
The method achieves 85-95% classification accuracy across multiple models.
Attention mechanism design influences which spectral features are most indicative of reasoning validity.
Abstract
We present a training-free method for detecting valid mathematical reasoning in large language models through spectral analysis of attention patterns. By treating attention matrices as adjacency matrices of dynamic graphs over tokens, we extract four interpretable spectral diagnostics, the Fiedler value (algebraic connectivity), high-frequency energy ratio (HFER), graph signal smoothness, and spectral entropy, that exhibit statistically significant differences between valid and invalid mathematical proofs. Experiments across seven transformer models from four independent architectural families (Meta Llama, Alibaba Qwen, Microsoft Phi, and Mistral AI) demonstrate that this spectral signature produces effect sizes up to Cohen's (), enabling 85.0--95.6\% classification accuracy under rigorous evaluation, with calibrated thresholds reaching 93--95\% on the full…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Adversarial Robustness in Machine Learning
