TL;DR
This paper evaluates the impact of System 1 and System 2 reasoning in small language models for decentralized governance, revealing that simpler, faster reasoning (System 1) offers superior robustness and efficiency over complex, iterative reasoning (System 2).
Contribution
It introduces Sentinel-Bench, an empirical framework for ablation studies on reasoning modes in SLMs, demonstrating the superiority of System 1 reasoning in adversarial decentralized environments.
Findings
System 1 reasoning achieved 100% adversarial robustness and juridical consistency.
System 2 reasoning caused a 26.7% non-convergence rate and increased latency by 17x.
Models exhibited 'Reasoning-Induced Sycophancy' with longer internal monologues when failing adversarial traps.
Abstract
Decentralized Autonomous Organizations (DAOs) are inclined explore Small Language Models (SLMs) as edge-native constitutional firewalls to vet proposals and mitigate semantic social engineering. While scaling inference-time compute (System 2) enhances formal logic, its efficacy in highly adversarial, cryptoeconomic governance environments remains underexplored. To address this, we introduce Sentinel-Bench, an 840-inference empirical framework executing a strict intra-model ablation on Qwen-3.5-9B. By toggling latent reasoning across frozen weights, we isolate the impact of inference-time compute against an adversarial Optimism DAO dataset. Our findings reveal a severe compute-accuracy inversion. The autoregressive baseline (System 1) achieved 100% adversarial robustness, 100% juridical consistency, and state finality in under 13 seconds. Conversely, System 2 reasoning introduced…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
