Reasoning as a Resource: Optimizing Fast and Slow Thinking in Code Generation Models
Zongjie Li, Shuai Wang

TL;DR
This paper advocates for explicitly managing reasoning depth in code generation models to optimize the balance between speed and accuracy, enhancing performance and deployment flexibility.
Contribution
It introduces a novel framework for controlling reasoning resources in code generation, enabling dynamic trade-offs between fast and slow thinking modes.
Findings
Proposes reasoning as a controllable resource in model design
Suggests adaptive reasoning control improves accuracy-latency-cost trade-offs
Outlines new benchmarks and deployment strategies based on reasoning budgets
Abstract
This position paper proposes a fundamental shift in designing code generation models: treating reasoning depth as a controllable resource. Rather than being an incidental byproduct of prompting, we argue that the trade-off between rapid, direct answers ("fast thinking") and elaborate, chain-of-thought deliberation ("slow thinking") must be explicitly managed. We contend that optimizing reasoning budgets across the entire model lifecycle - from synthetic data creation and benchmarking to real-world deploymen - can unlock superior trade-offs among accuracy, latency, and cost. This paper outlines how adaptive control over reasoning can enrich supervision signals, motivate new multi-dimensional benchmarks, and inform cost-aware, security-conscious deployment policies. By viewing fast and slow thinking as complementary modes to be scheduled, we envision coding agents that think deep when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Scientific Computing and Data Management · Model-Driven Software Engineering Techniques
