TL;DR
This paper introduces C-voting, a confidence-based test-time scaling method for recurrent neural models that improves reasoning task performance without needing explicit energy functions.
Contribution
It proposes C-voting, a novel test-time scaling strategy applicable to recurrent models, and demonstrates its effectiveness with a new attention-based model on reasoning benchmarks.
Findings
C-voting achieves 4.9% higher accuracy on Sudoku-hard than energy-based voting.
Combined with ItrSA++, C-voting outperforms HRM on Sudoku-extreme and Maze tasks.
C-voting improves reasoning performance without requiring explicit energy functions.
Abstract
Neural network models with latent recurrent processing, where identical layers are recursively applied to the latent state, have gained attention as promising models for performing reasoning tasks. A strength of such models is that they enable test-time scaling, where the models can enhance their performance in the test phase without additional training. Models such as the Hierarchical Reasoning Model (HRM) and Artificial Kuramoto Oscillatory Neurons (AKOrN) can facilitate deeper reasoning by increasing the number of recurrent steps, thereby enabling the completion of challenging tasks, including Sudoku, Maze solving, and AGI benchmarks. In this work, we introduce confidence-based voting (C-voting), a test-time scaling strategy designed for recurrent models with multiple latent candidate trajectories. Initializing the latent state with multiple candidates using random variables,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
