Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling

Falcon LLM Team; Iheb Chaabane; Puneesh Khanna; Suhail Mohmad; Slim Frikha; Shi Hu; Abdalgader Abubaker; Reda Alami; Mikhail Lubinets; Mohamed El Amine Seddik; Hakim Hacid

arXiv:2601.02346·cs.AI·January 6, 2026

Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling

Falcon LLM Team, Iheb Chaabane, Puneesh Khanna, Suhail Mohmad, Slim Frikha, Shi Hu, Abdalgader Abubaker, Reda Alami, Mikhail Lubinets, Mohamed El Amine Seddik, Hakim Hacid

PDF

Open Access 5 Models

TL;DR

Falcon-H1R is a 7B parameter reasoning-optimized model that achieves state-of-the-art reasoning performance and test-time scaling efficiency through targeted training and hybrid architecture, rivaling larger models.

Contribution

Introduces Falcon-H1R, a small, efficient reasoning model with hybrid architecture and training strategies that match or outperform larger models on reasoning tasks.

Findings

01

Matches or exceeds larger SOTA reasoning models

02

Achieves high test-time scaling efficiency

03

Demonstrates strong reasoning performance with fewer parameters

Abstract

This work introduces Falcon-H1R, a 7B-parameter reasoning-optimized model that establishes the feasibility of achieving competitive reasoning performance with small language models (SLMs). Falcon-H1R stands out for its parameter efficiency, consistently matching or outperforming SOTA reasoning models that are $2 \times$ to $7 \times$ larger across a variety of reasoning-intensive benchmarks. These results underscore the importance of careful data curation and targeted training strategies (via both efficient SFT and RL scaling) in delivering significant performance gains without increasing model size. Furthermore, Falcon-H1R advances the 3D limits of reasoning efficiency by combining faster inference (through its hybrid-parallel architecture design), token efficiency, and higher accuracy. This unique blend makes Falcon-H1R-7B a practical backbone for scaling advanced reasoning systems,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification