Tool-Augmented Hybrid Ensemble Reasoning with Distillation for Bilingual Mathematical Problem Solving

Peiqing Lu; Yuan Zhang; Haoyun Zhang; Jiasen Zheng; Kejian Tong; Wenjun Wu

arXiv:2512.19093·cs.AI·December 23, 2025

Tool-Augmented Hybrid Ensemble Reasoning with Distillation for Bilingual Mathematical Problem Solving

Peiqing Lu, Yuan Zhang, Haoyun Zhang, Jiasen Zheng, Kejian Tong, Wenjun Wu

PDF

Open Access

TL;DR

HERALD is a novel framework that combines language reasoning, symbolic calculation, and adaptive ensemble techniques to improve bilingual mathematical problem solving accuracy and stability.

Contribution

The paper introduces HERALD, a tool-augmented hybrid ensemble framework with adaptive routing and knowledge distillation for bilingual math reasoning.

Findings

01

HERALD improves reasoning accuracy and calculation precision in bilingual math problems.

02

Adaptive routing and reinforcement learning reduce redundancy and enhance stability.

03

Knowledge distillation accelerates inference without sacrificing accuracy.

Abstract

Bilingual mathematical problem solving needs a clear link between language reasoning and symbolic calculation. Large language models often handle language well but are weak in accurate computation. This paper presents HERALD (Hybrid Ensemble Reasoning with Adaptive Learning and Distillation), a framework that joins reasoning and calculation using NuminaMath-7B-TIR, GPT-4o, and Mistral-7B. HERALD uses adaptive routing, tool-based reinforcement learning, and knowledge distillation to connect different reasoning paths. Confidence calibration keeps weighting stable, and dual-path checking keeps results correct. Reinforcement learning controls tool use to cut redundancy, and distillation lowers delay without hurting accuracy. The system shows that combining symbolic checking, adaptive ensembles, and bilingual fine-tuning helps achieve both fluent reasoning and precise calculation. HERALD…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsConstraint Satisfaction and Optimization · Natural Language Processing Techniques · Topic Modeling