CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis

Bohan Zhang; Xiaokang Zhang; Jing Zhang; Jifan Yu; Sijia Luo; Jie Tang

arXiv:2501.01668·cs.CL·June 17, 2025

CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis

Bohan Zhang, Xiaokang Zhang, Jing Zhang, Jifan Yu, Sijia Luo, Jie Tang

PDF

Open Access 1 Repo 1 Models 1 Datasets 1 Video

TL;DR

This paper introduces CoT-based Synthesizer, a novel inference scaling method that synthesizes better answers from multiple flawed candidates using Chain-of-Thought reasoning, improving large language models' accuracy efficiently.

Contribution

The paper presents a new inference scaling strategy leveraging Chain-of-Thought reasoning to synthesize superior answers, even with all candidates flawed, and introduces an automated data generation pipeline for training smaller models.

Findings

01

Significant accuracy improvements on benchmark datasets.

02

Enhanced performance for both small and large LLMs.

03

Open-source code and data for reproducibility.

Abstract

Current inference scaling methods, such as Self-consistency and Best-of-N, have proven effective in improving the accuracy of LLMs on complex reasoning tasks. However, these methods rely heavily on the quality of candidate responses and are unable to produce correct answers when all candidates are incorrect. In this paper, we propose a novel inference scaling strategy, CoT-based Synthesizer, which leverages CoT reasoning to synthesize superior answers by analyzing complementary information from multiple candidate responses, even when all candidate responses are flawed. To enable a lightweight and cost-effective implementation, we introduce an automated data generation pipeline that creates diverse training data. This allows smaller LLMs trained on this data to improve the inference accuracy of larger models, including API-based LLMs. Experimental results across four benchmark datasets…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ruckbreasoning/cot-based-synthesizer
pytorchOfficial

Models

🤗
BoHanMint/Synthesizer-8B-math
model

Datasets

BoHanMint/Synthesizer-8B-math-train-data
dataset· 4 dl
4 dl

Videos

CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Software Engineering Research

MethodsAttention Is All You Need · Linear Layer · Softmax · Multi-Head Attention · Synthesizer