Dynamic Parallel Tree Search for Efficient LLM Reasoning
Yifu Ding, Wentao Jiang, Shunyu Liu, Yongcheng Jing, Jinyang Guo,, Yingjie Wang, Jing Zhang, Zengmao Wang, Ziwei Liu, Bo Du, Xianglong Liu,, Dacheng Tao

TL;DR
This paper introduces Dynamic Parallel Tree Search (DPTS), a novel framework that enhances the efficiency of Tree of Thoughts reasoning in large language models by dynamically optimizing reasoning paths and reducing redundant exploration.
Contribution
The paper proposes DPTS, a new parallelism framework with fine-grained cache management and candidate filtering to improve reasoning efficiency and scalability in LLMs.
Findings
DPTS achieves 2-4x efficiency improvement on average.
DPTS maintains or surpasses existing accuracy levels.
Effective reduction of redundant exploration in reasoning paths.
Abstract
Tree of Thoughts (ToT) enhances Large Language Model (LLM) reasoning by structuring problem-solving as a spanning tree. However, recent methods focus on search accuracy while overlooking computational efficiency. The challenges of accelerating the ToT lie in the frequent switching of reasoning focus, and the redundant exploration of suboptimal solutions. To alleviate this dilemma, we propose Dynamic Parallel Tree Search (DPTS), a novel parallelism framework that aims to dynamically optimize the reasoning path in inference. It includes the Parallelism Streamline in the generation phase to build up a flexible and adaptive parallelism with arbitrary paths by fine-grained cache management and alignment. Meanwhile, the Search and Transition Mechanism filters potential candidates to dynamically maintain the reasoning focus on more possible solutions and have less redundancy. Experiments on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Digital Rights Management and Security · Semantic Web and Ontologies
MethodsFocus
