Tree-of-Mixed-Thought: Combining Fast and Slow Thinking for Multi-hop   Visual Reasoning

Pengbo Hu; Ji Qi; Xingyu Li; Hong Li; Xinqi Wang; Bing Quan; Ruiyu; Wang; Yi Zhou

arXiv:2308.09658·cs.CL·August 22, 2023·5 cites

Tree-of-Mixed-Thought: Combining Fast and Slow Thinking for Multi-hop Visual Reasoning

Pengbo Hu, Ji Qi, Xingyu Li, Hong Li, Xinqi Wang, Bing Quan, Ruiyu, Wang, Yi Zhou

PDF

Open Access

TL;DR

This paper introduces a hierarchical plan-search algorithm inspired by human cognition to improve multi-hop visual reasoning with large language models, balancing accuracy and efficiency effectively.

Contribution

It proposes a novel Tree-of-Mixed-Thought algorithm combining fast and slow reasoning processes for better multi-hop visual reasoning performance.

Findings

01

Outperforms existing methods in accuracy and efficiency.

02

Reduces inference steps significantly.

03

Provides a systematic evaluation framework for reasoning tasks.

Abstract

There emerges a promising trend of using large language models (LLMs) to generate code-like plans for complex inference tasks such as visual reasoning. This paradigm, known as LLM-based planning, provides flexibility in problem solving and endows better interpretability. However, current research is mostly limited to basic scenarios of simple questions that can be straightforward answered in a few inference steps. Planning for the more challenging multi-hop visual reasoning tasks remains under-explored. Specifically, under multi-hop reasoning situations, the trade-off between accuracy and the complexity of plan-searching becomes prominent. The prevailing algorithms either address the efficiency issue by employing the fast one-stop generation or adopt a complex iterative generation method to improve accuracy. Both fail to balance the need for efficiency and performance. Drawing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning · Machine Learning in Materials Science · Software Engineering Research

Methodsfail