Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning
Jinyuan Wang, Junlong Li, Hai Zhao

TL;DR
This paper introduces Self-prompted Chain-of-Thought (SP-CoT), an automated method for enhancing large language models' multi-hop reasoning in open-domain question answering by generating high-quality reasoning chains.
Contribution
The paper presents a novel automated framework, SP-CoT, that improves reasoning quality and scalability for LLMs in open-domain multi-hop QA tasks, outperforming previous methods.
Findings
SP-CoT significantly surpasses previous SOTA on large LLMs.
Nearly doubles zero-shot performance of small-scale LLMs.
Recalls ~50% of intermediate answers on MuSiQue-Ans dataset.
Abstract
In open-domain question-answering (ODQA), most existing questions require single-hop reasoning on commonsense. To further extend this task, we officially introduce open-domain multi-hop reasoning (ODMR) by answering multi-hop questions with explicit reasoning steps in open-domain setting. Recently, large language models (LLMs) have found significant utility in facilitating ODQA without external corpus. Furthermore, chain-of-thought (CoT) prompting boosts the reasoning capability of LLMs to a greater extent with manual or automated paradigms. However, existing automated methods lack of quality assurance, while manual approaches suffer from limited scalability and poor diversity, hindering the capabilities of LLMs. In this paper, we propose Self-prompted Chain-of-Thought (SP-CoT), an automated framework to mass-produce high quality CoTs of LLMs, by LLMs and for LLMs. SP-CoT introduces an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
