O$^2$-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering

Jianbiao Mei; Tao Hu; Daocheng Fu; Licheng Wen; Xuemeng Yang; Rong Wu; Pinlong Cai; Xinyu Cai; Xing Gao; Yu Yang; Chengjun Xie; Botian Shi; Yong Liu; Yu Qiao

arXiv:2505.16582·cs.CL·May 27, 2025

O$^2$-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering

Jianbiao Mei, Tao Hu, Daocheng Fu, Licheng Wen, Xuemeng Yang, Rong Wu, Pinlong Cai, Xinyu Cai, Xing Gao, Yu Yang, Chengjun Xie, Botian Shi, Yong Liu, Yu Qiao

PDF

Open Access 1 Repo

TL;DR

O$^2$-Searcher is a reinforcement learning-based search agent designed for open-domain, open-ended question answering, effectively handling diverse question types by dynamically acquiring knowledge and adapting answer strategies, outperforming existing models.

Contribution

The paper introduces O$^2$-Searcher, a novel search agent that uses reinforcement learning and local search environments to improve open-ended question answering in open domains.

Findings

01

Outperforms leading LLM agents on the O$^2$-QA benchmark.

02

Achieves state-of-the-art results on various closed-ended QA benchmarks.

03

Uses only a 3B parameter model to surpass larger models.

Abstract

Large Language Models (LLMs), despite their advancements, are fundamentally limited by their static parametric knowledge, hindering performance on tasks requiring open-domain up-to-date information. While enabling LLMs to interact with external knowledge environments is a promising solution, current efforts primarily address closed-end problems. Open-ended questions, which characterized by lacking a standard answer or providing non-unique and diverse answers, remain underexplored. To bridge this gap, we present O $^{2}$ -Searcher, a novel search agent leveraging reinforcement learning to effectively tackle both open-ended and closed-ended questions in the open domain. O $^{2}$ -Searcher leverages an efficient, locally simulated search environment for dynamic knowledge acquisition, effectively decoupling the external world knowledge from model's sophisticated reasoning processes. It employs a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

acade-mate/o2-searcher
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Expert finding and Q&A systems · Multimodal Machine Learning Applications