Enhancing LLMs' Reasoning-Intensive Multimedia Search Capabilities through Fine-Tuning and Reinforcement Learning

Jinzheng Li; Sibo Ju; Yanzhou Su; Hongguang Li; Yiqing Shen

arXiv:2505.18831·cs.IR·May 27, 2025

Enhancing LLMs' Reasoning-Intensive Multimedia Search Capabilities through Fine-Tuning and Reinforcement Learning

Jinzheng Li, Sibo Ju, Yanzhou Su, Hongguang Li, Yiqing Shen

PDF

Open Access

TL;DR

This paper introduces SearchExpert, a novel training framework for large language models that enhances reasoning and multimedia search capabilities through fine-tuning, reinforcement learning, and multimedia understanding, resulting in significant performance improvements.

Contribution

The paper presents a new training method combining supervised fine-tuning and reinforcement learning to improve LLMs' reasoning and multimedia search abilities, along with an automated benchmark for evaluation.

Findings

01

SearchExpert outperforms Perplexity Pro by 36.60% on FinSearchBench-24.

02

SearchExpert achieves 54.54% higher accuracy on SearchExpertBench-25.

03

Human evaluations show improved readability of SearchExpert's outputs.

Abstract

Existing large language models (LLMs) driven search agents typically rely on prompt engineering to decouple the user queries into search plans, limiting their effectiveness in complex scenarios requiring reasoning. Furthermore, they suffer from excessive token consumption due to Python-based search plan representations and inadequate integration of multimedia elements for both input processing and response generation. To address these challenges, we introduce SearchExpert, a training method for LLMs to improve their multimedia search capabilities in response to complex search queries. Firstly, we reformulate the search plan in an efficient natural language representation to reduce token consumption. Then, we propose the supervised fine-tuning for searching (SFTS) to fine-tune LLM to adapt to these representations, together with an automated dataset construction pipeline. Secondly, to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOpen Education and E-Learning · Digital Rights Management and Security · Peer-to-Peer Network Technologies