AutoS$^2$earch: Unlocking the Reasoning Potential of Large Models for Web-based Source Search
Zhengqiu Zhu, Yatai Ji, Jiaheng Huang, Yong Zhao, Sihang Qiu, Rusheng, Ju

TL;DR
AutoS$^2$earch is a novel framework that uses large models for zero-shot source search in web-based systems, mimicking human reasoning to improve safety and efficiency without relying on crowdsourcing.
Contribution
This work introduces AutoS$^2$earch, a new approach leveraging large models and visual-to-language conversion for autonomous source search in web applications.
Findings
Achieves near-human performance in source search tasks.
Eliminates need for crowdsourcing in hazard detection.
Demonstrates effectiveness in industrial safety scenarios.
Abstract
Web-based management systems have been widely used in risk control and industrial safety. However, effectively integrating source search capabilities into these systems, to enable decision-makers to locate and address the hazard (e.g., gas leak detection) remains a challenge. While prior efforts have explored using web crowdsourcing and AI algorithms for source search decision support, these approaches suffer from overheads in recruiting human participants and slow response times in time-sensitive situations. To address this, we introduce AutoSearch, a novel framework leveraging large models for zero-shot source search in web applications. AutoSearch operates on a simplified visual environment projected through a web-based display, utilizing a chain-of-thought prompt designed to emulate human reasoning. The multi-modal large language model (MLLMs) dynamically converts visual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Biomedical Text Mining and Ontologies · Data Management and Algorithms
