Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning
Yulong Wang, Tianhao Shen, Lifeng Liu, Jian Xie

TL;DR
Sibyl is a novel LLM-based agent framework that enhances complex reasoning by integrating a global workspace and multi-agent debate, achieving state-of-the-art results with GPT-4 on the GAIA benchmark.
Contribution
Introduces Sibyl, a simple yet effective agent framework leveraging minimal tools, inspired by cognitive theories, to improve long-term reasoning and system scalability.
Findings
Achieves 34.55% average score on GAIA benchmark with GPT-4.
Outperforms existing LLM agents in complex reasoning tasks.
Demonstrates improved reasoning capabilities with a debate-based self-refinement process.
Abstract
Existing agents based on large language models (LLMs) demonstrate robust problem-solving capabilities by integrating LLMs' inherent knowledge, strong in-context learning and zero-shot capabilities, and the use of tools combined with intricately designed LLM invocation workflows by humans. However, these agents still exhibit shortcomings in long-term reasoning and under-use the potential of existing tools, leading to noticeable deficiencies in complex real-world reasoning scenarios. To address these limitations, we introduce Sibyl, a simple yet powerful LLM-based agent framework designed to tackle complex reasoning tasks by efficiently leveraging a minimal set of tools. Drawing inspiration from Global Workspace Theory, Sibyl incorporates a global workspace to enhance the management and sharing of knowledge and conversation history throughout the system. Furthermore, guided by Society of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · AI-based Problem Solving and Planning · Logic, Reasoning, and Knowledge
MethodsAttention Is All You Need · Sparse Evolutionary Training · Residual Connection · Byte Pair Encoding · Layer Normalization · Label Smoothing · Linear Layer · Adam · Dropout · Multi-Head Attention
