Sibyl: Simple yet Effective Agent Framework for Complex Real-world   Reasoning

Yulong Wang; Tianhao Shen; Lifeng Liu; Jian Xie

arXiv:2407.10718·cs.AI·July 17, 2024·1 cites

Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning

Yulong Wang, Tianhao Shen, Lifeng Liu, Jian Xie

PDF

Open Access 1 Repo

TL;DR

Sibyl is a novel LLM-based agent framework that enhances complex reasoning by integrating a global workspace and multi-agent debate, achieving state-of-the-art results with GPT-4 on the GAIA benchmark.

Contribution

Introduces Sibyl, a simple yet effective agent framework leveraging minimal tools, inspired by cognitive theories, to improve long-term reasoning and system scalability.

Findings

01

Achieves 34.55% average score on GAIA benchmark with GPT-4.

02

Outperforms existing LLM agents in complex reasoning tasks.

03

Demonstrates improved reasoning capabilities with a debate-based self-refinement process.

Abstract

Existing agents based on large language models (LLMs) demonstrate robust problem-solving capabilities by integrating LLMs' inherent knowledge, strong in-context learning and zero-shot capabilities, and the use of tools combined with intricately designed LLM invocation workflows by humans. However, these agents still exhibit shortcomings in long-term reasoning and under-use the potential of existing tools, leading to noticeable deficiencies in complex real-world reasoning scenarios. To address these limitations, we introduce Sibyl, a simple yet powerful LLM-based agent framework designed to tackle complex reasoning tasks by efficiently leveraging a minimal set of tools. Drawing inspiration from Global Workspace Theory, Sibyl incorporates a global workspace to enhance the management and sharing of knowledge and conversation history throughout the system. Furthermore, guided by Society of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ag2s1/sibyl-system
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation · AI-based Problem Solving and Planning · Logic, Reasoning, and Knowledge

MethodsAttention Is All You Need · Sparse Evolutionary Training · Residual Connection · Byte Pair Encoding · Layer Normalization · Label Smoothing · Linear Layer · Adam · Dropout · Multi-Head Attention