Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models
Pan Lu, Baolin Peng, Hao Cheng, Michel Galley, Kai-Wei Chang, Ying, Nian Wu, Song-Chun Zhu, Jianfeng Gao

TL;DR
Chameleon enhances large language models with plug-and-play modules for compositional reasoning, significantly improving performance on complex multi-modal knowledge tasks by integrating various external tools through an LLM-based planner.
Contribution
This paper introduces Chameleon, a novel system that augments LLMs with modular tools and a planning mechanism to address their limitations in reasoning and information access.
Findings
Achieves 86.54% accuracy on ScienceQA, surpassing previous results by 11.37%.
Reaches 98.78% accuracy on TabMWP, setting a new state-of-the-art.
GPT-4-powered planner demonstrates more rational tool selection than ChatGPT.
Abstract
Large language models (LLMs) have achieved remarkable progress in solving various natural language processing tasks due to emergent reasoning abilities. However, LLMs have inherent limitations as they are incapable of accessing up-to-date information (stored on the Web or in task-specific knowledge bases), using external tools, and performing precise mathematical and logical reasoning. In this paper, we present Chameleon, an AI system that mitigates these limitations by augmenting LLMs with plug-and-play modules for compositional reasoning. Chameleon synthesizes programs by composing various tools (e.g., LLMs, off-the-shelf vision models, web search engines, Python functions, and heuristic-based modules) for accomplishing complex reasoning tasks. At the heart of Chameleon is an LLM-based planner that assembles a sequence of tools to execute to generate the final response. We showcase…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research
MethodsMulti-Head Attention · Attention Is All You Need · Dense Connections · Dropout · Byte Pair Encoding · Softmax · Layer Normalization · Position-Wise Feed-Forward Layer · Linear Layer · Absolute Position Encodings
