Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning
Joongwon Kim, Bhargavi Paranjape, Tushar Khot, Hannaneh Hajishirzi

TL;DR
Husky is an open-source, unified language agent capable of multi-step reasoning across diverse tasks, outperforming prior models and matching larger models like GPT-4 on complex reasoning benchmarks.
Contribution
Husky introduces a holistic, open-source language agent with a unified action space and curated expert models, enabling versatile reasoning over multiple complex tasks.
Findings
Husky outperforms prior agents on 14 datasets.
Husky matches or exceeds GPT-4 on mixed-tool reasoning tasks.
Husky effectively handles numerical, tabular, and knowledge-based reasoning.
Abstract
Language agents perform complex tasks by using tools to execute each step precisely. However, most existing agents are based on proprietary models or designed to target specific tasks, such as mathematics or multi-hop question answering. We introduce Husky, a holistic, open-source language agent that learns to reason over a unified action space to address a diverse set of complex tasks involving numerical, tabular, and knowledge-based reasoning. Husky iterates between two stages: 1) generating the next action to take towards solving a given task and 2) executing the action using expert models and updating the current solution state. We identify a thorough ontology of actions for addressing complex tasks and curate high-quality data to train expert models for executing these actions. Our experiments show that Husky outperforms prior language agents across 14 evaluation datasets.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Natural Language Processing Techniques
MethodsAttention Is All You Need · Sparse Evolutionary Training · Softmax · Focus · Layer Normalization · Ontology · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam
