Gorilla: Large Language Model Connected with Massive APIs
Shishir G. Patil, Tianjun Zhang, Xin Wang, Joseph E. Gonzalez

TL;DR
Gorilla is a fine-tuned LLaMA-based model that significantly improves API call accuracy over GPT-4, effectively uses document retrieval for dynamic updates, and reduces hallucinations in tool usage.
Contribution
The paper introduces Gorilla, a novel fine-tuned LLaMA model that outperforms GPT-4 in API call generation and demonstrates enhanced adaptability and reliability through integration with a retrieval system.
Findings
Gorilla surpasses GPT-4 in API call accuracy.
Integration with retrieval improves adaptability to document changes.
Reduces hallucinations in API usage compared to prompting GPT-4.
Abstract
Large Language Models (LLMs) have seen an impressive wave of advances recently, with models now excelling in a variety of tasks, such as mathematical reasoning and program synthesis. However, their potential to effectively use tools via API calls remains unfulfilled. This is a challenging task even for today's state-of-the-art LLMs such as GPT-4, largely due to their inability to generate accurate input arguments and their tendency to hallucinate the wrong usage of an API call. We release Gorilla, a finetuned LLaMA-based model that surpasses the performance of GPT-4 on writing API calls. When combined with a document retriever, Gorilla demonstrates a strong capability to adapt to test-time document changes, enabling flexible user updates or version changes. It also substantially mitigates the issue of hallucination, commonly encountered when prompting LLMs directly. To evaluate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗gorilla-llm/gorilla-7b-hf-delta-v0model· 11 dl· ♡ 5411 dl♡ 54
- 🤗TheBloke/gorilla-7B-GGMLmodel· ♡ 45♡ 45
- 🤗TheBloke/gorilla-7B-fp16model· 28 dl· ♡ 1228 dl♡ 12
- 🤗TheBloke/gorilla-7B-GPTQmodel· 23 dl· ♡ 2623 dl♡ 26
- 🤗gorilla-llm/gorilla-7b-tf-delta-v0model· 6 dl· ♡ 66 dl♡ 6
- 🤗gorilla-llm/gorilla-7b-th-delta-v0model· 4 dl· ♡ 54 dl♡ 5
- 🤗gorilla-llm/gorilla-mpt-7b-hf-v0model· 20 dl· ♡ 1320 dl♡ 13
- 🤗gorilla-llm/gorilla-falcon-7b-hf-v0model· 15 dl· ♡ 3815 dl♡ 38
- 🤗gorilla-llm/gorilla-7b-hf-delta-v1model· 7 dl· ♡ 267 dl♡ 26
- 🤗TheBloke/gorilla-7B-GGUFmodel· 100 dl· ♡ 3100 dl♡ 3
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research
MethodsAttention Is All You Need · Softmax · Layer Normalization · Byte Pair Encoding · Dropout · Linear Layer · Label Smoothing · Adam · Dense Connections · Residual Connection
