Meta-Tool: Efficient Few-Shot Tool Adaptation for Small Language Models
Sachin Kumar

TL;DR
This study evaluates whether small language models can effectively use tools through simple prompting versus complex adaptation methods, finding prompts alone often suffice.
Contribution
The paper demonstrates that hypernetwork-based adaptation offers no significant advantage over few-shot prompting in small language models.
Findings
Few-shot prompting contributes +21.5% to performance.
Documentation encoding adds +5.0%.
Hypernetwork adaptation adds 0%.
Abstract
Can small language models achieve strong tool-use performance without complex adaptation mechanisms? This paper investigates this question through Meta-Tool, a controlled empirical study comparing hypernetwork-based LoRA adaptation against carefully designed few-shot prompting. Using a Llama-3.2-3B-Instruct backbone, we evaluate four adaptation mechanisms--few-shot prompting, documentation encoding, hypernetwork-generated LoRA weights, and value-guided beam search--across four diverse benchmarks: Gorilla APIBench, Spider 2.0, WebArena, and InterCode. Our central finding is a well-supported negative result: despite generating non-trivial weight matrices, the 227.8M-parameter hypernetwork provides no measurable improvement over few-shot prompting alone. Comprehensive ablation studies reveal that few-shot examples contribute +21.5% to performance and documentation contributes +5.0%, while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
