Small Language Models (SLMs) Can Still Pack a Punch: A survey (updated 2026)

Akanksha Gupta; Bijo Thomas; Harshita Asnani; Phanindra Reddy Madduru; Samia Feroze; Shreyas Subramanian; Vikram Elango; Mecit Gungor

arXiv:2501.05465·cs.CL·May 15, 2026·3 cites

Small Language Models (SLMs) Can Still Pack a Punch: A survey (updated 2026)

Akanksha Gupta, Bijo Thomas, Harshita Asnani, Phanindra Reddy Madduru, Samia Feroze, Shreyas Subramanian, Vikram Elango, Mecit Gungor

PDF

TL;DR

This survey reviews small language models (1-8 billion parameters) showing they can match or outperform larger models, emphasizing efficiency and scalability in AI development.

Contribution

It introduces a comprehensive overview of SLMs, their capabilities, and techniques for balancing performance with cost, guiding future model development.

Findings

01

SLMs can perform as well or better than larger models.

02

Techniques exist to optimize SLMs for efficiency and scalability.

03

SLMs' effective sizes reflect increased capabilities relative to LLMs.

Abstract

As foundation AI models continue to increase in size, an important question arises - is massive scale the only path forward? This survey of about 160 papers presents a family of Small Language Models (SLMs) in the 1 to 8 billion parameter range that demonstrate smaller models can perform as well, or even outperform large models. We explore task agnostic, general purpose SLMs, task-specific SLMs and techniques to create SLMs that can guide the community to build models while balancing performance, efficiency, scalability and cost. Furthermore we define and characterize SLMs' effective sizes, representing increased capability with respect to LLMs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.