SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource Environments

Syed Abdul Gaffar Shakhadri; Kruthika KR; and Rakshit Aralimatti

arXiv:2410.11331·cs.CL·June 23, 2025

SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource Environments

Syed Abdul Gaffar Shakhadri, Kruthika KR, and Rakshit Aralimatti

PDF

Open Access

TL;DR

Shakti is a 2.5 billion parameter language model optimized for edge devices, offering high performance and efficiency for real-time NLP tasks in resource-limited environments, including support for vernacular languages.

Contribution

The paper introduces Shakti, a compact yet powerful language model designed specifically for edge AI, combining high accuracy with low resource consumption.

Findings

01

Performs competitively against larger models on benchmarks.

02

Maintains low latency and high efficiency on edge devices.

03

Supports vernacular languages and domain-specific tasks.

Abstract

We introduce Shakti, a 2.5 billion parameter language model specifically optimized for resource-constrained environments such as edge devices, including smartphones, wearables, and IoT systems. Shakti combines high-performance NLP with optimized efficiency and precision, making it ideal for real-time AI applications where computational resources and memory are limited. With support for vernacular languages and domain-specific tasks, Shakti excels in industries such as healthcare, finance, and customer service. Benchmark evaluations demonstrate that Shakti performs competitively against larger models while maintaining low latency and on-device efficiency, positioning it as a leading solution for edge AI.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

Methodstravel james · Residual Connection · Attention Is All You Need · Softmax · Multi-Head Attention · Linear Layer · Layer Normalization · Refunds@Expedia|||How do I get a full refund from Expedia? · Byte Pair Encoding · Attention Dropout