Quecto-V1: Empirical Analysis of 8-bit Quantized Small Language Models for On-Device Legal Retrieval
Subrit Dikshit

TL;DR
Quecto-V1 is a small, 8-bit quantized legal language model trained on Indian statutes, enabling high-accuracy legal retrieval on resource-constrained devices while maintaining privacy and domain specificity.
Contribution
This paper introduces Quecto-V1, a domain-specific legal language model with 8-bit quantization, optimized for offline deployment and high retrieval accuracy in resource-limited environments.
Findings
8-bit quantization reduces model size by 74% with minimal accuracy loss
Quecto-V1 outperforms generalist models in legal retrieval tasks
Model runs efficiently on consumer-grade CPUs offline
Abstract
The rapid proliferation of Large Language Models (LLMs) has revolutionized Natural Language Processing (NLP) but has simultaneously created a "resource divide." State-of-the-art legal intelligence systems typically rely on massive parameter counts (7B+) and cloud-based inference, rendering them inaccessible to practitioners in resource-constrained environments and posing significant data sovereignty risks. This paper introduces Quecto-V1, a domain-specific Small Language Model (SLM) engineered to democratize access to Indian legal intelligence. Built upon a custom configuration of the GPT-2 architecture (124 million parameters), Quecto-V1 was trained from scratch exclusively on a corpus of Indian statutes, including the Indian Penal Code (IPC), the Code of Criminal Procedure (CrPC), and the Constitution of India. Unlike generalist models, which prioritize broad world knowledge, our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Artificial Intelligence in Law · Artificial Intelligence in Healthcare and Education
