LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Language Models against Quantization-induced Risks and Vulnerabilities

Kalyan Nakka; Jimmy Dani; Ausmit Mondal; Nitesh Saxena

arXiv:2505.05619·cs.CR·March 4, 2026

LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Language Models against Quantization-induced Risks and Vulnerabilities

Kalyan Nakka, Jimmy Dani, Ausmit Mondal, Nitesh Saxena

PDF

Open Access 2 Repos 1 Datasets

TL;DR

LiteLMGuard is a lightweight, model-agnostic on-device prompt filtering system that effectively defends small language models against harmful prompts and vulnerabilities introduced by quantization, ensuring safety and privacy.

Contribution

We introduce LiteLMGuard, a real-time, prompt-level filtering method for quantized SLMs that is model-agnostic and highly effective against safety risks.

Findings

01

Achieved over 85% defense rate against harmful prompts

02

Attained 94% filtering accuracy in real-time filtering

03

Operates with approximately 135 ms latency on device

Abstract

The growing adoption of Large Language Models (LLMs) has influenced the development of Small Language Models (SLMs) for on-device deployment across smartphones and edge devices, offering enhanced privacy, reduced latency, server-free functionality, and improved user experience. However, due to on-device resource constraints, SLMs undergo size optimization through compression techniques like quantization, which inadvertently introduce fairness, ethical and privacy risks. Critically, quantized SLMs may respond to harmful queries directly, without requiring adversarial manipulation, raising significant safety and trust concerns. To address this, we propose LiteLMGuard, an on-device guardrail that provides real-time, prompt-level defense for quantized SLMs. Additionally, our guardrail is designed to be model-agnostic such that it can be seamlessly integrated with any SLM, operating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

kalyannakka/Answerable-or-Not
dataset· 4 dl
4 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Security and Verification in Computing · Big Data and Digital Economy

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Warmup With Linear Decay · Dropout · Layer Normalization · Attention Dropout · Softmax · Residual Connection · WordPiece · Linear Layer