JavelinGuard: Low-Cost Transformer Architectures for LLM Security
Yash Datta, Sharath Rajasekar

TL;DR
JavelinGuard introduces a suite of low-cost, high-performance transformer architectures optimized for detecting malicious intent in LLM interactions, balancing accuracy, speed, and resource efficiency for practical deployment.
Contribution
The paper presents novel transformer-based models specifically designed for LLM security, with comprehensive benchmarking across diverse datasets and comparison to existing guardrail models.
Findings
Raudra offers the most robust performance among the architectures.
All models outperform existing open-source guardrails in cost-performance trade-offs.
Models achieve high accuracy with fewer than 400M parameters.
Abstract
We present JavelinGuard, a suite of low-cost, high-performance model architectures designed for detecting malicious intent in Large Language Model (LLM) interactions, optimized specifically for production deployment. Recent advances in transformer architectures, including compact BERT(Devlin et al. 2019) variants (e.g., ModernBERT (Warner et al. 2024)), allow us to build highly accurate classifiers with as few as approximately 400M parameters that achieve rapid inference speeds even on standard CPU hardware. We systematically explore five progressively sophisticated transformer-based architectures: Sharanga (baseline transformer classifier), Mahendra (enhanced attention-weighted pooling with deeper heads), Vaishnava and Ashwina (hybrid neural ensemble architectures), and Raudra (an advanced multi-task framework with specialized loss functions). Our models are rigorously benchmarked…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Explainable Artificial Intelligence (XAI)
