JavelinGuard: Low-Cost Transformer Architectures for LLM Security

Yash Datta; Sharath Rajasekar

arXiv:2506.07330·cs.LG·June 10, 2025

JavelinGuard: Low-Cost Transformer Architectures for LLM Security

Yash Datta, Sharath Rajasekar

PDF

Open Access

TL;DR

JavelinGuard introduces a suite of low-cost, high-performance transformer architectures optimized for detecting malicious intent in LLM interactions, balancing accuracy, speed, and resource efficiency for practical deployment.

Contribution

The paper presents novel transformer-based models specifically designed for LLM security, with comprehensive benchmarking across diverse datasets and comparison to existing guardrail models.

Findings

01

Raudra offers the most robust performance among the architectures.

02

All models outperform existing open-source guardrails in cost-performance trade-offs.

03

Models achieve high accuracy with fewer than 400M parameters.

Abstract

We present JavelinGuard, a suite of low-cost, high-performance model architectures designed for detecting malicious intent in Large Language Model (LLM) interactions, optimized specifically for production deployment. Recent advances in transformer architectures, including compact BERT(Devlin et al. 2019) variants (e.g., ModernBERT (Warner et al. 2024)), allow us to build highly accurate classifiers with as few as approximately 400M parameters that achieve rapid inference speeds even on standard CPU hardware. We systematically explore five progressively sophisticated transformer-based architectures: Sharanga (baseline transformer classifier), Mahendra (enhanced attention-weighted pooling with deeper heads), Vaishnava and Ashwina (hybrid neural ensemble architectures), and Raudra (an advanced multi-task framework with specialized loss functions). Our models are rigorously benchmarked…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Explainable Artificial Intelligence (XAI)