TL;DR
RAGnaroX is a secure, on-premise ChatOps assistant built with small language models, offering competitive accuracy and resource efficiency on commodity hardware, with full auditable control and modular architecture.
Contribution
It introduces RAGnaroX, a fully local, resource-efficient ChatOps assistant with modular design, implemented in Rust, and evaluated on multiple QA datasets.
Findings
Achieves 0.90 context precision on single-hop QA
Response time averages 2.5 seconds per request
Maintains competitive accuracy with resource efficiency
Abstract
This paper introduces RAGnaroX, a resource-efficient ChatOps assistant that operates entirely on commodity hardware. Unlike existing solutions that often rely on external providers such as Azure or OpenAI, RAGnaroX offers a fully auditable, on-premise stack implemented in Rust. Its architecture integrates modular data ingestion, hybrid retrieval, and function calling, enabling flexible yet secure deployment. Our evaluation focuses on the RAG pipeline, with benchmarks conducted on the SQuAD (single-hop QA), MultiHopRAG (multi-hop QA), and MLQA (cross-lingual QA) datasets. Results show that RAGnaroX achieves competitive accuracy while maintaining strong resource efficiency, for example, reaching 0.90 context precision on single-hop questions with an average response time of 2.5 seconds per request. A replication package containing the tool, the demonstration video…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
