MoRSE: Bridging the Gap in Cybersecurity Expertise with Retrieval Augmented Generation
Marco Simoni, Andrea Saracino, Vinod P., Mauro Conti

TL;DR
MoRSE is a specialized AI chatbot for cybersecurity that uses dual retrieval systems to provide accurate, up-to-date answers by accessing non-parametric knowledge bases, outperforming existing large language models.
Contribution
Introduces MoRSE, the first cybersecurity-specific RAG-based AI chatbot that retrieves real-time information from non-parametric sources for improved accuracy.
Findings
Over 10% improvement in relevance and correctness over GPT-4 and Mixtral 7x8
Effective retrieval from multidimensional cybersecurity contexts
Real-time knowledge updates enhance answer accuracy
Abstract
In this paper, we introduce MoRSE (Mixture of RAGs Security Experts), the first specialised AI chatbot for cybersecurity. MoRSE aims to provide comprehensive and complete knowledge about cybersecurity. MoRSE uses two RAG (Retrieval Augmented Generation) systems designed to retrieve and organize information from multidimensional cybersecurity contexts. MoRSE differs from traditional RAGs by using parallel retrievers that work together to retrieve semantically related information in different formats and structures. Unlike traditional Large Language Models (LLMs) that rely on Parametric Knowledge Bases, MoRSE retrieves relevant documents from Non-Parametric Knowledge Bases in response to user queries. Subsequently, MoRSE uses this information to generate accurate answers. In addition, MoRSE benefits from real-time updates to its knowledge bases, enabling continuous knowledge enrichment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Topic Modeling · Web Data Mining and Analysis
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Warmup With Linear Decay · Weight Decay · WordPiece · Attention Dropout · Adam · Label Smoothing · Linear Layer · Byte Pair Encoding · Layer Normalization
