ReGAIN: Retrieval-Grounded AI Framework for Network Traffic Analysis

Shaghayegh Shajarian; Kennedy Marsh; James Benson; Sajad Khorsandroo; Mahmoud Abdelsalam

arXiv:2512.22223·cs.LG·March 18, 2026

ReGAIN: Retrieval-Grounded AI Framework for Network Traffic Analysis

Shaghayegh Shajarian, Kennedy Marsh, James Benson, Sajad Khorsandroo, Mahmoud Abdelsalam

PDF

Open Access

TL;DR

ReGAIN is a novel framework that combines traffic summarization, retrieval-augmented generation, and LLM reasoning to improve transparency and accuracy in network traffic analysis, outperforming existing methods.

Contribution

ReGAIN introduces a multi-stage, retrieval-grounded AI framework that enhances interpretability and reduces false positives in network traffic analysis.

Findings

01

Achieves 95.95% to 98.82% accuracy on real-world attack data.

02

Outperforms rule-based, classical ML, and deep learning baselines.

03

Provides explainability through verifiable, evidence-based responses.

Abstract

Modern networks generate vast, heterogeneous traffic that must be continuously analyzed for security and performance. Traditional network traffic analysis systems, whether rule-based or machine learning-driven, often suffer from high false positives and lack interpretability, limiting analyst trust. In this paper, we present ReGAIN, a multi-stage framework that combines traffic summarization, retrieval-augmented generation (RAG), and Large Language Model (LLM) reasoning for transparent and accurate network traffic analysis. ReGAIN creates natural-language summaries from network traffic, embeds them into a multi-collection vector database, and utilizes a hierarchical retrieval pipeline to ground LLM responses with evidence citations. The pipeline features metadata-based filtering, MMR sampling, a two-stage cross-encoder reranking mechanism, and an abstention mechanism to reduce…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInternet Traffic Analysis and Secure E-voting · Network Security and Intrusion Detection · Web Application Security Vulnerabilities