REFN: A Reinforcement-Learning-From-Network Framework against 1-day/n-day Exploitations
Tianlong Yu, Lihong Liu, Ziyi Zhou, Fudu Xing, Kailong Wang, Yang Yang

TL;DR
REFN is a novel reinforcement learning framework that autonomously generates network filters using large language models to prevent rapid exploitations across large-scale networks, addressing scalability, compatibility, and robustness issues.
Contribution
REFN introduces a scalable, compatible, and robust framework that trains LLMs with reinforcement learning to generate network filters for exploit prevention, overcoming limitations of existing defenses.
Findings
21.1% higher accuracy than alternatives
Mean Time To Patch reduced to 3.65 hours
Easily scales to 10,000 devices
Abstract
The exploitation of 1 day or n day vulnerabilities poses severe threats to networked devices due to massive deployment scales and delayed patching (average Mean Time To Patch exceeds 60 days). Existing defenses, including host based patching and network based filtering, are inadequate due to limited scalability across diverse devices, compatibility issues especially with embedded or legacy systems, and error prone deployment process (manual patch validation). To address these issues, we introduce REFN (Reinforcement Learning From Network), a novel framework that trains Large Language Models (LLMs) to autonomously generate network filters to prevent 1 day or n day exploitations. REFN ensures scalability by uniquely employs Reinforcement Learning (RL) driven by online network rewards instead of traditional Human Feedback (RLHF). REFN guarantees compatibility via unified deployment on edge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
