RLShield: Practical Multi-Agent RL for Financial Cyber Defense with Attack-Surface MDPs and Real-Time Response Orchestration
Srikumar Nayak

TL;DR
RLShield introduces a multi-agent reinforcement learning framework for real-time, adaptive cyber defense in financial systems, effectively balancing containment speed, operational disruption, and response costs.
Contribution
The paper presents RLShield, a novel multi-agent RL pipeline that models the attack surface as an MDP and optimizes coordinated responses under operational constraints.
Findings
RLShield reduces time-to-containment and residual exposure.
It outperforms static rules and single-agent RL in operational metrics.
The approach balances containment, disruption, and response costs effectively.
Abstract
Financial systems run nonstop and must stay reliable even during cyber incidents. Modern attacks move across many services (apps, APIs, identity, payment rails), so defenders must make a sequence of actions under time pressure. Most security tools still use fixed rules or static playbooks, which can be slow to adapt when the attacker changes behavior. Reinforcement learning (RL) is a good fit for sequential decisions, but much of the RL-in-finance literature targets trading and does not model real cyber response limits such as action cost, service disruption, and defender coordination across many assets. This paper proposes RLShield, a practical multi-agent RL pipeline for financial cyber defense. We model the enterprise attack surface as a Markov decision process (MDP) where states summarize alerts, asset exposure, and service health, and actions represent real response steps (e.g.,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation and Cyber Security · Network Security and Intrusion Detection · Smart Grid Security and Resilience
