# QFlip: An Adaptive Reinforcement Learning Strategy for the FlipIt   Security Game

**Authors:** Lisa Oakley, Alina Oprea

arXiv: 1906.11938 · 2019-12-24

## TL;DR

This paper introduces QFlip, an adaptive reinforcement learning strategy for the FlipIt security game, demonstrating convergence and superior performance against various opponent strategies, and providing an open-source environment for future research.

## Contribution

We develop QFlip, a novel RL-based adaptive strategy for FlipIt, with proven convergence and improved performance over existing strategies, along with an open-source simulation environment.

## Key findings

- QFlip converges to optimal strategies against periodic opponents.
- QFlip outperforms greedy strategies in multiple distributions.
- The open-source environment facilitates further research.

## Abstract

A rise in Advanced Persistent Threats (APTs) has introduced a need for robustness against long-running, stealthy attacks which circumvent existing cryptographic security guarantees. FlipIt is a security game that models attacker-defender interactions in advanced scenarios such as APTs. Previous work analyzed extensively non-adaptive strategies in FlipIt, but adaptive strategies rise naturally in practical interactions as players receive feedback during the game. We model the FlipIt game as a Markov Decision Process and introduce QFlip, an adaptive strategy for FlipIt based on temporal difference reinforcement learning. We prove theoretical results on the convergence of our new strategy against an opponent playing with a Periodic strategy. We confirm our analysis experimentally by extensive evaluation of QFlip against specific opponents. QFlip converges to the optimal adaptive strategy for Periodic and Exponential opponents using associated state spaces. Finally, we introduce a generalized QFlip strategy with composite state space that outperforms a Greedy strategy for several distributions including Periodic and Uniform, without prior knowledge of the opponent's strategy. We also release an OpenAI Gym environment for FlipIt to facilitate future research.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.11938/full.md

## Figures

25 figures with captions in the complete paper: https://tomesphere.com/paper/1906.11938/full.md

## References

26 references — full list in the complete paper: https://tomesphere.com/paper/1906.11938/full.md

---
Source: https://tomesphere.com/paper/1906.11938