# Branes with Brains: Exploring String Vacua with Deep Reinforcement   Learning

**Authors:** James Halverson, Brent Nelson, and Fabian Ruehle

arXiv: 1903.11616 · 2019-06-26

## TL;DR

This paper introduces a deep reinforcement learning approach to explore string vacua, significantly improving the efficiency of finding consistent string models and discovering new strategies in the complex landscape of string theory solutions.

## Contribution

It applies an asynchronous advantage actor-critic reinforcement learning method to string compactifications, demonstrating improved solution discovery and autonomous strategy learning in string theory.

## Key findings

- Agent finds 200 times more solutions than random search.
- Learns human-like strategies for model building.
- Discovers new strategies without human guidance.

## Abstract

We propose deep reinforcement learning as a model-free method for exploring the landscape of string vacua. As a concrete application, we utilize an artificial intelligence agent known as an asynchronous advantage actor-critic to explore type IIA compactifications with intersecting D6-branes. As different string background configurations are explored by changing D6-brane configurations, the agent receives rewards and punishments related to string consistency conditions and proximity to Standard Model vacua. These are in turn utilized to update the agent's policy and value neural networks to improve its behavior. By reinforcement learning, the agent's performance in both tasks is significantly improved, and for some tasks it finds a factor of O(200) more solutions than a random walker. In one case, we demonstrate that the agent learns a human-derived strategy for finding consistent string models. In another case, where no human-derived strategy exists, the agent learns a genuinely new strategy that achieves the same goal twice as efficiently per unit time. Our results demonstrate that the agent learns to solve various string theory consistency conditions simultaneously, which are phrased in terms of non-linear, coupled Diophantine equations.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.11616/full.md

## Figures

29 figures with captions in the complete paper: https://tomesphere.com/paper/1903.11616/full.md

## References

51 references — full list in the complete paper: https://tomesphere.com/paper/1903.11616/full.md

---
Source: https://tomesphere.com/paper/1903.11616