# Constructing ancestral recombination graphs through reinforcement learning

**Authors:** Mélanie Raymond, Marie-Hélène Descary, Cédric Beaulac, Fabrice Larribe

PMC · DOI: 10.3389/fgene.2025.1569358 · Frontiers in Genetics · 2025-04-29

## TL;DR

This paper introduces a new method using reinforcement learning to build ancestral recombination graphs with fewer recombination events.

## Contribution

The novelty is applying reinforcement learning to construct maximum parsimony ancestral recombination graphs.

## Key findings

- Reinforcement learning can build ARGs with as few recombination events as heuristic algorithms.
- The method generalizes to new samples not used during training.
- It produces a distribution of ARGs with minimal recombination events.

## Abstract

Over the years, many approaches have been proposed to build ancestral recombination graphs (ARGs), graphs used to represent the genetic relationship between individuals. Among these methods, many rely on the assumption that the most likely graph is among those with the fewest recombination events. In this paper, we propose a new approach to build maximum parsimony ARGs: Reinforcement Learning (RL).

We exploit the similarities between finding the shortest path between a set of genetic sequences and their most recent common ancestor and finding the shortest path between the entrance and exit of a maze, a classic RL problem. In the maze problem, the learner, called the agent, must learn the directions to take in order to escape as quickly as possible, whereas in our problem, the agent must learn the actions to take between coalescence, mutation, and recombination in order to reach the most recent common ancestor as quickly as possible.

Our results show that RL can be used to build ARGs with as few recombination events as those built with a heuristic algorithm optimized to build minimal ARGs, and sometimes even fewer. Moreover, our method allows to build a distribution of ARGs with few recombination events for a given sample, and can also generalize learning to new samples not used during the learning process.

RL is a promising and innovative approach to build ARGs. By learning to construct ARGs just from the data, our method differs from conventional methods that rely on heuristic rules or complex theoretical models.

## Full-text entities

- **Genes:** ABL2 (ABL proto-oncogene 2, non-receptor tyrosine kinase) [NCBI Gene 27] {aka ABLL, ARG}
- **Diseases:** RL (MESH:D007859), ARGs (MESH:C535296)
- **Species:** Homo sapiens (human, species) [taxon 9606], Crohivirus B (no rank) [taxon 2169854]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12069460/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12069460/full.md

## References

62 references — full list in the complete paper: https://tomesphere.com/paper/PMC12069460/full.md

---
Source: https://tomesphere.com/paper/PMC12069460