Constructing Ancestral Recombination Graphs through Reinforcement Learning
M\'elanie Raymond (1), Marie-H\'el\`ene Descary (1), C\'edric Beaulac (1), Fabrice Larribe (1) ((1) Universit\'e du Qu\'ebec \`a Montr\'eal)

TL;DR
This paper introduces a reinforcement learning approach to construct ancestral recombination graphs (ARGs), achieving comparable or shorter graphs than heuristic methods and enabling distributional analysis and generalization to new samples.
Contribution
The paper presents a novel RL-based method for building short ARGs, outperforming heuristic algorithms and allowing for distributional and generalization capabilities.
Findings
RL can produce ARGs as short as or shorter than heuristic methods
The method enables generating a distribution of short ARGs for a sample
The approach generalizes well to new, unseen samples
Abstract
Over the years, many approaches have been proposed to build ancestral recombination graphs (ARGs), graphs used to represent the genetic relationship between individuals. Among these methods, many rely on the assumption that the most likely graph is among the shortest ones. In this paper, we propose a new approach to build short ARGs: Reinforcement Learning (RL). We exploit the similarities between finding the shortest path between a set of genetic sequences and their most recent common ancestor and finding the shortest path between the entrance and exit of a maze, a classic RL problem. In the maze problem, the learner, called the agent, must learn the directions to take in order to escape as quickly as possible, whereas in our problem, the agent must learn the actions to take between coalescence, mutation, and recombination in order to reach the most recent common ancestor as quickly as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing
MethodsSparse Evolutionary Training
