Constructing Ancestral Recombination Graphs through Reinforcement Learning

M\'elanie Raymond (1); Marie-H\'el\`ene Descary (1); C\'edric Beaulac (1); Fabrice Larribe (1) ((1) Universit\'e du Qu\'ebec \`a Montr\'eal)

arXiv:2406.12022·cs.LG·June 16, 2025

Constructing Ancestral Recombination Graphs through Reinforcement Learning

M\'elanie Raymond (1), Marie-H\'el\`ene Descary (1), C\'edric Beaulac (1), Fabrice Larribe (1) ((1) Universit\'e du Qu\'ebec \`a Montr\'eal)

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning approach to construct ancestral recombination graphs (ARGs), achieving comparable or shorter graphs than heuristic methods and enabling distributional analysis and generalization to new samples.

Contribution

The paper presents a novel RL-based method for building short ARGs, outperforming heuristic algorithms and allowing for distributional and generalization capabilities.

Findings

01

RL can produce ARGs as short as or shorter than heuristic methods

02

The method enables generating a distribution of short ARGs for a sample

03

The approach generalizes well to new, unseen samples

Abstract

Over the years, many approaches have been proposed to build ancestral recombination graphs (ARGs), graphs used to represent the genetic relationship between individuals. Among these methods, many rely on the assumption that the most likely graph is among the shortest ones. In this paper, we propose a new approach to build short ARGs: Reinforcement Learning (RL). We exploit the similarities between finding the shortest path between a set of genetic sequences and their most recent common ancestor and finding the shortest path between the entrance and exit of a maze, a classic RL problem. In the maze problem, the learner, called the agent, must learn the directions to take in order to escape as quickly as possible, whereas in our problem, the agent must learn the actions to take between coalescence, mutation, and recombination in order to reach the most recent common ancestor as quickly as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDNA and Biological Computing

MethodsSparse Evolutionary Training