# py_ped_sim: a flexible forward pedigree and genetic simulator for complex family pedigree analysis

**Authors:** Miguel Guardado, Cynthia Perez, Sthen Campana, Berenice Chavez Rojas, Joaquín Magaña, Shalom Jackson, Emily Samperio, Selena Hernandez, Kaela Syas, Ryan D. Hernandez, Elena I. Zavala, Rori V. Rohlfs

PMC · DOI: 10.1186/s12859-025-06142-z · BMC Bioinformatics · 2025-05-07

## TL;DR

py_ped_sim is a flexible tool for simulating family pedigrees and genomes, useful for medical, forensic, and evolutionary genetics research.

## Contribution

py_ped_sim introduces a flexible and accurate simulation of pedigrees and genomes with features like variable offspring and misattributed paternity.

## Key findings

- py_ped_sim accurately simulates pedigree structures and corresponding genomes.
- The tool allows for variable sibship sizes and simulates misattributed paternity events.
- Validation shows accurate levels of kinship in simulated genomes.

## Abstract

Large-scale family pedigrees are commonly used across medical, evolutionary, and forensic genetics. These pedigrees are tools for identifying genetic disorders, tracking evolutionary patterns, and establishing familial relationships via forensic genetic identification. However, there is a lack of software to accurately simulate different pedigree structures along with genomes corresponding to those individuals in a family pedigree. This limits simulation-based evaluations of methods that use pedigrees.

We have developed a python command-line-based tool called py_ped_sim that facilitates the simulation of pedigree structures and the genomes of individuals in a pedigree. py_ped_sim represents pedigrees as directed acyclic graphs, enabling conversion between standard pedigree formats and integration with the forward population genetic simulator, SLiM. Notably, py_ped_sim allows the simulation of varying numbers of offspring for a set of parents, with the capacity to shift the distribution of sibship sizes over generations. We additionally add simulations for events of misattributed paternity, which offers a way to simulate half-sibling relationships, and simulations to extend the breadth of a family pedigree. We validated the accuracy of both our genome simulator and pedigree simulator. We show that we can simulate genomes onto family pedigrees with levels of expected kinship.

py_ped_sim is a user-friendly and open-source solution for simulating pedigree structures and conducting pedigree genome simulations. It empowers medical, forensic, and evolutionary genetics researchers to gain deeper insights into the dynamics of genetic inheritance and relatedness within families.

The online version contains supplementary material available at 10.1186/s12859-025-06142-z.

## Full-text entities

- **Diseases:** genetic disorders (MESH:D030342)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12060417/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12060417/full.md

---
Source: https://tomesphere.com/paper/PMC12060417