# MUSICiAn: genome-wide identification of genes involved in DNA repair via control-free mutational spectra analysis

**Authors:** Colm Seale, Marco Barazas, Robin van Schendel, Marcel Tijsterman, Joana P Gonçalves

PMC · DOI: 10.1093/nargab/lqaf202 · NAR Genomics and Bioinformatics · 2026-01-08

## TL;DR

This paper introduces MUSICiAn, a new method to identify genes involved in DNA repair by analyzing mutation patterns without needing control groups.

## Contribution

MUSICiAn is a novel control-free method for identifying DNA repair genes using mutational spectra deviations.

## Key findings

- MUSICiAn successfully estimates pseudo-controls for Repair-seq screens and identifies 476 genes.
- The method recovers known DNA repair genes and highlights the spliceosome as a potential player.
- MUSICiAn reveals new candidate genes for further investigation in DNA repair.

## Abstract

Understanding DNA double-strand break (DSB) repair is crucial for the development of targeted anticancer therapies, yet the roles of many genes remain unclear. Recent studies show that disruption of known DSB repair genes can alter the sequence-specific distribution of mutations arising after DSB repair, suggesting that genome-wide perturbation screens could be leveraged to identify new DSB genes leading to distinct deviations from the expected wild-type distribution. Given the challenges of designing controls for a genome-wide screen, we explore the high gene throughput to forgo the use of traditional controls by reframing the analysis as an outlier detection problem, assuming that most genes have minimal influence on DSB repair outcomes. We propose MUSICiAn (Mutational Signature Catalogue Analysis), a compositional data analysis method that ranks gene perturbation impact on mutational spectra without controls by measuring deviations from the central tendency considering the distribution of all spectra. We show that MUSICiAn effectively estimates pseudo-controls for the Repair-seq screen, yielding 476 genes and 60 nontargeting controls. We further apply MUSICiAn to the first genome-wide screen of 18 406 genes with mutational spectra readout, MUSIC, reporting that MUSICiAn successfully recovers known DSB repair genes, highlights the spliceosome as a lesser-appreciated player, and reveals candidates for further investigation.

## Full-text entities

- **Genes:** MRE11 (MRE11 double strand break repair nuclease) [NCBI Gene 4361] {aka ATLD, HNGS1, MRE11A, MRE11B}, XRCC6 (X-ray repair cross complementing 6) [NCBI Gene 2547] {aka CTC75, CTCBF, G22P1, KU70, ML8, TLAA}, MSH6 (mutS homolog 6) [NCBI Gene 2956] {aka GTBP, GTMBP, HNPCC5, HSAP, LYNCH5, MMRCS3}, MSH2 (mutS homolog 2) [NCBI Gene 4436] {aka COCA1, FCC1, HNPCC, HNPCC1, LCFS2, LYNCH1}, BRCA1 (BRCA1 DNA repair associated) [NCBI Gene 672] {aka BRCAI, BRCC1, BROVCA1, FANCS, IRIS, PNCA4}, MLYCD (malonyl-CoA decarboxylase) [NCBI Gene 23417] {aka MCD}, DCLRE1C (DNA cross-link repair 1C) [NCBI Gene 64421] {aka A-SCID, DCLREC1C, RS-SCID, SCIDA, SNM1C}, ATP6V1G1 (ATPase H+ transporting V1 subunit G1) [NCBI Gene 9550] {aka ATP6G, ATP6G1, ATP6GL, ATP6J, Vma10}, RAD50 (RAD50 double strand break repair protein) [NCBI Gene 10111] {aka NBSLD, RAD502, hRad50}, COL11A2 (collagen type XI alpha 2 chain) [NCBI Gene 1302] {aka DFNA13, DFNB53, FBCG2, HKE5, OSMEDA, OSMEDB}, RNF168 (ring finger protein 168) [NCBI Gene 165918] {aka RIDL, hRNF168}, RNF8 (ring finger protein 8) [NCBI Gene 9025] {aka hRNF8}, HELQ (helicase, POLQ like) [NCBI Gene 113510] {aka HEL308}, XRCC5 (X-ray repair cross complementing 5) [NCBI Gene 7520] {aka KARP-1, KARP1, KU80, KUB2, Ku86, NFIV}, H2AX (H2A.X variant histone) [NCBI Gene 3014] {aka H2A.X, H2A/X, H2AFX}, PRKDC (protein kinase, DNA-activated, catalytic subunit) [NCBI Gene 5591] {aka DNA-PKC, DNA-PKcs, DNAPK, DNAPKc, DNPK1, HYRC}, NBN (nibrin) [NCBI Gene 4683] {aka AT-V1, AT-V2, ATV, NBS, NBS1, P95}, ATM (ATM serine/threonine kinase) [NCBI Gene 472] {aka AT1, ATA, ATC, ATD, ATDC, ATE}, METAP2 (methionyl aminopeptidase 2) [NCBI Gene 10988] {aka MAP2, MNPEP, p67eIF2}
- **Diseases:** Fanconi anemia (MESH:D005199), FA (MESH:C565561), Fanconi anaemia (MESH:D000743), cancer (MESH:D009369), HDR (MESH:D006086), deficient (MESH:D007153), DSB (MESH:D019457)
- **Chemicals:** DSB (-)
- **Species:** Mus musculus (house mouse, species) [taxon 10090]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12783044/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12783044/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/PMC12783044/full.md

---
Source: https://tomesphere.com/paper/PMC12783044