# AmalgaMo: flexible DNA motif merging

**Authors:** Orsolya Lapohos, Gregory J Fonseca

PMC · DOI: 10.1093/bioadv/vbag043 · Bioinformatics Advances · 2026-02-11

## TL;DR

AmalgaMo is a new tool that merges similar DNA motifs to improve the accuracy of predicting upstream regulators in genomic data.

## Contribution

AmalgaMo introduces a novel motif merging algorithm optimized for regression-based motif enrichment analysis.

## Key findings

- Merging motifs with AmalgaMo improves regression-based motif enrichment analysis.
- AmalgaMo is an efficient and flexible command-line tool for motif merging.
- The tool is supported by detailed documentation for genomic data interpretation.

## Abstract

Inference of candidate upstream regulators via motif enrichment analysis is a common step in the interpretation of genomic data. However, redundancy in motif databases can negatively impact predictive value, especially when relying on regression-based motif enrichment analysis. Although various forms of motif clustering have been used to mitigate problems caused by redundancy, an algorithm optimized for downstream regression-based analysis is needed.

We introduce AmalgaMo, an efficient and flexible command-line tool for merging highly similar motifs. Using publicly available human datasets, we demonstrate that merging motifs with our optimized settings greatly benefits regression-based motif enrichment analysis and provide detailed documentation that can serve as a reference for researchers inferring upstream regulators from genomic data.

AmalgaMo is available on GitHub at https://github.com/lapohosorsolya/AmalgaMo.

## Full-text entities

- **Genes:** NFIC (nuclear factor I C) [NCBI Gene 4782] {aka CTF, CTF5, NF-I, NF-I/C, NF1-C, NFI}, F3 (coagulation factor III, tissue factor) [NCBI Gene 2152] {aka CD142, TF, TFA}, NR4A2 (nuclear receptor subfamily 4 group A member 2) [NCBI Gene 4929] {aka HZF-3, IDLDP, NOT, NURR1, RNR1, TINUR}, NR4A1 (nuclear receptor subfamily 4 group A member 1) [NCBI Gene 3164] {aka GFRP1, HMR, N10, NAK-1, NGFIB, NP10}, CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, HIC1 (HIC ZBTB transcriptional repressor 1) [NCBI Gene 3090] {aka ZBTB29, ZNF901, hic-1}, XCL1 (X-C motif chemokine ligand 1) [NCBI Gene 6375] {aka ATAC, LPTN, LTN, SCM-1, SCM-1a, SCM1}
- **Chemicals:** AmalgaMo (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12947577/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12947577/full.md

## References

20 references — full list in the complete paper: https://tomesphere.com/paper/PMC12947577/full.md

---
Source: https://tomesphere.com/paper/PMC12947577