Reliable scaling of Position Weight Matrices for binding strength comparisons between transcription factors
Xiaoyan Ma, Daphne Ezer, Carmen Navarro, Boris Adryan

TL;DR
This paper introduces two methods to accurately scale PWM scores, enabling direct comparison of transcription factor binding strengths across different PWMs and biological contexts.
Contribution
The authors present novel approaches to determine the scaling parameter λ for PWMs, facilitating reliable energy inference and cross-PWM comparisons for transcription factors.
Findings
λ distributions align with DNA binding properties across TF families
Methods produce consistent scaling parameters
Enables direct comparison of PWMs from different sources
Abstract
Scoring DNA sequences against Position Weight Matrices (PWMs) is a widely adopted method to identify putative transcription factor binding sites. While common bioinformatics tools produce scores that can reflect the binding strength between a specific transcription factor and the DNA, these scores are not directly comparable between different transcription factors. Here, we provide two different ways to find the scaling parameter that allows us to infer binding energy from a PWM score. The first approach uses a PWM and background genomic sequence as input to estimate for a specific transcription factor, which we applied to show that distributions for different transcription factor families correspond with their DNA binding properties. Our second method can reliably convert between different PWMs of the same transcription factor, which allows us to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Chromatin Dynamics · RNA and protein synthesis mechanisms · Genetic and phenotypic traits in livestock
