Y-Trim: Evidence-gated Adaptase tail trimming for single-stranded bisulfite sequencing
Yihan Fang

TL;DR
Y-Trim is a novel evidence-gated framework for adaptive tail trimming in single-stranded bisulfite sequencing, improving accuracy by explicitly modeling uncertainty and tailoring decisions to sequencing chemistry.
Contribution
It introduces Y-Trim, a chemistry-specific, evidence-based approach for tail trimming that separates decision-making from inference, enhancing accuracy in ssWGBS preprocessing.
Findings
Y-Trim demonstrates stable performance across public and simulated datasets.
It explicitly models uncertainty, leading to more accurate tail trimming.
Y-Trim outperforms fixed boundary rules in diverse conditions.
Abstract
Background: Single-stranded whole-genome bisulfite sequencing (ssWGBS) enables DNA methylation profiling in low-input and highly fragmented material, including cell-free DNA. In widely used post-bisulfite protocols, Adaptase-mediated tailing adds stochastic, template-free end sequence. Unlike adapter-defined junctions, these tails lack a fixed sequence template, so trimming must be decided from FASTQ-stage observables under intrinsic uncertainty. Results: We show that bisulfite-induced compositional degeneracy implies a strictly positive error floor for any fixed per-read boundary rule under a finite nucleotide alphabet. Guided by this limit, we introduce Y-Trim, an evidence-gated framework that separates admission (should we trim) from inference (where to trim). For Read 2, Y-Trim performs per-read adaptive cut placement via a fixed, chemistry-typed matrix-linear texture scoring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEpigenetics and DNA Methylation · Genomics and Phylogenetic Studies · Genomics and Rare Diseases
