Identification of Capture Phases in Nanopore Protein Sequencing Data Using a Deep Learning Model
Annabelle Martin, Daphne Kontogiorgos-Heintz, Jeff Nivala

TL;DR
This paper presents a lightweight deep learning model that accurately detects capture phases in nanopore protein sequencing data, significantly reducing analysis time and enabling real-time processing.
Contribution
The authors developed CaptureNet-Deep, a simple yet effective 1D CNN that outperforms other models in detecting capture phases, facilitating rapid, real-time analysis in nanopore sequencing workflows.
Findings
CaptureNet-Deep achieved an F1 score of 0.94 on test data.
The model reduced analysis time from days to under thirty minutes.
Integration into a dashboard enables real-time capture detection.
Abstract
Nanopore protein sequencing produces long, noisy ionic current traces in which key molecular phases, such as protein capture and translocation, are embedded. Capture phases mark the successful entry of a protein into the pore and serve as both a checkpoint and a signal that a channel merits further analysis. However, manual identification of capture phases is time-intensive, often requiring several days for expert reviewers to annotate the data due to the need for domain-specific interpretation of complex signal patterns. To address this, a lightweight one-dimensional convolutional neural network (1D CNN) was developed and trained to detect capture phases in down-sampled signal windows. Evaluated against CNN-LSTM (Long Short-Term Memory) hybrids, histogram-based classifiers, and other CNN variants using run-level data splits, our best model, CaptureNet-Deep, achieved an F1 score of 0.94…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNanopore and Nanochannel Transport Studies · Genomics and Phylogenetic Studies · Protein Structure and Dynamics
