NanoSSL: attention mechanism-based self-supervised learning method for protein identification using nanopores
Yong Xie, Jindong Li, Ziyan Zhang, Bin Meng, Shuaijian Dai, Yuchen Zhou, Eamonn Kennedy, Niandong Jiao, Haobin Chen, Zhuxin Dong

TL;DR
NanoSSL is a new machine learning method that improves protein identification from nanopore data using self-supervised learning and attention mechanisms.
Contribution
NanoSSL introduces a novel self-supervised learning framework with attention mechanisms for protein identification using nanopores.
Findings
NanoSSL achieved high performance in classifying mutated Aβ1-42 proteins using four metrics: accuracy, precision, recall, and F1 score.
The method leverages a masked autoencoder and attention mechanisms to learn useful representations from fragmented nanopore signals.
Self-supervised pretraining combined with supervised fine-tuning improves molecular identification in nanopore-based proteomics.
Abstract
Nanopores are cutting-edge interdisciplinary tools that can analyze biomolecules at the single-molecule level for many applications, e.g. DNA sequencing. Efforts are underway to extend nanopores to proteomics, including the development of machine learning algorithms for protein sequencing and identification. However, single-molecule data are intrinsically noisy and hard to process. Moreover, the development and performance of machine learning for nanopore is jeopardized by data scarcity. Self-supervised learning is an emerging method that may yield advantages in nanopore scenarios. We propose and experimentally validate Nanopore analysis using Self-Supervised Learning (NanoSSL), a generative self-supervised learning framework based on attention mechanisms for the identification of protein signals from nanopores. Leveraging a two-step approach consisting of self-supervised pre-training…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNanopore and Nanochannel Transport Studies · MicroRNA in disease regulation · Ion-surface interactions and analysis
