Daisy: An integrated repeat protein curation service

Manuel Bezerra-Brandao; Ronaldo Romario Tunque Cahui; Layla Hirsh

arXiv:2407.07817·cs.SE·July 11, 2024

Daisy: An integrated repeat protein curation service

Manuel Bezerra-Brandao, Ronaldo Romario Tunque Cahui, Layla Hirsh

PDF

TL;DR

Daisy is a web service that streamlines the identification, classification, and curation of protein tandem repeats by integrating multiple databases and algorithms, thereby accelerating research in repeat protein analysis.

Contribution

It introduces an integrated platform combining databases and algorithms for repeat protein curation, enhancing efficiency and accuracy over manual methods.

Findings

01

Supports PDB and AlphaFold entries for repeat identification

02

Uses HMM library searches for sequence classification

03

Accelerates repeat unit detection with integrated tools

Abstract

Tandem repeats in proteins identification, classification and curation is a complex process that requires manual processing from experts, processing power and time. There are recent and relevant advances applying machine learning for protein structure prediction and repeat classification that are useful for this process. However, no service contemplates required databases and software to supplement researching on repeat proteins. In this publication we present Daisy, an integrated repeat protein curation web service. This service can process Protein Data Bank (PDB) and the AlphaFold Database entries for tandem repeats identification. In addition, it uses an algorithm to search a sequence against a library of Pfam hidden Markov model (HMM). Repeat classifications are associated with the identified families through RepeatsDB. This prediction is considered for enhancing the ReUPred…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.