A Brief Review of Data Mining Application Involving Protein Sequence Classification
Suprativ Saha, Rituparna Chaki

TL;DR
This paper reviews various data mining techniques for protein sequence classification, compares models like neural networks and fuzzy ARTMAP, and proposes a new method to improve accuracy and reduce computational costs.
Contribution
It provides a comparative review of existing classification models and introduces a novel technique to enhance efficiency and accuracy in protein sequence classification.
Findings
Neural networks, Fuzzy ARTMAP, and Rough Set classifiers are commonly used.
The proposed technique reduces computational overheads.
The new method aims to increase classification accuracy.
Abstract
Data mining techniques have been used by researchers for analyzing protein sequences. In protein analysis, especially in protein sequence classification, selection of feature is most important. Popular protein sequence classification techniques involve extraction of specific features from the sequences. Researchers apply some well-known classification techniques like neural networks, Genetic algorithm, Fuzzy ARTMAP, Rough Set Classifier etc for accurate classification. This paper presents a review is with three different classification models such as neural network model, fuzzy ARTMAP model and Rough set classifier model. A new technique for classifying protein sequences have been proposed in the end. The proposed technique tries to reduce the computational overheads encountered by earlier approaches and increase the accuracy of classification.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Fractal and DNA sequence analysis · Algorithms and Data Compression
