Feature extraction from complex networks: A case of study in genomic sequences classification
Bruno Mendes Moro Conque, Andr\'e Yoshiaki Kashiwabara and, Fabr\'icio Martins Lopes

TL;DR
This paper introduces a novel feature extraction method from complex network measures and information theory for genomic sequence classification, achieving high accuracy with various machine learning classifiers.
Contribution
It proposes a new approach combining complex network and information theory measures for genomic sequence feature extraction, improving classification performance.
Findings
Random Forest achieved 91.2% accuracy
J48 classifier reached 89.1% accuracy
SVM obtained 84.8% accuracy
Abstract
This work presents a new approach for classification of genomic sequences from measurements of complex networks and information theory. For this, it is considered the nucleotides, dinucleotides and trinucleotides of a genomic sequence. For each of them, the entropy, sum entropy and maximum entropy values are calculated.For each of them is also generated a network, in which the nodes are the nucleotides, dinucleotides or trinucleotides and its edges are estimated by observing the respective adjacency among them in the genomic sequence. In this way, it is generated three networks, for which measures of complex networks are extracted.These measures together with measures of information theory comprise a feature vector representing a genomic sequence. Thus, the feature vector is used for classification by methods such as SVM, MultiLayer Perceptron, J48, IBK, Naive Bayes and Random Forest in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · RNA and protein synthesis mechanisms · Bioinformatics and Genomic Networks
MethodsSupport Vector Machine
