Peano Count Trees (P-Trees) and Rule Association Mining for Gene Expression Profiling of Microarray Data
Willy Valdivia-Granda, William Perrizo, Edward Deckard, Francis Larson

TL;DR
This paper introduces P-trees and association rule mining to analyze gene expression microarray data across species, aiming to better interpret complex biological signals and pathways.
Contribution
It presents a novel integrative approach combining P-trees, super-chip data representation, and association rules for gene expression analysis across different organisms.
Findings
P-trees effectively represent multidimensional genomic data.
Association rules reveal meaningful gene expression and repression patterns.
The approach enhances understanding of gene regulation and evolutionary relationships.
Abstract
The greatest challenge in maximizing the use of gene expression data is to develop new computational tools capable of interconnecting and interpreting the results from different organisms and experimental settings. We propose an integrative and comprehensive approach including a super-chip containing data from microarray experiments collected on different species subjected to hypoxic and anoxic stress. A data mining technology called Peano count tree (P-trees) is used to represent genomic data in multidimensions. Each microarray spot is presented as a pixel with its corresponding red/green intensity feature bands. Each bad is stored separately in a reorganized 8-separate (bSQ) file format. Each bSQ is converted to a quadrant base tree structure (P-tree) from which a superchip is represented as expression P-trees (EP-trees) and repression P-trees (RP-trees). The use of association rule…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Gene Regulatory Network Analysis
