EBIC: an open source software for high-dimensional and big data biclustering analyses
Patryk Orzechowski, Jason H. Moore

TL;DR
EBIC is an advanced biclustering software designed for high-dimensional, large-scale genetic data analysis, featuring GPU acceleration, R integration, and handling of missing data to improve efficiency and scalability.
Contribution
This paper introduces support for big data in EBIC, enabling efficient genomic data mining with GPU acceleration and integration with R and Bioconductor.
Findings
Over 6.6-fold speedup on large datasets using 8 GPUs
Successful application to a DNA methylation dataset with 436,444 rows
High scalability demonstrated across datasets of various sizes
Abstract
Motivation: In this paper we present the latest release of EBIC, a next-generation biclustering algorithm for mining genetic data. The major contribution of this paper is adding support for big data, making it possible to efficiently run large genomic data mining analyses. Additional enhancements include integration with R and Bioconductor and an option to remove influence of missing value on the final result. Results: EBIC was applied to datasets of different sizes, including a large DNA methylation dataset with 436,444 rows. For the largest dataset we observed over 6.6 fold speedup in computation time on a cluster of 8 GPUs compared to running the method on a single GPU. This proves high scalability of the algorithm. Availability: The latest version of EBIC could be downloaded from http://github.com/EpistasisLab/ebic . Installation and usage instructions are also available online.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Algorithms and Data Compression
