Towards a HPC-oriented parallel implementation of a learning algorithm for bioinformatics applications
Gianni D'Angelo, Salvatore Rampone

TL;DR
This paper discusses a high-performance computing implementation of the U-BRAIN learning algorithm, designed for bioinformatics data analysis, addressing challenges in parallelization and load balancing to handle large datasets efficiently.
Contribution
It presents a parallel HPC-oriented implementation of U-BRAIN, optimizing its computational performance for large-scale bioinformatics applications.
Findings
Achieved significant speedup in execution time.
Reduced memory usage through parallelization.
Demonstrated scalability on large datasets.
Abstract
Background: The huge quantity of data produced in Biomedical research needs sophisticated algorithmic methodologies for its storage, analysis, and processing. High Performance Computing (HPC) appears as a magic bullet in this challenge. However, several hard to solve parallelization and load balancing problems arise in this context. Here we discuss the HPC-oriented implementation of a general purpose learning algorithm, originally conceived for DNA analysis and recently extended to treat uncertainty on data (U BRAIN). The U-BRAIN algorithm is a learning algorithm that finds a Boolean formula in disjunctive normal form (DNF), of approximately minimum complexity, that is consistent with a set of data (instances) which may have missing bits. The conjunctive terms of the formula are computed in an iterative way by identifying, from the given data, a family of sets of conditions that must be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
