Fighting against uncertainty: An essential issue in bioinformatics
Michiaki Hamada

TL;DR
This paper reviews methods to address the challenge of solution uncertainty in bioinformatics problems, emphasizing strategies like avoiding point estimates and using maximum expected accuracy to improve prediction reliability.
Contribution
It introduces and explains several approaches to handle uncertainty in bioinformatics estimations, providing practical examples and emphasizing their broad applicability.
Findings
Methods to combat uncertainty include avoiding point estimates and MEA.
Strategies involve designing multi-step prediction pipelines.
Handling uncertainty improves bioinformatics prediction accuracy.
Abstract
Many bioinformatics problems, such as sequence alignment, gene prediction, phylogenetic tree estimation and RNA secondary structure prediction, are often affected by the "uncertainty" of a solution; that is, the probability of the solution is extremely small. This situation arises for estimation problems on high-dimensional discrete spaces in which the number of possible discrete solutions is immense. In the analysis of biological data or the development of prediction algorithms, this uncertainty should be handled carefully and appropriately. In this review, I will explain several methods to combat this uncertainty, presenting a number of examples in bioinformatics. The methods include (i) avoiding point estimation, (ii) maximum expected accuracy (MEA) estimations, and (iii) several strategies to design a pipeline involving several prediction methods. I believe that the basic concepts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Machine Learning in Bioinformatics
