The impact of Gene Ontology evolution on GO-Term Information Content
Pietro Hiram Guzzi, Giuseppe Agapito, Marianna Milano, and Mario, Cannataro

TL;DR
This study investigates how the evolution of the Gene Ontology from 2005 to 2014 significantly impacts the information content of GO terms, affecting semantic similarity measures in bioinformatics.
Contribution
It provides a comprehensive analysis of GO evolution's effect on information content calculations, highlighting biases that influence semantic similarity assessments.
Findings
Significant differences in IC calculations across GO versions.
Statistically significant impact of GO evolution on semantic similarity measures.
Biases in GO evolution affecting bioinformatics analyses.
Abstract
The Gene Ontology (GO) is a major bioinformatics ontology that provides structured controlled vocabularies to classify gene and proteins function and role. The GO and its annotations to gene products are now an integral part of functional analysis. Recently, the evaluation of similarity among gene products starting from their annotations (also referred to as semantic similarities) has become an increasing area in bioinformatics. While many research on updates to the structure of GO and on the annotation corpora have been made, the impact of GO evolution on semantic similarities is quite unobserved. Here we extensively analyze how GO changes that should be carefully considered by all users of semantic similarities. GO changes in particular have a big impact on information content (IC) of GO terms. Since many semantic similarities rely on calculation of IC it is obvious that the study of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Biomedical Text Mining and Ontologies · Genomics and Phylogenetic Studies
