AOC: Analysis of Orthologous Collections -- an application for the characterization of natural selection in protein-coding sequences
Alexander Lucaci, Sergei Pond

TL;DR
AOC is a comprehensive tool that automates the analysis of natural selection in protein-coding sequences, providing an accessible way to interpret evolutionary patterns across genes and lineages.
Contribution
It introduces an automated, user-friendly pipeline for detecting various types of natural selection in orthologous gene collections, integrating all analysis steps with visualization.
Findings
Automates detection of negative, positive, and differential selection.
Provides complete analysis pipeline from sequences to visual results.
Accessible to domain experts for self-conducted evolutionary studies.
Abstract
Motivation Modern molecular sequence analysis increasingly relies on automated and robust software tools for interpretation, annotation, and biological insight. The Analysis of Orthologous Collections (AOC) application automates the identification of genomic sites and species/lineages influenced by natural selection in coding sequence analysis. AOC quantifies different types of selection: negative, diversifying or directional positive, or differential selection between groups of branches. We include all steps necessary to go from unaligned homologous sequences to complete results and interactive visualizations that are designed to aid in the useful interpretation and contextualization. Results We are motivated by a desire to make evolutionary analyses as simple as possible, and to close the disparity in the literature between genes which draw a significant amount of interest and those…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Machine Learning in Bioinformatics · Identification and Quantification in Food
