Mean-field methods in evolutionary duplication-innovation-loss models   for the genome-level repertoire of protein domains

A. Angelini; A. Amato; G. Bianconi; B. Bassetti; M. Cosentino; Lagomarsino

arXiv:0910.4253·q-bio.GN·May 14, 2015

Mean-field methods in evolutionary duplication-innovation-loss models for the genome-level repertoire of protein domains

A. Angelini, A. Amato, G. Bianconi, B. Bassetti, M. Cosentino, Lagomarsino

PDF

TL;DR

This paper combines mean-field and simulation methods to analyze models of genome evolution, focusing on duplication, innovation, and loss of protein domain classes, to explain observed scaling behaviors across species.

Contribution

It introduces a unified approach to study genome-level protein domain repertoire models, highlighting the effects of element loss and class specificity on scaling behaviors.

Findings

01

Models reproduce observed genome domain scaling patterns

02

Loss of elements significantly impacts model outcomes

03

Class specificity influences the dynamics of domain repertoires

Abstract

We present a combined mean-field and simulation approach to different models describing the dynamics of classes formed by elements that can appear, disappear or copy themselves. These models, related to a paradigm duplication-innovation model known as Chinese Restaurant Process, are devised to reproduce the scaling behavior observed in the genome-wide repertoire of protein domains of all known species. In view of these data, we discuss the qualitative and quantitative differences of the alternative model formulations, focusing in particular on the roles of element loss and of the specificity of empirical domain classes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.