Reverse engineering time discrete finite dynamical systems: A feasible undertaking?
Edgar Delgado-Eckert

TL;DR
This paper investigates the minimal data requirements and optimal data set properties for reverse engineering biochemical networks using discrete dynamical systems, revealing fundamental limitations in data sufficiency.
Contribution
It characterizes minimal and optimal data sets for a top-down reverse engineering algorithm and introduces a generalized method with a probability formula for correct model identification.
Findings
Optimal data sets are characterized by a geometric 'general position' property.
The probability of correctly identifying the model decreases exponentially with the number of variables.
Even with optimal data, reverse engineering remains infeasible without enormous data quantities.
Abstract
With the advent of high-throughput profiling methods, interest in reverse engineering the structure and dynamics of biochemical networks is high. Recently an algorithm for reverse engineering of biochemical networks was developed by Laubenbacher and Stigler. It is a top-down approach using time discrete dynamical systems. One of its key steps includes the choice of a term order. The aim of this paper is to identify minimal requirements on data sets to be used with this algorithm and to characterize optimal data sets. We found minimal requirements on a data set based on how many terms the functions to be reverse engineered display. Furthermore, we identified optimal data sets, which we characterized using a geometric property called "general position". Moreover, we developed a constructive method to generate optimal data sets, provided a codimensional condition is fulfilled. In addition,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
