Kolmogorov's Structure Functions and Model Selection
Nikolai Vereshchagin (Moscow State University), Paul Vitanyi (CWI, and University of Amsterdam)

TL;DR
This paper explores Kolmogorov's 1974 non-probabilistic approach to statistics, showing how the structure function of data determines all stochastic properties and the best-fitting models within complexity constraints.
Contribution
It demonstrates that the structure function uniquely characterizes data properties and model fit, and analyzes the computability of related functions and statistics.
Findings
The structure function determines all stochastic properties of data.
Every graph can be realized by some data's structure function.
The paper characterizes the computability of the structure function and minimal sufficient statistic.
Abstract
In 1974 Kolmogorov proposed a non-probabilistic approach to statistics and model selection. Let data be finite binary strings and models be finite sets of binary strings. Consider model classes consisting of models of given maximal (Kolmogorov) complexity. The ``structure function'' of the given data expresses the relation between the complexity level constraint on a model class and the least log-cardinality of a model in the class containing the data. We show that the structure function determines all stochastic properties of the data: for every constrained model class it determines the individual best-fitting model in the class irrespective of whether the ``true'' model is in the model class considered or not. In this setting, this happens {\em with certainty}, rather than with high probability as is in the classical case. We precisely quantify the goodness-of-fit of an individual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputability, Logic, AI Algorithms · AI-based Problem Solving and Planning · Evolutionary Algorithms and Applications
