Adjusted Kolmogorov Complexity of Binary Words with Empirical Entropy Normalization
Brani Vidakovic

TL;DR
This paper introduces a new way to measure the complexity of binary sequences by adjusting for symbol frequency imbalances.
Contribution
The novelty is an entropy-normalized Kolmogorov complexity measure that isolates intrinsic complexity from combinatorial effects.
Findings
Adjusted complexity converges to one for Martin–Löf random sequences under exchangeable measures.
Symbol imbalance affects standard complexity measures, but not the normalized version.
Regularity of the underlying measure is crucial for the adjusted complexity behavior.
Abstract
Kolmogorov complexity of a finite binary word reflects both algorithmic structure and the empirical distribution of symbols appearing in the word. Words with symbol frequencies far from one half belong to smaller combinatorial classes and therefore appear less complex under the standard definition. In this paper, an entropy-normalized complexity measure is introduced that divides the Kolmogorov complexity of a word by the empirical entropy of its observed distribution of zeros and ones. This adjustment isolates intrinsic descriptive complexity from the purely combinatorial effect of symbol imbalance. For Martin–Löf random sequences under constructive exchangeable measures, the adjusted complexity grows linearly and converges to one. A pathological construction shows that regularity of the underlying measure is essential. The proposed framework connects Kolmogorov complexity, empirical…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputability, Logic, AI Algorithms · Benford’s Law and Fraud Detection · semigroups and automata theory
