# Adjusted Kolmogorov Complexity of Binary Words with Empirical Entropy Normalization

**Authors:** Brani Vidakovic

PMC · DOI: 10.3390/e28020176 · 2026-02-03

## TL;DR

This paper introduces a new way to measure the complexity of binary sequences by adjusting for symbol frequency imbalances.

## Contribution

The novelty is an entropy-normalized Kolmogorov complexity measure that isolates intrinsic complexity from combinatorial effects.

## Key findings

- Adjusted complexity converges to one for Martin–Löf random sequences under exchangeable measures.
- Symbol imbalance affects standard complexity measures, but not the normalized version.
- Regularity of the underlying measure is crucial for the adjusted complexity behavior.

## Abstract

Kolmogorov complexity of a finite binary word reflects both algorithmic structure and the empirical distribution of symbols appearing in the word. Words with symbol frequencies far from one half belong to smaller combinatorial classes and therefore appear less complex under the standard definition. In this paper, an entropy-normalized complexity measure is introduced that divides the Kolmogorov complexity of a word by the empirical entropy of its observed distribution of zeros and ones. This adjustment isolates intrinsic descriptive complexity from the purely combinatorial effect of symbol imbalance. For Martin–Löf random sequences under constructive exchangeable measures, the adjusted complexity grows linearly and converges to one. A pathological construction shows that regularity of the underlying measure is essential. The proposed framework connects Kolmogorov complexity, empirical entropy, and randomness in a natural manner and suggests applications in randomness testing and in the analysis of structured binary data.

## Full-text entities

- **Diseases:** injury to (MESH:D014947), shell deficiency (MESH:D003130), Kolmogorov Complexity (MESH:D048090)
- **Chemicals:** LZ76 (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12939360/full.md

---
Source: https://tomesphere.com/paper/PMC12939360