# What Is Redundancy?

**Authors:** Clifford Bohm, Christoph Adami, Arend Hintze

PMC · DOI: 10.3390/e28020167 · Entropy · 2026-02-01

## TL;DR

The paper clarifies the ambiguous concept of redundancy in information theory by distinguishing between two types: operational and informational redundancy.

## Contribution

The paper introduces a formal distinction between operational and informational redundancy to resolve conceptual confusion in multivariate information theory.

## Key findings

- Operational redundancy relates to input sufficiency for prediction, while informational redundancy concerns shared content among inputs.
- Statistical correlations from partial observation can create apparent content overlap without reflecting true functional redundancy.
- The distinction explains why redundancy is elusive and why no single measure can capture all intuitions about it.

## Abstract

Redundancy is a central yet persistently ambiguous concept in multivariate information theory. Across the literature, the same term is used to describe fundamentally distinct phenomena. Operational redundancy concerns how different inputs relate to the prediction of output states, while informational redundancy concerns content overlap among inputs relevant to an output. These notions are routinely conflated in decompositions of mutual information, leading to incompatible definitions, contradictory interpretations, and apparent paradoxes—particularly when inputs are statistically independent. We argue that the difficulty in defining redundancy is not primarily technical, but conceptual: the field has not converged on what redundancy is meant to signify. We formalize this distinction by identifying two classes of redundancy. Operational redundancy encompasses task-relative properties and covers conditions when inputs are sufficient or substitutable for prediction. Informational redundancy concerns shared content among inputs, grounded in mutual information between them. Using functional examples and biased input ensembles, we demonstrate the practical distinction between these classes: inputs with no informational overlap can exhibit operational redundancy, while partial observation can induce statistical correlations that create content overlap without reflecting the underlying functional structure. We conclude by proposing a clear separation of these concepts and outlining minimal commitments for each. This separation clarifies why redundancy remains elusive, why no single measure can satisfy all intuitions, and how future work can proceed without redefining information itself.

## Full-text entities

- **Genes:** CIRSR (corepressor of RBPJ and splicing regulator) [NCBI Gene 9541] {aka CIR, CIR1, THE1B/CIR1}, GPR15 (G protein-coupled receptor 15) [NCBI Gene 2838] {aka BOB}, PGR (progesterone receptor) [NCBI Gene 5241] {aka NR3C3, PR}
- **Diseases:** PID (MESH:D004828), injury to (MESH:D014947)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12939084/full.md

## References

20 references — full list in the complete paper: https://tomesphere.com/paper/PMC12939084/full.md

---
Source: https://tomesphere.com/paper/PMC12939084