# On the emergence of Zipf's law in music

**Authors:** Juan Ignacio Perotti, Orlando Vito Billoni

arXiv: 1902.06678 · 2020-02-17

## TL;DR

This paper investigates the emergence of Zipf's law in music, demonstrating that a combination of chords and notes as units reveals the law, supported by statistical analysis and a theoretical model inspired by compression algorithms.

## Contribution

It identifies the appropriate Zipfian units in music and links the emergence of Zipf's law to language evolution and communication efficiency.

## Key findings

- Zipf's law appears when chords and notes are combined as units in music.
- The study provides a statistical analysis of music and texts supporting this emergence.
- A theoretical model based on compression explains the law's origin in language evolution.

## Abstract

Zipf's law is found when the vocabulary of long written texts is ranked according to the frequency of word occurrences, establishing a power-law decay for the frequency vs rank relation. This law is a robust statistical property observed even in ancient untranslated languages. Interestingly, this law seems to be also manifested in music records when several metrics---functioning as words in written texts---are used. Even though music can be regarded as a language, finding an accurate equivalent of the concept of words in music is difficult because it lacks a functional semantic. This raises the question of which is the appropriate choice of Zipfian units in music, which is extensive to other contexts where this law can emerge. In particular, this is still an open question in written texts, where several alternatives have been proposed as Zipfian units besides the canonical use of words. Seeking to validate a natural election of Zipfian units in music, in this work we find that Zipf's law emerges when a combination of chords and notes are chosen as Zipfian units. Our results are grounded on a consistent analysis of the statistical properties of music and texts, complemented with theoretical considerations that combine different reference models, including a simple model inspired in the Lempel-Ziv compression algorithm that we have devised to explain the emergence of Zipf's law as the consequence of languages evolving into more efficient forms of communication.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.06678/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/1902.06678/full.md

## References

69 references — full list in the complete paper: https://tomesphere.com/paper/1902.06678/full.md

---
Source: https://tomesphere.com/paper/1902.06678