# Tree Height and the Asymptotic Mean of the Colijn–Plazzotta Rank of Unlabeled Binary Rooted Trees

**Authors:** Luc Devroye, Michael R. Doboli, Noah A. Rosenberg, Stephan Wagner

PMC · DOI: 10.1007/s11538-025-01538-7 · Bulletin of Mathematical Biology · 2025-11-03

## TL;DR

This paper explores how the rank of a tree structure relates to its height and derives asymptotic properties for different tree models used in mathematical phylogenetics.

## Contribution

The paper introduces new asymptotic results for the Colijn–Plazzotta rank of binary trees under three models, resolving open problems in mathematical phylogenetics.

## Key findings

- The expected value of log log f(τn) grows as 2√(πn) for uniform and labeled binary tree models.
- Under the Yule–Harding model, the expected value of log log f(τn) grows proportionally to α log n, where α ≈ 4.31107.
- The mean rank f(τn) is closely tied to the rank of the highest-ranked caterpillar tree.

## Abstract

The Colijn–Plazzotta ranking is a bijective encoding of the unlabeled binary rooted trees with positive integers. We show that the rank f(t) of a tree t is closely related to its height h, the maximal path length from a leaf to the root. We consider the rank \documentclass[12pt]{minimal}
				\usepackage{amsmath}
				\usepackage{wasysym} 
				\usepackage{amsfonts} 
				\usepackage{amssymb} 
				\usepackage{amsbsy}
				\usepackage{mathrsfs}
				\usepackage{upgreek}
				\setlength{\oddsidemargin}{-69pt}
				\begin{document}$$f(\tau _n)$$\end{document}f(τn) of a random n-leaf tree \documentclass[12pt]{minimal}
				\usepackage{amsmath}
				\usepackage{wasysym} 
				\usepackage{amsfonts} 
				\usepackage{amssymb} 
				\usepackage{amsbsy}
				\usepackage{mathrsfs}
				\usepackage{upgreek}
				\setlength{\oddsidemargin}{-69pt}
				\begin{document}$$\tau _n$$\end{document}τn under each of three models: (i) uniformly random unlabeled unordered binary rooted trees, or unlabeled topologies; (ii) uniformly random leaf-labeled binary trees, or labeled topologies under the uniform model; and (iii) random binary search trees, or labeled topologies under the Yule–Harding model. Relying on the close relationship between tree rank and tree height, we obtain results concerning the asymptotic properties of \documentclass[12pt]{minimal}
				\usepackage{amsmath}
				\usepackage{wasysym} 
				\usepackage{amsfonts} 
				\usepackage{amssymb} 
				\usepackage{amsbsy}
				\usepackage{mathrsfs}
				\usepackage{upgreek}
				\setlength{\oddsidemargin}{-69pt}
				\begin{document}$$\log \log f(\tau _n)$$\end{document}loglogf(τn). In particular, we find \documentclass[12pt]{minimal}
				\usepackage{amsmath}
				\usepackage{wasysym} 
				\usepackage{amsfonts} 
				\usepackage{amssymb} 
				\usepackage{amsbsy}
				\usepackage{mathrsfs}
				\usepackage{upgreek}
				\setlength{\oddsidemargin}{-69pt}
				\begin{document}$${\mathbb {E}}\{\log _2 \log f(\tau _n)\} \sim 2 \sqrt{\pi n}$$\end{document}E{log2logf(τn)}∼2πn for uniformly random unlabeled ordered binary rooted trees and uniformly random leaf-labeled binary trees, and for a constant \documentclass[12pt]{minimal}
				\usepackage{amsmath}
				\usepackage{wasysym} 
				\usepackage{amsfonts} 
				\usepackage{amssymb} 
				\usepackage{amsbsy}
				\usepackage{mathrsfs}
				\usepackage{upgreek}
				\setlength{\oddsidemargin}{-69pt}
				\begin{document}$$\alpha \approx 4.31107$$\end{document}α≈4.31107, \documentclass[12pt]{minimal}
				\usepackage{amsmath}
				\usepackage{wasysym} 
				\usepackage{amsfonts} 
				\usepackage{amssymb} 
				\usepackage{amsbsy}
				\usepackage{mathrsfs}
				\usepackage{upgreek}
				\setlength{\oddsidemargin}{-69pt}
				\begin{document}$${\mathbb {E}}\{\log _2 \log f(\tau _n)\} \sim \alpha \log n $$\end{document}E{log2logf(τn)}∼αlogn for leaf-labeled binary trees under the Yule–Harding model. We show that the mean of \documentclass[12pt]{minimal}
				\usepackage{amsmath}
				\usepackage{wasysym} 
				\usepackage{amsfonts} 
				\usepackage{amssymb} 
				\usepackage{amsbsy}
				\usepackage{mathrsfs}
				\usepackage{upgreek}
				\setlength{\oddsidemargin}{-69pt}
				\begin{document}$$f(\tau _n)$$\end{document}f(τn) itself under the three models is largely determined by the rank \documentclass[12pt]{minimal}
				\usepackage{amsmath}
				\usepackage{wasysym} 
				\usepackage{amsfonts} 
				\usepackage{amssymb} 
				\usepackage{amsbsy}
				\usepackage{mathrsfs}
				\usepackage{upgreek}
				\setlength{\oddsidemargin}{-69pt}
				\begin{document}$$c_{n-1}$$\end{document}cn-1 of the highest-ranked tree—the caterpillar—obtaining an asymptotic relationship with \documentclass[12pt]{minimal}
				\usepackage{amsmath}
				\usepackage{wasysym} 
				\usepackage{amsfonts} 
				\usepackage{amssymb} 
				\usepackage{amsbsy}
				\usepackage{mathrsfs}
				\usepackage{upgreek}
				\setlength{\oddsidemargin}{-69pt}
				\begin{document}$$\pi _n c_{n-1}$$\end{document}πncn-1, where \documentclass[12pt]{minimal}
				\usepackage{amsmath}
				\usepackage{wasysym} 
				\usepackage{amsfonts} 
				\usepackage{amssymb} 
				\usepackage{amsbsy}
				\usepackage{mathrsfs}
				\usepackage{upgreek}
				\setlength{\oddsidemargin}{-69pt}
				\begin{document}$$\pi _n$$\end{document}πn is a model-specific function of n. The results resolve open problems, providing a new class of results on an encoding useful in mathematical phylogenetics.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12583421/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12583421/full.md

## References

2 references — full list in the complete paper: https://tomesphere.com/paper/PMC12583421/full.md

---
Source: https://tomesphere.com/paper/PMC12583421