On the Average Case of MergeInsertion
Florian Stober, Armin Wei{\ss}

TL;DR
This paper analyzes the average case complexity of the MergeInsertion sorting algorithm, providing new bounds, exact distributions, and experimental insights into its performance and variations.
Contribution
It establishes an upper bound on average comparisons, describes the distribution of insertion chain lengths, and compares different insertion orders experimentally.
Findings
Upper bound of n log n - 1.4005n + o(n) comparisons.
Exact average comparisons computed for n up to 148.
Different insertion orders can improve average case performance.
Abstract
MergeInsertion, also known as the Ford-Johnson algorithm, is a sorting algorithm which, up to today, for many input sizes achieves the best known upper bound on the number of comparisons. Indeed, it gets extremely close to the information-theoretic lower bound. While the worst-case behavior is well understood, only little is known about the average case. This work takes a closer look at the average case behavior. In particular, we establish an upper bound of comparisons. We also give an exact description of the probability distribution of the length of the chain a given element is inserted into and use it to approximate the average number of comparisons numerically. Moreover, we compute the exact average number of comparisons for up to 148. Furthermore, we experimentally explore the impact of different decision trees for binary insertion. To conclude,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Machine Learning and Algorithms · Machine Learning and Data Classification
