Distributional convergence for the number of symbol comparisons used by QuickSort
James Allen Fill

TL;DR
This paper analyzes the distributional limits of symbol comparison counts in QuickSort when keys are symbol sequences from a probabilistic source, revealing convergence properties under mild conditions.
Contribution
It establishes the existence of a limiting distribution for symbol comparisons in QuickSort with i.i.d. keys, extending previous comparison-based analyses to symbol-based costs.
Findings
Limiting distribution exists for normalized symbol comparisons
Convergence of moments for orders p and smaller under certain conditions
Memoryless sources lead to distribution and moment convergence for all orders
Abstract
Most previous studies of the sorting algorithm QuickSort have used the number of key comparisons as a measure of the cost of executing the algorithm. Here we suppose that the n independent and identically distributed (i.i.d.) keys are each represented as a sequence of symbols from a probabilistic source and that QuickSort operates on individual symbols, and we measure the execution cost as the number of symbol comparisons. Assuming only a mild "tameness" condition on the source, we show that there is a limiting distribution for the number of symbol comparisons after normalization: first centering by the mean and then dividing by n. Additionally, under a condition that grows more restrictive as p increases, we have convergence of moments of orders p and smaller. In particular, we have convergence in distribution and convergence of moments of every order whenever the source is memoryless,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
