A Fast Parallel Median Filtering Algorithm Using Hierarchical Tiling
Louis Sugy (NVIDIA)

TL;DR
This paper presents a novel hierarchical tiling algorithm for median filtering that significantly improves speed and scalability, enabling real-time noise removal in digital images on GPUs.
Contribution
It introduces two new variants of median filtering algorithms with unprecedented per-pixel complexity, achieving up to 5x faster performance on GPUs compared to existing methods.
Findings
Achieves per-pixel complexity of O(k log k) and O(k) for two variants.
Up to 5 times faster than current state-of-the-art GPU median filters.
Effective for kernels up to 75x75 pixels across various data types.
Abstract
Median filtering is a non-linear smoothing technique widely used in digital image processing to remove noise while retaining sharp edges. It is particularly well suited to removing outliers (impulse noise) or granular artifacts (speckle noise). However, the high computational cost of median filtering can be prohibitive. Sorting-based algorithms excel with small kernels but scale poorly with increasing kernel diameter, in contrast to constant-time methods characterized by higher constant factors but better scalability, such as histogram-based approaches or the 2D wavelet matrix. This paper introduces a novel algorithm, leveraging the separability of the sorting problem through hierarchical tiling to minimize redundant computations. We propose two variants: a data-oblivious selection network that can operate entirely within registers, and a data-aware version utilizing random-access…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
