A combinatorial interpretation of Gaussian blur

Travis Dillon

arXiv:1812.03569·math.CO·November 18, 2020

A combinatorial interpretation of Gaussian blur

Travis Dillon

PDF

Open Access

TL;DR

This paper introduces the collapsing sum, a new combinatorial operator on matrices, providing a novel interpretation of Gaussian blur and establishing explicit relations between the two.

Contribution

The paper presents the collapsing sum operator and its combinatorial properties, offering a new perspective on Gaussian blur in image processing.

Findings

01

Established a direct relation between Gaussian blur and collapsing sum

02

Provided combinatorial properties of the collapsing sum operator

03

Enhanced understanding of Gaussian blur through combinatorial interpretation

Abstract

Gaussian blur is a commonly-used method to filter image data. This paper introduces the collapsing sum, a new operator on matrices that provides a combinatorial interpretation of Gaussian blur. We study the combinatorial properties of this operator and prove the explicit relation between Gaussian blur and the collapsing sum.

Equations70

σ_{\scaleobj 0.7 ↓} (A)_{i, j} = a_{i, j} + a_{i + 1, j} .

σ_{\scaleobj 0.7 ↓} (A)_{i, j} = a_{i, j} + a_{i + 1, j} .

σ_{\scaleobj 0.7 \to} (A)_{i, j} = a_{i, j} + a_{i, j + 1} .

σ_{\scaleobj 0.7 \to} (A)_{i, j} = a_{i, j} + a_{i, j + 1} .

(K * A)_{p, q} := i, j = - r \sum r k_{i, j} \cdot a_{p - i, q - j} .

(K * A)_{p, q} := i, j = - r \sum r k_{i, j} \cdot a_{p - i, q - j} .

(K * A)_{p, q} = i + k = p j + ℓ = q \sum k_{i, j} \cdot a_{k, ℓ} .

(K * A)_{p, q} = i + k = p j + ℓ = q \sum k_{i, j} \cdot a_{k, ℓ} .

f (x, y) = \frac{1}{2 π s ^{2}} e^{- \frac{x ^{2} + y ^{2}}{2 s ^{2}}},

f (x, y) = \frac{1}{2 π s ^{2}} e^{- \frac{x ^{2} + y ^{2}}{2 s ^{2}}},

(G_{2 r + 1})_{i, j} = \frac{1}{4 ^{2 r}} (i + r 2 r) (j + r 2 r) .

(G_{2 r + 1})_{i, j} = \frac{1}{4 ^{2 r}} (i + r 2 r) (j + r 2 r) .

G_{5} = \frac{1}{256} 1464141624164624362464162416414641 .

G_{5} = \frac{1}{256} 1464141624164624362464162416414641 .

A = (a_{1, 1} a_{2, 1} a_{1, 2} a_{2, 2}) .

A = (a_{1, 1} a_{2, 1} a_{1, 2} a_{2, 2}) .

σ_{\scaleobj 0.7 ↓} (A) = (a_{1, 1} + a_{2, 1} a_{1, 2} + a_{2, 2}) σ_{\scaleobj 0.7 \to} (A) = (a_{1, 2} + a_{2, 2} a_{2, 1} + a_{2, 2})

σ_{\scaleobj 0.7 ↓} (A) = (a_{1, 1} + a_{2, 1} a_{1, 2} + a_{2, 2}) σ_{\scaleobj 0.7 \to} (A) = (a_{1, 2} + a_{2, 2} a_{2, 1} + a_{2, 2})

σ (A) = (a_{1, 1} + a_{1, 2} + a_{2, 1} + a_{2, 2}) .

δ_{i, j} = {10 if i = j if i \neq = j .

δ_{i, j} = {10 if i = j if i \neq = j .

R_{4} = 100110011001

R_{4} = 100110011001

R_{4}^{\underline{2}} = (101101) 100110011001 = (10211201) .

R_{4}^{\underline{2}} = (101101) 100110011001 = (10211201) .

k = 1 \sum m (δ_{i, k} + δ_{i + 1, k}) a_{k, j} = a_{i, j} + a_{i + 1, j} = σ_{\scaleobj 0.7 ↓} (A)_{i, j} .

k = 1 \sum m (δ_{i, k} + δ_{i + 1, k}) a_{k, j} = a_{i, j} + a_{i + 1, j} = σ_{\scaleobj 0.7 ↓} (A)_{i, j} .

(R_{m}^{\underline{k + 1}})_{i, j}

(R_{m}^{\underline{k + 1}})_{i, j}

= (j - i k) + (j - ( i + 1 ) k)

= (j - i k + 1),

i, j \sum (A X B)_{i, j} = i = 1 \sum m j = 1 \sum n [k = 1 \sum m r = 1 \sum n a_{i, k} \cdot x_{k, r} \cdot b_{r, j}] .

i, j \sum (A X B)_{i, j} = i = 1 \sum m j = 1 \sum n [k = 1 \sum m r = 1 \sum n a_{i, k} \cdot x_{k, r} \cdot b_{r, j}] .

\sum_{i=1}^{m}\sum_{j=1}^{n}a_{i,p}b_{q,j}=\Bigg{[}\sum_{i=1}^{m}a_{i,p}\Bigg{]}\Bigg{[}\sum_{j=1}^{n}b_{q,j}\Bigg{]}.

\sum_{i=1}^{m}\sum_{j=1}^{n}a_{i,p}b_{q,j}=\Bigg{[}\sum_{i=1}^{m}a_{i,p}\Bigg{]}\Bigg{[}\sum_{j=1}^{n}b_{q,j}\Bigg{]}.

α_{i} = ℓ = 1 \sum m - s (R_{m}^{\underline{a}})_{ℓ, i} = ℓ = 1 \sum m - a (i - ℓ a) .

α_{i} = ℓ = 1 \sum m - s (R_{m}^{\underline{a}})_{ℓ, i} = ℓ = 1 \sum m - a (i - ℓ a) .

σ^{n - 1} (A)_{1, 1} = i, j \sum σ^{n - 1} (A)_{i, j} = i, j \sum c_{i, j} a_{i, j}

σ^{n - 1} (A)_{1, 1} = i, j \sum σ^{n - 1} (A)_{i, j} = i, j \sum c_{i, j} a_{i, j}

\frac{1}{4 ^{2 r}} i = - r \sum r j = - r \sum r (i + r 2 r) (j + r 2 r) a_{p + i, q + i}^{'} .

\frac{1}{4 ^{2 r}} i = - r \sum r j = - r \sum r (i + r 2 r) (j + r 2 r) a_{p + i, q + i}^{'} .

σ^{2 r} (A^{'})_{p, q}

σ^{2 r} (A^{'})_{p, q}

= i = 0 \sum 2 r j = 0 \sum 2 r (i 2 r) (j 2 r) b_{i, j}

= i = - r \sum r j = - r \sum r (i + r 2 r) (j + r 2 r) a_{p + i, q + i}^{'}

= 4^{2 r} (G_{2 r + 1} * A)_{p, q} . \qed

G_{2 r + 1} = 4^{- 2 r} σ^{2 r} .

G_{2 r + 1} = 4^{- 2 r} σ^{2 r} .

a_{0} a_{1} a_{2} a_{3} a_{- 1} a_{0} a_{1} a_{2} a_{- 2} a_{- 1} a_{0} a_{1} a_{- 3} a_{- 2} a_{- 1} a_{0} a_{- 4} a_{- 3} a_{- 2} a_{- 1} .

a_{0} a_{1} a_{2} a_{3} a_{- 1} a_{0} a_{1} a_{2} a_{- 2} a_{- 1} a_{0} a_{1} a_{- 3} a_{- 2} a_{- 1} a_{0} a_{- 4} a_{- 3} a_{- 2} a_{- 1} .

A = k = - n \sum m toep (0, \dots, 0, a_{k}, 0, \dots, 0) .

A = k = - n \sum m toep (0, \dots, 0, a_{k}, 0, \dots, 0) .

a_{i, j} = {10 if i - j = k otherwise.

a_{i, j} = {10 if i - j = k otherwise.

σ_{\scaleobj 0.7 ↓}^{m} σ_{\scaleobj 0.7 \to}^{n} (A)_{1, 1}

σ_{\scaleobj 0.7 ↓}^{m} σ_{\scaleobj 0.7 \to}^{n} (A)_{1, 1}

= i = 0 \sum m (i m) (i - k n)

= i = 0 \sum m (i m) (( n + k ) - i n) .

σ^{n} (C_{n + 1}^{n}) = i = 0 \sum n j = 0 \sum n (i n) (j n) c_{i, j} = i = 0 \sum n i = 0 \sum n (i n)^{2} (j n)^{2} .

σ^{n} (C_{n + 1}^{n}) = i = 0 \sum n j = 0 \sum n (i n) (j n) c_{i, j} = i = 0 \sum n i = 0 \sum n (i n)^{2} (j n)^{2} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopological and Geometric Data Analysis · Advanced Image Fusion Techniques · Image Retrieval and Classification Techniques

Full text

\newdateformat

daymonthyear\THEDAY \monthname[\THEMONTH] \THEYEAR

A combinatorial interpretation of Gaussian blur

Travis Dillon

(\daymonthyear)

Abstract

Gaussian blur is a commonly-used method to filter image data. This paper introduces the collapsing sum, a new operator on matrices that provides a combinatorial interpretation of Gaussian blur. We study the combinatorial properties of this operator and prove the explicit relation between Gaussian blur and the collapsing sum.

1 Introduction

Image data, and data in general, is often filtered to remove noise, random fluctuations that hide the underlying pattern. For images, one of the most common solutions is to apply Gaussian blur, which smooths the data to remove noise.

Because of its use, there has been much interest in discovering efficient algorithms for Gaussian blur [1, 2, 3]. Waltz and Miller [3] in particular provide a clear example of the ways in which properties of binomial coefficients can be leveraged to create such an algorithm. An analysis of their algorithm in Section 2 leads to the following definitions.

Definition 1.1.

Let $A$ be a real $m\times n$ matrix. If $m\geq 2$ , then the $(m-1)\times n$ matrix $\sigma_{\scaleobj{0.7}{\downarrow}}(A)$ has entries

[TABLE]

If $n\geq 2$ , then the $m\times(n-1)$ matrix $\sigma_{\scaleobj{0.7}{\rightarrow}}(A)$ has entries

[TABLE]

Finally, the matrix $\sigma(A):=\sigma_{\scaleobj{0.7}{\downarrow}}\circ\sigma_{\scaleobj{0.7}{\rightarrow}}(A)=\sigma_{\scaleobj{0.7}{\rightarrow}}\circ\sigma_{\scaleobj{0.7}{\downarrow}}(A)$ is the collapsing sum of $A$ .

The collapsing sum captures mathematically what Waltz and Miller describe computationally in [3]. In this paper, we establish the connection between the collapsing sum and Gaussian blur and provide a theoretical study of the combinatorial properties of this operator.

Section 3 provides the main combinatorial analysis of the operator. We recast the collapsing sum (and therefore Gaussian blur) in terms of matrix multiplication and define a new class of matrices called coefficient matrices that generalize Gaussian blur. The section culminates in Theorem 3.12, which explicitly describes the connection between the collapsing sum and Gaussian blur.

In the remainder of the paper, we turn to the purely combinatorial properties of this operator; for example, we completely describe the fully-collapsed state of Toeplitz matrices (see Proposition 4.3). Finally, we discuss generalizations of Gaussian blur in connection to Waltz and Miller’s algorithm.

2 Background

As the collapsing sum will be motivated by Gaussian blur, we begin with a description of image filtering. Grayscale images are stored as matrices: Shades of gray are represented as numbers in a particular range (for example, integers from 0 to 255, or real numbers from 0 to 1), and each entry represents a pixel.111Whether 0 represents black or white depends on the application; in printing, 0 represents white, whereas in computing, 0 represents black. We won’t need to pick between these conventions for our purposes. We will consider only grayscale images, but this is not an artificial restriction; the same techniques are used to apply a filter to color images. The data for color images are stored as three separate values of red, green, and blue. Applying a filter to a color image consists of separating the data into three matrices by color type, applying the filter to each, and recombining.

It may be the case that the image contains noise, so that the pixel values are randomly perturbed by environmental factors. Because noise is random, it seems possible to eliminate it by averaging pixel values in the neighborhood of a central pixel. This process is known as filtering.

Filters are applied in a process called convolution. The matrix that represents the filter is called a kernel matrix. Typically, kernel matrices are square with dimensions $(2r+1)\times(2r+1)$ . The integer $r$ is the radius of the filter and controls the size of the neighborhood. For simplicity in the convolution formula, kernel matrices are indexed so that the central entry has coordinates (0,0). Convolving the kernel matrix $K=(k_{i,j})$ with an $m\times n$ image matrix $A$ returns the $m\times n$ matrix $K\ast A$ with entries

[TABLE]

The convolution can be equivalently expressed as

[TABLE]

We require that $\sum_{i,j}k_{i,j}=1$ so that the overall intensity of the image does not change.

As written, however, convolution is not well-defined when $a_{p,q}$ is near a boundary of $A$ . In these cases, the convolution formula requires values of entries that don’t exist, such as $a_{-1,0}$ . To fix this problem, we use what are called edge-handling techniques. In this paper, we only consider two common techniques: extending $A$ to have values beyond its edges or applying the filter to only those pixels for which convolution is defined (the latter is called cropping).222See https://en.wikipedia.org/wiki/Kernel_(image_processing) for a list of edge-handling techniques. To apply a kernel matrix of radius $r$ to all pixels in an $m\times n$ matrix $A$ , we need to extend $A$ by $r$ rows and columns on each side, to a matrix of size $(m+2r)\times(n+2r)$ , where the central $m\times n$ block is the matrix $A$ . The filter is applied to each pixel in the central $m\times n$ block of the enlarged matrix.

If extension is chosen as the edge-handling technique, let $A^{\prime}$ denote the corresponding extension of $A$ . If cropping is chosen as the edge-handling technique, then set $A^{\prime}=A$ . Applying the filter to $A$ with the chosen edge-handling technique is equivalent to applying the filter to $A^{\prime}$ with cropping.

The simplest blur filter is the box blur. Let $J_{m\times n}$ represent the $m\times n$ matrix with each entry equal to $1$ , and abbreviate $J_{n\times n}$ by $J_{n}$ .

Definition 2.1.

The kernel matrix $B_{2r+1}$ for the box blur of radius $r$ is $(2r+1)^{-2}J_{2r+1}$ .

As a visual example, consider the following image.

The results of applying box blurs with radii of 1, 2, and 3, respectively, to this image are shown below.

One problem with box blurs, especially ones of large radius, is that pixels are weighted the same regardless of their distance from the central pixel. It makes sense to weight closer pixels more heavily than distant pixels: Pixels that are closer to each other will contain more information about each other than those that are farther away. Because of this, the Gaussian blur, which takes this into account, is more commonly used. The values for the Gaussian blur kernel matrix are derived from the two-dimensional Gaussian curve

[TABLE]

where $s$ represents the standard deviation of the distribution. Sometimes values are directly sampled from this function, but they are often approximated using binomial coefficients.

Definition 2.2.

The $(2r+1)\times(2r+1)$ kernel matrix $G_{2r+1}$ of the approximate Gaussian blur with radius $r$ has entries, for each $-r\leq i,j\leq r$ , of

[TABLE]

Example 2.3.

The kernel matrix for the $5\times 5$ approximate Gaussian blur is

[TABLE]

Notice that the pixels near the center are weighted highest, and that the values taper off toward the edges. Applying Gaussian blurs of radii 1, 2, and 3, respectively, to our example image from above results in the images below. The images appear smooth, while each individual element of the image remains clear.

Each Gaussian blur kernel matrix can be decomposed into the product of a row vector and a column vector. Since it is much faster to compute smaller convolutions than large ones, Gaussian blur algorithms break the computation into two smaller convolutions: one with the row vector, and one with the column vector.

In [3], Waltz and Miller develop an algorithm for computing Gaussian blur that is more efficient than simple decomposition. The key observation that the authors use is that Gaussian blurs of larger radius can be created through repeated convolution with Gaussian blurs of smaller radius. Their algorithm decomposes the Gaussian blur kernel matrix into a row vector and a column vector, and it decomposes each of these vectors into the repeated convolution of the matrices $\begin{pmatrix}1&1\end{pmatrix}$ and $\begin{pmatrix}1&1\end{pmatrix}^{T}$ , respectively. With a bit of clever programming, Waltz and Miller created an algorithm that runs much faster than one that only uses the decomposition property.

Convolution by the matrices $\begin{pmatrix}1&1\end{pmatrix}$ and $\begin{pmatrix}1&1\end{pmatrix}^{T}$ corresponds to the operations $\sigma_{\scaleobj{0.7}{\rightarrow}}$ and $\sigma_{\scaleobj{0.7}{\downarrow}}$ , respectively. This observation leads to Definition 1.1.

Example 2.4.

Take the $2\times 2$ matrix

[TABLE]

Applying the collapsing operations, we get

[TABLE]

3 Equivalence of Gaussian blur and collapsing sum

In this section, we place the collapsing sum on a matrix-theoretic foundation and explicitly connect it with Gaussian blur via Theorem 3.12.

The following properties of the collapsing sum follow directly from Definition 1.1.

Proposition 3.1.

Let $A$ and $B$ be $m\times n$ matrices and $c$ be any real number. Then

$\sigma(A+B)=\sigma(A)+\sigma(B)$ , 2. 2.

$\sigma(cA)=c\cdot\sigma(A)$ , 3. 3.

$\sigma_{\scaleobj{0.7}{\downarrow}}(A^{T})=\sigma_{\scaleobj{0.7}{\rightarrow}}(A)^{T}$ , and 4. 4.

$\sigma(A^{T})=\sigma(A)^{T}$ **

whenever the operations are defined. The first two statements also hold for $\sigma_{\scaleobj{0.7}{\downarrow}}$ and $\sigma_{\scaleobj{0.7}{\rightarrow}}$ .

Much of the investigation will examine repeated application of the collapsing sum. Let $A$ be an $m\times n$ matrix. Then $\sigma^{0}(A)=A$ , and for each positive integer $1\leq s<\min\{m,n\}$ , we define $\sigma^{s}(A)=\sigma(\sigma^{s-1}(A))$ . The operators $\sigma_{\scaleobj{0.7}{\downarrow}}^{s}$ and $\sigma_{\scaleobj{0.7}{\rightarrow}}^{s}$ are defined similarly.

Let $I_{m}$ be the $m\times m$ identity matrix and $\delta_{i,j}$ be the Kronecker delta function

[TABLE]

Definition 3.2.

We denote by $R_{m}$ the $(m-1)\times m$ matrix with entries $r_{i,j}=\delta_{i,j}+\delta_{i+1,j}$ . For a positive integer $k<m$ , we define $R_{m}^{\underline{k}}$ as the product $R_{m-k+1}R_{m-k+2}\cdots R_{m}$ . Further, let $R_{m}^{\underline{0}}=I_{m}$ .

The matrices $R_{m}$ have $1$ ’s on the diagonal and superdiagonal and [math]’s elsewhere. The notation $R_{m}^{\underline{k}}$ is defined analogously to the falling power notation $n^{\underline{k}}=n(n-1)\cdots(n-k+1)$ .

Example 3.3.

We have

[TABLE]

and

[TABLE]

Proposition 3.4.

Let $A$ be an $m\times n$ matrix with $m,n\geq 2$ . Then $\sigma_{\scaleobj{0.7}{\downarrow}}^{s}(A)=R_{m}^{\underline{s}}A$ and $\sigma_{\scaleobj{0.7}{\rightarrow}}^{s}(A)=A(R_{n}^{\underline{s}})^{T}$ .

Proof.

First note that $R_{m}A$ is an $(m-1)\times n$ matrix. Using the definition of $R_{m}$ , the entry $(R_{m}A)_{i,j}$ is

[TABLE]

A quick induction argument shows that $\sigma_{\scaleobj{0.7}{\downarrow}}^{s}(A)=R_{m}^{\underline{s}}A$ . The calculation for the second assertion is similar. ∎

Consequently, $\sigma^{s}(A)=(R_{m}^{\underline{s}})A(R_{n}^{\underline{s}})^{T}$ .

Proposition 3.5.

Let $m$ be a positive integer and $s\leq m$ be a nonnegative integer. Then $R_{m}^{\underline{s}}$ is an $(m-s)\times m$ matrix with entries $(R_{m}^{\underline{s}})_{i,j}=\binom{s}{j-i}$ .

Proof.

We proceed by induction. For $s=0$ , the theorem simplifies to the definition of $I_{m}=R_{m}^{\underline{0}}$ . Now suppose that the theorem holds for some nonnegative integer $k$ . Then $R_{m}^{\underline{k+1}}=R_{m-k}R_{m}^{\underline{k}}$ . Since $R_{m}^{\underline{k}}$ is an $(m-k)\times m$ matrix, $R_{m}^{\underline{k+1}}$ is an $(m-k-1)\times m$ matrix. Further, by writing $(R_{m-k})_{i,r}=\delta_{i,r}+\delta_{i+1,r}$ , we have

[TABLE]

so the formula holds by induction. ∎

We now introduce an object that will facilitate the proof of Theorem 3.12.

Definition 3.6.

Let $a<m$ and $b<n$ be nonnegative integers. The coefficient matrix $C_{m\times n}^{a,b}=(c_{i,j})$ is the unique $m\times n$ matrix such that $\sum_{i,j}\sigma_{\scaleobj{0.7}{\downarrow}}^{a}\sigma_{\scaleobj{0.7}{\rightarrow}}^{b}(A)_{i,j}=\sum_{i,j}c_{i,j}a_{i,j}$ for all $m\times n$ matrices $A$ . We abbreviate $C^{a,b}_{n}:=C^{a,b}_{n\times n}$ and $C^{a}_{m\times n}:=C^{a,a}_{m\times n}$ .

One interpretation of the coefficient matrix uses indeterminates. Let $X=(x_{i,j})$ be an $m\times n$ matrix of indeterminates; that is, the entries of $X$ are distinct symbols, not numbers. The entry $c_{i,j}$ of the coefficient matrix $C^{a,b}_{m\times n}$ is the sum of the coefficients of $x_{i,j}$ across all entries of $\sigma_{\scaleobj{0.7}{\downarrow}}^{a}\sigma_{\scaleobj{0.7}{\rightarrow}}^{b}(X)$ . Thus, one way to think of the coefficient matrix is that its $(i,j)$ th entry represents the number of times that $x_{i,j}$ appears in $\sigma_{\scaleobj{0.7}{\downarrow}}^{a}\sigma_{\scaleobj{0.7}{\rightarrow}}^{b}(X)$ .

We now work to describe the entries of the coefficient matrices explicitly.

Definition 3.7.

Let $A$ be an $m\times n$ matrix and $\mathbf{e}_{n}$ be the $n\times 1$ vector in which each entry is $1$ . The column sum vector of $A$ is $\alpha=A^{T}\mathbf{e}_{m}$ , and the row sum vector of $A$ is $\beta=A\mathbf{e}_{n}$ . That is, $\alpha_{j}$ is the sum of the elements in the $j$ th column of $A$ , and $\beta_{j}$ is the sum of the elements of the $j$ th row of $A$ .

Lemma 3.8.

Let $X=(x_{i,j})$ be a matrix of indeterminates and $A$ and $B$ be matrices such that the product $AXB$ is defined. If $\alpha$ is the column sum vector of $A$ and $\beta$ is the row sum vector of $B$ , then the coefficient of $x_{p,q}$ in the formal expression $\sum_{i,j}(AXB)_{i,j}$ is $\alpha_{p}\beta_{q}$ .

Proof.

Choose any indeterminate $x_{p,q}$ . We have

[TABLE]

We obtain the coefficient of $x_{p,q}$ by summing only those terms where $k=p$ and $r=q$ . This coefficient is thus

[TABLE]

The left term in this product is $\alpha_{p}$ , and the right term is $\beta_{q}$ . ∎

Proposition 3.9.

Let $\alpha$ be the column sum vector of $R_{m}^{\underline{a}}$ and $\beta$ be the column sum vector of $R_{n}^{\underline{b}}$ . Then $C_{m\times n}^{a,b}=\alpha\beta^{T}$ .

Proof.

Let $X$ be an $m\times n$ matrix of indeterminates. Apply Lemma 3.8 to $\sigma_{\scaleobj{0.7}{\downarrow}}^{a}\sigma_{\scaleobj{0.7}{\rightarrow}}^{b}(X)=(R_{m}^{\underline{a}})X(R_{n}^{\underline{b}})^{T}$ . The row sum vector of $(R_{n}^{\underline{b}})^{T}$ is simply the column sum vector of $R_{n}^{\underline{b}}$ . The sum in Lemma 3.8 is the sum that defines the coefficient matrix, so the $(i,j)$ th entry of the coefficient matrix $C^{a,b}_{m\times n}$ is $(\alpha\beta^{T})_{i,j}=\alpha_{i}\beta_{j}$ . ∎

Proposition 3.9 implicitly gives the following formula for coefficient matrices.

Corollary 3.10.

Let $m$ and $n$ be positive integers and $a<m$ and $b<n$ be nonnegative integers. The coefficient matrix $C_{m\times n}^{a,b}$ has entries $c_{i,j}=\big{[}\!\sum_{\ell=1}^{m-a}\binom{a}{i-\ell}\big{]}\!\big{[}\!\sum_{\ell=1}^{n-b}\binom{b}{j-\ell}\big{]}$ .

Proof.

Proposition 3.5 shows that

[TABLE]

A similar calculation holds for $\beta_{j}$ . ∎

Corollary 3.11.

Let $A$ be an $m\times n$ matrix. The value of the single entry of the matrix $\sigma_{\scaleobj{0.7}{\downarrow}}^{m-1}\sigma_{\scaleobj{0.7}{\rightarrow}}^{n-1}(A)$ is $\sum_{i=1}^{m}\sum_{j=1}^{n}\binom{m-1}{i-1}\binom{n-1}{j-1}a_{i,j}$ .

Proof.

Let $C_{m\times n}^{m-1,n-1}=(c_{i,j})$ . Since $\sigma_{\scaleobj{0.7}{\downarrow}}^{m-1}\sigma_{\scaleobj{0.7}{\rightarrow}}^{n-1}(A)$ has a single entry, we have

[TABLE]

by the definition of the coefficient matrix. Corollary 3.10 shows that $c_{i,j}=\binom{m-1}{i-1}\binom{n-1}{j-1}$ . ∎

The entries of $\sigma^{s}(A)$ are determined by the blocks of $A$ of size $(s+1)\times(s+1)$ . From this observation, Corollary 3.11 can be used to find the value of any entry in $\sigma^{s}(A)$ : simply apply the corollary to the submatrix $(a_{p+i,q+j})_{i,j=0}^{s}$ to determine the value of $\sigma^{s}(A)_{i,j}$ .

Recall that to apply a kernel matrix, we need to specify an edge-handling technique, wherein we extend the matrix $A$ to a matrix $A^{\prime}$ . Then applying the filter to $A$ with the edge-handling technique is equivalent (by definition) to applying the filter to $A^{\prime}$ with cropping.

Theorem 3.12.

Suppose a matrix $A$ and an edge-handling technique yielding the extension $A^{\prime}$ of $A$ are given. Then $G_{2r+1}\ast A=4^{-2r}\sigma^{2r}(A^{\prime})$ for all nonnegative integers $r$ .

Proof.

Each entry of $G_{2r+1}\ast A$ corresponds to a block of $A^{\prime}$ of size $(2r+1)\times(2r+1)$ . From equation (1), the value of the entry $(G_{2r+1}\ast A)_{p,q}$ is

[TABLE]

On the other hand, let $B:=(a^{\prime}_{p+i,q+j})_{i,j=-r}^{r}$ be a submatrix of $A^{\prime}$ . Applying Corollary 3.11 and then (1) gives

[TABLE]

Theorem 3.12 may be equivalently stated as an equality of operators:

[TABLE]

4 Further properties of the collapsing sum

4.1 Special classes of matrices

Definition 4.1.

A Toeplitz matrix is an $m\times n$ matrix $A$ with the property that $a_{i,j}=a_{k,\ell}$ whenever $i-j=k-\ell$ . We denote by $\operatorname{toep}(a_{-n+1},\dots,a_{m-1})$ the $m\times n$ Toeplitz matrix $A$ with entries $a_{i,j}=a_{i-j}$ .

Example 4.2.

The general $4\times 5$ Toeplitz matrix $\operatorname{toep}(a_{-4},\dots,a_{3})$ is

[TABLE]

Toeplitz matrices have applications in a wide variety of pure and applied areas, including representation theory, signal processing, differential and integral equations, and quantum mechanics. Moreover, every $n\times n$ matrix can be decomposed as the product of at most $2n+5$ Toeplitz matrices [4].

In what follows, we use $(m+1)\times(n+1)$ Toeplitz matrices $\operatorname{toep}(a_{-n},\dots,a_{m})$ to slightly simplify the statements of the results.

Proposition 4.3.

Let $A=\operatorname{toep}(a_{-n},\dots,a_{m})$ be an $(m+1)\times(n+1)$ Toeplitz matrix. Then $\sum_{k=-n}^{m}\binom{m+n}{n+k}a_{k}$ is the single entry of $\sigma_{\scaleobj{0.7}{\downarrow}}^{m}\sigma_{\scaleobj{0.7}{\rightarrow}}^{n}(A)$ .

Proof.

We can decompose the Toeplitz matrix into “stripes”:

[TABLE]

Since by Proposition 3.1 the collapsing sum distributes over addition, we need only consider the case when one $a_{k}$ is nonzero. Moreover, since $\sigma(cA)=c\sigma(A)$ , we can restrict to $a_{k}=1$ .

Therefore, suppose $A=\operatorname{toep}(0,\dots,0,1,0,\dots,0)$ , so that

[TABLE]

We use the convention that $\binom{n}{r}=0$ if $r<0$ or $r>n$ . Applying Corollary 3.11 and the binomial symmetry $\binom{n}{r}=\binom{n}{n-r}$ gives

[TABLE]

Applying the well-known identity $\sum_{i=0}^{m}\binom{m}{i}\binom{n}{r-i}=\binom{m+n}{r}$ finishes the proof. ∎

A direct application of Proposition 4.3 yields the following.

Corollary 4.4.

The single entry in $\sigma^{n}(I_{n+1})$ is the central binomial coefficient $\binom{2n}{n}$ .

A similar result holds for coefficient matrices.

Proposition 4.5.

The single entry in $\sigma^{n}(C_{n+1}^{n})$ is $\binom{2n}{n}^{2}$ .

Proof.

From Corollary 3.11, we have

[TABLE]

Using the identity $\sum_{k=0}^{n}\binom{n}{k}^{2}=\binom{2n}{n}$ completes the proof. ∎

Recall that $J_{m\times n}$ denotes the $m\times n$ matrix with each entry equal to $1$ .

Proposition 4.6.

Let $a<m$ and $b<n$ be nonnegative integers. Then $\sum_{i,j}(C_{m\times n}^{a,b})_{i,j}=2^{a+b}(m-a)(n-b)$ .

Proof.

It follows from Definition 3.6 that the sum of the entries of $C^{a,b}_{m\times n}$ is equal to the sum of the entries of $\sigma_{\scaleobj{0.7}{\downarrow}}^{a}\sigma_{\scaleobj{0.7}{\rightarrow}}^{b}(J_{m\times n})$ . Each $(a+1)\times(b+1)$ block of $J_{m\times n}$ is $J_{(a+1)\times(b+1)}$ , so $\sigma_{\scaleobj{0.7}{\downarrow}}^{a}\sigma_{\scaleobj{0.7}{\rightarrow}}^{b}(J_{m\times n})_{i,j}=\sigma_{\scaleobj{0.7}{\downarrow}}^{a}\sigma_{\scaleobj{0.7}{\rightarrow}}^{b}(J_{(a+1)\times(b+1)})_{1,1}$ for all $1\leq i\leq m-a$ and $1\leq j\leq n-b$ . Since $J_{(a+1)\times(b+1)}$ is a Toeplitz matrix, Proposition 4.3 gives

[TABLE]

Noting that $\sigma_{\scaleobj{0.7}{\downarrow}}^{a}\sigma_{\scaleobj{0.7}{\rightarrow}}^{b}(J_{m\times n})$ has $(m-a)(n-b)$ entries completes the proof. ∎

The proof of Corollary 3.11 shows that the Gaussian blur kernel $G_{2r+1}$ is proportional to the coefficient matrix $C_{2r+1}^{2r}$ . Similarly, the box blur kernel $B_{2r+1}$ is proportional to the coefficient matrix $C_{2r+1}^{0}$ . The constant of proportionality in both cases is $(\sum_{i,j}(C^{s}_{2r+1})_{i,j})^{-1}$ , where $s=2r$ or $s=0$ , respectively. Thus, the coefficient matrices are a generalization that unite these two filters. That is, the expression $(2^{a+b}(m-a)(n-b))^{-1}C^{a,b}_{m\times n}$ provides an interpolation between box blur and Gaussian blur.

4.2 Further connections with Gaussian blur

Waltz and Miller [3] extend their techniques to non-square blurs, that is, Gaussian-like blurs using non-square kernel matrices. These can be defined in parallel to the square Gaussian blurs. If $G_{a\times b}$ denotes the kernel for the $a\times b$ Gaussian blur, then

[TABLE]

Since either $a$ or $b$ might be even, there may be no central element, so here we index from $(1,1)$ in the top left corner of the matrix.

The kernel matrix $G_{a\times b}$ is proportional to the coefficient matrix for a fully collapsed $a\times b$ matrix. This extends Theorem 3.12, since convolving $G_{a\times b}$ with a matrix $A$ is equivalent to applying $2^{-(a+b-2)}\sigma_{\scaleobj{0.7}{\downarrow}}^{a-1}\sigma_{\scaleobj{0.7}{\rightarrow}}^{b-1}$ to the extended matrix $A^{\prime}$ .

The authors also venture into higher dimensions and discuss higher-dimensional blurs. We can easily transfer this idea to the language of the collapsing sum. Suppose we want to collapse (or, equivalently, blur) an $n$ -dimensional array. We can define $\sigma_{\vec{\imath}}$ , for $1\leq i\leq n$ , to be the operator that “collapses” the array in the $i$ th direction, akin to the effects of $\sigma_{\scaleobj{0.7}{\downarrow}}$ and $\sigma_{\scaleobj{0.7}{\rightarrow}}$ in two dimensions. Define $\sigma_{n}:=\sigma_{\vec{1}}\cdots\sigma_{\vec{n}}$ . Then powers of $2^{-n}\sigma_{n}$ give the higher-dimensional blur that Waltz and Miler describe. As before, general rectangular blurs are obtained by simply composing the operators $\frac{1}{2}\sigma_{\vec{\imath}}$ for various values of $i$ .

4.3 A generalized collapsing sum

Waltz and Miller’s algorithm for Gaussian blur may be extended to an operator that returns weighted sums of entries.

Definition 4.7.

Let $\gamma$ be an $b_{1}\times b_{2}$ matrix and $A$ be an $m\times n$ matrix with $m,n\geq\max\{b_{1},b_{2}\}$ . Then $\sigma_{\gamma}(A)$ is an $(m-b_{1})\times(n-b_{2})$ matrix with

[TABLE]

If $\gamma=\left(\begin{smallmatrix}1&1\\ 1&1\end{smallmatrix}\right)$ , then we recover the original collapsing sum. Moreover, if $\gamma=(1\,1)$ , then $\sigma_{\gamma}=\sigma_{\scaleobj{0.7}{\rightarrow}}$ , and $\sigma_{\gamma^{T}}=\sigma_{\scaleobj{0.7}{\downarrow}}$ .

For any matrix $\gamma$ of rank $1$ , there exist two column vectors $\rho$ and $\varphi$ such that $\gamma=\rho\varphi^{T}$ . Waltz and Miller’s algorithm may be easily adapted for any $2\times 2$ rank-1 matrix. Our previous results on the collapsing sum may also be extended to $\sigma_{\gamma}$ for any (not necessarily square) matrix $\gamma$ of rank $1$ .

Definition 4.8.

Let $\varphi$ be a column vector with $k$ entries. Let $\varphi$ be a column vector with $k$ entries. The $(m-k+1)\times m$ matrix $R^{\varphi}_{m}$ has entries $(R^{\varphi}_{m})_{p,q}=\sum_{i=0}^{k-1}\varphi_{i+1}\delta_{p+i,q}$ .

Again, notice that if $\varphi=(1\,1)$ , then $R^{\varphi}_{m}=R_{m}$ . We define the falling powers of these matrices analogously to those of $R_{m}$ . A generalized form of Proposition 3.9 holds in that, for any $m\times n$ matrix $A$ ,

[TABLE]

In particular, if $\gamma=\rho\varphi^{T}$ , then

[TABLE]

Similar extensions may be obtained for other results, including the entries of the corresponding coefficient matrices.

5 Conclusion

By introducing the collapsing sum operators, we have provided a new combinatorial way to view Gaussian blur. We established the close connection between these concepts and also established a collection of theoretical results on the collapsing sum.

It would be interesting to study the collapsing sum as a matrix operator in its own right. For example, if $G$ is an abelian group and $G^{m\times n}$ represents the additive group of $m\times n$ matrices with entries in $G$ , then the collapsing sum is a map from $G^{m\times n}$ to $G^{(m-1)\times(n-1)}$ . What are the combinatorial and algebraic properties of this map?

Acknowledgements

The author would like to thank Samuel Gutekunst for his invaluable guidance and support, as well as Mike Orrison, Elizabeth Sattler, and the anonymous reviewers, whose insightful comments and suggestions greatly increased the quality of this paper.

Bibliography4

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] D. Charalampidis. Recursive implementation of the Gaussian filter using truncated cosine functions. Transactions on Signal Processing , 64(14):3554–3565, 2016.
2[2] E. Elboher and M. Werman. Efficient and accurate Gaussian image filtering using running sums. In 2012 12th International Conference on Intelligent Systems Design and Applications (ISDA) , pages 897–902. IEEE, 2012.
3[3] F. Waltz and J. Miller. An efficient algorithm for Gaussian blur using finite-state machines. In SPIE Conference on Machine Vision Systems for Inspection and Metrology VII , pages 334–341, 1998.
4[4] K. Ye and L.-H. Lim. Every matrix is a product of Toeplitz matrices. Foundations of Computational Mathematics , 16:577–598, 2016.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

A combinatorial interpretation of Gaussian blur

Abstract

1 Introduction

Definition 1.1**.**

2 Background

Definition 2.1**.**

Definition 2.2**.**

Example 2.3**.**

Example 2.4**.**

3 Equivalence of Gaussian blur and collapsing sum

Proposition 3.1**.**

Definition 3.2**.**

Example 3.3**.**

Proposition 3.4**.**

Proof.

Proposition 3.5**.**

Proof.

Definition 3.6**.**

Definition 3.7**.**

Lemma 3.8**.**

Proof.

Proposition 3.9**.**

Proof.

Corollary 3.10**.**

Proof.

Corollary 3.11**.**

Proof.

Theorem 3.12**.**

Proof.

4 Further properties of the collapsing sum

4.1 Special classes of matrices

Definition 4.1**.**

Example 4.2**.**

Proposition 4.3**.**

Proof.

Corollary 4.4**.**

Proposition 4.5**.**

Proof.

Proposition 4.6**.**

Proof.

4.2 Further connections with Gaussian blur

4.3 A generalized collapsing sum

Definition 4.7**.**

Definition 4.8**.**

5 Conclusion

Acknowledgements

Definition 1.1.

Definition 2.1.

Definition 2.2.

Example 2.3.

Example 2.4.

Proposition 3.1.

Definition 3.2.

Example 3.3.

Proposition 3.4.

Proposition 3.5.

Definition 3.6.

Definition 3.7.

Lemma 3.8.

Proposition 3.9.

Corollary 3.10.

Corollary 3.11.

Theorem 3.12.

Definition 4.1.

Example 4.2.

Proposition 4.3.

Corollary 4.4.

Proposition 4.5.

Proposition 4.6.

Definition 4.7.

Definition 4.8.