A Statistical View on Synthetic Aperture Imaging for Occlusion Removal

Indrajit Kurmi; David C. Schedl; Oliver Bimber

arXiv:1906.06600·cs.GR·June 18, 2019

A Statistical View on Synthetic Aperture Imaging for Occlusion Removal

Indrajit Kurmi, David C. Schedl, Oliver Bimber

PDF

TL;DR

This paper explores the limits of synthetic aperture imaging for occlusion removal, revealing practical constraints on aperture size and sampling density, and applying these insights to drone-based optical imaging.

Contribution

It provides a statistical analysis of sampling limits in synthetic aperture imaging, guiding optimal sensor and pattern design for occlusion removal applications.

Findings

01

Identifies practical limits to aperture size and sampling density.

02

Offers guidelines for designing efficient synthetic aperture sampling patterns.

03

Demonstrates application in drone-based optical sectioning for ground inspection.

Abstract

Synthetic apertures find applications in many fields, such as radar, radio telescopes, microscopy, sonar, ultrasound, LiDAR, and optical imaging. They approximate the signal of a single hypothetical wide aperture sensor with either an array of static small aperture sensors or a single moving small aperture sensor. Common sense in synthetic aperture sampling is that a dense sampling pattern within a wide aperture is required to reconstruct a clear signal. In this article we show that there exists practical limits to both, synthetic aperture size and number of samples for the application of occlusion removal. This leads to an understanding on how to design synthetic aperture sampling patterns and sensors in a most optimal and practically efficient way. We apply our findings to airborne optical sectioning which uses camera drones and synthetic aperture imaging to computationally remove…

Figures19

Click any figure to enlarge with its caption.

Equations32

D = 1 - (1 - D)^{l / o},

D = 1 - (1 - D)^{l / o},

\begin{split}V=&1-\widetilde{D}^{2}-\frac{\widetilde{D}(1-\widetilde{D})}{N^{2}}\times\\ &\Bigg{(}N+4\sum_{i=1}^{\sqrt{N}-1}\sum_{j=0}^{\sqrt{N}-1}(\sqrt{N}-i)(\sqrt{N}-j)\\ &\operatorname{Max}(0,(1-i\hat{d}))\operatorname{Max}(0,(1-j\hat{d}))\Bigg{)},\end{split}

\begin{split}V=&1-\widetilde{D}^{2}-\frac{\widetilde{D}(1-\widetilde{D})}{N^{2}}\times\\ &\Bigg{(}N+4\sum_{i=1}^{\sqrt{N}-1}\sum_{j=0}^{\sqrt{N}-1}(\sqrt{N}-i)(\sqrt{N}-j)\\ &\operatorname{Max}(0,(1-i\hat{d}))\operatorname{Max}(0,(1-j\hat{d}))\Bigg{)},\end{split}

V = 1 - D^{2} - \frac{D ( 1 - D )}{N} .

V = 1 - D^{2} - \frac{D ( 1 - D )}{N} .

V = (V - V_{min}) / (V_{max} - V_{min}) = 1 - \frac{1}{N} .

V = (V - V_{min}) / (V_{max} - V_{min}) = 1 - \frac{1}{N} .

a = (N - 1) b,

a = (N - 1) b,

X = \frac{1}{N} i = 1 \sum N Y_{i} .

X = \frac{1}{N} i = 1 \sum N Y_{i} .

V=1-\operatorname{MSE}=1-\operatorname{E}[{\big{(}X-R\big{)}}^{2}],

V=1-\operatorname{MSE}=1-\operatorname{E}[{\big{(}X-R\big{)}}^{2}],

MSE = E [X^{2}] = E [X]^{2} + Var [X] .

MSE = E [X^{2}] = E [X]^{2} + Var [X] .

\operatorname{E}[X]=\operatorname{E}\bigg{[}\frac{1}{N}\sum_{i=1}^{N}Y_{i}\bigg{]}=\frac{1}{N}\sum_{i=1}^{N}\operatorname{E}\big{[}Y_{i}\big{]}=\operatorname{E}\big{[}Y_{i}\big{]},

\operatorname{E}[X]=\operatorname{E}\bigg{[}\frac{1}{N}\sum_{i=1}^{N}Y_{i}\bigg{]}=\frac{1}{N}\sum_{i=1}^{N}\operatorname{E}\big{[}Y_{i}\big{]}=\operatorname{E}\big{[}Y_{i}\big{]},

\begin{split}\operatorname{Var}[X]=&\operatorname{Var}\bigg{[}\frac{1}{N}\sum_{i=1}^{N}Y_{i}\bigg{]}\\ =&\frac{1}{N^{2}}\bigg{(}\sum_{i=1}^{N}\operatorname{Var}[Y_{i}]+2\sum_{k>i}\operatorname{Cov}[Y_{i},Y_{k}]\bigg{)}\\ =&\frac{1}{N^{2}}\bigg{(}N\operatorname{Var}[Y_{i}]+2\sum_{i=1}^{N}\sum_{k>i}\operatorname{Cov}[Y_{i},Y_{k}]\bigg{)},\end{split}

\begin{split}\operatorname{Var}[X]=&\operatorname{Var}\bigg{[}\frac{1}{N}\sum_{i=1}^{N}Y_{i}\bigg{]}\\ =&\frac{1}{N^{2}}\bigg{(}\sum_{i=1}^{N}\operatorname{Var}[Y_{i}]+2\sum_{k>i}\operatorname{Cov}[Y_{i},Y_{k}]\bigg{)}\\ =&\frac{1}{N^{2}}\bigg{(}N\operatorname{Var}[Y_{i}]+2\sum_{i=1}^{N}\sum_{k>i}\operatorname{Cov}[Y_{i},Y_{k}]\bigg{)},\end{split}

Cov [Y_{i}, Y_{k}] = E [Y_{i} Y_{k}] - E [Y_{i}] E [Y_{k}] .

Cov [Y_{i}, Y_{k}] = E [Y_{i} Y_{k}] - E [Y_{i}] E [Y_{k}] .

\begin{split}\operatorname{E}[Y_{i}Y_{k}]=&\operatorname{E}\big{[}\operatorname{E}[Y_{i}Y_{k}\,|\,Y_{i}]\big{]}=\operatorname{E}\big{[}Y_{i}\operatorname{E}[Y_{k}\,|\,Y_{i}]\big{]}\\ =&\operatorname{E}\big{[}Y_{i}\big{(}(1-q_{ik})Y_{i}+q_{ik}Y_{k}\big{)}\big{]}\\ =&(1-q_{ik})\operatorname{E}[Y_{i}^{2}]+q_{ik}\operatorname{E}[Y_{i}]\operatorname{E}[Y_{k}]\\ =&(1-q_{ik})\big{(}\operatorname{E}[Y_{i}]^{2}+\operatorname{Var}[Y_{i}]\big{)}+q_{ik}\operatorname{E}[Y_{i}]\operatorname{E}[Y_{k}].\end{split}

\begin{split}\operatorname{E}[Y_{i}Y_{k}]=&\operatorname{E}\big{[}\operatorname{E}[Y_{i}Y_{k}\,|\,Y_{i}]\big{]}=\operatorname{E}\big{[}Y_{i}\operatorname{E}[Y_{k}\,|\,Y_{i}]\big{]}\\ =&\operatorname{E}\big{[}Y_{i}\big{(}(1-q_{ik})Y_{i}+q_{ik}Y_{k}\big{)}\big{]}\\ =&(1-q_{ik})\operatorname{E}[Y_{i}^{2}]+q_{ik}\operatorname{E}[Y_{i}]\operatorname{E}[Y_{k}]\\ =&(1-q_{ik})\big{(}\operatorname{E}[Y_{i}]^{2}+\operatorname{Var}[Y_{i}]\big{)}+q_{ik}\operatorname{E}[Y_{i}]\operatorname{E}[Y_{k}].\end{split}

\begin{split}\operatorname{Cov}[Y_{i},Y_{k}]=&(1-q_{ik})\big{(}\operatorname{E}[Y_{i}]^{2}+\operatorname{Var}[Y_{i}]\big{)}+\\ &q_{ik}\operatorname{E}[Y_{i}]\operatorname{E}[Y_{k}]-\operatorname{E}[Y_{i}]\operatorname{E}[Y_{k}]\\ =&(1-q_{ik})\big{(}\operatorname{E}[Y_{i}]^{2}+\operatorname{Var}[Y_{i}]\big{)}+q_{ik}\operatorname{E}[Y_{i}]^{2}-\operatorname{E}[Y_{i}]^{2}\\ =&(1-q_{ik})\operatorname{Var}[Y_{i}].\end{split}

\begin{split}\operatorname{Cov}[Y_{i},Y_{k}]=&(1-q_{ik})\big{(}\operatorname{E}[Y_{i}]^{2}+\operatorname{Var}[Y_{i}]\big{)}+\\ &q_{ik}\operatorname{E}[Y_{i}]\operatorname{E}[Y_{k}]-\operatorname{E}[Y_{i}]\operatorname{E}[Y_{k}]\\ =&(1-q_{ik})\big{(}\operatorname{E}[Y_{i}]^{2}+\operatorname{Var}[Y_{i}]\big{)}+q_{ik}\operatorname{E}[Y_{i}]^{2}-\operatorname{E}[Y_{i}]^{2}\\ =&(1-q_{ik})\operatorname{Var}[Y_{i}].\end{split}

\begin{split}\operatorname{Var}[X]=&\frac{1}{N^{2}}\bigg{(}N\operatorname{Var}[Y_{i}]+2\sum_{i=1}^{N}\sum_{k>i}(1-q_{ik})\operatorname{Var}[Y_{i}]\bigg{)}\\ =&\frac{\operatorname{Var}[Y_{i}]}{N^{2}}\bigg{(}N+2\sum_{i=1}^{N}(N-i)(\operatorname{Max}(0,1-id/o))\bigg{)}.\end{split}

\begin{split}\operatorname{Var}[X]=&\frac{1}{N^{2}}\bigg{(}N\operatorname{Var}[Y_{i}]+2\sum_{i=1}^{N}\sum_{k>i}(1-q_{ik})\operatorname{Var}[Y_{i}]\bigg{)}\\ =&\frac{\operatorname{Var}[Y_{i}]}{N^{2}}\bigg{(}N+2\sum_{i=1}^{N}(N-i)(\operatorname{Max}(0,1-id/o))\bigg{)}.\end{split}

\begin{split}\operatorname{E}[X^{2}]=&\operatorname{E}[Y_{i}]^{2}+\frac{\operatorname{Var}[Y_{i}]}{N^{2}}\times\\ &\bigg{(}N+2\sum_{i=1}^{N}(N-i)(\operatorname{Max}(0,1-id/o))\bigg{)}\\ =&\widetilde{D}^{2}+\frac{\widetilde{D}(1-\widetilde{D})}{N^{2}}\cdot\\ &\bigg{(}N+2\sum_{i=1}^{N}(N-i)\operatorname{Max}\big{(}0,(1-i{d}/{o})\big{)}\bigg{)}.\end{split}

\begin{split}\operatorname{E}[X^{2}]=&\operatorname{E}[Y_{i}]^{2}+\frac{\operatorname{Var}[Y_{i}]}{N^{2}}\times\\ &\bigg{(}N+2\sum_{i=1}^{N}(N-i)(\operatorname{Max}(0,1-id/o))\bigg{)}\\ =&\widetilde{D}^{2}+\frac{\widetilde{D}(1-\widetilde{D})}{N^{2}}\cdot\\ &\bigg{(}N+2\sum_{i=1}^{N}(N-i)\operatorname{Max}\big{(}0,(1-i{d}/{o})\big{)}\bigg{)}.\end{split}

\begin{split}\operatorname{E}[X^{2}]=&\widetilde{D}^{2}+\frac{\widetilde{D}(1-\widetilde{D})}{N^{2}}\times\\ &\Bigg{(}N+4\sum_{i=1}^{\sqrt{N}-1}\sum_{j=0}^{\sqrt{N}-1}(\sqrt{N}-i)(\sqrt{N}-j)\\ &\operatorname{Max}(0,(1-i\hat{d}))\operatorname{Max}(0,(1-j\hat{d}))\Bigg{)}.\end{split}

\begin{split}\operatorname{E}[X^{2}]=&\widetilde{D}^{2}+\frac{\widetilde{D}(1-\widetilde{D})}{N^{2}}\times\\ &\Bigg{(}N+4\sum_{i=1}^{\sqrt{N}-1}\sum_{j=0}^{\sqrt{N}-1}(\sqrt{N}-i)(\sqrt{N}-j)\\ &\operatorname{Max}(0,(1-i\hat{d}))\operatorname{Max}(0,(1-j\hat{d}))\Bigg{)}.\end{split}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A Statistical View on Synthetic Aperture Imaging for Occlusion Removal

Indrajit Kurmi, David C. Schedl, and Oliver Bimber Manuscript received March 26; revised May 3; accepted June 10, 2019.* Johannes Kepler University Linz, e-mail: [email protected].

Abstract

Synthetic apertures find applications in many fields, such as radar, radio telescopes, microscopy, sonar, ultrasound, LiDAR, and optical imaging. They approximate the signal of a single hypothetical wide aperture sensor with either an array of static small aperture sensors or a single moving small aperture sensor. Common sense in synthetic aperture sampling is that a dense sampling pattern within a wide aperture is required to reconstruct a clear signal. In this article we show that there exists practical limits to both, synthetic aperture size and number of samples for the application of occlusion removal. This leads to an understanding on how to design synthetic aperture sampling patterns and sensors in a most optimal and practically efficient way. We apply our findings to airborne optical sectioning which uses camera drones and synthetic aperture imaging to computationally remove occluding vegetation or trees for inspecting ground surfaces.

Index Terms:

Sensor Data Processing, Synthetic Aperture Imaging, Airborne Optical Sectioning, Light Fields.

I Introduction

Synthetic apertures (SA) approximate the signal of a single hypothetical wide aperture sensor with either an array of static small aperture sensors or a single moving small aperture sensor whose individual signals are computationally combined to increase resolution, depth-of-field, frame rate, contrast, and signal-to-noise ratio.

This principle has been used in many fields, such as for synthetic aperture radar (SAR) [1, 2, 3], synthetic aperture radio telescopes (SART) [4, 5], interferometric synthetic aperture microscopy (ISAM) [6], synthetic aperture sonar (SAS) [7, 8], synthetic aperture ultrasound (SAU) [9, 10], and synthetic aperture LiDAR (SAL) / synthetic aperture imaging laser (SAIL) [11, 12].

In the visible range, synthetic aperture imaging (SAI) [13, 14, 15, 16, 17, 18, 19, 20] has been used together with large camera arrays that capture structured light fields (regularly sampled multiscopic scene representations) and enable the computation of virtual views with maximal synthetic apertures that correspond to the physical size of the applied camera array. The wide aperture signal results in a shallow depth of field and consequently in a strong blur of out-of-focus occluders, while images of points in focus remain clearly visible. Shifting focus computationally allows optical slicing through dense occluder structures (such as bushes, leaves, tree branches, and coniferous trees), and discovery and inspection of concealed artefacts or objects behind the occluders.

With airborne optical sectioning (AOS) [21, 22], we apply camera drones for synthetic aperture imaging. It samples the optical signal of wide synthetic apertures (up to $100\text{\,}\mathrm{m}$ diameter) with multiscopic video images as an unstructured (irregularly sampled) light field to support optical slicing by image integration. By computationally removing occluding vegetation or trees when inspecting the ground surface, AOS supports various applications in archaeology, forestry, agriculture, and border control. Fig. 1 illustrates an example where AOS was used to uncover the ruins of a 19th century fortification system that is concealed by dense forest and shrubs. The interested reader is referred to [21, 22] for more details.

Compared to alternative airborne scanning technologies (such as LiDAR) AOS is cheaper, delivers surface colour information, achieves higher sampling resolutions, and (in contrast to photogrammetry) it does not suffer from inaccurate correspondence matches and long processing times. However, as SART and SAI, AOS is passive (it only receives reflected light and does not emit electromagnetic or sound waves to measure the backscattered signal). Thus, it relies on an external energy source (i.e., sunlight).

Common sense in synthetic aperture sampling is that a dense sampling pattern within a wide aperture is required to reconstruct a clear signal. In case of SAI, this implies that volumes of dense occluders require an unrealistically high number of image samples captured over a physically impractical aperture range. The disadvantage of a wide synthetic aperture for SAI is an increase of occlusion density for images captured at far distances and oblique angles (most extreme at the periphery of very wide synthetic apertures). Furthermore, the spatial resolution of camera recordings decreases with an increasing distance from the target object. For SAI, the individual resolutions of all recordings are averaged. Thus, the reconstructed spatial resolution drops with an increasing aperture diameter. The disadvantage of a high sampling rate is the high processing demand that, if too high, prevents from real-time visualization rates. A wide synthetic aperture and a high sampling rate also increase the capturing time (if sampled sequentially, as for AOS) or the complexity of the sensor (if sampled simultaneously, as for camera arrays).

In this article we show that there exists practical limits to both, synthetic aperture size and number of samples. This leads to an understanding on how to design synthetic aperture sampling patterns and sensors in a most optimal and practically efficient way.

We present three basic findings in this article: (1) There exists a limit to the baseline (distance) of sample positions. The minimal (optimal) baseline is the one that results in a disparity equal to the projected occluder size. Larger baselines do not improve visibility. (2) There exists a limit to achievable visibility improvement that depends on the density of the occluder volume. The maximum visibility gain is achieved at a density of $50\text{\,}\mathrm{\char 37\relax}$ . (3) The normalized visibility gain (normalized to the density-dependent, achievable range) is independent of the occlusion density. It is directly correlated to a fixed number of samples.

In the following, we will discuss these findings and explain how they lead to minimal synthetic apertures with a lowest number of samples. We present results based on a simplified mathematical model, simulations with a 3D LiDAR scan of forest (providing ground truth), and real AOS recordings of forest (without ground truth). For AOS recordings, we achieve an approximately $6.512.75$ times reduction of the synthetic aperture area and an approximately $1020$ times lower number of samples without significant loss in visibility.

Although we focus on SAI (in particular AOS), our findings might be transferable to other SA sensors that support occlusion removal.

II A Statistical Model for SAI

The key idea of our mathematical model is to consider the sampling process of SAI not in the common context of optics, where a wide aperture leads to a shallow depth of field and large point-spread of out-of-focus occluders. Instead, we understand SAI sampling and reconstruction as a variation of signal averaging that is explainable by statistical principles. In fact, synthetic aperture rendering (the computational process to reconstruct new images from captured SAI samples) is nothing more than averaging images at proper disparity shifts that correspond to the adjusted synthetic focal plane [23, 24].

If we consider the projected pixel-footprints of occluders as noise and the projected pixel-footprints of the target at a given synthetic focal plane as signal, we attenuate the occluders while amplifying the target when combining all SAI sample images. The reason for this is that all points on the selected synthetic focal plane that are imaged in all samples always project exactly to the same positions in the reconstructed image while all other points will project to different positions. From signal averaging theory we know that the signal-to-noise ratio (S/N) of noisy measurements improves proportional to $\sqrt{N}$ if $N$ samples with constant signal are averaged [25]. Thus, random noise in images is reduced by averaging multiple recordings of the same content. This principle does not apply to SAI as the random noise pattern (projected occluders) is mainly constant in each sample but differs only in disparity shifts that depend on sampling baselines and distances.

Our model assumes a volume of height $l$ , containing binary random occluders (assuming opaque occluders) of uniform size $o$ and cubic shape, uniform distribution (occluders cannot overlap in space), and uniform density $D$ at each slice of the volume (cf. Fig. 2a). Furthermore, it considers an orthographic projection for integrating all slices into sample images.

An occlusion density per slice of $D$ , an occluder size of $o$ , and a volume height of $l$ leads to an integrated occlusion density for each SAI sample after orthographic projection of

[TABLE]

where the ratio $l/o$ is unitless. A light ray passing through the volume in our model follows a binomial distribution. In (1) $(1-D)^{l/o}$ is the probability mass function of a non-occluded ray (interacting with zero occluders within its path through the volume). Thus, statistically, $\widetilde{D}$ is the probability of occlusion in a single SAI sample (cf. Fig. 2b).

When averaging $N$ such SAI samples that are 2D shifted by disparity $d$ , we statistically achieve a visibility probability of

[TABLE]

with $\hat{d}=d/o$ being the ratio of projected occluder size $o$ and disparity shift $d$ . Note that in (2), $o$ and $d$ are in SAI sample units (projected pixel distances in camera resolution), but $\hat{d}$ is unitless. The derivation of (2) is provided in the Appendix.

Fig. 3 illustrates the visibility improvement over varying disparity choices, different SAI sample densities, and an increasing number of SAI samples. In all cases, the visibility improvement settles at a constant maximum after an optimal disparity. This optimal disparity equals the projected occluder size $o$ , and is reached when $\hat{d}=1$ .

Fig. 4 shows visual results of a simulation. Noise reduction (visibility improvement with respect to the black background) settles at $d=5$ ( $\hat{d}=1$ ). With $d>5$ ( $\hat{d}>1$ ) only bokeh artefacts that visualize the sampling grid are emphasized. In Fig. 3, it is also shown that all such simulations match our model (2) precisely and under all conditions.

If physical capturing conditions, such as intrinsic camera parameters, resolution, and recording distance (flying altitude in case of AOS) are known, disparity in pixel directly translates to camera baseline in meters. This explains our first finding: (1) There exists a limit to the baseline (distance) of sample positions. The minimal (optimal) baseline is the one that results in a disparity equal to the projected occluder size. Larger baselines do not improve visibility.

If the optimal (or larger) disparity is achieved, then (2) can be simplified for $\hat{d}\geq 1$ to

[TABLE]

Fig. 5a plots (3) over various $\widetilde{D}$ and $N$ , and illustrates that there exists a lower bound of $V_{\text{min}}=1-\widetilde{D}$ for $N=1$ and an upper bound of $V_{\text{max}}=1-\widetilde{D}^{2}$ for $N=\infty$ . It also shows that the maximal visibility gain is achieved at $\widetilde{D}=$ 50\text{,}\mathrm{\char 37\relax}$$.

This explains our second finding: (2) There exists a limit to achievable visibility improvement that depends on the density of the occluder volume. The maximum visibility gain is achieved at a density of $50\text{\,}\mathrm{\char 37\relax}$ .

If we normalize (3) within the possible range ( $V_{\text{min}}$ to $V_{\text{max}}$ ), it becomes independent of the density (cf. Fig. 5b):

[TABLE]

This explains our third finding: (3) The normalized visibility gain is independent of the occlusion density. It is directly correlated to a fixed number of samples. Thus, a predefined tolerable visibility threshold leads to a constant number of required SAI samples—regardless of the occlusion density.

Note, that in reality we can neither consider sizes, densities, and distributions of occluders to be uniform, nor that orthographic projection applies to regular cameras. However, the idealized model described above opens a statistical take on SAI for making sampling decisions. In the subsequent chapter we compare our model with realistic data and show that our findings still hold.

III SAI under realistic conditions

We can compare our model with the non-uniform occlusion volume of a forest patch that has been scanned using LiDAR [26] (Fig. 6).

Fig. 7a plots $\widetilde{D}$ over $l/o$ for the LiDAR scan and for our model (1) using average values that are determined through the entire volume. The non-uniformity (occluder sizes, densities, disparities) of the LiDAR scan as well as the different projections (orthographic projection in the model vs. perspective projection in the scan) leads to the deviation from our model prediction. However, the tendential behaviour is the same. Furthermore, our model represents a worst-case upper bound in $\widetilde{D}$ since it considers uncorrelated noise while, in reality, individual occluder points are correlated (they form connected segments, such as branches and trunks).

Fig. 7b plots for the LiDAR scan the percentage of occluders whose sizes are projected to the optimal disparity ( $\hat{d}\geq 1$ ) over different choices of baselines $b$ . It indicates, for example, that from an axial sampling altitude of $50\text{\,}\mathrm{m}$ and for a baseline (lateral sampling distance) of $3.5\text{\,}\mathrm{m}$ , $98\text{\,}\mathrm{\char 37\relax}$ of all occluders project to equal or smaller pixel footprints than their corresponding disparities achieved with the baseline.

The size of the synthetic aperture (the lateral scanning area) depends on the chosen scanning baseline $b$ that directly relates to a particular $\hat{d}$ , and on the number of SAI samples $N$ being captured:

[TABLE]

where $a$ is the aperture diameter. Thus, multiple combinations of $N$ and $b$ (corresponds to $\hat{d}$ ) lead to the same $a$ . Our goal is to find a combination that minimizes $a$ (to maximize spatial resolution in the reconstruction and to reduce occlusion density due to oblique viewing angles) and $N$ (to support real-time visualization rates and minimize sampling time) that leads to a tolerable visibility gain.

When plotting the normalized visibility $\widehat{V}$ with respect to aperture diameter that would be needed to achieve a particular $\hat{d}$ with our model (Fig. 8a), it can be seen that for the optimal $\hat{d}=1$ and for a chosen visibility threshold of $\widehat{V}=$ 98\text{,}\mathrm{\char 37\relax} $(which correlates to a fixed $N$ of $50$ samples [(4)](#S2.E4)), the required aperture diameter is $6.1$ (unitless in our model). But [Fig. 8](#S3.F8)a also shows that if $\hat{d}$ is lower than $1$ the maximal visibility cannot be achieved for smaller apertures because projected occluder footprints are larger than the achieved disparities. In this case, the synthetic aperture is oversampled. If $\hat{d}$ is greater than $1$, the maximal visibility can also not be achieved for smaller apertures because of undersampling (the corresponding baselines are too large to support an adequate $N$ within the aperture). The same plot is shown for the LiDAR scan in [Fig. 8](#S3.F8)b, using the previously chosen baseline of $b=$3.5\text{\,}\mathrm{m}$ and $N=50$ which (based on our mode) should lead to $\widehat{V}=$ 98\text{,}\mathrm{\char 37\relax} $. The reason why it leads only to $\widehat{V}=$82.4\text{\,}\mathrm{\char 37\relax}$ is the non-uniformity of the forest that does not perfectly match our uniform model, as explained in Section II.

Fig. 9 illustrates visual synthetic aperture rendering results of the LiDAR scan (green) overlaid over a grayscale EIA resolution chart for increasing $N$ and $b$ . The synthetic focal plane is located at the target plane, and the axial sampling distance is $50\text{\,}\mathrm{m}$ (Fig. 6). The visibility $\widehat{V}$ is computed based on the fraction of non-occluded parts on the target plane. Note, that the theoretical optimum of SAI (for $N=\infty$ ; $\widehat{V}=$ 100\text{,}\mathrm{\char 37\relax} $) is the focussed image of the target at the synthetic focal plane overlaid by a constant bias which corresponds to the mean occlusion density ($\widetilde{D}$). For the LiDAR scan, this is $\widetilde{D}=$56.7\text{\,}\mathrm{\char 37\relax}$ , and the theoretical optimum will be an image of the EIA resolution chart with a $56.7\text{\,}\mathrm{\char 37\relax}$ greenish tint in our simulation. We use the structural similarity index (SSIM) [27] as a quantitative measure of visual difference between this theoretical optimum and other visual synthetic aperture rendering results. The SSIM is a common image quality metric which considers the fact that humans are highly adapted to extract structural information. Fig. 9 illustrates that SSIM values and visibility values are correlated, and that for baselines or number of SAI samples that differ from our selection ( $b=$ 3.5\text{,}\mathrm{m}$$, $N=50$ ), no significant improvement is achieved (only $4\text{\,}\mathrm{\char 37\relax}5\text{\,}\mathrm{\char 37\relax}$ in visibility and SSIM).

We can now apply our findings to the real AOS scan shown in Fig. 1 [21]: This scan was initially brute force sampled from an axial distance (altitude) of $40\text{\,}\mathrm{m}$ , with a baseline of $b=$ 2\text{,}\mathrm{m} $, and with $N=504$ samples within a synthetic aperture of $2500\text{\,}{\mathrm{m}}^{2}$ ($a=$50\text{\,}\mathrm{m}$ ). If we consider the forest in the LiDAR dataset to be statistically representative to the forest in the AOS scan, then our model would suggest a baseline of $b=$ 2.8\text{,}\mathrm{m} $(with respect to the different axial scanning distances in both cases: $b=$3.5\text{\,}\mathrm{m}$\cdot$40\text{\,}\mathrm{m}$/$50\text{\,}\mathrm{m}$ ). This is the baseline at which we expect $98\text{\,}\mathrm{\char 37\relax}$ of all occluders to project to the optimal disparity ( $\hat{d}\geq 1$ ), as shown in Fig. 7.

Fig. 10 illustrates synthetic aperture rendering results of the AOS scan at a varying numbers of SAI samples $N$ that, together with the fixed baseline of $b=$ 2.8\text{,}\mathrm{m} $lead to particular synthetic apertures of diameters $a$. With $N=50$ samples, we expect (based on the chosen normalized visibility threshold of $\widehat{V}=$98\text{\,}\mathrm{\char 37\relax}$ , as shown in Fig. 8a and in Fig. 5b) no significant gain in visibility for a larger number of SAI samples. But we are aware of the fact that the visibility which can be practically achieved will be lower than $98\text{\,}\mathrm{\char 37\relax}$ due to the non-uniformity of the forest, as shown in Fig. 8b. Since for the AOS scan no ground truth exists, we cannot determine visibility in a quantitative way. Instead, we compare new sampling results against the brute-force sampled result (using $N=504$ , $a=$ 50\text{,}\mathrm{m} $, $b=$2\text{\,}\mathrm{m}$ ) that we consider an approximation to the theoretical optimum. Visual similarity is determined again with the structural similarity index metric (SSIM) [27].

Fig. 11 plots all SSIM values for $a\leq$ 50\text{,}\mathrm{m} $, and indicates that even an aperture diameter of $a=$14\text{\,}\mathrm{m}$ with $N=25$ SAI samples (this corresponds to a normalized visibility threshold of $\widehat{V}=96\%$ ) would lead to no significant visual reduction compared to an aperture of $a=$ 50\text{,}\mathrm{m}$$ with $N=504$ samples.

These results imply that the synthetic aperture for this scene can be significantly smaller (by a factor of $\approx$ $6.512.75$ in area and by $\approx$ $1020$ in number of samples).

IV Conclusion and Future Work

In this article we have presented three findings that lead to a basic understanding on how to design synthetic aperture sampling patterns: (1) There exists a limit to the baseline (distance) of sample positions. The minimal (optimal) baseline is the one that results in a disparity equal to the projected occluder size. Larger baselines do not improve visibility. (2) There exists a limit to achievable visibility improvement that depends on the density of the occluder volume. The maximum visibility gain is achieved at a density of $50\text{\,}\mathrm{\char 37\relax}$ . (3) The normalized visibility gain is independent of the occlusion density. It is directly correlated to a fixed number of samples. For AOS and the evaluated datasets, these findings result in much smaller synthetic aperture areas with significantly less samples.

The key idea of our approach is to consider the sampling process of SAI not in the common context of optics, where a wide aperture leads to a shallow depth of field and large point-spread of out-of-focus occluders. Instead, we understand SAI sampling and reconstruction as a variation of signal averaging that is explainable by statistical principles.

In future, we want to explore better models that consider the non-uniformity of known occlusion volumes, such as forests where (from tree crowns to the ground) occluder size increases while density decreases. Furthermore, we want to investigate the potential non-uniform coded sampling patterns, and techniques that enable a measurement or a better approximation of local occluder sizes and densities. Finally, we are interested in evaluating our model with different forest types (such as conifer or rain forest) and datasets. Thereby, measurements and records from forestry research might guide the parametrization of occluder sizes and densities.

[Derivation of the Visibility Probability] For a given focal plane, synthetic aperture rendering integrates $N$ disparity-shifted SAI samples $Y_{i}$ :

[TABLE]

We define visibility $V$ based on the mean squared error (MSE) between the synthetic aperture rendering result $X$ and an image $R$ of the non-occluded target on the focal plane:

[TABLE]

where $\operatorname{E}$ indicates the expectation (i.e., mean). For simplicity, we assume $R=0$ in the following, hence

[TABLE]

Each SAI sample has equal mean and variance (i.e., $\operatorname{E}[Y_{i}]=\operatorname{E}[Y_{k}]$ and $\operatorname{Var}[Y_{i}]=\operatorname{Var}[Y_{k}]$ ), behaves like a Bernoulli trial, and is a mixture of Bernoulli distributions:

[TABLE]

The probability that the same occluder is not projected to the same pixel in two SAI samples ( $Y_{i}$ and $Y_{k}$ ) is $q_{ik}=\operatorname{Min}(1,d_{ik}/o)$ , where $d_{ik}$ is the total disparity of the occluder in $Y_{i}$ and $Y_{k}$ .

Because of the conditional dependence between $Y_{i}$ and $Y_{k}$ (being shifted instances of the same image), $\operatorname{E}[Y_{i}Y_{k}]$ can be expressed as

[TABLE]

Putting (12) into (11) yields

[TABLE]

Thus, if $q_{ik}=1$ (when $d_{ik}/o\geq 1$ ) the covariance $\operatorname{Cov}[Y_{i},Y_{k}]=0$ . Putting (13) into (10) and simplifying yields:

[TABLE]

Note, that only the co-variance depends on $d$ and $o$ . Putting (14) and (9) into (8) yield:

[TABLE]

Note, that (15) considers 1D disparities, only. But it can be easily extended for 2D disparities:

[TABLE]

Thus, $V=1-\operatorname{E}[X^{2}]$ leads to (2).

Acknowledgment

The authors would like to thank Bettina Gruen of Johannes Kepler University Linz for her help on deriving (2). This research was funded by the Austrian Science Fund (FWF) under grant number P 32185-NBL.

Bibliography27

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. Moreira, P. Prats-Iraola, M. Younis, G. Krieger, I. Hajnsek, and K. P. Papathanassiou, “A tutorial on synthetic aperture radar,” IEEE Geoscience and Remote Sensing Magazine , vol. 1, no. 1, pp. 6–43, March 2013.
2[2] C. J. Li and H. Ling, “Synthetic aperture radar imaging using a small consumer drone,” in 2015 IEEE International Symposium on Antennas and Propagation USNC/URSI National Radio Science Meeting , July 2015, pp. 685–686.
3[3] P. A. Rosen, S. Hensley, I. R. Joughin, F. K. Li, S. N. Madsen, E. Rodriguez, and R. M. Goldstein, “Synthetic aperture radar interferometry,” Proceedings of the IEEE , vol. 88, no. 3, pp. 333–382, March 2000.
4[4] R. Levanda and A. Leshem, “Synthetic aperture radio telescopes,” Signal Processing Magazine, IEEE , vol. 27, pp. 14 – 29, 02 2010.
5[5] D. Dravins, T. Lagadec, and P. D. Nuñez, “Optical aperture synthesis with electronically connected telescopes.” Nature communications , vol. 6, p. 6852, Apr 2015.
6[6] T. S. Ralston, D. L. Marks, P. S. Carney, and S. A. Boppart, “Interferometric synthetic aperture microscopy (ISAM),” Nature Physics , pp. 965–1004, 2007.
7[7] M. P. Hayes and P. T. Gough, “Synthetic aperture sonar: a review of current status,” IEEE Journal of Oceanic Engineering , vol. 34, no. 3, pp. 207–224, 2009.
8[8] R. E. Hansen, “Introduction to synthetic aperture sonar,” in Sonar Systems Edited . In Tech Published, 2011. [Online]. Available: http://www.intechopen.com/books/sonar-systems/introduction-to-synthetic-aperture-sonar