A Statistical View on Synthetic Aperture Imaging for Occlusion Removal
Indrajit Kurmi, David C. Schedl, Oliver Bimber

TL;DR
This paper explores the limits of synthetic aperture imaging for occlusion removal, revealing practical constraints on aperture size and sampling density, and applying these insights to drone-based optical imaging.
Contribution
It provides a statistical analysis of sampling limits in synthetic aperture imaging, guiding optimal sensor and pattern design for occlusion removal applications.
Findings
Identifies practical limits to aperture size and sampling density.
Offers guidelines for designing efficient synthetic aperture sampling patterns.
Demonstrates application in drone-based optical sectioning for ground inspection.
Abstract
Synthetic apertures find applications in many fields, such as radar, radio telescopes, microscopy, sonar, ultrasound, LiDAR, and optical imaging. They approximate the signal of a single hypothetical wide aperture sensor with either an array of static small aperture sensors or a single moving small aperture sensor. Common sense in synthetic aperture sampling is that a dense sampling pattern within a wide aperture is required to reconstruct a clear signal. In this article we show that there exists practical limits to both, synthetic aperture size and number of samples for the application of occlusion removal. This leads to an understanding on how to design synthetic aperture sampling patterns and sensors in a most optimal and practically efficient way. We apply our findings to airborne optical sectioning which uses camera drones and synthetic aperture imaging to computationally remove…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15
Figure 16
Figure 17
Figure 18
Figure 19Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A Statistical View on Synthetic Aperture Imaging for Occlusion Removal
Indrajit Kurmi, David C. Schedl, and Oliver Bimber Manuscript received March 26; revised May 3; accepted June 10, 2019.* Johannes Kepler University Linz, e-mail: [email protected].
Abstract
Synthetic apertures find applications in many fields, such as radar, radio telescopes, microscopy, sonar, ultrasound, LiDAR, and optical imaging. They approximate the signal of a single hypothetical wide aperture sensor with either an array of static small aperture sensors or a single moving small aperture sensor. Common sense in synthetic aperture sampling is that a dense sampling pattern within a wide aperture is required to reconstruct a clear signal. In this article we show that there exists practical limits to both, synthetic aperture size and number of samples for the application of occlusion removal. This leads to an understanding on how to design synthetic aperture sampling patterns and sensors in a most optimal and practically efficient way. We apply our findings to airborne optical sectioning which uses camera drones and synthetic aperture imaging to computationally remove occluding vegetation or trees for inspecting ground surfaces.
Index Terms:
Sensor Data Processing, Synthetic Aperture Imaging, Airborne Optical Sectioning, Light Fields.
I Introduction
Synthetic apertures (SA) approximate the signal of a single hypothetical wide aperture sensor with either an array of static small aperture sensors or a single moving small aperture sensor whose individual signals are computationally combined to increase resolution, depth-of-field, frame rate, contrast, and signal-to-noise ratio.
This principle has been used in many fields, such as for synthetic aperture radar (SAR) [1, 2, 3], synthetic aperture radio telescopes (SART) [4, 5], interferometric synthetic aperture microscopy (ISAM) [6], synthetic aperture sonar (SAS) [7, 8], synthetic aperture ultrasound (SAU) [9, 10], and synthetic aperture LiDAR (SAL) / synthetic aperture imaging laser (SAIL) [11, 12].
In the visible range, synthetic aperture imaging (SAI) [13, 14, 15, 16, 17, 18, 19, 20] has been used together with large camera arrays that capture structured light fields (regularly sampled multiscopic scene representations) and enable the computation of virtual views with maximal synthetic apertures that correspond to the physical size of the applied camera array. The wide aperture signal results in a shallow depth of field and consequently in a strong blur of out-of-focus occluders, while images of points in focus remain clearly visible. Shifting focus computationally allows optical slicing through dense occluder structures (such as bushes, leaves, tree branches, and coniferous trees), and discovery and inspection of concealed artefacts or objects behind the occluders.
With airborne optical sectioning (AOS) [21, 22], we apply camera drones for synthetic aperture imaging. It samples the optical signal of wide synthetic apertures (up to diameter) with multiscopic video images as an unstructured (irregularly sampled) light field to support optical slicing by image integration. By computationally removing occluding vegetation or trees when inspecting the ground surface, AOS supports various applications in archaeology, forestry, agriculture, and border control. Fig. 1 illustrates an example where AOS was used to uncover the ruins of a 19th century fortification system that is concealed by dense forest and shrubs. The interested reader is referred to [21, 22] for more details.
Compared to alternative airborne scanning technologies (such as LiDAR) AOS is cheaper, delivers surface colour information, achieves higher sampling resolutions, and (in contrast to photogrammetry) it does not suffer from inaccurate correspondence matches and long processing times. However, as SART and SAI, AOS is passive (it only receives reflected light and does not emit electromagnetic or sound waves to measure the backscattered signal). Thus, it relies on an external energy source (i.e., sunlight).
Common sense in synthetic aperture sampling is that a dense sampling pattern within a wide aperture is required to reconstruct a clear signal. In case of SAI, this implies that volumes of dense occluders require an unrealistically high number of image samples captured over a physically impractical aperture range. The disadvantage of a wide synthetic aperture for SAI is an increase of occlusion density for images captured at far distances and oblique angles (most extreme at the periphery of very wide synthetic apertures). Furthermore, the spatial resolution of camera recordings decreases with an increasing distance from the target object. For SAI, the individual resolutions of all recordings are averaged. Thus, the reconstructed spatial resolution drops with an increasing aperture diameter. The disadvantage of a high sampling rate is the high processing demand that, if too high, prevents from real-time visualization rates. A wide synthetic aperture and a high sampling rate also increase the capturing time (if sampled sequentially, as for AOS) or the complexity of the sensor (if sampled simultaneously, as for camera arrays).
In this article we show that there exists practical limits to both, synthetic aperture size and number of samples. This leads to an understanding on how to design synthetic aperture sampling patterns and sensors in a most optimal and practically efficient way.
We present three basic findings in this article: (1) There exists a limit to the baseline (distance) of sample positions. The minimal (optimal) baseline is the one that results in a disparity equal to the projected occluder size. Larger baselines do not improve visibility. (2) There exists a limit to achievable visibility improvement that depends on the density of the occluder volume. The maximum visibility gain is achieved at a density of . (3) The normalized visibility gain (normalized to the density-dependent, achievable range) is independent of the occlusion density. It is directly correlated to a fixed number of samples.
In the following, we will discuss these findings and explain how they lead to minimal synthetic apertures with a lowest number of samples. We present results based on a simplified mathematical model, simulations with a 3D LiDAR scan of forest (providing ground truth), and real AOS recordings of forest (without ground truth). For AOS recordings, we achieve an approximately times reduction of the synthetic aperture area and an approximately times lower number of samples without significant loss in visibility.
Although we focus on SAI (in particular AOS), our findings might be transferable to other SA sensors that support occlusion removal.
II A Statistical Model for SAI
The key idea of our mathematical model is to consider the sampling process of SAI not in the common context of optics, where a wide aperture leads to a shallow depth of field and large point-spread of out-of-focus occluders. Instead, we understand SAI sampling and reconstruction as a variation of signal averaging that is explainable by statistical principles. In fact, synthetic aperture rendering (the computational process to reconstruct new images from captured SAI samples) is nothing more than averaging images at proper disparity shifts that correspond to the adjusted synthetic focal plane [23, 24].
If we consider the projected pixel-footprints of occluders as noise and the projected pixel-footprints of the target at a given synthetic focal plane as signal, we attenuate the occluders while amplifying the target when combining all SAI sample images. The reason for this is that all points on the selected synthetic focal plane that are imaged in all samples always project exactly to the same positions in the reconstructed image while all other points will project to different positions. From signal averaging theory we know that the signal-to-noise ratio (S/N) of noisy measurements improves proportional to if samples with constant signal are averaged [25]. Thus, random noise in images is reduced by averaging multiple recordings of the same content. This principle does not apply to SAI as the random noise pattern (projected occluders) is mainly constant in each sample but differs only in disparity shifts that depend on sampling baselines and distances.
Our model assumes a volume of height , containing binary random occluders (assuming opaque occluders) of uniform size and cubic shape, uniform distribution (occluders cannot overlap in space), and uniform density at each slice of the volume (cf. Fig. 2a). Furthermore, it considers an orthographic projection for integrating all slices into sample images.
An occlusion density per slice of , an occluder size of , and a volume height of leads to an integrated occlusion density for each SAI sample after orthographic projection of
[TABLE]
where the ratio is unitless. A light ray passing through the volume in our model follows a binomial distribution. In (1) is the probability mass function of a non-occluded ray (interacting with zero occluders within its path through the volume). Thus, statistically, is the probability of occlusion in a single SAI sample (cf. Fig. 2b).
When averaging such SAI samples that are 2D shifted by disparity , we statistically achieve a visibility probability of
[TABLE]
with being the ratio of projected occluder size and disparity shift . Note that in (2), and are in SAI sample units (projected pixel distances in camera resolution), but is unitless. The derivation of (2) is provided in the Appendix.
Fig. 3 illustrates the visibility improvement over varying disparity choices, different SAI sample densities, and an increasing number of SAI samples. In all cases, the visibility improvement settles at a constant maximum after an optimal disparity. This optimal disparity equals the projected occluder size , and is reached when .
Fig. 4 shows visual results of a simulation. Noise reduction (visibility improvement with respect to the black background) settles at (). With () only bokeh artefacts that visualize the sampling grid are emphasized. In Fig. 3, it is also shown that all such simulations match our model (2) precisely and under all conditions.
If physical capturing conditions, such as intrinsic camera parameters, resolution, and recording distance (flying altitude in case of AOS) are known, disparity in pixel directly translates to camera baseline in meters. This explains our first finding: (1) There exists a limit to the baseline (distance) of sample positions. The minimal (optimal) baseline is the one that results in a disparity equal to the projected occluder size. Larger baselines do not improve visibility.
If the optimal (or larger) disparity is achieved, then (2) can be simplified for to
[TABLE]
Fig. 5a plots (3) over various and , and illustrates that there exists a lower bound of for and an upper bound of for . It also shows that the maximal visibility gain is achieved at 50\text{,}\mathrm{\char 37\relax}$$.
This explains our second finding: (2) There exists a limit to achievable visibility improvement that depends on the density of the occluder volume. The maximum visibility gain is achieved at a density of .
If we normalize (3) within the possible range ( to ), it becomes independent of the density (cf. Fig. 5b):
[TABLE]
This explains our third finding: (3) The normalized visibility gain is independent of the occlusion density. It is directly correlated to a fixed number of samples. Thus, a predefined tolerable visibility threshold leads to a constant number of required SAI samples—regardless of the occlusion density.
Note, that in reality we can neither consider sizes, densities, and distributions of occluders to be uniform, nor that orthographic projection applies to regular cameras. However, the idealized model described above opens a statistical take on SAI for making sampling decisions. In the subsequent chapter we compare our model with realistic data and show that our findings still hold.
III SAI under realistic conditions
We can compare our model with the non-uniform occlusion volume of a forest patch that has been scanned using LiDAR [26] (Fig. 6).
Fig. 7a plots over for the LiDAR scan and for our model (1) using average values that are determined through the entire volume. The non-uniformity (occluder sizes, densities, disparities) of the LiDAR scan as well as the different projections (orthographic projection in the model vs. perspective projection in the scan) leads to the deviation from our model prediction. However, the tendential behaviour is the same. Furthermore, our model represents a worst-case upper bound in since it considers uncorrelated noise while, in reality, individual occluder points are correlated (they form connected segments, such as branches and trunks).
Fig. 7b plots for the LiDAR scan the percentage of occluders whose sizes are projected to the optimal disparity () over different choices of baselines . It indicates, for example, that from an axial sampling altitude of and for a baseline (lateral sampling distance) of , of all occluders project to equal or smaller pixel footprints than their corresponding disparities achieved with the baseline.
The size of the synthetic aperture (the lateral scanning area) depends on the chosen scanning baseline that directly relates to a particular , and on the number of SAI samples being captured:
[TABLE]
where is the aperture diameter. Thus, multiple combinations of and (corresponds to ) lead to the same . Our goal is to find a combination that minimizes (to maximize spatial resolution in the reconstruction and to reduce occlusion density due to oblique viewing angles) and (to support real-time visualization rates and minimize sampling time) that leads to a tolerable visibility gain.
When plotting the normalized visibility with respect to aperture diameter that would be needed to achieve a particular with our model (Fig. 8a), it can be seen that for the optimal and for a chosen visibility threshold of 98\text{,}\mathrm{\char 37\relax} (which correlates to a fixed $N$ of $50$ samples [(4)](#S2.E4)), the required aperture diameter is $6.1$ (unitless in our model). But [Fig. 8](#S3.F8)a also shows that if $\hat{d}$ is lower than $1$ the maximal visibility cannot be achieved for smaller apertures because projected occluder footprints are larger than the achieved disparities. In this case, the synthetic aperture is oversampled. If $\hat{d}$ is greater than $1$, the maximal visibility can also not be achieved for smaller apertures because of undersampling (the corresponding baselines are too large to support an adequate $N$ within the aperture). The same plot is shown for the LiDAR scan in [Fig. 8](#S3.F8)b, using the previously chosen baseline of $b=$3.5\text{\,}\mathrm{m} and which (based on our mode) should lead to 98\text{,}\mathrm{\char 37\relax}. The reason why it leads only to $\widehat{V}=$82.4\text{\,}\mathrm{\char 37\relax} is the non-uniformity of the forest that does not perfectly match our uniform model, as explained in Section II.
Fig. 9 illustrates visual synthetic aperture rendering results of the LiDAR scan (green) overlaid over a grayscale EIA resolution chart for increasing and . The synthetic focal plane is located at the target plane, and the axial sampling distance is (Fig. 6). The visibility is computed based on the fraction of non-occluded parts on the target plane. Note, that the theoretical optimum of SAI (for ; 100\text{,}\mathrm{\char 37\relax}) is the focussed image of the target at the synthetic focal plane overlaid by a constant bias which corresponds to the mean occlusion density ($\widetilde{D}$). For the LiDAR scan, this is $\widetilde{D}=$56.7\text{\,}\mathrm{\char 37\relax}, and the theoretical optimum will be an image of the EIA resolution chart with a greenish tint in our simulation. We use the structural similarity index (SSIM) [27] as a quantitative measure of visual difference between this theoretical optimum and other visual synthetic aperture rendering results. The SSIM is a common image quality metric which considers the fact that humans are highly adapted to extract structural information. Fig. 9 illustrates that SSIM values and visibility values are correlated, and that for baselines or number of SAI samples that differ from our selection (3.5\text{,}\mathrm{m}$$, ), no significant improvement is achieved (only in visibility and SSIM).
We can now apply our findings to the real AOS scan shown in Fig. 1 [21]: This scan was initially brute force sampled from an axial distance (altitude) of , with a baseline of 2\text{,}\mathrm{m}, and with $N=504$ samples within a synthetic aperture of $2500\text{\,}{\mathrm{m}}^{2}$ ($a=$50\text{\,}\mathrm{m}). If we consider the forest in the LiDAR dataset to be statistically representative to the forest in the AOS scan, then our model would suggest a baseline of 2.8\text{,}\mathrm{m} (with respect to the different axial scanning distances in both cases: $b=$3.5\text{\,}\mathrm{m}$\cdot$40\text{\,}\mathrm{m}$/$50\text{\,}\mathrm{m}). This is the baseline at which we expect of all occluders to project to the optimal disparity (), as shown in Fig. 7.
Fig. 10 illustrates synthetic aperture rendering results of the AOS scan at a varying numbers of SAI samples that, together with the fixed baseline of 2.8\text{,}\mathrm{m} lead to particular synthetic apertures of diameters $a$. With $N=50$ samples, we expect (based on the chosen normalized visibility threshold of $\widehat{V}=$98\text{\,}\mathrm{\char 37\relax}, as shown in Fig. 8a and in Fig. 5b) no significant gain in visibility for a larger number of SAI samples. But we are aware of the fact that the visibility which can be practically achieved will be lower than due to the non-uniformity of the forest, as shown in Fig. 8b. Since for the AOS scan no ground truth exists, we cannot determine visibility in a quantitative way. Instead, we compare new sampling results against the brute-force sampled result (using , 50\text{,}\mathrm{m}, $b=$2\text{\,}\mathrm{m}) that we consider an approximation to the theoretical optimum. Visual similarity is determined again with the structural similarity index metric (SSIM) [27].
Fig. 11 plots all SSIM values for 50\text{,}\mathrm{m}, and indicates that even an aperture diameter of $a=$14\text{\,}\mathrm{m} with SAI samples (this corresponds to a normalized visibility threshold of ) would lead to no significant visual reduction compared to an aperture of 50\text{,}\mathrm{m}$$ with samples.
These results imply that the synthetic aperture for this scene can be significantly smaller (by a factor of in area and by in number of samples).
IV Conclusion and Future Work
In this article we have presented three findings that lead to a basic understanding on how to design synthetic aperture sampling patterns: (1) There exists a limit to the baseline (distance) of sample positions. The minimal (optimal) baseline is the one that results in a disparity equal to the projected occluder size. Larger baselines do not improve visibility. (2) There exists a limit to achievable visibility improvement that depends on the density of the occluder volume. The maximum visibility gain is achieved at a density of . (3) The normalized visibility gain is independent of the occlusion density. It is directly correlated to a fixed number of samples. For AOS and the evaluated datasets, these findings result in much smaller synthetic aperture areas with significantly less samples.
The key idea of our approach is to consider the sampling process of SAI not in the common context of optics, where a wide aperture leads to a shallow depth of field and large point-spread of out-of-focus occluders. Instead, we understand SAI sampling and reconstruction as a variation of signal averaging that is explainable by statistical principles.
In future, we want to explore better models that consider the non-uniformity of known occlusion volumes, such as forests where (from tree crowns to the ground) occluder size increases while density decreases. Furthermore, we want to investigate the potential non-uniform coded sampling patterns, and techniques that enable a measurement or a better approximation of local occluder sizes and densities. Finally, we are interested in evaluating our model with different forest types (such as conifer or rain forest) and datasets. Thereby, measurements and records from forestry research might guide the parametrization of occluder sizes and densities.
[Derivation of the Visibility Probability] For a given focal plane, synthetic aperture rendering integrates disparity-shifted SAI samples :
[TABLE]
We define visibility based on the mean squared error (MSE) between the synthetic aperture rendering result and an image of the non-occluded target on the focal plane:
[TABLE]
where indicates the expectation (i.e., mean). For simplicity, we assume in the following, hence
[TABLE]
Each SAI sample has equal mean and variance (i.e., and ), behaves like a Bernoulli trial, and is a mixture of Bernoulli distributions:
[TABLE]
[TABLE]
[TABLE]
The probability that the same occluder is not projected to the same pixel in two SAI samples ( and ) is , where is the total disparity of the occluder in and .
Because of the conditional dependence between and (being shifted instances of the same image), can be expressed as
[TABLE]
[TABLE]
Thus, if (when ) the covariance . Putting (13) into (10) and simplifying yields:
[TABLE]
Note, that only the co-variance depends on and . Putting (14) and (9) into (8) yield:
[TABLE]
Note, that (15) considers 1D disparities, only. But it can be easily extended for 2D disparities:
[TABLE]
Thus, leads to (2).
Acknowledgment
The authors would like to thank Bettina Gruen of Johannes Kepler University Linz for her help on deriving (2). This research was funded by the Austrian Science Fund (FWF) under grant number P 32185-NBL.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Moreira, P. Prats-Iraola, M. Younis, G. Krieger, I. Hajnsek, and K. P. Papathanassiou, “A tutorial on synthetic aperture radar,” IEEE Geoscience and Remote Sensing Magazine , vol. 1, no. 1, pp. 6–43, March 2013.
- 2[2] C. J. Li and H. Ling, “Synthetic aperture radar imaging using a small consumer drone,” in 2015 IEEE International Symposium on Antennas and Propagation USNC/URSI National Radio Science Meeting , July 2015, pp. 685–686.
- 3[3] P. A. Rosen, S. Hensley, I. R. Joughin, F. K. Li, S. N. Madsen, E. Rodriguez, and R. M. Goldstein, “Synthetic aperture radar interferometry,” Proceedings of the IEEE , vol. 88, no. 3, pp. 333–382, March 2000.
- 4[4] R. Levanda and A. Leshem, “Synthetic aperture radio telescopes,” Signal Processing Magazine, IEEE , vol. 27, pp. 14 – 29, 02 2010.
- 5[5] D. Dravins, T. Lagadec, and P. D. Nuñez, “Optical aperture synthesis with electronically connected telescopes.” Nature communications , vol. 6, p. 6852, Apr 2015.
- 6[6] T. S. Ralston, D. L. Marks, P. S. Carney, and S. A. Boppart, “Interferometric synthetic aperture microscopy (ISAM),” Nature Physics , pp. 965–1004, 2007.
- 7[7] M. P. Hayes and P. T. Gough, “Synthetic aperture sonar: a review of current status,” IEEE Journal of Oceanic Engineering , vol. 34, no. 3, pp. 207–224, 2009.
- 8[8] R. E. Hansen, “Introduction to synthetic aperture sonar,” in Sonar Systems Edited . In Tech Published, 2011. [Online]. Available: http://www.intechopen.com/books/sonar-systems/introduction-to-synthetic-aperture-sonar
