Generalized orthogonal matching pursuit for multiple measurements - A structural approach
Florian Bo{\ss}mann

TL;DR
This paper introduces a generalized orthogonal matching pursuit algorithm tailored for multiple measurement vectors, emphasizing the preservation of data structures and demonstrating improved approximation in signal processing applications.
Contribution
It presents a novel structural approach to MMV sparse approximation, focusing on data structure preservation rather than just approximation accuracy.
Findings
Numerical comparisons show improved performance over existing methods.
The approach effectively preserves complex data structures.
Applications demonstrate practical benefits in signal processing tasks.
Abstract
Sparse data approximation has become a popular research topic in signal processing. However, in most cases only a single measurement vector (SMV) is considered. In applications, the multiple measurement vector (MMV) case is more usual, i.e., the sparse approximation problem has to be solved for several data vectors coming from closely related measurements. Thus, there is an unknown inter-vector correlation between the data vectors. Using SMV methods typically does not return the best approximation result as the correlation is ignored. In the past few years several algorithms for the MMV case have been designed to overcome this problem. Most of these techniques focus on the approximation quality while quite strong assumptions to the type of inter-vector correlation are made. While we still want to find a sparse approximation, our focus lies on preserving (possibly complex) structures in…
| const. |
|---|
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Structural Health Monitoring Techniques · Microwave Imaging and Scattering Analysis
Generalized orthogonal matching pursuit for multiple measurements - A structural approach
Florian Boßmann1 1University of Passau, Mathmatics / Digital Image Processing, [email protected]
Abstract
Sparse data approximation has become a popular research topic in signal processing. However, in most cases only a single measurement vector (SMV) is considered. In applications, the multiple measurement vector (MMV) case is more usual, i.e., the sparse approximation problem has to be solved for several data vectors coming from closely related measurements. Thus, there is an unknown inter-vector correlation between the data vectors. Using SMV methods typically does not return the best approximation result as the correlation is ignored. In the past few years several algorithms for the MMV case have been designed to overcome this problem. Most of these techniques focus on the approximation quality while quite strong assumptions to the type of inter-vector correlation are made.
While we still want to find a sparse approximation, our focus lies on preserving (possibly complex) structures in the data. Structural knowledge is of interest in many applications. It can give information about e.g., type, form, number or size of objects of interest. This may even be more useful than information given by the non-zero amplitudes itself. Moreover, it allows efficient post processing of the data. We numerically compare our new approach with other techniques and demonstrate its benefits in two applications.
Index Terms:
sparse approximation, multiple measurements, greedy algorithm, inter-signal correlation
I Introduction
Sparse approximations of given data are of great interest in many different applications. They are used in image processing for e.g., denoising [2, 1], compression [3] or restoration [4]. Sparsity assumptions appear in face and speech recognition [5, 6], magnetic resonance imaging (MRI) and computer tomography (CT) [7, 8], as well as in non-destructive testing [10, 9] and seismic data processing [13, 14, 11, 12]. A detailed overview can also be found in [16, 15] and the references therein.
The sparse approximation itself can be stated as follows: Given a dictionary matrix and a measurement vector , solve
[TABLE]
for a given . Here is the -quasi-norm, i.e., . The vector is said to be -sparse, if for . The matrix is constructed using a basis, frame or dictionary in which the given data is assumed to be sparse. Typical examples are Fourier [17] or Wavelet bases [18] as well as Curvelet [19] or Shearlet frames [20]. Dictionaries can be designed according to the underlying application, as e.g., the Gabor impulse in ultrasonic testing [21, 9].
The exact solution of (1) can in general only be found combinatorially, i.e., by considering all possible supports of . Hence, finding the exact solution becomes NP-hard. There are two main strategies to find at least an approximate solution of (1): The first strategy is, replacing the -quasi-norm by the -norm what makes the problem convex. This approach is known as convex relaxation or basis pursuit [22, 23]. Greedy algorithms are another strategy to solve (1) approximatively. Those methods iteratively built up a global approximation by solving local subproblems [25]. Matching Pursuit (MP) and Orthogonal Matching Pursuit (OMP) [23] may be the most known algorithms in this context. In recent years, more advanced algorithms have been developed such as Stagewise OMP [26], Compressive Sampling MP [27, 28] and regularized OMP [27, 29]. An overview can be found in [30].
Eq. (1) is known as single measurement vector problem (SMV). It has been studied extensively over the last few years. However, there is an extension known as multiple measurement vector problem (MMV). Instead of only having one data vector , several measurements are given. The problem is stated similar as
[TABLE]
with and , i.e., each vector is sparse. In fact this problem seems to be more common in many applications. It appears e.g., in non-destructive testing [10, 9], in seismic data [13, 14, 11, 12] or Magnetoencephalography (MEG) [24]. The MMV formulation can be used, whenever several measurements of the same or similar objects were made. One straight-forward approach to solve (2) is, to split it into SMV problems and solve these independently using the methods mentioned above. However, our intuition tells us, since the measurements were made in a quite similar set-up, also the obtained data vectors should be correlated somehow. Simply solving SMV problems ignores this correlation and thus the solution quality might suffer.
Recently, new methods have been developed that consider inter-signal correlation. In [24] an extension for MP and the FOCal Underdetermined System Solver (FOCUSS) are presented. Bayesian methods are considered in [31, 32]. In [33, 34] the authors introduce Greedy pursuit and convex relaxation for the MMV problem. Theoretical results have been shown e.g., in [35]. All these methods force a common support in the reconstructed solution, i.e., the reconstructed matrix has only few nonzero rows or, in other words, the columns of have (nearly) the same support. In [36] two joint sparsity models (JSM) for compressed sensing are introduced. JSM-1 considers solutions where all columns can be written as the sum of a common sparse component that is equal for each column and another unique sparse vector . JSM-2 is equal to the support constraint considered above. Another approach is presented in [38, 37] where correlated measurements are assumed to have sparse approximations that are close in the euclidean distance. This idea is related to dynamic compressed sensing [40, 39]. Here neighboring columns are assumed to have similar support. In both cases, the support is allowed to change slowly over different data vectors. In most cases the used methods penalize non-smooth rows in .
However, all methods have quite restricting support assumptions and hence cannot reconstruct simple geometries in the solution. As an example consider to be the identity matrix. There is no common support between all columns and the rows are non-smooth. Nevertheless, the matrix is still clearly structured. The linear structure can be described by a shift of index per column. In the next section we introduce a generalized version of orthogonal matching pursuit for multiple measurements (GM-OMP) that takes complex structures in the data into account. Numerical evidence for the proposed method are shown in the third section of this work.
II The Algorithm
In this section we first introduce OMP and discuss its generalization to the MMV problem. GM-OMP increases the support of the solution in each iteration by adding an index set where is the set of feasible selections. The parametrization and selection of is the main idea of GM-OMP and is discussed in the second subsection. After first theoretical results are shown in the third subsection, we present an a-posteriori denoising technique that is based on the structural component of the reconstructed solution.
II-A OMP and GM-OMP
Orthogonal matching pursuit is a greedy algorithm that seeks to find a sparse solution of (1). For simplicity we assume that the columns of are normalized. Then the iterative scheme of OMP can be summarized as follows:
Set the residual and the support . 2. 2.
Calculate and update . 3. 3.
Solve and set . 4. 4.
Iterate 2-3 until a stopping criterion holds.
Here is the transposed conjugate complex matrix. Hence, the algorithm chooses the column of that correlates most with the residual and adds its index to the support set in step 2. Step 3 calculates the best approximation according to the selected support. The algorithm may e.g., be stopped after iterations (the solution is -sparse then), or when the residuum drops below a threshold, i.e., .
Now, let us consider the MMV problem shown in (2). The idea of GM-OMP is surprisingly simple. We only adapt the second step of OMP, while all other steps stay the same. Therefore, note that OMP chooses one index and adds it to the support set . Since we are now dealing with multiple measurements, GM-OMP is allowed to add not only one index , but multiple indices to the support (e.g., one index per column of ). Let us denote the set of all indices added by . This index set should be chosen from a set of feasible selections where denotes the power set. The second step of GM-OMP now reads as follows:
Choose and update .
The complete scheme of GM-OMP is shown in Alg. 1 where the maximum number of iterations and the minimal residuum norm is included as stopping criterion. Of course we need to define the feasible set and find a suitable choice . This problem will be discussed in the next subsection. Let us first consider three examples to clarify the principle of GM-OMP and the feasible set .
For our first example, consider the set
[TABLE]
i.e., contains all index sets that can be associated with the support of matrices having at most one non-zero element per column. Here is of same size as . Choose such that
[TABLE]
where is the -th column of the residual matrix . This way, GM-OMP is equivalent to OMP parallelly applied to each column of . In our next example define
[TABLE]
as the set of all 1-sparse supports. It is easy to see that the best choice is the index set containing only the position of the maximum absolute value of . Now, GM-OMP is identical to using OMP on the vectorized formulation of (2), i.e., rewrite as column vectors and becomes a block diagonal matrix. As our last example, consider
[TABLE]
containing all support sets with constant row index. A possible choice is given by
[TABLE]
and . Here denotes the -norm of the -th row of the matrix. For GM-OMP becomes the simultaneous OMP (S-OMP) introduced in [33], the case is discussed in [24].
For the three demonstrated choices of , GM-OMP transforms into well known algorithms for the MMV problem. However, neither nor contain structured sets while is bounded to row-sparsity of (see discussion in the introduction). Thus, we introduce a more general choice for in the next subsection. Here, can be adapted by parameters. We will see that the three examples form the extreme cases of the parameter choice.
II-B Feasible set and selection
Alg. 1 demonstrates the generalized scheme of OMP for multiple measurements. The set represents the feasible sparsity patterns that can be chosen per iteration. However, since is a subset of a power set, it can be of exponential size. Thus, it is not sufficient to leave it to the user as input data. Instead we will parametrize and define a selection rule for based on the parameters. This way, the user only has to choose parameters that describe sparsity patterns suitable for his application.
Remark 1
Following, we describe a parametrization idea that, in the authors opinion, can be used in many applications. However, the reader might choose a different description of and an according selection rule that is more suitable for the particular problem.
Note that is a set of two-dimensional elements (row and column indices). Our idea is, to use exactly two parameters to determine . It is clear that we cannot cover all sets with two parameters, since the number of possible choices for grows exponential. Thus, we need a parametrization that generates suitable sets for applications. Analogous to the given examples (3)-(5), we identify an element by its pattern matrix where holds. Since we assume the columns of to be sparse, it is reasonable to permit only matrices with (at most) 1-sparse columns, i.e., in each iteration of GM-OMP the support of should at most grow by one index per column. Given such a matrix we interpret its sparsity pattern as samples of a function, mapping the column indices to corresponding row indices (Fig. 1). Due to the 1-sparse columns of this mapping is unique but not necessarily defined for all rows (there may be zero columns in ). Having this in mind, we postulate the sparsity pattern of to hold two conditions:
- •
The domain of the sparsity pattern should be connected.
- •
The sparsity pattern should be (Lipschitz-)continuous.
Fig. 1 shows a sparsity pattern where both conditions do not hold (see the dotted lines). Next, we formulate our parametrization of that uses two parameters to ensure the above stated conditions. Therefore, we introduce the parameter and measurement space.
Let and be metric spaces. For let and be given. We call and the parameter space and measurement space respectively. The elements are parameters (of the dictionary) and is a measurement (setup).
Remark 2
At first glance it seems quite restricting to require the existence of such spaces and elements. However, they come quite naturally. For example consider being the Fourier matrix. Then is the frequency of the -th column of . If we use a Wavelet dictionary , the parameters contain the shift and scaling of each column. For convolution matrices each is the shift of the -th column. On the other hand, consider the measurement data was obtained using several sensors at different positions, each column of corresponding to one sensor. Thus we can set to the position of the -th sensor. While these parameters give the following formulas a more reasonable interpretation, one can surely just use and .
Now we can formulate the above stated conditions on . Using the points as vertices and defining edges using the metric defined on , we can state the connected sparsity pattern condition as, the graph
[TABLE]
Lipschitz continuity of the pattern is ensured if
[TABLE]
with the metric on . We define by
[TABLE]
By (8) we also ensure 1-sparse columns of () if . For we use the convention . We obtain the relations , and , i.e., our parametrization covers the shown examples.
Given the set we need to choose . Like in OMP, we search for the support that maximizes the correlation between dictionary and residuum, i.e., we would like to solve
[TABLE]
for some (compare Eq. 6). Intuitively, we want all values to be of same order of magnitude, i.e., (or ) may be a good choice. Unfortunately, we can state the following theorem which is proven in the next subsection:
Theorem 1
For and arbitrary problem (10) is NP-hard.
Hence, we use a greedy algorithm to approximatively solve (10). Indeed, the algorithm returns the exact solution of (10) with . Starting with the correlation matrix we iteratively built a matrix such that . Beginning with we add the position of the maximum value of to the support, i.e., we calculate and update . To ensure that (7) is not penalized, we restrict ourself to indices where is the set of all indices for which (7) holds. Afterwards, the chosen element and all elements in that violate (8) are set to zero, This way, it is guaranteed that (8) is fulfilled. The scheme is shown in Alg. 2.
For implementation we recommend to replace by using a reasonable threshold (see also the discussion in the theory part). Furthermore, if from start on has zero entries (or elements below the threshold), they will never be chosen by the algorithm. This assures that the support is not artificially enlarged, i.e., always holds.
It is easy to see that Alg. 2 returns the exact solution for and (i.e,, for and ). Setting (i.e., ) Alg. 2 solves (6) for .
II-C Theoretical results
We first prove Theorem 1 that is the motivation for Alg. 2.
Proof:
We give a polynomial-time reduction of the coloring problem: Given a graph and a number of colors , assign a color to each vertex such that for each edge the vertices have a different color.
Let have vertices, each vertex with at most edges. For the reduction we need each vertex to have the same amount of edges. Thus, we add edges to each vertex until it has exactly edges. Here denotes an edge with no second vertex (or an ”imaginary” vertex that will not be considered for the coloring). Let the total amount of edges be given by where . Now define as the -th unit vector, i.e., for . Furthermore, define , by
[TABLE]
Note that is exactly -sparse. For the -th unit vector set where is the Kronecker product. (For simplicity we keep the two-dimensional indexing of instead of reordering it into a single index.) We obtain
[TABLE]
i.e., for different colors or vertices which are not connected we obtain . Now choose and , then (8) only holds for pairs with either different colors or not connected vertices . Set as identity matrix and to
[TABLE]
Then, there is a feasible coloring of if and only if for and solution of (10). By construction of the value can only be achieved with and for all . It follows that and thus the -th vertex is assigned with the -th color. Since each vertex is colored. On the other hand, each feasible coloring defines an index set with what is maximal since for all . ∎
Following, we prove reconstruction results for GM-OMP given exact data or data obtained with noised sparsity pattern. Therefore, we need results shown in [23]. We briefly summarize Theorem 3.1, Corollary 3.2 and Theorem 3.5 of this work for the reader: Given the Babel function
[TABLE]
OMP recovers the -sparse solution of (1) in the noiseless case () whenever . There exists a weak version of OMP that chooses an index in each iteration such that with a weakness constant holds. Weak OMP recovers an -sparse solution whenever .
Let be the residual matrix after iterations and the greedy choice returned by Alg. 2. We calculate the weakness parameter of GM-OMP by
[TABLE]
Note that can be very small depending on the range of amplitudes. Exemplary, consider
[TABLE]
with , . The greedy choice would either select the first or the second row. In both cases we obtain and hence . However, will be close to for feasible sets where for holds. Before we state our first theorem, we need the following definition.
Definition 1
For sets we say that the sets are -intersecting if there exists , , such that (7) and (8) hold.
Let be the unique -column-sparse solution of and with . We state
Theorem 2
GM-OMP recovers and all elements of the sparsity pattern in iterations whenever are not -intersecting and .
Proof:
The reconstruction of the support of follows directly by the exactness of weak OMP. Since the sets are not -intersecting the indices selected by Alg. 2 belong to the same set . Suppose an index in has not been selected. Then the matrix from Alg. 2 is not zero and Alg. 2 will not stop iterating. Thus, the complete set is recovered. ∎
Theorem 2 has two disadvantages. First, the conditions depend on the solution and hence cannot be checked beforehand. Second, as we have seen can be small and thus the conditions are quite restricting. To overcome the last problem, we define . Now, we can use an adaptive threshold in Alg. 2 by selecting only indices such that holds, i.e., in the -th iteration only indices with
[TABLE]
will be selected. We can state
Theorem 3
Using this threshold strategy in Alg. 2, GM-OMP recovers the solution in iterations where whenever . Furthermore, let be the feasible sets selected in each iteration and . If are not -intersecting, then there exists a partition of such that .
Proof:
Note that the first index selected in Alg. 2 is the maximum of and thus equivalent to an OMP choice, i.e., Alg. 2 selects at least one element per iteration. As this choice is part of . The algorithm terminates after at most iterations what is the number of non-zero entries in . If the sets are not -intersecting the indices selected by Alg. 2 belong to the same set . Since we use a threshold, we can no longer follow that is found in one iteration. However, because is recovered completely after iterations, the existence of such a partition follows. ∎
Now, let us discuss the reconstruction qualities of GM-OMP due to noised sparsity patterns. Instead of with , the noised data with is given. Here is the noised version of the pattern . We consider two different kinds of noise and analyze in which case GM-OMP is able to reconstruct the sets , . Given we can try to recover using a post processing strategy that will be introduced in the next subsection.
Theorem 4
Let be corrupted by uniform noise :
[TABLE]
Set . If (8) holds for with parameter , then (8) holds for with Lippschitz parameter . If are not -intersecting, then are not -intersecting.
Proof:
For and
[TABLE]
holds. Equivalently, , , , with using that are not -intersecting and the inverse triangle inequality:
[TABLE]
∎
The noise assumed in Theorem 4 typically appears in applications where measurements may be corrupted due to shaking apertures. If an upper bound is known, the parameters of GM-OMP can be adapted.
Theorem 5
Let be corrupted by Bernoulli distributed noise , i.e.,
[TABLE]
where is the probability that an index is not in the corrupted set . Let (7) hold for with parameter and be the probability that (7) holds for with parameter , . Then
[TABLE]
Proof:
Note that (7) gives a connected graph. We search for a lower bound of the probability, that the graph is still connected when we remove points with probability but add edges with . For a lower bound it is sufficient to consider the worst case, i.e., . The graph becomes a line with at most points. The graph is connected whenever there is a connection from to . For the new parameter we obtain the edges with . It follows that the graph will no longer be connected whenever consecutive points vanish.
This problem is an application of success runs in Bernoulli trails [41]. In particular, is the probability that the longest success run is shorter than . This probability has an exact but rather complicated analytic expression. The simple lower bound is shown in [42]. Other bounds and the exact analytic form can also be found in [41]. ∎
The noise assumed in Theorem 5 appears in applications e.g., whenever a single measurement is lost or a sensor fails. The parameter can be adapted according to Theorem 5.
Theorem 2 gives two conditions for exact recovery. The condition ensures recovery of the right support set. It was deduced from OMP and was shown to be strict [23]. The -separation condition guarantees the separation of the support into structures. This condition is not strict. The feasible sets may still be reconstructed without having -separation depending on the amplitudes , . Theorem 3 gives reconstruction results if one or both of these conditions were penalized.
In Theorem 4 and 5 we discussed a noised sparsity pattern and how the parameters should be adapted. In the next section, we present a post processing step to reconstruct the original pattern given. Beforehand, we give a statement on two other cases of noisy data. First, consider the case where does not have an exact sparse representation with , but instead we search for a sparse approximation as in problem 2. This problem occurs e.g., when the data is noised. We can easily obtain similar results to Theorem 2 and 3 by replacing the exact recovery condition of (weak) OMP with the optimal -term approximation conditions given in [23].
As another scenario, consider a sampling that is sparse in some dictionary , i.e., there exists a sparse solution of . Now, instead of we have only given the dictionary . Exemplary, let be sparse in Fourier domain but not necessarily containing frequencies given by the discrete Fourier transform. Given the Fourier transform of is it possible to reconstruct the exact frequencies, i.e., given an approximation in is it possible to reconstruct the exact dictionary ? This problem was analyzed under the keyword of super-resolution in [43] for the SMV problem. Only recently, the MMV problem with common support constraint was discussed in [44]. In both cases, an exact recovery is possible whenever the non-zero entries are separated by at least a distance depending on the super-resolution factor. An interesting question we consider for future work, is the connection between this separation and patterns that are not -intersecting. This connection may be used to design a super-resolution method for generalized patterns.
II-D Post processing
So far, we presented the GM-OMP algorithm and proved basic theoretical properties. Before we demonstrate the technique on numerical examples, we discuss how to use GM-OMP for powerful post processing of the data. Consider we reconstructed a solution and its support , where is assumed to be a corrupted sparsity pattern.
While it is a common idea to denoise corrupted amplitude values of , the sparsity pattern has been of minor interest so far. Even though the pattern itself might be noised. Exemplary, in non-destructive testing external forces during the measurement can corrupt the probes positions what influences the geometry and thus the sparsity pattern [10, 9]. As another example, consider being the Fourier matrix. It only contains a fixed amount of Fourier frequencies. However, there are signals that are sparse in Fourier domain but only consist of frequencies not covered by the matrix. Then the reconstructed sparse approximation most likely rounds these frequencies upto the closest frequency of , what can be interpreted as a corrupted sparsity pattern of . As last example, simply assume a failed measurement, i.e., a zero column in . Surely the corresponding column in will also be zero. To reconstruct the original signal, we can apply inpainting ideas on the sparsity pattern.
Remembering Fig. 1, i.e., as a discrete sampling of a function, we can denoise the sparsity pattern for by solving the problems
[TABLE]
with a weight . Here, is a suitable function space (e.g., polynomials, splines, ). Afterwards set the denoised pattern to
[TABLE]
where is rounded to the closest of the elements . For small the support may be large and hence can increase. This gives an inpainting strategy to reconstruct missing structure elements.
A similar approach can be applied to denoise the amplitudes of . Given and assume that for all we solve
[TABLE]
on a function space and update , for all .
III Numerics
We demonstrate the advantages of our proposed algorithm in three examples. First, we compare the technique with other sparse approximation methods for the MMV problem. Afterwards we discuss two practical examples and illustrate the information given by the sparsity pattern.
III-A Numerical comparison
We compare our method to three other techniques: OMP applied to each column separately, S-OMP [33] and MSBL, a technique presented in [45] based on sparse bayesian learning. Let be a convolution matrix of a Gauss kernel with standard deviation . We define the matrix by
[TABLE]
for , i.e., each column of is -sparse. The matrix is clearly structured, it consists of one line with a slope of . For this becomes the identity matrix; gives a matrix with one non-zero row (which is the pattern that S-OMP and the MSBL assume). For we calculate and use all methods to reconstruct . Therefore, we choose , , and (which corresponds to a maximal slope of ). We use iteration since contains exactly one structure. In Fig. 2 the reconstruction error and the number of non-zero elements in the solution are shown for all algorithms. Nearly all methods are able to find a good approximation. Only S-OMP forces row sparsity of and thus produces stare casing effects which corrupt the solution. Both the MSBL and S-OMP assume that is row-sparse, i.e., there are only a few non-zero rows. Once a row contains a non-zero element, the entire row is considered to be non-zero. This leads to an extreme overestimation of the support while OMP and GM-OMP can find the exact number of non-zero entries.
Next, we demonstrate the power of the proposed denoising step. Therefore consider and its noised versions with
[TABLE]
[TABLE]
where is uniform noise and , is Bernoulli distributed. Given or we want to reconstruct . The mean squared error over runs for both cases is plotted in Fig. 3. The noise level gives the values of respectively .
For we adapted the parameters according to Theorem 4 and set . In the second case, we choose for the reconstruction of . Using Theorem 5 this gives us a reconstruction probability of more than even for . In both cases we solve optimization problem (13) with and afterwards to denoise the pattern.
Note that the row-sparsity assumption of S-OMP and MSBL hold for . Nevertheless, only GM-OMP is able to find a good approximation in most cases. All other methods only reconstruct the noised matrices . For a high probability of the MSE of GM-OMP increases, i.e., the parameter does no longer compensate the missing data. Interestingly, S-OMP profits from its stare casing effects when the pattern is uniformly noised and hence returns a slightly better solution.
III-B Application 1: Non-destructive testing
As a first practical example we analyze ultrasonic data obtained from non-destructive testing of steel tubes. The original data shown in Fig. 4 was generated by the ”Time-of-Flight Diffraction” (ToFD) method using an Olympus Omniscan iX system with Mhz transducer, mm diameter and angle of incidence. The tested tube was a large diameter pipe with outer diameter mm and mm wall thickness. Each column of the data represents a measured signal at different positions on the tubes surface. The positions were equidistantly set on a straight line with a distance of mm, hence we set . The signals were measured in time with a sampling ratio of s. Four major events can clearly been seen in the data. The topmost one is an ultrasonic impulse that directly travels through the surface from transducer to receiver - the lateral wave. The bottommost is an impulse that was reflected by the back wall - the back wall echo. The two events in between (recognizable as parabolas) indicate defects in the material. We use GM-OMP to recover and denoise these events.
Ultrasonic data is column-wise sparse when is a convolution matrix based on the Gabor impulse ([9, 11])
[TABLE]
Here is the bandwidth factor, is the frequency and is the phase. Thus, is the shift of the -th column in s. For the given data we choose , and (see [9, 11] for details about the parameter choice). We define our feasible set using such that only for . Note that (8) compares distance in time (s) with a distance in space (mm). The ultrasonic speed in steel is about mm/s, hence we chose what gives stability for noisy data.
After applying iterations of GM-OMP to the data, we use the denoising strategies discussed in the last section and set . Since structures in pipe testing often behave linearly or quadratically, we use , i.e., polynomials upto degree four multiplied by a characteristic function. The characteristic function is needed for the support constraint in (13). Moreover we choose as the space of all constant functions, i.e., the amplitudes of each structure are set to its mean value. This value can give a first hint about the underlying material in the pipe.
In Fig. 5 the four sparsity patterns found by GM-OMP are shown in data domain (i.e., multiplied by ). As we see, the algorithm is able to reconstruct all four structures of the original data. Due to the structural denoising, the pattern looks more smooth and effects caused by shaking probes are no longer visible. In Tab. I the polynomial coefficients of each pattern are shown. As suppoesd, the lateral wave and back wall echo are mostly linear while the defects were approximated by a quadratic polynomial.
III-C Application 2: meteorologic data
In this example we use GM-OMP without post-processing on meteorologic data provided by Deutscher Wetterdienst (DWD) [46]. We analyze the hourly precipitation in Germany from 25th to 28th of November 2008 where we use data of stations shown in Fig. 6. Stations that were moved during this time or had too many missing values were neglected. In Fig. 7 the overall precipitation and data of two stations is plotted exemplary where [math] hour refers to Nov. 25th 2008, . We use a dictionary based on centered cardinal B-splines
[TABLE]
We use the normalized versions of all B-splines with , i.e., contains all shifts of , . We have chosen a time period with low precipitation and thus the data can be sparsely approximated using splines. Note that a B-spline of order has a support of length . Thus, the order directly correlates to the duration of the precipitation. Since a single precipitation (e.g., rain) will be registered at several stations, we have an underlying structure in the data.
Choose to be the position coordinates of the -th station and as the shift and order of the corresponding spline. We use as the geodetic distance and set km. Let and , i.e., neither the duration nor the time of occurrence should change more than hours per km. We perform iterations of GM-OMP.
Fig. 8 shows the time of occurrence of the largest precipitation event. i.e., the structure that includes the most stations (here ). The mean duration is h (mean B-spline order) and one can clearly recognize the event moving from north to south caused e.g., by wind. In Fig. 9 we reconstructed the overall precipitation (see Fig. 7) using only structures. In the left figure we choose the first structure, i.e., those with the strongest precipitation; for the right figure the largest events were used. While the strongest events contain the precipitation peaks, the largest events can better reconstruct the overall structure from Fig. 9.
IV Conclusion
We presented a generalized orthogonal matching pursuit for multiple measurements. The algorithm is able to recognize and reconstruct more general sparsity pattern in the solution as other algorithms for multiple measurements. Moreover, GM-OMP allows efficient post processing for each pattern. These patterns can provide crucial information in application which was exemplary demonstrated in two practical examples. Two parameters allow an adaption of the feasible patterns to the application and make the algorithm more flexible. The advantages of GM-OMP were shown in comparison to other techniques and confirmed by first theoretical results.
Acknowledgements
The author thanks Mannesmann Salzgitter GmbH for providing the ultrasonic data used in this paper. This work is supported by BMBF joined research project ZeMat (grant number: 05M13MGA).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] K. Dabov, A. Foi, V. Katkovnik, K. Egiazarian, ”Image denoising by sparse 3-D transform-domain collaborative filtering”, IEEE Trans. Image Processing, vol. 16(8), pp. 2080-2095, 2007.
- 2[2] M. Elad, M. Aharon, ”Image denoising via sparse and redundant representations over learned dictionaries”, IEEE Trans. Image Processing, vol. 15(12), pp. 3736-3745, 2006.
- 3[3] E. Le Pennec, S. Mallat, ”Sparse geometric image representations with bandelets”, IEEE Trans. Image Processing, vol. 14(4), pp. 423-438, 2005.
- 4[4] J. Mairal, M. Elad, G. Sapiro, ”Sparse representation for color image restoration”, IEEE Trans. Image Processing, vol. 17(1), pp. 53-69, 2008.
- 5[5] S. M. Katz, ”Estimation of probabilities from sparse data for the language model component of a speech recognizer”, IEEE Trans. Acoustics, Speech and Signal Processing, vol. 35(3), pp. 400-401, 1987.
- 6[6] J. Wright, A. Y. Yang, A. Ganesh, S. S. Sastry, Y. Ma, ”Robust face recognition via sparse representation”, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 31(2), pp. 210-227, 2009.
- 7[7] M. Lustig, D. Donoho, J. M. Pauly, ”Sparse MRI: The application of compressed sensing for rapid MR imaging”, Magnetic resonance in medicine, vol. 58(6), pp. 1182-1195, 2007.
- 8[8] E. Y. Sidky, X. Pan, ”Image reconstruction in circular cone-beam computed tomography by constrained, total-variation minimization”, Physics in medicine and biology, vol. 53(17), pp. 4777–4807, 2008.
