A theoretical guarantee for data completion via geometric separation
Emily J. King, James M. Murphy

TL;DR
This paper provides a theoretical guarantee for data completion methods that leverage geometric separation, especially when data can be modeled as a mixture of different structural components.
Contribution
It introduces a unified theoretical framework that guarantees successful data recovery using combined separation and completion techniques for structured, incomplete data.
Findings
Theoretical proof of success for combined separation and completion methods.
Generalization of previous proofs to more complex data structures.
Applicable to data modeled as superpositions of multiple structures.
Abstract
Scientific and commercial data is often incomplete. Recovery of the missing information is an important pre-processing step in data analysis. Real-world data can in many cases be represented as a superposition of two or more different types of structures. For example, images may often be decomposed into texture and cartoon-like components. When incomplete data comes from a distribution well-represented as a mixture of different structures, a sparsity-based method combining concepts from data completion and data separation can successfully recover the missing data. This short note presents a theoretical guarantee for success of the combined separation and completion approach which generalizes proofs from the distinct problems.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A theoretical guarantee for data completion via geometric separation.
Emily J. King
James M. Murphy
Center for Industrial Mathematics, University of Bremen, Bremen, Germany
Department of Mathematics, Johns Hopkins University, Baltimore, USA
Abstract
Scientific and commercial data is often incomplete. Recovery of the missing information is an important pre-processing step in data analysis. Real-world data can in many cases be represented as a superposition of two or more different types of structures. For example, images may often be decomposed into texture and cartoon-like components. When incomplete data comes from a distribution well-represented as a mixture of different structures, a sparsity-based method combining concepts from data completion and data separation can successfully recover the missing data. This short note presents a theoretical guarantee for success of the combined separation and completion approach which generalizes proofs from the distinct problems.
For a Hilbert space and coefficient index set , a Parseval frame (or more precisely, the analysis operator of a Parseval frame) is such that for all . Data in a variety of applications of interest admit sparse representations with respect to a Parseval frame. It is known, for example, that if is a shearlet frame for and is a cartoon-like image, then there is some relatively “small” subset such that captures “most” of the important information of , where is the function that is on and [math] off of [1]. This sparsity can be used to perform image and data processing tasks including geometric separation of data into components with different fundamental structures [2, 3] and image inpainting or, more broadly, data recovery [2, 4, 5]. These two tasks can be combined. For example, by noting that natural images are usually a superposition of a cartoon-like part which is sparsely represented by a shearlet or curvelet frame and a textured part which is sparsely represented by a discrete cosine frame, one can use both frames simultaneously to perform empirically successful data completion [6, 7]. What follows is a generalization to the combined problem of the theoretical guarantees for data recovery [2, 4] and for data separation [2, 3].
Theorem 1.1**.**
Let be an orthogonal decomposition of a Hilbert space . Let and be (the analysis operators of) Parseval frames for index sets , , respectively. Suppose may be decomposed as and for there exist some , such that the following -sparsity condition is satisfied:
[TABLE]
Let be orthogonal projections onto , respectively, and let the joint concentration (with respect to and ) be defined as
[TABLE]
Suppose solves the constrained optimization problem
[TABLE]
Then
[TABLE]
In Theorem 1.1, represents the part of the data which is known, while represents the missing part which is to be reconstructed. In the context of image inpainting, represents the known image pixels, while represents the unknown pixels to be recovered. and yield sparse representations of different types of structural information; for example, could be a shearlet frame which sparsely represents cartoon-like images and could sparsely represent texture. If is a superposition of structures which are sparsely represented by the , then relatively small index sets can be chosen yielding both a small and a small .
Proof.
We perform an initial estimate using frame theory and basic analysis:
[TABLE]
Noting that by the constraint in (1), , and hence:
[TABLE]
After some algebraic manipulation and employing -sparsity, we see:
[TABLE]
So, it suffices to bound these terms involving . We note that is a minimizer of (1). Thus,
[TABLE]
Substituting (3) into (2), we obtain
[TABLE]
Thus,
[TABLE]
∎
Further theoretical results concerning the joint concentration and applications are the subjects of a forthcoming paper [8].
Acknowledgement
The authors met and began work on this project during the 2016 Hausdorff Trimester Program “Mathematics of Signal Processing” and are thus grateful to the organizers and the Hausdorff Institute for Mathematics in Bonn, Germany.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] K. Guo and D. Labate. Optimally sparse multidimensional representation using shearlets. SIAM J. Math. Anal , 39(1):298–318, 2007.
- 2[2] E.J. King, G. Kutyniok, and X. Zhuang. Analysis of data separation and recovery problems using clustered sparsity. SPIE Proceedings: Wavelets and Sparsity XIV , 8138, 2011.
- 3[3] D.L. Donoho and G. Kutyniok. Microlocal analysis of the geometric separation problem. Comm. Pure Appl. Math. , 66:1–47, 2013.
- 4[4] E.J. King, G. Kutyniok, and X. Zhuang. Analysis of inpainting via clustered sparsity and microlocal analysis. J. Math. Imaging Vision , 48(2):205–34, 2014.
- 5[5] E.J. King, G. Kutyniok, and W.-Q. Lim. Image inpainting: theoretical analysis and comparison of algorithms. SPIE Proceedings, Wavelets and Sparsity XV , 8858, 2013.
- 6[6] M. Elad, J.-L Starck, P. Querre, and D.L. Donoho. Simultaneous cartoon and texture image inpainting using morphological component analysis (MCA). Appl. Comput. Harmon. Anal. , 19(3):340–358, 2005.
- 7[7] A. Stück. Shearlet-based image inpainting. Master’s thesis, Universität Bremen, Bremen, Germany, 2015.
- 8[8] E.J. King, J. M. Murphy, and A. Stück. Data completion via analysis-side geometric separation. in preparation, 2017.
