Phase Retrieval: Uniqueness and Stability

Philipp Grohs; Sarah Koppensteiner; Martin Rathmair

arXiv:1901.07911·math.FA·February 17, 2020·SIAM Rev.

Phase Retrieval: Uniqueness and Stability

Philipp Grohs, Sarah Koppensteiner, Martin Rathmair

PDF

TL;DR

This paper reviews mathematical results on phase retrieval, focusing on the conditions for unique and stable recovery of functions from Fourier magnitude data across various scientific fields.

Contribution

It summarizes recent advances in understanding the uniqueness and stability of phase retrieval problems, integrating results from harmonic analysis, complex analysis, and geometry.

Findings

01

Conditions for uniqueness in phase retrieval

02

Stability properties of phase retrieval solutions

03

Connections to applications in physics and imaging

Abstract

The problem of phase retrieval, i.e., the problem of recovering a function from the magnitudes of its Fourier transform, naturally arises in various fields of physics, such as astronomy, radar, speech recognition, quantum mechanics and, perhaps most prominently, diffraction imaging. The mathematical study of phase retrieval problems possesses a long history with a number of beautiful and deep results drawing from different mathematical fields, such as harmonic analyis, complex analysis, or Riemannian geometry. The present paper aims to present a summary of some of these results with an emphasis on recent activities. In particular we aim to summarize our current understanding of uniqueness and stability properties of phase retrieval problems.

Figures14

Click any figure to enlarge with its caption.

Equations394

T : X \to Y .

T : X \to Y .

A : f \mapsto ∣ T f ∣, f \in X,

A : f \mapsto ∣ T f ∣, f \in X,

A f = A (c f), f \in X, ∣ c ∣ = 1.

A f = A (c f), f \in X, ∣ c ∣ = 1.

d ([f]_{\sim}, [g]_{\sim}) := ∣ c ∣ = 1 in f ∥ f - c g ∥

d ([f]_{\sim}, [g]_{\sim}) := ∣ c ∣ = 1 in f ∥ f - c g ∥

T^{'} f := (f g_{1}, \dots, f g_{m})

T^{'} f := (f g_{1}, \dots, f g_{m})

A_{Φ} f := (∣ ⟨ f, φ_{λ} ⟩ ∣)_{λ \in Λ},

A_{Φ} f := (∣ ⟨ f, φ_{λ} ⟩ ∣)_{λ \in Λ},

A_{Φ} : B /_{\sim} \to R_{+}^{Λ}

A_{Φ} : B /_{\sim} \to R_{+}^{Λ}

∣ ⟨ f \pm h, φ_{λ} ⟩ ∣^{2} = ∣ ⟨ f, φ_{λ} ⟩ ∣^{2} \pm = 0 2 Re (⟨ f, φ_{λ} ⟩ ⟨ h, φ_{λ} ⟩) + ∣ ⟨ h, φ_{λ} ⟩ ∣^{2} \forall λ \in Λ .

∣ ⟨ f \pm h, φ_{λ} ⟩ ∣^{2} = ∣ ⟨ f, φ_{λ} ⟩ ∣^{2} \pm = 0 2 Re (⟨ f, φ_{λ} ⟩ ⟨ h, φ_{λ} ⟩) + ∣ ⟨ h, φ_{λ} ⟩ ∣^{2} \forall λ \in Λ .

h = \frac{c - 1}{1 + c} f \in (span Φ_{S})_{⊥} \cap (span Φ_{Λ ∖ S})_{⊥},

h = \frac{c - 1}{1 + c} f \in (span Φ_{S})_{⊥} \cap (span Φ_{Λ ∖ S})_{⊥},

d (f, h) := ∣ c ∣ = 1 in f ∥ f - c h ∥_{B} .

d (f, h) := ∣ c ∣ = 1 in f ∥ f - c h ∥_{B} .

C_{Φ} f := (⟨ f, φ_{λ} ⟩)_{λ \in Λ}

C_{Φ} f := (⟨ f, φ_{λ} ⟩)_{λ \in Λ}

α d (f, h) \leq ∥ A_{Φ} (f) - A_{Φ} (h) ∥_{\textfrak B} \leq β d (f, h) \forall f, h \in B

α d (f, h) \leq ∥ A_{Φ} (f) - A_{Φ} (h) ∥_{\textfrak B} \leq β d (f, h) \forall f, h \in B

A ∥ f ∥_{B} \leq ∥ C_{Φ} f ∥_{\textfrak B} \leq B ∥ f ∥_{B} \forall f \in B .

A ∥ f ∥_{B} \leq ∥ C_{Φ} f ∥_{\textfrak B} \leq B ∥ f ∥_{B} \forall f \in B .

R C_{Φ} f = f \forall f \in B .

R C_{Φ} f = f \forall f \in B .

max {A_{opt} (Φ_{S}), A_{opt} (Φ_{Λ ∖ S})} \geq σ \forall S \subseteq Λ .

max {A_{opt} (Φ_{S}), A_{opt} (Φ_{Λ ∖ S})} \geq σ \forall S \subseteq Λ .

α_{opt} \leq C σ_{opt} .

α_{opt} \leq C σ_{opt} .

∥ C_{Φ_{S}} f ∥_{\textfrak B} < σ and ∥ C_{Φ_{Λ ∖ S}} h ∥_{\textfrak B} < σ .

∥ C_{Φ_{S}} f ∥_{\textfrak B} < σ and ∥ C_{Φ_{Λ ∖ S}} h ∥_{\textfrak B} < σ .

∥ A_{Φ} (x) - A_{Φ} (y) ∥_{B}

∥ A_{Φ} (x) - A_{Φ} (y) ∥_{B}

\leq 2∥ C_{Φ_{S}} f ∥_{\textfrak B} + 2∥ C_{Φ_{Λ ∖ S}} h ∥_{\textfrak B}

\leq 4 σ,

α_{opt} d (x, y) \leq ∥ A_{Φ} (x) - A_{Φ} (y) ∥_{B} \leq 4 σ .

α_{opt} d (x, y) \leq ∥ A_{Φ} (x) - A_{Φ} (y) ∥_{B} \leq 4 σ .

d (x, y) = min {∥ x + y ∥_{B}, ∥ x - y ∥_{B}} = 2 min {∥ f ∥_{B}, ∥ h ∥_{B}} = 2.

d (x, y) = min {∥ x + y ∥_{B}, ∥ x - y ∥_{B}} = 2 min {∥ f ∥_{B}, ∥ h ∥_{B}} = 2.

∥ A_{Φ} (x) - A_{Φ} (y) ∥_{B} ≲ σ and d (x, y) ≳ 1 .

∥ A_{Φ} (x) - A_{Φ} (y) ∥_{B} ≲ σ and d (x, y) ≳ 1 .

∥ C_{Φ} f ∥_{\textfrak B} < ε ∥ f ∥_{B} .

∥ C_{Φ} f ∥_{\textfrak B} < ε ∥ f ∥_{B} .

∥ φ_{ω} - φ_{λ} ∥_{B^{'}} < \frac{ε}{∥ χ _{Λ} ∥ _{\textfrak B}} \forall ω \in U_{λ} .

∥ φ_{ω} - φ_{λ} ∥_{B^{'}} < \frac{ε}{∥ χ _{Λ} ∥ _{\textfrak B}} \forall ω \in U_{λ} .

∥ φ_{λ} - φ_{λ_{j}} ∥_{B^{'}} < \frac{ε}{∥ χ _{Λ} ∥ _{\textfrak B}} \forall λ \in U_{j} .

∥ φ_{λ} - φ_{λ_{j}} ∥_{B^{'}} < \frac{ε}{∥ χ _{Λ} ∥ _{\textfrak B}} \forall λ \in U_{j} .

∣ ⟨ f, φ_{λ} ⟩ ∣ \leq ∣ ⟨ f, φ_{λ_{j}} ⟩ ∣ + ∣ ⟨ f, φ_{λ} - φ_{λ_{j}} ⟩ ∣

∣ ⟨ f, φ_{λ} ⟩ ∣ \leq ∣ ⟨ f, φ_{λ_{j}} ⟩ ∣ + ∣ ⟨ f, φ_{λ} - φ_{λ_{j}} ⟩ ∣

A_{Φ} f (λ)

A_{Φ} f (λ)

\leq j = 1 \sum N ∣ ⟨ f, φ_{λ_{j}} ⟩ ∣ χ_{U_{j}} (λ) + j = 1 \sum N ∣ ⟨ f, φ_{λ} - φ_{λ_{j}} ⟩ ∣ χ_{U_{j}} (λ)

< j = 1 \sum N ∣ ⟨ f, φ_{λ_{j}} ⟩ ∣ χ_{U_{j}} (λ) + \frac{ε ∥ f ∥ _{B}}{∥ χ _{Λ} ∥ _{\textfrak B}} χ_{Λ} (λ) .

∥ C_{Φ} f ∥_{\textfrak B} = ∥ A_{Φ} f ∥_{\textfrak B} < j = 1 \sum N ∣ ⟨ f, φ_{λ_{j}} ⟩ ∣∥ χ_{U_{j}} ∥_{\textfrak B} + ε ∥ f ∥_{B}

∥ C_{Φ} f ∥_{\textfrak B} = ∥ A_{Φ} f ∥_{\textfrak B} < j = 1 \sum N ∣ ⟨ f, φ_{λ_{j}} ⟩ ∣∥ χ_{U_{j}} ∥_{\textfrak B} + ε ∥ f ∥_{B}

∥ C_{Φ_{S}} f ∥_{\textfrak B} < ε ∥ f ∥_{B} and ∥ C_{Φ_{Λ ∖ S}} h ∥_{\textfrak B} < ε ∥ h ∥_{B} .

∥ C_{Φ_{S}} f ∥_{\textfrak B} < ε ∥ f ∥_{B} and ∥ C_{Φ_{Λ ∖ S}} h ∥_{\textfrak B} < ε ∥ h ∥_{B} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Phase Retrieval: Uniqueness and Stability

Philipp Grohs

Faculty of Mathematics

University of Vienna

Oskar-Morgenstern-Platz 1

A-1090 Vienna, Austria

[email protected]

,

Sarah Koppensteiner

Faculty of Mathematics

University of Vienna

Oskar-Morgenstern-Platz 1

A-1090 Vienna, Austria

[email protected]

and

Martin Rathmair

Faculty of Mathematics

University of Vienna

Oskar-Morgenstern-Platz 1

A-1090 Vienna, Austria

[email protected]

Abstract.

The problem of phase retrieval, i.e., the problem of recovering a function from the magnitudes of its Fourier transform, naturally arises in various fields of physics, such as astronomy, radar, speech recognition, quantum mechanics, and, perhaps most prominently, diffraction imaging. The mathematical study of phase retrieval problems possesses a long history with a number of beautiful and deep results drawing from different mathematical fields, such as harmonic analysis, complex analysis, and Riemannian geometry. The present paper aims to present a summary of some of these results with an emphasis on recent activities. In particular we aim to summarize our current understanding of uniqueness and stability properties of phase retrieval problems.

1. Introduction

The problem of phase retrieval, i.e., the problem of recovering a function from the magnitudes of its Fourier transform, naturally arises in various fields of physics, such as astronomy [40], radar [67], speech recognition [89], and quantum mechanics [85]. The most prominent example, however, is diffraction imaging, where in a basic experiment an object is placed in front of a laser which emits coherent electromagnetic radiation. The object interacts with the incident wave in a diffractive manner, creating a new wave front, which is described by Kirchhoff’s diffraction equation. An adequate approximation of the resulting wave front in the far field is given by the Fraunhofer diffraction equation, which essentially states that the wave front in a plane at a sufficiently large distance from the object is given by the Fourier transform (with appropriate spatial scaling) of the function representing the object; cf. [50] for an introduction to diffraction theory.

The aim in diffractive imaging is to determine the object from measurements of the diffracted wave. This objective is seriously impeded by the fact that measurement devices usually are only capable of capturing the intensities, and a loss of phase information takes place. Reconstructing the object from the far field diffraction intensities, the so-called diffraction pattern, therefore requires one to solve the Fourier phase retrieval problem

Given $|\hat{f}|$ , find $f$ (up to trivial ambiguities).

The name “phase retrieval” accounts for the fact that recovery of the phase of $\hat{f}$ is equivalent to recovering $f$ itself.

In microscopy a lens is employed to essentially invert the Fourier transformation and create the image of the object. While this is possible in the case of visible light, which has a wavelength of approximately $10^{-7}\text{m}$ , lenses which perform this task are not available for waves of much shorter wavelength (e.g. for x-rays with a wavelength in the range between $10^{-8}\text{m}$ and $10^{-11}\text{m}$ ). Since the spatial resolution of the optical system is proportional to the wavelength of light the direct approach using lenses can only achieve a certain level of resolution. In order to obtain high resolution it is necessary to compute the image from the diffraction pattern.

Determining objects from diffraction patterns—and therefore the question of phase retrieval—for the first time became relevant when Max von Laue discovered in 1912 that x-rays are diffracted when interacting with crystals, an insight for which he would be awarded the Nobel Prize in Physics two years later. The discovery of this phenomenon launched the field of x-ray crystallography. Crystallography seeks to determine the atomic and molecular structure of a crystal, i.e., a material whose atoms are arranged in a periodic fashion. In the diffraction pattern the periodicity of the crystalline sample manifests itself in the form of strong peaks (Bragg peaks) lying on the so-called reciprocal lattice; cf. [82]. From the position and the intensities of these peaks chrystallographers can deduce the electron density of the crystal. Over the course of the past century the methods of x-ray crystallography have developed into the most powerful tool for analyzing the atomic structure of various materials and have enabled scientists to achieve breakthrough results in different fields such as chemistry, medicine, biology, physics, and the material sciences. This is highlighted by the fact that more than a dozen Nobel Prizes have been awarded for work involving x-ray crystallography, the discovery of the double helix structure of DNA [101] being just one example. For an exhaustive introduction to x-ray crystallography the interested reader may have a look at [57, 76].

In 1980 it was proposed by David Sayre [92] to extend the approach of x-ray crystallography to noncrystalline specimens. Almost twenty years later, facilitated by the development of new powerful x-ray sources, Sayre et al.[80] for the first time successfully reconstructed the image of a sample with resolution at nanometer scale from its x-ray diffraction pattern. This approach is nowadays known as Coherent Diffraction Imaging (CDI). The process consists of two principal steps. First, the acquisition of one or multiple diffraction patterns, and second, processing the diffraction patterns in order to obtain the image of the sample, which is usually done by applying iterative phase retrieval algorithms. Plenty of CDI methods have been developed in recent years and have been employed to great success in physics, biology, and chemistry. See [81, 93] for very recent overviews of CDI methods, for their limitations and their achievements in various applications, and for algorithmic phase retrieval methods in diffraction imaging.

Even though the quest for recovering lost phase information has been omnipresent in physics for more than a century now, the phase retrieval problem has only very recently—with a few exceptions—started receiving great attention by the mathematics community. One notable exception is the work of Herbert Hauptman beginning in the 1950s. The direct methods developed by Hauptman [58], together with Jerome Karle have been applied with great success to determine the structure of many crystals. In 1985 Hauptman and Karle were awarded the Nobel Prize in Chemistry.

As a second significant exception we mention the work of Joseph Rosenblatt from the 1980s [91], where the problem of phase retrieval from Fourier magnitudes is studied in great generality.

Phase retrieval in the most general formulation is concerned with reconstructing a function $f$ in a space $\mathcal{X}$ from the phaseless information of some transform of $f$ . The operator describing the transform, which will be denoted by $T$ , is mapping elements of $\mathcal{X}$ into another space $\mathcal{Y}$ of either real- or complex-valued functions and is usually linear, i.e.,

[TABLE]

Furthermore, $T$ is usually nicely invertible, which means that $T:\mathcal{X}\rightarrow\text{ran}T$ has a bounded inverse.

In order to have a concrete example in mind one may think of $\mathcal{X}=\mathcal{Y}=L^{2}(\mathbb{R}^{d})$ and $T=\mathcal{F}$ , the Fourier transform operator. In this case it is well known that $T$ is a unitary map.

Under the above assumptions the linear measurement process does not introduce a loss of information. However, the situation changes significantly if the phase information of the transform is absent. The problem arises of studying the obviously nonlinear mapping

[TABLE]

and its invertibility properties. Well-posedness in the sense of Hadamard of an inverse problem associated with $f\mapsto\mathcal{A}f$ requires

(1)

existence of a solution, i.e., $\mathcal{A}$ to be surjective; 2. (2)

uniqueness, i.e., $\mathcal{A}$ to be injective; and 3. (3)

stability, meaning that the solution continuously depends on the data, i.e., $\mathcal{A}^{-1}$ to be continuous.

For the problem of phase retrieval, condition (1) amounts to identifying the image of the operator $\mathcal{A}$ . The question is often of minor importance compared to (2) and (3) as it is simply assumed that the input data arise from the measurement process described by $\mathcal{A}$ .

Provided that $\mathcal{X}$ is a vector space—excluding trivial cases— $\mathcal{A}$ is not injective due to the simple observation that

[TABLE]

Further ambiguities may occur, such as translations in the Fourier example but also less trivial ones. The first key question in the mathematical analysis of a phase retrieval problem is to identify all ambiguities. Depending on the context a particular source of ambiguities is either classified as trivial or as severe. If there exist severe ambiguities the phase retrieval problem is hopeless as there exist different objects yielding identical measurements. If on the other hand all occurring ambiguities are considered trivial, $f$ and $g$ may be identified ( $f\sim g$ ) whenever $\mathcal{A}f=\mathcal{A}g$ . Let $\tilde{\mathcal{X}}=\mathcal{X}/\sim$ denote the quotient set. Then—by definition— $\mathcal{A}$ is injective as mapping acting on $\tilde{\mathcal{X}}$ and uniqueness in this new sense is ensured.

In order to study stability, $\tilde{\mathcal{X}}$ has to be endowed with a reasonable topology first. In the case where $\mathcal{X}$ is a normed space and the only ambiguities occurring are of the type shown in (1), usually the quotient metric

[TABLE]

is used. If there are other ambiguities, a suitable choice may be less obvious.

Beyond determining whether the mapping $\mathcal{A}$ on $\tilde{\mathcal{X}}$ is continuously invertible further continuity properties of the inverse are often studied such as (local) Lipschitz continuity.

If there are nontrivial ambiguities, i.e., if injectivity is not attained after identifying all trivial ambiguities or if the inverse is not continuous, one or both of the following measures may be taken in order to render the phase retrieval problem well-posed:

(A)

Restriction of $\mathcal{A}$ : The restriction $\mathcal{A}:\tilde{\mathcal{X}}^{\prime}\rightarrow\mathcal{A}(\tilde{\mathcal{X}}^{\prime})$ , where $\tilde{\mathcal{X}}^{\prime}\subset\tilde{\mathcal{X}}$ is eventually injective (has a continuous inverse) if $\tilde{\mathcal{X}}^{\prime}$ is chosen sufficiently small, $\tilde{\mathcal{X}}^{\prime}$ consisting of a single element being the extremal, trivial example.

Restriction of $\mathcal{A}$ to a smaller domain can be understood as imposing additional structural assumptions on the function $f$ to be reconstructed. In applications of the phase retrieval problem from Fourier measurements, for instance, it is typically sensible to demand that $f$ be nonnegative, as other functions do not hold a physical meaning. 2. (B)

Modification of $T$ : The idea is to suitably modify $T$ in order to soften the setback which is suffered by the subsequent removal of the phase information.

In the case of the Fourier phase retrieval problem this can be achieved by applying several different manipulations of $f$ before computation of the Fourier transform, e.g., using

[TABLE]

for known functions $g_{1},\ldots,g_{m}$ instead of $Tf=\hat{f}$ . In the context of diffraction imaging this approach is common practice, as a physical system which produces measurements $|T^{\prime}f|=\left(|\widehat{fg_{1}}|,\ldots,|\widehat{fg_{m}}|\right)$ can often be implemented. In ptychography—a concept proposed by Walter Hoppe in the 1960s [61]—different sections of an object are illuminated one after another and the object is to be reconstructed from several diffraction patterns. For suitable, localized window functions $g_{1},\ldots,g_{m}$ , (2) serves as a reasonable mathematical model.

As a second example let us mention holography, invented by Dennis Gabor in 1947 [47]. In holography the diffracted waves interfere with the wave field of a known object. This idea amounts to an additive distortion of the wave field $T^{\prime}f:=\widehat{f+g}$ , where $g$ is a known reference wave.

To the best of our knowledge, these ideas (the restriction and modification approach that is) have been systematically implemented for the first time in a series of papers by Jaming [67, 68].

When studying a concrete phase retrieval problem with an application in the background it is useful to keep in mind that often there is a certain degree of freedom in the way the measurements are acquired. For instance, in diffraction imaging there is the fundamental observation that the wave in the object plane and the wave in the far field are connected in terms of the Fourier transform. However there are many different options in how to generate one or several diffraction patterns. Instead of viewing a phase retrieval problem as the analysis of a fixed operator $\mathcal{A}$ one may as well include the question of how to design the measurement process in order to get a well-posed problem.

Beyond the question of well-posedness it is desirable to provide a method that recovers a function $f$ (at least the equivalence class $[f]_{\sim}$ ) from the observed measurements $\mathcal{A}f$ . Such a method could be an explicit expression of the inverse of $\mathcal{A}$ . Mostly the aim of coming up with an explicit expression is rather hopeless. In practice iterative algorithms are employed, which serve as approximate inverses of the measurement mapping $\mathcal{A}$ . A framework based on iterative projections which is often simple to implement and has proved to be very flexible was introduced by Gerchberg and Saxton [48] and was later extended by Fienup [44]. These methods have been employed to great empirical success, but due to the absence of convexity there is no guarantee of convergence. In recent years Candes, Strohmer, and Voroninski [32] have studied phase retrieval in a random setup and proposed an algorithm which provably recovers $f$ with high probability.

Phase retrieval problems have been studied in a rich variety of shapes. They can be distinguished between finite- and infinite-dimensional as well as between discrete and continuous phase retrieval problems. Furthermore phase retrieval problems differ in what kind of measurements are considered, i.e., the choice of the operator $T$ . The most common choice is that $T$ involves some sort of Fourier transform [2, 99, 60, 4, 78]. Moreover there is a huge body of research in the more abstract setting of frames, where it is assumed that $T$ is induced by a frame [14, 16, 28, 5]. Phase retrieval problems where the quantity of interest is assumed to arise as the solution of certain differential equations have also been studied [70, 69].

It is the aim of the present paper to present an overview of a selection of the aforementioned developments. In Section 2 we summarize our current understanding of abstract phase retrieval problems, that is, without precisely specifying the nature of the observed measurements.

Then we specialize to phase retrieval problems arising from (masked or windowed) Fourier transform measurements. The finite-dimensional case is considered in section 3, and the continuous infinite-dimensional setting in section 4. We present several (well-known and also new) results on uniqueness and stability of the corresponding phase retrieval problems.

Finally, we believe that phase retrieval offers researchers a unique combination of beautiful and deep mathematics as well as very concrete physical applications. It is our hope to convey some of our enthusiasm for this topic to the reader.

2. Abstract Phase Retrieval

From an abstract point of view, Fourier phase retrieval lends itself to the following interpretation: Of a function $f$ , we are given the absolute values of measurements given by bounded linear functionals. In the case of Fourier phase retrieval, the family of linear functionals are just the pointwise evaluation of the Fourier transform $\{f\mapsto\hat{f}(x):x\in\mathbb{R}^{d}\}$ .

With this interpretation in mind, we can phrase the phase retrieval problem in a more abstract way. Throughout this section let $\mathcal{B}$ denote a Banach space over $\mathbb{K}\in\{\mathbb{R},\mathbb{C}\}$ and $\mathcal{B}^{\prime}$ its topological dual space. Furthermore, let $\Lambda$ be a not necessarily countable index set. For a family of bounded linear functionals $\Phi:=\{\varphi_{\lambda}:\lambda\in\Lambda\}\subseteq\mathcal{B}^{\prime}$ , we define the operator of phaseless measurements by

[TABLE]

where $\langle\,.\,,\,.\,\rangle$ denotes the dual pairing. Due to the linearity, it is clear that $\mathcal{A}_{\Phi}(cf)=\mathcal{A}_{\Phi}f$ for phase factors $|c|=1$ . We therefore introduce the equivalence relation $cf\sim f$ and say $\Phi$ does phase retrieval if the mapping

[TABLE]

is injective.

2.1. Injectivity

Suppose $\Phi:=\{\varphi_{\lambda}:\lambda\in\Lambda\}\subseteq\mathcal{B}^{\prime}$ is a family of bounded linear functionals and $S\subseteq\Lambda$ . We then write $\Phi_{S}:=\{\varphi_{\lambda}:\lambda\in S\}\subseteq\Phi$ . For a linear subspace $V$ of $\mathcal{B}^{\prime}$ , let $V_{\perp}:=\{f\in\mathcal{B}:\langle f,v\rangle=0\quad\forall v\in V\}$ denote the annihilator of $V$ in $\mathcal{B}$ .

Definition 2.1.

The family $\Phi\subseteq\mathcal{B}^{\prime}$ satisfies the complement property in $\mathcal{B}$ if we have $(\operatorname{span}\Phi_{S})_{\perp}=\{0\}$ or $(\operatorname{span}\Phi_{\Lambda\setminus S})_{\perp}=\{0\}$ for every $S\subseteq\Lambda$ .

Then the complement property is necessary for $\mathcal{A}_{\Phi}$ to be injective. In the real case, it is even sufficient.

Theorem 2.2.

Let $\mathcal{B}$ be a Banach space over $\mathbb{K}\in\{\mathbb{R},\mathbb{C}\}$ and $\Phi\subseteq\mathcal{B}^{\prime}$ a family of bounded linear functionals. Then the following hold:

(i)

If $\mathcal{A}_{\Phi}$ is injective, then $\Phi$ satisfies the complement property. 2. (ii)

If $\mathbb{K}=\mathbb{R}$ and $\Phi$ satisfies the complement property, then $\mathcal{A}_{\Phi}$ is injective.

Theorem 2.2 has quite a history. It was first stated for finite dimensions in Balan, Casazza, and Edidin [14]. The arguments for the complex case should have been given more care. Bandeira et al. [16] spotted this oversight and gave an alternative proof for the complex case in finite dimensions. In doing so, they produced a series of characterizations for injectivity in finite dimensions. Moreover, they had the crucial insight for stability of phase retrieval by introducing a “numerical version” of the complement property (see section 2.2).

Ultimately, only a minor correction was necessary to repair Balan et al.’s proof and the same arguments also work in infinite dimensions. This is the proof we present below, which can also be found in [5, 24, 28].

Proof.

(i) Let $\mathcal{A}_{\Phi}$ be injective for $\Phi=\{\varphi_{\lambda}:\lambda\in\Lambda\}$ and $S\subseteq\Lambda$ arbitrary. Assume that there is a nonzero $f\in(\operatorname{span}\Phi_{S})_{\perp}$ and let $h\in(\operatorname{span}\Phi_{\Lambda\setminus S})_{\perp}$ . We have to show that $h=0$ . First note that

[TABLE]

Hence $\mathcal{A}_{\Phi}(f+h)=\mathcal{A}_{\Phi}(f-h)$ . As $\mathcal{A}_{\Phi}$ is assumed to be injective, there exists a phase factor $|c|=1$ such that $f+h=c(f-h)$ . Since $f\neq 0$ we have $c\neq-1$ and then

[TABLE]

which implies that $\mathcal{A}_{\Phi}h=0$ . Now the injectivity of $\mathcal{A}_{\Phi}$ implies $h=0$ as expected.

(ii) Suppose $\mathcal{A}_{\Phi}$ is not injective, this means that there exist $f,h\in\mathcal{B}$ such that $\mathcal{A}_{\Phi}f=\mathcal{A}_{\Phi}h$ . Since $\Phi=\{\varphi_{\lambda}:\lambda\in\Lambda\}$ consists of real-valued linear functionals, the signed measurements of $f$ and $h$ with respect to $\Phi$ can only differ by a factor of $c=-1$ . We therefore consider the following partition of the index set $\Lambda$ : Let $S:=\{\lambda\in\Lambda:\langle f,\varphi_{\lambda}\rangle=\langle h,\varphi_{\lambda}\rangle\}$ ; then $\Lambda\setminus S=\{\lambda\in\Lambda:\langle f,\varphi_{\lambda}\rangle=-\langle h,\varphi_{\lambda}\rangle\}$ .

Consequently, $f-h\in(\operatorname{span}\Phi_{S})_{\perp}$ and $f+h\in(\operatorname{span}\Phi_{\Lambda\setminus S})_{\perp}$ . But by assumption at least one of those annihilators consists only of [math]. Hence $f=h$ or $f=-h$ and therefore $\mathcal{A}_{\Phi}$ is injective. ∎

For the Paley–Wiener space $PW^{p,b}_{\mathbb{R}}:=\{f\in L^{p}(\mathbb{R},\mathbb{R}):\operatorname{supp}\hat{f}\subseteq[-b/2,b/2]\}$ ( $1<p<\infty$ ) of real-valued band-limited functions, one can show that the complement property holds for families of point-evaluations $\Phi=\{\delta_{\lambda}:\lambda\in\Lambda\}$ if the sampling rate exceeds twice the critical density [5]. Since $PW^{p,b}_{\mathbb{R}}$ is a real-valued Banach space, this implies that phase retrieval is possible.

For complex Banach spaces, the complement property is not sufficient. Hence other methods need to be employed to study injectivity. For Fourier-type measurements, these tools often come from complex analysis (see sections 3 and 4).

We now turn to the finite-dimensional case. The complement property implies that $\Phi\subseteq\mathbb{K}^{d}$ needs to span the whole space and must be overcomplete for phase retrieval to be possible. In other words, $\Phi$ must be a frame.

In the remainder of this section, we state necessary and sufficient conditions on the number of frame elements of $\Phi$ to do phase retrieval. The first result is an easy consequence of the complement property.

Corollary 2.3.

If $N<2d-1$ , then $\mathcal{A}_{\Phi}$ cannot be injective for any family $\Phi\subseteq\mathbb{K}^{d}$ with $N$ elements.

Proof.

We partition $\Phi$ into two sets $\Phi_{S},\Phi_{\Lambda\setminus S}$ with at most $d-1$ elements. This yields $\operatorname{span}\Phi_{S}\neq\mathbb{K}^{d}$ and $\operatorname{span}\Phi_{\Lambda\setminus S}\neq\mathbb{K}^{d}$ , clearly violating the complement property. ∎

For $\mathbb{K}=\mathbb{R}$ , the converse statement also holds for “almost all” frames. To make this more precise, we need some terminology of algebraic geometry.

An algebraic variety in $\mathbb{K}^{d}$ is the common zero set of finitely many polynomials in $\mathbb{K}[x_{1},\dots,x_{d}]$ . By defining algebraic varieties in $\mathbb{K}^{d}$ as closed, we obtain the Zariski topology. Note that this topology is coarser than the Euclidean topology on $\mathbb{K}^{d}$ , meaning that every Zariski-open set is also open with respect to the Euclidean topology. Furthermore, nonempty Zariski-open sets are dense with respect to the Euclidean topology and have full Lebesgue measure in $\mathbb{K}^{d}$ [16, 34].

We say a generic point in $\mathbb{K}^{d}$ satisfies a certain property, if there exists a nonempty Zariski-open set with this property. By the above, this means that if a certain property holds for a generic point, it holds for almost all points in $\mathbb{K}^{d}$ .

Now we identify a frame $\Phi\subseteq\mathbb{K}^{d}$ of $N$ elements with a $d\times N$ matrix of full rank. Hence the set of frames with $N$ elements in $\mathbb{K}^{d}$ , i.e., the set of matrices of full rank in $\mathbb{K}^{d\times N}$ , is a nonempty Zariski-open set and it makes sense to study generic points within the set of frames. We call those generic points generic frames.

The following theorem is due to Balan, Casazza, and Edidin [14]. Together with Corollary 2.3, it (almost) characterizes the injectivity of phase retrieval in $\mathbb{R}^{d}$ .

Theorem 2.4.

If $N\geq 2d-1$ , then $\mathcal{A}_{\Phi}$ is injective for a generic frame $\Phi\subseteq\mathbb{R}^{d}$ with $N$ elements.

For phase retrieval in $\mathbb{C}^{d}$ , Bandeira et al. [16] conjectured an analogous characterization with $4d-4$ being the critical number of frame elements. They also gave a proof in dimensions $d=2,3$ . Conca et al. [34] (see also [73]) proved the following theorem, confirming the sufficient part of the $(4d-4)$ -Conjecture.

Theorem 2.5.

Let $d\geq 2$ . If $N\geq 4d-4$ , then $\mathcal{A}_{\Phi}$ is injective for a generic frame $\Phi\subseteq\mathbb{C}^{d}$ with $N$ elements.

Conversely, a frame in $\mathbb{C}^{d}$ with $N<4d-4$ elements does not allow phase retrieval in dimensions $d=2^{k}+1$ [34]. But the $(4d-4)$ -Conjecture does not hold in general: Vinzant [97] gave an example of a frame with $11=4d-5$ elements in $\mathbb{C}^{4}$ which does phase retrieval. For necessary lower bounds in general dimension, we refer the interested reader to Wang and Xu [100]. A more in-depth account of the history of necessary and sufficient bounds for phase retrieval in $\mathbb{C}^{d}$ can be found in [24]. Furthermore, Bodmann and Hammen [20, 21] developed concrete algorithms and error bounds for phase retrieval with low-redundancy frames.

2.2. Stability

Once the question of injectivity is answered positively, the question of stability arises. Stability refers to the continuity of the operator $\mathcal{A}_{\Phi}^{-1}:\operatorname{ran}\mathcal{A}_{\Phi}\to\mathcal{B}/_{\sim}$ . To this end, we need to introduce a topology on $\mathcal{B}/_{\sim}$ and find a suitable Banach space $\textfrak{B}$ with $\operatorname{ran}\mathcal{A}_{\Phi}\subseteq\textfrak{B}\subseteq\mathbb{K}^{\Lambda}$ . The natural choice for $\mathcal{B}/_{\sim}$ is the quotient metric

[TABLE]

The analysis space for frames in separable Hilbert spaces is the sequence space $\ell^{2}(\Lambda)$ . We will consider the stability of phase retrieval for continuous Banach frames in this section. There, the appropriate generalization of $\ell^{2}(\Lambda)$ is an “admissible” Banach space $\textfrak{B}$ such that the range of the coefficient operator

[TABLE]

is contained in $\textfrak{B}$ .

Definition 2.6.

Let $\Lambda$ be a $\sigma$ -compact topological space. A Banach space $\textfrak{B}\subseteq\mathbb{K}^{\Lambda}$ is called admissible if it satisfies the following properties:

(i)

The indicator function $\chi_{K}$ of every compact set $K\subseteq\Lambda$ satisfies $\|\chi_{K}\|_{\textfrak{B}}<\infty$ . 2. (ii)

The Banach space $\textfrak{B}$ is solid; this means that $\|w\|_{\textfrak{B}}\leq\|z\|_{\textfrak{B}}$ whenever ${|w(\lambda)|\leq|z(\lambda)|}$ for all $\lambda\in\Lambda$ . 3. (iii)

The elements of $\textfrak{B}$ with compact support are dense in $\textfrak{B}$ .

These properties are quite reasonable. Indeed, all $L^{p}$ -spaces for $1\leq p<\infty$ are admissible Banach spaces and $L^{\infty}$ violates only the last point unless $\Lambda$ is already compact.

Now we are in a position to define stability of phase retrieval precisely.

Definition 2.7.

Let $\Phi\subseteq\mathcal{B}^{\prime}$ be a family of bounded linear functionals and $\textfrak{B}$ and admissible Banach space such that $C_{\Phi}:\mathcal{B}\to\textfrak{B}$ . We say that the phase retrieval of $\Phi$ is stable (with respect to $\textfrak{B}$ ) if there exist constants $0<\alpha\leq\beta<\infty$ such that

[TABLE]

Moreover, let $\alpha_{\operatorname{opt}}(\Phi),\beta_{\operatorname{opt}}(\Phi)$ denote the optimal lower and upper Lipschitz bound respectively.

Definition 2.8.

Suppose that $\Phi:=\{\varphi_{\lambda}:\lambda\in\Lambda\}\subseteq\mathcal{B}^{\prime}$ is a family of bounded linear functionals such that $\lambda\mapsto\varphi_{\lambda}$ is continuous. We call $\Phi$ a continuous Banach frame if there exists an admissible Banach space such that the following is satisfied:

(i)

There exist positive constants $0<A\leq B<\infty$ such that

[TABLE]

Moreover, let $A_{\operatorname{opt}}(\Phi),B_{\operatorname{opt}}(\Phi)$ denote the optimal constants satisfying (4). 2. (ii)

There exists a continuous operator $R:\textfrak{B}\to\mathcal{B}$ , the so-called reconstruction operator, satisfying

[TABLE]

The requirement for $\Phi$ to be a frame is a natural one. In fact, if $C_{\Phi}$ maps into an admissible Banach space, the solidity implies $\|\mathcal{A}_{\Phi}f\|_{\textfrak{B}}=\|C_{\Phi}f\|_{\textfrak{B}}$ . Hence, stability in the sense of (3) implies the frame inequality (4) by taking $h=0$ . For the upper inequalities, we even have equivalence.

Proposition 2.9.

If $\Phi\subseteq\mathcal{B}^{\prime}$ is a family of continuous linear functionals such that $C_{\Phi}$ maps into an admissible Banach space, then $\beta_{\operatorname{opt}}=B_{\operatorname{opt}}$ .

Again the solidity of the admissible Banach space plays an integral role in the proof. As the rest follows from straightforward estimates, we omit the proof and refer the interested reader to [5, 28].

The remainder of the section deals with the lower inequality in (3). We start by mentioning an interesting result about the continuity of the inverse operator $\mathcal{A}_{\Phi}^{-1}$ , which can be regarded as a weaker form of stability.

Theorem 2.10.

Let $\Phi\subseteq\mathcal{B}^{\prime}$ be a continuous Banach frame and $\mathcal{A}_{\Phi}$ injective. Then $\mathcal{A}_{\Phi}^{-1}$ is continuous on the range of $\mathcal{A}_{\Phi}$ .

Proof idea.

We need to show that the convergence of the image sequence $\mathcal{A}_{\Phi}f_{n}\to\mathcal{A}_{\Phi}f$ in $\textfrak{B}$ implies the convergence of $f_{n}\to f$ in $\mathcal{B}$ . The idea is to link the convergence of $\mathcal{A}_{\Phi}f_{n}$ to the convergence of the signed measurements $C_{\Phi}f_{n}$ . This is the technical and lengthy part of the proof, and we refer the interested reader to [5] for the details. Once this relation is established, one can use the continuous reconstruction operator $R$ to obtain $f_{n}\to f$ . ∎

As an easy consequence of Theorem 2.10, we obtain stability of phase retrieval in finite-dimensional Banach spaces.

Theorem 2.11.

Let $\mathcal{B}$ be a finite-dimensional Banach space. If $\Phi$ is a frame that does phase retrieval, then $\mathcal{A}_{\Phi}$ has a lower Lipschitz bound $\alpha_{\operatorname{opt}}>0$ .

Proof.

Note that the existence of a positive lower Lipschitz bound $\alpha_{\operatorname{opt}}>0$ in (3) is equivalent to $\mathcal{A}_{\Phi}^{-1}:\operatorname{ran}\mathcal{A}_{\Phi}\to\mathcal{B}/_{\sim}$ being Lipschitz continuous with constant $L=\alpha_{\operatorname{opt}}^{-1}$ .

By Theorem 2.10, the inverse $\mathcal{A}_{\Phi}^{-1}$ is continuous on $\operatorname{ran}\mathcal{A}_{\Phi}$ . Since $\mathcal{B}$ is finite-dimensional, the closed unit ball $B(0,1)$ is compact, and therefore $\mathcal{A}_{\Phi}^{-1}$ is uniformly continuous on $\operatorname{ran}\mathcal{A}_{\Phi}\cap B(0,1)$ . By using the scaling invariance of $\mathcal{A}_{\Phi}^{-1}$ and playing everything back into the unit ball $B(0,1)$ , the Lipschitz continuity follows in a series of straightforward estimates. ∎

The result of Theorem 2.11 was proved first for the real case in [15, 16]. Cahill, Casazza, and Daubechies [28] gave a proof for the complex case. The proof above is from [5].

For their proof of stability in finite dimensions, Bandeira et al. [16] introduced the following “numerical” version of the complement property, which relates to stability as the complement property relates to injectivity.

Definition 2.12.

The family $\Phi\subseteq\mathcal{B}^{\prime}$ satisfies the $\sigma$ -strong complement property in $\mathcal{B}$ if there exists a $\sigma>0$ such that

[TABLE]

Moreover, let $\sigma_{\operatorname{opt}}(\Phi)$ denote the supremum over all $\sigma>0$ satisfying (5).

Theorem 2.13.

Let $\mathcal{B}$ be a Banach space over $\mathbb{K}\in\{\mathbb{R},\mathbb{C}\}$ and $\Phi\subseteq\mathcal{B}^{\prime}$ a continuous Banach frame. Then there exists a constant $C>0$ such that

[TABLE]

In the real case, the constant is $C=2$ . For the complex case, the constant can be chosen $C=2B_{\operatorname{opt}}/A_{\operatorname{opt}}$ .

Remark 2.14.

For the real case, one can also show that $\sigma_{\operatorname{opt}}\leq C\alpha_{\operatorname{opt}}$ for some $C>0$ . This implies that the $\sigma$ -strong complement property is not only necessary, but also sufficient for stability in real Banach spaces. In this sense, it mirrors the behavior of the complement property.

Unfortunately, the sufficiency cannot be exploited for (global) stability: On one hand, phase retrieval is always stable in finite dimensions by Theorem 2.11 and on the other hand, we will see that the $\sigma$ -strong complement property can never hold in infinite dimensions.

Proof.

Let $\sigma>\sigma_{\operatorname{opt}}$ . Then there exist a subset $S\subseteq\Lambda$ and $f,h\in\mathcal{B}$ with $\|f\|_{\mathcal{B}}=\|h\|_{\mathcal{B}}=1$ such that

[TABLE]

Now set $x:=f+h$ and $y:=f-h$ . Due to the solidity of $\textfrak{B}$ , we obtain

[TABLE]

where we used the reverse triangle inequality in the second line.

By definition of $\alpha_{\operatorname{opt}}$ , we conclude

[TABLE]

In the real case, we are done since

[TABLE]

The complex case proves to be more difficult. A series of elementary estimates are necessary to bound $d(x,y)$ away from zero. We refer the interested reader to the original article [5]. ∎

Remark 2.15.

The computations in the proof of Theorem 2.13 also yield an estimate on local stability constants. More precisely, suppose a fixed $x\in\mathcal{B}$ can be decomposed according to $x=f+h$ such that $\|f\|_{\mathcal{B}}\asymp 1$ , $\|h\|_{\mathcal{B}}\asymp 1$ and that (6) holds for $\sigma\ll 1$ . Then there exists $y\in\mathcal{B}$ such that

[TABLE]

Thus, $x$ and $y$ yield similar measurements even though they are very different from each other.

Theorem 2.13 implies that the $\sigma$ -strong complement property is necessary for stability. Bandeira et al. [16] gave a proof of this for the real case and conjectured the complex case, which was proved in [5].

For finite dimensions, phase retrieval is always stable by Theorem 2.11. In particular, the $\sigma$ -strong complement property is satisfied. In infinite dimensions, we will see that continuous Banach frames cannot satisfy the $\sigma$ -strong complement property, hence phase retrieval is always unstable in this case. To show this, we follow [5] and prove an intermediate result, which is interesting in its own right. It states that there cannot exist continuous Banach frames in infinite dimensions with compact index set $\Lambda$ .

Proposition 2.16.

Suppose $\mathcal{B}$ is an infinite-dimensional Banach space and $\Lambda$ a compact index set. Then any family $\Phi:=\{\varphi_{\lambda}:\lambda\in\Lambda\}\subseteq\mathcal{B}^{\prime}$ with continuous mapping $\lambda\mapsto\varphi_{\lambda}$ fails to satisfy the lower frame inequality. This means that for every $\varepsilon>0$ there exists an $f\in\mathcal{B}$ such that

[TABLE]

Proof.

Let $\varepsilon>0$ . By continuity of the mapping $\lambda\mapsto\varphi_{\lambda}$ , there exists for every $\lambda\in\Lambda$ an open neighborhood $U_{\lambda}$ such that

[TABLE]

Since $\Lambda$ is compact, the open covering $\{U_{\lambda}:\lambda\in\Lambda\}$ admits a finite subcover $\{U_{\lambda_{1}},\dots,\allowbreak U_{\lambda_{N}}\}$ . Now set $U_{1}:=U_{\lambda_{1}}$ and $U_{j}:=U_{\lambda_{j}}\setminus\bigcup_{k=1}^{j-1}U_{k}$ for $j=2,\dots,N$ to obtain a partition of $\Lambda$ which satisfies the following for all $j=1,\dots,N$ :

[TABLE]

Clearly, we have

[TABLE]

for all $j=1,\dots N$ . After multiplication with the characteristic function $\chi_{U_{j}}$ and summing over $j$ , we obtain

[TABLE]

Now the solidity of $\textfrak{B}$ implies

[TABLE]

for all $f\in\mathcal{B}\setminus\{0\}$ . Since $\mathcal{B}$ is infinite-dimensional, there exists a nonzero $f_{0}\in\mathcal{B}$ such that $\langle f_{0},\varphi_{\lambda_{j}}\rangle=0$ for all $j=1,\dots,N$ . Consequently, the sum on the right-hand side vanishes for $f_{0}$ and we obtain the claim. ∎

Theorem 2.17.

Let $\mathcal{B}$ be an infinite-dimensional Banach space over $\mathbb{K}\in\{\mathbb{R},\mathbb{C}\}$ and $\Phi\subseteq\mathcal{B}^{\prime}$ a continuous Banach frame. Then $\Phi$ does not satisfy the $\sigma$ -strong complement property.

Proof.

We need to show that the $\sigma$ -strong complement property is not satisfied. This means that for every $\varepsilon>0$ we can find a subset $S\subseteq\Lambda$ and $f,h\in\mathcal{B}$ such that

[TABLE]

We start with an arbitrary $f\in\mathcal{B}$ with $\|f\|_{\mathcal{B}}=1$ . Since $\textfrak{B}$ is an admissible Banach space where compact elements are dense, there exists a nested sequence of compact subsets $K_{n}\subseteq K_{n+1}$ with $\bigcup_{n\in\mathbb{N}}K_{n}=\Lambda$ such that

[TABLE]

Hence, there exists a $K_{N}$ such that

[TABLE]

Setting $S:=\Lambda\setminus K_{N}$ , we obtain $\|C_{\Phi_{S}}f\|_{\textfrak{B}}<\varepsilon\|f\|_{\mathcal{B}}$ .

On the other hand, we can use Theorem 2.16 for the compact set $\Lambda\setminus S=K_{N}$ to find an $h\in\mathcal{B}$ such that

[TABLE]

∎

Corollary 2.18.

Let $\mathcal{B}$ be an infinite-dimensional Banach space over $\mathbb{K}\in\{\mathbb{R},\mathbb{C}\}$ and $\Phi\subseteq\mathcal{B}^{\prime}$ a continuous Banach frame. Then $\Phi$ cannot do stable phase retrieval. This means that for every $\varepsilon>0$ , there exist $f,h\in\mathcal{B}$ with $\|\mathcal{A}_{\Phi}(f)-\mathcal{A}_{\Phi}(h)\|_{\textfrak{B}}<\varepsilon$ but $d(f,h)\geq 1$ .

Proof.

This is an immediate consequence of the fact that the $\sigma$ -strong complement property is necessary for stability by Theorem 2.13, but continuous Banach frames in infinite dimensions cannot satisfy it by Theorem 2.17. ∎

Remark 2.19.

Phase retrieval in infinite dimensions cannot be stable for continuous Banach frames by Corollary 2.18. On the other hand, Theorem 2.11 states that it is always stable in finite dimensions. The natural question that arises is the following: Suppose $V_{n}\subseteq\mathcal{B}$ is a sequence of finite-dimensional subspaces and let $\alpha(V_{n})$ denote the stability constant for the subspace $V_{n}$ in (3). How fast does the stability constant $\alpha(V_{n})$ degenerate as the dimension increases?

It turns out, this can be rather rapidly: Cahill, Casazza, and Daubechies [28] considered subspaces of increasing dimension in the Paley–Wiener space and showed that the stability constant degrades exponentially fast in the dimension. Even worse degeneration can be observed for the short-time Fourier transform with Gaussian window on $L^{2}(\mathbb{R})$ : Alaifari and one of the authors [6] constructed a sequence of subspaces whose stability constant degrades quadratically exponentially in the dimension.

3. Finite Dimensional Phase Retrieval

This section is devoted to phase retrieval from Fourier measurements in the finite-dimensional setting. The first emphasis lies on identifying ambiguities for phase retrieval from phaseless discrete time Fourier transform (DTFT) measurements. The second main focus lies on discussing various strategies to remove ambiguities, and yield well-posed reconstruction problems. These strategies include priors such as assuming sparsity of the signals to be reconstructed or tweaking of the measurement process, for example by increasing the number of measurements and/or by introducing randomness.

3.1. The classical Fourier Phase Retrieval Problem

In the following we will discuss the problem of recovering a signal from its phaseless Fourier transform. We consider multidimensional discrete signals. This means that for $n\in\mathbb{N}^{d}$ , a discrete signal is a complex-valued function on

[TABLE]

Definition 3.1.

The discrete-time Fourier transform $\hat{x}$ of a discrete signal $x=(x_{j})_{j\in J_{n}}\in\mathbb{C}^{J_{n}}$ is defined by

[TABLE]

where the normalization $\omega/n:=(\omega_{1}/n_{1},\dots,\omega_{d}/n_{d})$ is understood componentwise and $j\cdot\omega:=\sum_{k=1}^{d}j_{k}\omega_{k}$ denotes the inner product on $\mathbb{R}^{d}$ .

The problem of Fourier phase retrieval can now be stated as follows.

Problem 1 (Fourier phase retrieval, discrete).

Recover $x\in\mathbb{C}^{J_{n}}$ from $|\hat{x}|$ .

Remark 3.2.

For $x\in\mathbb{C}^{J_{n}}$ the squared modulus of its DTFT $|\hat{x}|^{2}$ is a trigonometric polynomial and is uniquely defined by its values on a suitable, finite sampling set $\Omega\subseteq\mathbb{R}^{d}$ . The problem of recovering $x$ from the full Fourier magnitude $|\hat{x}(\omega)|,~{}\omega\in\mathbb{R}^{d}$ is therefore equivalent to the problem of recovering $x$ from finitely many samples of the Fourier magnitude $|\hat{x}(\omega)|,~{}\omega\in\Omega$ .

The goal is to characterize all ambiguous solutions of Problem 1 for a given signal $x\in\mathbb{C}^{J_{n}}$ . Before we do so let us draw the attention of the reader to Fienup’s paper [45] from the 1970s where the following observation is made:

Experimental results suggest that the uniqueness problem is severe for one-dimensional objects but may not be severe for complicated two-dimensional objects.

Within this section we will give a rigorous explanation of this phenomenon.

Before we identify ambiguities, we have to explain what it means to reflect and translate a signal $x\in\mathbb{C}^{J_{n}}$ . We define the reflection operator $R$ on $\mathbb{C}^{J_{n}}$ by

[TABLE]

and the translation operator $T_{\tau}$ for $\tau\in\mathbb{Z}^{d}$ by

[TABLE]

where the modulo operation is to be understood componentwise. Similarly the conjugation operation will be understood componentwise. For $z\in\mathbb{C}^{d}$ and $j\in\mathbb{Z}^{d}$ we will write $z^{-1}:=(z_{1}^{-1},\ldots,z_{d}^{-1})$ and $z^{j}:=z_{1}^{j_{1}}\cdot\ldots\cdot z_{d}^{j_{d}}$ for short.

Proposition 3.3.

Let $x\in\mathbb{C}^{J_{n}}$ . Then each of the following choices of $y$ yields the same Fourier magnitudes as $x$ , i.e., $|\hat{y}|=|\hat{x}|$ :

(i)

$y=cx$ * for $|c|=1$ ;* 2. (ii)

$y=T_{\tau}x$ * for $\tau\in\mathbb{Z}^{d}$ ;* 3. (iii)

$y=\overline{Rx}$ .

Proof.

The statement follows from (i) linearity of the Fourier transform, (ii) translation amounts to modulation in the Fourier domain and (iii) reflection and conjugation amounts to conjugation in the Fourier domain. ∎

The ambiguities described in Proposition 3.3, as well as combinations thereof, are considered trivial. By identifying trivial ambiguities an equivalence relation $\sim$ is introduced on $\mathbb{C}^{J_{n}}$ , i.e.,

[TABLE]

To determine all ambiguities we will study the so-called $Z$ -transform.

Definition 3.4.

For $x\in\mathbb{C}^{J_{n}}$ the $Z$ -transform is defined by

[TABLE]

The question of uniqueness of Problem 1 is closely connected to whether the $Z$ -transform has a nontrivial factorization, as we shall see.

Definition 3.5.

A polynomial $p$ of one or several variables is called reducible if there exist nonconstant polynomials $q$ and $r$ such that $p=q\cdot r$ . Otherwise $p$ is called irreducible.

In what follows, let $p(z)=\sum_{j}c_{j}z^{j}$ denote a multivariate polynomial. Its degree $\deg(p)\in\mathbb{N}_{0}^{d}$ is defined with respect to each coordinate, i.e.,

[TABLE]

Later we will need to consider the mapping $z\mapsto\overline{p(\bar{z}^{-1})}$ . Clearly, its singularities can be removed by multiplication with a suitable monomial. Indeed,

[TABLE]

is again a polynomial. Finally, let $\nu(p)\in\mathbb{N}_{0}^{d}$ denote the largest exponent (component-wise) such that $z^{\nu(p)}$ is a divisor of $p$ . Thus there exists a unique polynomial $p_{0}$ such that

[TABLE]

To shed some more light onto these concepts we consider a concrete example.

Example 3.6.

Let us consider the polynomial $p$ on $\mathbb{C}^{2}$ defined by

[TABLE]

We see that $\deg(p)=(3,2)$ and that $\nu(p)=(1,0)$ . Moreover,

[TABLE]

and

[TABLE]

It is not difficult to verify that for any polynomial $p\neq 0$ it holds that

[TABLE]

and that

[TABLE]

The following theorem characterizes all ambiguities of the discrete Fourier phase retrieval problem.

Note that multiplication of the Z-transform $X$ of $x$ by a unimodular factor $\gamma$ by linearity corresponds to multiplication of the $x$ itself. Multiplication of $z^{\tau}$ for $\tau\in\mathbb{Z}^{d}$ corresponds to translation in the signal domain, and flipping the $Z$ -transform, i.e., passing over to $\overline{X(\bar{z}^{-1})}$ , amounts to reflection and conjugation in the signal domain.

Theorem 3.7.

Let $x,y\in\mathbb{C}^{J_{n}}$ and let $X,Y$ denote their respective $Z$ -transforms. Then $|\hat{x}|=|\hat{y}|$ if and only if there exist a factorization $Y=Y_{1}\cdot Y_{2}$ , a constant $\gamma$ with $|\gamma|=1$ , and $\tau\in\mathbb{Z}^{d}$ such that

[TABLE]

Proof.

First we show the necessity of the statement. Suppose $y$ is an ambiguous solution with respect to $x$ . By definition $X(z)=\sum_{j\in J_{n}}x_{j}z^{j}$ and thus, using the notation

[TABLE]

we observe that $X(e^{-2\pi i\omega/n})=\hat{x}(\omega)$ . For the squared magnitude of the Fourier transform it therefore holds that

[TABLE]

where conjugation and the reciprocal are to be understood component-wise. By the assumption that $|\hat{x}|=|\hat{y}|$ and by analytic continuation, we obtain

[TABLE]

Now factorize $X$ and $Y$ into irreducible polynomials:

[TABLE]

After multiplying both sides of (8) by $z^{n}$ , we obtain the equality

[TABLE]

Since $p_{i}$ is irreducible it follows that $p_{i}^{*}(z)=z^{\deg(p_{i})}\overline{p_{i}(\bar{z}^{-1})}$ is irreducible. Moreover we have that $\nu(p_{i}^{*})=0$ . Obviously the same arguments can be applied to $(p^{\prime}_{i})^{*}(z)=z^{\deg(p^{\prime}_{i})}\overline{p^{\prime}_{i}(\bar{z}^{-1})}$ .

By uniqueness of the factorization it follows that

[TABLE]

and that $L=L^{\prime}$ . Now let $I$ be a maximal subset of $\{1,\ldots,L\}$ such that $\prod_{i\in I}p_{i}$ divides $\prod_{i=1}^{L}p^{\prime}_{i}$ and let $J:=\{1,\ldots,L\}\setminus I$ . Without loss of generality (w.l.o.g.) we assume that $I=\{1,\ldots,l\}$ with $l\leq L$ and that $\prod_{i\leq l}p_{i}$ divides $\prod_{i\leq l}p_{i}^{\prime}$ (this can be achieved by permutation of the index sets). Due to irreducibility it must hold that

[TABLE]

for suitable nonzero constant $a$ , and, consequently that

[TABLE]

where $b$ is another nonzero constant. Use (10) and (11) and cancel (9) by the respective factors to obtain that

[TABLE]

for a constant $c\neq 0$ . From the maximality of $I$ it follows that $\prod_{i>l}p_{i}$ divides $\prod_{i>l}(p_{i}^{\prime})^{*}$ , and thus that $\prod_{i>l}p_{i}^{*}$ divides $\prod_{i>l}p_{i}^{\prime}$ . Therefore there exists $d\neq 0$ such that

[TABLE]

Hence we get that

[TABLE]

Note that $|ad|=1$ , since

[TABLE]

Consequently, we obtain for suitable $m\in\mathbb{Z}^{d}$ and $\gamma=ad$ the factorization

[TABLE]

with $Y_{1}:=\prod_{i\leq l}p^{\prime}_{i}$ and $Y_{2}:=\prod_{i>l}p^{\prime}_{i}$ .

For the sufficiency let $X$ be a polynomial of the form (7). Then

[TABLE]

∎

For $x$ to have nontrivial ambiguities it is therefore necessary that its $Z$ -transform $X$ be reducible. Note that this is not sufficient in general, as the factors of $X$ may possess symmetry properties such that a flipping does not introduce nontrivial ambiguities. Nevertheless, this observation yields an upper bound on the number of ambiguous solutions for $x\in\mathbb{C}^{J_{n}}$ denoted by

[TABLE]

Corollary 3.8.

Let $x\in\mathbb{C}^{J_{n}}$ and let $X$ denote its $Z$ -transform. Then $\mathcal{N}(x)\leq 2^{L-1}$ , where $L$ denotes the number of nontrivial factors of $X$ .

In the one-dimensional case $d=1$ the $Z$ -transform $X$ is a polynomial of one variable of order $k\leq n$ . By the fundamental theorem of algebra, $X$ has $k$ roots and can be expressed as a product of $k$ linear factors. Assuming none of the roots lie on the unit circle and coincides each element in the power set of the set of roots (except for the empty set and the full set) induces a nontrivial ambiguity. The situation in the higher dimensional case is radically different, as shown by Hayes and McClellan [59].

Theorem 3.9 ([59]).

Let $\mathcal{P}^{d,k}$ denote the set of complex polynomials of $d>1$ variables with order $k$ and let $m$ denote the degrees of freedom of $\mathcal{P}^{d,k}$ . We identify $\mathcal{P}^{d,k}$ with $\mathbb{C}^{m}\simeq\mathbb{R}^{2m}$ . Then the set of reducible polynomials in $\mathcal{P}^{d,k}$ is a set of measure zero (as a subset of $\mathbb{C}^{m}$ ).

Corollary 3.8 together with Theorem 3.9 yields the following result.

Corollary 3.10.

If $d=1$ , then for any fixed $n\in\mathbb{N}$ the set $\{x\in\mathbb{C}^{n}:\mathcal{N}(x)<2^{n-1}\}$ is of measure zero.

If $d>1$ , then for any fixed $n\in\mathbb{N}^{d}$ the set $\{x\in\mathbb{C}^{J_{n}}:\mathcal{N}(x)>1\}$ is of measure zero.

A frequently used prior restriction on the signals is to require sparsity. To assume sparsity of the underlying signal appears natural in many practical applications such as crystallography or astronomy. For a thorough discussion of sparse phase retrieval among other topics, we refer the reader to the excellent survey articles [19, 65]. We choose to present at this point one particular result on sparse Fourier phase retrieval which nicely complements the univariate statement in Corollary 3.10.

Theorem 3.11 ([66]).

For $3\leq k\leq n-1$ let $\mathcal{S}_{k}^{n}$ denote the set of $k$ -sparse signals in $\mathbb{C}^{n}$ , i.e., the set of vectors that possess at most $k$ nonzero entries, with aperiodic support. Then almost all $x\in\mathcal{S}_{k}^{n}$ are uniquely determined by $|\hat{x}|$ up to a constant sign factor within $\mathcal{S}_{k}^{n}$ .

3.2. Fourier phase retrieval using masks

In the one-dimensional case the modulus of the DTFT is not a useful representation for most signals. A popular strategy in order to increase information and introduce redundancy to counter the loss of phase is to allow for masked Fourier measurements. By a mask we mean a function $m\in\mathbb{C}^{N}$ with the corresponding phaseless measurement process being described by

[TABLE]

where $\odot$ denotes the pointwise product in $\mathbb{C}^{N}$ . In order to attain a sufficient amount of information it is common to employ not one but several different masks. We essentially distinguish between two types of this kind of measurement. First, in case of the short-time Fourier (STFT) measurements, the various masks are generated by applying shifts to a fixed window function. This is the mathematical model behind ptychography, where an aperture is slid over the sample to illuminate different parts (see Figure 1). Second, the various masks can be chosen in a completely unstructured manner.

We shall see that the uniqueness issues which have been discussed in the first part of this section can be removed if the masks are suitably chosen. In the multivariate setting a generic signal is uniquely (up to trivial ambiguities) determined by the modulus of its DTFT; cf. Corollary 3.10. However, there are deterministic signals—namely, those which possess a reducible Z-transform—for which uniqueness fails to hold. In a randomized setting where one can observe the modulus of the DTFT of $x\odot m$ and the entries of the mask $m$ are drawn randomly according to a suitable distribution, uniqueness holds with probability one provided that the support of the signal $x$ satisfies a rather weak assumption, as shown by Fannjiang [43].

3.2.1. Discrete Short-Time Fourier Phase Retrieval

In this section, we consider finite signals $x$ in the complex Hilbert space $\mathbb{C}^{N}$ with inner product

[TABLE]

The discrete Fourier transform maps finite signals to finite signals and is defined as

[TABLE]

Its inverse is given by

[TABLE]

and with the normalization above, Plancherel’s theorem is of the form

[TABLE]

We define the (circular) translation and modulation operators by

[TABLE]

for $k,l\in\mathbb{Z}_{N}$ . In the following, we identify the finite signal $x\in\mathbb{C}^{N}$ with its periodic extension and just write $(T_{k}x)_{j}=x_{j-k}$ for the circular translated signal.

Since a modulation in time corresponds to a shift in frequency, operators of the form $\pi(\lambda)=\pi(k,l):=M_{l}T_{k}$ are called time-frequency shifts for $\lambda=(k,l)$ . Note that time-frequency shifts do not commute, but satisfy the following commutation relation.

Lemma 3.12.

Let $\lambda=(k,l),\mu=(p,q)\in\mathbb{Z}_{N}^{2}$ . Then

[TABLE]

where $\mathcal{I}=\left(\begin{smallmatrix}0&1\\ -1&0\end{smallmatrix}\right)$ denotes the standard symplectic matrix.

We omit the proof, as it is a straightforward verification.

The discrete short-time Fourier transform of $x\in\mathbb{C}^{N}$ with respect to the window $g\in\mathbb{C}^{N}$ is defined by

[TABLE]

for $\lambda=(k,l)\in\mathbb{Z}_{N}^{2}$ .

For fixed window $g$ , the short-time Fourier transform $V_{g}$ is a linear operator that maps finite signals in $\mathbb{C}^{N}$ to finite signals in $\mathbb{C}^{N\times N}$ . Due to the linearity, we again have the trivial ambiguity $|V_{g}(cx)|=|V_{g}x|$ for phase factors $|c|=1$ . Now the question is whether these are the only ambiguities, and how can the original signal be recovered.

Problem 2 (discrete short-time Fourier phase retrieval).

Suppose $x\in\mathbb{C}^{N}$ . Recover $x$ from $|V_{g}x|$ up to a global phase factor when $g\in\mathbb{C}^{N}$ is known.

Whether Problem 2 has a solution depends on the choice of the window $g$ . A sufficient condition is that the short-time Fourier transform $V_{g}g$ does not vanish anywhere on $\mathbb{Z}_{N}^{2}$ . In the following we aim at proving this fact.

The main insight for short-time Fourier transform phase retrieval comes from the following formula which also appears in [49] and will be proved in what follows.

Proposition 3.13.

Let $x,y,g,h\in\mathbb{C}^{N}$ . Then

[TABLE]

where $\mathcal{I}=\left(\begin{smallmatrix}0&1\\ -1&0\end{smallmatrix}\right)$ denotes the standard symplectic matrix.

The proof of formula (13) is elementary and requires only two things: the covariance property, which is an easy consequence of the commutation relations (12), and a version of Plancherel’s theorem for the short-time Fourier transform.

Lemma 3.14 (Covariance Property).

Let $\lambda,\mu\in\mathbb{Z}_{N}^{2}$ . Then

[TABLE]

Proof.

Note that time-frequency shifts are unitary operators on $\mathbb{C}^{N}$ . Hence

[TABLE]

where we used the commutation relation (12) on the second line. ∎

Proposition 3.15 (Orthogonality Relations).

Let $g,h,x,y\in\mathbb{C}^{N}$ . Then

[TABLE]

Proof.

We write the short-time Fourier transform as $V_{g}x(k,l)=(x\cdot T_{k}\bar{g})\widehat{\phantom{x}}(l)$ and use Plancherel’s theorem in the sum over $l\in\mathbb{Z}_{N}$ :

[TABLE]

∎

Proof of Proposition 3.13.

First note that $\mathcal{I}^{2}=-I$ , where $I$ denotes the identity matrix. Consequently,

[TABLE]

by Lemma 3.14. Hence, we obtain

[TABLE]

where we used the orthogonality relations (14) in the last step.

Note that $\pi(\lambda)^{*}=c\pi(-\lambda)$ for a suitable phase factor $|c|=1$ . But these phase factors cancel when we bring both time-frequency shifts to the other side, hence

[TABLE]

∎

We can now prove a sufficient condition on the window to allow phase retrieval.

Theorem 3.16.

*Let $g\in\mathbb{C}^{N}$ be a window with $V_{g}g(\lambda)\neq 0$ for all $\lambda\in\mathbb{Z}_{N}^{2}$ . Then any $x\in\mathbb{C}^{N}$ can be recovered from $|V_{g}x|$ up to a global phase factor. *

Proof.

By Proposition 3.13 we have

[TABLE]

If $V_{g}g$ has no zeros, we can recover $V_{x}x$ . Now we apply the inverse discrete Fourier transform to $V_{x}x(k,l)=(x\cdot T_{k}\bar{x})\widehat{\phantom{x}}(l)$ and obtain

[TABLE]

Setting $k=j$ yields

[TABLE]

and we recover the signal $x$ up to a global phase factor after dividing by $|x_{0}|$ . ∎

Theorem 3.16 also appears in [22], where it is proved with the methods introduced in [16]. Moreover, the authors also give examples and counterexamples of window functions $g$ satisfying $V_{g}g(\lambda)\neq 0$ for all $\lambda\in\mathbb{Z}_{N}^{2}$ .

Different variants of Problem 2 have been studied over the years. One possible, alternative point of view is to restrict the problem to either sparse signals or signals that do not vanish at all in favor of weaker assumptions imposed on the window. We showcase one illustrative result in this direction and point the reader toward the articles by Jaganathan, Eldar, and Hassibi [64] and Bendory, Beinert, and Eldar [19] which give an excellent overview on both uniqueness and algorithmic aspects of short-time Fourier transform phase retrieval.

Theorem 3.17 ([42]).

Let $g\in\mathbb{C}^{n}$ be a window of length $W\geq 2$ , where the length of $g$ is defined as the length of the smallest interval in $\mathbb{Z}_{n}$ containing the support of $g$ . Then every $x\in\mathbb{C}^{n}$ with nonvanishing entries is defined uniquely by $\left|V_{g}x\right|$ provided

(i)

the discrete Fourier transform of $v$ defined as $v_{n}:=|g_{n}|^{2},\,n\in\mathbb{Z}_{n}$ is nonvanishing; 2. (ii)

$n\geq 2W-1$ ; and 3. (iii)

$n$ * and $W-1$ are coprime.*

Next, we study the case of a randomly picked window.

Theorem 3.18.

There exists a set $E\subset\mathbb{C}^{n}$ of measure zero such that for all $g\in\mathbb{C}^{n}\setminus E$ the family $(\pi(\lambda)g)_{\lambda\in\mathbb{Z}^{2}}$ allows for phase retrieval, i.e., the mapping

[TABLE]

is injective.

Proof.

To prove the theorem we closely follow the proof techniques used by Bojarovska and Flinth [22, Proposition 2.1] where a similar statement is shown.

By Theorem 3.16 and since there are only finitely many $\lambda$ , it suffices to show that there exists a set $E$ of measure zero such that for arbitrary but fixed $\lambda_{0}$ it holds that

[TABLE]

Since $\pi(\lambda_{0})$ is unitary there exists an orthonormal basis $(q_{j})_{j=1}^{n}$ of $\mathbb{C}^{n}$ and $(\alpha_{j})_{j=1}^{n}\subset\mathbb{T}$ such that

[TABLE]

where $q_{j}^{*}$ denotes the conjugate transpose of the row vector $q_{j}$ . If we expand the window $g$ with respect to the basis, i.e., $g=\sum_{j}\beta_{j}q_{j}$ , we get that

[TABLE]

Since $E^{\prime}:=\{\beta\in\mathbb{C}^{n}:\,\sum_{j}\overline{\alpha_{j}}\left|\beta_{j}\right|^{2}=0\}$ is a manifold of codimension one in $\mathbb{C}^{n}\simeq\mathbb{R}^{2n}$ and $\beta\mapsto g=\sum_{j}\beta_{j}q_{j}$ is an isometry it follows that

[TABLE]

is of measure zero, and indeed for all $g\in\mathbb{C}^{n}\setminus E$ it holds that $V_{g}g(\lambda_{0})\neq 0$ . ∎

3.2.2. Phase retrieval with equiangular frames

This subsection is devoted to presenting the work by Balan et al. [13], “Painless reconstruction from magnitudes of frame coefficients.” The main results reveal that the structure of certain, carefully designed frames can be leveraged to derive explicit reconstruction formulas for the corresponding phase retrieval problem.

To specify the properties of the frames that we shall consider we give a few definitions.

Definition 3.19.

Let $\mathcal{H}$ be a $d$ -dimensional Hilbert space. A finite family of vectors $\left\{f_{1},\ldots,f_{N}\right\}\subset\mathcal{H}$ is called

•

$A$ -tight frame, with frame constant $A>0$ if all $x\in\mathcal{H}$ can be reconstructed from the sequence of frame coefficients $\left(\langle x,f_{j}\rangle\right)_{j=1}^{N}$ according to

[TABLE]

•

uniform A-tight frame if it is an $A$ -tight frame and there is $b>0$ such that $\|f_{j}\|=b$ for all $j\in\{1,\ldots,N\}$ .

•

$2$ -uniform A-tight (or equiangular) frame if it is a uniform $A$ -tight frame and there exists $c>0$ such that $\left|\langle f_{j},f_{l}\rangle\right|=c$ for all $j\neq l$ .

A simple example of an equiangular frame is given by the so-called Mercedes-Benz frame.

Example 3.20.

Let $\mathcal{H}=\mathbb{R}^{2}\simeq\mathbb{C}$ . Then the three vectors defined by

[TABLE]

form a $2$ -uniform $3/2$ -tight frame.

The size $N$ of a $2$ -uniform tight frame is bounded from above in terms of the dimension of the space.

Theorem 3.21 ([13, Proposition 2.3]).

Let $N$ denote the number of vectors in a $2$ -uniform tight frame on a $d$ -dimensional real or complex Hilbert space $\mathcal{H}$ . Then $N\leq d(d+1)/2$ in the real case, and $N\leq d^{2}$ in the complex case, respectively.

We shall call a $2$ -uniform tight frame maximal if it is of maximal size, i.e., if $N=d(d+1)/2$ in the real case, and $N=d^{2}$ in the complex case.

Definition 3.22.

Let $\mathcal{H}$ be a real or complex Hilbert space. A family of vectors $(e_{k}^{(j)})_{j\in\mathbb{J},k\in\mathbb{K}}$ with $\mathbb{J}=\{1,\ldots,d\}$ and $\mathbb{K}=\{1,\ldots,m\}$ is said to form $m$ mutually unbiased bases if for all $j,j^{\prime}\in\mathbb{J}$ and $k,k^{\prime}\in\mathbb{K}$ it holds that

[TABLE]

If $(e_{k}^{(j)})$ form $m$ mutually unbiased bases, then it follows from the defining equality (15), that for fixed $j$ we have that $(e_{k}^{(j)})_{k\in\mathbb{K}}$ is an orthonormal basis. Moreover, the magnitudes of the inner products attain just the three values $0,1$ , and $1/\sqrt{d}$ . In that sense the notion of mutually unbiased bases may be regarded as a relaxation of equiangular frames, which allows for only two values.

As in the case of equiangular frames the size of mutually unbiased bases is bounded from above in terms of the dimension.

Proposition 3.23 ([13, Proposition 2.6]).

There are at most $m=d+1$ mutually unbiased bases in a $d$ -dimensional complex Hilbert space $\mathcal{H}$ .

To every $x\in\mathcal{H}$ we associate the operator

[TABLE]

which is (up to a scaling factor) the projection onto the span of $x$ . In particular, $x$ is uniquely determined up to a sign factor by $\mathcal{Q}_{x}$ .

A special case of the reconstruction formula reads as follows.

Theorem 3.24 ([13, Theorem 3.4], special case).

Let $\mathcal{H}$ be a $d$ -dimensional complex Hilbert space and suppose that $F=\{f_{1},\ldots,f_{N}\}$ satisfies one of the following assumptions:

(i)

$F$ * forms a maximal $2$ -uniform $N/d$ -tight frame.* 2. (ii)

$F$ * forms $d+1$ mutually unbiased bases.*

Given a vector $x\in\mathcal{H}$ with associated self-adjoint rank-one operator $\mathcal{Q}_{x}$ , then

[TABLE]

where $I$ denotes the identity operator on $\mathcal{H}$ .

Remark 3.25.

Note that in [13] the preceding theorem is phrased in terms of $2$ -uniform tight frames that give rise to so-called projective $2$ -designs. The assumptions made in the version as stated above are stronger; see [13, Example 3.3].

Furthermore, note that since the frame is assumed to be $N/d$ -tight, it follows that

[TABLE]

thus, the right-hand side of (16) can be computed from the magnitudes of the inner products $\langle x,f_{j}\rangle$ .

We conclude this subsection with a concrete example of $d+1$ mutually unbiased bases that have Gabor structure. The construction we consider goes back to Alltop [8]; see also [94].

Lemma 3.26 (Alltop).

Let $p\geq 5$ be prime, let $\omega=e^{2\pi i/p}$ denote the $p$ -th unit root, and define

[TABLE]

Then it holds that

[TABLE]

With the help of Lemma 3.26 we establish the following.

Theorem 3.27.

Let $p\geq 5$ be prime, let $\omega=e^{2\pi i/p}$ denote the $p$ -th unit root, and define $g(t):=p^{-1/2}\omega^{t^{3}},\,t\in\mathbb{Z}_{p}$ . For $1\leq j,k\leq p$ define vectors

[TABLE]

and let $(e_{k}^{(p+1)})_{k=1}^{p}$ denote the canonical basis of $\mathbb{C}^{p}$ .

Then $(e_{k}^{(j)})_{k,j}$ form $d+1$ mutually unbiased bases, and in particular for all $x\in\mathbb{C}^{p}$ it holds that

[TABLE]

Proof.

We only need to check that (15) holds true. Then the reconstruction formula follows by applying Theorem 3.24. The case $j=j^{\prime}$ follows from the preceding lemma and from the definition of $e_{k}^{(d+1)}$ , respectively. Thus, it remains to show that for $j\neq j^{\prime}$ the magnitude of the inner product equals $1/\sqrt{p}$ . We distinguish the two cases (a) $\max\{j,j^{\prime}\}<d+1$ and (b) $j<j^{\prime}=d+1$ .

Since $M_{j}g=b_{j}$ , where $b_{j}$ is defined as in (17) we have in case (a) that

[TABLE]

where the last equality follows again from Lemma 3.26. In case (b) we have that

[TABLE]

∎

3.2.3. Lifting Methods

Inspired by the pioneering work of Candès et al.[29, 32] which produced the now famous PhaseLift algorithm, the use of methods from semidefinite programming to solve phase retrieval problems has become hugely popular in recent years.

Given $\Phi=(\varphi_{k})_{k=1}^{m}\subset\mathbb{C}^{n}$ let us consider the associated phase retrieval problem, i.e., the problem of finding $x\in\mathbb{C}^{n}$ from observations

[TABLE]

PhaseLift and related methods are based on the idea of lifting the signal $x$ of interest to a rank (at most) one matrix $X$ by virtue of

[TABLE]

where $x^{*}$ denotes the conjugate transpose of $x$ . Note that $X$ determines $x$ up to a constant phase factor.

Since

[TABLE]

the original phase retrieval can be reformulated in terms of an optimization problem in $X$ :

[TABLE]

In order to obtain a feasible problem—rank minimization is NP-hard in general—(19) is relaxed by means of trace minimization, which gives rise to the semidefinite program

[TABLE]

As it turns out (20) and the original phase retrieval problem are equivalent if the measurement vectors are sufficiently many and picked at random.

Theorem 3.28 ([32]).

Consider $x\in\mathbb{C}^{n}$ arbitrary. Suppose that

•

the number of measurements obeys $m\geq c_{0}n\log n$ , where $c_{0}$ is a sufficiently large constant and

•

the measurement vectors $\varphi_{k},\,k=1,\ldots,m,$ are independently and uniformly sampled on the unit sphere of $\mathbb{C}^{n}$ .

Then with probability at least $1-3e^{-\gamma m/n}$ , where $\gamma$ is a positive constant, (20) has $X=xx^{*}$ as its unique solution.

Let us point out that [32] also contains a result that guarantees robust reconstruction of $x$ by an adequate modification of (20) from noisy measurements

[TABLE]

where $\nu$ models the effect of noise. Since our main focus lies on Fourier-type measurements we refrain from going into detail.

In a followup article it was shown by Candès, Li, and Soltanolkotabi [30] that also Fourier-type measurements can be accommodated for in the PhaseLift framework if random masks are employed. In the case of masked Fourier measurements, with masks $m^{(l)}\in\mathbb{C}^{n}$ , $l=1,\dots,L$ the quantities that can be observed are

[TABLE]

where $f_{k}:=(e^{2\pi ikj/n})_{j=0}^{n-1}$ and $D_{l}:=\operatorname{diag}\left(m^{(l)}_{j}\right)_{j=0}^{n-1}.$ To set up the corresponding semidefinite program it is enough to use

[TABLE]

as replacements for $\varphi_{k}$ in (20):

[TABLE]

Again, if the measurements are sufficiently many and suitably randomized the trace relaxation (21) is exact.

Theorem 3.29 ([30]).

Let $x\in\mathbb{C}^{n}$ be arbitrary. Suppose that

•

the number of coded diffraction patterns $L$ obeys $L\geq c\log^{4}n$ for some numerical constant $c$ ; and

•

the diagonal matrices $D_{l},~{}l=1,\ldots,L$ , are independent and identically distributed (i.i.d.) copies of a diagonal matrix $D$ , whose entries are themselves i.i.d. copies of a random variable $d$ , where $d$ is assumed to be symmetric, $|d|\leq M$ as well as

[TABLE]

Then with probability at least $1-1/n$ it holds that $X=xx^{*}$ is the only feasible point of (21).

In particular, Theorem 3.29 implies uniqueness with high probability when using on the order of $\mathcal{O}(\log^{4}n)$ random masks, which amounts to a total number of $\mathcal{O}(n\log^{4}n)$ measurements. In the same setting, Gross, Krahmer, and Kueng [56] were able to prove that in fact $\mathcal{O}(\log^{2}n)$ masks are sufficient.

A major drawback of PhaseLift and related methods is that with increasing dimension the algorithms become computationally demanding. After all, the lifting shifts the problem from an $n$ -dimensional space to an $n^{2}$ -dimensional space. A more efficient procedure that does not rely on lifting the signal and still comes with recovery guarantees is the Wirtinger flow as proposed by Candès, Li, and Soltanolkotabi [31]. Wirtinger flow is based on carefully picking an initialization using a spectral method followed by an iterative scheme akin gradient descent.

3.2.4. Polarization Methods

Within this subsection we outline the approach taken by Alexeev et al. [7] and summarize some results based upon these ideas.

Again, the objective is to reconstruct $x$ up to a phase factor from measurements $y_{k}$ , as given by (18). At the heart of the proposed reconstruction method lies the elementary Mercedes-Benz polarization identity,

[TABLE]

where $\zeta:=e^{2\pi i/3}$ . Apply (22) to $a=\langle x,\varphi_{j}\rangle$ and $b=\langle x,\varphi_{l}\rangle$ to obtain

[TABLE]

thus, in particular the relative phase

[TABLE]

between two measurements can be observed under the assumption that we have access to supplementary phaseless measurements $\left|\langle x,\varphi_{j}+\zeta^{k}\varphi_{l}\rangle\right|^{2}$ , and provided that $y_{j}$ and $y_{l}$ do not vanish. Hence in that case phase information can be propagated, meaning that assuming that, the phase of $\langle x,\varphi_{j}\rangle$ is known, one can also reconstruct the phase and thus the value of $\langle x,\varphi_{l}\rangle$ .

It turns out to be very useful to represent the measurement setup as a graph $G=(V,E)$ with each vertex corresponding to one of the measurement vectors $\varphi_{j}$ and vice versa. To the edge between vertices $\varphi_{j}$ and $\varphi_{l}$ the three new measurement vectors

[TABLE]

are attached. Note that if $G$ is fully connected we end up with $\mathcal{O}(m^{2})$ measurement vectors. The main result in [7] guarantees that if the vectors $(\varphi_{j})_{j=1}^{m}$ are drawn randomly and $m$ is of order $n\log n$ , then the graph $G$ can be adaptively pruned in such a way that the resulting graph $G^{\prime}$ possesses at most $\mathcal{O}(n\log n)$ edges, and, moreover, that $x$ can be uniquely recovered from the corresponding $\mathcal{O}(n\log n)$ measurements. The statement holds true with high probability.

It is important to mention that the proposed algorithm does in general not produce an ensemble of measurement vectors such that any $x$ can be recovered since the process of selecting the measurement vectors depends on the observed intensities $y_{k}$ . Furthermore, the algorithm can be adapted in such a way that also reconstruction from noisy measurements can be accommodated.

In a followup paper [17] it was shown by some of the authors of [7] and collaborators that a similar approach can be taken when dealing with masked Fourier measurements. To be more concrete, the main results reveal that the proposed randomized procedure yields $\mathcal{O}(\log n)$ masks such that any signal is uniquely determined up to global phase by the corresponding phaseless masked Fourier measurements.

A robust version of the algorithm, which is capable of handling noisy masked Fourier measurements, was recently introduced by Pfander and Salanevich [87].

4. Infinite Dimensional Fourier Phase Retrieval

This section is devoted to phase retrieval problems where the underlying operator is the continuous Fourier transform or variants thereof. Such problems are typically studied within the scope of complex analysis. As is widely known analytic functions of several complex variables behave very differently from univariate holomorphic functions. As we shall see a qualitative gap between the one-dimensional and the multidimensional cases is also encountered for the problem of Fourier phase retrieval.

4.1. The Classical Fourier Phase Retrieval

For signals of continuous variables we will use the following normalization of the Fourier transform.

Definition 4.1.

Let $f\in L^{1}(\mathbb{R}^{d})$ . The Fourier transform $\hat{f}$ of $f$ is defined by

[TABLE]

For $f\in L^{2}(\mathbb{R}^{d})$ , the Fourier transform is to be understood as the usual extension.

The problem of continuous Fourier phase retrieval can now be stated as follows.

Problem 3 (Fourier phase retrieval, continuous).

Suppose $f\in L^{2}(\mathbb{R}^{d})$ and is compactly supported. Recover $f$ from $|\hat{f}|$ .

We start with identifying the trivial ambiguities. As in the discrete case, let $T_{\tau}$ denote the translation operator $T_{\tau}f(x)=f(x-\tau)$ for $\tau\in\mathbb{R}^{d}$ and $R$ the reflection operator $Rf(x)=f(-x)$ .

Proposition 4.2.

Let $f\in L^{2}(\mathbb{R}^{d})$ . Then each of the following choices of $g$ yields the same Fourier magnitudes as $f$ , i.e., $|\hat{g}|=|\hat{f}|$ :

(i)

$g=cf$ * for $|c|=1$ ;* 2. (ii)

$g=T_{\tau}f$ * for $\tau\in\mathbb{R}^{d}$ ;* 3. (iii)

$g=\overline{Rf}$ .

The proof is straightforward. Again the ambiguities of Proposition 4.2 and their combinations are considered trivial ambiguities.

A standard assumption is to consider only compactly supported functions. In the context of imaging applications, this restriction is rather mild as it requires the object of interest to be of finite extent. The great advantage of this assumption is that the Fourier transform of compactly supported functions extends analytically to all of $\mathbb{C}^{d}$ and one can draw upon complex analysis and the theory of entire functions in particular. By the well-known Paley–Wiener theorem [84] for functions of one variable the converse also holds true. The extension to higher dimensions is due to Plancherel and Pólya [88].

Theorem 4.3 (Paley–Wiener).

Let $f\in L^{2}(\mathbb{R}^{d})$ be compactly supported. Then its Fourier–Laplace transform,

[TABLE]

is an entire function of exponential type; i.e., there exist $C_{1},C_{2}>0$ such that

[TABLE]

Conversely, suppose $F:\mathbb{C}^{d}\to\mathbb{C}$ is an entire function of exponential type and its restriction to the real plane $F|_{\mathbb{R}^{d}}:\mathbb{R}^{d}\to\mathbb{C}$ is square integrable. Then $F$ is the Fourier–Laplace transform of a compactly supported function $f\in L^{2}(\mathbb{R}^{d})$ .

Remark 4.4.

Let us mention that Theorem 4.3 can be extended to compactly supported distributions. This result is also known by the name of Paley–Wiener–Schwartz. See [62, Chapter 7] for more details.

Definition 4.5.

An entire function $F$ of one or several variables is called reducible if there exist entire functions $G,H\neq 0$ both having a nonempty zero set such that $F=G\cdot H$ . Otherwise $F$ is called irreducible.

The decomposition of an entire function of exponential type into irreducible factors is unique up to nonvanishing factors. For functions of one variable this is due to the Weierstrass factorization theorem [75], and for functions of several variables, to Osgood [83]. A similar result as in the discrete case (cf. Theorem 3.7) can be established.

Theorem 4.6.

Let $f,g\in L^{2}(\mathbb{R}^{d})$ be compactly supported and let $F,G$ denote the Fourier–Laplace transform of $f$ and $g$ respectively. Then $|\hat{f}|=|\hat{g}|$ if and only if there exists a factorization $G=G_{1}\cdot G_{2}$ , a constant $\gamma$ with $|\gamma|=1$ , and an entire function $Q$ where $Q\big{|}_{\mathbb{R}^{d}}$ is real-valued such that

[TABLE]

Proof.

The proof is quite similar to the proof of Theorem 3.7. Therefore we give only a sketch. First, from the assumption that $|\hat{f}|=|\hat{g}|$ , it follows by analytic extension that

[TABLE]

Both $F$ and $G$ can be represented as (infinite) products of irreducible functions, where the representations are essentially unique. By plugging the product expansions into (24), one can finally deduce in a similar way as in the proof of Theorem 3.7 that (23) holds true.

Sufficiency follows from the observation that the function defined by the right-hand side of (23) has the same modulus as $G$ for arguments in $\mathbb{R}^{d}$ . ∎

The constant $\gamma$ and the modulation $e^{2\pi i\tau\cdot z}$ in formula (23) correspond to multiplication by a unimodular constant and translation in the signal domain respectively. Flipping the whole Fourier–Laplace transform, i.e., choosing $G(z)=G_{2}(z)=\overline{F(\bar{z})}$ , amounts to reflection and conjugation of the underlying function.

By making use of the Paley–Wiener theorem, we can characterize all ambiguous solutions of a given function $f$ .

Corollary 4.7.

Let $f\in L^{2}(\mathbb{R}^{d})$ be compactly supported, and let $F$ denote its Fourier–Laplace transform. Furthermore, suppose that $F=F_{1}\cdot F_{2}$ such that the entire function $G(z):=F_{1}(z)\cdot\overline{F_{2}(\bar{z})}$ is of exponential type. Then for any constant $\gamma$ with $|\gamma|=1$ and $\tau\in\mathbb{R}^{d}$ the function

[TABLE]

is ambiguous with respect to $f$ , i.e., $|\hat{g}|=|\hat{f}|$ . Here $G|_{\mathbb{R}^{d}}$ denotes the restriction of $G$ to real-valued inputs and $\mathcal{F}$ is the usual Fourier transform on $\mathbb{R}^{d}$ .

For functions of one variable the question of uniqueness has been studied in the late 1950s by Akutowicz [2, 3] and a few years later independently by Walther [99] and Hofstetter [60]. Their results reveal that all ambiguous solutions of the phase retrieval problem are obtained by flipping a set of zeros of the holomorphic extension of the Fourier transform across the real axis.

Theorem 4.8 (Akutowicz–Walther–Hofstetter).

Let $f,g\in L^{2}(\mathbb{R})$ be compactly supported and let $F,G$ denote their respective Fourier–Laplace transforms. Let $m$ denote the multiplicity of the root of $F$ at the origin, and let $Z(F)$ denote the multiset of the remaining zeros of $F$ , where all zeros appear according to their multiplicity. Then $|\hat{f}|=|\hat{g}|$ if and only if there exist $a,b\in\mathbb{R}$ and $E\subset Z(F)$ such that

[TABLE]

Theorem 4.8 can be deduced similarly to Theorem 4.6 by making use of Hadamard’s factorization theorem (see, for example, [1]), which states that an entire function of one complex variable is essentially determined by its zeros. More precisely, suppose $F$ is an entire function of exponential type with a zero of order $m$ at the origin. Then there exist $a,b\in\mathbb{C}$ such that

[TABLE]

For the sufficiency part of Theorem 4.8 it is important to point out that the right-hand side of (25) constitutes an entire function of exponential type for every choice of $E\subset Z(F)$ . This follows from a result due to Titchmarsh [96].

While for functions of one variable the expectation of uniqueness is in general hopeless, it is commonly asserted that—similar to the finite-dimensional case (cf. theorem 3.9)—the situation changes drastically when switching to multivariate functions; see [18], where it is stated that

Irreducibility extends to general functions of two variables with infinite sets of zeros, so that exact alternative solutions are most unlikely in 2-D phase retrieval.

However, we are not aware of a rigorous argument of this claim.

4.2. Restriction of the 1D Fourier Phase Retrieval Problem

Common restriction approaches to achieve uniqueness include the following: demand the function (1) to be real-valued or even positive; (2) to satisfy certain symmetry properties; (3) to be monotonic; or (4) to be supported in a prescribed region. We will only state an incomplete, deliberate selection of results in this direction. Before that we mention that requiring positivity as the only a priori assumption (apart from compact support) does not suffice for $|\hat{f}|$ to uniquely determine $f$ up to trivial ambiguities, as has been shown in [37].

Theorem 4.9.

Suppose that $f\in L^{2}(\mathbb{R})$ is compactly supported and that there exists $t_{0}\in\mathbb{R}$ such that

[TABLE]

Then $f$ is uniquely (up to trivial ambiguities) determined by $|\hat{f}|$ .

Proof.

As translations are trivial ambiguities, we may assume w.l.o.g. that $t_{0}=0$ . Due to the symmetry of $f$ , its Fourier–Laplace transform $F$ satisfies

[TABLE]

Particularly, the zeros of $F$ appear symmetrically with respect to the real axis. Uniqueness now follows from the observation that if any factor of the Hadamard factorization (26) is to be flipped, then necessarily also the factor corresponding to its complex conjugate must be flipped in order to preserve property (27). Thus the flipping procedure cannot introduce ambiguous solutions. ∎

We have seen in the previous theorem that by requiring $f$ to be symmetric, the zeros of its Fourier–Laplace transform appear in a symmetric way, which ensures uniqueness. By requiring that $f$ be monotonically nondecreasing, it can be shown that all the zeros of the Fourier–Laplace transform are located in the lower half-plane, which gives the following result.

Theorem 4.10 ([74]).

Suppose that $f$ is supported in an interval $[a,b]$ , positive, and monotone on $[a,b]$ . Then $f$ is uniquely (up to trivial ambiguities) determined by $|\hat{f}|$ .

A further method to enforce uniqueness is to require the function to be supported on two intervals which are sufficiently far apart from each other.

Theorem 4.11 ([51, 38]).

Suppose that $f=f_{1}+f_{2}\in L^{2}(\mathbb{R})$ , where the support of $f_{1}$ and $f_{2}$ is contained in finite, disjoint intervals $I_{1}$ and $I_{2}$ respectively. Suppose further that the distance between the intervals $I_{1}$ and $I_{2}$ is greater than the sum of their lengths and that the Fourier–Laplace transforms of $f_{1}$ and $f_{2}$ have no common zeros. Then $f$ is uniquely (up to trivial ambiguities) determined by $|\hat{f}|$ .

4.3. Additional Measurements

The use of a second measurement obtained by additive distortion by a known signal has also been considered.

Theorem 4.12 ([74]).

Suppose $g\in L^{2}(\mathbb{R})$ is compactly supported and its Fourier transform is real valued and suppose $f\in L^{2}(\mathbb{R})$ with compact support in $[0,\infty)$ . Then $f$ is uniquely determined by $|\hat{f}|$ and $|\hat{f}+\hat{g}|$ .

If the additive distortion $g$ is chosen to be a suitable multiple of the delta distribution the magnitude information of $\hat{f}$ is dispensable, i.e., if $c$ is sufficiently large compared to $f$ , then $f$ can be recovered from $|\hat{f}+c|$ . The interference of $f$ with such a $g$ pushes all the zeros of the analytic extension of the Fourier transform to the upper half-plane. In this case the relation between phase and magnitude is described by the Hilbert transform [27, 26] and remarkably, phase retrieval is rendered not only unique but also stable.

Theorem 4.13.

For $a,b>0$ we define $\mathcal{B}_{a,b}:=\{f\in L^{2}(\mathbb{R}):\|f\|_{L^{\infty}(\mathbb{R})}<a\allowbreak\text{and }\operatorname{supp}(f)\subseteq[0,b]\}$ and for $c\in\mathbb{R}$ let $L^{2}_{c}(\mathbb{R}):=\{f+c:f\in L^{2}(\mathbb{R})\}$ endowed with the metric

[TABLE]

Suppose $c>ab$ . Then $\mathcal{A}:f\mapsto|\hat{f}+c|$ is an injective mapping from $\mathcal{B}_{a,b}$ to $L^{2}_{c}(\mathbb{R})$ and $\mathcal{A}^{-1}:\mathcal{A}(\mathcal{B}_{a,b})\subseteq L^{2}_{c}(\mathbb{R})\rightarrow\mathcal{B}_{a,b}$ is uniformly continuous, i.e. there exists a constant $C>0$ such that

[TABLE]

Proof.

In order to show that $\mathcal{A}$ maps from $\mathcal{B}_{a,b}$ to $L^{2}_{c}(\mathbb{R})$ let $f\in\mathcal{B}_{a,b}\subseteq L^{2}(\mathbb{R})$ be arbitrary. We have to verify that $\mathcal{A}f-c\in L^{2}(\mathbb{R})$ . By the reverse triangle inequality we have that

[TABLE]

Since $\hat{f}\in L^{2}(\mathbb{R})$ also $\mathcal{A}f-c\in L^{2}(\mathbb{R})$ .

Let us denote by $g$ the analytic extension of $\hat{f}+c$ , i.e.,

[TABLE]

Then—provided that $g$ has all its zeros in the upper (or lower) half-plane—phase and magnitude of $g$ are related via the Hilbert transform [26], i.e.,

[TABLE]

satisfies $g=|g|e^{i\alpha}$ .

In order to make use of this identity, we check that $g$ has no zeros in the lower half-plane: For $\operatorname{Im}z\leq 0$ it holds that

[TABLE]

and we have $|g(z)|\geq||\hat{f}(z)|-|c||\geq c-ab>0$ in the lower half-plane since $c>ab$ .

For $f_{1},f_{2}\in\mathcal{B}_{a,b}$ let $g_{k}:=\hat{f_{k}}+c$ and let $\alpha_{k}:=H(\ln|g_{k}|)$ . Then we have for $k=1,2$ that

[TABLE]

and similarly that $|g_{k}(x)|\leq c+ab.$ It follows that there exists a constant $C_{1}>0$ (depending on $a,b,c$ ) such that

[TABLE]

which implies that the difference $\ln|g_{1}|-\ln|g_{2}|$ is an element of $L^{2}(\mathbb{R})$ . According to (28) the phase difference $\delta:=\alpha_{1}-\alpha_{2}$ can be computed by $\delta=H\left(\ln|g_{1}|-\ln|g_{2}|\right)$ . By using the well-known fact that the Hilbert transform is an isometry on $L^{2}(\mathbb{R})$ [95] and (29) it follows that there exists a constant $C_{2}$ (depending on $a,b,c$ ) such that

[TABLE]

Thus we obtain by using the elementary estimate $|1-e^{it}|\leq|t|,t\in\mathbb{R}$ , that

[TABLE]

for suitable constant $C_{3}>0$ . ∎

Remark 4.14.

Note that the assumption $\operatorname{supp}f\subseteq[0,b]$ implies not only that $\hat{f}$ is band-limited but also $|\hat{f}|^{2}$ and $\operatorname{Re}\hat{f}$ . Therefore the function

[TABLE]

is also band-limited and $|\hat{f}+c|$ can be uniquely and stably determined from samples. Together with Theorem 4.13, this implies that any $f\in\mathcal{B}_{a,b}$ can be recovered stably from the samples of $|\hat{f}+c|$ on a suitable discrete set.

4.3.1. Phase Retrieval from holomorphic measurements

By the Paley–Wiener theorem there is a one-to-one correspondence between certain holomorphic functions and compactly supported $L^{2}$ functions, in the sense that the Fourier transform of such a function extends to such a holomorphic function. As discussed in the previous section this observation plays a crucial role in identifying ambiguous solutions of the classical Fourier phase retrieval problem.

There are further instances of Fourier-type transforms that produce essentially holomorphic measurements such as the short-time Fourier transform with Gaussian window and the Cauchy-wavelet transform, which leads us to pose

Problem 4 (phase retrieval from holomorphic measurements).

Suppose $D\subset\mathbb{C}$ is open, $\mathcal{X}\subset\mathcal{O}(D)$ is a set of admissible functions and $S\subset D$ . Given $F\in\mathcal{X}$ , find all $G\in\mathcal{X}$ such that

[TABLE]

If $D$ is the complex plane, $S$ is the real line, and $\mathcal{X}$ denotes the set of entire functions of exponential type whose restriction to the real line is square integrable, Theorem 4.8 reveals that there is in general a huge amount of nontrivial ambiguities, each of which is created by flipping a set of zeros across the real axis.

However, if the modulus of the function is known on two suitably picked lines (see Figure 2), uniqueness is guaranteed. We first consider the case with two lines passing through the origin.

Theorem 4.15 ([68, Theorem 3.3]).

Let $\mathcal{X}$ denote the set of entire functions of finite order and $S$ the union of two lines passing through the origin

[TABLE]

where $\alpha_{1},\alpha_{2}\in[0,2\pi)$ satisfy $\alpha_{1}-\alpha_{2}\notin\pi\mathbb{Q}$ .

Suppose that $F,G\in\mathcal{X}$ satisfy that $\left|G(z)\right|=\left|F(z)\right|$ for all $z\in S$ . Then there exists $\theta\in\mathbb{R}$ such that $G=e^{i\theta}F$ .

Similarly to Theorem 4.8 the proof of Theorem 4.15 relies on the idea of comparing two entire functions by making use of Hadamard’s factorization theorem. To highlight where the assumption on the angle between the two lines comes into play we give a sketch of the proof.

Proof sketch.

We assume for simplicity that $F$ and $G$ are functions of exponential type with simple zeros and that $\alpha_{1}=0$ . W.l.o.g. we may assume that $F$ and $G$ do not vanish at the origin.

Let the Weierstrass factors be denoted by

[TABLE]

and let $Z(F)$ and $Z(G)$ denote the set of zeros of $F$ and $G$ respectively. By Hadamard’s factorization theorem we have that

[TABLE]

with $a,b,c,d\in\mathbb{C}$ . Since $|F|$ and $|G|$ coincide on the real line it follows that

[TABLE]

and, since $|F|$ and $|G|$ agree on the line $z=e^{i\alpha_{2}}t,t\in\mathbb{R}$ , that

[TABLE]

From (30) and (31) it follows that $\operatorname{Re}a=\operatorname{Re}c$ and that $b=d$ . Let us define the discrete set $D$ by

[TABLE]

It remains to show that $D$ is the empty set. Note that the identities (30) and (31) imply that $D$ is invariant under the mappings $\zeta\mapsto\bar{\zeta}$ and $\zeta\mapsto e^{i\alpha_{2}}\cdot\overline{e^{-i\alpha_{2}}\zeta}=e^{2i\alpha_{2}}\bar{\zeta}$ , and thus also under their composition, which happens to be a rotation

[TABLE]

Assume that $D\neq\emptyset$ . Then there exists $0\neq\zeta_{0}\in D$ . Since $D$ is invariant under $\rho$ we have that the orbit

[TABLE]

By the assumption on $\alpha_{2}$ the set $\omega$ cannot be discrete—a contradiction. ∎

For functions in the Hardy space of the upper half-plane, knowledge of the modulus of the function on two parallel lines is sufficient.

Theorem 4.16 ([79, Theorem 2.1]).

Let $a>0$ be fixed and

[TABLE]

Suppose that $F,G\in\mathcal{X}$ satisfy that

(i)

$\left|G(x+ia)\right|=\left|F(x+ia)\right|$ * for almost all $x\in\mathbb{R}$ and* 2. (ii)

$\lim_{y\searrow 0}\left|G(x+iy)\right|=\lim_{y\searrow 0}\left|G(x+iy)\right|$ * for almost all $x\in\mathbb{R}$ .*

Then there exists $\theta\in\mathbb{R}$ such that $G=e^{i\theta}F$ .

Since the functions considered in Theorem 4.16 are not entire but only holomorphic on the half-plane, Hadamard factorization cannot be applied in this case. There is, however, a substitute available, that is, functions in the Hardy space have a unique representation as a product of its Blaschke factors, which involves so-called inner and outer functions.

In [79], Theorem 4.16 is used in order to establish uniqueness for the phase retrieval problem associated to the Cauchy wavelet transform. Recall that the Cauchy wavelets of order $p>0$ are defined by

[TABLE]

where $a>1$ denotes a fixed dilation factor. The associated wavelet transform is then given by the operator

[TABLE]

Furthermore recall that the analytic part $f_{+}$ of a function $f\in L^{2}(\mathbb{R})$ is defined by

[TABLE]

Theorem 4.17 ([79, Corollary 2.2]).

Let $(\psi_{j})_{j\in\mathbb{Z}}$ be defined as in (33). Suppose $f,g\in L^{2}(\mathbb{R})$ are such that for some $j\neq k$ it holds that

[TABLE]

Then there exists $\theta\in\mathbb{R}$ such that the analytic parts of $f$ and $g$ satisfy

[TABLE]

Remark 4.18.

The article [79] also studies stability properties of the phase retrieval problem for Cauchy wavelets. The authors observe in numerical experiments that instabilities are of a certain “generic” type and give formal arguments that there cannot be other types of instabilities; cf. [79, introduction of Sec. 5]

The goal of this section is to give a partial formal justification to the fact that has been nonrigorously discussed …: when two functions $g_{1},g_{2}$ satisfy $\left|g_{1}\ast\psi_{j}\right|=\left|g_{2}\ast\psi_{j}\right|$ for all $j$ , then the wavelet transforms $\left\{g_{1}\ast\psi_{j}(t)\right\}_{j}$ and $\left\{g_{2}\ast\psi_{j}(t)\right\}_{j}$ are equal up to a phase whose variation is slow in $t$ and $j$ , except eventually at the points where $\left|g_{1}\ast\psi_{j}(t)\right|$ is small.

4.3.2. The Pauli Problem

In 1933 Pauli asked his seminal work Die allgemeinen Prinzipien der Wellenmechanik [86] whether a wave function is uniquely determined by the probability densities of position and momentum. In mathematical terms, this is equivalent to the following phase retrieval problem known as the Pauli problem.

Problem 5 (Pauli problem).

Do $|f|$ and $|\hat{f}|$ determine $f\in L^{2}(\mathbb{R})$ uniquely?

Reichenbach [90] published the first counterexamples of Bargmann in 1944: Any symmetric $f$ and its flipped complex conjugated function $\overline{Rf}$ have the same modulus and absolute Fourier measurement. We will call any pair of functions which cannot be distinguished under the measurements of Problem 5 Pauli partners. If a function does not have any Pauli partners beyond the trivial ambiguity of multiplication by a unimodular constant, it is said to be Pauli unique.

In 1978 Vogt [98] (see also Corbett and Hurst [36]) exploited the relation $C\mathcal{F}=\mathcal{F}CR$ to produce infinitely many Pauli partners. Recall that $Cf:=\overline{f}$ and $Rf(x):=f(-x)$ denote the conjugation and reflection operators, respectively. If a function satisfies the symmetry relation $\overline{f(-x)}=f(x)w(x)$ with $|w(x)|=1$ and $w$ is not constant on $\{x:f(x)\neq 0\}$ , then $f$ and $\overline{Rf}$ are again Pauli partners.

Note that both counterexamples, those of Bargmann and of Vogt, Corbett and Hurst, respectively, are trivial ambiguities of the classical Fourier phase retrieval problem (Problem 3). (But not of the Pauli problem, whose only trivial ambiguity is multiplication by a unimodular constant, since translations and conjugated reflections are picked up on in general.)

Since the Pauli problem is of particular interest in quantum mechanics, it is often studied from a quantum mechanical perspective, where the position and momentum operator play a central role. We will use them in the following normalization

[TABLE]

such that $\mathcal{F}P\mathcal{F}^{-1}=Q$ .

Corbett and Hurst [36] proved the following theorem characterizing Pauli uniqueness.

Theorem 4.19.

Let $Q,P$ denote the position and momentum operator as defined in (34). Then $f\in L^{2}(\mathbb{R})$ is Pauli unique if and only if there exists a $\lambda\in\mathbb{R}$ and real-valued Borel-measurable functions $F,G$ such that $e^{iF(Q)}e^{iG(P)}f=e^{i\lambda}f$ .

Note that constant functions $F,G$ amount to multiplication by a unimodular constant, i.e., the only trivial ambiguity of the Pauli problem.

Proof.

Suppose there exists a $\lambda\in\mathbb{R}$ and real-valued Borel-measurable functions $F,G$ such that $e^{iF(Q)}e^{iG(P)}f=e^{i\lambda}f$ . Let $g:=e^{iG(P)}f=e^{-iF(Q)}e^{i\lambda}f$ . By the functional calculus for the position operator, the operator $e^{-iF(Q)}$ amounts to multiplying with the function $e^{-iF(\,.\,)}$ . Hence

[TABLE]

since $F$ is real-valued.

Due to the unitary equivalence $\mathcal{F}P\mathcal{F}^{-1}=Q$ , the operator $e^{iG(P)}$ is the multiplication operator $e^{iG(Q)}$ on the Fourier domain, i.e., $\mathcal{F}e^{iG(P)}\mathcal{F}^{-1}\varphi(\xi)=e^{iG(\xi)}\varphi(\xi)$ . Consequently

[TABLE]

which proves the necessary direction.

Conversely, assume that $f$ has Pauli partner $g$ . Then there exist real-valued Borel-measurable functions $F,G$ such that

[TABLE]

Hence

[TABLE]

and therefore $e^{iF(Q)}f=g=e^{iG(P)}f$ . ∎

Corollary 4.20.

Let $Q,P$ denote the position and momentum operator as defined in (34). Suppose $A(Q,P)$ is a self-adjoint operator such that there exists a unitary operator $U$ with $Ue^{iA}U^{*}=e^{iF(Q)}e^{iG(P)}$ . If $A\varphi=\lambda\varphi$ , then $f:=U\varphi$ is Pauli nonunique with Pauli partner $g:=e^{iG(P)}f=e^{-iF(Q)}f$ .

Corbett and Hurst [35] used this result to show that there exists a dense set of $L^{2}(\mathbb{R})$ that is Pauli nonunique, but includes both trivial and nontrivial solutions of the classical Fourier phase retrieval problem. Furthermore, they constructed uncountably many Pauli nonunique functions which are not trivial solutions of the classical problem.

For this, they considered the Hamiltonian of the quantum harmonic oscillator

[TABLE]

where $m,K>0$ are positive constants corresponding to the mass of the particle and the force constant, respectively. One can show that the self-adjoint operator $H$ satisfies

[TABLE]

where

[TABLE]

The eigenfunctions of the Hamiltonian satisfying

[TABLE]

are the Hermite functions

[TABLE]

By Corollary 4.20, the functions

[TABLE]

are Pauli nonunique with Pauli partner

[TABLE]

Note that due to the symmetry of the Hermite functions, the pairs $(f_{k},g_{k})$ are again trivial solutions of the classical Fourier phase retrieval problem. Nevertheless, this construction yields an orthonormal basis of $L^{2}(\mathbb{R})$ of Pauli nonunique functions.

To construct nontrivial solutions in the classical sense, Corbett and Hurst exploited the periodicity of the eigenvalues of $f_{k}$ . Observe that

[TABLE]

Therefore, by defining $\mathcal{H}_{b,c}:=\operatorname{span}\{f_{nb+c}:n\in\mathbb{N}\}$ where $b\geq 3$ , $0\leq c\leq b-1$ and choosing $\alpha=2\pi/b$ , we obtain for every $f\in\mathcal{H}_{b,c}$

[TABLE]

Hence any $f=\sum_{n=0}^{N}a_{n}f_{nb+c}$ has the Pauli partner $g=\sum_{n=0}^{N}a_{n}\overline{f}_{nb+c}\neq\overline{f}$ as long as at least one $a_{n}\notin\mathbb{R}$ . In particular, this construction yields uncountably many Pauli nonunique functions with Pauli partners differing not just by a trivial ambiguity in the classical sense.

Furthermore for each $b\geq 3$ , this construction yields an orthogonal decomposition of $L^{2}(\mathbb{R})=\bigoplus_{c=0}^{b-1}\mathcal{H}_{b,c}$ such that every $f\in\mathcal{H}_{b,c}$ is Pauli nonunique. By a tensor product argument, this can be generalized to higher dimensions [36].

Remark 4.21.

See also Ismagilov [63], Janssen [71], and Jaming [67] for another construction of uncountably many Pauli partners which are not trivial solutions of the classical phase retrieval problem.

Conversely, there is also a big class of functions where the Pauli problem has a unique solution. Friedman [46] proved that any nonnegative function is Pauli unique.

A generalized version of the Pauli problem was considered by Jaming [68] for the fractional Fourier transform. It is defined for $f\in L^{1}(\mathbb{R})\cap L^{2}(\mathbb{R})$ by

[TABLE]

with respect to the angle $\alpha\notin\pi\mathbb{Z}$ and where $c_{\alpha}$ is a normalization constant such that $\mathcal{F}_{\alpha}$ is an isometry on $L^{2}(\mathbb{R})$ . Note that $\mathcal{F}_{\pi/2}f=\hat{f}$ and $\mathcal{F}_{0}f=f$ , $\mathcal{F}_{\pi}f=Rf$ by a limit procedure [9].

In terms of the fractional Fourier transform, the original Pauli problem asks if a function $f\in L^{2}(\mathbb{R})$ is uniquely determined by the measurements $\{|\mathcal{F}_{0}f|,|\mathcal{F}_{\pi/2}f|\}$ . The natural generalization is the following phase retrieval problem.

Problem 6 (extended Pauli problem).

Suppose $\tau\subseteq[-\pi/2,\pi/2]$ is a given set of angles (not necessarily finite). Does the set of fractional Fourier measurements $\{|\mathcal{F}_{\alpha}f|:\alpha\in\tau\}$ uniquely determine $f\in L^{2}(\mathbb{R})$ ?

Let us first discuss the case where $\tau$ consists of only one angle. In this case, the proof of Theorem 4.6 can be generalized to the fractional Fourier transform [68]. Hence compactly supported functions in $L^{2}(\mathbb{R})$ are not uniquely determined by any single fractional Fourier measurement by a “zero-flipping” argument.

On the other hand, taking “sufficiently dense” fractional Fourier measurements guarantees uniqueness in the extended Pauli problem.

Theorem 4.22 ([68, Theorem 5.1]).

Let $f,g\in L^{2}(\mathbb{R})$ , $\tau\subseteq[-\pi/2,\pi/2]$ , and $|\mathcal{F}_{\alpha}f|=|\mathcal{F}_{\alpha}g|$ for all $\alpha\in\tau$ . Then the following hold:

(i)

If $\tau=[-\pi/2,\pi/2]$ , then there exists a constant $c\in\mathbb{C}$ with $|c|=1$ such that $f=cg$ . 2. (ii)

If $f,g$ have compact support, and $\tau$ is of positive measure or has an accumulation point $\alpha_{0}\neq 0$ , then there exists a constant $c\in\mathbb{C}$ with $|c|=1$ such that $f=cg$ . 3. (iii)

If the support of $f,g$ is included in $[-a,a]$ and $\tau:=\{\pi/2\}\cup\{\arctan a^{2}/k:k\in\mathbb{Z}\setminus\{0\}\}$ , then there exists a constant $c\in\mathbb{C}$ with $|c|=1$ such that $f=cg$ .

The proof of Theorem 4.22 relies on a relation between the fractional Fourier transform and the ambiguity function

[TABLE]

More precisely, one can show that (see [9])

[TABLE]

Therefore, knowledge of $|\mathcal{F}_{\alpha}f|$ for a particular angle $\alpha\in[-\pi/2,\pi/2]$ translates to knowing the values of the ambiguity function $Af$ on a line in the time-frequency plane. Since $Af(x,\xi)=\mathcal{F}(T_{-x/2}f\cdot\overline{T_{x/2}f})(\xi)$ , one can easily recover $f$ , up to a global phase factor, from $Af$ by taking the inverse Fourier transform (see, for example, [12, 102] or the textbook [52]).

This leads to (i) immediately. Statement (ii) requires a brief excursion into complex analysis: Due to the compact support of $f$ , its ambiguity function $Af(x,\,.\,)=\mathcal{F}(T_{-x/2}f\cdot\overline{T_{x/2}f})$ is an entire function for every fixed $x\in\mathbb{R}$ . Hence it is already uniquely determined on a set with accumulation point. For (iii) one employs the Shannon–Whittaker formula for band-limited functions, where the angles $\alpha_{k}$ are chosen precisely to correspond to the samples.

The sufficient conditions of Theorem 4.22 require at least countably many fractional Fourier measurements for uniqueness. A natural question is to ask, whether only finitely many would suffice. Jaming [68] showed that functions of a specific structure, like pulse-train signals or linear combinations of Gaussians or Hermite functions, require only one or two fractional Fourier measurements to be uniquely determined within their specific type (but not necessarily with respect to all $L^{2}$ functions).

On the other hand, Andreys and Jaming [10] showed that any finite set of angles $\tau=\{\alpha_{1},\dots,\alpha_{N}\}$ with $\cot(\alpha_{k})\in\mathbb{Q}$ for all $k=1,\dots,N$ is not sufficient for uniqueness in the generalized Pauli problem. Their result extends the methods of Janssen [71] for the classical setting.

4.3.3. Ambiguity Phase retrieval

We continue with a phase retrieval problem for the ambiguity function, which appears in radar theory [12, 102]. Recall that the ambiguity function is defined for $f\in L^{2}(\mathbb{R})$ by

[TABLE]

The (narrow band) radar ambiguity problem is now formulated as follows.

Problem 7 (Radar Ambiguity Problem).

Does the modulus the ambiguity function $|Af|$ determine $f\in L^{2}(\mathbb{R})$ uniquely?

Again, we will say that two functions $f,g\in L^{2}(\mathbb{R})$ are ambiguity partners if $|Af|=|Ag|$ .

Recall the translation, modulation, and reflection operators $T_{\tau}f(x)=f(x-\tau)$ , $M_{\omega}f(x)=e^{2\pi\omega\cdot x}f(x)$ , and $Rf(x)=f(-x)$ , respectively. Then it is easy to see from the definition the following trivial ambiguities of Problem 7.

Proposition 4.23.

Let $f\in L^{2}(\mathbb{R}^{d})$ . Then each of the following choices of $g$ yields $|Af|=|Ag|$ :

(i)

$g=cf$ * for $|c|=1$ ;* 2. (ii)

$g=T_{\tau}f$ * for $\tau\in\mathbb{R}^{d}$ ;* 3. (iii)

$g=M_{\omega}f$ * for $\omega\in\mathbb{R}^{d}$ ;* 4. (iv)

$g=Rf$ .

A first example of nontrivial ambiguity partners came from de Buda [41]. A systematic approach to studying the ambiguities of Problem 7 can be found in [67].

For compactly supported functions $f\mkern-4.0mu\in\mkern-4.0muL^{2}(\mathbb{R})$ , the ambiguity function $Af(x,\,.\,)=\mathcal{F}(T_{-x/2}f\cdot\overline{T_{x/2}f})$ is an entire function in the second variable by the Paley–Wiener theorem. Therefore $|Af(x,\xi)|=|Ag(x,\xi)|$ for $x,\xi\in\mathbb{R}$ is equivalent to

[TABLE]

Hence the “zero-flipping” that creates a lot of the ambiguities in the classical Fourier phase retrieval problem may also appear for the ambiguity function. Unfortunately, zero-flipping is not well understood for the ambiguity function. Indeed, flipping some zeros of $Af$ may not even yield an ambiguity function.

Jaming [67] characterized the ambiguities of Problem 7 excluding zero-flipping. He called two functions $f,g\in L^{2}(\mathbb{R})$ with compact support restricted ambiguity partners if $Af(x,\,.\,)$ and $Ag(x,\,.\,)$ have the same zeros in the complex plane and proved the following result.

Theorem 4.24 ([67, Theorem 4]).

Suppose $f\in L^{2}(\mathbb{R})$ is a compactly supported function and let $\Omega$ be the open set of all $x$ such that $Af(x,\,.\,)$ is not identically [math].

Then $g\in L^{2}(\mathbb{R})$ is a restricted ambiguity partner of $f$ if and only if there exists a locally constant function $\varphi$ on $\Omega$ such that, for every $t_{0},t_{1},t_{2}\in\operatorname{supp}f$ ,

[TABLE]

and

[TABLE]

for some $a,\xi\in\mathbb{R}$ and $|c|=1$ .

Bonami, Garrigós, and Jaming [23] proved a uniqueness results for Hermite functions, i.e., functions of the form $f(x):=P(x)e^{-x^{2}/2}$ , where $P$ is a polynomial. Their proofs were inspired by some preliminary results obtained in the 1970s by Bueckner [25] and de Buda [41].

Theorem 4.25 ([23, Theorem A]).

For almost all polynomials $P$ , the Hermite function $f(x):=P(x)e^{-x^{2}/2}$ has only trivial ambiguity partners.

Here “almost all” is understood in the sense of the Lebesgue measure after identifying the space of $n$ -dimensional polynomials with $\mathbb{C}^{n+1}$ .

The authors of [23] note that the “almost all” part in Theorem 4.25 may be an artifact of the proof and strongly believe that all Hermite functions have only trivial ambiguities. Furthermore, Jaming [67] conjectured that similar results hold for functions of the form $f(x)=P(x)e^{-x^{2}/2}$ with $P$ an entire function of order $\alpha<1$ , but the techniques of [23] do not apply in this case.

Conjecture 1.

(i)

If $f$ is a Hermite function, i.e., $f(x)=P(x)e^{-x^{2}/2}$ with a polynomial $P$ , then $f$ only has trivial ambiguity partners. (Bueckner [25]) 2. (ii)

If $f(x)=P(x)e^{-x^{2}/2}$ with $P$ an entire function of order $\alpha<1$ , then f has only trivial ambiguity partners. (Jaming [67])

We end this section by stating that we only considered the “narrow-band” ambiguity problem, where certain physical restrictions are assumed of the signal. If those assumptions are lifted, the physical measurements yield the wide-band ambiguity function, which is related to the wavelet transform. The “wide-band” ambiguity phase retrieval problem is even less understood. We refer the interested reader to [67] for some phase retrieval results in this direction.

4.3.4. Continuous Short-Time Fourier Transform Phase Retrieval Problem

We finally turn to the continuous short-time Fourier transform phase retrieval. Recall that the short-time Fourier transform (STFT) of $f\in L^{2}(\mathbb{R}^{d})$ with respect to the window $g\in L^{2}(\mathbb{R}^{d})$ is defined by

[TABLE]

If we fix the window $g$ , the short-time Fourier transform $V_{g}$ is a linear operator from $L^{2}(\mathbb{R}^{d})$ to $L^{2}(\mathbb{R}^{2d})$ . Consequently, a multiplication of $f$ with a unimodular constant produces the same phaseless short-time Fourier transform measurements and is therefore considered a trivial ambiguity. The problem of phase retrieval now reads as follows.

Problem 8 (short-time Fourier phase retrieval).

Suppose $f\in L^{2}(\mathbb{R}^{d})$ . Recover $f$ from $|V_{g}f|$ up to a global phase factor when $g\in L^{2}(\mathbb{R}^{d})$ is known.

Whether Problem 8 is well-posed depends on the choice of the window $g$ . Again a sufficient condition for uniqueness is given in terms of the zero set of its short-time Fourier transform $V_{g}g$ . The proof of this result is analogous to the discrete case with the following fundamental formula at its core.

Proposition 4.26.

Let $f,h,g,u\in L^{2}(\mathbb{R}^{d})$ . Then

[TABLE]

Proposition 4.26 is obtained as in the discrete setting by combining the covariance property with the orthogonality relations. The relevant properties of the short-time Fourier transform and their detailed proof can be found in [52, 53].

We can now prove the following theorem.

Theorem 4.27.

*Let $g\in L^{2}(\mathbb{R}^{d})$ with $V_{g}g(x,\xi)\neq 0$ for almost all $x,\xi\in\mathbb{R}^{d}$ . Then for any $f,h\in L^{2}(\mathbb{R}^{d})$ with $|V_{g}f|=|V_{g}h|$ there exists $\alpha\in\mathbb{R}$ such that $h=e^{i\alpha}f$ . *

More general versions of this statement can be found in [55] and [77].

Proof.

By Proposition 4.26 we obtain that

[TABLE]

and recover $V_{f}f$ almost everywhere. It is easy to see that $V_{f}f$ uniquely determines $f$ up to a phase factor by taking the inverse Fourier transform. ∎

Remark 4.28.

The fact that $V_{f}f$ uniquely determines $f$ up to a phase factor is now a standard result in time-frequency analysis. See, for example, [12, 102] or the textbook [52].

Let us mention some examples for window functions that allow phase retrieval because their short-time Fourier transform does not vanish. The obvious candidate is the Gaussian $\varphi(x)=e^{-\pi|x|^{2}}$ , whose short-time Fourier transform $V_{\varphi}\varphi$ is again a (generalized) Gaussian. A lesser known example is the one-sided exponential $g(x)=e^{-\alpha x}\chi_{[0,\infty)}$ for parameter $\alpha>0$ . Already Janssen [72] computed its short-time Fourier transform $V_{g}g=e^{-|x|(\alpha+\pi i\xi)}/(2\alpha+2\pi i\xi)$ , which clearly does not vanish. More examples can be found in the recent paper by Gröchenig, Jaming, and Malinnikova [54].

The choice of the one-dimensional Gaussian $\varphi(x)=e^{-\pi|x|^{2}}$ is special in one crucial point: it is the only window for which $V_{\varphi}f$ yields a holomorphic function after a slight modification [11]. Hence the full toolbox of complex analysis becomes available when working with a Gaussian window. This modified transform is best known as the Bargmann transform.

In the remainder, we present a result of two of the authors [55] which gives a characterization of instabilities of the short-time Fourier phase retrieval problem with Gaussian window. The work in [55] builds upon results by one of the authors and his collaborators [4], where for phaseless measurements arising from holomorphic functions it is shown that the phase can be stably recovered on so-called atolls.

By an instability we mean, roughly speaking, a signal $f$ for which there exists a signal $g$ which is very different from $f$ , but at the same time produces very similar phaseless measurements. This intuition is formalized by the local Lipschitz constant of the solution operator $|V_{\varphi}f|\mapsto f\sim e^{i\alpha}f$ .

Definition 4.29.

Let $\mathcal{A}$ be a mapping from $\mathcal{X}$ to $\mathcal{Y}$ , where $(\mathcal{X},d_{\mathcal{X}})$ and $(\mathcal{Y},d_{\mathcal{Y}})$ are metric spaces. Then the local stability constant $C_{\mathcal{A}}(f)$ of $\mathcal{A}$ at $f\in\mathcal{X}$ is defined as the smallest positive number $C$ such that

[TABLE]

Instabilities are routinely constructed by fixing a well-localized function $f_{0}$ ; then for large $\tau$ the functions

[TABLE]

yield approximately the same phaseless short-time Fourier measurements. Even more so the stability constant degenerates exponentially in $\tau$ , i.e., $C_{|V_{\varphi}|}(f_{+}^{\tau})\gtrsim e^{c\tau^{2}}$ for suitable metrics [6].

As we shall see, the stability constant for short-time Fourier phase retrieval with Gaussian window can be controlled in terms of a concept which was introduced by Cheeger in the field of Riemannian geometry [33].

Definition 4.30.

Let $\Omega\subseteq\mathbb{R}^{d}$ be open. For a continuous, nonnegative, integrable function $w$ on $\Omega$ the Cheeger constant is defined as

[TABLE]

A small Cheeger constant indicates that the domain can be partitioned into two subdomains such that the weight is rather small on the separating boundary of the two subdomains and that, at the same time both subdomains carry approximately the same amount of $L^{1}$ -energy.

In that sense the Cheeger constant captures the disconnectedness of the weight; cf. Figure 3.

Before we state the stability result, both the signal space and the measurement space have to be endowed with suitable metrics. To this end we define Feichtinger’s algebra and a family of weighted Sobolev norms.

Definition 4.31.

Feichtinger’s algebra is defined as

[TABLE]

with induced norm $\|f\|_{\mathcal{M}^{1}}:=\|V_{\varphi}f\|_{L^{1}(\mathbb{R}^{2})}.$

Definition 4.32.

For $1\leq p,q<\infty$ , $s>0$ and $F:\mathbb{R}^{2}\rightarrow\mathbb{C}$ sufficiently smooth we define

[TABLE]

The main stability result in [55] now reads as follows.

Theorem 4.33.

Let $q>2$ . Let $\mathcal{X}:=\mathcal{M}^{1}/\sim$ be endowed with the metric111 $f\sim g$ if and only if $g=e^{i\alpha}f$ for some $\alpha\in\mathbb{R}$ .

[TABLE]

and let $\mathcal{Y}:=|V_{\varphi}|(\mathcal{M}^{1})$ be endowed with the metric induced by the norm $\|\cdot\|_{\mathcal{D}_{1,q}^{4}}$ . Suppose that $f\in\mathcal{M}^{1}$ is such that $|V_{\varphi}f|$ has a global maximum at the origin. Then there exists a constant $c$ that only depends on $q$ and the quotient $\|f\|_{\mathcal{M}^{1}}/\|V_{\varphi}f\|_{L^{\infty}(\mathbb{R}^{2})}$ such that

[TABLE]

Disregarding the weak dependence of $c$ on $f$ the estimate (38) can be informally summarized as follows:

The only instabilities for short-time Fourier phase retrieval with Gaussian window are of disconnected type.

Before we give a sketch of the proof we set the stability result in relation to the general results in the abstract setting in section 2.2, where the concept of the $\sigma$ -strong complement property was introduced. In the context of short-time Fourier phase retrieval Remark 2.15 can be qualitatively understood in the following way. A function $x$ is rather unstable if it can be written as $x=f+h$ with $\|f\|_{L^{2}(\mathbb{R})},\|h\|_{L^{2}(\mathbb{R})}\asymp 1$ such that their respective short-time Fourier measurements are essentially supported on two disjoint domains. In other words the time-frequency plane can be split up into $S\subseteq\mathbb{R}^{2}$ and $\mathbb{R}^{2}\setminus S$ such that both $\|V_{\varphi}f\|_{L^{2}(S)}$ and $\|V_{\varphi}h\|_{L^{2}(\mathbb{R}^{2}\setminus S)}$ are small. If the metrics on the signal and measurement space are both induced by the respective $L^{2}$ -norm it holds that

[TABLE]

Theorem 4.33 nicely complements this result in the sense that the disconnectedness as quantified by the Cheeger constant—which to some extent resembles the lower bound in (39)—also gives an upper bound on the local stability constant.

Architecture of the proof.

Let us start with the observation that for any $f,g\in\mathcal{M}^{1}$ it holds that

[TABLE]

where $w=|V_{\varphi}f|$ .

Now suppose that we could just disregard the constraint $|c|=1$ in (40) (this can be justified with considerable effort). The Poincaré inequality tells us that there exists a constant $C_{poinc}(w)$ such that (40) can be bounded by

[TABLE]

Now spectral geometry enters the picture. Cheeger’s inequality [33] says that the Poincaré constant on a Riemannian manifold can be controlled by the reciprocal of the Cheeger constant. We would like to apply this result to the metric induced by the metric tensor $\left(w(x,y)\begin{bmatrix}1&0\\ 0&1\end{bmatrix}\right)_{(x,y)\in\mathbb{R}^{2}}$ in order to get a bound on $C_{poinc}(w)$ . However, since $w$ in our case arises from short-time Fourier measurements it generally has zeros and therefore does not qualify as a Riemannian manifold. Nevertheless a version of Cheeger’s inequality can be established, i.e.,

[TABLE]

where $h(w,\mathbb{R}^{2})$ is defined as in (37).

Next we will make use of the fact that for any $h\in L^{2}(\mathbb{R})$

[TABLE]

is an entire function (up to reflection). Thus $V_{\varphi}g/V_{\varphi}f$ is meromorphic (again up to reflection) and by applying the Cauchy–Riemann equations one elementarily computes that

[TABLE]

almost everywhere. Combining (40), (41), (42) and (44) yields that

[TABLE]

This means that we already succeeded in bounding the distance between the signals in terms of their phaseless short-time Fourier measurements. The aim, however, is to get a bound in terms of the difference of the short-time Fourier transform magnitudes. In order to obtain this, we estimate

[TABLE]

The final ingredient of the proof lies in the treatment of the logarithmic derivative $\frac{\nabla|V_{\varphi}f|}{|V_{\varphi}f|}$ . The norm of the logarithmic derivative on balls centered at the origin can essentially be controlled by the product of the volume of the ball and the number of its singularities in a ball of twice the radius, which are precisely the zeros of $V_{\varphi}f$ . Jensen’s formula relates the number of zeros of the function in (43), and therefore of $V_{\varphi}f$ , to its growth. Since the growth of the entire functions in (43) can be uniformly bounded for functions $f\in\mathcal{M}^{1}$ this argument allows us to absorb the logarithmic derivative in a lower order polynomial, which is independent of $f$ . ∎

Acknowledgments

The authors thank Martin Ehler for reading and commenting on parts of the manuscript. Furthermore, the authors highly appreciated the constructive feedback of both referees, which considerably improved this work. Finally, the last two authors graciously acknowledge the support of the Austrian Science Fund (FWF): S.K. was employed in the project P30148-N32, and M.R. was supported by the START-Project Y963-N35.

Bibliography102

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] L. Ahlfors. Complex Analysis . 1966.
2[2] E. J. Akutowicz. On the determination of the phase of a Fourier integral. I. Trans. Amer. Math. Soc. , 83:179–192, 1956.
3[3] E. J. Akutowicz. On the determination of the phase of a Fourier integral. II. Proc. Amer. Math. Soc. , 8:234–238, 1957.
4[4] R. Alaifari, I. Daubechies, P. Grohs, and R. Yin. Stable Phase Retrieval in Infinite Dimensions. Ar Xiv e-prints , Aug. 2016.
5[5] R. Alaifari and P. Grohs. Phase retrieval in the general setting of continuous frames for Banach spaces. SIAM J. Math. Anal. , 49(3):1895–1911, 2017.
6[6] R. Alaifari and P. Grohs. Gabor phase retrieval is severely ill-posed. Ar Xiv e-prints , May 2018.
7[7] B. Alexeev, A. S. Bandeira, M. Fickus, and D. G. Mixon. Phase Retrieval with Polarization. SIAM Journal on Imaging Sciences , 7(1):35–66, 2014.
8[8] W. O. Alltop. Complex sequences with low periodic correlations. IEEE Trans. Inform. Theory , 26(3):350–354, 1980.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Phase Retrieval: Uniqueness and Stability

Abstract.

1. Introduction

2. Abstract Phase Retrieval

2.1. Injectivity

Definition 2.1**.**

Theorem 2.2**.**

Proof.

Corollary 2.3**.**

Proof.

Theorem 2.4**.**

Theorem 2.5**.**

2.2. Stability

Definition 2.6**.**

Definition 2.7**.**

Definition 2.8**.**

Proposition 2.9**.**

Theorem 2.10**.**

Proof idea.

Theorem 2.11**.**

Proof.

Definition 2.12**.**

Theorem 2.13**.**

Remark 2.14**.**

Proof.

Remark 2.15**.**

Proposition 2.16**.**

Proof.

Theorem 2.17**.**

Proof.

Corollary 2.18**.**

Proof.

Remark 2.19**.**

3. Finite Dimensional Phase Retrieval

3.1. The classical Fourier Phase Retrieval Problem

Definition 3.1**.**

Problem 1** (Fourier phase retrieval, discrete).**

Remark 3.2**.**

Proposition 3.3**.**

Proof.

Definition 3.4**.**

Definition 3.5**.**

Example 3.6**.**

Theorem 3.7**.**

Proof.

Corollary 3.8**.**

Theorem 3.9** ([59]).**

Corollary 3.10**.**

Theorem 3.11** ([66]).**

3.2. Fourier phase retrieval using masks

3.2.1. Discrete Short-Time Fourier Phase Retrieval

Lemma 3.12**.**

Problem 2** (discrete short-time Fourier phase retrieval).**

Proposition 3.13**.**

Lemma 3.14** (Covariance Property).**

Proof.

Proposition 3.15** (Orthogonality Relations).**

Proof.

Proof of Proposition 3.13.

Theorem 3.16**.**

Proof.

Theorem 3.17** ([42]).**

Theorem 3.18**.**

Proof.

3.2.2. Phase retrieval with equiangular frames

Definition 3.19**.**

Example 3.20**.**

Theorem 3.21** ([13, Proposition 2.3]).**

Definition 3.22**.**

Proposition 3.23** ([13, Proposition 2.6]).**

Theorem 3.24** ([13, Theorem 3.4], special case).**

Remark 3.25**.**

Lemma 3.26** (Alltop).**

Theorem 3.27**.**

Definition 2.1.

Theorem 2.2.

Corollary 2.3.

Theorem 2.4.

Theorem 2.5.

Definition 2.6.

Definition 2.7.

Definition 2.8.

Proposition 2.9.

Theorem 2.10.

Theorem 2.11.

Definition 2.12.

Theorem 2.13.

Remark 2.14.

Remark 2.15.

Proposition 2.16.

Theorem 2.17.

Corollary 2.18.

Remark 2.19.

Definition 3.1.

Problem 1 (Fourier phase retrieval, discrete).

Remark 3.2.

Proposition 3.3.

Definition 3.4.

Definition 3.5.

Example 3.6.

Theorem 3.7.

Corollary 3.8.

Theorem 3.9 ([59]).

Corollary 3.10.

Theorem 3.11 ([66]).

Lemma 3.12.

Problem 2 (discrete short-time Fourier phase retrieval).

Proposition 3.13.

Lemma 3.14 (Covariance Property).

Proposition 3.15 (Orthogonality Relations).

Theorem 3.16.

Theorem 3.17 ([42]).

Theorem 3.18.

Definition 3.19.

Example 3.20.

Theorem 3.21 ([13, Proposition 2.3]).

Definition 3.22.

Proposition 3.23 ([13, Proposition 2.6]).

Theorem 3.24 ([13, Theorem 3.4], special case).

Remark 3.25.

Lemma 3.26 (Alltop).

Theorem 3.27.

Theorem 3.28 ([32]).

Theorem 3.29 ([30]).

Definition 4.1.

Problem 3 (Fourier phase retrieval, continuous).

Proposition 4.2.

Theorem 4.3 (Paley–Wiener).

Remark 4.4.

Definition 4.5.

Theorem 4.6.

Corollary 4.7.

Theorem 4.8 (Akutowicz–Walther–Hofstetter).

Theorem 4.9.

Theorem 4.10 ([74]).

Theorem 4.11 ([51, 38]).

Theorem 4.12 ([74]).

Theorem 4.13.

Remark 4.14.

Problem 4 (phase retrieval from holomorphic measurements).

Theorem 4.15 ([68, Theorem 3.3]).

Theorem 4.16 ([79, Theorem 2.1]).

Theorem 4.17 ([79, Corollary 2.2]).

Remark 4.18.

Problem 5 (Pauli problem).

Theorem 4.19.

Corollary 4.20.

Remark 4.21.

Problem 6 (extended Pauli problem).

Theorem 4.22 ([68, Theorem 5.1]).

Problem 7 (Radar Ambiguity Problem).

Proposition 4.23.

Theorem 4.24 ([67, Theorem 4]).

Theorem 4.25 ([23, Theorem A]).