Stability Analysis for a Class of Sparse Optimization Problems

Jialiang Xu; Yun-Bin Zhao

arXiv:1904.09637·math.OC·April 23, 2019

Stability Analysis for a Class of Sparse Optimization Problems

Jialiang Xu, Yun-Bin Zhao

PDF

Open Access

TL;DR

This paper establishes a stability result for $ ext{l}_1$-minimization in sparse optimization, generalizing previous results by introducing a new property of sensing matrices, which enhances understanding of signal recovery stability.

Contribution

The paper introduces the restricted weak range space property (RSP) of sensing matrices, generalizing previous concepts, and establishes a stability result for $ ext{l}_1$-minimization in a broad class of $ ext{l}_0$-minimization problems.

Findings

01

Introduces the restricted weak RSP of sensing matrices.

02

Establishes a generalized stability theorem for $ ext{l}_1$-minimization.

03

Includes several existing stability results as special cases.

Abstract

The sparse optimization problems arise in many areas of science and engineering, such as compressed sensing, image processing, statistical and machine learning. The $ℓ_{0}$ -minimization problem is one of such optimization problems, which is typically used to deal with signal recovery. The $ℓ_{1}$ -minimization method is one of the plausible approaches for solving the $ℓ_{0}$ -minimization problems, and thus the stability of such a numerical method is vital for signal recovery. In this paper, we establish a stability result for the $ℓ_{1}$ -minimization problems associated with a general class of $ℓ_{0}$ -minimization problems. To this goal, we introduce the concept of restricted weak range space property (RSP) of a transposed sensing matrix, which is a generalized version of the weak RSP of the transposed sensing matrix introduced in [Zhao et al., Math. Oper. Res., 44(2019),…

Equations176

\begin{array}[]{lcl}&\min\limits_{x\in R^{n}}&~{}\left\|x\right\|_{0}\\ &$s.t$.&~{}a_{1}\left\|y-Ax\right\|_{2}+a_{2}\left\|U^{T}(Ax-y)\right\|_{\infty}+a_{3}\left\|U^{T}(Ax-y)\right\|_{1}\leq\varepsilon\\ &&~{}Bx\leq b,\end{array}

\begin{array}[]{lcl}&\min\limits_{x\in R^{n}}&~{}\left\|x\right\|_{0}\\ &$s.t$.&~{}a_{1}\left\|y-Ax\right\|_{2}+a_{2}\left\|U^{T}(Ax-y)\right\|_{\infty}+a_{3}\left\|U^{T}(Ax-y)\right\|_{1}\leq\varepsilon\\ &&~{}Bx\leq b,\end{array}

ϕ (x) = U^{T} (A x - y),

ϕ (x) = U^{T} (A x - y),

x \in R^{n} min {∥ x ∥_{0} : a_{1} ∥ y - A x ∥_{2} + a_{2} ∥ ϕ (x) ∥_{\infty} + a_{3} ∥ ϕ (x) ∥_{1} \leq ε, B x \leq b} .

x \in R^{n} min {∥ x ∥_{0} : a_{1} ∥ y - A x ∥_{2} + a_{2} ∥ ϕ (x) ∥_{\infty} + a_{3} ∥ ϕ (x) ∥_{1} \leq ε, B x \leq b} .

\begin{array}[]{ll}$(C1)$~{}\min\limits_{x}\{\|x\|_{0}:~{}y=Ax\};&$(C2)$~{}\min\limits_{x}\{\|x\|_{0}:~{}\left\|y-Ax\right\|_{2}\leq\varepsilon\};\\ $(C3)$~{}\min\limits_{x}\{\|x\|_{0}:~{}\left\|U^{T}(Ax-y)\right\|_{1}\leq\varepsilon\};&$(C4)$~{}\min\limits_{x}\{\|x\|_{0}:~{}\left\|U^{T}(Ax-y)\right\|_{\infty}\leq\varepsilon\}.\end{array}

\begin{array}[]{ll}$(C1)$~{}\min\limits_{x}\{\|x\|_{0}:~{}y=Ax\};&$(C2)$~{}\min\limits_{x}\{\|x\|_{0}:~{}\left\|y-Ax\right\|_{2}\leq\varepsilon\};\\ $(C3)$~{}\min\limits_{x}\{\|x\|_{0}:~{}\left\|U^{T}(Ax-y)\right\|_{1}\leq\varepsilon\};&$(C4)$~{}\min\limits_{x}\{\|x\|_{0}:~{}\left\|U^{T}(Ax-y)\right\|_{\infty}\leq\varepsilon\}.\end{array}

x min {∥ x ∥_{1} : a_{1} ∥ y - A x ∥_{2} + a_{2} ∥ ϕ (x) ∥_{\infty} + a_{3} ∥ ϕ (x) ∥_{1} \leq ε, B x \leq b} .

x min {∥ x ∥_{1} : a_{1} ∥ y - A x ∥_{2} + a_{2} ∥ ϕ (x) ∥_{\infty} + a_{3} ∥ ϕ (x) ∥_{1} \leq ε, B x \leq b} .

\begin{array}[]{ll}$(D1)$~{}\min\limits_{x}\{\|x\|_{1}:~{}y=Ax\};&$(D2)$~{}\min\limits_{x}\{\|x\|_{1}:~{}\left\|y-Ax\right\|_{2}\leq\varepsilon\};\\ $(D3)$~{}\min\limits_{x}\{\|x\|_{1}:~{}\left\|U^{T}(Ax-y)\right\|_{1}\leq\varepsilon\};&$(D4)$~{}\min\limits_{x}\{\|x\|_{1}:~{}\left\|U^{T}(Ax-y)\right\|_{\infty}\leq\varepsilon\}.\end{array}

\begin{array}[]{ll}$(D1)$~{}\min\limits_{x}\{\|x\|_{1}:~{}y=Ax\};&$(D2)$~{}\min\limits_{x}\{\|x\|_{1}:~{}\left\|y-Ax\right\|_{2}\leq\varepsilon\};\\ $(D3)$~{}\min\limits_{x}\{\|x\|_{1}:~{}\left\|U^{T}(Ax-y)\right\|_{1}\leq\varepsilon\};&$(D4)$~{}\min\limits_{x}\{\|x\|_{1}:~{}\left\|U^{T}(Ax-y)\right\|_{\infty}\leq\varepsilon\}.\end{array}

x - x^{#}_{2} \leq C_{1} σ_{k} (x)_{1} + C_{2} ε

x - x^{#}_{2} \leq C_{1} σ_{k} (x)_{1} + C_{2} ε

σ_{k} (x)_{1} = z min {∥ x - z ∥_{1} : ∥ z ∥_{0} \leq k} .

σ_{k} (x)_{1} = z min {∥ x - z ∥_{1} : ∥ z ∥_{0} \leq k} .

\left\{\begin{array}[]{lll}\eta_{i}=1&\mathrm{if}~{}i\in J_{1},\\ \eta_{i}=-1&\mathrm{if}~{}i\in J_{2},\\ |\eta_{i}|\leq 1&\mathrm{if}~{}i\notin J_{1}\cup J_{2}.\end{array}\right.

\left\{\begin{array}[]{lll}\eta_{i}=1&\mathrm{if}~{}i\in J_{1},\\ \eta_{i}=-1&\mathrm{if}~{}i\in J_{2},\\ |\eta_{i}|\leq 1&\mathrm{if}~{}i\notin J_{1}\cup J_{2}.\end{array}\right.

\left\{\begin{array}[]{ll}\eta_{i}=1&\mathrm{if}~{}i\in J_{1},\\ \eta_{i}=-1&\mathrm{if}~{}i\in J_{2},\\ |\eta_{i}|\leq 1&\mathrm{if}~{}i\notin J_{1}\cup J_{2}.\end{array}\right.

\left\{\begin{array}[]{ll}\eta_{i}=1&\mathrm{if}~{}i\in J_{1},\\ \eta_{i}=-1&\mathrm{if}~{}i\in J_{2},\\ |\eta_{i}|\leq 1&\mathrm{if}~{}i\notin J_{1}\cup J_{2}.\end{array}\right.

\begin{array}[]{lcl}&\min\limits_{(x,r,s,\xi,v)}&\left\|x\right\|_{1}\\ &$s.t$.&a_{1}s+a_{2}\xi+a_{3}\left(e^{h}\right)^{T}v\leq\varepsilon,\\ &&r\in s\mathcal{B},~{}r=y-Ax,~{}(s,\xi,v)\geq 0,\\ &&\left\|\phi(x)\right\|_{\infty}\leq\xi,~{}\left|\phi(x)\right|\leq v,~{}Bx\leq b,\end{array}

\begin{array}[]{lcl}&\min\limits_{(x,r,s,\xi,v)}&\left\|x\right\|_{1}\\ &$s.t$.&a_{1}s+a_{2}\xi+a_{3}\left(e^{h}\right)^{T}v\leq\varepsilon,\\ &&r\in s\mathcal{B},~{}r=y-Ax,~{}(s,\xi,v)\geq 0,\\ &&\left\|\phi(x)\right\|_{\infty}\leq\xi,~{}\left|\phi(x)\right|\leq v,~{}Bx\leq b,\end{array}

B = ∥ a ∥_{2} = 1 ⋂ {z \in R^{m} : a^{T} z \leq 1} .

B = ∥ a ∥_{2} = 1 ⋂ {z \in R^{m} : a^{T} z \leq 1} .

E = {(x, s, ξ, v) : a_{1} s + a_{2} ξ + a_{3} (e^{h})^{T} v \leq ε, B x \leq b, ∥ ϕ (x) ∥_{\infty} \leq ξ, ∣ ϕ (x) ∣ \leq v, (s, ξ, v) \geq 0},

E = {(x, s, ξ, v) : a_{1} s + a_{2} ξ + a_{3} (e^{h})^{T} v \leq ε, B x \leq b, ∥ ϕ (x) ∥_{\infty} \leq ξ, ∣ ϕ (x) ∣ \leq v, (s, ξ, v) \geq 0},

Ω^{*} = {(x, r, s, ξ, v) : ∥ x ∥_{1} \leq θ^{*}, r \in s B, r = y - A x, (x, s, ξ, v) \in E},

Ω^{*} = {(x, r, s, ξ, v) : ∥ x ∥_{1} \leq θ^{*}, r \in s B, r = y - A x, (x, s, ξ, v) \in E},

Ω_{P} = {(x, r, s, ξ, v) : ∥ x ∥_{1} \leq θ^{*}, r \in s P, r = y - A x, (x, s, ξ, v) \in E} .

Ω_{P} = {(x, r, s, ξ, v) : ∥ x ∥_{1} \leq θ^{*}, r \in s P, r = y - A x, (x, s, ξ, v) \in E} .

\delta^{\mathcal{H}}(M_{1},M_{2})=\max\biggr{\{}\sup_{x\in M_{1}}\inf_{z\in M_{2}}\left\|x-z\right\|_{2},\sup_{z\in M_{2}}\inf_{x\in M_{1}}\left\|x-z\right\|_{2}\biggr{\}}.

\delta^{\mathcal{H}}(M_{1},M_{2})=\max\biggr{\{}\sup_{x\in M_{1}}\inf_{z\in M_{2}}\left\|x-z\right\|_{2},\sup_{z\in M_{2}}\inf_{x\in M_{1}}\left\|x-z\right\|_{2}\biggr{\}}.

δ^{H} (Ω^{*}, Ω_{P}) \leq ε^{'} .

δ^{H} (Ω^{*}, Ω_{P}) \leq ε^{'} .

P = {z \in R^{m} : (a^{i})^{T} z \leq 1, 1 \leq i \leq L},

P = {z \in R^{m} : (a^{i})^{T} z \leq 1, 1 \leq i \leq L},

(β^{j})^{T} z \leq 1, - (β^{j})^{T} z \leq 1, j = 1, ..., m

(β^{j})^{T} z \leq 1, - (β^{j})^{T} z \leq 1, j = 1, ..., m

P_{0} = P \cap {z \in R^{m} : (β^{j})^{T} z \leq 1, - (β^{j})^{T} z \leq 1, j = 1, ..., m} = {z \in R^{m} : (a^{i})^{T} z \leq 1, 1 \leq i \leq L; (β^{j})^{T} z \leq 1, - (β^{j})^{T} z \leq 1, j = 1, ..., m} .

P_{0} = P \cap {z \in R^{m} : (β^{j})^{T} z \leq 1, - (β^{j})^{T} z \leq 1, j = 1, ..., m} = {z \in R^{m} : (a^{i})^{T} z \leq 1, 1 \leq i \leq L; (β^{j})^{T} z \leq 1, - (β^{j})^{T} z \leq 1, j = 1, ..., m} .

T := {a^{i} : 1 \leq i \leq L} \cup {\pm β^{j} : 1 \leq j \leq m} .

T := {a^{i} : 1 \leq i \leq L} \cup {\pm β^{j} : 1 \leq j \leq m} .

δ^{H} (Ω^{*}, Ω_{P_{0}}) \leq ε^{'} .

δ^{H} (Ω^{*}, Ω_{P_{0}}) \leq ε^{'} .

P_{0} = {z \in R^{m} : (M_{P_{0}})^{T} z \leq e^{N}},

P_{0} = {z \in R^{m} : (M_{P_{0}})^{T} z \leq e^{N}},

θ_{P_{0}}^{*} : = (x, r, s, ξ, v) min {∥ x ∥_{1} : r \in s P_{0}, r = y - A x, (x, s, ξ, v) \in E} = (x, s, ξ, v) min {∥ x ∥_{1} : (M_{P_{0}})^{T} (y - A x) \leq s e^{N}, (x, s, ξ, v) \in E} .

θ_{P_{0}}^{*} : = (x, r, s, ξ, v) min {∥ x ∥_{1} : r \in s P_{0}, r = y - A x, (x, s, ξ, v) \in E} = (x, s, ξ, v) min {∥ x ∥_{1} : (M_{P_{0}})^{T} (y - A x) \leq s e^{N}, (x, s, ξ, v) \in E} .

(x, s, ξ, v) min {∥ x ∥_{1} : (M_{P_{0}})^{T} (y - A x) \leq s e^{N}, (x, s, ξ, v) \in E} .

(x, s, ξ, v) min {∥ x ∥_{1} : (M_{P_{0}})^{T} (y - A x) \leq s e^{N}, (x, s, ξ, v) \in E} .

Ω_{P_{0}}^{*} = {x \in R^{n} : ∥ x ∥_{1} \leq θ_{P_{0}}^{*}, r \in s P_{0}, r = y - A x, (x, s, ξ, v) \in E} .

Ω_{P_{0}}^{*} = {x \in R^{n} : ∥ x ∥_{1} \leq θ_{P_{0}}^{*}, r \in s P_{0}, r = y - A x, (x, s, ξ, v) \in E} .

\begin{array}[]{lcl}&\min\limits_{(x,t,s,\xi,v)}&e^{T}t\\ &$s.t$.&a_{1}s+a_{2}\xi+a_{3}\left(e^{h}\right)^{T}v\leq\varepsilon,~{}Bx\leq b,~{}|x|\leq t,\\ &&\left(M_{P_{0}}\right)^{T}(y-Ax)\leq se^{N},~{}(t,s,\xi,v)\geq 0,\\ &&\left\|\phi(x)\right\|_{\infty}\leq\xi,\left|\phi(x)\right|\leq v.\\ \end{array}

\begin{array}[]{lcl}&\min\limits_{(x,t,s,\xi,v)}&e^{T}t\\ &$s.t$.&a_{1}s+a_{2}\xi+a_{3}\left(e^{h}\right)^{T}v\leq\varepsilon,~{}Bx\leq b,~{}|x|\leq t,\\ &&\left(M_{P_{0}}\right)^{T}(y-Ax)\leq se^{N},~{}(t,s,\xi,v)\geq 0,\\ &&\left\|\phi(x)\right\|_{\infty}\leq\xi,\left|\phi(x)\right|\leq v.\\ \end{array}

\begin{array}[]{cl}\min\limits_{(x,t,s,\xi,v)}&e^{T}t\\ $s.t$.&x+t\geq 0,~{}-x+t\geq 0,\\ &-a_{1}s-a_{2}\xi-a_{3}\left(e^{h}\right)^{T}v\geq-\varepsilon,M_{P_{0}}^{T}Ax+e^{N}s\geq M_{P_{0}}^{T}y,\\ &U^{T}Ax+\xi e^{h}\geq U^{T}y,-U^{T}Ax+\xi e^{h}\geq-U^{T}y,\\ &U^{T}Ax+v\geq U^{T}y,-U^{T}Ax+v\geq-U^{T}y,\\ &~{}-Bx\geq-b,~{}(t,s,\xi,v)\geq 0.\end{array}

\begin{array}[]{cl}\min\limits_{(x,t,s,\xi,v)}&e^{T}t\\ $s.t$.&x+t\geq 0,~{}-x+t\geq 0,\\ &-a_{1}s-a_{2}\xi-a_{3}\left(e^{h}\right)^{T}v\geq-\varepsilon,M_{P_{0}}^{T}Ax+e^{N}s\geq M_{P_{0}}^{T}y,\\ &U^{T}Ax+\xi e^{h}\geq U^{T}y,-U^{T}Ax+\xi e^{h}\geq-U^{T}y,\\ &U^{T}Ax+v\geq U^{T}y,-U^{T}Ax+v\geq-U^{T}y,\\ &~{}-Bx\geq-b,~{}(t,s,\xi,v)\geq 0.\end{array}

\begin{array}[]{cl}\max\limits_{w}&-\varepsilon w_{3}+y^{T}M_{P_{0}}w_{4}+y^{T}U(w_{5}-w_{6}+w_{7}-w_{8})-b^{T}w_{9}\\ $s.t$.&w_{1}-w_{2}+A^{T}M_{P_{0}}w_{4}+A^{T}U(w_{5}-w_{6}+w_{7}-w_{8})-B^{T}w_{9}=0,\\ &w_{1}+w_{2}\leq e,\\ &-a_{1}w_{3}+(e^{N})^{T}w_{4}\leq 0,\\ &-a_{2}w_{3}+(e^{h})^{T}(w_{5}+w_{6})\leq 0,\\ &-a_{3}w_{3}e^{h}+w_{7}+w_{8}\leq 0,\\ &w_{1},w_{2}\in R^{n}_{+},~{}w_{3}\in R_{+},~{}w_{4}\in R^{N}_{+},~{}w_{5-8}\in R^{h}_{+},~{}w_{9}\in R_{+}^{l}.\end{array}

\begin{array}[]{cl}\max\limits_{w}&-\varepsilon w_{3}+y^{T}M_{P_{0}}w_{4}+y^{T}U(w_{5}-w_{6}+w_{7}-w_{8})-b^{T}w_{9}\\ $s.t$.&w_{1}-w_{2}+A^{T}M_{P_{0}}w_{4}+A^{T}U(w_{5}-w_{6}+w_{7}-w_{8})-B^{T}w_{9}=0,\\ &w_{1}+w_{2}\leq e,\\ &-a_{1}w_{3}+(e^{N})^{T}w_{4}\leq 0,\\ &-a_{2}w_{3}+(e^{h})^{T}(w_{5}+w_{6})\leq 0,\\ &-a_{3}w_{3}e^{h}+w_{7}+w_{8}\leq 0,\\ &w_{1},w_{2}\in R^{n}_{+},~{}w_{3}\in R_{+},~{}w_{4}\in R^{N}_{+},~{}w_{5-8}\in R^{h}_{+},~{}w_{9}\in R_{+}^{l}.\end{array}

\begin{array}[]{lll}\Theta=\biggr{\{}u:&-x-t\leq 0,~{}x-t\leq 0,~{}a_{1}s+a_{2}\xi+a_{3}\left(e^{h}\right)^{T}v\leq\varepsilon,\\ &-M_{P_{0}}^{T}Ax-e^{N}s\leq-M_{P_{0}}^{T}y,~{}Bx\leq b,\\ &-U^{T}Ax-\xi e^{h}\leq-U^{T}y,~{}U^{T}Ax-\xi e^{h}\leq U^{T}y,\\ &-U^{T}Ax-v\leq-U^{T}y,~{}U^{T}Ax-v\leq U^{T}y,\\ &w_{1}-w_{2}+A^{T}M_{P_{0}}w_{4}+A^{T}U(w_{5}-w_{6}+w_{7}-w_{8})-B^{T}w_{9}=0,\\ &w_{1}+w_{2}\leq e,~{}-a_{1}w_{3}+(e^{N})^{T}w_{4}\leq 0,~{}(t,s,\xi,v,w)\geq 0,\\ &-a_{2}w_{3}+(e^{h})^{T}(w_{5}+w_{6})\leq 0,~{}-a_{3}e^{h}w_{3}+w_{7}+w_{8}\leq 0,\\ &e^{T}t=-\varepsilon w_{3}+y^{T}M_{P_{0}}w_{4}+y^{T}U(w_{5}-w_{6}+w_{7}-w_{8})-b^{T}w_{9}\biggr{\}}.\end{array}

\begin{array}[]{lll}\Theta=\biggr{\{}u:&-x-t\leq 0,~{}x-t\leq 0,~{}a_{1}s+a_{2}\xi+a_{3}\left(e^{h}\right)^{T}v\leq\varepsilon,\\ &-M_{P_{0}}^{T}Ax-e^{N}s\leq-M_{P_{0}}^{T}y,~{}Bx\leq b,\\ &-U^{T}Ax-\xi e^{h}\leq-U^{T}y,~{}U^{T}Ax-\xi e^{h}\leq U^{T}y,\\ &-U^{T}Ax-v\leq-U^{T}y,~{}U^{T}Ax-v\leq U^{T}y,\\ &w_{1}-w_{2}+A^{T}M_{P_{0}}w_{4}+A^{T}U(w_{5}-w_{6}+w_{7}-w_{8})-B^{T}w_{9}=0,\\ &w_{1}+w_{2}\leq e,~{}-a_{1}w_{3}+(e^{N})^{T}w_{4}\leq 0,~{}(t,s,\xi,v,w)\geq 0,\\ &-a_{2}w_{3}+(e^{h})^{T}(w_{5}+w_{6})\leq 0,~{}-a_{3}e^{h}w_{3}+w_{7}+w_{8}\leq 0,\\ &e^{T}t=-\varepsilon w_{3}+y^{T}M_{P_{0}}w_{4}+y^{T}U(w_{5}-w_{6}+w_{7}-w_{8})-b^{T}w_{9}\biggr{\}}.\end{array}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Microwave Imaging and Scattering Analysis · Numerical methods in inverse problems

Full text

Stability Analysis for a Class of Sparse Optimization Problems

\nameJialiang Xua and Yun-Bin Zhaob Yun-Bin Zhao. Email: [email protected] a,b School of Mathematics, University of Birmingham, Edgbaston, Birmingham, B15 2TT, United Kingdom

Abstract

The sparse optimization problems arise in many areas of science and engineering, such as compressed sensing, image processing, statistical and machine learning. The $\ell_{0}$ -minimization problem is one of such optimization problems, which is typically used to deal with signal recovery. The $\ell_{1}$ -minimization method is one of the plausible approaches for solving the $\ell_{0}$ -minimization problems, and thus the stability of such a numerical method is vital for signal recovery. In this paper, we establish a stability result for the $\ell_{1}$ -minimization problems associated with a general class of $\ell_{0}$ -minimization problems. To this goal, we introduce the concept of restricted weak range space property (RSP) of a transposed sensing matrix, which is a generalized version of the weak RSP of the transposed sensing matrix introduced in [Zhao et al., Math. Oper. Res., 44(2019), 175-193]. The stability result established in this paper includes several existing ones as special cases.

keywords:

Sparsity optimization; $\ell_{1}$ -minimization; stability; optimality condition; Hoffman theorem; restricted weak range space property.

1 Introduction

The sparsity is a useful assumption under which the sparse optimization models arise frequently in many areas in science and engineering. Let $A\in R^{m\times n}(m\ll n)$ , $B\in R^{l\times n}(l<n)$ and $U\in R^{m\times h}(m\ll h)$ be three given full-row-rank matrices. Let $y\in R^{m}$ and $b\in R^{l}$ be given vectors and $\varepsilon$ be a positive number. Consider the following sparse optimization model:

[TABLE]

where $\|x\|_{0}$ is called the ‘ $\ell_{0}$ -norm’ which counts the number of nonzero components of $x$ , and $a_{1},a_{2}$ and $a_{3}$ are given nonnegative parameters satisfying $\sum_{i=1}^{3}a_{i}=1$ . Many problems in signal and image processing (see, e.g., [6, 13, 17]) and statistical regressions [23] can be formulated as the form (1) or its special cases. In problem (1), the constraint $Bx\leq b$ is motivated by some practical applications. For instance, many signal recovery models might need to include certain constraints reflecting special structures of the target signal. For simplicity, we define

[TABLE]

and write the problem (1) as

[TABLE]

The following $\ell_{0}$ -minimization models are clearly the special cases of (1):

[TABLE]

The problem (C1) is often called the standard $\ell_{0}$ -minimization problem [17, 8, 28]. Two structured sparsity models, called the nonnegative sparsity model [8, 7, 17, 28] and the monotonic sparsity model (isotonic regression) [24, 23], are also the special cases of the model (1).

It is well known that $\ell_{1}$ -minimization is a useful method to solve the $\ell_{0}$ -minimization problem. By replacing the $\ell_{0}$ -norm with the $\ell_{1}$ -norm in problem (1), we immediately obtain the $\ell_{1}$ -minimization problem

[TABLE]

Similar to its $\ell_{0}$ counterpart, the problem (2) includes the following special cases:

[TABLE]

The problem (D2) is often called quadratically constrained basis pursuit [17, 10, 28], and it reduces to (D1) if $\varepsilon=0$ , which is called standard $\ell_{1}$ -minimization or the basis pursuit [12, 8, 19, 26, 17]. The problem (D4) is the type of Dantzig Selectors [9, 17].

From both numerical and theoretical viewpoints, it is important to know how close the solutions of $\ell_{0}$ - and $\ell_{1}$ -minimization problems are. To address this question, one needs to study the stability of $\ell_{1}$ -minimization methods. The stability of a sparse optimization method can be described as follows: For any $x\in R^{n}$ in the feasible set of a sparse optimization problem, the solution $x^{\#}$ generated by the method satisfies the following bound:

[TABLE]

where $C_{1}$ and $C_{2}$ are constants, and $\sigma_{k}(x)_{1}$ is called the error of the best $k$ -term approximation of the vector $x$ (see, e.g., [12, 17]):

[TABLE]

In this paper, we establish a stability result for the $\ell_{1}$ -minimization method (2). The stability of (D1) and (D2) has been investigated by Donoho, Candès, Tao, Romberg and others [14, 13, 6, 7, 8, 12, 25, 16, 3] under various assumptions such as the so-called restricted isometry property (RIP) of order $k$ , mutual coherence, stable null space property (NSP) of order $k$ or robust NSP of order $k$ . The RIP of order $k$ was introduced by Candès and Tao [8] to study the stability of $\ell_{1}$ -minimization. The singular-value-property-based stability analysis for (D1), (D2) and the Dantzig Selector have also been performed by Tang and Nehorai in [22].

A new and unified stability analysis for $\ell_{1}$ -minimization methods has been developed by Zhao, Jiang and Luo [29] under the assumption of weak RSP of order $k$ , which has been proven as a necessary and sufficient condition for the standard $\ell_{1}$ -minimization to be stable. The main differnece between the weak-RSP-based-analysis and existing ones lies in the constants $C_{1}$ and $C_{2}$ in (3). Specifically, the constants $C_{1}$ and $C_{2}$ in (3) are determined by the RIP or NSP constant in existing analysis [17, 3, 8]. However, in [29, 28], these constants are determined by the so-called Robinson’s constant. Motivated by the new analysis tool introduced in [29], we develop the stability result for the model (2) in this paper under the assumption of restricted weak range space property ( $\mathrm{RSP}$ ) of order $k$ (which will be introduced in next section). Our result extends the stability theorem for $\ell_{1}$ -minimization established by Zhao et al. [29, 30, 28].

This paper is organized as follows. In Section 2, we introduce the concept of restricted weak $\mathrm{RSP}$ of order $k$ . An approximation of the solution set of (2) will be discussed in Section 3. Then, in Section 4, we show the main stability result of this paper. Finally, some special cases are discussed in Section 5.

Notation

The field of real numbers is denoted by $R$ and the $n$ -dimensional Euclidean space is denoted by $R^{n}$ . Let $R^{n}_{+}$ and $R^{n}_{-}$ be the sets of nonnegative and nonpositive vectors, respectively. Unless otherwise stated, the identity matrix of suitable size is denoted by $I$ . Given a vector $u\in R^{n}$ , $|u|$ , $(u)^{+}$ and $(u)^{-}$ denote the vectors with components $|u|_{j}=|u_{j}|$ , $[(u)^{+}]_{j}=\max\{u_{j},0\}$ and $[(u)^{-}]_{j}=\min\{u_{j},0\}$ , $j=1,...,n$ , respectively. The cardinality of the set $S$ is denoted by $|S|$ and the complementary set of $S\subseteq\left\{1,...,n\right\}$ is denoted by $\bar{S}$ , i.e., $\bar{S}=\{1,...,n\}\setminus S$ . For a given vector $x\in R^{n}$ , $x_{S}$ denotes the vector supported on $S$ . $a_{i,j}$ denotes the entry of the matrix $A$ in row $i$ and column $j$ . For the set $S\subseteq\{1,...,n\}$ , $A_{S}$ denotes the submatrix of $A\in R^{m\times n}$ obtained by deleting the columns indexed by $\bar{S}$ . For a matrix $A=\left(a_{i,j}\right)$ , $|A|$ represents the absolute version of $A$ , i.e., $|A|=\left(|a_{i,j}|\right)$ . $\mathcal{R}\left(A^{T}\right)=\{A^{T}y:y\in R^{m}\}$ is the range space of $A^{T}$ . $\left\|x\right\|_{p}=\left(\sum_{i=1}^{n}\left|x_{i}\right|^{p}\right)^{1/p}$ , where $p\geq 1$ , is a norm, called the $\ell_{p}$ -norm of $x$ . $\left\|x\right\|_{\infty}=\max_{i=1}^{n}\left|x_{i}\right|$ is called the $\ell_{\infty}$ -norm of $x$ . For $1\leq p,q\leq\infty$ , $\left\|A\right\|_{p\rightarrow q}=\sup_{\left\|x\right\|_{p}\leq 1}\left\|Ax\right\|_{q}$ is the matrix norm induced by $\ell_{p}$ - and $\ell_{q}$ -norms.

2 Restricted weak range space property

The $\mathrm{RSP}$ of order $k$ of a transposed matrix was first introduced in [26, 27] to develop a necessary and sufficient condition for the uniform recovery of sparse signals via $\ell_{1}$ -minimization. Zhao et al. [29] generalised the $\mathrm{RSP}$ of order $k$ to the following weak $\mathrm{RSP}$ of order $k$ to develop a stability theory for convex optimization algorithms:

Definition 2.1 (weak $\mathrm{RSP}$ of order $k$ ).

Given a matrix $A\in R^{m\times n}$ , $A^{T}$ is said to satisfy the weak $\mathrm{RSP}$ order $k$ if for any two disjoint sets $J_{1},J_{2}\subseteq\{1,...,n\}$ satisfying $|J_{1}|+|J_{2}|\leq k$ , there exists a vector $\eta\in\mathcal{R}\left(A^{T}\right)$ such that

[TABLE]

In [29, 28], it was shown that the weak $\mathrm{RSP}$ of order $k$ is a sufficient condition for the stability of many convex optimization methods, and it is also a necessary stability condition for many optimization methods.

Different from the problems (D1)-(D4), the problem (2) is more general than these models. To investigate the stability of the problem (2), we need to extend the notion of weak RSP of order $k$ to the so-called restricted weak $\mathrm{RSP}$ of order $k$ , which is defined as follows:

Definition 2.2 (Restricted weak $\mathrm{RSP}$ of order $k$ ).

Given matrices $A\in R^{m\times n}$ and $B\in R^{l\times n}$ , the pair $\left(A^{T},B^{T}\right)$ is said to satisfy the restricted weak $\mathrm{RSP}$ of order $k$ if for any two disjoint sets $J_{1},J_{2}\subseteq\{1,...,n\}$ satisfying $|J_{1}|+|J_{2}|\leq k$ , there exists a vector $\eta\in\mathcal{R}\left(A^{T},B^{T}\right)$ such that $\eta=\left(A^{T},B^{T}\right)\left(\begin{array}[]{c}\nu\\ h\end{array}\right)$ where $\nu\in R^{m}$ , $h\in R^{l}_{-}$ and

[TABLE]

It is worth mentioning that a generalized version of the RSP of order $k$ is also used in [31] to study the exact sign recovery in 1-bit compressive sensing.

3 Approximation of (2) and its solution set

By introducing the slack variables $r$ , $s$ , $\xi$ and $v$ , the problem (2) can be rewritten as

[TABLE]

where $e^{h}$ is the vector of ones in $R^{h}$ and $\mathcal{B}$ is the unit $\ell_{2}$ -ball defined as $\mathcal{B}=\{z\in R^{m}:\left\|z\right\|_{2}\leq 1\}$ . The unit ball $\mathcal{B}$ can be also described as

[TABLE]

Denote the set ${\rm E}$ by

[TABLE]

and hence the solution set of (4) can be represented as

[TABLE]

where $\theta^{*}$ is the optimal value of (4). By replacing $\mathcal{B}$ in (6) with a polytope $P\supseteq\mathcal{B}$ , we can get the relaxation of $\Omega^{*}$ , denoted by $\Omega_{P}$ , i.e.,

[TABLE]

The polytope $\Omega_{P}$ can approximate $\Omega^{*}$ to any level of accuracy provided that $P$ is chosen suitably. Recall the Hausdorff metric of two sets $M_{1},M_{2}\subseteq R^{m}$ :

[TABLE]

Following the analysis in [29, 28] (see Lemmas 5.1, 5.2 and 5.3 in [29]), we can obtain the following lemma:

Lemma 3.1.

Let $\varepsilon$ be the given number in problem (2). Then for any $\varepsilon^{\prime}\leq\varepsilon$ , there exists a polytope approximation $P$ of $\mathcal{B}$ satisfying $P\supseteq\mathcal{B}$ and

[TABLE]

In the remainder of this paper, we fix $\varepsilon^{\prime}\in(0,\varepsilon]$ and choose the polytope $P$ such that $\Omega_{P}$ and $\Omega^{*}$ satisfy (8). The polytope $P$ can be represented as the intersection of a finite number of half spaces:

[TABLE]

where $\textbf{a}^{i},~{}1\leq i\leq L$ are some unit vectors (i.e., $\|\textbf{a}^{i}\|_{2}=1$ ), and $L$ is an integer number. By adding the $2m$ half spaces

[TABLE]

to $P$ , where $\beta^{j}$ is the $j$ th column of the $m\times m$ identity matrix, we obtain the following polytope:

[TABLE]

We define $T$ as the collection of the vectors $\textbf{a}^{i}$ and $\pm\beta^{j}$ in $P_{0}$ , that is,

[TABLE]

Clearly, $P_{0}$ still satisfies (8) in Lemma 3.1, i.e.,

[TABLE]

In the remainder of the chapter, we use the above defined polytope $P_{0}.$ Let $N=|T|,$ and let $M_{P_{0}}$ be the matrix with column vectors in $T$ . Thus $P_{0}$ can be written as

[TABLE]

where $e^{N}$ is the vector of ones in $R^{N}$ .

By replacing $\mathcal{B}$ by $P_{0}$ , we obtain the following approximation of the optimal value $\theta^{*}$ of (2):

[TABLE]

The associated approximation problem of (2) can be written as

[TABLE]

The solution set of (10) is

[TABLE]

Note that $\mathcal{B}\subseteq P_{0}$ implies that $\theta^{*}\geq\theta^{*}_{P_{0}}$ . So we can see that $\Omega^{*}_{P_{0}}\subseteq\Omega_{P_{0}}$ . By the definition of $P_{0}$ , we also have $\Omega^{*}\subseteq\Omega_{P_{0}}$ . In the next section, we prove the main result for the problem (2).

4 Main result

Introducing a variable $t$ yields the following equivalent form of (10):

[TABLE]

The solution set of (12) is given as (11). Note that the above optimization problem is equivalent to a linear programming problem. In fact, the constraint $\left\|\phi(x)\right\|_{\infty}\leq\xi$ can be rewritten as $\left|\phi(x)\right|\leq\xi e^{h},$ where $e^{h}$ is the vector of ones in $R^{h}$ . Thus the model (12) can be rewritten explicitly as the linear programming problem

[TABLE]

The dual problem of (13) is given as follows:

[TABLE]

The optimality condition yields the following lemma:

Lemma 4.1.

Denote by $u=(x,t,s,\xi,v,w)$ . Then $x^{*}$ is an optimal solution of (10) if and only if there exists a vector $u^{*}=(x^{*},t^{*},s^{*},\xi^{*},v^{*},w^{*})\in\Theta$ , where $\Theta$ is the set given as

[TABLE]

Clearly, $|x^{*}|=t^{*}$ holds for every $u^{*}\in\Theta$ . The set $\Theta$ can be written as the form

[TABLE]

where the vectors $q^{\prime}=0$ and

[TABLE]

The matrices $M^{\prime}_{1}$ and $M^{\prime}_{2}$ in (15) are given as follows:

[TABLE]

where the matrices $M_{*}$ , $M_{**}$ , $D^{1}$ , $D^{2}$ and $D^{3}$ and $\widetilde{I}$ are given as follows:

[TABLE]

In the above matrices, [math]’s are zero matrices with suitable sizes and $I$ , $I^{h}$ and $\widetilde{I}$ are the $n\times n$ , $h\times h$ and $(2n+1+N+4h+l)\times(2n+1+N+4h+l)$ identity matrices, respectively.

To prove the main stability result, we also need the next two Lemmas.

Lemma 4.2 (Hoffman [18, 21]).

Let $M_{1}\in R^{m\times n}$ and $M_{2}\in R^{l\times n}$ be two given matrices and the set $\mathcal{Q}$ be given as

[TABLE]

For any vector $x\in R^{n}$ , there exists a vector $x^{*}\in\mathcal{Q}$ satisfying

[TABLE]

where $\sigma(M_{1},M_{2})$ is a constant determined by $M_{1}$ and $M_{2}$ .

The constant $\sigma(M_{1},M_{2})$ is also called the Robinson constant. We also use the following lemma in the proof of the main result in this section.

Lemma 4.3 ([28, 30]).

Let $\pi_{S}(x)$ be the projection of $x$ into the convex set $S$ , i.e., $\pi_{S}(x)=\arg\min_{z\in S}\|x-z\|_{2}.$ Let the three convex compact sets $T_{1}$ , $T_{2}$ and $T_{3}$ satisfy that $T_{1}\subseteq T_{2}$ and $T_{3}\subseteq T_{2}.$ Then for any $x\in R^{n}$ and any $z\in T_{3}$ the following holds:

[TABLE]

We also define two types of constants. Let

[TABLE]

be a matrix with full row rank. Given three positive numbers $c,d,\widehat{d}\in[1,\infty]$ , we define the constants $\Upsilon(d,\widehat{d})$ and $\vartheta(c)$ as follows:

[TABLE]

We will use the above constants together with the specific constants $\Upsilon(1,1),\Upsilon(\infty,\infty)$ and $\vartheta(1)$ in the stability analysis of (2). The main result is given as follows.

Theorem 4.4.

Let the problem data $(U,A,B,\varepsilon,a_{1},a_{2},a_{3},b,y)$ of (2) be given, and the matrix $C\in R^{(m+l)\times n}$ be given in (18) with full row rank. Let $P_{0}$ be the polytope given in (9) satisfying (8). If $\left(A^{T},B^{T}\right)$ satisfies the restricted weak $\mathrm{RSP}$ of order $k$ , then for any $x\in R^{n}$ , there is an optimal solution $x^{*}$ of (2) satisfying the bound

[TABLE]

where $\sigma^{\prime}$ is the Robinson constant determined by $(M^{\prime}_{1},M^{\prime}_{2})$ in (16), $\Upsilon(d,\widehat{d})$ and $\vartheta(c)$ are the constants given in (19a) and (19b), and $\hat{\Upsilon}=\max\{\Upsilon(1,1),\Upsilon(\infty,\infty),\vartheta(1)\}.$ $\widehat{d},d,c,d^{\prime},c^{\prime}\in[1,+\infty]$ are five given positive numbers (allowing to be $\infty$ ) satisfying

[TABLE]

In particular, if $x$ is a feasible solution of (2), then there is an optimal solution $x^{*}$ of (2) such that

[TABLE]

Proof.

Let $x$ be any given vector in $R^{n}$ and $P_{0}$ be the fixed polytope given in (9) satisfying (8) in Lemma 3.1. We let $(t,s,\xi,v)$ satisfy that

[TABLE]

With such a choice of $(t,s,\xi,\nu)$ , we have

[TABLE]

Let $J$ be the support set of $k$ largest absolute entries of $x$ , and $J_{1}$ and $J_{2}$ be the sets such that

[TABLE]

Clearly, $|J_{1}\cup J_{2}|=|J|=|J_{1}|+|J_{2}|\leq k$ . Let $J_{3}$ be the complementary set of $J$ . Clearly, $J_{1}$ , $J_{2}$ and $J_{3}$ are disjoint. Under the assumption of restricted weak RSP of order $k$ , there exists a vector $\eta\in R\left(A^{T},B^{T}\right)$ such that $\eta=A^{T}\nu^{*}+B^{T}h^{*}$ for some $\nu^{*}\in R^{m}$ and $h^{*}\in R^{l}_{-}$ satisfying

[TABLE]

Now we construct a feasible solution $w=(w_{1},...,w_{9})$ to the dual problem (14).

Constructing ( $w_{1},w_{2}$ ). Set $w_{1}$ and $w_{2}$ as follows:

[TABLE]

Such $w_{1}$ and $w_{2}$ satisfy that

[TABLE]

Constructing ( $w_{5}$ – $w_{8}$ ). Note that $U$ is a matrix with full row rank. There must exist an invertible $m\times m$ matrix of $U$ , denoted by $U_{\mho}$ , where $\mho\subseteq\{1,...,h\}$ with $|\mho|=m$ . Denote the complementary set of $\mho$ by $\bar{\mho}=\{1,...,h\}\setminus\mho.$ Then we construct a vector $g\in R^{h}$ satisfying $g_{\mho}=U^{-1}_{\mho}\nu^{*}$ and $g_{\bar{\mho}}=0,$ which imply that

[TABLE]

Let $g^{+}$ ( $g^{-}$ ) be the vector obtained by keeping the positive (negative) components of $g$ and setting the remaining components to [math]. By using the vector $g$ , $w_{5}$ – $w_{8}$ can be constructed as follows:

[TABLE]

which implies that

[TABLE]

Constructing $w_{4}$ . Without loss of generality, we suppose that the first $m$ columns in $M_{P_{0}}$ are $\beta_{j},~{}j=1,...,m,$ and $-\beta_{j},~{}j=1,...,m$ are the second $m$ columns of $M_{P_{0}}$ . The components of $w_{4}$ can be assigned as follows:

[TABLE]

From this choice of $w_{4}$ , we can see that

[TABLE]

Constructing $w_{3}$ . Let $w_{3}=\max\left\{\left\|\nu^{*}\right\|_{1},\left\|g\right\|_{1},\left\|g\right\|_{\infty}\right\}$ . Such a choice of $w_{3}$ together with the choice of $w_{4}$ – $w_{8}$ implies that

[TABLE]

Constructing $w_{9}$ . Let $w_{9}=-h^{*}$ . Clearly, $w_{9}\geq 0$ due to $h^{*}\leq 0$ .

With the above choice of $w$ , we deduce from (26), (29), (30) and (31) that

[TABLE]

Let $\mathcal{X}$ and $\mathcal{Y}$ be defined as follows:

[TABLE]

For the vector $u=(x,t,s,\xi,\nu,w)$ where $(t,s,\xi,\nu,w)$ is constructed above, by Lemma 4.2, there exists a vector $\hat{u}\in\Theta,$ where $\Theta$ is given in Lemma 4.1 and written as (15), such that

[TABLE]

where $\sigma^{\prime}$ is the Robinson constant determined by $(M^{\prime}_{1},M^{\prime}_{2})$ given by (16). Since the vector $(x,t,s,\xi,v,w)$ satisfies (24) and (32), the inequality (33) can be simplified to

[TABLE]

In the reminder of the proof, we estimate the terms on the right-hand side of (34). Note that the vectors in $T$ are unit vectors. It is easy to see that

[TABLE]

The value of $s$ in (23) implies that $s\leq\left\|y-Ax\right\|_{2}.$ Therefore we have

[TABLE]

Due to (27), (29) and (30), we have

[TABLE]

The fact $A^{T}\nu^{*}+B^{T}h^{*}=\eta$ (due to the restricted weak RSP of order $k$ ) and the triangle inequality imply that

[TABLE]

Now we deal with the right-hand side of the above inequality. First, by using the index sets $J$ and $J_{3}$ , we have

[TABLE]

It follows from $t=|x|$ and (25) that

[TABLE]

Then we obtain

[TABLE]

By using the restricted weak $\mathrm{RSP}$ of order $k$ , we have

[TABLE]

where $C=\left[A^{T},B^{T}\right]^{T}\in R^{(m+l)\times n}$ and $\vartheta(1)$ is defined in (19b). Moreover, we have

[TABLE]

Recall that $\Upsilon(1,1)$ is determined in (19a). Then $\left\|g\right\|_{1}\leq\Upsilon(1,1)$ . Similarly, $\left\|g\right\|_{\infty}\leq\Upsilon(\infty,\infty)$ can be obtained. Due to $w_{3}=\max\left\{\left\|\nu^{*}\right\|_{1},\left\|g\right\|_{1},\left\|g\right\|_{\infty}\right\}$ , we have

[TABLE]

Let $c,d,\widehat{d}\in[1,+\infty]$ be three given positive numbers and $d,d^{\prime}$ be two given numbers satisfying (21). For the term $\left|(\phi(x))^{T}g\right|$ in (36), it follows from Hölder inequalities that

[TABLE]

Let $\Upsilon(d,\widehat{d})$ be given as (19a), i.e.,

[TABLE]

Thus we have

[TABLE]

Similarly, the following inequalities holds

[TABLE]

Due to (37), (38), (40) and (41), the inequality (36) is reduced to

[TABLE]

where $\hat{\Upsilon}=\max\{\Upsilon(1,1),\Upsilon(\infty,\infty),\vartheta(1)\}$ .

Note that $\left\|x-\hat{x}\right\|_{2}\leq\left\|u-\hat{u}\right\|_{2}.$ It follows from (34), (35) and (42) that

[TABLE]

We recall the three sets $\Omega^{*}$ , $\Omega_{P_{0}}$ and $\Omega^{*}_{P_{0}}$ , where $\Omega^{*}$ and $\Omega^{*}_{P_{0}}$ are the solution sets of (2) and (10), given as (6) and (11), respectively, and $\Omega_{P_{0}}$ is given as (7) with $P=P_{0}$ . Clearly, $\hat{x}\in\Omega^{*}_{P_{0}}$ . Let $x^{*}$ denote the projection of $x$ onto $\Omega^{*}$ , that is,

[TABLE]

Note that the three sets are compact convex sets satisfying $\Omega^{*}\subseteq\Omega_{P_{0}}$ and $\Omega_{P_{0}}^{*}\subseteq\Omega_{P_{0}}$ . Then by applying Lemma 4.3 with $T_{1}=\Omega^{*}$ , $T_{2}=\Omega_{P_{0}}$ and $T_{3}=\Omega^{*}_{P_{0}}$ , we have

[TABLE]

Since $P_{0}$ satisfies (8), it implies that

[TABLE]

Let $\hat{\Upsilon}=\max\{\Upsilon(1,1),\Upsilon(\infty,\infty),\vartheta(1)\}.$ Combination of the above inequality and (43) yields the desired results (20). If $x$ is the feasible solution of (2), then $\left\|(Bx-b)^{+}\right\|_{1}=0$ and

[TABLE]

and thus the desired error bound (22) is also obtained. ∎

Based on Theorem 4.4, the error bound for the solutions of (1) and (2) can be stated as follows.

Corollary 4.5.

For any optimal solution $x$ of (1), there is an optimal solution $x^{*}$ of (2) estimating $x$ with the error:

[TABLE]

where the constants $\varepsilon^{\prime}$ , $\hat{\Upsilon}$ , $\sigma^{\prime}$ , $\Upsilon(d,\widehat{d})$ and $\vartheta(c)$ are given as in Theorem 4.4.

5 Special cases

Firstly, by setting different values of $a_{1},a_{2}$ and $a_{3}$ , the problem $\eqref{Ps}$ can reduce to several special cases, and the corresponding stability results for these special cases can be obtained from (20) and (22) immediately. Note that if any of $a_{1},a_{2}$ and $a_{3}$ is zero, the constant $\hat{\Upsilon}=\max\{\Upsilon(1,1),\Upsilon(\infty,\infty),\vartheta(1)\}$ in (20) and (22) will be simplified as well. For example, if $a_{1}=0$ , the constant $\hat{\Upsilon}$ is reduced to $\max\{\Upsilon(1,1),\Upsilon(\infty,\infty)\}$ . The following table shows the form of the constant $\hat{\Upsilon}$ for different choices of $a_{1},a_{2}$ and $a_{3}$ .

Note that for any case with $a_{1}=0$ , we have $\Omega^{*}=\Omega_{P_{0}}=\Omega_{P_{0}}^{*}$ so that $\hat{x}=x^{*}$ where $\hat{x}\in\Omega_{P_{0}}^{*}$ and $x^{*}\in\Omega^{*}$ . Thus instead of using Lemma 4.3, the stability results can be immediately obtained from (43).

Secondly, without matrix $B$ , the problem (2) is reduced to

[TABLE]

In this case, the restricted weak $\mathrm{RSP}$ of order $k$ is reduced to the standard weak $\mathrm{RSP}$ of order $k$ , which means $A^{T}\nu^{*}=\eta$ . In fact, the upper bound of $\left|(\phi(x))^{T}g\right|$ in (39) can be improved to

[TABLE]

Then in order to obtain a tighter bound, $\Upsilon(d,\widehat{d})$ can be replaced by

[TABLE]

Thus we have $\left|(\phi(x))^{T}g\right|\leq\left\|\phi(x)\right\|_{d^{\prime}}\Upsilon^{\prime}(d)$ . Similarly, the constants $\Upsilon(1,1)$ and $\Upsilon(\infty,\infty)$ are replaced by $\Upsilon^{\prime}(1)$ and $\Upsilon^{\prime}(\infty)$ , respectively. Clearly, in this case, $\vartheta(c)=\left\|(AA^{T})^{-1}A\right\|_{\infty\rightarrow c}$ . Let $\hat{\Upsilon^{\prime}}=\max\{\Upsilon^{\prime}(1),\Upsilon^{\prime}(\infty),\vartheta(1)\}$ . Then the bound (22) is reduced to

[TABLE]

Similarly, we list the constants $\hat{\Upsilon}^{\prime}$ for different choices of $a_{i},~{}i=1,2,3$ in the following table.

Note that when $a_{1}=0$ , we have $\hat{\Upsilon}^{\prime}=\Upsilon^{\prime}(1)$ due to the fact $\left\|U_{\mho}^{-1}\left(AA^{T}\right)^{-1}A\right\|_{\infty\rightarrow 1}\geq\left\|U_{\mho}^{-1}\left(AA^{T}\right)^{-1}A\right\|_{\infty\rightarrow\infty}$ . Moreover, in this case, setting $d=1$ yields

[TABLE]

which is the bound for the following $\ell_{1}$ -minimization established by Zhao and Li [30] (see also in Zhao [28]):

[TABLE]

Last but not least, our analysis can also apply to 1-bit basis pursuit [31], which can be viewed as a special case of our model (2). The stability result for the 1-bit basis pursuit in [31] can be obtained immediately from Theorem 4.4 by setting $a_{2}=a_{3}=0$ .

6 Conclusion

In this paper, we have studied the stability issue of the $\ell_{1}$ -minimization method (2). To establish our results, we introduced the restricted weak RSP of order $k$ which is a mild assumption governing the stability of sparsity-seeking algorithms. Under this assumption, we use the classic Hoffman theorem and Lemma 4.3 to show that the $\ell_{1}$ -minimization method (2) is stable and thus the error between the solutions of the problems (1) and (2) can be measured in terms of the best $k$ term approximation and the problem data (see Theorem 4.4). The result developed in this paper can apply to a range of problems with constraints defined by $\ell_{1}$ -, $\ell_{2}$ -, and $\ell_{\infty}$ -norms.

Bibliography31

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] J. Andersson and J. O. Str o ¨ ¨ o \ddot{\mathrm{o}} mberg, On the theorem of uinform recoery of structured random matrices , IEEE Trans. Inform. Theory 60 (2014), pp. 1700–1710.
2[2] P. T. Boufounos, Greedy sparse signal reconstruction from sign measurements , Porc. 43rd Asilomar Conf. Signals, Systems and Computers, 2009, pp. 1305–1309.
3[3] J. Cahill, X. Chen, and R. Wang, The gap between the null space property and restricted isometry property , Linear Algebra Appl. 501 (2016), pp. 363–375.
4[4] T. Cai, L. Wang, and G. Xu, New bounds for restricted isometry constants , IEEE Transactions on Information Theory. 56 (2010), pp. 4388–4394.
5[5] T. Cai and A. Zhang, Sharp RIP bound for sparse signal and low-rank matrix recovery , Applied and Computational Harmonic Analysis. 35 (2013), pp. 74–93.
6[6] E. Candès, Compressive sampling , Proceedings of the International Congress of Mathematicians. 3 (2006), pp. 1433–1452.
7[7] E. Candès, J. Romberg and T. Tao, Stable signal recovery from incomplete and inaccurate measurements , Communications on Pure and Applied Mathematics 59 (2006), pp. 1207–1223.
8[8] E. Candès and T. Tao, Decoding by linear programming , IEEE Transactions on Information Theory 51 (2005), pp. 4203–4215.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Stability Analysis for a Class of Sparse Optimization Problems

Abstract

keywords:

1 Introduction

Notation

2 Restricted weak range space property

Definition 2.1** (weak RSP\mathrm{RSP}RSP of order kkk).**

Definition 2.2** (Restricted weak RSP\mathrm{RSP}RSP of order kkk).**

3 Approximation of (2) and its solution set

Lemma 3.1**.**

4 Main result

Lemma 4.1**.**

Lemma 4.2** (Hoffman [18, 21]).**

Lemma 4.3** ([28, 30]).**

Theorem 4.4**.**

Proof.

Corollary 4.5**.**

5 Special cases

6 Conclusion

Definition 2.1 (weak $\mathrm{RSP}$ of order $k$ ).

Definition 2.2 (Restricted weak $\mathrm{RSP}$ of order $k$ ).

Lemma 3.1.

Lemma 4.1.

Lemma 4.2 (Hoffman [18, 21]).

Lemma 4.3 ([28, 30]).

Theorem 4.4.

Corollary 4.5.