Location and Orientation Optimisation for Spatially Stretched Tripole   Arrays Based on Compressive Sensing

Matthew Hawes; Lyudmila Mihaylova; Wei Liu

arXiv:1702.00248·cs.IT·February 2, 2017

Location and Orientation Optimisation for Spatially Stretched Tripole Arrays Based on Compressive Sensing

Matthew Hawes, Lyudmila Mihaylova, Wei Liu

PDF

TL;DR

This paper introduces innovative compressive sensing methods to optimize both the locations and orientations of sparse, spatially stretched tripole arrays, significantly reducing the number of dipoles needed while maintaining performance.

Contribution

It formulates the joint location and orientation optimization as compressive sensing problems, a novel approach for designing sparse tripole arrays.

Findings

01

Achieved 67% reduction in dipole count compared to uniform arrays.

02

Validated the effectiveness of the proposed methods in approximating reference responses.

03

Demonstrated cost reduction in array design.

Abstract

The design of sparse spatially stretched tripole arrays is an important but also challenging task and this paper proposes for the very first time efficient solutions to this problem. Unlike for the design of traditional sparse antenna arrays, the developed approaches optimise both the dipole locations and orientations. The novelty of the paper consists in formulating these optimisation problems into a form that can be solved by the proposed compressive sensing and Bayesian compressive sensing based approaches. The performance of the developed approaches is validated and it is shown that accurate approximation of a reference response can be achieved with a 67% reduction in the number of dipoles required as compared to an equivalent uniform spatially stretched tripole array, leading to a significant reduction in the cost associated with the resulting arrays.

Tables20

Table 1. TABLE I: Dipole locations and orientations for the broadside CS-IMDSM design example.

n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$
1	0.34	4	2.86	7	5.59	10	8.57
2	1.18	5	3.79	8	6.53	11	9.48
3	2.02	6	4.64	9	7.67

Table 2. TABLE II: Dipole locations and orientations for the broadside BCS-IMDSM design example.

n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$
1	0.56	4	3.48	7	6.37	10	9.02
2	1.43	5	4.48	8	7.25	11	9.89
3	2.56	6	5.44	9	8.12

Table 3. TABLE III: Dipole locations and orientations for the broadside AIRMS design example.

n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$
1	1.50	4	4.17	6	5.80	8	7.60
2	2.30	5	5	7	6.70	9	8.47
3	3.27

Table 4. TABLE IV: Performance comparison for the broadside design examples.

	CS-	BCS-
Example	IMDSM	IMDSM	AIRMS	ULA
Aperture/ $λ$	9.11	9.33	6.97	10
$\bar{Δ d} / λ$	0.91	0.93	0.87	0.50
Number of
dipoles	11	11	9	21
( $%$ decrease)	48	48	57	0
Error	1.00	0.43	0.46	0.64
Amplitude of
closest sidelobe (dB)	-20.02	-31.47	-30.55	-26.83
Computation
time (seconds)	363.16	4.38	62.03	1.17
Number of
iterations	11	11	3	2

Table 5. TABLE V: Performance comparison for the CS-IMDSM broadside design examples.

M	101	201	301	401
Aperture/ $λ$	9.08	7.19	9.11	9.13
$\bar{Δ d} / λ$	0.91	0.90	0.91	0.91
Number of
dipoles	11	10	11	11
( $%$ decrease)	48	52	48	48
Error	1.07	1.12	1.00	1.25
Amplitude of
closest sidelobe (dB)	-14.18	-17.85	-20.02	-10.47
Computation
time (seconds)	47.46	235.94	363.16	546.89
Number of
iterations	11	10	11	11

Table 6. TABLE VI: Performance comparison for the BCS-IMDSM broadside design examples.

M	101	201	301	401
Aperture/ $λ$	9.05	9.49	9.33	8.88
$\bar{Δ d} / λ$	0.91	0.95	0.93	0.89
Number of
dipoles	11	11	11	11
( $%$ decrease)	48	48	48	48
Error	0.82	0.87	0.43	0.81
Amplitude of
closest sidelobe (dB)	-22.41	-20.56	-31.47	-19.63
Computation
time (seconds)	3.43	2.86	4.38	38.00
Number of
iterations	11	11	11	11

Table 7. TABLE VII: Performance comparison for the AIRMS broadside design examples.

M	101	201	301	401
Aperture/ $λ$	NA	6.95	6.97	6.98
$\bar{Δ d} / λ$	NA	0.87	0.87	0.87
Number of
dipoles	NA	9	9	9
( $%$ decrease)	NA	57	57	57
Error	NA	0.48	0.46	0.45
Amplitude of
closest sidelobe (dB)	NA	-24.61	-30.55	-29.88
Computation
time (seconds)	NA	34.06	62.03	99.06
Number of
iterations	NA	2	3	2

Table 8. TABLE VIII: Performance comparison for varying values of α 𝛼 \alpha .

$α$	0.35	0.65	0.65
(method)	(CS-IMDSM)	(CS-IMDSM)	(AIRMS)
Aperture/ $λ$	5.31	8.89	6.10
$\bar{Δ d} / λ$	0.88	0.89	0.87
Number of
dipoles	7	11	8
( $%$ decrease)	67	48	62
Error	1.33	0.66	0.63
Amplitude of
closest sidelobe (dB)	-16.83	-21.22	-26.26
Computation
time (seconds)	379.95	339.11	71.84
Number of
iterations	8	11	2

Table 9. TABLE IX: Dipole locations and orientations for the off-broadside ( θ M L = 60 ∘ subscript 𝜃 𝑀 𝐿 superscript 60 \theta_{ML}=60^{\circ} and ϕ M L = 90 ∘ subscript italic-ϕ 𝑀 𝐿 superscript 90 \phi_{ML}=90^{\circ} ) CS-IMDSM design example.

n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$
1	0.15	4	3.15	7	6.22	9	8.31
2	1.21	5	4.17	8	7.40	10	9.22
3	2.22	6	5.23

Table 10. TABLE X: Dipole locations and orientations for the off-broadside ( θ M L = 60 ∘ subscript 𝜃 𝑀 𝐿 superscript 60 \theta_{ML}=60^{\circ} and ϕ M L = 90 ∘ subscript italic-ϕ 𝑀 𝐿 superscript 90 \phi_{ML}=90^{\circ} ) BCS-IMDSM design example.

n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$
1	0.24	4	3.22	7	5.98	10	8.55
2	1.26	5	4.16	8	6.86	11	9.37
3	2.25	6	5.08	9	7.72

Table 11. TABLE XI: Dipole locations and orientations for the off-broadside ( θ M L = 60 ∘ subscript 𝜃 𝑀 𝐿 superscript 60 \theta_{ML}=60^{\circ} and ϕ M L = 90 ∘ subscript italic-ϕ 𝑀 𝐿 superscript 90 \phi_{ML}=90^{\circ} ) AIRMS design example.

n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$
1	0	4	3.37	7	6.17	9	8.27
2	1	5	4.27	8	7.20	10	9.70
3	2.40	6	5.20

Table 12. TABLE XII: Performance comparison for the off-broadside ( θ M L = 60 ∘ subscript 𝜃 𝑀 𝐿 superscript 60 \theta_{ML}=60^{\circ} and ϕ M L = 90 ∘ subscript italic-ϕ 𝑀 𝐿 superscript 90 \phi_{ML}=90^{\circ} ) design examples.

	CS-	BCS-
Example	IMDSM	IMDSM	AIRMS	ULA
Aperture/ $λ$	9.08	9.13	9.70	10
$\bar{Δ d} / λ$	1.01	0.91	1.08	0.50
Number of
dipoles	10	11	12	21
( $%$ decrease)	52	48	62	0
Error	1.60	1.00	1.12	0.89
Amplitude of
closest sidelobe (dB)	-13.84	-19.20	-24.15	-22.02
Computation
time (seconds)	300.07	4.88	92.36	1.26
Number of
iterations	10	11	4	2

Table 13. TABLE XIII: Performance comparison for CS-IMDSM with varying θ M L subscript 𝜃 𝑀 𝐿 \theta_{ML} .

$θ$	$10^{\circ}$	$20^{\circ}$	$30^{\circ}$	$40^{\circ}$
Aperture/ $λ$	9.61	6.92	7.43	7.06
$\bar{Δ d} / λ$	0.80	1.15	1.06	1.01
Number of
dipoles	13	7	8	8
( $%$ decrease)	38	67	62	62
Error	0.14	2.85	3.01	3.24
Amplitude of
closest sidelobe (dB)	-31.07	-16.48	-14.96	-14.38
Computation
time (seconds)	440.75	381.60	453.65	370.06
Number of
iterations	13	8	9	9
Achieved
Mainlobe	$8^{\circ}$	$20^{\circ}$	$30^{\circ}$	$40^{\circ}$
$θ$	$50^{\circ}$	$70^{\circ}$	$80^{\circ}$	$90^{\circ}$
Aperture/ $λ$	7.59	6.04	2.13	2.21
$\bar{Δ d} / λ$	1.08	1.01	1.07	1.11
Number of
dipoles	9	7	3	3
( $%$ decrease)	57	67	86	86
Error	2.25	3.92	4.57	5.70
Amplitude of
closest sidelobe (dB)	-13.77	-14.46	-7.24	-4.80
Computation
time (seconds)	361.64	417.89	215.25	330.27
Number of
iterations	8	8	4	4
Achieved
Mainlobe	$48^{\circ}$	$65^{\circ}$	$90^{\circ}$	$88^{\circ}$

Table 14. TABLE XIV: Performance comparison for BCS-IMDSM with varying θ M L subscript 𝜃 𝑀 𝐿 \theta_{ML} .

$θ_{M L}$	$10^{\circ}$	$20^{\circ}$	$30^{\circ}$	$40^{\circ}$
Aperture/ $λ$	9.40	9.36	9.07	8.54
$\bar{Δ d} / λ$	0.94	0.94	0.91	0.95
Number of
dipoles	11	11	11	10
( $%$ decrease)	48	48	48	52
Error	1.18	2.04	2.46	2.23
Amplitude of
closest sidelobe (dB)	-10.24	-12.23	-12.57	-14.66
Computation
time (seconds)	5.05	5.49	19.06	4.81
Number of
iterations	11	11	11	11
Achieved
Mainlobe	$10^{\circ}$	$20^{\circ}$	$30^{\circ}$	$40^{\circ}$
$θ$	$50^{\circ}$	$70^{\circ}$	$80^{\circ}$	$90^{\circ}$
Aperture/ $λ$	9.40	9.26	9.54	9.51
$\bar{Δ d} / λ$	0.94	0.93	0.95	0.95
Number of
dipoles	11	11	11	11
( $%$ decrease)	48	48	48	48
Error	2.20	2.35	2.77	4.47
Amplitude of
closest sidelobe (dB)	-10.86	-7.32	-7.11	-15.58
Computation
time (seconds)	7.47	5.38	5.53	8.06
Number of
iterations	11	11	11	10
Achieved
Mainlobe	$49^{\circ}$	$69^{\circ}$	$72^{\circ}$	$80^{\circ}$

Table 15. TABLE XV: Performance comparison for AIRMS with varying θ M L subscript 𝜃 𝑀 𝐿 \theta_{ML} .

$θ_{M L}$	$50^{\circ}$	$70^{\circ}$
Aperture/ $λ$	10	10
$\bar{Δ d} / λ$	1.00	1.00
Number of
dipoles	11	11
( $%$ decrease)	48	48
Error	0.97	1.03
Amplitude of
closest sidelobe (dB)	-22.19	-17.00
Computation
time (seconds)	65.12	101.58
Number of
iterations	2	3
Achieved
Mainlobe	$51^{\circ}$	$69^{\circ}$

Table 16. TABLE XVI: Performance comparison for ULA with varying θ M L subscript 𝜃 𝑀 𝐿 \theta_{ML} .

$θ_{M L}$	$10^{\circ}$	$20^{\circ}$	$30^{\circ}$	$40^{\circ}$
Error	0.52	0.43	0.59	0.90
Amplitude of
closest sidelobe (dB)	-25.71	-22.51	-22.40	-26.95
Computation
time (seconds)	3.56	1.75	1.72	1.75
Number of
iterations	2	2	2	2
Achieved
Mainlobe	$11^{\circ}$	$20^{\circ}$	32^∘	38^∘
$θ$	$50^{\circ}$	$70^{\circ}$	$80^{\circ}$	$90^{\circ}$
Error	0.86	1.41	1.70	1.77
Amplitude of
closest sidelobe (dB)	-21.55	-15.61	-18.68	-14.84
Computation
time (seconds)	1.76	1.80	2.11	1.73
Number of
iterations	2	2	2	2
Achieved
Mainlobe	$50^{\circ}$	$67^{\circ}$	$84^{\circ}$	$90^{\circ}$

Table 17. TABLE XVII: Performance comparison for the off-broadside ( θ M L = 70 ∘ subscript 𝜃 𝑀 𝐿 superscript 70 \theta_{ML}=70^{\circ} and ϕ M L = − 90 ∘ subscript italic-ϕ 𝑀 𝐿 superscript 90 \phi_{ML}=-90^{\circ} ) design examples.

	CS-	BCS-
Example	IMDSM	IMDSM	AIRMS	ULA
Aperture/ $λ$	9.69	9.30	10	10
$\bar{Δ d} / λ$	0.97	0.93	0.91	0.50
Number of
dipoles	11	11	12	21
( $%$ decrease)	48	48	43	0
Error	2.08	1.71	1.03	1.42
Amplitude of
closest sidelobe (dB)	-14.36	-16.78	-17.92	-15.73
Computation
time (seconds)	374.07	3.75	67.35	1.24
Number of
iterations	11	11	4	2

Table 18. TABLE XVIII: Dipole locations and orientations for the off-broadside ( θ M L = 70 ∘ subscript 𝜃 𝑀 𝐿 superscript 70 \theta_{ML}=70^{\circ} and ϕ M L = − 90 ∘ subscript italic-ϕ 𝑀 𝐿 superscript 90 \phi_{ML}=-90^{\circ} ) CS-IMDSM design example.

n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$
1	0.19	4	3.09	7	6.03	10	9.05
2	1.21	5	3.92	8	7.03	11	9.88
3	2.13	6	5.00	9	8.03

Table 19. TABLE XIX: Dipole locations and orientations for the off-broadside ( θ M L = 70 ∘ subscript 𝜃 𝑀 𝐿 superscript 70 \theta_{ML}=70^{\circ} and ϕ M L = − 90 ∘ subscript italic-ϕ 𝑀 𝐿 superscript 90 \phi_{ML}=-90^{\circ} ) BCS-IMDSM design example.

n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$
1	0.38	4	3.44	7	6.27	10	8.88
2	1.27	5	4.41	8	7.16	11	9.68
3	2.44	6	5.36	9	8.02

Table 20. TABLE XX: Dipole locations and orientations for the off-broadside ( θ M L = 70 ∘ subscript 𝜃 𝑀 𝐿 superscript 70 \theta_{ML}=70^{\circ} and ϕ M L = − 90 ∘ subscript italic-ϕ 𝑀 𝐿 superscript 90 \phi_{ML}=-90^{\circ} ) AIRMS design example.

n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$	n	$d_{n} / λ$
1	0	4	2.63	7	5.40	10	8.13
2	0.90	5	3.50	8	6.47	11	9.07
3	1.83	6	4.60	9	7.33	12	10

Equations117

s_{s} (θ, ϕ)

s_{s} (θ, ϕ)

s_{p} (θ, ϕ, γ, η)

s_{p} (θ, ϕ, γ, η)

=

s_{f} (θ, ϕ, γ, η) = s_{p, f} (θ, ϕ, γ, η)) s_{s} (θ, ϕ) .

s_{f} (θ, ϕ, γ, η) = s_{p, f} (θ, ϕ, γ, η)) s_{s} (θ, ϕ) .

p (θ, ϕ, γ, η) = s (θ, ϕ, γ, η)^{T} w,

p (θ, ϕ, γ, η) = s (θ, ϕ, γ, η)^{T} w,

=

=

s (θ, ϕ, γ, η)

s (θ, ϕ, γ, η)

min ∣∣ w ∣ ∣_{1} subject to ∣∣ p_{r} - S w ∣ ∣_{2} \leq α,

min ∣∣ w ∣ ∣_{1} subject to ∣∣ p_{r} - S w ∣ ∣_{2} \leq α,

p_{r}

p_{r}

=

min

min

∣ ⟨ w ⟩ ∣_{1} = m = 1 \sum 3 M ∣∣ w_{m} ∣ ∣_{2}

∣ ⟨ w ⟩ ∣_{1} = m = 1 \sum 3 M ∣∣ w_{m} ∣ ∣_{2}

q min

q min

\hat{w}

\hat{w}

\hat{c}

\hat{p}_{r}

\hat{\textbf{S}}=\left(\begin{array}[]{cc}\boldsymbol{0}&\boldsymbol{0}\\ R(\textbf{s}_{x,1})&I(\textbf{s}_{x,1})\\ -I(\textbf{s}_{x,1})&R(\textbf{s}_{x,1})\\ \boldsymbol{0}&\boldsymbol{0}\\ R(\textbf{s}_{y,1})&I(\textbf{s}_{y,1})\\ -I(\textbf{s}_{y,1})&R(\textbf{s}_{y,1})\\ \vdots&\vdots\\ R(\textbf{s}_{Z,M})&I(\textbf{s}_{Z,M})\\ -I(\textbf{s}_{Z,M})&R(\textbf{s}_{Z,M})\\ \end{array}\right)^{T},

\hat{\textbf{S}}=\left(\begin{array}[]{cc}\boldsymbol{0}&\boldsymbol{0}\\ R(\textbf{s}_{x,1})&I(\textbf{s}_{x,1})\\ -I(\textbf{s}_{x,1})&R(\textbf{s}_{x,1})\\ \boldsymbol{0}&\boldsymbol{0}\\ R(\textbf{s}_{y,1})&I(\textbf{s}_{y,1})\\ -I(\textbf{s}_{y,1})&R(\textbf{s}_{y,1})\\ \vdots&\vdots\\ R(\textbf{s}_{Z,M})&I(\textbf{s}_{Z,M})\\ -I(\textbf{s}_{Z,M})&R(\textbf{s}_{Z,M})\\ \end{array}\right)^{T},

\hat{w} min

\hat{w} min

\hat{w} min

\hat{w} min

\hat{p}_{F} - \overset{˘}{S} w_{F}^{T} = \tilde{D}_{F},

\hat{p}_{F} - \overset{˘}{S} w_{F}^{T} = \tilde{D}_{F},

w_{F}

w_{F}

P (w_{F} ∣ \hat{p}_{F}) = \frac{P ( p ^ _{F} ∣ w _{F} ) P ( w _{F} )}{P ( p ^ _{F} )} .

P (w_{F} ∣ \hat{p}_{F}) = \frac{P ( p ^ _{F} ∣ w _{F} ) P ( w _{F} )}{P ( p ^ _{F} )} .

\textbf{w}_{F}=\max\limits_{\textbf{w}_{F}}\mathcal{P}\Bigg{(}\frac{\mathcal{P}(\hat{\textbf{p}}_{F}|\textbf{w}_{F})\mathcal{P}(\textbf{w}_{F})}{\mathcal{P}(\hat{\textbf{p}}_{F})}\Bigg{)}.

\textbf{w}_{F}=\max\limits_{\textbf{w}_{F}}\mathcal{P}\Bigg{(}\frac{\mathcal{P}(\hat{\textbf{p}}_{F}|\textbf{w}_{F})\mathcal{P}(\textbf{w}_{F})}{\mathcal{P}(\hat{\textbf{p}}_{F})}\Bigg{)}.

P (w_{F}) = \int P (w_{F} ∣ \overset{˘}{a}, \overset{σ}{˘}^{2}) P (\overset{˘}{a}) P (\overset{σ}{˘}^{2}) d \overset{˘}{a} d \overset{σ}{˘}^{2},

P (w_{F}) = \int P (w_{F} ∣ \overset{˘}{a}, \overset{σ}{˘}^{2}) P (\overset{˘}{a}) P (\overset{σ}{˘}^{2}) d \overset{˘}{a} d \overset{σ}{˘}^{2},

P (w_{F} ∣ \overset{˘}{a}, \overset{σ}{˘}^{2}) = (2 π \overset{σ}{˘})^{- 3 M} m = 1 \prod 3 M \overset{a}{˘}_{m} e^{- \frac{a ˘ _{m} w _{F, m}^{2}}{2 σ ˘ ^{2}}},

P (w_{F} ∣ \overset{˘}{a}, \overset{σ}{˘}^{2}) = (2 π \overset{σ}{˘})^{- 3 M} m = 1 \prod 3 M \overset{a}{˘}_{m} e^{- \frac{a ˘ _{m} w _{F, m}^{2}}{2 σ ˘ ^{2}}},

w_{F, o pt} =

w_{F, o pt} =

\displaystyle\max\limits_{\textbf{w}_{F}}\bigg{(}\int\frac{\mathcal{P}(\textbf{w}_{F}|\breve{\textbf{a}},\breve{\sigma}^{2})\mathcal{P}(\hat{\textbf{p}}_{F}|\textbf{w}_{F})\mathcal{P}(\breve{\textbf{a}})\mathcal{P}(\breve{\sigma}^{2})}{\mathcal{P}(\hat{\textbf{p}}_{F})}d\breve{\textbf{a}}d\breve{\sigma}^{2}\Bigg{)},

\textbf{w}_{F,opt}=\max\limits_{\textbf{w}_{F}}\bigg{(}\int\mathcal{P}(\textbf{w}_{F}|\hat{\textbf{p}}_{F},\breve{\textbf{a}})\mathcal{P}(\breve{\textbf{a}}|\hat{\textbf{p}}_{F})d\breve{\textbf{a}}\Bigg{)}.

\textbf{w}_{F,opt}=\max\limits_{\textbf{w}_{F}}\bigg{(}\int\mathcal{P}(\textbf{w}_{F}|\hat{\textbf{p}}_{F},\breve{\textbf{a}})\mathcal{P}(\breve{\textbf{a}}|\hat{\textbf{p}}_{F})d\breve{\textbf{a}}\Bigg{)}.

P (w_{F} ∣ \hat{p}_{F}, \overset{˘}{a}) = \int P (w_{F} ∣ \hat{p}_{F}, \overset{˘}{a}, \overset{σ}{˘}^{2}) P (\overset{σ}{˘}^{2}) d \overset{σ}{˘}^{2}

P (w_{F} ∣ \hat{p}_{F}, \overset{˘}{a}) = \int P (w_{F} ∣ \hat{p}_{F}, \overset{˘}{a}, \overset{σ}{˘}^{2}) P (\overset{σ}{˘}^{2}) d \overset{σ}{˘}^{2}

P (w_{F} ∣ \hat{p}_{F}, \overset{˘}{a}, \overset{σ}{˘}^{2}) P (\overset{σ}{˘}^{2}) =

P (w_{F} ∣ \hat{p}_{F}, \overset{˘}{a}, \overset{σ}{˘}^{2}) P (\overset{σ}{˘}^{2}) =

\displaystyle\mathcal{P}(\textbf{w}_{F}|\hat{\textbf{p}}_{F},\breve{\textbf{a}})=\bigg{(}\int_{0}^{\infty}t^{\beta_{MT-1}+(3M/2)-1}e^{-t}dt\bigg{)}\times

\displaystyle\mathcal{P}(\textbf{w}_{F}|\hat{\textbf{p}}_{F},\breve{\textbf{a}})=\bigg{(}\int_{0}^{\infty}t^{\beta_{MT-1}+(3M/2)-1}e^{-t}dt\bigg{)}\times

\displaystyle\frac{\big{(}1+\frac{1}{2\beta_{MT-2}}(\textbf{w}_{F}-\hat{\boldsymbol{\mu}}_{F})^{T}\hat{\boldsymbol{\Sigma}}^{-1}(\textbf{w}_{F}-\hat{\boldsymbol{\mu}}_{F})\big{)}^{-(\beta_{MT-2}+(3M/2))}}{\big{(}\int_{0}^{\infty}t^{\beta_{MT-1}-1}e^{-t}dt\big{)}\big{(}2\pi\beta_{MT-2}\big{)}^{(3M/2)}\sqrt{\det(\hat{\boldsymbol{\Sigma}})}},

\hat{μ}_{F}

\hat{μ}_{F}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Location and Orientation Optimisation for Spatially Stretched Tripole Arrays Based on Compressive Sensing

Matthew Hawesa, Lyudmila Mihaylovaa and Wei Liub

a Department of Automatic Control and Systems Engineering, University of Sheffield, S1 3JD, UK

b Department of Electronic and Electrical Engineering, University of Sheffield, S1 3JD, UK

{m.hawes, l.s.mihaylova, w.liu}@sheffield.ac.uk

Abstract

The design of sparse spatially stretched tripole arrays is an important but also challenging task and this paper proposes for the very first time efficient solutions to this problem. Unlike for the design of traditional sparse antenna arrays, the developed approaches optimise both the dipole locations and orientations. The novelty of the paper consists in formulating these optimisation problems into a form that can be solved by the proposed compressive sensing and Bayesian compressive sensing based approaches. The performance of the developed approaches is validated and it is shown that accurate approximation of a reference response can be achieved with a 67 $\%$ reduction in the number of dipoles required as compared to an equivalent uniform spatially stretched tripole array, leading to a significant reduction in the cost associated with the resulting arrays.

Index Terms:

Sparse array, spatially stretched, tripole, compressive sensing, Bayesian compressive sensing.

I Introduction

I-A Related Work

For uniform linear arrays (ULAs), an adjacent antenna separation of no larger than half of the operating wavelength is used to avoid the introduction of grating lobes [1, 2]. This can become prohibitive in terms of the cost associated with the number of antennas required. Instead, sparse arrays become a desirable alternative due to the fact that the nonuniform nature of their adjacent antenna separations avoids grating lobes even when the mean adjacent antenna separation is greater than half the operating wavelength [3].

However, the sidelobe behaviour of sparse arrays is unpredictable. This means that optimisation of the antenna locations is required in order to achieve a desired beam response. Such optimisation can be achieved by stochastic optimisation methods such as genetic algorithms (GAs) [4, 5, 6], and simulated annealing (SA)[7, 8]. Difference sets and almost difference sets have also been successfully used in the design of sparse arrays, [9, 10], and merged with GAs to help give an improved performance, [11, 12]. The disadvantage of GAs, and similar design methods, is the potentially long computation time and the possibility of convergence to a non-optimal solution.

More recently, the area of compressive sensing (CS) has been explored [13], and CS-based methods have been proposed in the design of traditional sparse arrays [14, 15, 16, 17, 18, 19]. CS theory says that when certain conditions are met it is possible to recover some signals from fewer measurements than used by traditional methods [13]. It is possible to use CS to design sparse sensor arrays by obtaining a close approximation of a desired beam response using as few array elements as possible.

Further work has also shown that it is possible to improve the sparseness of a solution by considering a reweighted $l_{1}$ norm minimisation problem [17, 20, 21, 22]. The aim of these methods is to bring the minimisation of the $l_{1}$ norm of the weight coefficients closer to that of the minimisation of the $l_{0}$ norm. To do this an iterative method is required to solve a series of reweighted $l_{1}$ minimisation problems, where locations with small weight coefficients are more heavily penalised than locations with large weight coefficients.

Alternatively, the problem can be converted into a probabilistic framework (termed Bayesian compressive sensing (BCS)) [23], with some suggested advantages to BCS as compared to traditional CS based implementations. However, an important point of interest is that the problem can be solved by the relevance vector machine (RVM) optimisation framework [24], which is efficient to use as also supported by the comparisons shown in the design examples section of this paper. Additionally, using BCS can remove the need to fine tune the error limits or sparsity associated with the implementations of CS above [25]. Such approaches have been applied in the design of sparse arrays with real valued and complex valued weight coefficients [26, 27, 28], where the multi-task BCS scheme [29], is applied in the case of complex valued weight coefficients.

The methods discussed above have been implemented assuming the arrays consist of isotropic array elements. As a result, the polarisation of a signal is not taken into account when considering the performance of an array. Instead arrays based on vector sensors, [19, 30], provide a desirable alternative as they allow the measurement of both the horizontal and vertical components of the received waveform. For example, the vector sensors used could be crossed dipoles (two orthogonally orientated dipoles) [19, 31, 32, 33], or tripoles (three orthogonally orientated dipoles) [34, 35].

When tripoles are used it is possible to measure the full electromagnetic (EM) field at a given point [35]. These arrays have been applied in the area of direction and polarisation estimation [34]. Due to the close proximity of the three orthogonal dipoles that make up each tripole there can be issues with mutual coupling when implemented in practice. As a result, the concept of spatially stretched tripoles (SST) has been developed and used in the area of direction of arrival (DOA) estimation [35]. An SST is a tripole where the three orthogonal dipoles are spread over a given geometry, leading to reduced mutual coupling effects.

I-B Contributions

In this work for the first time the problem of designing sparse SST arrays (SSSTAs) is addressed. Unlike for the design of traditional sparse arrays there are now two optimisation problems to solve, i.e. finding the optimal locations and orientations for the dipoles. It is proposed to use CS and BCS based design methods that go beyond the state of the art in order to solve these problems.

As a result, it is now necessary to formulate the problem to include the fact that there are three potential dipoles at each point on the sampling grid and the signal model now includes polarisation information (requiring alterations to the CS and BCS formulations). It is possible to avoid co-located dipoles by viewing them as a special case of the minimum adjacent dipole separation not meeting a physical size constraint [17]. However, if the methods in [17] are directly applied in this case, then although there will be a minimum spacing between antenna locations, there can still be multiple dipoles at each location. Therefore it is necessary to consider co-located dipoles as breaking the size constraint. Here, the design of SSSTAs utilising the size constraint is implemented in two ways: i) An iterative minimum distance sampling method (IMDSM) with CS and BCS; ii) an altered iterative reweighted minimisation scheme (AIRMS). When integrating the CS/BCS based method with the IMDSM it is also important to account for the response due to the previously fixed dipoles when deciding what the reference response in the current iteration is.

The remainder of this paper is structured as follows: Section II gives details of the proposed design methods, including the array model being used (II-A), a review of CS and BCS (II-B and II-C) and the proposed IMDSM and reweighted design methods for SSSTAs (II-D and II-E). In Section III design examples are presented to verify the effectiveness of the proposed methods and conclusions are drawn in Section IV.

II Proposed Design Methods

II-A Array Model

Figure 1 shows an example of a linear SSSTA. $M$ possible dipole locations are spread along the y-axis with an adjacent separation of $d$ . For each possible dipole location there are three potential orientation directions, one parallel to each axis. Also shown is a signal with its direction of arrival (DOA) defined by the angles $\theta$ and $\phi$ , with $0\leq\theta\leq\pi/2$ and $-\pi/2\leq\phi\leq\pi/2$ [34, 35]. A plane-wave signal model is assumed, i.e. the signal impinges upon the array from the far field.

The spatial steering vector of the array is given by

[TABLE]

where $\lambda$ is the wavelength of interest and $\{.\}^{T}$ indicates the transpose operation. The spatial-polarization coherent vector, which contains information about a signal’s polarisation and is given by [34, 35]:

[TABLE]

where $\gamma\in[0,\pi/2]$ is the auxiliary polarization angle and $\eta\in[-\pi,\pi)$ is the polarization phase difference.

Now the array can be split into three sub-arrays, one parallel to each axis. With $f\in\{x,y,z\}$ , the steering vector of each sub-array is given by:

[TABLE]

The response of the array is given by

[TABLE]

with

[TABLE]

where $w_{1}=w_{x,1}$ is the complex weight coefficient for the dipole located at the point $m=1$ and orientated parallel to the $x$ -axis and $\{.\}^{H}$ denotes the Hermitian transpose. Note that for an SSSTA if $w_{x,1}\neq 0$ , then $w_{y,1}=w_{z,1}=0$ , as there can be only one dipole present. Similarly

[TABLE]

where $s_{x,1}(\theta,\phi,\gamma,\eta)$ is the contribution of the dipole located at the point $m=1$ to the overall steering vector parallel to the $x$ -axis.

II-B Compressive Sensing for SSSTA Design

Suppose $P_{r}(\theta,\phi,\gamma,\eta)$ is the desired beam response as a function of $\theta,\phi,\gamma$ and $\eta$ . Then the problem is to match the designed response to this desired response for the full range of $\theta,\phi,\gamma$ and $\eta$ values of interest while finding the optimised dipole locations and orientations.

First, consider Figure 1 as being a grid of potential dipole locations. Here $M$ is a large number and sparseness is then introduced by selecting the weight coefficients to give as few active dipoles as possible, or in other words as few non-zero valued weight coefficients as possible, while still giving a designed response close to the desired one. Note, a large $M$ means it is more likely that the optimal locations will appear on the grid thus allowing for a better performance. However, the tradeoff is that if $M$ is too large the efficiency of the algorithm deteriorates.

This problem is formulated as

[TABLE]

where $||\textbf{w}||_{1}$ is the $l_{1}$ norm of the weight coefficients [13], $\textbf{p}_{r}$ is the vector holding the desired beam response at the sampled angular and polarisation points of interest, S is the matrix composed of the corresponding steering vectors, and $\alpha$ places a limit on the allowed difference between the desired and the designed responses. Minimising the $l_{1}$ norm has the effect of minimising the number of dipoles used, while the constraint ensures a reasonable approximation of the ideal reference response is achieved. If the size of $\alpha$ is increased, more error can be introduced into the final response, which would be expected to allow a sparser solution to be achieved. Note, $||.||_{2}$ indicates the $l_{2}$ norm.

In detail, $\textbf{p}_{r}$ and S are respectively given by

[TABLE]

where $L$ is the number of points sampled at each dimension of the desired beam response. In this work $\textbf{p}_{r}$ is the ideal response, i.e. a value of one for the mainlobe and zeros for the other entries. Note, $L$ has to be large enough to ensure all angular and polarisation points of interest are considered.

Since the coefficients are complex valued, (14) can be reformulated as a modified $l_{1}$ norm minimisation [36]:

[TABLE]

where

[TABLE]

and $\textbf{w}_{m}=[R(w_{m}),I(w_{m})]^{T}$ for $m=1,\ldots 3M$ contains the real and imaginary components of the complex weight coefficient given by the $m^{th}$ entry in w. Here, the variable $q$ has been introduced and requires minimising. By keeping $|\langle\textbf{w}\rangle|_{1}$ less than this value the effect is to minimise the $l_{1}$ norm of all of the absolute weight coefficients.

Now decompose $q$ to $q=\sum_{m=1}^{3M}q_{m}$ , $q_{m}\in\;\;\mathbb{R}^{+}$ , to reformulate (17). Note, the upper limit on the sum is $3M$ as there are $3$ potential dipole orientations at each location.

In vector form, $q=\textbf{1}^{T}\textbf{q},$ where $\textbf{1}^{T}=[1,\cdots,1]$ and $\textbf{q}~{}=~{}[q_{1},\cdots,q_{3M}]^{T}$ . Then (17) can be rewritten as

[TABLE]

Note, a value of $q_{m}=0$ , means the second constraint in (19) ensures that the real and imaginary parts of the weight coefficient contained in $\textbf{w}_{m}$ will both be equal to zero. This allows the desired sparsity to be introduced.

Now define

[TABLE]

and

[TABLE]

where $R(.)$ is the real component and $I(.)$ is the imaginary component. Then, the final formulation is as follows

[TABLE]

Note, the values $q_{m}$ for $m=1,\ldots,3M$ are included with the weight coefficients in $\hat{\textbf{w}}$ . This is so that it is not necessary to predefine their values, instead the algorithm finds them at the same time as the optimised weight coefficients. As a result, it is necessary for the vector $\hat{\textbf{c}}$ to select the values $q_{m}$ for minimisation and the zeros are introduced into $\hat{\textbf{S}}$ to ensure the same values do not contribute to the error between the ideal reference response and the achieved response in the first constraint in (23). Finally, as the weight coefficients have been split into real and imaginary parts, the response given by the product $\hat{\textbf{S}}\hat{\textbf{w}}^{H}$ will contain the real and imaginary parts of the achieved response separately. This means the reference pattern has to be split in a similar manner giving (21).

However, unlike the $l_{0}$ norm, the $l_{1}$ norm does not penalise all non-zero valued coefficients equally. Instead, larger coefficients are penalised more heavily. To further improve the sparseness of the array and get a better approximation of the $l_{0}$ norm minimisation, large reweighting terms can be applied to the smaller weight coefficients so that they are penalised more heavily [17, 18, 20, 21, 22].

When applied to the above modified $l_{1}$ norm minimisation problem we get the following

[TABLE]

where now $\hat{\textbf{c}}=[\delta^{i}_{1},0,0,\delta^{i}_{2},0,0,\ldots,\delta^{i}_{3M},0,0]^{T}$ and $\delta^{i}_{m}~{}=~{}(|w_{m}^{i-1}|+\epsilon)^{-1}.$ Here $i$ is the current iteration, $\hat{\textbf{w}}$ holds the current estimate of the weight coefficients, $w_{m}^{i-1}$ contains the weight coefficients, from the previous iteration, for the $m^{th}$ dipole and $\epsilon$ is a small value roughly equal to the minimum desired weight coefficient. The iterative algorithm would then follow the steps below:

Set $i=0$ and find an initial estimate of the weight coefficients by solving (23). 2. 2.

$i=i+1$ , and find the reweighting terms $\delta_{m}^{i}$ . 3. 3.

Solve (24). 4. 4.

Repeat steps 2 to 3 until $||\textbf{w}^{i}||_{0}=||\textbf{w}^{i-1}||_{0}=||\textbf{w}^{i-2}||_{0}$ i.e. until the number of active locations has remained the same for three iterations. Here define $\textbf{w}^{i}=[w_{1}^{i},w_{2}^{i},\ldots,w_{3M}^{i}]^{T}$ .

The addition of the reweighting term, which is calculated using coefficients from the previous iteration, means all non-zero valued coefficients are penalised in a more uniform manner.

It is worth noting that as it stands the solutions to (23) and (24) do not strictly give an SSSTA in the result. This is because currently there is no way of guaranteeing there can only be a single dipole at a given location. In other words the proposed methods are in effect finding a sparse weight coefficient vector without considering the locations of the associated dipoles. The methods detailed in Section II-D and Section II-E can both be used to overcome this issue and ensure that there are no co-located dipoles, guaranteeing an SSSTA.

II-C Bayesian Compressive Sensing for SSSTA Design

When considering BCS for sparse array design, [26, 27, 28, 37], there are two formulations of BCS that can be used. Firstly there is a single task (ST) BCS formulation [23] which can be implemented using a RVM [24, 38]. Alternatively multi task (MT) BCS, [29], can be used when there are multiple CS measurements and the statistical relationships between them can be exploited. This could include measurements at multiple time instances, or in the case of sparse array design if multiple or complex weight coefficients have to be minimised. As a result MT-BCS is well suited to the problem being addressed and is formulated in what follows. However, the ST-BCS based design methodology for SSSTA design is provided in the appendix for the interested reader.

Firstly, consider matching the real and imaginary parts of the achieved array response to that of the ideal reference response:

[TABLE]

where $F\in\{R,I\}$ , $\tilde{\textbf{D}}_{R}$ and $\tilde{\textbf{D}}_{I}$ are zero mean Gaussian error vectors, with a variance of $\breve{\sigma}^{2}$ , $\textbf{w}_{R}=R(\textbf{w})$ , $\textbf{w}_{I}=-I(\textbf{w})$ , $\breve{\textbf{S}}=[R(\textbf{S})^{T},I(\textbf{S})^{T}]^{T}$ , $\textbf{p}_{r}=\textbf{p}_{R}+j\textbf{p}_{I}$ , $\hat{\textbf{p}}_{R}=[R(\textbf{p}_{R}),I(\textbf{p}_{R})]^{T}$ and $\hat{\textbf{p}}_{I}=[R(\textbf{p}_{I}),I(\textbf{p}_{I})]^{T}$ . The problem now is to find the solutions to solve

[TABLE]

It is known that for the likelihood function $\mathcal{P}(\hat{\textbf{p}}_{F}|\textbf{w}_{F})$ and the priors $\mathcal{P}(\textbf{w}_{F})$ and $\mathcal{P}(\hat{\textbf{p}}_{F})$ , the following applies

[TABLE]

This allows the problem to be written as

[TABLE]

The prior $\mathcal{P}(\textbf{w}_{R})$ is the same as $\mathcal{P}(\textbf{w}_{I})$ to model the relationship between the real and imaginary parts of the weight coefficients, while still enforcing sparsity. It is given by $\mathcal{P}(\textbf{w}_{F})$ and found as follows:

[TABLE]

where $\mathcal{P}(\breve{\textbf{a}})$ is the multi-task shared hyperpriors, $\breve{\textbf{a}}~{}=~{}[\breve{a}_{1},\breve{a}_{1},...,\breve{a}_{1}]^{T}$ , given by a Gamma distribution, and $\mathcal{P}(\breve{\sigma}^{2})$ is a shared Gamma hierarchial prior, where

[TABLE]

gives

[TABLE]

which after integrating over $\breve{\sigma}^{2}$ and simplifying gives:

[TABLE]

Equation (30) considers $3M$ points as there are three potential dipoles at each location.

Note,

[TABLE]

and from Bayes’ theorem

[TABLE]

From (30), the fact that a Gamma hierarchial prior is placed on $\mathcal{P}(\breve{\sigma}^{2})$ and the fact that $\mathcal{P}(\textbf{w}_{F}|\hat{\textbf{p}}_{F},\breve{\sigma}^{2})$ can be modelled as a Gaussian likelihood, then

[TABLE]

where $\beta_{MT-1}$ and $\beta_{MT-2}$ are parameters associated with the MT-BCS process chosen to encourage sparsity. In (II-C) the mean and covariance are given by:

[TABLE]

respectively, where $\hat{\textbf{A}}=\text{diag}(\breve{\textbf{a}})=\text{diag}(\breve{a}_{1},\breve{a}_{2},\ldots,\breve{a}_{3M})$ . Note, this gives a Student’s t-distribution for $\mathcal{P}(\textbf{w}_{F}|\hat{\textbf{p}}_{F},\breve{\textbf{a}})$ .

When considering the remaining term in (32) a delta function approximation can be used [27]. This is because a closed-form solution is not possible. Note,

[TABLE]

with a mode given by

[TABLE]

where

[TABLE]

As the mode of a student-t distribution is equal to its mean the resulting weight coefficients are given by [27]

[TABLE]

The final optimal weight coefficient vector is then given by

[TABLE]

Note, that as for the CS formulation discussed in the previous subsection the MT-BCS scheme detailed here is unable to guarantee an SSSTA as an outcome. This is because it is in effect finding a sparse weight coefficient vector without considering where the associated dipoles are located. As a result, it is possible that there could be multiple dipoles present at the optimised locations (optimised locations refers to the locations with one or more non-zero valued weight coefficients). This means the desired reduction in mutual coupling effects when implemented in practice will not be achieved. Instead to ensure an SSSTA the methods discussed in the following subsections should be considered.

II-D Iterative Minimum Distance Sampling Method for SSSTAs

In the above two formulations, there is no way to ensure that an SSSTA is achieved. This is due to the fact that only the weight coefficients associated with a given dipole are minimised, rather than considering if there are any co-located dipoles.

To solve this problem it is proposed to extend the idea of imposing a physical size constraint on the optimisation from [17]. However, when directly applied these methods only ensure that there is a minimum distance between the optimised antenna locations. Therefore, in this instances they could not guarantee an SSSTA as there can potentially be three dipoles at each antenna location. As a result, it is necessary to also consider the fact that co-located dipoles at a given location can also be seen as breaking the minimum separation of a physical size constraint. In this work we use the idea of the IMDSM and AIRMS algorithms proposed in [17] to ensure an SSSTA is achieved as the final solution.

Note, that the iterative nature of the IMSDM based approaches means that the relationship between $M$ or $\alpha$ and the algorithms performance becomes less predictable. Consider the fact that the value of $M$ used affects where the first dipole is located. This then defines the remaining aperture, which is again sampled using $M$ grid points. As a result the density of the sampling grid in the next iteration varies depending on where the previous dipole was placed and the value of $M$ , which in turn makes it difficult to predict how the performance will be effected by $M$ . The effects of $\alpha$ can also be hard to predict for similar reasons.

II-D1 CS Based IMDSM

To begin with, the full aperture of the array is uniformly sampled and an estimate of the weight coefficients found using (23), with the first cluster of dipoles that are too close together being merged to give the first location as shown in Figure 2. At this point if there are multiple dipoles at the merged location the least significant are discarded to leave a single dipole present. The remainder of the aperture is then uniformly sampled, ensuring that the next dipole will be at least the distance of the size constraint away. This process is then repeated until there is no room for further dipoles.

It is worth noting that this method has involved the merger of dipole locations and has the potential for some dipoles to be discarded in order to avoid co-located dipoles. As a result the weight coefficients may no longer be optimal for the given dipole locations and orientations. However, the locations and orientations can be used to efficiently implement a fixed beamformer, by minimising the sidelobe levels while keeping a unitary response for the mainlobe location. This is detailed below in Section II-D3.

II-D2 MT-BCS Based IMDSM

In essence the same iterative procedure is followed in this instance. The initial set of weight coefficients used to find the first cluster is instead found using the MT-BCS procedure detailed in Section II-C. For subsequent iterations some changes have to be made to ensure that the method of solving the problem can account for the fact that some dipole locations and orientations have been fixed and will be contributing to the overall response.

As a result, consider the following

[TABLE]

where $\check{\textbf{p}}_{R}$ and $\check{\textbf{p}}_{I}$ are found by subtracting the response due to the locations fixed in the previous iteration from the reference response in the previous iteration. Then from the remaining uniformly sampled aperture in the current iteration we construct $\check{\textbf{S}}$ and the resulting estimate of the weight coefficients are given by $\check{\textbf{w}}=\check{\textbf{w}}_{R,opt}+j\check{\textbf{w}}_{I,opt}$ . Following the MT-BCS scheme detailed in Section II-C the solution is

[TABLE]

This process is repeated, with the merging and discarding of dipoles. As a result it is again necessary to use the method for redesigning the weight coefficients detailed below.

II-D3 Fixed Beamformer Design for Given Dipole Locations and Orientations

After obtaining the dipole locations and orientations using the CS-IMDSM or BCS-IMDSM, it is necessary to re-design the coefficients of the array to provide a closer approximation to the desired responses. This is a classic fixed beamformer design problem and can be solved using the method described below, which is applicable to any arbitrary array geometry.

The redesign of the weight coefficients is achieved by minimising the sidelobe levels subject to a unitary response for the mainlobe direction. This can be formulated as

[TABLE]

where $\hat{\textbf{w}}_{mask}=[\textbf{w}_{mask},\textbf{w}_{mask}]^{T}$ and $\textbf{w}_{mask}$ is a series of 1s and 0s to ensure only the correct dipole orientations are used, $\hat{\textbf{w}}_{re}=[R(\textbf{w}_{re}),I(\textbf{w}_{re})]^{T},$ $\tilde{\textbf{S}}=\left(\begin{array}[]{cc}R(\textbf{S})&-I(\textbf{S})\\ I(\textbf{S})&R(\textbf{S})\\ \end{array}\right)$ , $\tilde{\textbf{S}}_{ML}$ only considers the mainlobe direction and $\circ$ denotes the Hadamard product.

II-E Altered Iterative Reweighted Minimisation Scheme for SSSTAs

To avoid the merging and discarding of dipoles as required for IMDSM, this work also proposes an AIRMS. Here the reweighting scheme in (24) is adapted to also penalise dipole locations that are too close together [17]. This gives the following reweighting scheme

[TABLE]

Now the iterative procedure is repeated until a solution that complies with the size constraint being enforced is obtained.

Unfortunately, this algorithm will not always guarantee a viable solution, due to the presence of $\epsilon$ in the calculation of reweighting terms. The inclusion of $\epsilon$ is required for numerical stability, but prevents a zero weight coefficient in the current iteration guaranteeing a zero weight coefficient in the next iteration. Based on the authors’ experience with different design parameters, if a solution is possible it will usually be achieved in less than $10$ iterations.

It is also hard to predict if a solution will be achieved, or the performance level achieved, based on the selection of $M$ . This is as the choice of $M$ greatly effects how likely we are to get a solution that meets the size constraint value. It may be expected that increasing $M$ should allow an improvement in the algorithms performance as it is more likely to get the optimal locations included on the sampling grid. This also makes it more likely that two or more dipoles will be located closer together than the size constraint making it harder to get a valid solution.

III Design Examples

This section provides design examples to verify the effectiveness of the proposed methods. All examples are implemented on a computer with an Intel Xeon CPU E3-1271 (3.60GHz) and 16GB of RAM.

For all of the figures that follow positive values of $\theta$ indicate the value range $\theta\in[0^{\circ},\;90^{\circ}]$ for $\phi=90^{\circ}$ , while negative values of $\theta\in[-90^{\circ},\;0^{\circ}]$ indicate an equivalent range of $\theta\in[0^{\circ},\;90^{\circ}]$ with $\phi=-90^{\circ}$ .

Here a broadside design example and two off-broadside design examples are considered to illustrate the effectiveness of the proposed design methods, when designing linear SSSTAs. Although the AIRMS does not necessarily require the weight coefficients to be redesigned, they have been here in order to allow a fairer comparison between all three design methods considered. Unless otherwise stated, the examples consider the scenario of $M=301$ with a maximum possible aperture of 10 $\lambda$ . For the design examples using MT-BCS the values of $\beta_{MT-1}$ and $\beta_{MT-2}$ are set as suggested in [29], with the value of $\sigma^{2}$ being found from the CS-IMDSM and AIRMS design examples. In this work the CS-IMDSM and AIRMS are implemented using cvx, a package for specifying and solving convex programs [39, 40].

Note, the selection of $M$ has been made to get close to the sampling density suggested in [21], while also accounting for the fact that the proposed methods have to consider three antennas at each grid point rather than a single antenna. As discussed for the proposed methods it is also hard to predict how changing $M$ will effect the performance of the algorithms (in the case of the AIRMS a solution is not even always guaranteed). Experience with different design examples suggest that $M=301$ for a $10\lambda$ aperture usually ensures a suitable solution will be achieved by at least one of the three proposed methods.

For the three examples the response from an equivalent ULA is also provided as a further comparison. To ensure optimised dipole locations and orientations for the ULAs, solve the minimisation in (LABEL:eq:redesign1) with $\textbf{w}_{mask}=[1,1,1,\ldots,1]^{T}$ to allow the three dipole orientations at each location to be considered. Then a new $\textbf{w}_{mask}$ is constructed in order to keep only the most significant dipole orientations at each location. The minimisation in (LABEL:eq:redesign1) is then resolved to give the final optimised dipole orientations and locations.

III-A Broadside Example

For the broadside design example, the mainlobe is given by $\theta_{ML}=0^{\circ}$ for $\phi_{ML}=90^{\circ}$ , with the sidelobe regions defined by $\theta_{SL}=[10^{\circ},90^{\circ}]$ for $\phi_{SL}=\pm 90^{\circ}$ and being sampled every $1^{\circ}$ . The polarisation information is given by $\gamma=45^{\circ}$ and $\eta=100^{\circ}$ . For the CS-IMDSM and AIRMS examples the value of $\alpha=0.5$ is used.

The responses for the CS-IMDSM, BCS-IMDSM and AIRMS design examples are shown in Figure 3. For all three of the proposed methods the correct mainlobe location has been achieved (whereas the ULA example gave a $1^{\circ}$ error), along with sufficient sidelobe attenuation. For completeness the resulting dipole locations are shown in Tables I, II and III, respectively, where it is clear the size constraint has been successfully enforced in all cases. Figures. 4, 5 and 6 illustrate the orientations of the dipoles for each of the three broadside examples and the ULA orientations are shown in Figure 7. Note, the dipole positions shown in the figures do not accurately reflect the true dipole locations. The true locations should instead be determined from the corresponding tables provided.

The following performance measures are summarised in Table IV: aperture length, mean adjacent dipole separation ( $\overline{\Delta{d}}$ ), number of dipoles required (also given as a $\%$ reduction as compared to an equivalent ULA), $l_{2}$ norm of the error between the desired and achieved responses ( $||\textbf{p}_{r}-\textbf{Sw}_{opt}||_{2}$ , where $\textbf{w}_{opt}$ are the optimised weight coefficients for a given method), the amplitude of the peak sidelobe closest to the mainlobe, the computation time and the number of iterations required by each method.

Firstly, as expected, it can be seen that there are reasonably small error values, suggesting that a good match to the desired response has been achieved in each case. For two of the three proposed methods the error between the designed and desired response is less than that for the ULA. This suggests a better approximation of the ideal response has been achieved, despite requiring less dipoles (48 $\%$ less for BCS-IMDSM and 57 $\%$ less for AIRMS) and the introduction of sparsity. It can also be seen that by comparing the values of $\overline{\Delta{d}}$ a comparable amount of sparseness has been introduced by each of the design methods, with the BCS-IMDSM performing slightly better (and also giving the lowest response error).

When considering the computation time it can be seen that there is a difference between the three methods. The AIRMS has given a shorter computation compared to the CS-IMDSM which is explained by the fact that it requires fewer iterations as dipoles are not placed individually. There is also a significant reduction in the computation time between the CS-IMDSM and BCS-IMDSM design examples. This would suggest that the BCS-IMDSM design method is the more computationally efficient IMDSM based design method. The authors’ experience with different design examples also suggests that this is consistently the case and that the difference increases with the problem size.

To illustrate the effects of the value of $M$ used, now consider the same design example again with the values $M=101,201$ and $M=401$ , along with the original value of $M=301$ . The performance measures for the three proposed methods are summarised in Tables V-VII.

As expected, increasing the value of $M$ has increased the computation for the three proposed design methods. This is because the design methods now consider a larger sampling grid for each iteration, which in turn means a longer computation time. However, the effect on the other performance measures used has proven to be harder to predict.

For each of the design methods varying $M$ can alter the aperture of the designed array and the dipoles required to implement it in practice. The mean adjacent dipole separation has remained reasonably constant and for the CS-IMSDM method the smallest separation has even occurred for the largest value of $M$ . However, for the design of traditional sparse arrays using CS-based methods, increasing the value of $M$ would lead to an expected increase in the mean adjacent dipole separation. This is because a denser grid will be able to give a closer approximating to the ideal locations and as a result uses less dipole in total. By looking at the error between the designed responses and the ideal response, along with the amplitudes of the closest sidelobes, it can be seen that the effect on the desirability of the designed response is similarly hard to predict in advance. The same is true when off-broadside examples are considered. So for the remainder of this broadside design example and the two off-broadside design examples that follow only the original value of $M=301$ is used.

Finally, now consider the effect of $\alpha$ on the performance of the CS-IMDSM and AIRMS for the broadside design example. Two further values of $\alpha$ will be considered, $\alpha=0.35$ and $0.65$ , respectively. The performance of the two methods for these values is summarised in Table VIII. For traditional CS based problems it would be expected to see that increasing the value of $\alpha$ would increase the amount of error allowed, thus allowing extra sparsity to be introduced. However, here we can see the iterative nature of the algorithms has made predicting the effects of $\alpha$ difficult. As a result, in what follows a single value of $\alpha$ that gives a solution for both methods will be used in the off-broadside examples to allow a fair comparison. Note, the reason why no results are shown for AIRMS with $\alpha=0.35$ is that no solution was obtained in this case.

III-B Off-Broadside Example 1

For the first off-broadside design example consider a mainlobe location of $\theta_{ML}=60^{\circ}$ for $\phi_{ML}=90^{\circ}$ , with the sidelobe regions defined as $\theta_{SL}=[0^{\circ},50^{\circ}]\bigcup[70^{\circ},90^{\circ}]$ for $\phi=90^{\circ}$ and $\theta_{SL}=[0^{\circ},90^{\circ}]$ for $\phi=-90^{\circ}$ , which are sampled every $1^{\circ}$ . The polarisation information is given by $\gamma=55^{\circ}$ and $\eta=100^{\circ}$ . The value $\alpha=0.75$ is used to place a limit on the allowed error in responses.

Figure 8 shows the resulting responses for the three design examples. The CS-IMDSM design example has the mainlobe at the correct location, while for the other two examples and the ULA comparison the mainlobe is located at $\theta=59^{\circ}$ . In all three cases sufficient sidelobe attenuation has also been achieved. Again, for completeness the resulting dipole locations and orientations are shown in Tables IX, X and XI and Figures 9, 10 and 11, respectively, where it is clear the size constraint has been successfully enforced in all three cases. The comparison ULA dipole orientations are shown in Figure 12. Note, the distances in the dipole orientation figures are again not intended to be accurate. Instead, the dipole location information should be taken from the tables provided.

Table XII compares the performance measures for the off-broadside design examples. The first thing to note is that the error in the responses has significantly been increased for all three cases. This is expected as we used a larger value of $\alpha$ and can be predicted after having looked at the three designed beam responses. It can be seen that the BCS-IMDSM has given the most accurate estimate of the desired response (as compared to the CS-IMDSM and AIRMS), but this has come at the expense of a reduced adjacent dipole separation. Again, the BCS-IMDSM has been shown to be the most computationally efficient of the proposed SSSTA design methods. Finally, although the error values show a worse approximation of the ideal response has been achieved by the proposed methods, as compared to the comparison ULA, a reasonable approximation has still been achieved despite using less dipoles and the introduction of sparsity.

For completeness, now consider how the methods perform over the full range of potential off-broadside mainlobe directions. These results are summarised in Tables XIII-XV, where the designed mainlobe locations have also been added for reference. Note, if one of the additional values of $\theta_{ML}$ is missing, it is because no solution was possible. In addition, the comparison ULA results are summarised in Table XVI.

Here, it can be seen that there are varying performance levels for the three methods, helping illustrate that the same method is not always guaranteed to perform the best. The first thing that can be seen is that the AIRMS has not managed to get a solution for the majority of the values of $\theta_{ML}$ . However, the important thing is that one of the three solutions always appears to give an acceptable approximation of the reference response, with a reduction in the number of dipoles required, for each angle of interest. Also, if desired a uniform SST array can be designed using the method details provided for the ULA comparisons. One thing that does seem constant is that the BCS-IMDSM is the most efficient of the three proposed methods. As similar patterns can be expected for the second off-broadside design example, only one value of $\theta_{ML}$ will be considered.

III-C Off-broadside Example 2

In the third design example, the mainlobe is defined by $\theta_{ML}=70^{\circ}$ and $\phi_{ML}=-90^{\circ}$ , with the sidelobe regions given by $\theta_{SL}=[0^{\circ},90^{\circ}]$ for $\phi_{SL}=90^{\circ}$ and $\theta_{SL}=[0^{\circ},60^{\circ}]\bigcup[80^{\circ},90^{\circ}]$ for $\phi_{SL}=-90^{\circ}$ . The value of $\theta$ is then sampled every $1^{\circ}$ in the sidelobe regions. Finally, consider the values $\gamma=60^{\circ},\eta=-10^{\circ}$ and $\alpha=0.8$ . This results in the responses shown in Figure 13, with the various performance measures being summarised in Table XVII.

Firstly, the mainlobe for the AIRMS example is within $1^{\circ}$ of what was desired with the other three mainlobes being within $3^{\circ}$ . As a result, it is clear that the AIRMS has achieved a mainlobe direction closer to the desired direction than the comparison ULA. Although the mainlobe for the CS-IMDSM and BCS-IMDSM are not as accurately located, they are still close enough ensuring that there is not significant suppression of signals from the desired location. In addition, they are no worse than the comparison ULA in this regard. This, along with the fact that sufficient sidelobe attenuation has been achieved, suggests that an acceptable response has been achieved by the proposed methods. However, comparing the error values shows that the CS-IMSDM and BCS-IMSDM have given an approximation of the ideal achieved response that is worse than with the ULA. Although, this is done using less dipoles (48 $\%$ less for both methods) and a larger adjacent dipole separation. We can also see that the AIRMS has given a similar reduction in the number of dipoles required (43 $\%$ rather than 48 $\%$ ), while also giving a better approximation of the ideal response than achieved by the ULA. In this instance the BCS-IMDSM has proven to have the best computational efficiency. For completeness the dipole locations and orientations are given in Tables XVIII-XX and Figures 14-17, respectively. As for the previous examples the true dipole locations should be taken from the tables provided.

III-D Discussion

This subsection presents a discussion of the main results in light of the implications for optimal parameter selection. These points can be summarised as follows:

From the broadside design example it can be seen that increasing the value of $M$ always increases the computation time as more grid points are being considered. For CS or BCS it would be reasonable to expect that increasing $M$ would improve the solution in terms of sparsity and desirability of the achieved response. The iterative nature of the algorithms makes it harder to predict the effects on error between the reference and achieved responses and the number of dipoles required. Experience suggests that $M=301$ is the best tradeoff to make. 2. 2.

The iterative nature of the algorithms has also made it difficult to predict the effects of varying the value of $\alpha$ for the CS-IMDSM and AIRMS. It is worth noting two points. Firstly a value of $\alpha=0$ would mean that the approximation of the reference pattern would have to be exact. This is unlikely to be possible when the ideal response is used. Secondly, a value of $\alpha=1$ will result in a response of all zeros and no dipoles being used, as $||[0,0,...,1,...,0,0]^{T}~{}-~{}[0,0,...,0,...,0,0]^{T}||_{2}=1$ . 3. 3.

The value of $L$ has to be large enough to consider all the angular and polarisation points of interest, as an acceptable response can not be guaranteed for the points not directly considered. Increasing $L$ further when this has been achieved adds computational complexity for no further gain in desirability of the array’s response. 4. 4.

The off broadside design examples provided indicate that one of the three methods (or alternative the method for designing a comparison uniform SST array) can be used for all off-broadside mainlobe directions of interest. However, a single method can not be guaranteed to perform best in all cases.

It is also worth considering the problem of selecting which of the three proposed methods should be used in a given situation. There are 4 criteria to be considered: guarantees of a solution, the sparsity introduced, error between the reference and designed responses and the computational efficiency. These 4 points are now considered in turn and recommendations made about which method to use.

Guarantees of a solution: The results provided show that the AIRMS was the only one not to always give a solution. This would suggests using one of the other two methods when guarantees of a solution is the overriding factor. The selection of which of the remaining two methods should be used depends on which of the remaining criteria are prioritised. 2. 2.

The sparsity introduced: The results given above indicate that the CS-IMDSM tends to give the sparsest solution (followed by the BCS-IMDSM and then AIRMS) so should be selected when this criterion is the most important. 3. 3.

Error between reference and designed responses: In terms of the amount of error between the reference and designed responses the BCS-IMDSM has been shown to give the best performance. This can be explained by the fact that the less dipoles used the more error is expected and the BCS-IMDSM method had lower levels of sparsity than the CS-IMDSM (while the AIRMS was not always guaranteed to give a solution). 4. 4.

The computational efficiency: If computational efficiency is prioritised over the other criteria the authors would suggest considering the BCS-IMDSM as the results consistently show it is the most efficient method (followed by AIRMS, when it gives a solution, and CS-IMDSM).

IV Conclusions

In this work the problem of designing sparse SSSTA has been addressed for the first time. Novel CS and BCS based approaches have been proposed to solve the problem of simultaneously optimising dipole locations and orientations, with a minimum spacing being used to avoid co-located dipoles. Design examples have been provided and show that an accurate approximation of a reference pattern can be achieved using fewer dipoles than a comparable uniform SST array (38 $\%$ -86 $\%$ reduction in the number of dipoles). This work has focused on the design of linear SSSTAs for a single signal polarisation of interest. In order to fully control a wide range of signal polarisations a planar array may be necessary. Extending the proposed approaches to this case is seen as an area for future research.

Appendix

IV-A Single Task Bayesian Compressive Sensing for Spatially Stretched Sparse Tripole Array Design

When looking to use ST-BCS to design SSSTA the problem can be considered in a similar form to what is done when designing traditional sparse arrays [26, 27, 28, 37]:

[TABLE]

where D is a zero mean Gaussian error vector. The variance of D is proportional to the limit placed on the allowed error between the desired and achieved response, i.e. $\sigma^{2}\propto\alpha$ . These complex values can be split into real and imaginary parts giving

[TABLE]

where

[TABLE]

and

[TABLE]

Now model $\hat{\textbf{p}}_{r}$ as a Gaussian likelihood

[TABLE]

The problem of finding the optimal sensor locations is then solved by maximising the a-posteriori probability $\mathcal{P}(\tilde{\textbf{w}},\sigma^{2}|\hat{\textbf{p}}_{r})$ while also enforcing a belief that the weight coefficient vector should also be sparse.

This sparse belief can be enforced by using the Gaussian hierarchial prior

[TABLE]

where $\tilde{a}_{m}$ is the hyperparameter that determines whether $\tilde{w}_{m}$ is zero-valued or not. To be able to fully evaluate (52) further definitions have to be made, i.e. the hyperpriors over $\tilde{\textbf{a}}$ and $\sigma^{2}$ , where $\tilde{\textbf{a}}=[\tilde{a}_{1},\tilde{a}_{2},\ldots,\tilde{a}_{6M}]^{T}$ . Note, there are $6M$ hyperparameters as the real and imaginary parts of the weight coefficients for the $3M$ dipoles are being considered separately. For the same reason the limit in (52) is also $6M$ . The hyperparameters are given by the following Gamma distributions:

[TABLE]

and

[TABLE]

The solution to the problem of maximising $\mathcal{P}(\tilde{\textbf{w}},\sigma^{2}|\hat{\textbf{p}}_{r})$ can now be found by following the methodology detailed for the RVM [23, 24, 38], which will be briefly summarised below.

It is known that the posterior can be written as

[TABLE]

and from (51) and (52)

[TABLE]

The posterior mean and variance are given respectively by

[TABLE]

and

[TABLE]

where $\textbf{A}=\text{diag}(\tilde{a}_{1},\tilde{a}_{2},\ldots,\tilde{a}_{6M})$ is the diagonal matrix of the $6M$ hyperparameters.

A delta function at the values of $\tilde{\textbf{a}}$ and $\sigma^{2}$ that maximise $\mathcal{P}(\tilde{\textbf{a}},\sigma^{2}|\hat{\textbf{p}}_{r})$ can be used to approximate $\mathcal{P}(\tilde{\textbf{a}},\sigma^{2}|\hat{\textbf{p}}_{r})$ (i.e. a point estimate of $\mathcal{P}(\tilde{\textbf{a}},\sigma^{2}|\hat{\textbf{p}}_{r})$ for the most probable values of $\tilde{\textbf{a}}$ and $\sigma^{2}$ ). It is also known that

[TABLE]

With uniform values of $\beta_{ST-3},\beta_{ST-4},\beta_{ST-5}$ and $\beta_{ST-6}$ , $\mathcal{P}(\tilde{\textbf{a}})$ and $\mathcal{P}(\sigma^{2})$ become constant. Therefore, maximising $\mathcal{P}(\tilde{\textbf{a}},\sigma^{2}|\hat{\textbf{p}}_{r})$ is equivalent to maximising $\mathcal{P}(\hat{\textbf{p}}_{r}|\tilde{\textbf{a}},\sigma^{2})$ . This can be solved by following a type II likelihood maximisation procedure to maximise the likelihood function given by

[TABLE]

where

[TABLE]

This allows the optimal values $\tilde{\textbf{a}}_{opt}$ and $\sigma^{2}_{opt}$ to be obtained.

The $m^{th}$ optimal weight coefficients are then given by

[TABLE]

where $\tilde{w}_{opt,m}$ is the $m^{th}$ entry of

[TABLE]

Acknowledgments

We appreciate the support of the UK Engineering and Physical Sciences Research Council (EPSRC) via the project Bayesian Tracking and Reasoning over Time (BTaRoT) grant EP/K021516/1. We would like to thank the associate editor for handling our paper and acknowledge the anonymous reviewers’ comments that have helped improve this work.

Bibliography40

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] H. L. Van Trees, Optimum Array Processing, Part IV of Detection, Estimation, and Modulation Theory . New York, U.S.A.: John Wiley & Sons, Inc., 2002.
2[2] W. Liu and S. Weiss, Wideband Beamforming: Concepts and Techniqeus . Chichester, UK: John Wiley & Sons, 2010.
3[3] P. Jarske, T. Saramaki, S. K. Mitra, and Y. Neuvo, “On properties and design of nonuniformly spaced linear arrays,” IEEE Transactions on Acoustics, Speech, and Signal Processing , vol. 36, no. 3, pp. 372 –380, 1988.
4[4] R. L. Haupt, “Thinned arrays using genetic algorithms,” IEEE Transactions on Antennas and Propagation , vol. 42, no. 7, pp. 993–999, 1994.
5[5] K.-K. Yan and Y. Lu, “Sidelobe reduction in array-pattern synthesis using genetic algorithm,” IEEE Transactions on Antennas and Propagation , vol. 45, no. 7, pp. 1117 –1122, 1997.
6[6] M. Hawes and W. Liu, “Location optimization of robust sparse antenna arrays with physical size constraint,” IEEE Antennas and Wireless Propagation Letters , vol. 11, pp. 1303 –1306, 2012.
7[7] A. Trucco and V. Murino, “Stochastic optimization of linear sparse arrays,” IEEE Journal of Oceanic Engineering , vol. 24, no. 3, pp. 291–299, 1999.
8[8] S. Repetto and A. Trucco, “Designing superdirective microphone arrays with a frequency-invariant beam pattern,” IEEE Sensors Journal , vol. 6, no. 3, pp. 737–747, 2006.