Independence-Checking Coding for OFDM Channel Training Authentication:   Protocol Design, Security, Stability, and Tradeoff Analysis

Dongyang Xu; Pinyi Ren; James A. Ritcey

arXiv:1901.07897·eess.SP·January 24, 2019

Independence-Checking Coding for OFDM Channel Training Authentication: Protocol Design, Security, Stability, and Tradeoff Analysis

Dongyang Xu, Pinyi Ren, James A. Ritcey

PDF

TL;DR

This paper introduces an innovative coding-based authentication protocol for OFDM channel training that enhances security against attacks by encoding pilot signals into diversified patterns, ensuring high accuracy and stability in channel estimation.

Contribution

The paper develops an independence-checking coding theory and a secure, stable CTA protocol that encodes pilot tones into diversified patterns, improving security and robustness in OFDM systems.

Findings

01

The ICC-CTA protocol achieves high security in pilot authentication.

02

The protocol maintains stable channel estimation under attack scenarios.

03

Optimal code rate balances security and stability effectively.

Abstract

In wireless OFDM communications systems, pilot tones, due to their publicly known and deterministic characteristic, suffer significant jamming/nulling/spoofing risks. Thus, the convectional channel training protocol using pilot tones could be attacked and paralyzed, which raises the issue of anti-attack channel training authentication (CTA), i.e., verifying the claims of identities of pilot tones and channel estimation samples. In this paper, we consider one-ring scattering scenarios with large-scale uniform linear arrays (ULA) and develop an independence-checking coding (ICC) theory to build a secure and stable CTA protocol, namely, ICC-based CTA (ICC-CTA) protocol. In this protocol, the pilot tones are not only merely randomized and inserted into subcarriers but also encoded as diversified subcarrier activation patterns (SAPs) simultaneously. Those encoded SAPs, though camouflaged by…

Tables1

Table 1. TABLE I: Summary of Notations.

Notations	Description
$N_{T}$ ; $D λ (0 \leq D \leq 1 / 2)$	Number of antennas at BS; Antenna spacing
$Δ$ ; $θ_{i}, i = 1, 2$	Angle spread at BS; Mean AoA of Bob, $i = 1$ and Ava, $i = 2$
$\bar{N}$ ; $N$ ; $(\bar{N} \geq N)$	Total available number of subcarriers within each OFDM symbol time; Length of FFT points
$N_{B}; N_{A}$ $(N_{B} \leq \bar{N}, N_{A} \leq \bar{N})$	Number of subcarriers allocated for Bob and Ava
$Ψ = {0, 1 \dots \bar{N} - 1}$	Index set of total available subcarriers
$Ψ_{B} = {i_{0}, i_{1}, \dots, i_{N_{B} - 1}}$ , $Ψ_{A} = {i_{0}, i_{1}, \dots, i_{N_{A} - 1}}$	Index set of subcarriers allocated for Bob and Ava
$x_{B}^{j} [k], j \in Ψ_{B}$ ; $x_{A}^{j} [k], j \in Ψ_{A}$	Pilot tones for Bob and Ava at the $j$ -th subcarrier and $k$ -th symbol time
$ρ_{B}$ , $ρ_{A}$ ; $ϕ_{k}$ , $φ_{k, i}$	Uplink training power for Bob and Ava; Pilot phases of Bob and Ava
$L$ ; $σ^{2}$	Number of sampled multi-path taps in baseband, Average noise power of BS
$𝐡_{B}^{i} \in ℂ^{L \times 1}$ ; $𝐡_{A}^{i} \in ℂ^{L \times 1}$	CIR vectors, respectively from Bob and Ava to the $i$ -th receive antenna of Alice
$𝐅 \in ℂ^{N \times N}$ ; $𝐅_{L}$ ; $𝐅_{L, k}$ ; $𝐅_{j}$	DFT matrix; $𝐅_{L} = \sqrt{N} 𝐅 (:, 1 : L)$ ; $k$ -row matrix of $𝐅_{L}$ ; $j$ -row matrix of $𝐅$ .
$𝐯^{i} [k] \in ℂ^{N \times 1}$ , $𝐯^{i} [k] \sim 𝒞 𝒩 (0, 𝐈_{N} σ^{2})$	AWGN vector at the $i$ -th antenna of BS within the $k$ -th symbol time
$𝐰_{j}^{i} [k] = 𝐅_{j} 𝐯^{i} [k]$ , $1 \leq j \leq N$	AWGN vector across $j$ subcarriers for $i$ -th antenna of BS within $k$ -th symbol
$σ_{B, l}^{2}$ ; $σ_{A, l}^{2}$	PDP of the $l$ -th path of Bob and Ava
$𝐲_{i} [k]$	Received signal vector at the $i$ -th subcarrier and $k$ -th OFDM symbol.
$𝒜$ ;	${ϕ : ϕ = 2 m π / C, 0 \leq m \leq C - 1}$ ; $C$ denotes the quantization resolution
$𝒫_{d} = {k_{1}, \dots, k_{d}}$ , $𝒫_{d} \subseteq Ψ$	Index set of ambiguous subcarriers under hybrid attack
$𝒫_{s} = {j_{1}, \dots, j_{s}}$ , $𝒫_{s} \subseteq Ψ, \| 𝒫_{s} \| = s$	Index set of overlapping subcarriers under hybrid attack
$𝒫_{a} = {i_{1}, \dots, i_{a}}$ , $𝒫_{a} \subseteq {1, \dots N_{T}}$ , $\| 𝒫_{a} \| = a$	Index set of the intersection of $𝒮_{1}$ with $𝒮_{2}$
$𝐑_{i} \in ℂ^{N_{T} \times N_{T}}$ ; $𝐑_{F}$	Channel covariance matrix of Bob ( $i = 1$ ) and Ava ( $i = 2$ ); $𝐑_{F} = 𝐅_{L, s}^{T} 𝐅_{L, s}^{*}$
$ρ_{i}$ ; $ρ_{f} = \min {s, L}$	Rank of $𝐑_{i}$ ; Rank of $𝐑_{F}$
$N_{1}^{d}$ ; $N_{0}^{d}$	Total number of non-zero digits in S.1 and zero digits in S.2
$N_{1, i}^{s}$ ; $N_{0, i}^{s}$ , $i = 0, 1$	Total number of nonzero digits for $𝒜_{i}$ ; Total number of zero digits for $𝒜_{i}$
$d_{i_{r}}$	Digit indicated by RS

Equations51

{x_{{\rm{B}},j}}\left[k\right]=\left\{{\begin{array}[]{*{20}{c}}{x_{\rm{B}}\left[k\right]}&{{j\in{\Psi_{\rm{B}}}}}\\ 0&{{j\notin{\Psi_{\rm{B}}}}}\end{array}}\right.,{x_{{\rm{A}},j}}\left[k\right]=\left\{{\begin{array}[]{*{20}{c}}{x_{\rm{A}}^{j}\left[k\right]}&{{j\in{\Psi_{\rm{A}}}}}\\ 0&{{j\notin{\Psi_{\rm{A}}}}}\end{array}}\right.

{x_{{\rm{B}},j}}\left[k\right]=\left\{{\begin{array}[]{*{20}{c}}{x_{\rm{B}}\left[k\right]}&{{j\in{\Psi_{\rm{B}}}}}\\ 0&{{j\notin{\Psi_{\rm{B}}}}}\end{array}}\right.,{x_{{\rm{A}},j}}\left[k\right]=\left\{{\begin{array}[]{*{20}{c}}{x_{\rm{A}}^{j}\left[k\right]}&{{j\in{\Psi_{\rm{A}}}}}\\ 0&{{j\notin{\Psi_{\rm{A}}}}}\end{array}}\right.

y^{i} [k] = H_{C, B}^{i} F^{H} x_{B} [k] + H_{C, A}^{i} F^{H} x_{A} [k] + v^{i} [k]

y^{i} [k] = H_{C, B}^{i} F^{H} x_{B} [k] + H_{C, A}^{i} F^{H} x_{A} [k] + v^{i} [k]

y^{i} [k] = diag {x_{B} [k]} F_{L} h_{B}^{i} + diag {x_{A} [k]} F_{L} h_{A}^{i} + w_{N}^{i} [k]

y^{i} [k] = diag {x_{B} [k]} F_{L} h_{B}^{i} + diag {x_{A} [k]} F_{L} h_{A}^{i} + w_{N}^{i} [k]

[R_{k}]_{m, n} = \frac{1}{2Δ L} \int_{- Δ + θ_{k}}^{Δ + θ_{k}} e^{- j 2 π D (m - n) s i n (θ)} d θ, k = 1, 2

[R_{k}]_{m, n} = \frac{1}{2Δ L} \int_{- Δ + θ_{k}}^{Δ + θ_{k}} e^{- j 2 π D (m - n) s i n (θ)} d θ, k = 1, 2

y_{PTS}^{i} [k] = F_{L} h_{B}^{i} x_{B} [k] + F_{L} h_{A}^{i} x_{B} [k] + w_{N}^{i} [k]

y_{PTS}^{i} [k] = F_{L} h_{B}^{i} x_{B} [k] + F_{L} h_{A}^{i} x_{B} [k] + w_{N}^{i} [k]

ζ_{λ_{1}^{p}} = C_{N_{T}, 3}^{- 1} i, j = 1 \sum 3 (- 1)^{i + j} 2Γ (L_{α_{1}, 1}) Γ (L_{α_{2}, 2}) G_{i, j}

ζ_{λ_{1}^{p}} = C_{N_{T}, 3}^{- 1} i, j = 1 \sum 3 (- 1)^{i + j} 2Γ (L_{α_{1}, 1}) Γ (L_{α_{2}, 2}) G_{i, j}

ζ_{λ_{3}^{p}} = C_{N_{T} 3}^{- 1} i, j = 1 \sum 3 (- 1)^{i + j} 2Γ (L_{α_{1}, 1}) Γ (L_{α_{1}, 2}) G_{i, j}^{1}

ζ_{λ_{3}^{p}} = C_{N_{T} 3}^{- 1} i, j = 1 \sum 3 (- 1)^{i + j} 2Γ (L_{α_{1}, 1}) Γ (L_{α_{1}, 2}) G_{i, j}^{1}

ζ_{λ_{1}, λ_{3}} = C_{N_{T} 3}^{- 1} i_{1}, i_{3}, j_{1}, j_{3} \sum χ Γ (L_{β_{1}, 1}) ⎩ ⎨ ⎧ l_{1} = 1 \sum L_{α_{1}, 1} - 1 \frac{G _{i, j}^{2}}{l _{1} !} ⎭ ⎬ ⎫

ζ_{λ_{1}, λ_{3}} = C_{N_{T} 3}^{- 1} i_{1}, i_{3}, j_{1}, j_{3} \sum χ Γ (L_{β_{1}, 1}) ⎩ ⎨ ⎧ l_{1} = 1 \sum L_{α_{1}, 1} - 1 \frac{G _{i, j}^{2}}{l _{1} !} ⎭ ⎬ ⎫

R_{I C C} (N_{B}, w) = lo g_{2} [\frac{N _{B} !}{( \frac{N _{B} + s}{2} ) ! ( \frac{N _{B} - s}{2} ) !}]^{1 / 1 N_{B} N_{B}}

R_{I C C} (N_{B}, w) = lo g_{2} [\frac{N _{B} !}{( \frac{N _{B} + s}{2} ) ! ( \frac{N _{B} - s}{2} ) !}]^{1 / 1 N_{B} N_{B}}

P_{I} = \frac{N _{B} ! - ( \frac{N _{B} + s}{2} ) ! ( \frac{N _{B} - s}{2} ) !}{2 ^{N_{B} + 1} ( \frac{N _{B} + s}{2} ) ! ( \frac{N _{B} - s}{2} ) !}

P_{I} = \frac{N _{B} ! - ( \frac{N _{B} + s}{2} ) ! ( \frac{N _{B} - s}{2} ) !}{2 ^{N_{B} + 1} ( \frac{N _{B} + s}{2} ) ! ( \frac{N _{B} - s}{2} ) !}

Y_{L} = X_{L} H_{L} + N_{L}

Y_{L} = X_{L} H_{L} + N_{L}

h_{B, L} = W_{B, L} Y_{L}, h_{A, L} = W_{A, L} Y_{L}

h_{B, L} = W_{B, L} Y_{L}, h_{A, L} = W_{A, L} Y_{L}

\begin{array}[]{l}{{\cal H}_{\rm{0}}}:{\widehat{\bf{h}}_{{\rm{B}},{\rm{L}}}}\to Bob,~{}{\cal{H}_{\rm{1}}}:{\widehat{\bf{h}}_{{\rm{A}},{\rm{L}}}}\to Bob\end{array}

\begin{array}[]{l}{{\cal H}_{\rm{0}}}:{\widehat{\bf{h}}_{{\rm{B}},{\rm{L}}}}\to Bob,~{}{\cal{H}_{\rm{1}}}:{\widehat{\bf{h}}_{{\rm{A}},{\rm{L}}}}\to Bob\end{array}

\frac{Δ f \buildrel Δ}{= f ( h _{B, L} ) - f ( h _{A, L} )}

\frac{Δ f \buildrel Δ}{= f ( h _{B, L} ) - f ( h _{A, L} )}

Δ f = L {ρ_{1} - Tr (R_{2} \overline{R}_{1})}

Δ f = L {ρ_{1} - Tr (R_{2} \overline{R}_{1})}

Tr (Λ_{2, p} \overline{U}_{2}^{H} \overline{U}_{1} \overline{Λ}_{1, p}) \leq j = 1 \sum a \frac{λ _{2, i_{j}}}{λ _{1, i_{j}}}

Tr (Λ_{2, p} \overline{U}_{2}^{H} \overline{U}_{1} \overline{Λ}_{1, p}) \leq j = 1 \sum a \frac{λ _{2, i_{j}}}{λ _{1, i_{j}}}

R_{F} min

R_{F} min

\overline{P}_{s} : {i_{k}, i_{k + \frac{N}{L}} \dots, i_{k + \frac{( L - 1 ) N}{L}}, k = 0, 1, \dots, \frac{N}{L} - 1}

\overline{P}_{s} : {i_{k}, i_{k + \frac{N}{L}} \dots, i_{k + \frac{( L - 1 ) N}{L}}, k = 0, 1, \dots, \frac{N}{L} - 1}

\frac{s ^{*} \buildrel Δ}{= \frac{L - 1}{L} N + 1}

\frac{s ^{*} \buildrel Δ}{= \frac{L - 1}{L} N + 1}

P_{s} (N_{B}, w, s^{*}) = \frac{κ ( N _{B} , w , s ^{*} )}{C ^{2} ( N _{B} , w , s ^{*} )}, 0 \leq P_{s} (N_{B}, w, s^{*}) \leq 1

P_{s} (N_{B}, w, s^{*}) = \frac{κ ( N _{B} , w , s ^{*} )}{C ^{2} ( N _{B} , w , s ^{*} )}, 0 \leq P_{s} (N_{B}, w, s^{*}) \leq 1

S_{T}\left({{N_{\rm{B}}},w,{s^{*}}}\right)={\left\{{{{\left({\begin{array}[]{*{20}{c}}{{N_{\rm{B}}}}\\ {{N_{\rm{B}}}-w}\end{array}}\right)}\mathord{\left/{\vphantom{{\left({\begin{array}[]{*{20}{c}}{{N_{\rm{B}}}}\\ {{N_{\rm{B}}}-w}\end{array}}\right)}{\left({\begin{array}[]{*{20}{c}}{{N_{\rm{B}}}-{s^{*}}}\\ {{N_{\rm{B}}}-w}\end{array}}\right){\rm{}}}}}\right.\kern-1.2pt}{\left({\begin{array}[]{*{20}{c}}{{N_{\rm{B}}}-{s^{*}}}\\ {{N_{\rm{B}}}-w}\end{array}}\right){\rm{}}}}}\right\}^{2}}

S_{T}\left({{N_{\rm{B}}},w,{s^{*}}}\right)={\left\{{{{\left({\begin{array}[]{*{20}{c}}{{N_{\rm{B}}}}\\ {{N_{\rm{B}}}-w}\end{array}}\right)}\mathord{\left/{\vphantom{{\left({\begin{array}[]{*{20}{c}}{{N_{\rm{B}}}}\\ {{N_{\rm{B}}}-w}\end{array}}\right)}{\left({\begin{array}[]{*{20}{c}}{{N_{\rm{B}}}-{s^{*}}}\\ {{N_{\rm{B}}}-w}\end{array}}\right){\rm{}}}}}\right.\kern-1.2pt}{\left({\begin{array}[]{*{20}{c}}{{N_{\rm{B}}}-{s^{*}}}\\ {{N_{\rm{B}}}-w}\end{array}}\right){\rm{}}}}}\right\}^{2}}

N_{B}, w max R_{I C C} (N_{B}, w)

N_{B}, w max R_{I C C} (N_{B}, w)

s . t . P_{s} (N_{B}, w, s^{*}) = 1, s^{*} = \frac{L - 1}{L} N + 1

(s^{*} + 1) (N_{B} - w) + s^{*} \leq N_{B} \leq \overline{N}, s^{*} \leq w \leq N_{B} \leq \overline{N}

(s^{*} + 1) (N_{B} - w) + s^{*} \leq N_{B} \leq \overline{N}, s^{*} \leq w \leq N_{B} \leq \overline{N}

R_{s} (N_{B}, w, s^{*}) = lo g_{2} \frac{N _{B} !}{( \frac{s ^{*} ( N _{B} + 1 )}{s ^{*} + 1} ) ! ( \frac{N _{B} - s ^{*}}{s ^{*} + 1} ) !}^{1 / 1 N_{B} N_{B}}

R_{s} (N_{B}, w, s^{*}) = lo g_{2} \frac{N _{B} !}{( \frac{s ^{*} ( N _{B} + 1 )}{s ^{*} + 1} ) ! ( \frac{N _{B} - s ^{*}}{s ^{*} + 1} ) !}^{1 / 1 N_{B} N_{B}}

R_{s} (N_{B}, w, s^{*}) \geq \frac{lo g _{2} η}{η}

R_{s} (N_{B}, w, s^{*}) \geq \frac{lo g _{2} η}{η}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Independence-Checking Coding for OFDM Channel Training Authentication: Protocol Design, Security, Stability, and Tradeoff Analysis

Dongyang Xu, Pinyi Ren, and James A. Ritcey,

Abstract

In wireless OFDM communications systems, pilot tones, due to their publicly-known and deterministic characteristic, suffer significant jamming/nulling/spoofing risks. Thus, the convectional channel training protocol using pilot tones could be attacked and paralysed, which raises the issue of anti-attack channel training authentication (CTA), that is, verifying the claims of identities of pilot tones and channel estimation samples. In this paper, we consider one-ring scattering scenarios with large-scale uniform linear arrays (ULA) and develop an independence-checking coding (ICC) theory to build a secure and stable CTA protocol, namely ICC based CTA (ICC-CTA) protocol. In this protocol, pilot tones are not merely randomized and inserted into subcarriers, but also encoded as diversified subcarrier activation patterns (SAPs) simultaneously. Those encoded SAPs, though camouflaged by malicious signals, can be identified and decoded into original pilots, and hence for high-accuracy channel impulse response (CIR) estimation. The CTA security is first characterised by the error probability of identifying legitimate CIR estimation samples. We prove that the identification error probability (IEP) is equal to zero under the continuously-distributed mean angle of arrival (AoA) and also derive a closed-form expression of IEP under the discretely-distributed case. The CTA instability is formulated as the function of probability of stably estimating CIR against all available diversified SAPs. A realistic tradeoff between the CTA security and instability under the discretely-distributed AoA is identified and an optimally-stable tradeoff problem is formulated, with the objective of optimizing the code rate to maximize security while maintaining maximum stability for ever. Solving this, we derive the closed-form expression of optimal code rate. Numerical results finally validate the resilience of proposed ICC-CTA protocol.

Index Terms:

Physical-layer authentication, anti-attack, OFDM, channel training, independence-checking coding.

I Introduction

With the evolution of air interface towards 5G, security paradigms for the protection of air interface technologies have attracted increasing attentions in wireless communications systems. Safeguarding the current standard, for instance, orthogonal frequency-division multiplexing (OFDM) or securely implementing the initiation, such as massive antenna technique, gradually come up on the agenda [1]. The common problem encountered is that the imperishable characteristic of wireless channels, such as the open and shared nature, has always been rendering those technologies vulnerable to the growing denial of service (DoS) attacks [2]. A phenomenon, if we notice, has emerged in the physical (PHY) layer that DoS attacks, with moderate size of the involved network segment and modest implementation complexity, have become increasingly common and potent [3]. As their major hacking behaviors, radio jamming (RJ) attacks have been exhibiting its astonishing destructive power on those existing [4] and emerging air interface techniques [5].

Among these RJ attacks, protocol-aware attack serves as the most effective one as the attacker could sense the specific protocols and intensify its effectiveness significantly by jamming a physical layer mechanism instead of data payload directly [6]. The typical case which frequently occurs in massive-antenna OFDM systems is that protocol-aware attackers always show a great appetite for the channel training protocol. In this protocol, frequency-domain subcarrier (FS) channels and channel impulse response (CIR) samples, are estimated to further the high-quality user experience using those estimations. The motivations for this case are twofold. On one hand, multi-antenna OFDM technique has been deployed universally in current commercial and military applications, which incurs huge interests of malicious nodes. Since the channel training protocol requires that deterministic and publicly-known pilot tones should be shared on the time-frequency resource grid (TFRG) by all parties [7], a pilot-aware attacker could sense and acquire the public pilot information, and practically behave in such a way that the regular channel training process may not be maintained as usual [8, 9, 10]. On the other hand, everyone has witnessed the introduction of massive antennas into OFDM technique which has been promoted significantly in the recent practice, such as in 3GPP new radio (NR) specifications. In this era, the precise channel training becomes very crucial to maintaining the significant multiplexing gains of target users. The bad news is that imprecise estimation samples could not only lower down those gains but also benefit others, such as the attacker, due to the high resolution of antenna arrays. What’s more, when the channel training is misguided in favour of attacker, actually without too much efforts, massive antenna arrays in OFDM systems will be well loved by the attacker.

In this context, authenticating channel training becomes very critical to the massive antenna OFDM systems since it determines the authenticity of channel estimation results. Generally, channel training launched by any certain subscriber is authenticated by default through the designated public pilot tones allocated to that subscriber [11, 12]. Applying the same pilot tones as the subscriber at the receiver to channel estimation means the exact authentication for channel training. This process is called the channel training authentication (CTA) which belongs to the field of physical-layer authentication [13]. Intrinsically, exact CTA mainly depends on the authenticity of pilot tones in a sense that the claims of identities of pilot tones should be verified. The uniqueness and non-reproducibility of pilot tones are two foremost requirements which however will no longer hold true when a pilot-aware attacker jams/nulls/spoofs those pilot tones. In practice, attacking CTA process in OFDM systems is a common phenomenon, e.g., in scenarios with tactical consideration [14] or in Long Term Evolution (LTE)-based public safety networks [15]. Those attacks, including pilot tone jamming (PTJ) attack [8], pilot tone nulling (PTN) attack [9] and pilot tone spoofing (PTS) attack [10], are very hard to eliminate once they have occurred successfully.

I-A Related Works

Much of the work related to securing CTA has been investigated thus far. How to detect the alteration to authenticity and how to protect and further maintain the high authenticity are two major branches in this area.

The first attempt for narrow-band single-carrier systems is made in [16] in which the pilot contamination (PC) attack, one type of PTS attack, was introduced and evaluated. Following [16], much of the work was studied, but limited to the detection of authenticity of pilot signals by exploiting the physical layer information, such as auxiliary training or data sequences [17, 19, 18] and some prior-known channel information [20, 21]. Different from those, authors in [22] first studied the advantage of spatial correlation in the maintenance of authenticity of pilots, and found that the natural spatial separation of massive antenna arrays can force PC attack to occur effectively only in a particular angular domain. However, we should never forget that the attacker is out of control. In this regard, PC attack actually becomes more well-directed, rather than less effective.

The first attempt for multi-subcarrier scenarios was presented by Clancy et al. [23], verifying the possibility and effectiveness of PTJ attack. Following this, PTJ attack was then studied for single-input single-output (SISO)-OFDM communications in [8] which also introduced the PTN attack and then extended it to the multiple-input multiple-output (MIMO)-OFDM system [9]. The initial attempt to resolve pilot aware attack for conventional OFDM systems was proposed in [24], that is, transforming the PTN and PTS attack into PTJ attack by randomizing the locations and values of regular pilot tones on time-frequency resource grid (TFRG). It figured out the importance that pilot tone scheduling, even being random, would also affect channel acquisition. Hinted by this, authors in [10] proposed a FS channel estimation framework under the PTS attack by exploiting pilot randomization and the independence component analysis (ICA) theory. One key problem is that the practical subcarriers are not mutually independent in the scenarios with limited channel taps, and thus ICA does not apply in this case. Most importantly, the CIR estimation is impossible. Basically, CIR is very critical to the CTA in future 5G mobile eco-systems in which measuring the multipath before designing systems is mandatory since the channel has to carry the big amount of data for our “everything wireless” applications. The knowledge of the channel response represents the aggregate values of gross physical multipath information. CIR is such a wideband channel characterization and contains all information necessary to simulate or analyze any type of radio transmission through the channel. For instance, the amplitude of channel taps could reflect the sparsity of channel in some cases and their variations could tell us the Doppler spread, coherence bandwidth, and so forth [25].

To solve those issues, our previous work in [26] proposed an independence-checking coding (ICC) method which provides high authenticity guarantee on the FS channel and CIR estimation based on randomized pilot tones. Nevertheless, the influence of randomization on CIR estimation was not evaluated and optimized, which incurs the instability of CIR estimation. In this sense, CTA not only merely requires the high security against attacks, but also strongly and necessarily calls for the high stability of CIR estimation accuracy. As far as we know, there were very few studies jointly considering the security and instability during the channel training phase.

I-B Motivations and Contributions

The hints from the above investigation further motive us to build up a secure CTA protocol for massive-antenna OFDM systems with considerations of the heterogeneity of attack modes and the instability of CIR estimation

Recall that pilot randomization serves as a commonsense technique for defending against pilot-aware attack. However, inserting randomized pilot tones on TFRG solely functions to transform the attack modes such that the attack issue will not be insolvable, rather than to resolve the issue practically. To be more specific, this brings two bottlenecks, i.e., 1) Unpredictable attack modes;

Problem 1 (Attack Model).

A pilot-aware attacker chooses on TFRG a hybrid attack mode including PTJ attack and silence cheating (SC) mode. In PTJ attack mode, two behaviors are available, i. e., wide-band pilot jamming (WB-PJ) attack [27] and partial-band pilot jamming (PB-PJ) attack [28]. In SC mode, the attacker keeps silent for cheating the legitimate node. The legitimate node can never acquire the behaviors of the attacker in advance. All of the three modes can be very effective due to the node transparency (i.e., no association or independent with each other) and should never be ignored.

**2) Irreversible pilot information. ** Randomized pilot information become irreversible in the following sense:

Problem 2.

Randomized pilot information are naturally camouflaged by random channel information. Those information, if transmitted by pilot tones for uplink channel training through wireless channels, cannot be separated and identified.

This problem inspires us to perform the protocol design for the overall channel training process. The guideline for this is presented in Fig. 1 where two key requirements are detailed as follows:

Share pilot information through encoded subcarrier activation patterns (SAPs): Selectively activate and deactivate OFDM subcarriers by transmitting pilots on subcarriers or not, and create various SAP candidates. Encode all SAPs as a binary code. Optimize the code set in such a way that arbitrary one SAP, namely, codeword, if suffering a hybrid attack in the wireless environment, are enabled to be separated and identified securely. With this preparation, pilot information is conveyed and encoded as one codeword and further expressed as a SAP. Secure pilot sharing is thus constructed between transceiver pairs. 2. 2.

Reuse subcarriers in activation to estimate channels: Generate channel estimators according to the identified pilots and apply them on the activated subcarriers for FS channel estimation. Enhance the pilot identification using the estimated FS channels. Derive CIR estimation samples from the estimated FS channels.

In this methodology, channel estimation coexists with the information coding and the two techniques influence each other. In spite of the security guarantee provided by encoded SAPs, SAP diversification also incurs the uncertainties as to the amount and distribution of subcarriers in activation, further instabilizing the CIR estimation extremely. This entanglement between security and instability motivates us to perform the protocol optimization. The main contributions of this paper are summarized as follows:

Protocol Design: First, we establish a fundamental principle for encoding arbitrary SAPs as a binary code set precisely. Following this, we develop an ICC theory to further optimize the code such that arbitrary two codewords in the code, if being superimposed on each other, can be separated and identified securely. In order to evaluate the security for this, we formulate two key performance indicators (KPIs), i.e., the separation error probability (SEP) and identification error probability (IEP). We prove that SEP is always guaranteed to be zero and also derive the analytical expression of IEP. We build up an uplink ICC based CTA (ICC-CTA) protocol in which legitimate transceiver pair encodes and decodes randomized pilot phases securely through the ICC codebook, and then performs FS channel and CIR estimation using the identified pilots. 2. 2.

Next, we discover a hidden phenomenon that when FS channel estimation is performed on the basis of this protocol, the array spatial correlation existing in the overlapping subcarriers that also carry information from both the legitimate node and the attacker can further help reduce IEP in one-ring scattering scenarios. At this point, the attacker can actually help the legitimate node to enhance the security. Interestingly, it can be proved that zero IEP cannot be achieved only when the attacker is located in the clusters with the same mean angle of arrival (AoA) as the legitimate node. This principle, in this sense, could facilitate the acquisition of the position of attacker. Theoretically when we consider the mean AoA with continuous probability distribution, the security, in theory, can be perfectly guaranteed. Practically in discretely-distributed case, we give an analytical expression of how much the security could be further improved. 3. 3.

Protocol Optimization: Finally, we identify the phenomenon of instable CIR estimation in this protocol and define the stability by the function of probability of stable CIR estimation against diversified SAPs. In the realistic scenario with discretely-distributed mean AoAs, we identify and model the tradeoff between the security and instability. Interestingly, we prove that there always exists an optimally-stable tradeoff for which the CIR estimation can always achieve its optimal stability without losing estimation precision asymptotically. Maintaining this stability, we further determine a closed-form expression of optimal code rate that maximizes the security. This code rate indicates how to flexibly configure the number of activated subcarriers under this hybrid attack such that desirable security and maximum stability of CIR estimation can be both guaranteed.

Organization: In Section II , we present an overview of pilot-aware attack on massive-antenna OFDM systems. In Section III, we introduce an ICC-CTA protocol. FS channel estimation and security enhancement are described in Section IV. Security-instability tradeoff in CIR estimation is provided in Section V. Numerical results are presented in Section VI and finally we conclude our work in Section VII.

Notations: We use boldface capital letters ${\bf{A}}$ for matrices, boldface small letters ${\bf{a}}$ for vectors , and small letters $a$ for scalars. ${{\bf{A}}^{*}}$ , ${{\bf{A}}^{\rm{T}}}$ , ${{\bf{A}}^{{H}}}$ and ${\bf{A}}\left({:,1:L}\right)$ respectively denotes the conjugate operation, the transpose, the conjugate transpose and the first $L$ columns of matrix ${\bf{A}}$ . $\left\|{\cdot}\right\|$ denotes the Euclidean norm of a vector or a matrix. $\left|{\cdot}\right|$ is the cardinality of a set. ${\mathbb{E}}\left\{\cdot\right\}$ is the expectation operator. $\otimes$ denotes the Kronecker product operator. ${\rm{diag}}\left\{{\bf{a}}\right\}$ stands for the diagonal matrix with the elements of column vector $\bf{a}$ on its diagonal.

II Overview of Pilot-Aware Attack on Massive-Antenna OFDM Systems

We in this section outline a fundamental overview of CTA issue under pilot aware attack, from a mathematical point of view. This refers to the basic system model, signal model, and channel estimation model. Finally, the pilot randomization technique is described and most importantly, we identify its potential challenges in resolving the attack.

II-A System Description

We consider a synchronous large-scale multiple-input single-output (MISO)-OFDM system with a $N_{\rm T}\gg 1$ -antenna base station (named as Alice) and a single-antenna legitimate user (named as Bob). As shown in Fig. 2, the based station (BS) is equipped with a $D\lambda$ -spacing directive uniform linear array (ULA) and placed at the origin along the $y$ -axis to serve a 120-degree sector that is centered around the $x$ -axis ( $\alpha=0$ ). We assume no energy is received for angles $\alpha\notin\left[{-\frac{\pi}{3},\frac{{\pi}}{3}}\right]$ . The summary of notations is given in Table I.

For a typical cellular configuration, the channel from Bob to Alice is a correlated random vector with covariance matrix that depends on the scattering geometry. Assuming a macro-cellular tower-mounted BS with no significant local scattering, the propagation between Bob and Alice is characterized by the local scattering around Bob, resulting in the well-known one-ring model [22]. For OFDM systems with frequency-selective channels, the wide-band configuration is more realistic. Here, we consider the wide-band one-ring scattering model in which Bob is surrounded by local scatterers within $\left[{{\theta_{1}}-{\Delta},{\theta_{1}}+{\Delta}}\right]$ [22, 29].This will contribute to the following mathematical characterisation of the advantage of spatial correlation in security provision as an explicit result, rather than a complex and unintuitive implication.

We consider pilot tone based uplink channel training process on time-frequency domain with $\overline{N}$ available subcarriers at each OFDM symbol time. In principle, subcarriers indexed by ${\Psi_{\rm{B}}}$ are employed for pilot tone insertion and the following channel estimation. Those pilot tones, known as reference signals in LTE-A and/or beyond, are deterministic and publicly-known in TFRG. Each transceiver, by sharing those tones, can deduce the FS channels and further estimate the CIR samples. Therefore a single-antenna malicious node (named as Ava) could disturb this training process by jamming/spoofing/nulling those pilot tones. We denote the set of victim subcarriers by ${\Psi_{\rm{A}}}$ and make the following assumption:

Assumption 1.

Ava is surrounded by local scatterers within $\left[{{\theta_{2}}-{\Delta},{\theta_{2}}+{\Delta}}\right]$ and always has common or overlapping AoA intervals with Bob, this is, $\left[{{\theta_{2}}-\Delta,{\theta_{2}}+\Delta}\right]\cap\left[{{\theta_{1}}-\Delta,{\theta_{1}}+\Delta}\right]\neq\emptyset$

This assumption is supported by the scenario where a common large scattering body (e.g., a large building) could create a set of angles common to all nodes in the system. In this case, the angular spread of BS is broad and the overlapping of AoA intervals is inevitable. The result is that the channel covariance eigenspaces of Bob and Eva are coupled and the attack is hard to eliminate through angular separation [22].

Assumption 2.

We consider the multiple-cluster scenario. Two types of the distribution model of ${\theta_{i}}$ , $i=1,2$ are considered, including the continuous probability distribution (CPD) [22] and the discrete probability distribution (DPD) [30], for instance, discrete uniform distribution with the support of interval length $K$ .

II-B Receiving Signal Model

In this subsection, we introduce the receiving signal model at Alice. To begin with, we will give the concept of pilot insertion pattern (PIP) which indicates the way of inserting pilot tones across subcarriers and OFDM symbols.

Assumption 3 (Frequency-domain PIP).

We in this paper assume $x_{{\rm{B}}}^{i}\left[k\right]={x_{{\rm{B}}}}\left[k\right]=\sqrt{{\rho_{{\rm{B}}}}}{e^{j{\phi_{k}}}}$ , ${{i\in{\Psi_{\rm{B}}}}}$ for low overhead consideration and theoretical analysis. Alternatively, we can superimpose ${x_{{\rm{B}}}}\left[k\right]$ onto a dedicated pilot sequence optimized under a non-security oriented scenario and utilize this new pilot for training. At this point, $\phi_{k}$ can be an additional phase difference for security consideration. We do not impose the phase constraint on the PIP strategies of Ava, that is, ${x^{i}_{\rm{A}}}\left[k\right]=\sqrt{{\rho_{\rm{A}}}}{e^{j{{\varphi}_{k,i}}}},{{i\in{\Psi_{\rm{A}}}}}$ .

Let us proceed to the basic OFDM procedure. First, the frequency-domain pilot signals of Bob and Ava over $N$ subcarriers are respectively stacked as $N$ by $1$ vectors ${{\bf{x}}_{\rm{B}}}\left[k\right]=\left[{{x_{{\rm{B}},j}}\left[k\right]}\right]_{{{j\in{\Psi}}}}^{\rm{T}}$ and ${{\bf{x}}_{\rm{A}}}\left[k\right]=\left[{{x_{{\rm{A}},j}}\left[k\right]}\right]_{{{j\in{\Psi}}}}^{\rm{T}}$ . Here there exist:

[TABLE]

Assume that the length of cyclic prefix is larger than $L$ . The parallel streams, i.e., ${{\bf{x}}_{{\rm{B}}}}\left[k\right]$ and ${{\bf{x}}_{{\rm{A}}}}\left[k\right]$ , are modulated with inverse fast Fourier transform (IFFT). After removing the cyclic prefix at the $i$ -th receive antenna and $k$ -th OFDM symbol time, Alice derive the time-domain $N$ by $1$ vector ${{\bf{y}}^{i}}\left[k\right]$ as:

[TABLE]

where ${\bf{H}}_{{\rm{C,B}}}^{i}$ and ${\bf{H}}_{{\rm{C,A}}}^{i}$ are $N\times N$ circulant matrices for which the first column of ${\bf{H}}_{{\rm{C,B}}}^{i}$ and ${\bf{H}}_{{\rm{C,A}}}^{i}$ are respectively given by ${\left[{\begin{array}[]{*{20}{c}}{{\bf{h}}_{\rm{B}}^{{i^{\rm{T}}}}}&{{{\bf{0}}_{1\times\left({N-L}\right)}}}\end{array}}\right]^{\rm{T}}}$ and ${\left[{\begin{array}[]{*{20}{c}}{{\bf{h}}_{\rm{A}}^{{i^{\rm{T}}}}}&{{{\bf{0}}_{1\times\left({N-L}\right)}}}\end{array}}\right]^{\rm{T}}}$ . Here, ${\bf{h}}_{\rm{A}}^{i}$ is assumed to be independent with ${\bf{h}}_{\rm{B}}^{i}$ . Taking fast Fourier transform (FFT), Alice finally derives the frequency-domain $N$ by $1$ signal vector at the $i$ -th receive antenna and $k$ -th OFDM symbol time as

[TABLE]

Throughout this paper, we assume that the CIRs belonging to different paths at each antenna exhibit spatially uncorrelated Rayleigh fading. Without loss of generality, each path has the uniform and normalized power delay profile (PDP) satisfying $\sum\limits_{l=1}^{L}{\sigma_{{\rm B},l}^{2}}=1,\sum\limits_{l=1}^{L}{\sigma_{{\rm A},l}^{2}}=1$ [31]. For each path, CIRs of different antennas are spatially correlated. With the one-ring scattering model, the correlation between channel coefficients of antennas $1\leq m,n\leq N_{\rm T}$ , $\forall l$ is defined by [22, 29]:

[TABLE]

Here, ${{\bf{R}}_{{k}}},k=1,2$ are symmetric positive semi-definite matrices. Note that ${{\bf{R}}_{{2}}}$ is unknown for Alice and Bob while ${{\bf{R}}_{{1}}}$ is known by Alice.

II-C * Channel Estimation Model*

For the PTS attack, Ava could learn the pilot tones employed by Bob in advance and impersonate Bob by utilizing the same pilot tone learned. There exists ${\Psi_{\rm{B}}}\cup{\Psi_{\rm{A}}}={\Psi_{\rm{B}}}$ and $x_{{\rm{A}}}^{i}\left[k\right]={x_{{\rm{B}}}}\left[k\right],{i\in{\Psi_{\rm{A}}}}$ . Signals in Eq. (3) can be rewritten as:

[TABLE]

Finally, a least square (LS) based channel estimation is formulated by the equation ${\widehat{{\bf{h}}}_{con}^{i}}={\bf{h}}_{\rm{B}}^{i}+{\bf{h}}_{\rm{A}}^{i}+{\left({{{\bf{F}}_{\rm{L}}}}\right)^{+}}\frac{{x_{\rm{B}}^{\rm{H}}\left[k\right]}}{{{{\left|{x_{\rm{B}}^{\rm{H}}\left[k\right]}\right|}^{2}}}}{{\bf{w}}^{i}_{N}}\left[k\right]$ where $\left({{{\bf{F}}_{\rm{L}}}}\right)^{+}$ is the Moore-Penrose pseudoinverse of ${{{\bf{F}}_{\rm{L}}}}$ . We see that the estimation of ${\bf{h}}_{\rm{B}}^{i}$ is contaminated by ${\bf{h}}_{\rm{A}}^{i}$ with a noise bias when a PTS attack happens. As to the characterisation of PTN attack and PTJ attack, we can refer to the mathematical interpretation in [26].

II-D Influence of Pilot Randomization on Pilot-Aware Attack

Pilot randomization can avoid the pilot aware attack without imposing any prior information on the pilot design. The common method is to randomly select phase candidates. Each of the phase candidates is mapped by default into a unique quantized sample, chosen from the set ${\cal A}$ . Since phase information only provides the security guarantee as shown in Assumption 3, thus without the need of huge overheads, we make the following assumptions:

Assumption 4 (Time-domain PIP).

During two adjacent OFDM symbol time, such as, $k_{i},k_{i+1}$ , $i\geq 0$ two pilot phases ${\phi_{{k_{i}}}}$ and ${\phi_{{k_{i+1}}}}$ are kept with fixed phase difference, that is, ${\phi_{{k_{i+1}}}}-{\phi_{{k_{i}}}}=\overline{\phi}$ , for reducing the authentication overheads. Here, ${\phi_{{k_{i+1}}}}$ and ${\phi_{{k_{i}}}}$ are both random but $\overline{\phi}$ are deterministic and publicly known.

Institutively, how the value $C$ increases affects the performance of anti-attack technique. This technique also brings up the subject of Problem 2.

III ICC-CTA Protocol

As shown in the Fig. 1, this section presents the principles of pilot conveying, separation and identification.

III-A Pilot Conveying on Code-Frequency Domain

Basically, the more phases supported in $\cal A$ , the higher coding diversity is required, and thus the more available SAPs should be created. Theoretically, this requires a delicately-designed binary code and practically depends on how to activate and deactivate subcarriers as the code indicates. This operation will inevitably induce a concurrence of activated and deactivated subcarriers, and therefore detecting the number of signals coexisting on one subcarrier is a necessary work before coding.

To achieve this goal, we will employ the technique of eigenvalue ratio based detection (ERD) proposed in [32]. Here we consider three symbol time and a $3\times N_{\rm T}$ receiving signal matrix, denoted by ${{\bf{Y}}_{\rm{D}}}$ , is created for detection. Given the normalized covariance matrix defined by $\widehat{\bf{R}}=\frac{1}{{{\sigma^{2}}}}{\bf{Y}}_{\rm D}{{\bf{Y}_{\rm D}}^{\rm{H}}}$ , we define its ordered eigenvalues by ${\lambda_{{1}}}>{\lambda_{2}}>{\lambda_{3}}>0$ and construct the test statistics by $T=\frac{{{\lambda_{1}}}}{{{\lambda_{3}}}}\mathop{\gtrless}\limits_{{{{\overline{\cal H}}_{0}}}}^{{{{{{\cal H}}_{0}}}}}\gamma$ where $\gamma$ denotes the decision threshold. The hypothesis ${{{\cal H}}_{0}}$ means that there exist signals and ${{\overline{\cal H}}_{0}}$ means the opposite.

III-A1 Construction of Code Frequency Domain

Given the threshold $\gamma$ , the cumulative distribution function (CDF) of $T$ , denoted by $F\left(\gamma\right)$ , can be expressed by $F\left(\gamma\right)=1-{P_{f}}=\Phi\left\{{\frac{{{\zeta_{{\lambda_{3}}}}\gamma-{\zeta_{{\lambda_{1}}}}}}{{{\xi_{{\lambda_{1}}}}{\xi_{{\lambda_{3}}}}\chi\left(\gamma\right)}}}\right\},\chi\left(\gamma\right)=\sqrt{\frac{{{\gamma^{2}}}}{{\xi_{{\lambda_{1}}}^{2}}}-\frac{{2\rho\gamma}}{{{\xi_{{\lambda_{1}}}}{\xi_{{\lambda_{3}}}}}}+\frac{1}{{\xi_{{\lambda_{3}}}^{2}}}}$ where $\rho={{\left({{\zeta_{{\lambda_{1}},{\lambda_{3}}}}-{\zeta_{{\lambda_{1}}}}{\zeta_{{\lambda_{3}}}}}\right)}\mathord{\left/{\vphantom{{\left({{\zeta_{{\lambda_{1}},{\lambda_{3}}}}-{\zeta_{{\lambda_{1}}}}{\zeta_{{\lambda_{3}}}}}\right)}{{\xi_{{\lambda_{1}}}}{\xi_{{\lambda_{3}}}}}}}\right.\kern-1.2pt}{{\xi_{{\lambda_{1}}}}{\xi_{{\lambda_{3}}}}}}$ [32] . Here $\Phi\left\{\cdot\right\}$ denotes CDF of a standard Gaussian random variable. In order to measure how many antennas are required on each subcarrier to achieve a certain ${P_{f}}$ , a decision threshold function $\gamma\buildrel\Delta\over{=}f\left({{N_{\rm{T}}},{P_{f}}}\right)$ is derived, with $f\left({{N_{\rm{T}}},{P_{f}}}\right)=\frac{{{\zeta_{{\lambda_{1}}}}{\zeta_{{\lambda_{3}}}}-\tau^{2}{\rho}{\xi_{{\lambda_{1}}}}{\xi_{{\lambda_{3}}}}+{\tau}\sqrt{{\delta}-2{\rho}{\xi_{{\lambda_{1}}}}{\xi_{{\lambda_{3}}}}{\zeta_{{\lambda_{1}}}}{\zeta_{{\lambda_{3}}}}}}}{{\zeta_{{\lambda_{3}}}^{2}-\tau^{2}\xi_{{\lambda_{3}}}^{2}}}$ where ${\delta}=\zeta_{{\lambda_{1}}}^{2}\xi_{{\lambda_{3}}}^{2}+\zeta_{{\lambda_{3}}}^{2}\xi_{{\lambda_{1}}}^{2}+\left({\rho^{2}-1}\right)\tau^{2}\xi_{{\lambda_{1}}}^{2}\xi_{{\lambda_{3}}}^{2}$ , $\tau={\Phi^{-1}}\left\{1-{P_{f}}\right\}$ . Here $\zeta_{\lambda_{i}^{p}}={\mathbb{E}}\left({\lambda_{i}^{p}}\right),i=1,3,p=1,2$ , $\zeta_{\lambda_{1}\lambda_{3}}={\mathbb{E}}\left({\lambda_{1}\lambda_{3}}\right)$ and $\xi_{{\lambda_{i}}}^{2}={\mathbb{E}}\left({\lambda_{i}^{2}}\right)-{\left[{{\mathbb{E}}\left({{\lambda_{i}}}\right)}\right]^{2}},i=1,3$ . The related parameters can be shown as follows:

[TABLE]

where there exists $G_{i,j}=\sum\limits_{{l_{1}}=1}^{{L_{{\alpha_{1}},1}}-1}{\sum\limits_{{l_{2}}=1}^{{L_{{\alpha_{2}},2}}-1}{\frac{{\Gamma\left({{l_{1}}+{l_{2}}+{p_{i,j}}-1}\right)}}{{{l_{1}}!{l_{2}}!{3^{{l_{1}}+{l_{2}}+{p_{i,j}}-1}}}}}}-\sum\limits_{{l_{1}}=1}^{{L_{{\alpha_{1}},1}}-1}{\frac{{\Gamma\left({{l_{1}}+{p_{i,j}}-1}\right)}}{{{l_{1}}!{2^{{l_{1}}+{p_{i,j}}-1}}}}}-\sum\limits_{{l_{2}}=1}^{{L_{{\alpha_{2}},2}}-1}{\frac{{\Gamma\left({{l_{2}}+{p_{i,j}}-1}\right)}}{{{l_{2}}!{2^{{l_{2}}+{p_{i,j}}-1}}}}+\Gamma\left({{p_{i,j}}-1}\right)}.$

[TABLE]

where $G_{i,j}^{1}=\sum\limits_{{l_{1}}=1}^{{L_{{\alpha_{1}},1}}-1}{\sum\limits_{{l_{2}}=1}^{{L_{{\alpha_{2}},2}}-1}{\frac{{\Gamma\left({{l_{1}}+{l_{2}}+{p_{i,j}}-1}\right)}}{{{l_{1}}!{l_{2}}!{3^{{l_{1}}+{l_{2}}+{p_{i,j}}-1}}}}}}$ .

[TABLE]

where $\chi={\left({-1}\right)^{{i_{1}}+{i_{3}}+{j_{1}}+{j_{3}}}}$ and we have $G_{i,j}^{2}=\sum\limits_{{l_{1}}=1}^{{L_{{\alpha_{1}},1}}-1}{\frac{{\Gamma\left({{l_{1}}+p_{i,j}^{3}+1}\right)}}{{{2^{{l_{1}}+p_{i,j}^{3}+1}}}}}\left\{{\Gamma\left({p_{i,j}^{1}+1}\right)-\sum\limits_{t=0}^{p_{i,j}^{3}+{l_{1}}}{\frac{{{2^{t}}\Gamma\left({t+p_{i,j}^{1}+1}\right)}}{{t!{3^{t+p_{i,j}^{1}+1}}}}}}\right\}+\sum\limits_{{l_{2}}=1}^{{L_{{\alpha_{2}},2}}-1}{\Gamma\left({p_{i,j}^{3}+1}\right)}\left\{{\frac{{\Gamma\left({{l_{1}}+p_{i,j}^{1}+1}\right)}}{{{2^{{l_{1}}+p_{i,j}^{1}+1}}}}-\sum\limits_{t=0}^{p_{i,j}^{3}}{\frac{{\Gamma\left({{l_{1}}+t+p_{i,j}^{1}+1}\right)}}{{t!{3^{{l_{1}}+t+p_{i,j}^{1}+1}}}}}}\right\}$ . For the parameters therein, there exist ${C_{{N_{\rm{T}}},3}}=2\prod\limits_{i=1}^{3}{\left({{N_{\rm{T}}}-i}\right)!}$ , ${p_{i,j}}={N_{\rm{T}}}+p+i+j-3,p_{i,j}^{1}={N_{\rm{T}}}+p+{i_{1}}+{j_{1}}-5,p_{i,j}^{3}={N_{\rm{T}}}+p+{i_{3}}+{j_{3}}-5$ , ${\alpha_{1}}=2,{\alpha_{2}}=1$ , ${L_{{\alpha_{k}},k}}=\left\{{\begin{array}[]{*{20}{c}}{{N_{\rm{T}}}-4+k+{\alpha_{k}}}&{{\alpha_{k}}<i,k<j}\\ {{N_{\rm{T}}}-2+k+{\alpha_{k}}}&{{\alpha_{k}}\geq i,k\geq j}\\ {{N_{\rm{T}}}-3+k+{\alpha_{k}}}&{otherwise}\end{array}}\right.$ and ${L_{{\beta_{k}},k}}=\left\{{\begin{array}[]{*{20}{c}}{{N_{\rm{T}}}-4+k+{\beta_{k}}}&{{\beta_{k}}<,k<}\\ {{N_{\rm{T}}}-3+k+{\beta_{k}}}&{<{\beta_{k}}<\bar{i},k<,or,<{\beta_{k}}<\bar{j},k<}\\ {{N_{\rm{T}}}-1+k+{\beta_{k}}}&{<{\beta_{k}}<\bar{i},k>\bar{j},or,<{\beta_{k}}<\bar{j},k>\bar{i}}\\ {{N_{\rm{T}}}+k+{\beta_{k}}}&{{\beta_{k}}>\bar{i},k>\bar{j}}\\ {{N_{\rm{T}}}-2+k+{\beta_{k}}}&{otherwise}\end{array}}\right.$ . ${\mathop{\rm sgn}}(\cdot)$ is the Signum function and $\Gamma\left({\cdot}\right)$ is the upper incomplete Gamma function.

Using the expression of $\gamma$ , we establish a single-subcarrier encoding (SSE) principle to encode the number of detected signals into binary digits, i.e, 0 or 1.

Definition 1 (SSE Principle).

One subcarrier can be precisely encoded if, for any $\varepsilon>0$ , there exists a positive number $\gamma\left(\varepsilon\right)$ such that, for all $\gamma\geq\gamma\left(\varepsilon\right)$ , ${P_{f}}$ is smaller than $\varepsilon$ .

Based on the Definition 1, we can encode the $m$ -th subcarrier as a binary digit ${s_{m}}$ according to ${s_{m}}=\left\{{\begin{array}[]{*{20}{c}}1&\rm{{{\cal H}_{0}}~{}is~{}true}\\ 0&{otherwise}\end{array}}\right.$ . We should note that $f\left({{N_{\rm{T}}},{P_{f}}}\right)$ is a monotone decreasing function of two independent variables, i.e., ${N_{\rm{T}}}$ and ${P_{f}}$ . For a given probability constraint ${\varepsilon}^{*}$ , we could always expect a lower bound $\gamma\left({{\varepsilon^{*}}}\right)$ such that $\gamma\left({{\varepsilon^{*}}}\right)=f\left({{N_{\rm{T}}},{\varepsilon^{*}}}\right)$ is satisfied. Under this equation, we could flexibly configure ${N_{\rm{T}}}$ and $\gamma\left({{\varepsilon^{*}}}\right)$ to make ${\varepsilon}^{*}$ approach zero [32]. We also find that the value of $\gamma$ achieving zero- ${P_{f}}$ is decreased with the increase of ${N_{\rm{T}}}$ .

To verify this, we consider three OFDM symbols and flexible configuration of $N_{\rm T}$ , such as, from $32$ to $256$ . We simulate ${P_{f}}$ against various $\gamma$ in Fig. 3. As we can see, the required decision threshold $\gamma$ is decreased with the increase of the number of antennas. This fact also further verifies the feasibility of Definition 1. For example, we can find a desirable point at $\gamma=1.5,N_{\rm T}=256$ where $P_{f}$ is equal to zero, thus facilitating perfect binary coding for each kind of SAPs.

Based on the formulated binary digits for subcarriers in detection, we denote a set of binary code vectors by ${{\cal S}}$ with ${{\cal S}}=\left\{{\left.{\bf{s}}\right|{s_{m}}\in\left\{{0,1}\right\},1\leq m\leq{{L_{s}}}}\right\}$ where ${L_{s}}$ denotes the maximum length of the code. Then, a code frequency domain could be constructed as a set of pairs $\left({{{\bf{s}}},b}\right)$ with ${\bf{s}}\subset{{\cal S}}$ and $1\leq b\leq N_{\rm B}$ where $b$ is an integer representing the subcarrier index of appearance of the code. This is shown in Fig. 4.

III-A2 Binary Codebook Matrix

On the formulated code-frequency domain, we group the binary digits and construct the binary code by presenting a binary codebook as follows:

Definition 2.

Given a $N_{\rm B}\times C$ binary matrix ${\bf C}$ with each element satisfying ${c_{i,j}}\in{\bf{s}}\subset{{\cal S}}$ , we denote the $i$ -th column of ${\bf C}$ by ${\bf c}_{i}$ with ${{\bf{c}}_{i}}={\left[{\begin{array}[]{*{20}{c}}{{c_{1,i}}}&\cdots&{{c_{N_{\rm B},i}}}\end{array}}\right]^{\rm{T}}}$ . We call $\bf C$ a binary codebook matrix and ${{\bf{c}}_{i}}$ a codeword of $\bf C$ of length $N_{\rm B}$ .

The codebook size is equal to the quantization resolution of phases in the set $\cal A$ . Based on this codebook matrix, a mapping from pilot phases, to codewords and further to SAPs is developed in Fig. 4 for pilot conveying.

Pilot conveying provides the basis for pilot separation and identification which also means the codeword separation and identification. Therefore, the performance of CTA becomes totally dependent on the property of binary codebook.

III-B Pilot Separation and Identification Via ICC

In this subsection, we present the ICC theory to optimize the previous binary codebook. Its crucial feature is to create the “difference” by checking the independence of channels experienced by different parties. In what follows, we will introduce the ICC theory by formulating its encoding/decoding principle.

III-B1 Encoding Principle

Based on the Definition 2, we further have the following definition:

Definition 3.

A $N_{\rm B}\times C$ binary matrix $\bf C$ is called a ICC- $\left({N_{\rm B},s}\right)$ code of length $N_{\rm B}$ and order $s$ , if for any column set $\cal Q$ such that $\left|\cal Q\right|=2$ , there exist at least a set $\cal S$ of $s$ rows such that ${c_{i,j}}=1,\forall i,j,{i\in\cal S},j\in{\cal Q}$ .

For this principle, any two codewords in $\bf C$ must superimpose with each other on at least $s$ non-zero digits.

Remark 1.

Basically, $s,s\geq 1$ denotes the discriminatory feature we have created. This feature intrinsically can be seen as a characteristic that there always exist more nonzero digits than zero digits. Returning to the subcarriers, $s$ means the available number of overlapping subcarriers for channel estimation. The overlapping of subcarriers means the coexistence of signals from two nodes on the same subcarrier and same OFDM symbol time.

Theorem 1.

The weight of ICC- $\left({N_{\rm B},s}\right)$ code of length $N_{\rm B}$ and order $s$ satisfies $w=\frac{{N_{\rm B}+s}}{2}$ with $N_{\rm B}\geq s$ . $w$ is an integer smaller than $N_{\rm B}$ .

Proof.

See proof in Appendix VIII-A ∎

Here and in the following sections, we assume the ratio of two integer is always kept to be an integer without loss of generality. Based on the theorem, we can derive the number of codewords or namely the columns in $\bf C$ , by a binomial coefficient $C=\left({\begin{array}[]{*{20}{c}}{{N_{\rm B}}}\\ {\frac{{{N_{\rm B}}+s}}{2}}\end{array}}\right)$ . Then we have the following proposition about the code rate:

Proposition 1.

The code rate of ICC- $\left({N_{\rm B},s}\right)$ code, defined by ${R_{ICC}}=\frac{{{{\log}_{2}}\left(C\right)}}{{{N_{\rm B}}}}$ , is calculated as:

[TABLE]

III-B2 Decoding Procedure

Despite the fact that the encoding principle provides the discriminatory feature of ICC, Alice has to construct a decoding principle according with this feature to perform codeword separation and identification

Considering the hybrid attack environment, Alice could recognize three types of results on the $i$ -th subcarrier $i\in\left[{1,{N_{\rm B}}}\right]$ : **Case 1:**None of Bob and Ava transmits signals. Case 2: Bob and Ava both transmit signals. Case 3: One unknown node (Bob or Ava) transmits signals. Obviously, Alice can identify the behaviors in the first two cases but this cannot work well in Case 3 due to the ambiguity of superposition operation of signals on subcarriers. For simplicity, we define the subcarriers in Case 1 and Case 2 as the deterministic subcarriers while those in Case 3 are defined as the ambiguous subcarriers. The related decoding principle is depicted in Fig. 5.

Now that we have explored the principle of ICC method in theory, we ought to look at its performance evaluation.

Proposition 2.

SEP, defined by error probability of separating two right codewords from the observed codeword, is zero.

It is sufficiently feasible that the distance between Bob and Ava can guarantee that their channels fade independently with each other. The inner product of high-dimensional receiving signals on different subcarriers is therefore always precisely measured under massive antennas, providing the perfect differential decoding and thus perfect pilot separation in Fig. 5.

Theorem 2.

IEP, defined by the error probability of identifying Bob’s codeword from the two separated codewords, is given by

[TABLE]

Proof.

See proof in Appendix VIII-B. ∎

The overall pilot conveying, separation and identification can be seen in part of Fig. 6.

IV FS Channel Estimation and Security Enhancement

In this section, we continue our design work for the ICC-CTA protocol architecture and focus on the FS channel estimation. Two questions will be answered further:

Question 1.

How to estimate FS channels based on the identified pilots?

Question 2.

Is it possible to improve the security performance of ICC theory by further digging the properties of estimated FS channels ?

IV-A FS Channel Estimation

It is well-known that LS estimator is a natural choice when there is no attack. In this subsection, we only consider the FS channel estimation under PTJ attack shown in the attack model in Introduction part.

In principle, performing linear channel estimation requires specifying the receiving signal model and linear decorrelating estimator (LDE) that weights on the receiving signals for channel estimation.

Let us consider the construction of LDE. Basically, Alice examines the decoded pilots which can be, 1) successfully identified; ( no identification error) or 2) confusing (identification error happens). We in this section consider the latter and forget the case without identification error. In this way, the estimator to be designed naturally apply to the case without identification error. Within two OFDM symbol time, i.e., ${k_{0}}$ and ${k_{1}}$ , Alice could collect two confusing pilot vectors defined by ${\bf{x}}_{{\rm{L,1}}}$ and ${\bf{x}}_{{\rm{L,2}}}$ where ${\bf{x}}_{{\rm{L,1}}}{\rm{=}}{\left[{\begin{array}[]{*{20}{c}}{{x_{\rm{B}}}\left[{{k_{0}}}\right]}&{{x_{\rm{B}}}\left[{{k_{1}}}\right]}\end{array}}\right]^{\rm{T}}}$ and ${\bf{x}}_{{\rm{L,2}}}{\rm{=}}{\left[{\begin{array}[]{*{20}{c}}{{x_{\rm{A}}}\left[{{k_{0}}}\right]}&{{x_{\rm{A}}}\left[{{k_{1}}}\right]}\end{array}}\right]^{\rm{T}}}$ . The notation of ${{x_{\rm{B}}}\left[{{k}}\right]}$ can be found in Assumption 3. Here the confusing case happens when Ava keeps the same frequency-domain and time-domain PIP as Bob, which is proved in Remark 2. Then we use the notation of ${{x_{\rm{A}}}\left[{{k}}\right]}$ with the only difference, that is, different value with ${{x_{\rm{B}}}\left[{{k}}\right]}$ .

Then we consider the receiving signal model for which two facts involved should be clarified:

Fact 1.

1) The phenomenon that arbitrary two codewords within ICC- $\left({N_{\rm B},s}\right)$ must overlap at least on $s$ code digits does not mean that the total number of overlapping subcarriers always keeps stable and constant; 2) The superimposed signals on those overlapping subcarriers could be employed for channel estimation and security enhancement whereas the subcarrier on which only one signal exists can be utilized for, but limited to channel estimation.

In order to formulate the receiving signal, we choose two OFDM symbol time, i.e., ${k_{0}}$ and ${k_{1}}$ , and $s,s\geq 1$ randomly-overlapping subcarriers. The randomness here means the random frequency positions of subcarriers. The signals received are stacked as the ${2\times{N_{\rm T}s}}$ matrix ${{\bf{Y}}_{\rm L}}$ , equal to

[TABLE]

where the ${2\times 2}$ matrix ${{\bf{X}}_{{\rm L}}}$ satisfies ${{\bf{X}}_{\rm L}}=\left[{\begin{array}[]{*{20}{c}}{\bf{x}}_{{\rm{L,1}}}&{\bf{x}}_{{\rm{L,2}}}\end{array}}\right]$ . The integrated ${2\times{N_{\rm T}s}}$ channel matrix ${{\bf{H}}_{{{{\rm L}}}}}$ satisfies ${{\bf{H}}_{{{{\rm L}}}}}={\left[{\begin{array}[]{*{20}{c}}{{\bf{h}}_{{\rm{B}},{\rm L}}^{\rm{T}}}&{{\bf{h}}_{{\rm{A}},{\rm L}}^{\rm{T}}}\end{array}}\right]^{\rm{T}}}$ where ${{\bf{h}}_{{\rm{B}},{\rm L}}}=\left[{\begin{array}[]{*{20}{c}}{{{\left({{{\bf{F}}_{{\rm{L}},s}}{\bf{h}}_{\rm{B}}^{1}}\right)}^{\rm{T}}}}&{,\ldots,}&{{{\left({{{\bf{F}}_{{\rm{L}},s}}{\bf{h}}_{\rm{B}}^{{N_{\rm{T}}}}}\right)}^{\rm{T}}}}\end{array}}\right]$ and ${{\bf{h}}_{{\rm{A}},{\rm L}}}=\left[{\begin{array}[]{*{20}{c}}{{{\left({{{\bf{F}}_{{\rm{L}},s}}{\bf{h}}_{\rm{A}}^{1}}\right)}^{\rm{T}}}}&{,\ldots,}&{{{\left({{{\bf{F}}_{{\rm{L}},s}}{\bf{h}}_{\rm{A}}^{{N_{\rm{T}}}}}\right)}^{\rm{T}}}}\end{array}}\right]$ . ${{\bf{F}}_{{\rm{L}},s}}$ is the $s$ -row matrix for which each index of $s$ rows belongs to the set ${\cal P}_{s}$ . ${\bf{N}}_{\rm L}$ represents the ${2\times{N_{\rm T}s}}$ noise matrix with ${{\bf{N}}_{\rm L}}={\left[{\begin{array}[]{*{20}{c}}{{\bf{w}}_{\rm L}^{\rm{T}}\left[{{k_{0}}}\right]}&{{\bf{w}}_{\rm L}^{\rm{T}}\left[{{k_{1}}}\right]}\end{array}}\right]^{\rm{T}}}$ where ${{\bf{w}}_{\rm L}}\left[k\right]=\left[{\begin{array}[]{*{20}{c}}{{{\bf{w}}_{s}^{{1^{\rm{T}}}}}\left[k\right]}&{,\ldots,}&{{{\bf{w}}_{s}^{{N_{\rm{T}}}^{\rm{T}}}}\left[k\right]}\end{array}}\right]$ for $k=k_{0},k_{1}$ .

Remark 2.

Since the specific values of elements in ${\cal P}_{s}$ are randomly distributed between 1 and $N$ , the ${{\bf{F}}_{{\rm{L}},s}}$ is no longer a semi-unitary matrix.

We formulate the sample covariance matrix by ${{\bf{C}}_{{{\bf{Y}}_{\rm L}}}}=\frac{1}{{{N_{\rm{T}}}s}}{{\bf{Y}}_{\rm L}}{\bf{Y}}_{\rm L}^{\rm{H}}$ and then could derive the asymptotically-optimal linear minimum mean square error (LMMSE) estimators as ${{\bf{W}}_{{\rm{B}},{\rm{L}}}}={T_{\rm{B}}}{\bf{x}}_{{\rm{L,1}}}^{\rm{H}}{\bf{C}}_{{{\bf{Y}}_{\rm{L}}}}^{-1}$ and ${{\bf{W}}_{{\rm{A}},{\rm{L}}}}={T_{\rm{A}}}{\bf{x}}_{{\rm{L,2}}}^{\rm{H}}{\bf{C}}_{{{\bf{Y}}_{\rm{L}}}}^{-1},$ where ${T_{\rm{B}}}\buildrel\Delta\over{=}\frac{{{\rm{Tr}}\left({{{\bf{R}}_{{1}}}}\right){\rm{Tr}}\left({{{\bf{R}}_{\rm{F}}}}\right)}}{{{N_{\rm{T}}}s}}$ and ${T_{\rm{A}}}\buildrel\Delta\over{=}\frac{{{\rm{Tr}}\left({{{\bf{R}}_{2}}}\right){\rm{Tr}}\left({{{\bf{R}}_{\rm{F}}}}\right)}}{{{N_{\rm{T}}}s}}$ . Here, there exists ${\rm{Tr}}\left({{{\bf{R}}_{{1}}}}\right)={\rm{Tr}}\left({{{\bf{R}}_{{2}}}}\right)=N_{\rm T}$ and therefore we could define ${T_{\rm{B}}}={T_{\rm{A}}}=T$ .

Finally, the estimated versions of FS channels are respectively derived as

[TABLE]

The normalized mean square error (NMSE) for the two estimations are respectively defined by $\varepsilon_{\rm{B}}^{2}=\frac{{{\mathbb{E}}\left\{{{{\left\|{{{\widehat{\bf{h}}}_{\rm{B,L}}}-{{\bf{h}}_{\rm{B,L}}}}\right\|}^{2}}}\right\}}}{{{N_{\rm{T}}}s}},\varepsilon_{\rm{A}}^{2}=\frac{{{\mathbb{E}}\left\{{{{\left\|{{{\widehat{\bf{h}}}_{\rm{A,L}}}-{{\bf{h}}_{\rm{A,L}}}}\right\|}^{2}}}\right\}}}{{{N_{\rm{T}}}s}}$ . Furthermore, the relationship between the ideal channels with estimated versions can be given by ${{\bf{h}}_{\rm{B,L}}}={\widehat{\bf{h}}_{\rm{B,L}}}+{\varepsilon_{\rm{B}}}{\bf{h}}$ and ${{\bf{h}}_{\rm{A,L}}}={\widehat{\bf{h}}_{\rm{A,L}}}+{\varepsilon_{\rm{A}}}{\bf{h}}^{{}^{\prime}}$ where ${\varepsilon_{\rm{B}}}{\bf{h}}$ is uncorrelated with ${{\bf{h}}_{\rm{B,L}}}$ and ${\varepsilon_{\rm{A}}}{\bf{h}}^{{}^{\prime}}$ is uncorrelated with ${{\bf{h}}_{\rm{A,L}}}$ . Here, the entries of ${\bf{h}}$ and ${\bf{h}}^{{}^{\prime}}$ are i.i.d zero-mean complex Gaussian vectors with each element having unity variance.

Proposition 3.

In the large-scale array regime, there exists $\varepsilon_{\rm{B}}^{2}=\varepsilon_{\rm{A}}^{2}$ at high SNR .

Proof:

See proof in Appendix VIII-C ∎

Remark 3.

When no identification error happens, Alice only utilizes the identified pilots of Bob to derive ${\bf{x}}_{{\rm{L,1}}}$ and finally gets ${\widehat{\bf{h}}_{{\rm{B}},{\rm{L}}}}$ .

IV-B Security Enhancement: Exploiting Spatial Correlation

We are now ready to answer Question 2. Security enhancement in this section means reducing IEP further. To this end, we should focus on the case where Bob gets two confusing pilots, i.e, ${\bf{x}}_{{\rm{L,1}}}$ and ${\bf{x}}_{{\rm{L,2}}}$ and two confusing estimated channels, i.e., ${\widehat{\bf{h}}_{{\rm{B}},{\rm{L}}}}$ and ${\widehat{\bf{h}}_{{\rm{A}},{\rm{L}}}}$ . Even in this case, the identification error will occur only when Ava keeps the same frequency-domain and time-domain PIP as Bob, which is proved in Remark 2. In this section, we will reduce the probability of this happening in an independent dimension, i.e., the angular domain.

IV-B1 Angular Domain Identification

Basically, the process of identification can be modelled as a decision process between two hypotheses:

[TABLE]

For the sake of simplicity, we define several useful eigenvalue decompositions, including ${{\bf{R}}_{i}}={{\bf{U}}_{i}}{{\bf{\Lambda}}_{i}}{\bf{U}}_{i}^{\rm{H}}$ , ${\overline{\bf{R}}_{i}}={{\bf{U}}_{i}}{\overline{\bf{\Lambda}}_{i}}{\bf{U}}_{i}^{\rm{H}}$ , ${{\bf{R}}_{\rm{F}}}={{\bf{V}}_{\rm{f}}}{{\bf{\Sigma}}_{\rm{f}}}{\bf{V}}_{\rm{f}}^{\rm{H}}$ and ${\overline{\bf{R}}_{\rm{F}}}={{\bf{V}}_{\rm{f}}}{\overline{\bf{\Sigma}}_{\rm{f}}}{\bf{V}}_{\rm{f}}^{\rm{H}}$ . Here, ${\bf{U}}_{i}$ and ${{\bf{V}}_{\rm{f}}}$ denote the eigenvector matrices and eigenvalue matrices satisfy ${{\bf{\Lambda}}_{i}}={\rm{diag}}\left\{{{{\left[{\begin{array}[]{*{20}{c}}{{\lambda_{i,1}}}&\cdots&{{\lambda_{i,{\rho_{i}}}}}&0&\cdots&0\end{array}}\right]}^{\rm{T}}}}\right\}$ , ${\overline{\bf{\Lambda}}_{i}}={\rm{diag}}\left\{{{{\left[{\begin{array}[]{*{20}{c}}{\lambda_{i,1}^{-1}}&\cdots&{\lambda_{i,{\rho_{i}}}^{-1}}&0&\cdots&0\end{array}}\right]}^{\rm{T}}}}\right\}$ , ${{\bf{\Sigma}}_{\rm{f}}}={\rm{diag}}\left\{{{{\left[{\begin{array}[]{*{20}{c}}{{\lambda_{{\rm{f}},1}}}&\cdots&{{\lambda_{{\rm{f}},{\rho_{\rm{f}}}}}}&0&\cdots&0\end{array}}\right]}^{\rm{T}}}}\right\}$ , ${\overline{\bf{\Sigma}}_{\rm{f}}}={\rm{diag}}\left\{{{{\left[{\begin{array}[]{*{20}{c}}{\lambda_{{\rm{f}},1}^{-1}}&\cdots&{\lambda_{{\rm{f}},{\rho_{\rm{f}}}}^{-1}}&0&\cdots&0\end{array}}\right]}^{\rm{T}}}}\right\}$ .

We build up an error decision function as

[TABLE]

where $f\left({\bf{r}}\right)={\bf{r}}\left({{\overline{\bf{R}}_{1}}\otimes{\overline{\bf{R}}_{\rm{F}}}}\right){{\bf{r}}^{\rm{H}}}$ . Then we have the following theorem to identify two hypotheses.

Theorem 3.

When ${N_{\rm{T}}}\to\infty$ , the error decision function can be simplified as:

[TABLE]

Proof:

See proof in Appendix VIII-D ∎

The further simplification of above equation requires exploiting the relationship between ${{\bf{R}}_{1}}$ and ${{\bf{R}}_{2}}$ . Backing to the Eq. (15), we know that the trace function satisfies ${\rm{Tr}}\left({{{\bf{R}}_{2}}{{\overline{\bf{R}}}_{1}}}\right)\leq{\rm{Tr}}\left({{{\bf{\Lambda}}_{2}}{\bf{U}}_{2}^{\rm{H}}{{\bf{U}}_{1}}{{\overline{\bf{\Lambda}}}_{1}}}\right)={\rm{Tr}}\left({{{\bf{\Lambda}}_{2,{\rm{p}}}}\overline{\bf{U}}_{2}^{\rm{H}}{{\overline{\bf{U}}}_{1}}{{\overline{\bf{\Lambda}}}_{1,{\rm{p}}}}}\right)$ where ${{\bf{\Lambda}}_{i,{\rm{p}}}}$ and ${\overline{\bf{\Lambda}}_{i,{\rm{p}}}}$ are respectively defined by ${{\bf{\Lambda}}_{i,{\rm{p}}}}={\rm{diag}}\left\{{{{\left[{\begin{array}[]{*{20}{c}}{{\lambda_{i,1}}}&\cdots&{{\lambda_{i,{\rho_{i}}}}}\end{array}}\right]}^{\rm{T}}}}\right\}$ and ${\overline{\bf{\Lambda}}_{i,{\rm{p}}}}={\rm{diag}}\left\{{{{\left[{\begin{array}[]{*{20}{c}}{\lambda_{i,1}^{-1}}&\cdots&{\lambda_{i,{\rho_{i}}}^{-1}}\end{array}}\right]}^{\rm{T}}}}\right\}$ . The $N_{{\rm T}}\times{\rho_{{i}}}$ matrix ${{{\overline{\bf{U}}}_{i}}}$ denotes the tall unitary matrix of channel covariance eigenvectors ${{\bf{U}}_{i}}$ . As discussed in [22], ${\overline{\bf{U}}_{2}^{\rm{H}}{{\overline{\bf{U}}}_{1}}}$ can be approximated using ${\bf{F}}_{{{\cal S}_{2}}}^{\rm{H}}{{\bf{F}}_{{{\cal S}_{1}}}}$ . We define ${{\cal S}_{1}}\cap{{\cal S}_{2}}={{\cal S}_{3}}$ where ${{\cal S}_{i}}$ denotes the support of ${S}_{i}\left(x\right)$ , a uniformly-bounded absolutely-integrable function satisfying ${S_{i}}\left(x\right)=\frac{1}{{2\Delta}}\sum\limits_{0\in\left[{D\sin\left({{\theta_{i}}-\Delta}\right)+x,D\sin\left({{\theta_{i}}+\Delta}\right)+x}\right]}{\frac{1}{{\sqrt{{D^{2}}-{x^{2}}}}}}$ , over $x\in\left[{-\frac{1}{2},\frac{1}{2}}\right]$ . There exists ${{\bf{F}}_{{{\cal S}_{i}}}}=\left({{{\bf{f}}_{n}}:n\in{{\cal J}_{{{\cal S}_{i}}}}}\right)$ where ${{\cal J}_{{{\cal S}_{i}}}}=\left\{{n,\left[{{n\mathord{\left/{\vphantom{n{{N_{\rm{T}}}}}}\right.\kern-1.2pt}{{N_{\rm{T}}}}}}\right]\in{{\cal S}_{i}},n=0,\ldots,{N_{\rm{T}}}-1}\right\}$ . We then discuss the influence of ${{\cal S}_{3}}$ on ${\rm{Tr}}\left({{{\bf{R}}_{2}}{{\overline{\bf{R}}}_{1}}}\right)$ . When ${{\cal S}_{3}}=\emptyset$ , we can have ${\rm{Tr}}\left({{{\bf{R}}_{2}}{{\overline{\bf{R}}}_{1}}}\right)=0$ . When ${{\cal S}_{3}}\neq\emptyset$ , we assume ${{\cal S}_{3}}={\cal P}_{a}$ and have

[TABLE]

This is because the eigenvectors labeled by the indexes out of the interacted set ${{\cal S}_{3}}$ are mutually orthogonal [22].

Theorem 4.

When ${N_{\rm{T}}}\to\infty$ , there always exists $\sum\limits_{j=1}^{a}{\frac{{{\lambda_{2,{i_{j}}}}}}{{{\lambda_{1,{i_{j}}}}}}}=a$ . If ${\theta_{1}}\neq{\theta_{2}}$ , there must exist $a<{\rho_{1}}$ and $\Delta f>0$ . Otherwise if ${\theta_{1}}={\theta_{2}}$ , there must exist $a={\rho_{1}}$ and $\Delta f=0$ .

Proof:

See proof in Appendix VIII-E. ∎

Thus far, we can know that Ava is restricted on a line lying the center of clusters surrounding Bob, otherwise, its attack is invalidated, which shows another potential of angular domain identification in countering attack.

IV-B2 Combine Angular Domain with Code Domain to Enhance Security

Since the pilot identification breaks down iff ${\theta_{1}}={\theta_{2}}$ , we have the following theorem:

Theorem 5.

Under the assumption of mean AoA obeying CPD, the IEP ${{P}_{\rm{I}}}$ is equal to zero. Under the assumption of mean AoA obeying DPD, for instance, uniform distribution with interval length $K$ , the IEP ${{P}_{\rm{I}}}$ is updated to be $\frac{{{P_{\rm{I}}}}}{K}$ .

The proof is institutive since we consider two independent dimensions, that is, angular domain and code domain, to reduce IEP. The IEP is lowered to ${{P}_{\rm{I}}}$ by using coding approach and further reduced to $\frac{{{P_{\rm{I}}}}}{K}$ by exploiting angular domain identification. In this sense, the security provided on the code domain by the ICC-CTA protocol is enhanced at the same time by fully exploiting the angular domain. Finally, we give the overall process of channel estimation and security enhancement in Algorithm 4.

Remark 4.

We aim to evaluate the influence of different PIP principles of Ava on Theorem 4. We need to stress that the key lies in the following two aspects. On one hand, Ava selects different frequency-domain PIP principles with Bob. It adopts different phases across its own activated subcarriers in order to protect its own correlation property from being exploited by Alice. In this case, the original DFT submatrix in ${\bf H}_{\rm L}$ of Eq. (11) is now represented by ${\widetilde{\bf{F}}_{{\rm{L}},{{s}}}}$ with ${\widetilde{\bf{F}}_{{\rm{L}},{{s}}}}={\bf{\Psi}}{{\bf{F}}_{{\rm{L}},{{s}}}}$ . Here, ${\bf{\Psi}}=diag\left\{{{{\left[{\begin{array}[]{*{20}{c}}{{e^{j{\beta_{1}}}}}&\cdots&{{e^{j{\beta_{s}}}}}\end{array}}\right]}^{\rm{T}}}}\right\}$ represents the strategies of Ava across subcarriers on which ${\beta_{i}},i=1,\ldots,{{{s}}}$ are random. As we can see, there exists ${\widetilde{\bf{R}}_{\rm{F}}}=\widetilde{\bf{F}}_{{\rm{L}},{{s}}}^{\rm{T}}\widetilde{\bf{F}}_{{\rm{L}},{{s}}}^{*}={{\bf{R}}_{\rm{F}}}$ . This does not affect the value of function $f$ and thus not violate the Theorem 4. On the other hand, we examine the case where Ava adopts different time-domain PIP principles with Bob. In this case, LLE vector derived by Bob is not optimal for Ava’s channel estimation since the final pilot vectors demapped from Ava’s SAPs are actually wrong for channel estimation. The elements of ${\widehat{\bf{h}}_{{\rm{A}},{\rm{L}}}}$ in Eq. (12) are further imposed on significant estimation error. Thus Bob acquires very large $\varepsilon_{\rm{A}}^{2}$ , compared with $\varepsilon_{\rm{B}}^{2}$ derived under asymptotically-optimal LMMSE estimation. Finally, the value of $\Delta f$ must be much larger than zero, which does not violate the Theorem 4. Actually, this can guarantee perfect security even ${\theta_{1}}={\theta_{2}}$ .

In summary, those PIP principles different with Bob’s strategy can benefit Alice and are not prudent for Ava.

V Security-Instability Tradeoff in CIR Estimation

Security advantages originate from the diversified SAPs using ICC- $\left({N_{\rm B},s}\right)$ . However, various superimposed modes of SAPs (SSAPs) affect the stability of CIR estimation significantly as those subcarriers in activation are utilized for estimating CIR samples from estimated FS channels. To begin with, we show when and why this instability could occur and then gradually wean ourselves from the constraint of instability to find a tradeoff between the security and instability in CIR estimations. Finally, we present an optimal code rate under which a sufficiently-stable estimation performance is secured.

V-A Essence of Unstable CIR Estimation: Random SSAPs

Recall that each pilot phase in use has been mapped to one unique SAP and thus randomized pilots mean random SSAPs. When random SAPs from Bob and Ava are superimposed in wireless environment, Alice will observe two typical SSAPs which both incur unstable performance. This can be seen in Fig. 7. The key question is: How to evaluate and reduce the influence of the instability resulting from random SSAPs on CIR estimation ?

To answer this question, let us focus on the mathematical expression of CIR estimation. The CIR generally satisfies the equation ${{\bf{h}}_{{\rm{B}},{\rm{L}}}}\buildrel\Delta\over{=}{{\bf{g}}_{{\rm{B}},{\rm{L}}}}\left({{\bf{R}}_{\rm{1}}^{{1\mathord{\left/{\vphantom{12}}\right.\kern-1.2pt}2}}\otimes{\bf{F}}_{{\rm{L}},{{s}}}^{\rm{T}}}\right)$ where ${{\bf{g}}_{{\rm{B}},{\rm{L}}}}$ is the integrated $1\times N_{\rm T}L$ CIR vector of i.i.d. ${\cal C}{\cal N}\left({0,{1}}\right)$ random variables. Given ${\bf{R}}_{1}$ and ${{\bf{h}}_{{\rm{B}},{\rm{L}}}}$ , the estimation of ${{\bf{g}}_{{\rm{B}},{\rm{L}}}}$ , denoted by ${\widehat{\bf{g}}_{{\rm{B}},{\rm{L}}}}$ , will fluctuate under various forms of ${\bf{F}}_{{\rm{L}},{{s}}}^{\rm{T}}$ . Note that the structure of ${\bf{F}}_{{\rm{L}},{{s}}}^{\rm{T}}$ is determined by the number $s$ and the frequency positions of overlapping subcarriers. Therefore, the key factor influencing the stability of ${\widehat{\bf{g}}_{{\rm{B}},{\rm{L}}}}$ is ${\cal P}_{s}$ .

Specifically, we examine Fig. 8 (a). When $s<L$ , the CIR estimation from ${{\bf{h}}_{{\rm{B}},{\rm{L}}}}$ is under-determined with low estimation precision. We turn to consider $s\geq L$ in Fig. 8 (b) where we could always find a non-underdetermined recovery model.

Nevertheless, the fluctuation of $s$ will directly influence the estimation stability. Particularly, the random value of those elements in ${\cal P}_{s}$ will cause unequally-spaced overlapping subcarriers which continue to cause instability and limited estimation precision. To show this mathematically, we begin by giving the CIR estimation ${\widehat{\bf{g}}_{{\rm{B}},{\rm{L}}}}$ as ${\widehat{\bf{g}}_{{\rm{B}},{\rm{L}}}}={\widehat{\bf{h}}_{{\rm{B}},{\rm{L}}}}\left\{{\overline{\bf{R}}_{\rm{1}}^{{{1}\mathord{\left/{\vphantom{{1}2}}\right.\kern-1.2pt}2}}\otimes\left({{\bf{F}}_{{\rm{L}},{{s}}}^{\rm{*}}{\bf{R}}_{\rm{F}}^{-1}}\right)}\right\}$ . By using ${{\bf{h}}_{\rm{B,L}}}={\widehat{\bf{h}}_{\rm{B,L}}}+{\varepsilon_{\rm{B}}}{\bf{h}}$ , we then expand the equation into ${\widehat{\bf{g}}_{{\rm{B}},{\rm{L}}}}=L{{\bf{g}}_{{\rm{B}},{\rm{L}}}}\left\{{\left({{\bf{R}}_{\rm{1}}^{{1\mathord{\left/{\vphantom{12}}\right.\kern-1.2pt}2}}\overline{\bf{R}}_{\rm{1}}^{{{1}\mathord{\left/{\vphantom{{1}2}}\right.\kern-1.2pt}2}}}\right)}\right\}-{{\varepsilon_{\rm{B}}}}{\bf{h}}\left\{{\overline{\bf{R}}_{\rm{1}}^{{{1}\mathord{\left/{\vphantom{{1}2}}\right.\kern-1.2pt}2}}\otimes\left({{\bf{F}}_{{\rm{L}},{{s}}}^{\rm{*}}{\bf{R}}_{\rm{F}}^{-1}}\right)}\right\}$ . Given the correlation matrix $\bf R_{1}$ and ${{\varepsilon_{\rm{B}}}}$ , the minimization of $\overline{\varepsilon}_{\rm{B}}^{2}$ defined by the equation $\overline{\varepsilon}_{\rm{B}}^{2}={{{\mathbb{E}}\left\{{{{\left\|{{{\widehat{\bf{g}}}_{{\rm{B}},{\rm{L}}}}-{{\bf{g}}_{{\rm{B}},{\rm{L}}}}}\right\|}^{2}}}\right\}}\mathord{\left/{\vphantom{{E\left\{{{{\left\|{{{\widehat{\bf{g}}}_{{\rm{B}},{\rm{L}}}}-{{\bf{g}}_{{\rm{B}},{\rm{L}}}}}\right\|}^{2}}}\right\}}{{N_{\rm{T}}}L}}}\right.\kern-1.2pt}{{N_{\rm{T}}}L}}$ , is equivalent to:

[TABLE]

For this optimization problem, the minimization is achieved iff ${{{\bf{R}}_{\rm{F}}}}$ has the identical eigenvalues, and thus the overlapping subcarriers are equally spaced, satisfying

[TABLE]

The total number of subcarriers within the interval that extends from the first overlapping position to the last one can be derived as:

[TABLE]

Hinted by this, we know how any mismatch between the indices of ${{\cal P}}_{s}$ with those of ${\overline{\cal P}}_{s}$ could increase the estimation error and instability.

Based on above observations, we define the condition of being stable (CS) for CIR estimation as follows:

Definition 4 (CS).

The overlapping subcarriers are equally spaced and meet the number constraint, that is , $s\geq L$ and $s\geq s^{*}$

Returning to examine the previous SSAPs in Fig. 7, we can know that SAPs are diversified, completely under the direction of ICC- $\left({N_{\rm B},s}\right),s\geq 1$ code. Basically, the instability originates from the random use of codewords and the constraint of $N_{\rm B}$ and $w$ in ICC- $\left({N_{\rm B},s}\right),s\geq 1$ code. Therefore. any mechanism for reduction of instability must reconsider the code design. In this design process, we must deal with the relationship between security and instability.

V-B Security-Instability Tradeoff

To begin with, we identify and define the instability by the following metirc:

Definition 5.

The KPI indicating the instability of CIR estimation using ICC- $\left({N_{\rm B},s}\right)$ code is defined by $S_{T}\left({{N_{\rm{B}}},w,{s^{*}}}\right)={1\mathord{\left/{\vphantom{1{{P_{\rm{s}}}\left({{N_{\rm{B}}},w,{s^{*}}}\right)}}}\right.\kern-1.2pt}{{P_{\rm{s}}}\left({{N_{\rm{B}}},w,{s^{*}}}\right)}}$ with

[TABLE]

where ${{C^{2}}\left({{N_{\rm{B}}},w,s^{*}}\right)}$ denotes the total possibilities of codeword pair for which each codeword represents the one choice from one node, i.e. Bob or Ava. ${\kappa\left({{N_{\rm{B}}},w,s^{*}}\right)}$ denotes the number of codeword pairs that satisfy CS when they overlap with each other.

In this definition, we should note that ${\kappa\left({{N_{\rm{B}}},w,s^{*}}\right)}$ relies on a fundamental fact:

Fact 2.

1) The number of zero digits in each codeword determines how frequency CS can be broken down; 2) Those zero digits, with uniform spacing, incur the most severe interference on CIR estimation accuracy.

This fact also determines why the instability of CIR estimation could occur. We define the Optimal Stability (OS) condition by:

Definition 6.

There always exists ${P_{\rm{s}}}\left({{N_{\rm{B}}},w,s^{*}}\right)=1$ under arbitrary SSAPs.

V-B1 Low- $N_{\rm B}$ scenario

Without loss of generality, we consider the low- $N_{\rm B}$ scenario where $N_{\rm B}$ is equal to $s^{*}$ . Obviously, CS is satisfied when ${{\cal P}}_{s}$ is equal to the set ${\overline{\cal P}}_{s}$ . In this case, we derive the expression of instability, defined by

[TABLE]

with $s^{*}\leq w\leq N_{\rm B}\leq\overline{N}$ .

Based on this equation, we could characterize the relationship between the security (defined by $S_{E}$ equal to ${1\mathord{\left/{\vphantom{1{{P_{\rm{I}}}}}}\right.\kern-1.2pt}{{P_{\rm{I}}}}}$ ) and instability (i.e., $S_{T}$ ) as a fundamental tradeoff existing in the whole uplink training process:

Fact 3 (A Realistic Tradeoff).

*The lower code rate brings the lower instability (Eq. (21)); However, the lower code rate causes the higher security (Theorem 2 and Theorem 5). ***

Remark 5.

For a mean AoA model with CPD, the tradeoff does not exist since $P_{\rm I}$ is always zero and thus independent with the stability of CIR estimation. However, this is not realistic since the mean AoA is discretely distributed in practical scenarios with limited clusters. In this sense, the security-stability tradeoff is necessary and inevitable.

The drawback of low- $N_{\rm B}$ configuration is that there is no security when Alice expects to achieve OS condition and thus $w$ should be equal to $N_{\rm B}$ according to Eq. (21). In other words, the tradeoff under OS condition cannot provide desirable security guarantee when $N_{\rm B}$ is low. See the example in Fig. 8 (c).

We always expect to maximize the lower bound of security by jointly optimizing $N_{\rm B}$ and $w$ . This object motives us to turn to large- $N_{\rm B}$ case.

V-B2 High- $N_{\rm B}$ scenario and Optimally-Stable Tradeoff

In this part, we aim to determine the optimal ${R_{s}}$ such that the security is maximized while the OS condition is satisfied. Maximizing security means maximizing the code rate since the security is a monotonic increasing function of code rate ${R_{s}}$ . The optimization problem, also namely Optimally-Stable Tradeoff problem, can be formulated by:

[TABLE]

Before solving this problem, we need to fully understand ${P_{\rm{s}}}\left({{N_{\rm{B}}},w,s^{*}}\right)=1$ under high- ${N_{\rm{B}}}$ . According to Fact 2 and Fig. 8 (d), we have the following propositon:

Proposition 4.

OS condition is satisfied iff the number of adjacent non-zero digits between any adjacent zero digits is at least equal to $s^{*}$ when zero digits are equally spaced for each of ICC codeword. We say this is named as the $s^{*}$ -OS condition.

Inspired by this, we should optimize ${N_{\rm B}}$ and $w$ such that the non-zero digits are constrained to create the $s^{*}$ -OS condition. Under $s^{*}$ -OS condition, ${N_{\rm B}}$ should always satisfy

[TABLE]

The weight $w$ of ICC- $\left({N_{\rm B},s}\right)$ should therefore satisfy $\frac{{{s^{*}}}}{{{s^{*}}+1}}\left({{N_{\rm{B}}}+1}\right)\leq w\leq{N_{\rm{B}}}\leq\overline{N}$ . Especially, when $w$ is equal to ${N_{\rm{B}}}$ , we have ${s^{*}}=w$ . This corresponds to the low- $N_{\rm{B}}$ case.

In this way, the $s^{*}$ -OS condition is represented by the Eq. (23). And the maximization operation should be constrained by this equation.

Theorem 6.

The optimal code rate maximizing the security while maintaining the $s^{*}$ -OS condition can be calculated by

[TABLE]

The weight and order of optimally-stable code satisfy $w=\frac{{{s^{*}}}}{{{s^{*}}+1}}\left({{N_{\rm{B}}}+1}\right)$ and $s=\frac{{\left({{s^{*}}-1}\right){N_{\rm{B}}}+2{s^{*}}}}{{{s^{*}}+1}}$ .

Proof.

See proof in Appendix VIII-F. ∎

By exploiting the property that there exists $\left({\begin{array}[]{*{20}{c}}n\\ k\end{array}}\right)\geq{{{n^{k}}}\mathord{\left/{\vphantom{{{n^{k}}}{{k^{k}}}}}\right.\kern-1.2pt}{{k^{k}}}}$ for all values of $n$ and $k$ , the lower bound approximation of optimally-stable ICC- $\left({N_{\rm B},s}\right)$ code can be given by:

[TABLE]

with $\eta{\rm{=}}\frac{{\left({L-1}\right)N+2L}}{{\left({L-1}\right)N+L}}\frac{{{N_{\rm{B}}}}}{{{N_{\rm{B}}}+1}}$ .

VI Numerical Results

In this section, numerical simulations are presented to evaluate above-mentioned techniques during the CTA process.

VI-A * Numerical Verification for Theorem 4*

We confirm the feasibility of Theorem 4 in Fig. 9 (a) where the strength of $\Delta f$ is plotted against $\theta_{i},i=1,2$ by configuring $N_{\rm T}=100$ and $K=5$ . To be more specific, the examples of $\Delta f$ are derived from the estimated FS channels and the correlation model in Eq. (4). $\theta_{i},i=1,2$ are assumed to lie within the set $\left\{{-\frac{\pi}{4},-\frac{\pi}{7},0,-\frac{\pi}{7},-\frac{\pi}{4}}\right\}$ . As we can see, the identification error happens when $\Delta f=0$ , that is, $\theta_{1}=\theta_{2}$ . In this sense, we verified the feasibility of Theorem 4 and could envision that the IEP is zero under the assumption of the mean AoA with CPD.

VI-B Security-Instability Tradeoff Curve

In this subsection, we focus on the trade-off related results. We evaluate in Fig. 9 (b) the fluctuation of NMSE employing ICC- $\left({N_{\rm B},s}\right)$ code under various SSAPs, and then show how the security-instability tradeoff is developed in Fig. 9 (c).

In Fig. 9 (b), we take the cumulative distribution function (CDF) of NMSE as the evaluation matric. The simulation is averaged over 100 runs, each of which perform 1000 channel average. We further consider that $N_{\rm B}=128$ are provided and at most $s=L=6$ subcarriers overlap for channel estimation. As a benchmark for measuring the instability, we simulate the ideal case where six overlapping subcarriers are always right selected. As we can see, the CDF of NMSE under this ideal case is always stable. However, in practice, ICC- $\left({128,6}\right)$ code causes an undesirable status where the phenomenon of less-overlapping and unequally-spaced subcarriers occurs inevitably. This induces significant fluctuations of NMSE. As a consequence, we present in Fig. 9 (c) the possibility of tradeoff between the security and instability by using parameters $P_{\rm{I}}*10^{2}$ and $P_{\rm{s}}^{\frac{1}{4}}$ . We consider $N_{\rm B}=s^{*}=\frac{{L-1}}{L}N+1$ where the FFT points $N$ is set to be either 16 or 32 while $L$ and $K$ are respectively fixed to be 4 and 10. As we can see, there exists a tradeoff curve on which the security has to be sacrificed to maintain a certain level of stability.

VI-C * Security Under Optimally-Stable Tradeoff*

For this part, we should note that the IEP is zero under the assumption of mean AoA obeying CPD. We consider the DPD model for the sake of practical analysis, and further simulate the IEP performance corresponding to the optimally-stable tradeoff in Fig. 10 (a). In this figure, the 3D plot of IEP is sketched versus $N_{\rm B}$ and $s^{*}$ . We consider $s^{*}$ to be from 4 to 12 and $K$ to be 20. $k$ , related to $N_{\rm B}$ , satisfies $N_{\rm B}+1=\left({s^{*}+1}\right)k$ . As we can see, IEP decreases with the increase of $N_{\rm B}$ and $s^{*}$ . On one hand, the initial value of $s^{*}$ determines how fast the IEP can decrease and what is the minimum value IEP can achieve. For example, IEP decreases faster with the increase of $s^{*}$ , and $P_{\rm I}$ achieves as low as $10^{-3}$ at $k=15$ when $s^{*}$ is equal to 12. In this case, the number of occupied subcarriers is required to be $N_{\rm B}=195$ . On the other hand, the initial value of $N_{\rm B}$ also determines the tendency for the variable $s^{*}$ to be reduced. Specially, at a large $N_{\rm B}$ , a decreasing function $P_{\rm I}$ of $s^{*}$ , at least within the interval $\left[{4,12}\right]$ , can be created.

VI-D * Code Rate Under Optimally-Stable Tradeoff*

In Fig. 10 (b), we evaluate the code rate under the optimally-stable tradeoff. Before that, we consider the Eq. (9) for comparison and sketch the curve of maximum code rate under $s=1$ over $k$ . On this reference curve, the code rate increases and gradually approach 1 with the increase of $k$ . As to the optimally-stable tradeoff, we simulate the curves of code rate shown in Eq. (24) over $s^{*}$ from 4 to 7. As we can see, the code rate in this case is reduced compared with that without tradeoff consideration. With the increase of $s^{*}$ , we have to get less code rate. For example, the code rate under $s^{*}=7,k=21$ and thus $N_{\rm B}=167$ is equal to 0.5083, which means the rate loss of 0.4205 (almost 45 percent) is caused by the tradeoff at this point.

VI-E CIR Estimation Under Optimally-Stable Tradeoff

Finally, we stimulate the performance of stable CIR estimation in Fig. 10 (c) where the NMSE is presented versus SNR of Bob under different number of antennas. $L$ and $N_{\rm B}$ are respectively configured to be 6 and 256. Here, we consider the estimation using Eq. (12) and assume perfect identification for attacks. The performance under this type of estimator is not influenced by the specific value of $\rho_{\rm A}$ due to the subspace projection property. We configure $\rho_{\rm A}=\rho_{\rm B}$ and do not consider the case where there is no attack since in this case LS estimator is a natural choice. For the simplicity of comparison, we only present the channel estimation under PTS attack because the estimation error floor under PTN and PTJ attack can be easily understood to be very high. The binned scheme proposed in [24] is simulated as an another comparison scheme. As we can see, PTS attack causes a high-NMSE floor on CIR estimation for Bob. This phenomenon can also be seen in the binned scheme. However, the estimation in our proposed framework breaks down this floor and its NMSE gradually decreases with the increase of transmitting antennas. Also, we consider perfect MMSE to be a performance benchmark for which perfect pilot tones, including Ava’s pilot tones, are assumed to be known by Alice. We find that the NMSE brought in our scheme gradually approaches the level under perfect MMSE with the increase of antennas. That’s because the asymptotically-optimal estimator highly relies on the statistical covariance matrix which is determined by the number of antennas.

VII Conclusions

This paper investigated the issue of pilot-aware attack on the uplink CTA in large-scale MISO-OFDM systems. We proposed a secure ICC-CTA protocol in which pilot tones, usually exposed in public, are now enabled to be shared between legitimate transceiver pair, with high security under hybrid attack environment. Theoretically, we discovered an critical fact that this architecture could exhibit a perfect security if the CPD model of mean AoA was considered. In practical scenarios with the DPD model of mean AoA, this architecture was required to make tradeoff between the security and stability of CIR estimation. We showed that given a suitable code rate, stable CIR estimation could be always maintained under a high security. We conclude this paper by pointing out some interesting topics for future work. As one interesting direction, more delicate optimization on the tradeoff could be further researched such that the code rate under optimally-stable tradeoff could be higher. The extension to solving the issue of pilot contamination in massive MIMO systems could be another interesting direction since the pilot phases guaranteed by our scheme can be superimposed onto the traditional optimized pilots and thus control even avoid pilot contamination in multi-cell scenarios with only three OFDM symbol time.

VIII Appendix

VIII-A Proof of Theorem 1

Since codewords in this constant-weight code are constrained to be with same and fixed length, the number of overlapping digits achieves its minimum only when the zero digits of each codeword are fully occupied. In this case, the remanent digits, i.e., the overlapping digits, account for ${2w-N_{\rm B}}$ which should be equal to $s$ and less than $w$ . Therefore, we can prove the theorem.

VIII-B Proof of Theorem 2

Considering the hybrid attack, we know that there exists the possibility of $2^{N_{\rm B}}$ codewords to appear. Two interpreted codewords derived under ${\cal A}_{1}$ and ${\cal A}_{0}$ , if satisfying $N_{1}^{\rm d}+N_{1,1}^{\rm s}=N_{1}^{\rm d}+N_{1,0}^{\rm s}$ , will confuse Alice. In this case, each assumption is decided with the probability of $0.5$ . The possible number of codewords that satisfy this condition is equal to ${{\frac{{{N_{\rm B}}!}}{{\left({\frac{{{N_{\rm B}}+s}}{2}}\right)!\left({\frac{{{N_{\rm B}}-s}}{2}}\right)!}}}}$ . One exception is when the codeword of Ava is identical to that of Bob. In this case, the codeword can be uniquely identified. Finally, there exists the possibility of ${{\frac{{{N_{\rm B}}!}}{{\left({\frac{{{N_{\rm B}}+s}}{2}}\right)!\left({\frac{{{N_{\rm B}}-s}}{2}}\right)!}}}-1}$ codewords that could cause identification errors. Then the ultimate IEP can be proved.

VIII-C Proof of Proposition 3

Taking Bob for example, we can derive the estimation error as $\varepsilon_{\rm{B}}^{2}=T\left({1-T{\bf{x}}_{{\rm{L}},{\rm{1}}}^{\rm{H}}{\bf{C}}_{{{\bf{Y}}_{\rm{L}}}}^{-1}{{\bf{x}}_{{\rm{L}},{\rm{1}}}}}\right)$ . Now let us focus on the term ${{\bf{C}}_{{{\bf{Y}}_{\rm L}}}}$ . We can express ${{\bf{h}}_{{\rm{B}},{\rm{L}}}}$ as ${{\bf{g}}_{{\rm{B}},{\rm{L}}}}\left({{\bf{R}}_{\rm{1}}^{{1\mathord{\left/{\vphantom{12}}\right.\kern-1.2pt}2}}\otimes{\bf{F}}_{{\rm{L}},{{s}}}^{\rm{T}}}\right)$ where ${{\bf{g}}_{{\rm{B}},{\rm{L}}}}$ is the integrated $1\times N_{\rm T}L$ CIR vector of i.i.d. ${\cal C}{\cal N}\left({0,{1}}\right)$ random variables. Based on the Lemma B.26 in [33], ${{\bf{C}}_{{{\bf{Y}}_{\rm L}}}}$ is then transformed into ${{\bf{C}}_{{{\bf{Y}}_{\rm L}}}}\xlongrightarrow[{N_{\rm T}}\to\infty]{\rm{a.s.}}\frac{1}{{{N_{\rm{T}}}s}}{{\bf{X}}_{{\rm{L}}}}{{\bf{R}}_{\rm C}}{\bf{X}}_{{\rm{L}}}^{\rm{H}}+{\sigma^{2}}{{\bf{I}}_{2}}$ . Here, the $2\times 2$ matrix ${{\bf{R}}_{\rm C}}$ satisfies ${{\bf{R}}_{\rm{C}}}={\rm{diag}}\left\{{{{\left[{\begin{array}[]{*{20}{c}}{{\rm{Tr}}\left({{{\bf{R}}_{1}}}\right){\rm{Tr}}\left({{{\bf{R}}_{\rm{F}}}}\right)}&{{\rm{Tr}}\left({{{\bf{R}}_{2}}}\right){\rm{Tr}}\left({{{\bf{R}}_{\rm{F}}}}\right)}\end{array}}\right]}^{\rm{T}}}}\right\}$ . Therefore, we can derive $\varepsilon_{\rm{B}}^{2}=T\left\{{1-{\bf{x}}_{{\rm{L}},{\rm{1}}}^{\rm{H}}{{\left({{{\bf{X}}_{\rm{L}}}{\bf{X}}_{\rm{L}}^{\rm{H}}}\right)}^{-1}}{{\bf{x}}_{{\rm{L}},{\rm{1}}}}}\right\}$ at high SNR region. In the same way, we can derive $\varepsilon_{\rm{A}}^{2}=T\left\{{1-{\bf{x}}_{{\rm{L}},{\rm{2}}}^{\rm{H}}{{\left({{{\bf{X}}_{\rm{L}}}{\bf{X}}_{\rm{L}}^{\rm{H}}}\right)}^{-1}}{{\bf{x}}_{{\rm{L}},{\rm{2}}}}}\right\}$ . After calculating the matrix inverse and performing matrix multiplication, we can finally verify $\varepsilon_{\rm{B}}^{2}=\varepsilon_{\rm{A}}^{2}$ . This completes the proof.

VIII-D Proof of Theorem 3

Thanks to ${\widehat{\bf{h}}_{\rm{B,L}}}={{\bf{h}}_{\rm{B,L}}}-\varepsilon_{{\rm{B}}}{\bf{h}}$ , the measure $f\left({{{\widehat{\bf{h}}}_{{\rm{B}},{\rm{L}}}}}\right)$ can be expressed as the equation satisfying ${f\left({{{\widehat{\bf{h}}}_{{\rm{B}},{\rm{L}}}}}\right)=\left({{{\bf{h}}_{{\rm{B}},{\rm{L}}}}-{\varepsilon_{\rm{B}}}{\bf{h}}}\right)\left({{{\overline{\bf{R}}}_{1}}\otimes{{\overline{\bf{R}}}_{\rm{F}}}}\right){{\left({{{\bf{h}}_{{\rm{B}},{\rm{L}}}}-{\varepsilon_{\rm{B}}}{\bf{h}}}\right)}^{\rm{H}}}}$ . This equation can be expanded into $f\left({{{\widehat{\bf{h}}}_{{\rm{B}},{\rm{L}}}}}\right)={f_{1}}-2{f_{2}}+{f_{3}}$ with ${f_{1}}={{\bf{h}}_{{\rm{B}},{\rm{L}}}}\left({{{\overline{\bf{R}}}_{1}}\otimes{{\overline{\bf{R}}}_{\rm{F}}}}\right){\bf{h}}_{{\rm{B}},{\rm{L}}}^{\rm{H}}$ , ${f_{2}}={\varepsilon_{\rm{B}}}{{\bf{h}}_{{\rm{B}},{\rm{L}}}}\left({{{\overline{\bf{R}}}_{1}}\otimes{{\overline{\bf{R}}}_{\rm{F}}}}\right){\bf{h}}$ and ${f_{3}}=\varepsilon_{\rm{B}}^{2}{\bf{h}}\left({{{\overline{\bf{R}}}_{1}}\otimes{{\overline{\bf{R}}}_{\rm{F}}}}\right){\bf{h}}$ . By using the Lemma B.26 in [33] for each term, we can have $\frac{{f\left({{{\widehat{\bf{h}}}_{{\rm{B}},{\rm{L}}}}}\right)}}{{{N_{\rm{T}}}s}}\xlongrightarrow[{N_{\rm T}}\to\infty]{\rm{a.s.}}\frac{{{\rho_{1}}L{\rm{+}}\varepsilon_{\rm{B}}^{2}{\rm{Tr}}\left({{{\overline{\bf{R}}}_{1}}\otimes{{\overline{\bf{R}}}_{\rm{F}}}}\right)}}{{{N_{\rm{T}}}s}}$ . In the same way, we can obtain the relationship $\frac{{f\left({{{\widehat{\bf{h}}}_{{\rm{A}},{\rm{L}}}}}\right)}}{{{N_{\rm{T}}}s}}\xlongrightarrow[{N_{\rm T}}\to\infty]{\rm{a.s.}}\frac{{L{\rm{Tr}}\left({{{\bf{R}}_{2}}{{\overline{\bf{R}}}_{1}}}\right){\rm{+}}\varepsilon_{\rm{A}}^{2}{\rm{Tr}}\left({{{\overline{\bf{R}}}_{1}}\otimes{{\overline{\bf{R}}}_{\rm{F}}}}\right)}}{{{N_{\rm{T}}}s}}$ . As indicated in Proposition 3, there exists $\varepsilon_{\rm{B}}^{2}=\varepsilon_{\rm{A}}^{2}$ . By comparing the two simplified results of $f\left({{{\widehat{\bf{h}}}_{{\rm{B}},{\rm{L}}}}}\right)$ and $f\left({{{\widehat{\bf{h}}}_{{\rm{A}},{\rm{L}}}}}\right)$ , we can complete the proof.

VIII-E Proof of Theorem 4

First, we will prove $\sum\limits_{j=1}^{a}{\frac{{{\lambda_{2,{i_{j}}}}}}{{{\lambda_{1,{i_{j}}}}}}}=a$ . As shown in [22], the empirical CDF of eigenvalues of ${\bf{R}}_{i}$ can be asymptotically approximated by the samples from $\left\{{{S_{i}}\left({\left[{{n\mathord{\left/{\vphantom{n{{N_{\rm{T}}}}}}\right.\kern-1.2pt}{{N_{\rm{T}}}}}}\right]}\right),n=0,\ldots,{N_{\rm{T}}}-1}\right\}$ . Therefore, the eigenvalues of different individuals, if overlapping at the same location, e.g., $n$ , can be approximated with the same eigenvalue. In this case, the ratio of two eigenvalues at the same location is one and therefore, we can prove $\sum\limits_{j=1}^{a}{\frac{{{\lambda_{2,{i_{j}}}}}}{{{\lambda_{1,{i_{j}}}}}}}=a$ for $a$ overlapping positions. Then we prove that there must $a<{\rho_{1}}$ . Examining $\left[{{\theta_{2}}-{\Delta},{\theta_{2}}+{\Delta}}\right]$ and $\left[{{\theta_{1}}-{\Delta},{\theta_{1}}+{\Delta}}\right]$ , we found that if ${\theta_{1}}\neq{\theta_{2}}$ is satisfied, there must exist $a<{\rho_{1}}$ since $\left[{{\theta_{2}}-{\Delta},{\theta_{2}}+{\Delta}}\right]$ must have non-empty intersection with $\left[{{\theta_{1}}-{\Delta},{\theta_{1}}+{\Delta}}\right]$ . In this case, the number of elements in ${{\cal S}_{3}}$ is reduced to be smaller than that ${\rho_{1}}$ . Now we turn to the case ${\theta_{1}}={\theta_{2}}$ in which we easily have ${{\bf{R}}_{1}}={{\bf{R}}_{2}}$ and therefore the theorem is proved.

VIII-F Proof of Theorem 6

Let us determine the value of minimum of $w$ . From Eq. (23), we know that there exists $w\geq{s^{*}}$ and $w\geq\frac{{{s^{*}}}}{{{s^{*}}+1}}\left({{N_{\rm{B}}}+1}\right)$ . Since ${N_{\rm{B}}}\geq{s^{*}}$ , we can acquire $w=\frac{{{s^{*}}}}{{{s^{*}}+1}}\left({{N_{\rm{B}}}+1}\right)$ as the minimum of $w$ . Note that it satisfies $w\geq\frac{{{N_{\rm B}}+1}}{2}$ for $s^{*}>1$ . In this case, the value of $C$ will decrease with the increase of $w$ . Thus the maximum code rate, i.e. maximum security, can be achieved at this weight. Moreover, according to the Theorem 1, we can know there exists $w=\frac{{{N_{\rm{B}}}+s}}{2}$ for an ICC- $\left({N_{\rm B},s}\right)$ code and therefore we can derive the relationship between $s$ and $s^{*}$ . The theorem is finally proved.

Bibliography33

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] T. E. Bogale and L. B. Le, “Massive MIMO and mm Wave for 5G wireless Het Net: Potentials and challenges," IEEE Veh. Technol. Mag. , vol. 11, no. 1, pp. 64-75, Feb. 2016.
2[2] Q. Yan, H. Zeng, T. Jiang, M. Li, W. Lou, and Y. T. Hou, “Jamming resilient communication using MIMO interference cancellation," IEEE Trans. Inf. Forensics Security , vol. 11, no. 7, pp. 1486-1499, Jul. 2016.
3[3] H. Rahbari, M. Krunz, and L. Lazos, “Swift jamming attack on frequency offset estimation: The Achilles Heel of OFDM systems," IEEE Trans. Mobile Comput. , vol. 15, no. 5, pp. 1264-1278, May 2016.
4[4] C. Shahriar, M. La Pan, M. Lichtman, T. C. Clancy, R. Mc Gwier, R. Tandon, S. Sodagari, and J. H. Reed, “PHY-Layer resiliency in OFDM communications: A tutorial," IEEE Commun. Surveys Tuts. , vol. 17, no. 1, pp. 292-314, Aug. 2015.
5[5] H. Pirzadeh, S. M. Razavizadeh, and E. Bjornson, “Subverting massive MIMO by smart jamming," IEEE Wireless Commun. Lett. , vol. 5, no. 1, pp. 20-23, Feb. 2016.
6[6] M. Lichtman, J. D. Poston, S. Amuru, C. Shahriar, T. C. Clancy, R. M. Buehrer, and J. H. Reed, “A communications jamming taxonomy," IEEE Security Privacy , vol. 14, no. 1, pp. 47-54, Jan. 2016.
7[7] M. Lichtman, R. P. Jover, M. Labib, R. Rao, V. Marojevic, and J. H. Reed, “LTE/LTE-A jamming, spoofing, and sniffing: Threat assessment and mitigation," IEEE Commun. Mag. , vol. 54, no. 4, pp. 54-61, Apr. 2016.
8[8] T. C. Clancy, “Efficient OFDM denial: Pilot jamming and pilot nulling," in Proc. IEEE Int. Conf. Commun. (ICC) , June 2011, pp. 1-5.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Independence-Checking Coding for OFDM Channel Training Authentication: Protocol Design, Security, Stability, and Tradeoff Analysis

Abstract

Index Terms:

I Introduction

I-A Related Works

I-B Motivations and Contributions

Problem 1** (Attack Model).**

Problem 2**.**

II Overview of Pilot-Aware Attack on Massive-Antenna OFDM Systems

II-A System Description

Assumption 1**.**

Assumption 2**.**

II-B Receiving Signal Model

Assumption 3** (Frequency-domain PIP).**

II-C * Channel Estimation Model*

II-D Influence of Pilot Randomization on Pilot-Aware Attack

Assumption 4** (Time-domain PIP).**

III ICC-CTA Protocol

III-A Pilot Conveying on Code-Frequency Domain

III-A1 Construction of Code Frequency Domain

Definition 1** (SSE Principle).**

III-A2 Binary Codebook Matrix

Definition 2**.**

III-B Pilot Separation and Identification Via ICC

III-B1 Encoding Principle

Definition 3**.**

Remark 1**.**

Theorem 1**.**

Proof.

Proposition 1**.**

III-B2 Decoding Procedure

Proposition 2**.**

Theorem 2**.**

Proof.

IV FS Channel Estimation and Security Enhancement

Question 1**.**

Question 2**.**

IV-A FS Channel Estimation

Fact 1**.**

Remark 2**.**

Proposition 3**.**

Proof:

Remark 3**.**

IV-B Security Enhancement: Exploiting Spatial Correlation

IV-B1 Angular Domain Identification

Theorem 3**.**

Proof:

Theorem 4**.**

Proof:

IV-B2 Combine Angular Domain with Code Domain to Enhance Security

Theorem 5**.**

Remark 4**.**

V Security-Instability Tradeoff in CIR Estimation

V-A Essence of Unstable CIR Estimation: Random SSAPs

Definition 4** (CS).**

V-B *Security-Instability Tradeoff *

Definition 5**.**

Fact 2**.**

Definition 6**.**

V-B1 Low-NBN_{\rm B}NB​ scenario

Fact 3** (A Realistic Tradeoff).**

Remark 5**.**

V-B2 High-NBN_{\rm B}NB​ scenario and Optimally-Stable Tradeoff

Proposition 4**.**

Theorem 6**.**

Proof.

VI Numerical Results

VI-A * Numerical Verification for Theorem 4*

VI-B Security-Instability Tradeoff Curve

VI-C * Security Under Optimally-Stable Tradeoff*

VI-D * Code Rate Under Optimally-Stable Tradeoff*

VI-E CIR Estimation Under Optimally-Stable Tradeoff

VII Conclusions

VIII Appendix

Problem 1 (Attack Model).

Problem 2.

Assumption 1.

Assumption 2.

Assumption 3 (Frequency-domain PIP).

Assumption 4 (Time-domain PIP).

Definition 1 (SSE Principle).

Definition 2.

Definition 3.

Remark 1.

Theorem 1.

Proposition 1.

Proposition 2.

Theorem 2.

Question 1.

Question 2.

Fact 1.

Remark 2.

Proposition 3.

Remark 3.

Theorem 3.

Theorem 4.

Theorem 5.

Remark 4.

Definition 4 (CS).

V-B Security-Instability Tradeoff

Definition 5.

Fact 2.

Definition 6.

V-B1 Low- $N_{\rm B}$ scenario

Fact 3 (A Realistic Tradeoff).

Remark 5.

V-B2 High- $N_{\rm B}$ scenario and Optimally-Stable Tradeoff

Proposition 4.

Theorem 6.