On Distance Properties of Convolutional Polar Codes

Ruslan Morozov; Peter Trifonov

arXiv:1901.06341·cs.IT·July 2, 2020

On Distance Properties of Convolutional Polar Codes

Ruslan Morozov, Peter Trifonov

PDF

TL;DR

This paper establishes a lower bound on the minimum distance of convolutional polar codes, introduces a new subcode construction with improved decoding performance, and compares its decoding complexity and error probability favorably to Arikan polar codes.

Contribution

It provides a novel lower bound on the minimum distance and proposes convolutional polar subcodes with enhanced decoding efficiency and error performance.

Findings

01

Lower bound on minimum distance derived

02

Convolutional polar subcodes outperform Arikan polar subcodes in decoding error probability

03

Decoding complexity of convolutional polar subcodes is lower for large list sizes

Abstract

A lower bound on minimum distance of convolutional polar codes is provided. The bound is obtained from the minimum weight of generalized cosets of the codes generated by bottom rows of the polarizing matrix. Moreover, a construction of convolutional polar subcodes is proposed, which provides improved performance under successive cancellation list decoding. For sufficiently large list size, the decoding complexity of convolutional polar subcodes appears to be lower compared to Arikan polar subcodes with the same performance. The error probability of successive cancellation list decoding of convolutional polar subcodes is lower than that of Arikan polar subcodes with the same list size.

Equations80

\displaystyle\left\{u_{0}^{n-1}G^{(n)}\big{|}u_{\mathcal{I}}\in\mathbb{F}^{k},u_{\mathcal{F}}=\mathbf{0}\right\},\mathcal{I}\subseteq[n],|\mathcal{I}|=k,

\displaystyle\left\{u_{0}^{n-1}G^{(n)}\big{|}u_{\mathcal{I}}\in\mathbb{F}^{k},u_{\mathcal{F}}=\mathbf{0}\right\},\mathcal{I}\subseteq[n],|\mathcal{I}|=k,

W_{n}^{(φ)} (u_{0}^{φ} ∣ y_{0}^{n - 1}) = u_{φ + 1}^{n - 1} \in F^{n - φ - 1} \sum W^{n} (u_{0}^{n - 1} G^{(n)} ∣ y_{0}^{n - 1}),

W_{n}^{(φ)} (u_{0}^{φ} ∣ y_{0}^{n - 1}) = u_{φ + 1}^{n - 1} \in F^{n - φ - 1} \sum W^{n} (u_{0}^{n - 1} G^{(n)} ∣ y_{0}^{n - 1}),

\overset{u}{^}_{φ} = ⎩ ⎨ ⎧ 0, ar g u_{φ} \in F max W_{n}^{(φ)} (\overset{u}{^}_{0}^{φ - 1}, u_{φ} ∣ y_{0}^{n - 1}), φ \in F φ \in / F .

\overset{u}{^}_{φ} = ⎩ ⎨ ⎧ 0, ar g u_{φ} \in F max W_{n}^{(φ)} (\overset{u}{^}_{0}^{φ - 1}, u_{φ} ∣ y_{0}^{n - 1}), φ \in F φ \in / F .

Q^{(n)} = (X^{(n)} Q^{(n /2)}, Z^{(n)} Q^{(n /2)}),

Q^{(n)} = (X^{(n)} Q^{(n /2)}, Z^{(n)} Q^{(n /2)}),

X_{i, j}^{(l)} = {1, 0, if 2 j \leq i \leq 2 j + 2 otherwise

X_{i, j}^{(l)} = {1, 0, if 2 j \leq i \leq 2 j + 2 otherwise

Z_{i, j}^{(l)} = {1, 0, if 2 j < i \leq 2 j + 2 otherwise

\displaystyle W^{(2\psi)}_{n}(u_{0}^{2\psi}|y)=\sum_{w}W^{(\psi)}_{n/2}\left((u_{0}^{2\psi},w)X^{(2\psi+2)}\big{|}y^{\prime}\right)

\displaystyle W^{(2\psi)}_{n}(u_{0}^{2\psi}|y)=\sum_{w}W^{(\psi)}_{n/2}\left((u_{0}^{2\psi},w)X^{(2\psi+2)}\big{|}y^{\prime}\right)

\times W_{n /2}^{(ψ)} ((u_{0}^{2 ψ}, w) Z^{(2 ψ + 2)} ∣ y^{''})

\displaystyle W^{(2\psi+1)}_{n}(u_{0}^{2\psi+1}|y)=\!\!\!\sum_{u_{2\psi+2},w}\!\!W^{(\psi+1)}_{n/2}\!\!\left((u_{0}^{2\psi+2},w)X^{(2\psi+4)}\big{|}y^{\prime}\right)

\displaystyle\times W^{(\psi+1)}_{n/2}\!\left((u_{0}^{2\psi+2},w)Z^{(2\psi+4)}\big{|}y^{\prime\prime}\right)

\displaystyle W^{(n-1)}_{n}(u_{0}^{n-1}|y)=W^{(n/2-1)}_{n/2}\left(u_{0}^{n-1}X^{(n)}\big{|}y^{\prime}\right)

\displaystyle\times W^{(n/2-1)}_{n/2}\left(u_{0}^{n-1}Z^{(n)}\big{|}y^{\prime\prime}\right)

⟨ b^{(0)}, \dots, b^{(l - 1)} ⟩ = {i = 0 \sum l - 1 a_{i} b^{(i)} ∣ a_{0}^{l - 1} \in F^{l}} .

⟨ b^{(0)}, \dots, b^{(l - 1)} ⟩ = {i = 0 \sum l - 1 a_{i} b^{(i)} ∣ a_{0}^{l - 1} \in F^{l}} .

C_{n}^{(φ)} (p) = {u_{0}^{n - 1} G^{(n)} ∣ u_{0}^{φ - 1} = 0 \land p \vbox \scalebox 0.5 ∙ u_{φ}^{φ + j - 1} = 1},

C_{n}^{(φ)} (p) = {u_{0}^{n - 1} G^{(n)} ∣ u_{0}^{φ - 1} = 0 \land p \vbox \scalebox 0.5 ∙ u_{φ}^{φ + j - 1} = 1},

d_{n}^{(φ)} = c \in C_{n}^{(φ)} (1) min wt (c) .

d_{n}^{(φ)} = c \in C_{n}^{(φ)} (1) min wt (c) .

d \geq φ \in I min d_{n}^{(φ)} .

d \geq φ \in I min d_{n}^{(φ)} .

\displaystyle\mathcal{U}=\left\{u_{\varphi}^{n-1}+a_{0}^{k-1}\big{|}a_{0}^{k-1}\in\operatorname{cs}^{\perp}(\hat{G})\right\},

\displaystyle\mathcal{U}=\left\{u_{\varphi}^{n-1}+a_{0}^{k-1}\big{|}a_{0}^{k-1}\in\operatorname{cs}^{\perp}(\hat{G})\right\},

\displaystyle\left\{p_{0}^{k-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$\displaystyle\bullet$}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$\textstyle\bullet$}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$\scriptstyle\bullet$}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$\scriptscriptstyle\bullet$}}}}}(u_{\varphi}^{n-1}+a_{0}^{k-1})\;\big{|}\;a_{0}^{k-1}\in\operatorname{cs}^{\perp}(\hat{G})\right\}.

\displaystyle\left\{p_{0}^{k-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$\displaystyle\bullet$}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$\textstyle\bullet$}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$\scriptstyle\bullet$}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$\scriptscriptstyle\bullet$}}}}}(u_{\varphi}^{n-1}+a_{0}^{k-1})\;\big{|}\;a_{0}^{k-1}\in\operatorname{cs}^{\perp}(\hat{G})\right\}.

χ_{n}^{(φ, j)} (E)

χ_{n}^{(φ, j)} (E)

ξ_{n}^{(φ, j)} (s)

\displaystyle\chi_{n}^{(\varphi,k+h)}(\mathcal{E})=\left\{(p,q)\;\big{|}\;p\in\chi_{n}^{(\varphi,k)}(\mathcal{E}),q\in\mathbb{F}^{h}\right\}.

\displaystyle\chi_{n}^{(\varphi,k+h)}(\mathcal{E})=\left\{(p,q)\;\big{|}\;p\in\chi_{n}^{(\varphi,k)}(\mathcal{E}),q\in\mathbb{F}^{h}\right\}.

ξ_{2}^{(0, 2)} (⟨ 01 ⟩) = {{0}}, ξ_{2}^{(0, 2)} (⟨ 10 ⟩) = \emptyset, ξ_{2}^{(0, 2)} (⟨ 11 ⟩) = {{1}},

ξ_{2}^{(0, 2)} (⟨ 01 ⟩) = {{0}}, ξ_{2}^{(0, 2)} (⟨ 10 ⟩) = \emptyset, ξ_{2}^{(0, 2)} (⟨ 11 ⟩) = {{1}},

ξ_{2}^{(0, 2)} (⟨ ⟩) = {{0, 1}}, ξ_{2}^{(0, 2)} (F^{2}) = {\emptyset} .

χ_{2}^{(0, 2)} (\emptyset) = F^{2}, χ_{2}^{(0, 2)} ({0}) = ⟨ 01 ⟩,

χ_{2}^{(0, 2)} (\emptyset) = F^{2}, χ_{2}^{(0, 2)} ({0}) = ⟨ 01 ⟩,

χ_{2}^{(0, 2)} ({1}) = ⟨ 11 ⟩, χ_{2}^{(0, 2)} ({0, 1}) = ⟨ ⟩ .

δ_{n}^{(φ, j)} (s) = E \in ξ_{n}^{(φ, j)} (s) min ∣ E ∣,

δ_{n}^{(φ, j)} (s) = E \in ξ_{n}^{(φ, j)} (s) min ∣ E ∣,

c \in C_{n}^{(φ)} (p) min wt (c) = s \in S_{j} : p \in / s min δ_{n}^{(φ, j)} (s) .

c \in C_{n}^{(φ)} (p) min wt (c) = s \in S_{j} : p \in / s min δ_{n}^{(φ, j)} (s) .

B

B

= {E ∣ p \in / χ_{n}^{(φ, j)} (E)} .

d^{(\varphi)}_{n}=\min\left\{\delta^{(\varphi,j)}_{n}(s)\big{|}s\in\mathbb{S}_{j}:(1,\mathbf{0}^{j-1})\notin s\right\}.

d^{(\varphi)}_{n}=\min\left\{\delta^{(\varphi,j)}_{n}(s)\big{|}s\in\mathbb{S}_{j}:(1,\mathbf{0}^{j-1})\notin s\right\}.

T_{o} (i, j) =

T_{o} (i, j) =

(p_{0}^{2}, 0, 0)^{T} = X_{\overline{[1]}, *}^{(6)} p^{' T} + Z_{\overline{[1]}, *}^{(6)} p^{'' T}}

T_{e} (i, j) =

(p_{0}^{2}, 0)^{T} = X_{\overline{[2]}, *}^{(6)} p^{' T} + Z_{\overline{[2]}, *}^{(6)} p^{'' T}} .

\displaystyle\Delta^{(2\psi+1)}_{n,l}=\min_{i,j}\!\left\{\Delta^{(\psi)}_{n/2,i}+\Delta^{(\psi)}_{n/2,j}\big{|}\mathbf{T}_{o}(i,j)=\mathcal{T}_{l}\right\},

\displaystyle\Delta^{(2\psi+1)}_{n,l}=\min_{i,j}\!\left\{\Delta^{(\psi)}_{n/2,i}+\Delta^{(\psi)}_{n/2,j}\big{|}\mathbf{T}_{o}(i,j)=\mathcal{T}_{l}\right\},

\displaystyle\Delta^{(2\psi)}_{n,l}=\min_{i,j}\left\{\Delta^{(\psi-1)}_{n/2,i}+\Delta^{(\psi-1)}_{n/2,j}\big{|}\mathbf{T}_{e}(i,j)=\mathcal{T}_{l}\right\}.

δ_{1}^{(0, 1)} (⟨ ⟩) = 1, δ_{1}^{(0, 1)} (⟨ 1 ⟩) = 0.

δ_{1}^{(0, 1)} (⟨ ⟩) = 1, δ_{1}^{(0, 1)} (⟨ 1 ⟩) = 0.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On Distance Properties of Convolutional Polar Codes

Ruslan Morozov, , Peter Trifonov The authors are with the Saint Petersburg Polytechnic University, Russia. E-mail: {rmorozov, petert}@dcn.icc.spbstu.ru

Abstract

A lower bound on minimum distance of convolutional polar codes is provided. The bound is obtained from the minimum weight of generalized cosets of the codes generated by bottom rows of the polarizing matrix. Moreover, a construction of convolutional polar subcodes is proposed, which provides improved performance under successive cancellation list decoding. For sufficiently large list size, the decoding complexity of convolutional polar subcodes appears to be lower compared to Arikan polar subcodes with the same performance. The error probability of successive cancellation list decoding of convolutional polar subcodes is lower than that of Arikan polar subcodes with the same list size.

Index Terms:

Convolutional polar codes, polar codes, successive cancellation decoding, list decoding, polar subcodes.

I Introduction

In this paper we consider codes that were firstly introduced as branching-MERA codes [1] and then as convolutional polar codes (CvPCs) [2] by A. J. Ferris, C. Hirche and D. Poulin. These codes were shown to provide substantially better performance under successive cancellation (SC) decoding compared to classical polar codes [3]. In [2] both open-boundary and periodic-boundary CvPCs are presented, in this paper by CvPCs we always mean open-boundary CvPCs. In [4] the efficient min-sum implementation of SC decoding is presented for CvPCs, which requires one to perform only comparisons and additions and can be easily extended to the case of SC list (SCL) decoding. Other implementations of SCL decoding for CvPCs are presented in [5, 6].

Classical polar codes provide quite poor performance under SCL decoding due to very low minimum distance, which scales as $O(\sqrt{n})$ [7]. Although the minimum distance of a polar code can be found simply, the problem of computing minimum distance of an arbitrary linear code is NP-complete. However, for moderate-length codes minimum distance can be obtained by method presented in [8].

The generator matrix of a CvPC consists of rows of $n\times n$ non-singular matrix $Q^{(n)}$ , called convolutional polarizing transformation (CvPT). In this paper we derive a tight lower bound on the minimum distance of CvPCs, based on computing the minimum weight of a coset, given by the $i$ -th row of CvPT, of a linear code, generated by the last $n-i-1$ rows of CvPT. The weight enumerator polynomial of such coset can be expressed as $A_{i}(x)-A_{i+1}(x)$ , where $A_{i}(x)$ is a weight spectrum of code generated by the last $n-i$ rows of matrix $Q^{(n)}$ . In the case of polar codes, an efficient method for approximate enumerator evaluation is available [9]. However, for convolutional polar codes there are no methods for evaluation of coset enumerator.

The minimum distance of CvPCs appears to be of the same order as in the case of classical polar codes. However, by generalizing the construction of randomized polar subcodes [10] to the case of CvPC, we obtain convolutional polar subcodes (CvPSs) with reduced error coefficient, which provide superior performance under SCL decoding, compared to polar subcodes.

The paper is organized as follows. In Section II we introduce representation of linear block codes, which is natural for the cases of Arikan and convolutional polar codes. The concepts of generalized cosets and recoverable vectors are introduced in Section III and are used to obtain a lower bound on the minimum distance of linear block codes. An efficient algorithm for computing the lower bound in the case of CvPC is provided in Section IV. This algorithm is aimed to explore some properties of low-weight codewords of CvPC. These properties are used for a construction of convolutional polar subcodes, which is proposed in Section V. The performance of the proposed code construction is presented in Section VI.

II Background

II-A Notations

The following notations are used throughout the paper. $\mathbb{F}$ denotes the Galois field of two elements. For integer $n$ we denote $[n]=\{0,1,\ldots n-1\}$ . For vector $a$ symbol $a_{b}^{c}=(a_{b},a_{b+1},\ldots,a_{c})$ . For two vectors $a$ and $b$ we denote their concatenation by $(a,b)$ . For $m\times\ n$ matrix $A$ and sets $\mathcal{X}\subseteq[m]$ , $\mathcal{Y}\subseteq[n],$ by $A_{\mathcal{X},\mathcal{Y}}$ we denote the submatrix of $A$ with rows with indices from set $\mathcal{X}$ and columns with indices from set $\mathcal{Y}$ , indexing of rows and columns starts with zero. Similar notations are applied to vectors as well. If $\mathcal{X}=*$ or $\mathcal{Y}=*$ , this means that all rows or all columns of the original matrix are in the submatrix. Furthermore, $A_{\overline{\mathcal{X}},\overline{\mathcal{Y}}}$ denotes submatrix of $A$ consisting of rows and columns with indices that are not in $\mathcal{X}$ and $\mathcal{Y}$ , respectively. The vector of $i$ zeroes is denoted by $\mathbf{0}^{i}$ , or just by $\mathbf{0}$ if $i$ is clear from the context.

II-B A Representation of a Linear Block Code and Successive Cancellation Decoding

Consider binary linear block code in the form

[TABLE]

where $G^{(n)}$ is an $n\times n$ non-singular binary matrix, $\mathcal{I}$ is called information set and $\mathcal{F}=[n]\setminus\mathcal{I}$ is called frozen set. The generator matrix of such code is $G^{(n)}_{\mathcal{I},*}$ . Note that any $(n,k)$ linear code with generator matrix $G$ can be expressed as in (1) with $G^{(n)}$ , such that $G=G^{(n)}_{\mathcal{I},*}$ for some $\mathcal{I}\subseteq[n]$ . For example, classical polar codes [3] have $G^{(n)}=F^{\otimes m}$ for $n=2^{m}$ .

For such code representation, the successive cancellation (SC) decoding method can be defined. Consider transmission of codeword $c_{0}^{n-1}=u_{0}^{n-1}G^{(n)}$ through binary-input memoryless channel $\mathcal{W}:\mathbb{F}\to\mathcal{Y}$ . Let $y_{0}^{n-1}$ be the output of this channel. After demodulation, the probabilities $W(c_{i}|y_{i})=\mathcal{W}(y_{i}|c_{i})/\left(\mathcal{W}(y_{i}|0)+\mathcal{W}(y_{i}|1)\right)$ for $c_{i}\in\mathbb{F}$ are provided to the decoding algorithm. Given the prior hard decisions $\hat{u}_{0}\ldots\hat{u}_{\varphi-1}$ , at phase $\varphi$ the SC decoding algorithm calculates probabilities $W^{(\varphi)}_{n}(\hat{u}_{0}^{\varphi-1},u_{\varphi}|y_{0}^{n-1})$ , defined as

[TABLE]

where $W^{n}(c_{0}^{n-1}|y_{0}^{n-1})=\prod_{i=0}^{n-1}W(c_{i}|y_{i})$ . The channels $W^{(\varphi)}_{n}:\mathcal{Y}\to\mathbb{F}^{\varphi+1}$ are called bit subchannels. Then, the hard decision on $u_{\varphi}$ is made by

[TABLE]

The SC decoding can be defined for any linear code, if an efficient method for computing $W^{(\varphi)}_{n}(u_{0}^{\varphi}|y_{0}^{n-1})$ is available. However, SC decoding can provide reasonable performance only for codes with $G^{(n)}$ , such that the capacities of bit subchannels $W^{(\varphi)}_{n}$ polarize, i.e. converge to [math] or $1$ with $n\to\infty$ .

II-C Convolutional Polar Codes

Convolutional polar codes [2] (CvPCs) are a family of linear block codes, for which $G^{(n)}$ , $n=2^{m}$ , is equal to the matrix of convolutional polarizing transformation (CvPT) $Q^{(n)}$ , such that

[TABLE]

where $Q^{(1)}=(1)$ , $X^{(l)}$ and $Z^{(l)}$ are $l\times l/2$ matrices, defined for even $l$ as

[TABLE]

For example, $X^{(4)}=\begin{pmatrix}1110\\ 0011\end{pmatrix}^{T}$ , $Z^{(4)}=\begin{pmatrix}0110\\ 0001\end{pmatrix}^{T}$ . Expansion (3) corresponds to one layer of CvPT. In Fig. 1, the $m$ -th layer of CvPT is a mapping of vector $u_{0}^{n-1}$ to vectors $x_{0}^{n/2-1}=u_{0}^{n-1}X^{(n)}$ and $z_{0}^{n/2-1}=u_{0}^{n-1}Z^{(n)}$ .

It is shown in [4] that for $n=2^{m}$ , $\varphi\in[n]$ , the value of $W^{(\varphi)}_{n}(u_{0}^{\varphi}|y_{0}^{n-1})$ for CvPT can be recursively computed as

[TABLE]

for $0\leq\psi<n/2-1$ , where $y=y_{0}^{n-1}$ , and $y^{\prime}=y_{0}^{n/2-1}$ , $y^{\prime\prime}=y_{n/2}^{n-1}$ are subvectors of $y$ . These formulae are the same as in [4] under permutation of the output vector $y$ by the bit-reversal permutation, which is omitted from the definition (3) of CvPT for the sake of simplicity.

III A Lower Bound on The Minimum Distance of Linear Codes

III-A Basic Definitions

Let $\mathbb{S}_{n}$ be the set of all linear subspaces of $\mathbb{F}^{n}$ .

Denote $a_{0}^{l-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}b_{0}^{l-1}=\sum_{i=0}^{l-1}a_{i}b_{i}$ , where $a_{i},b_{i}\in\mathbb{F}$ . For vectors $b^{(0)},\dots,b^{(l-1)}\in\mathbb{F}^{t}$ , denote by $\langle b^{(0)},\dots,b^{(l-1)}\rangle$ the linear subspace of $\mathbb{F}^{t}$ with basis vectors $b^{(i)}$ , i.e.

[TABLE]

A sum over an empty set is assumed to be equal to zero, which implies $\left\langle{}\right\rangle=\left\{\mathbf{0}^{t}\right\}$ , where $t$ is clear from the context. By abuse of notation, we write $x_{0}x_{1}\dots x_{t-1}$ for $x_{i}\in\mathbb{F}$ to denote a vector $(x_{0},x_{1},\dots,x_{t-1})\in\mathbb{F}^{t}$ .

Example 1.

It can be seen that $\mathbb{S}_{2}=\left\{\left\langle{}\right\rangle,\left\langle{10}\right\rangle,\left\langle{01}\right\rangle,\left\langle{11}\right\rangle,\left\langle{10,01}\right\rangle\right\}$ , and $|\mathbb{S}_{3}|=16$ .

III-B Outline of the Approach

Consider a code $\mathcal{P}$ in the form (1) with $\mathcal{F}=[\varphi]$ , i.e. the set of vectors $(\mathbf{0}^{\varphi},u_{\varphi}^{n-1})G^{(n)}$ . Code $\mathcal{P}$ can be split in two sets corresponding to each value of $u_{\varphi}$ . Namely, $\mathcal{P}=\mathcal{P}_{0}\cup\mathcal{P}_{1}$ , where $\mathcal{P}_{a}$ consists of all codewords of the form $(\mathbf{0}^{\varphi},a,u_{\varphi+1}^{n-1})G^{(n)}$ . These subsets are equal to the subsets, which probabilities are computed at the $\varphi$ -th phase of the SC decoding algorithm by (2), provided that the estimated symbols $\hat{u}_{0}^{\varphi-1}$ are zero. Since we are interested in distance properties of the code, we can assume that $\hat{u}_{0}^{\varphi-1}=\mathbf{0}^{\varphi}$ .

Let $d_{n}^{(\varphi)}$ be the distance between $\mathcal{P}_{0}$ and $\mathcal{P}_{1}$ , i.e. $d_{n}^{(\varphi)}=\min_{\dot{c}\in\mathcal{P}_{0},\ddot{c}\in\mathcal{P}_{1}}\mathbf{wt}(\dot{c}+\ddot{c})$ . Consider $\dot{c}$ and $\ddot{c}$ , for such the minimum is achieved, i.e., $\dot{c}=(\mathbf{0}^{\varphi},0,\dot{u}_{\varphi+1}^{n-1})G^{(n)}$ , $\ddot{c}=(\mathbf{0}^{\varphi},1,\ddot{u}_{\varphi+1}^{n-1})G^{(n)}$ , such that $d_{n}^{(\varphi)}=\mathbf{wt}(\dot{c}+\ddot{c})=\mathbf{wt}(\tilde{c})$ . Note that $\tilde{c}=(\mathbf{0}^{\varphi},1,\dot{u}_{\varphi+1}^{n-1}+\ddot{u}_{\varphi+1}^{n-1})G^{(n)}$ corresponds to value $u_{\varphi}=1$ , so $\tilde{c}\in\mathcal{P}_{1}$ . Hence, $d^{(\varphi)}_{n}$ is equal to the weight of a minimum-weight codeword from $\mathcal{P}_{1}$ . In general, we can say that if $\hat{u}_{0}^{\varphi-1}=u_{0}^{\varphi-1}$ , i.e., all previous symbols are estimated correctly, then the probability of erroneous estimation of $u_{\varphi}$ in the case of transmission over sufficiently good binary memoryless channel is mainly defined by $d^{(\varphi)}_{n}=\min_{c\in\mathcal{P}_{1}}\mathbf{wt}(c)$ .

In section III-C we consider the partition of $\mathcal{P}$ in two sets $\mathcal{P}^{\prime}_{0}$ and $\mathcal{P}^{\prime}_{1}$ not by the value of $u_{\varphi}$ , but by the value of some linear combination $p_{0}^{j-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}u_{\varphi}^{\varphi+j-1}$ of symbols $u_{\varphi}^{\varphi+j-1}$ . Thus, set $\mathcal{P}^{\prime}_{a}$ , $a\in\mathbb{F}$ consists of all codewords $(\mathbf{0}^{\varphi},u_{\varphi}^{n-1})G^{(n)}$ satisfying $p_{0}^{j-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}u_{\varphi}^{\varphi+j-1}=a$ .

In section III-D we consider transmission of codewords through binary erasure channel (BEC) $W:\mathbb{F}\to\mathbb{F}\cup\left\{\epsilon\right\}$ , defined as $W(x|x)=1-p_{\epsilon}$ , $W(\epsilon|x)=p_{\epsilon}$ , where $p_{\epsilon}$ is the erasure probability. We consider mapping of the set of erased symbols $\mathcal{E}\subseteq[n]$ to the set of all linear combinations of symbols $u_{\varphi}^{\varphi+j-1}$ , which can be recovered by the receiver by given $c_{\overline{\mathcal{E}}}=(c_{i})_{i\notin\mathcal{E}}$ . Thus, we consider a set $s\subseteq\ \mathbb{F}^{j}$ of all vectors $p_{0}^{j-1}\in\mathbb{F}^{j}$ , such that the value of corresponding linear combination $p_{0}^{j-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}u_{\varphi}^{\varphi+j-1}$ can be recovered by receiver after erasure configuration $\mathcal{E}$ . It appears that $s\in\mathbb{S}_{j}$ , i.e. $s$ is a linear subspace of $\mathbb{F}^{j}$ .

In section III-E, we prove that the minimum weight of vector from $\mathcal{P}^{\prime}_{1}$ (i.e., the distance between $\mathcal{P}^{\prime}_{0}$ and $\mathcal{P}^{\prime}_{1}$ ) is equal to the minimum number of erasures, such that corresponding subspace $s\in\mathbb{S}_{j}$ of coefficients of recoverable linear combinations does not include the linear combination with coefficients $p_{0}^{j-1}$ .

These results are combined to derive the algorithm for computing $d_{n}^{(\varphi)}$ in the case of CvPC, which leads to the lower bound on minimum distance of CvPC and the construction of CvPS. Furthermore, we believe that the introduced concepts and their properties can be used for other $G^{(n)}$ that have recursive structure.

III-C Minimum Weight of Cosets and the Minimum Distance

Definition 1.

Given an $n\times n$ non-singular matrix $G^{(n)}$ , for a vector $p\in\mathbb{F}^{j}$ define a generalized coset $\mathcal{C}^{(\varphi)}_{n}(p)$ as

[TABLE]

Remark 1.

In the case of $j>n-\varphi$ , we assume in (9) that $u_{l}=0$ for $l\geq n$ .

We define the weight of the $\varphi$ -th bit subchannel $W_{n}^{(\varphi)}$ as

[TABLE]

Observe that for all $j>0$ one has $\mathcal{C}^{(\varphi)}_{n}(p)=\mathcal{C}^{(\varphi)}_{n}(p,\mathbf{0}^{j})$ , which implies $d^{(\varphi)}_{n}=\displaystyle\min_{c\in\mathcal{C}^{(\varphi)}_{n}(1,\mathbf{0}^{j})}\mathbf{wt}(c)$ .

Lemma 1.

If a linear code with minimum distance $d$ is generated by rows of $G^{(n)}$ with indices from $\mathcal{I}\subseteq[n]$ , then

[TABLE]

Proof.

Consider the minimum-weight codeword $c_{0}^{n-1}=u_{0}^{n-1}G^{(n)}$ , $\mathbf{wt}(c_{0}^{n-1})=d$ . Let $\psi$ be the first position of non-zero element in $u_{0}^{n-1}$ . Thus, $\psi\in\mathcal{I}$ , $u_{\psi}=1$ , $u_{0}^{\psi-1}=\mathbf{0}$ , which implies $c_{0}^{n-1}\in\mathcal{C}^{(\psi)}_{n}(1)$ and $d=\mathbf{wt}(c_{0}^{n-1})\geq d^{(\psi)}_{n}\geq\min_{\varphi\in\mathcal{I}}d^{(\varphi)}_{n}$ . ∎

This bound is valid for any linear block code represented in the form of (1). However, the evaluation of $d^{(\varphi)}_{n}$ is not a simple problem for an arbitrary $G^{(n)}$ .

III-D Recoverable and erased vectors

Consider transmission of a codeword $c_{0}^{n-1}=u_{0}^{n-1}G^{(n)}$ of a code with frozen set $\mathcal{F}=[\varphi]$ , $u_{0}^{\varphi-1}=\mathbf{0}$ and dimension $k=n-\varphi$ over BEC.

The set of erased positions $\mathcal{E}\subseteq[n]$ is called an erasure configuration. When erasure configuration $\mathcal{E}$ occurs, the values $c_{\overline{\mathcal{E}}}=u_{\varphi}^{n-1}\hat{G}$ are available for the receiver, where $\hat{G}=G^{(n)}_{\overline{[\varphi]},\overline{\mathcal{E}}}$ is $k\times r$ submatrix of $G^{(n)}$ without rows from $[\varphi]$ and without columns from $\mathcal{E}$ , $r=n-|\mathcal{E}|$ . Denote by $\mathcal{U}$ the set of all $\hat{u}_{\varphi}^{n-1}$ such that $\hat{u}_{\varphi}^{n-1}\hat{G}=c_{\overline{\mathcal{E}}}$ . One can see that

[TABLE]

where for set of vectors $\mathcal{A}\subseteq\mathbb{F}^{t}$ , by $\mathcal{A}^{\perp}\subseteq\mathbb{F}^{t}$ we denote the set of vectors $x_{0}^{t-1}:\forall y_{0}^{t-1}\in\mathcal{A}:x_{0}^{t-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}y_{0}^{t-1}=0$ , and $\operatorname{cs}(A)$ is the column space of matrix $A$ . The value $u_{\varphi}^{n-1}$ can be unambiguously recovered by the receiver after erasure configuration $\mathcal{E}$ iff $|\mathcal{U}|=1$ , i.e. $\mathcal{U}=\left\{u_{\varphi}^{n-1}\right\}$ .

More generally, consider the recoverability of the value of a linear combination $p_{0}^{k-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}u_{\varphi}^{n-1}$ after erasure configuration $\mathcal{E}$ . The set of values of $p_{0}^{k-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}\hat{u}_{\varphi}^{n-1}$ for all $\hat{u}_{\varphi}^{n-1}\in\mathcal{U}$ is given by

[TABLE]

We say that vector $p_{0}^{k-1}$ is $(\mathcal{E},\varphi)$ -recoverable, if the corresponding linear combination $p_{0}^{k-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}u_{\varphi}^{n-1}$ can be recovered unambiguously for given $c_{\overline{\mathcal{E}}}$ , i.e., the set (12) contains only the correct value $p_{0}^{k-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}u_{\varphi}^{n-1}$ . Expanding the brackets in (12), one can see that $p_{0}^{k-1}$ is $(\mathcal{E},\varphi)$ -recoverable iff $\forall a_{0}^{k-1}\in\operatorname{cs}^{\perp}(\hat{G}):p_{0}^{k-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}a_{0}^{k-1}=0$ , which leads to $p_{0}^{k-1}\in{\operatorname{cs}^{\perp}}^{\perp}(\hat{G})=\operatorname{cs}(\hat{G})$ . Thus, the set of $(\mathcal{E},\varphi)$ -recoverable vectors is a linear space, which is equal to $\operatorname{cs}(\hat{G})\in\mathbb{S}_{k}$ .

Definition 2.

Let $s\in\mathbb{S}_{j}$ be the space of all $p_{0}^{j-1}$ , such that $(p_{0}^{j-1},\mathbf{0}^{k-j})$ is $(\mathcal{E},\varphi)$ -recoverable. In this case, $s$ is called a $(\mathcal{E},\varphi,j)$ -space and is denoted by $\chi^{(\varphi,j)}_{n}(\mathcal{E})$ , and $\mathcal{E}$ is called an $(s,\varphi,j)$ -configuration. The set of $(s,\varphi,j)$ -configurations is denoted by $\xi^{(\varphi,j)}_{n}(s)$ . Thus,

[TABLE]

If $\mathcal{A}$ is a set, denote by $2^{\mathcal{A}}$ the set of all subsets of $\mathcal{A}$ . Thus, function $\chi_{n}^{(\varphi,j)}:2^{[n]}\to\mathbb{S}_{j}$ , maps an erasure configuration, which is a subset of $[n]$ , to a linear subspace of $\mathbb{F}^{j}$ , and $\xi^{(\varphi,j)}_{n}$ returns the inverse image of $\chi_{n}^{(\varphi,j)}$ . Note that $\chi_{n}^{(\varphi,j)}$ is not injective, so $\xi^{(\varphi,j)}_{n}:\mathbb{S}_{j}\to 2^{2^{[n]}}$ .

In words, $\chi_{n}^{(\varphi,j)}(\mathcal{E})$ defines the set of vectors $p_{0}^{j-1}$ , for which the value of linear combination $p_{0}^{j-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}u_{\varphi}^{\varphi+j-1}$ can be recovered after erasure configuration $\mathcal{E}$ , provided that $u_{0}^{\varphi-1}=\mathbf{0}$ . Conversely, $\xi^{(\varphi,j)}_{n}(s)$ defines the set of erasure configurations, after which the linear combination $p_{0}^{j-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}u_{\varphi}^{\varphi+j-1}$ can be deduced by the receiver if and only if $p\in s$ .

Remark 2.

Let $j>k$ , i.e. $j=k+h$ for some $h>0$ . In this case, the conditional part of definition (13) is inconsistent. We extend the definition as follows. In Remark 1 we assume that symbols $u_{n+h}$ for $h\geq 0$ are equal to zero. Hence, these symbols are always perfectly known for the receiver, so any $\mathcal{E}$ does not erase any symbol $u_{n+h}$ . Observe that any vector from $\mathbb{F}^{j}\setminus\chi^{(\varphi,j)}_{n}(\mathcal{E})$ must be not $(\mathcal{E},\varphi)$ -recoverable, so for any $\mathcal{E}$ and $q_{0}^{h-1}\in\mathbb{F}^{h}$ , we must include vector $(\mathbf{0}^{k},q_{0}^{h-1})$ in the set $\chi^{(\varphi,k+h)}_{n}(\mathcal{E})$ . This leads to

[TABLE]

Similarly, we assume that $\xi^{(\varphi,k+h)}_{n}(s)=\emptyset$ for all $s$ which do not contain $(\mathbf{0}^{k},q)$ for some $q\in\mathbb{F}^{h}$ .

Example 2.

Consider $(s,0,2)$ -configurations for the case of $n=2$ , $c_{0}^{1}=u_{0}^{1}Q^{(2)}=(u_{0}+u_{1},u_{1})$ . For erasure configuration $\mathcal{E}=\{0\}$ , the only non-zero vector which is $(\mathcal{E},0)$ -recoverable is $p=(0,1)$ . That is, if symbol $c_{0}$ is erased, one can recover unambiguously only $u_{1}=c_{1}$ . This means that $\{0\}\in\xi^{(0,2)}_{2}(\left\langle{01}\right\rangle)$ . All $(s,0,2)$ -configurations are

[TABLE]

That is, there are no erasure configurations, such that only $\langle 10\rangle$ (i.e. symbol $u_{0}$ ) is unambiguously recoverable, and the whole vector $u_{0}^{1}$ can be unambiguously recovered only if there are no erasures. For the same case, the $(\mathcal{E},0,2)$ -spaces are

[TABLE]

Example 3.

Consider the case of $\varphi=2$ , $j=2$ , $n=4$ and $c_{0}^{3}=u_{0}^{3}Q^{(4)}=(u_{0}+u_{1}+u_{3},u_{2}+u_{3},u_{1}+u_{2}+u_{3},u_{3})$ . Since $\varphi=2$ implies $u_{0}^{1}=\mathbf{0}$ , one has $c_{0}=c_{3}=u_{3}$ , $c_{1}=c_{2}=u_{2}+u_{3}$ and one can restore $u_{3}$ by $c_{0}$ or $c_{3}$ . Thus, $\xi^{(2,2)}_{4}(\left\langle{01}\right\rangle)=\left\{\left\{1,2\right\},\left\{0,1,2\right\},\left\{1,2,3\right\}\right\}$ .

III-E Coset minimum weight and erasure configurations

For a subspace $s\in\mathbb{S}_{j}$ , we denote the minimal cardinality of $(s,\varphi,j)$ -configuration as

[TABLE]

assuming that the minimum over the empty set is $+\infty$ .

Theorem 1.

Let $\varphi\in[n]$ and $j>0$ . For any $p\in\mathbb{F}^{j}$ ,

[TABLE]

Proof.

Denote $\mathcal{A}=\left\{\operatorname{supp}(c)\big{|}c\in\mathcal{C}^{(\varphi)}_{n}(p)\right\}$ ,

[TABLE]

Then the theorem can be reformulated as $\min_{\Omega\in\mathcal{A}}|\Omega|=\min_{\mathcal{E}\in\mathcal{B}}|\mathcal{E}|.$

If $\Omega\in\mathcal{A}$ , then there exists $u_{\varphi}^{n-1}$ , such that $p\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}u_{\varphi}^{\varphi+j-1}=1$ and $\Omega=\operatorname{supp}(c_{0}^{n-1})$ for $c_{0}^{n-1}=(\mathbf{0}^{\varphi},u_{\varphi}^{n-1})G^{(n)}$ . In this case $c_{\overline{\Omega}}=\mathbf{0}$ and the all-zero value $\hat{u}_{\varphi}^{n-1}=\mathbf{0}$ also belongs to set (11) of possible values of $u_{\varphi}^{n-1}$ for the given $c_{\overline{\Omega}}$ , but $p\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}\hat{u}_{\varphi}^{\varphi+j-1}=0$ . Thus, the value of $p\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}u_{\varphi}^{\varphi+j-1}$ is not recoverable after erasure configuration $\Omega$ , which implies $p\notin\chi^{(\varphi,j)}_{n}(\Omega)\implies\Omega\in\mathcal{B}$ . So, $\Omega\in\mathcal{A}\implies\Omega\in\mathcal{B}$ and $\displaystyle\min_{\Omega\in\mathcal{A}}|\Omega|\geq\min_{\mathcal{E}\in\mathcal{B}}|\mathcal{E}|$ .

If $\mathcal{E}\in\mathcal{B}$ , then $p\notin\chi^{(\varphi,j)}_{n}(\mathcal{E})$ , which by Definition 2 implies $(p,\mathbf{0}^{k-j})\notin\operatorname{cs}(\hat{G})$ and $\exists a_{0}^{k-1}\in\operatorname{cs}^{\perp}(\hat{G}):(p,\mathbf{0}^{k-j})~{}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}~{}a_{0}^{k-1}=~{}1$ , which implies $p\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}a_{0}^{j-1}=1$ . Denote $\hat{c}_{0}^{n-1}=(\mathbf{0}^{\varphi},a_{0}^{k-1})G^{(n)}$ . Since $p\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}a_{0}^{j-1}=1$ , by Definition 1 one has $\hat{c}_{0}^{n-1}\in\mathcal{C}^{(\varphi)}_{n}(p)$ , and therefore $\operatorname{supp}(\hat{c})\in\mathcal{A}$ . On the other hand, $\hat{c}_{\overline{\mathcal{E}}}=a_{0}^{k-1}\hat{G}=\mathbf{0}$ , which means $\operatorname{supp}(\hat{c})\subseteq\mathcal{E}$ . So, $\forall\mathcal{E}\in\mathcal{B}\;\;\exists\Omega\in\mathcal{A}:\Omega\subseteq\mathcal{E}$ , hence, $\displaystyle\min_{\Omega\in\mathcal{A}}|\Omega|\leq\min_{\mathcal{E}\in\mathcal{B}}|\mathcal{E}|$ . ∎

Corollary 1.

For any $j>0:$

[TABLE]

IV Bound on Minimum Distance of Convolutional Polar Codes

The structure of the convolutional polarizing transformation $Q^{(n)}$ , $n=2^{m}$ , enables one to compute easily $\delta^{(\varphi,j)}_{n}(s)$ , defined in (16), for $j=3$ . By computing values of $\delta^{(\varphi,3)}_{n}(s)$ , one can obtain values of $d^{(\varphi)}_{n}$ by Corollary 1 and lower bound on minimum distance by Lemma 1.

Consider transmission of $c_{0}^{n-1}=u_{0}^{n-1}Q^{(n)}$ , such that $u_{0}^{\varphi-1}=\mathbf{0}$ , through BEC and let the erasure configuration be $\mathcal{E}$ . The intuition behind recursive computing of $\delta^{(\varphi,3)}_{n}(s)$ is as follows.

Consider the case of $\varphi=2\psi+1<n-1$ . Denote $x_{0}^{n/2-1}=u_{0}^{n-1}X^{(n)}$ , $z_{0}^{n/2-1}=u_{0}^{n-1}Z^{(n)}$ , $\mathcal{E}^{\prime}=\mathcal{E}\cap[\frac{n}{2}]$ , $\mathcal{E}^{\prime\prime}=\left\{i\geq 0|i+\frac{n}{2}\in\mathcal{E}\right\}$ . Recall that $\chi^{(2\psi+1,3)}_{n}(\mathcal{E})$ is the set of all $p_{0}^{2}$ , such that the value of $p_{0}^{2}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}u_{2\psi+1}^{2\psi+3}$ can be deduced from $c_{0}^{n-1}$ after erasure configuration $\mathcal{E}$ . Similarly, $\chi^{(\psi,3)}_{n/2}(\mathcal{E}^{\prime})$ and $\chi_{n/2}^{(\psi,3)}(\mathcal{E}^{\prime\prime})$ are the sets of $q_{0}^{2}$ and $r_{0}^{2}$ , s.t. $q_{0}^{2}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}x_{\psi}^{\psi+2}$ and $r_{0}^{2}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}z_{\psi}^{\psi+2}$ are recoverable from $c_{0}^{n/2-1}$ and $c_{n/2}^{n-1}$ after erasure configurations $\mathcal{E}^{\prime}$ and $\mathcal{E}^{\prime\prime}$ , under assumption $x_{0}^{\psi-1}=\mathbf{0}$ and $z_{0}^{\psi-1}=\mathbf{0}$ , respectively. By (4)–(5) one obtains $x_{i}=u_{2i}+u_{2i+1}+u_{2i+2}$ and $z_{i}=u_{2i+1}+u_{2i+2}$ for $i<\frac{n}{2}-1$ , which, together with $u_{0}^{2\psi}=\mathbf{0}$ , implies $x_{0}^{\psi-1}=z_{0}^{\psi-1}=\mathbf{0}$ , so the above assumption holds. Furthermore, since $u_{0}^{n-1}$ was processed by the $m$ -th layer of CvPT before the transmission, the value of elements of $u_{2\psi+1}^{2\psi+3}$ , as well as the value of any linear combination $p_{0}^{2}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}u_{2\psi+1}^{2\psi+3}$ , can be deduced only from known linear combinations of elements of $x_{\psi}^{n-1}$ and $z_{\psi}^{n-1}$ . However, for any $x_{\psi+3}^{n/2-1}$ , $z_{\psi+3}^{n/2-1}$ and $u_{2\psi+1}^{2\psi+3}$ , one can find $u_{2\psi+4}^{n-1}$ , such that $(\mathbf{0}^{2\psi+1},u_{2\psi+1}^{n-1})=\left(\mathbf{0}^{\psi},x_{\psi}^{n/2-1},\mathbf{0}^{\psi},z_{\psi}^{n/2-1}\right)Q^{(n)}$ as follows: set $u_{2i+2}$ to $x_{i+1}+z_{i+1}$ for $i=\frac{n}{2}-2,\ldots,\psi+1$ , set $u_{n-1}$ to $z_{n/2-1}$ , and set $u_{2i+1}$ to $z_{i}+u_{2i+2}$ for $i=\frac{n}{2}-2,\ldots,\psi+2$ . So, for any $p\in\mathbb{F}^{3}$ , even complete knowledge of $x_{\psi+3}^{n/2-1}$ and $z_{\psi+3}^{n/2-1}$ does not provide the value $p\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}u_{2\psi+1}^{2\psi+3}$ . Thus, recoverable linear combinations $q_{0}^{2}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}x_{\psi}^{\psi+2}$ and $r_{0}^{2}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}z_{\psi}^{\psi+2}$ contain all information about recoverable linear combinations $p_{0}^{2}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}u_{2\psi+1}^{2\psi+3}$ , and therefore $\chi^{(2\psi+1,3)}_{n}(\mathcal{E})$ can be uniquely deduced from given $\chi^{(\psi,3)}_{n/2}(\mathcal{E}^{\prime})$ and $\chi^{(\psi,3)}_{n/2}(\mathcal{E}^{\prime\prime})$ . The similar consideration for $\varphi=2\psi+2$ leads to the fact that $\chi^{(2\psi+2,3)}_{n}(\mathcal{E})$ can also be deduced from $\chi^{(\psi,3)}_{n/2}(\mathcal{E}^{\prime})$ and $\chi^{(\psi,3)}_{n/2}(\mathcal{E}^{\prime\prime})$ .

Let $\psi=\left\lfloor{\frac{\varphi-1}{2}}\right\rfloor$ , $\mathbb{S}_{3}=\{\mathcal{T}_{i}\}_{i=0}^{15}$ . For any $l\in[16]$ , consider $(\mathcal{T}_{l},\varphi,3)$ -erasure configuration $\mathcal{E}$ for which the minimum in (16) is achieved, i.e. $\chi^{(\varphi,3)}_{n}(\mathcal{E})=\mathcal{T}_{l}$ and $|\mathcal{E}|=\delta_{n}^{(\varphi,3)}(\mathcal{T}_{l})$ . Obviously, $|\mathcal{E}|=|\mathcal{E}^{\prime}|+|\mathcal{E}^{\prime\prime}|$ . Let $\chi^{(\psi,3)}_{n/2}(\mathcal{E}^{\prime})=\mathcal{T}_{i}$ , $\chi^{(\psi,3)}_{n/2}(\mathcal{E}^{\prime\prime})=\mathcal{T}_{j}$ . Then, $\mathcal{E}^{\prime}$ and $\mathcal{E}^{\prime\prime}$ are also the minimum-weight $(\mathcal{T}_{i},\psi,3)$ - and $(\mathcal{T}_{j},\psi,3)$ - erasure configurations, respectively, i.e. $|\mathcal{E}^{\prime}|=\delta_{n/2}^{(\psi,3)}(\mathcal{T}_{i})$ , and $|\mathcal{E}^{\prime\prime}|=\delta_{n/2}^{(\psi,3)}(\mathcal{T}_{j})$ . We know that $\mathcal{T}_{l}$ can be deduced from $\mathcal{T}_{i}$ and $\mathcal{T}_{j}$ , i.e., for each $\varphi$ and $n$ there is a function $\mathbf{T}^{(\varphi)}_{n}(i,j)$ , which returns $\mathcal{T}_{l}$ for given $i$ and $j$ , and for considered minimum-weight $\mathcal{E}$ , $\mathcal{E}^{\prime}$ , $\mathcal{E}^{\prime\prime}$ one can obtain $\delta^{(\varphi,3)}_{n}(\mathbf{T}^{(\varphi)}_{n}(i,j))=\delta^{(\psi,3)}_{n/2}(\mathcal{T}_{i})+\delta^{(\psi,3)}_{n/2}(\mathcal{T}_{j})$ .

It appears that $\mathbf{T}^{(\varphi)}_{n}=\mathbf{T}^{(\varphi^{\prime})}_{n^{\prime}}$ if $\varphi\equiv\varphi^{\prime}\mod 2$ , i.e., there are only two different functions $\mathbf{T}^{(\varphi)}_{n}$ : one for odd $\varphi$ and another one for even $\varphi$ . They are defined as $\mathbf{T}_{o},\mathbf{T}_{e}:[16]\times[16]\to\mathbb{S}_{3}$ , such that

[TABLE]

The above consideration form the following theorem.

Theorem 2.

Denote $\Delta^{(\varphi)}_{n,l}=\delta^{(\varphi,3)}_{n}(\mathcal{T}_{l})$ for $l\in[16]$ , $n=2^{m}$ . Then, for a CvPT , for $0\leq\psi<\frac{n}{2}:$

[TABLE]

The base of the recursion is

[TABLE]

Remark 3.

Note that formulae (19)–(20) include the cases of $\Delta^{(n-2)}_{n,l}=\delta^{(n-2,3)}_{n}(\mathcal{T}_{l})$ and $\Delta^{(n-1)}_{n,l}=\delta^{(n-1,3)}_{n}(\mathcal{T}_{l})$ . They can be obtained according to the assumption in Remark 2 as follows. For $s\in\mathbb{S}_{i+h}$ , denote the set of tails of length $i$ by $s|_{i}=\left\{p_{0}^{i-1}\;\big{|}\;p_{0}^{i+h-1}\in s\right\}$ . We assume that any erasure configuration does not erase $u_{n-1+h}$ for any $h>0$ , i.e.

[TABLE]

The same assumption is applied for computing the values of $\Delta^{(0)}_{1,l}=\delta^{(0,3)}_{1}(\mathcal{T}_{l})$ from the values $\delta^{(0,1)}_{1}(s)$ for $s\in\mathbb{S}_{1}$ that are given by the base (21) of the recursion. This assumption, though not natural since symbols $u_{n+h}$ , $h\geq 0$ do not exist, allows one to employ the unified formulae (19)–(20) for the cases of $\varphi>n-3$ .

Remark 4.

Formula (20) in the case of $\Delta^{(0)}_{n,l}$ leads to computing $\Delta^{(-1)}_{n/2,i}=\delta^{(-1,3)}_{n/2}(\mathcal{T}_{i})$ , which is formally equal, for a given $\mathcal{T}_{i}$ , to the minimum weight of an erasure configuration which erases values $p\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \displaystyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \textstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptstyle\bullet $}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{$ \scriptscriptstyle\bullet $}}}}}u_{-1}^{2}$ for and only for $p\in\mathcal{T}_{i}$ . For the symbols $u_{-i}$ , $i>0$ , we do not employ the same assumption as in Remark 3. If one assumes that symbols with negative indices are always known and employs functions $\mathbf{T}_{o}$ and $\mathbf{T}_{e}$ , one would obtain that input symbols on the current layer of convolutional polarizing transformation $u_{-2}$ , $u_{-1}$ , and input symbol $x_{-1}=u_{-2}+u_{-1}+u_{0}$ on the next layer are always known, which implies that $u_{0}$ is always known. This would result in incorrect value of $\Delta_{n,l}^{(0)}$ . Thus, we assume that $u_{-i}$ for $i>0$ are always erased, which leads to

[TABLE]

Proof.

The proof is in the Appendix. ∎

The values $d^{(i)}_{n}$ for the case of CvPC can be computed with Algorithm 1. The three-dimensional array $\tau$ of subspaces of $\mathbb{F}^{3}$ is initialized in lines 1–1, such that $\tau[0][i][j]=\mathbf{T}_{e}(i,j)$ and $\tau[1][i][j]=\mathbf{T}_{o}(i,j)$ . The values $\Delta^{(0)}_{1,*}$ are computed in lines 1–1. Function M1Cluster, presented in Algorithm 2, is called to obtain $\Delta^{(-1)}_{n,*}$ for $n=1$ and $n=2^{\lambda}$ , respectively, in lines 1 and 1.

The values of $\Delta^{(\varphi)}_{2^{\lambda},*}$ for $-1\leq\varphi<2^{\lambda}$ are computed by Theorem 2 in lines 1–1 and stored in array $C^{\prime}$ , using values of $\Delta^{(\psi)}_{2^{\lambda-1},*}$ for $-1\leq\psi<2^{\lambda-1}$ , which are stored in array $C$ . The values $d_{n}^{(i)}$ are obtained as $d[i],i\in[n]$ .

The asymptotic complexity of the Algorithm 1 is defined by the complexity of the main loop 1–1. The complexity of the $\lambda$ -th iteration of the loop is defined by the complexity of the loop in lines 1–1, which consists of $2^{\lambda}$ iterations, each of them has complexity $O(1)$ . Thus, the overall asymptotic complexity is $\sum_{\lambda=1}^{\log n}O(2^{\lambda})=O(n)$ .

In Fig. 2 the lower bound on minimum distance, computed by (10), for CvPCs of lengths $64$ , $1024$ , $16384$ is presented. The codes are obtained via the Monte-Carlo method by minimization of the $E_{b}/N_{0}$ needed to achieve the SC decoding error probability $10^{-3}$ . For comparison, we also report the results for Arikan polar codes, which are optimized in the same way. One can see that CvPCs can have lower, equal or higher minimum distance, compared to Arikan polar codes.

Unlike the case of Arikan polarizing transformation $A^{(n)}$ , the weight of the $i$ -th row of CvPT $Q^{(n)}$ is not necessarily equal to $d^{(i)}_{n}$ . Thus, the bound (10) is not exact at least for codes with $\mathcal{I}=\{i\}$ . In general, it is not known, for which cases the bound is exact. However, by employing the low-weight codeword search algorithm presented in [8], we verified that the bound is exact for CvPCs with $m=5,\dots,13$ , rates $\frac{1}{20},\dots\frac{19}{20}$ and target FER of SC decoding $10^{-2}$ , $10^{-3}$ , $10^{-4}$ , $10^{-5}$ , and $10^{-6}$ .

V Convolutional Polar Subcodes

In general, the SC decoding algorithm for polar-like codes does not provide ML decoding. The Tal-Vardy list decoding algorithm [11] for polar codes can be immediately extended to the case of CvPC using the techniques presented in [4]. With sufficiently large list size $L$ the SCL algorithm delivers near-ML decoding. The SCL decoding error probability of convolutional polar codes is lower than that of classical polar codes, but still can be improved by extending the construction of randomized polar subcodes [10] to the case of convolutional polarizing transformation.

By Lemma 1, any codeword $c_{0}^{n-1}=u_{0}^{n-1}G^{(n)}$ of weight $d$ corresponds to vector $u_{0}^{n-1}$ with at least one symbol $u_{i}=1,i\in\mathcal{I}$ , such that $d^{(i)}_{n}\leq d$ . In the case of polar codes, $d^{(i)}_{n}$ is equal to the weight of the $i$ -th row of $A^{(n)}$ . In the case of CvPCs one can obtain $d^{(i)}_{n}$ by Algorithm 1.

A code construction, which has low SCL decoding error probability, was proposed in [10] for the case of classical polar codes as polar subcodes. Polar subcodes are obtained as a generalization of polar codes, where some symbols $u_{\varphi},\varphi\in\mathcal{D}$ , called dynamic frozen symbols, are not set to zero, but to linear combinations of previous symbols $u_{i},i<\varphi$ . This approach can be immediately extended to the case of convolutional polarizing transformation. Namely, the dynamic freezing constraints should be constructed, so that they involve all non-frozen symbols $u_{i}$ with the smallest $d_{n}^{(i)}$ , but the indices of dynamic frozen symbols $i\in\mathcal{D}$ should be as small as possible, so that the SCL decoding algorithm can process these constraints at the earliest possible phases, minimizing thus the probability of a correct path being killed.

This results in the following code construction algorithm:

Construct $(n,k+f)$ convolutional polar code, i.e. assign $u_{\mathcal{S}}=\mathbf{0}$ for the static frozen set $\mathcal{S}\subset[n]$ of worst $n-k-f$ bit subchannels. 2. 2.

Choose dynamic frozen set $\mathcal{D}\subseteq[n]\setminus\mathcal{S}$ as the set of $f$ indices of minimum-weight bit subchannels with the largest indices, that are not static frozen. Set

[TABLE]

where the frozen set $\mathcal{F}=\mathcal{S}\cup\mathcal{D}$ consists of indices of static frozen or dynamic frozen symbols, and $V_{i,j}$ are distributed uniformly over $\mathbb{F}$ .

The set $\mathcal{I}$ for a convolutional polar code optimized for SC decoding can be chosen either by evolution of erasure probabilities proposed in [2], or by Monte-Carlo simulations of genie-aided SC decoder. Due to lack of analysis techniques for the list SC decoding algorithm, the optimal value of $f$ should be determined by simulations.

Another component of the construction introduced in [10] is type-B dynamic freezing constraints, which are imposed on the symbols transmitted over the least reliable yet unfrozen subchannels. These constraints speed up error propagation for incorrect paths in the list SC algorithm, so that the probabilities (2) of these paths decrease quickly, reducing thus the probability of a correct path being killed. However, simulations of moderate-length CvPS show that type-B dynamic frozen symbols do not provide any noticeable gain in the case of CvPS.

VI Performance of Convolutional Polar Subcodes

In Fig. 3 the performance of $(1024,512)$ CvPS, polar code and polar subcode is presented for $f=10$ for the case of AWGN channel. The polar code and the polar subcode are constructed for AWGN channel with $E_{b}/N_{0}=2$ dB using Gaussian approximation of density evolution [12], and the CvPS is constructed for the same channel using Monte-Carlo simulations for subchannels qualities. One can see that the CvPS outperforms randomized polar subcodes [10], CvPC [2] and CvPC concatenated with CRC-10.

In Fig. 4 the performance of a $(4096,2048)$ CvPS with $f=12$ type-A dynamic frozen symbols is presented. Transmission of BPSK-modulated symbols over AWGN channel with $E_{b}/N_{0}=1.25$ dB is considered. The decoding algorithm is the SCL decoding with different values of list size that are shown in the x-axis. The performance of CvPSs is compared to that of a polar subcode with $f=12$ type-A dynamic frozen symbols and $52$ type-B dynamic frozen symbols. One can see that the CvPS under SCL decoding with the same list size outperforms classical polar subcodes. The smaller list size can be used to achieve the same FER, which allows less sophisticated hardware implementation.

In Fig. 5 the complexity (the number of operations) of SCL decoding, based on the expressions derived in [4], of the described above codes is compared for list size $L=1\dots 64$ for the CvPS and $L=1\dots 1024$ for the polar subcode. The complexity is obtained as the number of additions and comparisons of LLRs. The complexity of SC decoding for CvPS is approximately $46.5n\log n$ , as shown in [4]. The complexity of SC decoding of polar codes is $n\log n$ . However, as was shown in [2], CvPT induces stronger polarization than Arikan polarizing transformation, so the smaller list size is needed to achieve the same FER. This leads to the smaller complexity needed to achieve FER less than $6\cdot 10^{-4}$ in the case of CvPS, because achieving this FER requires list size $L=352$ for polar subcodes and only $L=28$ for CvPS. Furthermore, for a large list size the SCL decoding is near-ML, and for sufficiently good channel FER of ML-decoding is mainly defined by the minimum distance and the error coefficient. Dynamic frozen symbols decrease the error coefficient and may even increase the minimum distance of a CvPS. In Fig. 2 one can see that the minimum distance of CvPS is higher than that of CvPC.

VII Conclusions

In this paper a tight lower bound on minimum distance of convolutional polar codes is provided. Furthermore, a generalization of the randomized construction of polar subcodes to the case of convolutional polarizing transformation is proposed. Simulations show that the proposed code construction has lower frame error rate under SCL decoding [4] compared to polar subcodes with the same list size. The complexity for achieving the same FER with convolutional polar subcodes can be lower than in the case of polar subcodes [10] based on Arikan polarizing transformation.

Proof of Theorem 2. For erasure configuration $\mathcal{E}\subseteq[n]$ , denote $\mathcal{E}^{\prime}=\mathcal{E}\cap[n/2]$ and $\mathcal{E}^{\prime\prime}=\left\{j-n/2\mid j\in\mathcal{E}\setminus[n/2]\right\}$ . We now consider the case of $\varphi=2\psi+1$ and prove (19).

Note that $u_{0}^{2\psi}=\mathbf{0}^{2\psi+1}$ implies $x_{0}^{\psi-1}=z_{0}^{\psi-1}=\mathbf{0}^{\psi}$ . By (3) one obtains

[TABLE]

where $\hat{Q}=Q^{(n)}_{\overline{[2\psi+1]},\overline{\mathcal{E}}}$ , $\hat{Q}^{\prime}=Q^{(n/2)}_{\overline{[\psi]},\overline{\mathcal{E}^{\prime}}}$ , $\hat{Q}^{\prime\prime}=Q^{(n/2)}_{\overline{[\psi]},\overline{\mathcal{E}^{\prime\prime}}}$ , $\hat{X}=X^{(n)}_{\overline{[2\psi+1]},\overline{[\psi]}}$ , $\hat{Z}=Z^{(n)}_{\overline{[2\psi+1]},\overline{[\psi]}}$ . By (13), $p_{0}^{2}\in\chi^{(\varphi,3)}_{n}(\mathcal{E})$ iff there exists $q$ :

[TABLE]

where $q=(q^{\prime},q^{\prime\prime})$ , $k=n-\varphi$ , which implies, in particular,

[TABLE]

Denote $a=q^{\prime}\hat{Q}^{\prime T}$ , $b=q^{\prime\prime}\hat{Q}^{\prime\prime T}$ . Thus, $a\in\operatorname{cs}(\hat{Q}^{\prime})$ , $b\in\operatorname{cs}(\hat{Q}^{\prime\prime})$ . Then (22) implies $a\hat{X}_{\overline{[3]},*}^{T}=b\hat{Z}^{T}_{\overline{[3]},*}$ , so from (4)–(5) one obtains

[TABLE]

which leads to the system of equations

[TABLE]

It is easy to see that (23) implies $a_{i}=b_{i}=0$ for $i\geq 3$ . Let $k^{\prime}=n/2-\psi$ . By above consideration, for any $p\in\mathbb{F}^{3}$ one has $(p,\mathbf{0}^{k-3})\in\operatorname{cs}(\hat{Q})$ iff there exists $p^{\prime},p^{\prime\prime}\in\mathbb{F}^{3}$ , s.t. $(p^{\prime},\mathbf{0}^{k^{\prime}-3})\in\operatorname{cs}(\hat{Q}^{\prime})$ , $(p^{\prime\prime},\mathbf{0}^{k^{\prime}-3})\in\operatorname{cs}(\hat{Q}^{\prime\prime})$ , and

[TABLE]

Note that two last elements of vector in the left-hand side equals [math], and two last rows in the right hand size of (24) are identical, so last rows of these matrices can be removed. The resulting matrices are equal to $X^{(6)}_{\overline{[1]},*}$ and $Z^{(6)}_{\overline{[1]},*}$ , respectively. Recalling (13), one obtains that $\chi_{n}^{(2\psi+1,3)}(\mathcal{E})$ consists of all $p_{0}^{2}$ , for which there exist ${p^{\prime}}\in\chi_{n/2}^{(\psi,3)}(\mathcal{E}^{\prime})$ , $p^{\prime\prime}\in\chi_{n/2}^{(\psi,3)}(\mathcal{E}^{\prime\prime})$ :

[TABLE]

Observe that (25) is equivalent to the equation in the right-hand side of (17). Obviously, $|\mathcal{E}|=|\mathcal{E}^{\prime}|+|\mathcal{E}^{\prime\prime}|$ and the minimal cardinality of $(\mathcal{T}_{l},2\psi+1,3)$ -configuration $|\mathcal{E}|$ for each $\mathcal{T}_{l}\in\mathbb{S}_{3}$ can be found exactly as it is stated in (19).

Equality (20) can be proved similarly.

Bibliography12

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. J. Ferris and D. Poulin, “Branching MERA codes: a natural extension of polar codes,” Co RR , vol. abs/1312.4575, 2013. [Online]. Available: http://arxiv.org/abs/1312.4575
2[2] A. J. Ferris, C. Hirche, and D. Poulin, “Convolutional polar codes,” Co RR , vol. abs/1704.00715, 2017. [Online]. Available: http://arxiv.org/abs/1704.00715
3[3] E. Arikan, “Channel polarization: A method for constructing capacity-achieving codes for symmetric binary-input memoryless channels,” IEEE Transactions on Information Theory , vol. 55, no. 7, pp. 3051–3073, July 2009.
4[4] R. Morozov and P. Trifonov, “Efficient SC decoding of convolutional polar codes,” in 2018 International Symposium on Information Theory and its Applications (ISITA 2018) , Singapore, Singapore, Oct. 2018.
5[5] H. Saber, Y. Ge, R. Zhang, W. Shi, and W. Tong, “Convolutional polar codes: LLR-based successive cancellation decoder and list decoding performance,” in 2018 IEEE International Symposium on Information Theory (ISIT) , June 2018, pp. 1480–1484.
6[6] T. Prinz and P. Yuan, “Successive cancellation list decoding of BMERA codes with application to higher-order modulation,” in 2018 International Symposium on Turbo Codes and Iterative Information Processing (ITW) , December 2018.
7[7] N. Hussami, S. B. Korada, and R. Urbanke, “Performance of polar codes for channel and source coding,” in Proceedings of IEEE International Symposium on Information Theory , 2009, pp. 1488–1492.
8[8] A. Canteaut and F. Chabaud, “A new algorithm for finding minimum-weight words in a linear code: Application to Mc Eliece’s cryptosystem and to narrow-sense BCH codes of length 511 511 511 ,” IEEE Transactions on Information Theory , vol. 44, no. 1, pp. 367–378, January 1998.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

On Distance Properties of Convolutional Polar Codes

Abstract

Index Terms:

I Introduction

II Background

II-A Notations

II-B A Representation of a Linear Block Code and Successive Cancellation Decoding

II-C Convolutional Polar Codes

III A Lower Bound on The Minimum Distance of Linear Codes

III-A Basic Definitions

Example 1**.**

III-B Outline of the Approach

III-C Minimum Weight of Cosets and the Minimum Distance

Definition 1**.**

Remark 1**.**

Lemma 1**.**

Proof.

III-D Recoverable and erased vectors

Definition 2**.**

Remark 2**.**

Example 2**.**

Example 3**.**

III-E Coset minimum weight and erasure configurations

Theorem 1**.**

Proof.

Corollary 1**.**

IV Bound on Minimum Distance of Convolutional Polar Codes

Theorem 2**.**

Remark 3**.**

Remark 4**.**

Proof.

V Convolutional Polar Subcodes

VI Performance of Convolutional Polar Subcodes

VII Conclusions

Example 1.

Definition 1.

Remark 1.

Lemma 1.

Definition 2.

Remark 2.

Example 2.

Example 3.

Theorem 1.

Corollary 1.

Theorem 2.

Remark 3.

Remark 4.