On Distance Properties of Convolutional Polar Codes
Ruslan Morozov, Peter Trifonov

TL;DR
This paper establishes a lower bound on the minimum distance of convolutional polar codes, introduces a new subcode construction with improved decoding performance, and compares its decoding complexity and error probability favorably to Arikan polar codes.
Contribution
It provides a novel lower bound on the minimum distance and proposes convolutional polar subcodes with enhanced decoding efficiency and error performance.
Findings
Lower bound on minimum distance derived
Convolutional polar subcodes outperform Arikan polar subcodes in decoding error probability
Decoding complexity of convolutional polar subcodes is lower for large list sizes
Abstract
A lower bound on minimum distance of convolutional polar codes is provided. The bound is obtained from the minimum weight of generalized cosets of the codes generated by bottom rows of the polarizing matrix. Moreover, a construction of convolutional polar subcodes is proposed, which provides improved performance under successive cancellation list decoding. For sufficiently large list size, the decoding complexity of convolutional polar subcodes appears to be lower compared to Arikan polar subcodes with the same performance. The error probability of successive cancellation list decoding of convolutional polar subcodes is lower than that of Arikan polar subcodes with the same list size.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On Distance Properties of Convolutional Polar Codes
Ruslan Morozov, , Peter Trifonov The authors are with the Saint Petersburg Polytechnic University, Russia. E-mail: {rmorozov, petert}@dcn.icc.spbstu.ru
Abstract
A lower bound on minimum distance of convolutional polar codes is provided. The bound is obtained from the minimum weight of generalized cosets of the codes generated by bottom rows of the polarizing matrix. Moreover, a construction of convolutional polar subcodes is proposed, which provides improved performance under successive cancellation list decoding. For sufficiently large list size, the decoding complexity of convolutional polar subcodes appears to be lower compared to Arikan polar subcodes with the same performance. The error probability of successive cancellation list decoding of convolutional polar subcodes is lower than that of Arikan polar subcodes with the same list size.
Index Terms:
Convolutional polar codes, polar codes, successive cancellation decoding, list decoding, polar subcodes.
I Introduction
In this paper we consider codes that were firstly introduced as branching-MERA codes [1] and then as convolutional polar codes (CvPCs) [2] by A. J. Ferris, C. Hirche and D. Poulin. These codes were shown to provide substantially better performance under successive cancellation (SC) decoding compared to classical polar codes [3]. In [2] both open-boundary and periodic-boundary CvPCs are presented, in this paper by CvPCs we always mean open-boundary CvPCs. In [4] the efficient min-sum implementation of SC decoding is presented for CvPCs, which requires one to perform only comparisons and additions and can be easily extended to the case of SC list (SCL) decoding. Other implementations of SCL decoding for CvPCs are presented in [5, 6].
Classical polar codes provide quite poor performance under SCL decoding due to very low minimum distance, which scales as [7]. Although the minimum distance of a polar code can be found simply, the problem of computing minimum distance of an arbitrary linear code is NP-complete. However, for moderate-length codes minimum distance can be obtained by method presented in [8].
The generator matrix of a CvPC consists of rows of non-singular matrix , called convolutional polarizing transformation (CvPT). In this paper we derive a tight lower bound on the minimum distance of CvPCs, based on computing the minimum weight of a coset, given by the -th row of CvPT, of a linear code, generated by the last rows of CvPT. The weight enumerator polynomial of such coset can be expressed as , where is a weight spectrum of code generated by the last rows of matrix . In the case of polar codes, an efficient method for approximate enumerator evaluation is available [9]. However, for convolutional polar codes there are no methods for evaluation of coset enumerator.
The minimum distance of CvPCs appears to be of the same order as in the case of classical polar codes. However, by generalizing the construction of randomized polar subcodes [10] to the case of CvPC, we obtain convolutional polar subcodes (CvPSs) with reduced error coefficient, which provide superior performance under SCL decoding, compared to polar subcodes.
The paper is organized as follows. In Section II we introduce representation of linear block codes, which is natural for the cases of Arikan and convolutional polar codes. The concepts of generalized cosets and recoverable vectors are introduced in Section III and are used to obtain a lower bound on the minimum distance of linear block codes. An efficient algorithm for computing the lower bound in the case of CvPC is provided in Section IV. This algorithm is aimed to explore some properties of low-weight codewords of CvPC. These properties are used for a construction of convolutional polar subcodes, which is proposed in Section V. The performance of the proposed code construction is presented in Section VI.
II Background
II-A Notations
The following notations are used throughout the paper. denotes the Galois field of two elements. For integer we denote . For vector symbol . For two vectors and we denote their concatenation by . For matrix and sets , by we denote the submatrix of with rows with indices from set and columns with indices from set , indexing of rows and columns starts with zero. Similar notations are applied to vectors as well. If or , this means that all rows or all columns of the original matrix are in the submatrix. Furthermore, denotes submatrix of consisting of rows and columns with indices that are not in and , respectively. The vector of zeroes is denoted by , or just by if is clear from the context.
II-B A Representation of a Linear Block Code and Successive Cancellation Decoding
Consider binary linear block code in the form
[TABLE]
where is an non-singular binary matrix, is called information set and is called frozen set. The generator matrix of such code is . Note that any linear code with generator matrix can be expressed as in (1) with , such that for some . For example, classical polar codes [3] have for .
For such code representation, the successive cancellation (SC) decoding method can be defined. Consider transmission of codeword through binary-input memoryless channel . Let be the output of this channel. After demodulation, the probabilities for are provided to the decoding algorithm. Given the prior hard decisions , at phase the SC decoding algorithm calculates probabilities , defined as
[TABLE]
where . The channels are called bit subchannels. Then, the hard decision on is made by
[TABLE]
The SC decoding can be defined for any linear code, if an efficient method for computing is available. However, SC decoding can provide reasonable performance only for codes with , such that the capacities of bit subchannels polarize, i.e. converge to [math] or with .
II-C Convolutional Polar Codes
Convolutional polar codes [2] (CvPCs) are a family of linear block codes, for which , , is equal to the matrix of convolutional polarizing transformation (CvPT) , such that
[TABLE]
where , and are matrices, defined for even as
[TABLE]
For example, , . Expansion (3) corresponds to one layer of CvPT. In Fig. 1, the -th layer of CvPT is a mapping of vector to vectors and .
It is shown in [4] that for , , the value of for CvPT can be recursively computed as
[TABLE]
for , where , and , are subvectors of . These formulae are the same as in [4] under permutation of the output vector by the bit-reversal permutation, which is omitted from the definition (3) of CvPT for the sake of simplicity.
III A Lower Bound on The Minimum Distance of Linear Codes
III-A Basic Definitions
Let be the set of all linear subspaces of .
Denote a_{0}^{l-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}b_{0}^{l-1}=\sum_{i=0}^{l-1}a_{i}b_{i}, where . For vectors , denote by the linear subspace of with basis vectors , i.e.
[TABLE]
A sum over an empty set is assumed to be equal to zero, which implies , where is clear from the context. By abuse of notation, we write for to denote a vector .
Example 1**.**
It can be seen that , and .
III-B Outline of the Approach
Consider a code in the form (1) with , i.e. the set of vectors . Code can be split in two sets corresponding to each value of . Namely, , where consists of all codewords of the form . These subsets are equal to the subsets, which probabilities are computed at the -th phase of the SC decoding algorithm by (2), provided that the estimated symbols are zero. Since we are interested in distance properties of the code, we can assume that .
Let be the distance between and , i.e. . Consider and , for such the minimum is achieved, i.e., , , such that . Note that corresponds to value , so . Hence, is equal to the weight of a minimum-weight codeword from . In general, we can say that if , i.e., all previous symbols are estimated correctly, then the probability of erroneous estimation of in the case of transmission over sufficiently good binary memoryless channel is mainly defined by .
In section III-C we consider the partition of in two sets and not by the value of , but by the value of some linear combination p_{0}^{j-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}u_{\varphi}^{\varphi+j-1} of symbols . Thus, set , consists of all codewords satisfying p_{0}^{j-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}u_{\varphi}^{\varphi+j-1}=a.
In section III-D we consider transmission of codewords through binary erasure channel (BEC) , defined as , , where is the erasure probability. We consider mapping of the set of erased symbols to the set of all linear combinations of symbols , which can be recovered by the receiver by given . Thus, we consider a set of all vectors , such that the value of corresponding linear combination p_{0}^{j-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}u_{\varphi}^{\varphi+j-1} can be recovered by receiver after erasure configuration . It appears that , i.e. is a linear subspace of .
In section III-E, we prove that the minimum weight of vector from (i.e., the distance between and ) is equal to the minimum number of erasures, such that corresponding subspace of coefficients of recoverable linear combinations does not include the linear combination with coefficients .
These results are combined to derive the algorithm for computing in the case of CvPC, which leads to the lower bound on minimum distance of CvPC and the construction of CvPS. Furthermore, we believe that the introduced concepts and their properties can be used for other that have recursive structure.
III-C Minimum Weight of Cosets and the Minimum Distance
Definition 1**.**
Given an non-singular matrix , for a vector define a generalized coset as
[TABLE]
Remark 1**.**
In the case of , we assume in (9) that for .
We define the weight of the -th bit subchannel as
[TABLE]
Observe that for all one has , which implies .
Lemma 1**.**
If a linear code with minimum distance is generated by rows of with indices from , then
[TABLE]
Proof.
Consider the minimum-weight codeword , . Let be the first position of non-zero element in . Thus, , , , which implies and . ∎
This bound is valid for any linear block code represented in the form of (1). However, the evaluation of is not a simple problem for an arbitrary .
III-D Recoverable and erased vectors
Consider transmission of a codeword of a code with frozen set , and dimension over BEC.
The set of erased positions is called an erasure configuration. When erasure configuration occurs, the values are available for the receiver, where is submatrix of without rows from and without columns from , . Denote by the set of all such that . One can see that
[TABLE]
where for set of vectors , by we denote the set of vectors x_{0}^{t-1}:\forall y_{0}^{t-1}\in\mathcal{A}:x_{0}^{t-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}y_{0}^{t-1}=0, and is the column space of matrix . The value can be unambiguously recovered by the receiver after erasure configuration iff , i.e. .
More generally, consider the recoverability of the value of a linear combination p_{0}^{k-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}u_{\varphi}^{n-1} after erasure configuration . The set of values of p_{0}^{k-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}\hat{u}_{\varphi}^{n-1} for all is given by
[TABLE]
We say that vector is -recoverable, if the corresponding linear combination p_{0}^{k-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}u_{\varphi}^{n-1} can be recovered unambiguously for given , i.e., the set (12) contains only the correct value p_{0}^{k-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}u_{\varphi}^{n-1}. Expanding the brackets in (12), one can see that is -recoverable iff \forall a_{0}^{k-1}\in\operatorname{cs}^{\perp}(\hat{G}):p_{0}^{k-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}a_{0}^{k-1}=0, which leads to . Thus, the set of -recoverable vectors is a linear space, which is equal to .
Definition 2**.**
Let be the space of all , such that is -recoverable. In this case, is called a -space and is denoted by , and is called an -configuration. The set of -configurations is denoted by . Thus,
[TABLE]
If is a set, denote by the set of all subsets of . Thus, function , maps an erasure configuration, which is a subset of , to a linear subspace of , and returns the inverse image of . Note that is not injective, so .
In words, defines the set of vectors , for which the value of linear combination p_{0}^{j-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}u_{\varphi}^{\varphi+j-1} can be recovered after erasure configuration , provided that . Conversely, defines the set of erasure configurations, after which the linear combination p_{0}^{j-1}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}u_{\varphi}^{\varphi+j-1} can be deduced by the receiver if and only if .
Remark 2**.**
Let , i.e. for some . In this case, the conditional part of definition (13) is inconsistent. We extend the definition as follows. In Remark 1 we assume that symbols for are equal to zero. Hence, these symbols are always perfectly known for the receiver, so any does not erase any symbol . Observe that any vector from must be not -recoverable, so for any and , we must include vector in the set . This leads to
[TABLE]
Similarly, we assume that for all which do not contain for some .
Example 2**.**
Consider -configurations for the case of , . For erasure configuration , the only non-zero vector which is -recoverable is . That is, if symbol is erased, one can recover unambiguously only . This means that . All -configurations are
[TABLE]
That is, there are no erasure configurations, such that only (i.e. symbol ) is unambiguously recoverable, and the whole vector can be unambiguously recovered only if there are no erasures. For the same case, the -spaces are
[TABLE]
Example 3**.**
Consider the case of , , and . Since implies , one has , and one can restore by or . Thus, .
III-E Coset minimum weight and erasure configurations
For a subspace , we denote the minimal cardinality of -configuration as
[TABLE]
assuming that the minimum over the empty set is .
Theorem 1**.**
Let and . For any ,
[TABLE]
Proof.
Denote \mathcal{A}=\left\{\operatorname{supp}(c)\big{|}c\in\mathcal{C}^{(\varphi)}_{n}(p)\right\},
[TABLE]
Then the theorem can be reformulated as
If , then there exists , such that p\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}u_{\varphi}^{\varphi+j-1}=1 and for . In this case and the all-zero value also belongs to set (11) of possible values of for the given , but p\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}\hat{u}_{\varphi}^{\varphi+j-1}=0. Thus, the value of p\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}u_{\varphi}^{\varphi+j-1} is not recoverable after erasure configuration , which implies . So, and .
If , then , which by Definition 2 implies and \exists a_{0}^{k-1}\in\operatorname{cs}^{\perp}(\hat{G}):(p,\mathbf{0}^{k-j})~{}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}~{}a_{0}^{k-1}=~{}1, which implies p\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}a_{0}^{j-1}=1. Denote . Since p\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}a_{0}^{j-1}=1, by Definition 1 one has , and therefore . On the other hand, , which means . So, , hence, . ∎
Corollary 1**.**
For any
[TABLE]
IV Bound on Minimum Distance of Convolutional Polar Codes
The structure of the convolutional polarizing transformation , , enables one to compute easily , defined in (16), for . By computing values of , one can obtain values of by Corollary 1 and lower bound on minimum distance by Lemma 1.
Consider transmission of , such that , through BEC and let the erasure configuration be . The intuition behind recursive computing of is as follows.
Consider the case of . Denote , , , . Recall that is the set of all , such that the value of p_{0}^{2}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}u_{2\psi+1}^{2\psi+3} can be deduced from after erasure configuration . Similarly, and are the sets of and , s.t. q_{0}^{2}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}x_{\psi}^{\psi+2} and r_{0}^{2}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}z_{\psi}^{\psi+2} are recoverable from and after erasure configurations and , under assumption and , respectively. By (4)–(5) one obtains and for , which, together with , implies , so the above assumption holds. Furthermore, since was processed by the -th layer of CvPT before the transmission, the value of elements of , as well as the value of any linear combination p_{0}^{2}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}u_{2\psi+1}^{2\psi+3}, can be deduced only from known linear combinations of elements of and . However, for any , and , one can find , such that as follows: set to for , set to , and set to for . So, for any , even complete knowledge of and does not provide the value p\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}u_{2\psi+1}^{2\psi+3}. Thus, recoverable linear combinations q_{0}^{2}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}x_{\psi}^{\psi+2} and r_{0}^{2}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}z_{\psi}^{\psi+2} contain all information about recoverable linear combinations p_{0}^{2}\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}u_{2\psi+1}^{2\psi+3}, and therefore can be uniquely deduced from given and . The similar consideration for leads to the fact that can also be deduced from and .
Let , . For any , consider -erasure configuration for which the minimum in (16) is achieved, i.e. and . Obviously, . Let , . Then, and are also the minimum-weight - and - erasure configurations, respectively, i.e. , and . We know that can be deduced from and , i.e., for each and there is a function , which returns for given and , and for considered minimum-weight , , one can obtain .
It appears that if , i.e., there are only two different functions : one for odd and another one for even . They are defined as , such that
[TABLE]
The above consideration form the following theorem.
Theorem 2**.**
Denote for , . Then, for a CvPT , for
[TABLE]
The base of the recursion is
[TABLE]
Remark 3**.**
Note that formulae (19)–(20) include the cases of and . They can be obtained according to the assumption in Remark 2 as follows. For , denote the set of tails of length by s|_{i}=\left\{p_{0}^{i-1}\;\big{|}\;p_{0}^{i+h-1}\in s\right\}. We assume that any erasure configuration does not erase for any , i.e.
[TABLE]
The same assumption is applied for computing the values of from the values for that are given by the base (21) of the recursion. This assumption, though not natural since symbols , do not exist, allows one to employ the unified formulae (19)–(20) for the cases of .
Remark 4**.**
Formula (20) in the case of leads to computing , which is formally equal, for a given , to the minimum weight of an erasure configuration which erases values p\mathchoice{\mathbin{\vbox{\hbox{\scalebox{0.5}{\displaystyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\textstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptstyle\bullet}}}}}{\mathbin{\vbox{\hbox{\scalebox{0.5}{\scriptscriptstyle\bullet}}}}}u_{-1}^{2} for and only for . For the symbols , , we do not employ the same assumption as in Remark 3. If one assumes that symbols with negative indices are always known and employs functions and , one would obtain that input symbols on the current layer of convolutional polarizing transformation , , and input symbol on the next layer are always known, which implies that is always known. This would result in incorrect value of . Thus, we assume that for are always erased, which leads to
[TABLE]
Proof.
The proof is in the Appendix. ∎
The values for the case of CvPC can be computed with Algorithm 1. The three-dimensional array of subspaces of is initialized in lines 1–1, such that and . The values are computed in lines 1–1. Function M1Cluster, presented in Algorithm 2, is called to obtain for and , respectively, in lines 1 and 1.
The values of for are computed by Theorem 2 in lines 1–1 and stored in array , using values of for , which are stored in array . The values are obtained as .
The asymptotic complexity of the Algorithm 1 is defined by the complexity of the main loop 1–1. The complexity of the -th iteration of the loop is defined by the complexity of the loop in lines 1–1, which consists of iterations, each of them has complexity . Thus, the overall asymptotic complexity is .
In Fig. 2 the lower bound on minimum distance, computed by (10), for CvPCs of lengths , , is presented. The codes are obtained via the Monte-Carlo method by minimization of the needed to achieve the SC decoding error probability . For comparison, we also report the results for Arikan polar codes, which are optimized in the same way. One can see that CvPCs can have lower, equal or higher minimum distance, compared to Arikan polar codes.
Unlike the case of Arikan polarizing transformation , the weight of the -th row of CvPT is not necessarily equal to . Thus, the bound (10) is not exact at least for codes with . In general, it is not known, for which cases the bound is exact. However, by employing the low-weight codeword search algorithm presented in [8], we verified that the bound is exact for CvPCs with , rates and target FER of SC decoding , , , , and .
V Convolutional Polar Subcodes
In general, the SC decoding algorithm for polar-like codes does not provide ML decoding. The Tal-Vardy list decoding algorithm [11] for polar codes can be immediately extended to the case of CvPC using the techniques presented in [4]. With sufficiently large list size the SCL algorithm delivers near-ML decoding. The SCL decoding error probability of convolutional polar codes is lower than that of classical polar codes, but still can be improved by extending the construction of randomized polar subcodes [10] to the case of convolutional polarizing transformation.
By Lemma 1, any codeword of weight corresponds to vector with at least one symbol , such that . In the case of polar codes, is equal to the weight of the -th row of . In the case of CvPCs one can obtain by Algorithm 1.
A code construction, which has low SCL decoding error probability, was proposed in [10] for the case of classical polar codes as polar subcodes. Polar subcodes are obtained as a generalization of polar codes, where some symbols , called dynamic frozen symbols, are not set to zero, but to linear combinations of previous symbols . This approach can be immediately extended to the case of convolutional polarizing transformation. Namely, the dynamic freezing constraints should be constructed, so that they involve all non-frozen symbols with the smallest , but the indices of dynamic frozen symbols should be as small as possible, so that the SCL decoding algorithm can process these constraints at the earliest possible phases, minimizing thus the probability of a correct path being killed.
This results in the following code construction algorithm:
Construct convolutional polar code, i.e. assign for the static frozen set of worst bit subchannels. 2. 2.
Choose dynamic frozen set as the set of indices of minimum-weight bit subchannels with the largest indices, that are not static frozen. Set
[TABLE]
where the frozen set consists of indices of static frozen or dynamic frozen symbols, and are distributed uniformly over .
The set for a convolutional polar code optimized for SC decoding can be chosen either by evolution of erasure probabilities proposed in [2], or by Monte-Carlo simulations of genie-aided SC decoder. Due to lack of analysis techniques for the list SC decoding algorithm, the optimal value of should be determined by simulations.
Another component of the construction introduced in [10] is type-B dynamic freezing constraints, which are imposed on the symbols transmitted over the least reliable yet unfrozen subchannels. These constraints speed up error propagation for incorrect paths in the list SC algorithm, so that the probabilities (2) of these paths decrease quickly, reducing thus the probability of a correct path being killed. However, simulations of moderate-length CvPS show that type-B dynamic frozen symbols do not provide any noticeable gain in the case of CvPS.
VI Performance of Convolutional Polar Subcodes
In Fig. 3 the performance of CvPS, polar code and polar subcode is presented for for the case of AWGN channel. The polar code and the polar subcode are constructed for AWGN channel with dB using Gaussian approximation of density evolution [12], and the CvPS is constructed for the same channel using Monte-Carlo simulations for subchannels qualities. One can see that the CvPS outperforms randomized polar subcodes [10], CvPC [2] and CvPC concatenated with CRC-10.
In Fig. 4 the performance of a CvPS with type-A dynamic frozen symbols is presented. Transmission of BPSK-modulated symbols over AWGN channel with dB is considered. The decoding algorithm is the SCL decoding with different values of list size that are shown in the x-axis. The performance of CvPSs is compared to that of a polar subcode with type-A dynamic frozen symbols and type-B dynamic frozen symbols. One can see that the CvPS under SCL decoding with the same list size outperforms classical polar subcodes. The smaller list size can be used to achieve the same FER, which allows less sophisticated hardware implementation.
In Fig. 5 the complexity (the number of operations) of SCL decoding, based on the expressions derived in [4], of the described above codes is compared for list size for the CvPS and for the polar subcode. The complexity is obtained as the number of additions and comparisons of LLRs. The complexity of SC decoding for CvPS is approximately , as shown in [4]. The complexity of SC decoding of polar codes is . However, as was shown in [2], CvPT induces stronger polarization than Arikan polarizing transformation, so the smaller list size is needed to achieve the same FER. This leads to the smaller complexity needed to achieve FER less than in the case of CvPS, because achieving this FER requires list size for polar subcodes and only for CvPS. Furthermore, for a large list size the SCL decoding is near-ML, and for sufficiently good channel FER of ML-decoding is mainly defined by the minimum distance and the error coefficient. Dynamic frozen symbols decrease the error coefficient and may even increase the minimum distance of a CvPS. In Fig. 2 one can see that the minimum distance of CvPS is higher than that of CvPC.
VII Conclusions
In this paper a tight lower bound on minimum distance of convolutional polar codes is provided. Furthermore, a generalization of the randomized construction of polar subcodes to the case of convolutional polarizing transformation is proposed. Simulations show that the proposed code construction has lower frame error rate under SCL decoding [4] compared to polar subcodes with the same list size. The complexity for achieving the same FER with convolutional polar subcodes can be lower than in the case of polar subcodes [10] based on Arikan polarizing transformation.
Proof of Theorem 2. For erasure configuration , denote and . We now consider the case of and prove (19).
Note that implies . By (3) one obtains
[TABLE]
where , , , , . By (13), iff there exists :
[TABLE]
where , , which implies, in particular,
[TABLE]
Denote , . Thus, , . Then (22) implies , so from (4)–(5) one obtains
[TABLE]
which leads to the system of equations
[TABLE]
It is easy to see that (23) implies for . Let . By above consideration, for any one has iff there exists , s.t. , , and
[TABLE]
Note that two last elements of vector in the left-hand side equals [math], and two last rows in the right hand size of (24) are identical, so last rows of these matrices can be removed. The resulting matrices are equal to and , respectively. Recalling (13), one obtains that consists of all , for which there exist , :
[TABLE]
Observe that (25) is equivalent to the equation in the right-hand side of (17). Obviously, and the minimal cardinality of -configuration for each can be found exactly as it is stated in (19).
Equality (20) can be proved similarly.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. J. Ferris and D. Poulin, “Branching MERA codes: a natural extension of polar codes,” Co RR , vol. abs/1312.4575, 2013. [Online]. Available: http://arxiv.org/abs/1312.4575
- 2[2] A. J. Ferris, C. Hirche, and D. Poulin, “Convolutional polar codes,” Co RR , vol. abs/1704.00715, 2017. [Online]. Available: http://arxiv.org/abs/1704.00715
- 3[3] E. Arikan, “Channel polarization: A method for constructing capacity-achieving codes for symmetric binary-input memoryless channels,” IEEE Transactions on Information Theory , vol. 55, no. 7, pp. 3051–3073, July 2009.
- 4[4] R. Morozov and P. Trifonov, “Efficient SC decoding of convolutional polar codes,” in 2018 International Symposium on Information Theory and its Applications (ISITA 2018) , Singapore, Singapore, Oct. 2018.
- 5[5] H. Saber, Y. Ge, R. Zhang, W. Shi, and W. Tong, “Convolutional polar codes: LLR-based successive cancellation decoder and list decoding performance,” in 2018 IEEE International Symposium on Information Theory (ISIT) , June 2018, pp. 1480–1484.
- 6[6] T. Prinz and P. Yuan, “Successive cancellation list decoding of BMERA codes with application to higher-order modulation,” in 2018 International Symposium on Turbo Codes and Iterative Information Processing (ITW) , December 2018.
- 7[7] N. Hussami, S. B. Korada, and R. Urbanke, “Performance of polar codes for channel and source coding,” in Proceedings of IEEE International Symposium on Information Theory , 2009, pp. 1488–1492.
- 8[8] A. Canteaut and F. Chabaud, “A new algorithm for finding minimum-weight words in a linear code: Application to Mc Eliece’s cryptosystem and to narrow-sense BCH codes of length 511 511 511 ,” IEEE Transactions on Information Theory , vol. 44, no. 1, pp. 367–378, January 1998.
