Polar Coding for the Binary Erasure Channel with Deletions
Eldho K. Thomas, Vincent Y. F. Tan, Alexander Vardy, Mehul, Motani

TL;DR
This paper explores the use of polar codes for binary erasure channels with deletions, proposing a list decoding algorithm with redundancy optimization, achieving high probability message recovery with manageable complexity.
Contribution
It introduces a polar coding scheme for deletion channels, including a list decoding algorithm with redundancy optimization and complexity analysis.
Findings
Decoding complexity is $O(N^2\log N)$.
High probability of message recovery as code length increases.
List size can be reduced to one in simulations.
Abstract
We study the application of polar codes in deletion channels by analyzing the cascade of a binary erasure channel (BEC) and a deletion channel. We show how polar codes can be used effectively on a BEC with a single deletion, and propose a list decoding algorithm with a cyclic redundancy check for this case. The decoding complexity is , where is the blocklength of the code. An important contribution is an optimization of the amount of redundancy added to minimize the overall error probability. Our theoretical results are corroborated by numerical simulations which show that the list size can be reduced to one and the original message can be recovered with high probability as the length of the code grows.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsError Correcting Code Techniques · DNA and Biological Computing · Advanced biosensing and bioanalysis techniques
Polar Coding for the Binary Erasure Channel with Deletions
Eldho K. Thomas, Vincent Y. F. Tan, Senior Member, IEEE, Alexander Vardy, Fellow, IEEE, and
Mehul Motani, Senior Member, IEEE E. K. Thomas is with the Institute of Computer Science, University of Tartu, Estonia, 51014 (email: [email protected]). V. Y. F. Tan and M. Motani are with the Department of Electrical & Computer Engineering, National University of Singapore, Singapore 117583 (emails: [email protected], [email protected]). A. Vardy is with Department of Electrical & Computer Engineering, University of California San Diego, La Jolla, CA 92093, USA and the School of Physical & Mathematical Sciences, Nanyang Technological University, Singapore 637371 (email: [email protected]). This work is partially funded by a Singapore Ministry of Education (MoE) Tier 2 grant (R-263-000-B61-112).
Abstract
We study the application of polar codes in deletion channels by analyzing the cascade of a binary erasure channel (BEC) and a deletion channel. We show how polar codes can be used effectively on a BEC with a single deletion, and propose a list decoding algorithm with a cyclic redundancy check for this case. The decoding complexity is , where is the blocklength of the code. An important contribution is an optimization of the amount of redundancy added to minimize the overall error probability. Our theoretical results are corroborated by numerical simulations which show that the list size can be reduced to one and the original message can be recovered with high probability as the length of the code grows.
Index Terms:
Polar codes, deletions, binary erasure channel, cascade, list decoding, cyclic redundancy check, candidate set
I Introduction
Polar codes, invented by Arıkan [1], are the first provably capacity-achieving codes with low encoding and decoding complexity. Arıkan’s presentation of polar codes includes a successive cancellation decoding algorithm, which generally does not perform as well as the state-of-the-art error-correcting codes at finite block lengths [2]. To improve the performance of polar codes, Tal and Vardy [3] devised a list decoding algorithm. The initial work of Arıkan considers binary symmetric memoryless channels. There have been attempts to study polar codes for other channels, e.g., the AWGN channel [4]. However, there are not many constructions of polar codes for channels with memory. See [5] and references therein.
The deletion channel is a canonical example of a non-stationary, non-ergodic channel with memory. It deletes symbols arbitrarily and the positions of the deletions are unknown to the receiver. A survey by Mitzenmacher [6] discusses the major developments in the understanding of deletion channels in greater detail. To date, the Shannon capacity of deletion channels, in general, remains unknown. However, there have been attempts to find upper and lower bounds on the capacity of deletion channels [7, 8].
Our motivation is partly the work of Dolecek and Anantharam [9], in which the run length properties of Reed-Muller (RM) codes were exploited to correct a certain number of substitutions together with a single deletion; our work involves correcting erasures rather than substitiutions. RM codes and polar codes have similar algebraic structures and therefore polar codes are also potential candidates for correcting single deletions. However, they cannot be used directly on deletion channels since the polarization of a channel with memory has not been well-studied. Developing polarization techniques for deletion channels is beyond the scope of this study. Instead, motivated by decoders that are possibly defective and delete symbols arbitrarily, we consider polar codes over a binary erasure channel (BEC) and an adversarial version of the deletion channel with one deletion, and provide a list decoding algorithm to successfully recover the original message with high probability111In this letter, we use the term w.h.p. to mean with probability tending to as the blocklength of the code tends to infinity. (w.h.p.). Unlike RM codes, polar codes do not have rich run length properties. Instead, we use the successive cancellation algorithm [1] for decoding. In addition, we provide a detailed analysis of the error probability, which was lacking in [9]. Channel cascades were studied previously in [10] but our model has not been previously considered in the literature. We argue that the capacity of the cascade can be achieved; in constrast, [9] does not discuss capacity issues.
II Preliminaries
II-A Polar Codes
We consider polar codes of length constructed recursively from the kernel . Given an information vector (message) where , a codeword is generated using the relation where is the -th Kronecker product of and is a bit-reversal permutation matrix, defined explicitly in [1]. The vector is transmitted through independent copies of a binary discrete memoryless channel (BDMC) with transition probabilities and capacity . As grows, the individual channels start polarizing. That is, a subset of the channels tend to noise-free channels and others tend to completely noisy channels. The fraction of noise-free channels tends to the capacity . The polarization behavior suggests using the noise-free channels to transmit information bits, while setting the inputs to the noisy channels to values that are known a priori to the decoder (i.e., the frozen bits). That is, a message vector consists of information bits and frozen bits (often set to zero) where of size is the information set and is the set of frozen bits. This scheme achieves capacity [1]. Denote the channel output by and the -th synthesized subchannel with input and output by for . The transition probability matrix is defined as
[TABLE]
where and is the codeword corresponding to the message . The encoding complexity of polar coding is [1].
II-B Successive Cancellation Decoding
Arıkan [1] proposed a successive cancellation (SC) decoding scheme for polar codes. Given and the estimates of , the SC algorithm estimates . The following logarithmic likelihood ratios (LLR) are used to estimate each for :
[TABLE]
The estimate of an unfrozen bit is determined by the signs of the LLRs, i.e., if and otherwise. It is known that polar codes with SC decoding achieve capacity with decoding complexity of [1].
II-C Adversarial Deletion Channel
We suppose that bits are sent over a channel and exactly bits are deleted. We call this a -deletion channel. That is, for bits sent, the decoder only receives bits after deletions and the positions of deletions are not known to the receiver. Note that this is not the probabilistic deletion channel in which each symbol is independently deleted with some fixed probability [8].
III Problem Setting and Model
Consider the 1-deletion channel ( in the definition in Section II-C), where exactly one bit is deleted. We suppose that where . A message vector is encoded using the polar encoder and is sent across uses of a BEC , each with erasure probability . The output vector is passed through a 1-deletion channel . We denote this cascade of and as and call this a BEC-1-Deletion Cascade. This model is shown in Fig. 1. The output of is denoted as . Note that permits erasures and a single deletion. That is, a message is sent across and a vector is received. A decoder is designed in such a way that w.h.p., a list (of linear size in ) containing an estimate of the original message is returned.
IV Coding for the BEC-1-Deletion Cascade
IV-A Reconstruction of the BEC Output
A message is sent over a BEC-1-Deletion cascade using a polar encoder described in Section II-A and is received. In order to decode , we use the SC algorithm (refer to Section II-B). Since the position of the deletion is unknown, we first identify a set of vectors, called the candidate set, which contains as a sub-sequence. A naïve algorithm to construct the candidate set would be to insert in the locations before and after each symbol of . We then apply the SC algorithm to each vector in the candidate set.
For example, suppose and the received vector is . Then the following set includes all vectors which contain the subsequence :
[TABLE]
The size of this set can be further reduced if we notice that inserting at positions is enough to identify all possible messages those can output after a single deletion. This is because of the following: Suppose the -th symbol is deleted from . Instead of inserting 0 or 1 at position , we insert an erasure symbol . Since a polar code correcting (where ) erasures also corrects erasures w.h.p., under the SC decoding algorithm, this new length- vector decodes to the correct message w.h.p. no matter which symbol was at position . We state this observation formally:
Proposition 1**.**
Suppose is sent over a BEC-1-Deletion cascade . (See Fig. 1.) The size of the candidate set (constructed above) is where is the number of erasures present in the received string .
Proof:
The candidate set is
[TABLE]
where is the received string. Suppose that the -th symbol of is . Inserting another before the -th symbol forms vector . This vector repeats if we insert again after the the -th symbol . Therefore, considering non-erasure bits of and inserting exactly one erasure symbol at positions before and after these non-erasure bits produces unique vectors in the candidate set . Since the number of erasure symbols is , the total number of vectors in is . ∎
We remark that as , by the law of large numbers and hence where is the erasure probability of the BEC.
IV-B List Decoding
After the construction of the set , the problem reduces to the decoding of each vector in using the SC algorithm. Since , we get a list of messages of size at most at the end of the whole decoding procedure.
Let denote the SC decoding of , and define
[TABLE]
as the list of messages returned by the set where is the information set.
Since we insert the erasure symbol at each of the possible positions (including the deleted position), the original message sent belongs to w.h.p. Arıkan [1] proved that the probability of error vanishes asymptotically for polar codes over any BDMC. A more precise estimate was provided by Arıkan and Telatar [11] who showed that for any , for sufficiently large block lengths . Therefore, under SC decoding, vectors in return all possible messages that can produce the string under a single (adversarial) deletion.
IV-C Recovering the Correct Message from the List via Cyclic Redundancy Check (CRC)
Naturally, there can be multiple that belong to the list and it may not be easy to single out the original message. However, by applying a simple pre-coding technique using an -bit CRC (or a code having an parity check matrix) [12, 3], the original message can be detected from the list, albeit with some additional probability of error. We describe how to recover the correct message w.h.p. here.
Recall that we have frozen bits that we usually set to zero. Instead of setting all of them to zero, we set frozen bits to zero, where is a small number we optimize in Section IV-D. These bits will contain the -bit CRC value of the unfrozen bits (or simply the parity bits). To generate a -bit CRC, we select a polynomial of degree , called a CRC polynomial, having coefficients. We then divide the message (by treating it as a binary polynomial) by this CRC polynomial to generate a remainder of degree at most , with total number of coefficients . We append these coefficients at the end of the -bit message to generate a -bit vector. To verify that the correct message is received, we perform the polynomial division again to check if the remainder is zero. For more details on the choice of CRC polynomials, please refer to [13]. We send these bits across the cascade. This new encoding is a slight variation the original polar coding scheme [1]. Also, note that the original information rate is preserved. However, the rate of the polar code is slightly increased to .
To summarize, we encode the message of length into a length vector having redundancy where . Then we apply the polar coding scheme for the codebook . This will result in a polar code of length and size where only the subset carries information that we wish to transmit. The codeword corresponding to the original message is then passed through the BEC-1-Deletion channel and outputs a vector . After constructing the set by inserting at each possible positions, we apply the SC algorithm on . However, not all of these resulting vectors in carry information. We can check this using the initial -bit CRC (or the parity check matrix). All vectors which fail under the CRC check are removed and we then select the message with the maximum likelihood from the list.
IV-D Analysis and Optimization of the Overall Error Probability
Suppose denotes the parity check matrix with rows that is being used for adding parity to the bit message. Then the set of messages that carries any information can be identified as
[TABLE]
where is the modified version of (1) according to the new polar coding scheme defined as
[TABLE]
and where is the set of parity bits ( is the set of frozen bits). If the rows of are chosen uniformly and independently from , the probability that a vector is in is
[TABLE]
where . That is, a message in is wrongly identified as the original message with probability . However, the true message sent satisfies the parity-check condition . Therefore, by the union bound, the total probability that an incorrect message is returned is upper bounded as
[TABLE]
where is the probability of error of the SC decoding algorithm and for a single deletion.
To maintain that (that is, as the block length grows, converges to ) and the upper bound on in (2) is minimized, we have to choose carefully.
For a single deletion, the size of the candidate set and hence w.h.p. From Hassani et al. [14], the rate-dependent error probability of the polar code for the BEC with rate is
[TABLE]
where , is the complementary Gaussian cumulative distribution function, and is the capacity of the channel cascade.
From (2),
[TABLE]
It can be verified easily that the first term in the square parentheses in (3) is decreasing and the second term with is increasing in . To optimize the upper bound in (3), we set the exponents of two terms to be equal (neglecting the insignificant term), i.e.,
[TABLE]
where we used the fact that .
Now we find an expression for in terms of the backoff from capacity. To transmit the code at a rate close to the capacity, for a small constant , assume that where since a polar code over the BEC 1-deletion cascade achieves the capacity of the BEC; this is a simple consequence of [15, Problem 3.14] and the fact that the list size is polynomial. Then the rate for large enough. Therefore,
[TABLE]
Let . Since , . Then and hence . Since decays as as , . Then . Therefore, the optimal value of the number of parity bits is
[TABLE]
This is a rate-dependent choice of (through ) that simultaneously ensures that and the upper bound on in (2) is minimized.
IV-E Finite Number of Deletions
Now consider the cascade of a BEC and a -deletion channel where is finite. This model can be analyzed using the same techniques presented here. The only difference is the size of the candidate set . By using the same arguments as in the -deletion case, we construct by inserting erasure symbols at positions and . Therefore, the list size . Since the models are similar, a CRC construction and error probability analysis for the BEC--Deletion cascade similar to that presented in Sections IV-C and IV-D respectively can be performed. In addition, we see that even if the list size is d=o\big{(}\frac{N}{\log N}\big{)}, the capacity of the BEC is achieved because is still subexponential.
IV-F Complexity of the Decoding Algorithm
The encoding complexity of the BEC-1-Deletion cascade is same as that for standard polar codes, i.e., . However, the SC decoding algorithm has to be applied to all vectors in the candidate set of size (cf. Prop. 1). Thus, the complexity of the decoding algorithm of the BEC-1-Deletion cascade is and that for the BEC--Deletion cascade is . Although the complexity of the decoding algorithm increases by for each additional deletion, it can still be performed in polynomial time.
V Simulation Results
In this section, we demonstrate the utility of the proposed algorithm by performing numerical simulations. The simulations are carried out in Matlab using code provided in [16] with the following parameters.222The Matlab code to reproduce the simulations is provided at https://www.ece.nus.edu.sg/stfpage/vtan/commL_code.zip. Let vary from to . The erasure probability of the BEC is . Thus, the capacity of the cascade is . We consider three different code rates: and . We fix and the -bit CRC polynomial is chosen according to [13]. The error probability is computed by averaging over independent runs.
We encode a random length- message using a -bit CRC polynomial so that the input of the encoder is a length input vector and the output is an -bit vector. This vector is then transmitted through a BEC-1-deletion cascade and received a length- vector. The CRC list decoder then computes a list of possible messages given the channel output. Fig. 2 shows that, with a suitable choice of the number of CRC bits and CRC polynomials, as grows, the list is of size and contains only the original message w.h.p.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] E. Arıkan, “Channel polarization: A method for constructing capacity-achieving codes for symmetric binary-input memoryless channels”, IEEE Trans. Inform. Theory , vol. 55, no. 7, pp. 3051-3073, Jul 2009.
- 2[2] S. H. Hassani, K. Alishahi and R. L. Urbanke, “Finite-Length scaling for polar codes”, IEEE Trans. Inform. Theory , vol. 60, no. 10, pp. 5875-5898, Oct 2014.
- 3[3] I. Tal and A. Vardy, “List decoding of polar codes”, IEEE Trans. Inform. Theory , vol. 61, no. 5, pp. 2213-2226, May 2015.
- 4[4] E. Abbe and A. Barron, “Polar coding schemes for the AWGN channel”, Proceedings of the ISIT , 2011, pp. 194-198.
- 5[5] R. Wang, J. Honda, H. Yamamoto and R. Liu, “Construction of polar codes for channels with memory”, Proceedings of the Fall ITW , Jeju Island, South Korea, 2015, pp. 187-191.
- 6[6] M. Mitzenmacher, “A survey of results for deletion channels and related synchronization channels”, Probability Surveys , Vol. 6, pp 1-33, 2009.
- 7[7] R. Venkataramanan, S. Tatikonda, and K. Ramchandran, “Achievable rates for channels with deletions and insertions”, IEEE Trans. Inform. Theory , vol. 59, no. 11, pp. 6990-7013, Nov 2013.
- 8[8] S. Diggavi, M. Mitzenmacher and H. D. Pfister, “Capacity upper bounds for the deletion channel”, Proceedings of the ISIT , 2007, pp. 1716-1720.
