Asymptotic Average Multiplicity of Structures within Different Categories of Trapping Sets, Absorbing Sets and Stopping Sets in Random Regular and Irregular LDPC Code Ensembles
Ali Dehghan, Amir H. Banihashemi

TL;DR
This paper analyzes the asymptotic average counts of various trapping set structures in random LDPC code ensembles, revealing their growth behavior and providing cycle-based estimates that are accurate even for finite-length codes.
Contribution
It characterizes the asymptotic behavior of different trapping set types in LDPC ensembles and offers cycle-based estimates for structures with a single cycle.
Findings
Average number of structures tends to infinity, a positive constant, or zero depending on cycle count.
Cycle-based estimates are accurate for finite-length codes.
Growth behavior depends on the structure's cycle content.
Abstract
The performance of low-density parity-check (LDPC) codes in the error floor region is closely related to some combinatorial structures of the code's Tanner graph, collectively referred to as {\it trapping sets (TSs)}. In this paper, we study the asymptotic average number of different types of trapping sets such as {\em elementary TSs (ETS)}, {\em leafless ETSs (LETS)}, {\em absorbing sets (ABS)}, {\em elementary ABSs (EABS)}, and {\em stopping sets (SS)}, in random variable-regular and irregular LDPC code ensembles. We demonstrate that, regardless of the type of the TS, as the code's length tends to infinity, the average number of a given structure tends to infinity, to a positive constant, or to zero, if the structure contains no cycle, only one cycle, or more than one cycle, respectively. For the case where the structure contains a single cycle, we obtain an estimate of the expected…
| TS Type | |||||
|---|---|---|---|---|---|
| ETSs | 0 | ||||
| LETSs | 0 | – | |||
| EASs | 0 | – | – | ||
| SSs | 0 | – | – | ||
| 1 | 2 | 7 | 30 | |
| 1 | 3 | 15 | 91 |
| Code | |||||||||
| 816 | 1008 | 4000 | 20000 | 50000 | 4000 | 8000 | 10000 | 20000 | |
| 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | |
| 3 | 3 | 3 | 3 | 3 | 4 | 4 | 4 | 4 |
| Code | Expected value | |||||
| (3,3) | 132 | 165 | 171 | 161 | 178 | 166.6 |
| (4,4) | 1491 | 1252 | 1219 | 1260 | 1268 | 1250 |
| (5,5) | 9169 | 10019 | 9935 | 10046 | 10231 | 10000 |
| (4,2) | 3 | 6 | 1 | 0 | 0 | 0 |
| (5,3) | 90 | 100 | 21 | 2 | 1 | 0 |
| (6,2) | 2 | 5 | 0 | 0 | 0 | 0 |
| (6,4) | 2463 | 1885 | 476 | 95 | 52 | 0 |
| (7,3) | 110 | 116 | 10 | 0 | 1 | 0 |
| (7,5) | 33406 | 29736 | 7661 | 1540 | 640 | 0 |
| (8,2) | 1 | 3 | 0 | 0 | 0 | 0 |
| (8,4) | 4199 | 2961 | 183 | 4 | 1 | 0 |
| (9,3) | 195 | 169 | 4 | 0 | 0 | 0 |
| (9,5) | 84378 | 63787 | 4001 | 167 | 21 | 0 |
| (10,2) | 15 | 4 | 0 | 0 | 0 | 0 |
| (10,4) | 7965 | 4869 | 74 | 1 | 0 | 0 |
| (11,3) | 290 | 219 | 1 | 0 | 0 | 0 |
| (11,5) | 211273 | 134236 | 2278 | 8 | 2 | 0 |
| (12,2) | 15 | 6 | 0 | 0 | 0 | 0 |
| (12,4) | 17838 | 9041 | 36 | 0 | 0 | 0 |
| Code | Expected value | ||||
| (3,6) | 1563 | 1620 | 1598 | 1531 | 1543 |
| (4,8) | 24269 | 24107 | 24241 | 24368 | 24310 |
| (5,10) | 402513 | 406289 | 406754 | 407743 | 408410 |
| (4,6) | 91 | 55 | 33 | 17 | 0 |
| (5,6) | 2 | 2 | 1 | 0 | 0 |
| (5,8) | 4640 | 2303 | 1910 | 934 | 0 |
| (6,8) | 588 | 196 | 107 | 22 | 0 |
| (6,10) | 185544 | 96525 | 76621 | 37075 | 0 |
| (7,8) | 53 | 10 | 7 | 0 | 0 |
| (7,10) | 39794 | 10698 | 6649 | 1682 | 0 |
| (8,8) | 5 | 0 | 0 | 0 | 0 |
| (8,10) | 7006 | 1116 | 567 | 60 | 0 |
| (9,8) | 1 | 0 | 0 | 0 | 0 |
| (9,10) | 1145 | 101 | 53 | 0 | 0 |
| (10,10) | 185 | 13 | 4 | 0 | 0 |
| Multiplicity of LETSs | Block Length | Expected values | ||||
| for | 500 | 1000 | 4000 | 10000 | 20000 | from Equation (58) |
| 26 | 22 | 18 | 22 | 31 | 22 | |
| 145 | 146 | 127 | 148 | 123 | 136 | |
| 312 | 254 | 281 | 269 | 257 | 272 | |
| 168 | 176 | 191 | 165 | 189 | 181 | |
| LETS classes | Block Length | Expected values | ||||
|---|---|---|---|---|---|---|
| for | 500 | 1000 | 4000 | 10000 | 20000 | from Equation (58) |
| 5 | 4 | 0 | 1 | 0 | 0 | |
| 130 | 102 | 72 | 95 | 93 | 87 | |
| 841 | 726 | 735 | 731 | 713 | 699 | |
| 2240 | 2122 | 2122 | 2075 | 2090 | 2098 | |
| 2884 | 2830 | 2743 | 2741 | 2871 | 2797 | |
| 1273 | 1389 | 1384 | 1412 | 1402 | 1398 | |
| LETS classes | Block Length | Expected values | ||||
|---|---|---|---|---|---|---|
| for | 500 | 1000 | 4000 | 10000 | 20000 | from Equation (58) |
| 12 | 2 | 1 | 0 | 0 | 0 | |
| 143 | 58 | 11 | 6 | 1 | 0 | |
| 1055 | 654 | 436 | 420 | 374 | 359 | |
| 5549 | 4471 | 3649 | 3638 | 3657 | 3598 | |
| 16731 | 15904 | 14657 | 14480 | 14412 | 14392 | |
| 29850 | 29089 | 29248 | 28661 | 28693 | 28780 | |
| 27041 | 28434 | 28390 | 28795 | 28909 | 28777 | |
| 9753 | 11108 | 11172 | 11637 | 11584 | 11509 | |
| LETS classes | Block Length | Expected values | ||||
|---|---|---|---|---|---|---|
| for | 500 | 1000 | 4000 | 10000 | 20000 | from Equation (58) |
| 1 | 0 | 0 | 0 | 0 | 0 | |
| 10 | 0 | 0 | 0 | 0 | 0 | |
| 179 | 53 | 9 | 5 | 4 | 0 | |
| 1771 | 717 | 132 | 75 | 18 | 0 | |
| 11080 | 5840 | 2474 | 2015 | 1785 | 1542 | |
| 48495 | 32305 | 21896 | 19933 | 19267 | 18507 | |
| 142776 | 116436 | 99340 | 94744 | 93575 | 92527 | |
| 282304 | 265229 | 254057 | 247637 | 247219 | 246710 | |
| 352158 | 364261 | 371165 | 369248 | 368437 | 370022 | |
| 247006 | 276543 | 288721 | 295189 | 295669 | 295983 | |
| 71956 | 89281 | 93021 | 98685 | 98889 | 98649 | |
| ETS class | Expected value of (33) | |||||
|---|---|---|---|---|---|---|
| 132 | 165 | 171 | 161 | 178 | 166.6 | |
| 3459 | 3700 | 3777 | 3675 | 3938 | 3750 | |
| 67559 | 70783 | 72468 | 71446 | 75626 | 72500 |
| ETS class | Expected Value | |||||
|---|---|---|---|---|---|---|
| 3 | 6 | 1 | 0 | 0 | 0 | |
| 120 | 160 | 31 | 2 | 1 | 0 | |
| 2 | 5 | 0 | 0 | 0 | 0 | |
| 4152 | 4080 | 916 | 125 | 67 | 0 | |
| 130 | 163 | 10 | 0 | 1 | 0 | |
| 98465 | 97852 | 23550 | 3886 | 1899 | 0 |
| ETS class | Expected value of (38) | ||||
|---|---|---|---|---|---|
| 1563 | 1620 | 1598 | 1531 | 1543 | |
| 89467 | 91891 | 91186 | 88578 | 89137 | |
| 4232030 | 4345623 | 4325691 | 4236433 | 4265616 |
| ETS | Block Length | Expected values | ||||
|---|---|---|---|---|---|---|
| classes | 500 | 1000 | 4000 | 10000 | 20000 | from Theorem 10 |
| 26 | 22 | 18 | 22 | 31 | 22.7 | |
| 341 | 287 | 216 | 272 | 304 | 291.5 | |
| 3979 | 3158 | 2340 | 2759 | 2883 | 3244 | |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Asymptotic Average Multiplicity of Structures within Different Categories of Trapping Sets, Absorbing Sets and Stopping Sets in Random Regular and Irregular LDPC Code Ensembles
Ali Dehghan, and Amir H. Banihashemi Part of this work is accepted for presentation at ISIT 2018, Vail, Colorado, USA, June 2018.
Abstract
The performance of low-density parity-check (LDPC) codes in the error floor region is closely related to some substructures of the code’s Tanner graph, collectively referred to as trapping sets (TSs). In this paper, we study the asymptotic average number of different types of trapping sets such as elementary TSs (ETS), leafless ETSs (LETS), absorbing sets (ABS), elementary ABSs (EABS), and stopping sets (SS), in random variable-regular and irregular LDPC code ensembles. We demonstrate that, regardless of the type of the TS, as the code’s length tends to infinity, the average number of a given structure tends to infinity, to a positive constant, or to zero, if the structure contains no cycle, only one cycle, or more than one cycle, respectively. For the case where the structure contains a single cycle, we derive the asymptotic expected multiplicity of the structure by counting the average number of its constituent cycles and all the possible ways that the structure can be constructed from the cycle. This, in general, involves computing the expected number of cycles of a certain length with a certain given combination of node degrees, or computing the expected number of cycles of a certain length expanded to the desired structure by the connection of trees to its nodes. The asymptotic results obtained in this work, which are independent of the block length and only depend on the code’s degree distributions, are shown to be accurate even for finite-length codes.
Index Terms: Low-density parity-check (LDPC) codes, random LDPC codes, trapping sets (TS), elementary trapping sets (ETS), leafless elementary trapping sets (LETS), absorbing sets (ABS), elementary absorbing sets (EABS), stopping sets (SS).
I introduction
The performance of low-density parity-check (LDPC) codes in the error floor region is closely related to some substructures of the code’s Tanner graph, here referred to collectively as trapping sets (TS). A trapping set is often identified by its number of variable nodes , and the number of unsatisfied check nodes in its subgraph.111The term “unsatisfied” is used for check nodes that are connected to an odd number of variable nodes in the trapping set. This term (in the case of binary LDPC codes) implies that if the variable nodes in the trapping set are set to one, and all the other variable nodes in the Tanner graph are set to zero, then these check nodes will be unsatisfied, i.e., the modulo-2 addition of the variables connected to the check node is non-zero. In this case, the set is said to belong to the class. For a given LDPC code, the harmful trapping sets would depend on the degree distributions of the code, the channel model, the decoding algorithm, the quantization scheme, and the structure of the code’s Tanner graph. Some categorizations of TSs include stopping sets (SS), elementary TSs (ETS), leafless ETSs (LETS), absorbing sets (ABS), and elementary ABSs (EABS). Stopping sets are known to be the problematic structures of the belief propagation algorithm over the binary erasure channel (BEC) [1]. Leafless ETSs are relevant for soft-decision iterative decoding of LDPC codes over the additive white Gaussian noise (AWGN) channel and have been widely studied in the literature, see, e.g., [2], [3], [4], and the references therein. In fact, LETS structures appear to form an overwhelming majority of dominant trapping sets of variable-regular LDPC codes over the AWGN channel [4]. The broader category of ETSs are harmful sets for irregular codes over the AWGN channel [2], [5]. Absorbing sets are the fixed points of bit-flipping decoding algorithms [6], [7], and are also shown to be relevant in the context of quantized decoders over the AWGN channel [7].
Regardless of the type of the TS, the problem of counting and/or enumerating trapping sets in a given code is a hard problem. It was shown in [8] that for a given , finding an TS with the smallest is an NP-hard problem. Also, finding an ETS with the smallest for a given is a hard problem [8]. Very recently, similar hardness results were proved for LETSs and EABSs [9]. Furthermore, it was proved in [10] that for a given Tanner graph and a positive integer , determining whether has a stopping set of size is NP-complete. Despite the intrinsic difficulty of such combinatorial problems, many advances are made on characterization of trapping sets and the development of efficient search algorithms to find them [11], [12], [13], [14], [15], [2], [16], [3], [17], [4], [5].
More related to the work presented in this paper, asymptotic results on the distribution of trapping sets in different ensembles of LDPC codes were established in [18], [19], [20], [21], [22]. The distribution of stopping sets in Tanner graph ensembles was first studied in [18]. In [19], using tedious combinatorial methods, Orlitsky et al. studied the asymptotic average distribution of stopping sets and the stopping number (the size of the smallest nonempty stopping set in a code) in regular and irregular LDPC code ensembles, as the code’s block length tends to infinity. In particular, they proved that for almost all codes with smallest variable degree greater than two, stopping number increases linearly with . Milenkovic et al. [20] used random matrix enumeration techniques and large deviations theory to study the asymptotic distribution of SSs and TSs of size constant and linear in , in random regular and irregular LDPC code ensembles. Specifically, for regular ensembles with variable degree , they demonstrated that as tends to infinity, for constant and values, the average number of ETSs in the class tends to , where is a constant with respect to , but depends on , , and . Abu-Surra et al. [21] studied the ensemble stopping set and trapping set enumerators for protograph-based LDPC codes both in finite-length and asymptotic regimes. The focus in [21] was to determine whether or not the typical relative smallest size of trapping or stopping sets grows linearly with the block length. Using a technique similar to that of [20], Amiri et al. [22] studied the asymptotic distribution of ABSs and fully ABSs (FABSs) of size linear in , in random regular LDPC code ensembles. They also derived simplified formulas for enumerating the asymptotic average number of EABSs, and FEABSs in regular LDPC code ensembles with variable degrees and .
In this paper, we focus on the case where and are constant values (with respect to ). This case is of particular practical interest, as in practice, the size of the most harmful trapping sets may not necessarily increase with the code’s block length. For the scenario of constant and values, the limited existing results on the asymptotic average number of trapping sets, as discussed in the previous paragraph, are consistent with the results of our analysis. In addition, for this scenario, our analysis both extends and deepens the results of the literature. For example, while the results in [20] are limited to TSs and ETSs, in this work, we also cover LETSs. As an example of how the results of this work are more accurate and provide a deeper understanding about the TS structures, we consider the only result of [20] on ETSs with constant and values. This result, which is limited to biregular ensembles, indicate that the asymptotic expected number of ETSs is equal to , for some constant . For the interesting case where , this result implies that, on average, there are a constant (non-zero) number of ETSs within the class under consideration. In this work, we determine the exact value of this constant as a function of the graph’s degree distributions and the values of and . In addition, we obtain similar results for variable-regular and irregular ensembles.
Another major difference in comparison with the existing literature [19], [20], [22], is that rather than focusing on a class of trapping sets, we focus on individual non-isomorphic structures within a class. We show that such structures, although in the same class, may demonstrate completely different asymptotic behavior. For example, the average number of one structure may tend to a constant as tends to infinity, while that of another structure, within the same class, may tend to infinity. Or the asymptotic average number of one structure may be zero, while that of another structure may be a non-zero constant. If one only considers the behavior of the whole class, in the former case, the average number tends to infinity and in the latter, to a constant. The granularity of our analysis is important since it is known that, in general, different non-isomorphic trapping sets in the same class can differ in their harmfulness [23]. It is thus important to know the average number of individual non-isomorphic TS structures within a code ensemble. Finally, although the analysis in [19], [20], [22], may be conceptually easy to understand, the derivations are tedious and the final formulas are often complicated. In contrast, at the core of our analysis is a simple, yet general, asymptotic result that the expected number of an structure is equal to ,222We use the notation , if for sufficiently large values of , we have , for some positive and values. where and are the number of vertices (nodes) and the number of edges of , respectively. This result is then easily translated to an asymptotic result on the average multiplicity of the structure based on the number of cycles in the structure. For the number of cycles equal to zero, one, or more than one, the average multiplicity tends to infinity, a non-zero constant, or zero, respectively.
In an ensemble of LDPC codes, for given and values, we say that the class has a consistent behavior in , if all the structures within the class demonstrate the same asymptotic behavior in , i.e., the average number of every non-isomorphic structure in the class tends to the same value (zero, infinity or a non-zero constant) in , as tends to infinity. Otherwise, we say that the class has an inconsistent behavior in . As an example of our results, we prove that in random variable-regular LDPC codes, every class of ETSs and every class of LETSs has a consistent behavior. In irregular LDPC codes, however, classes of ETSs and LETSs have inconsistent behavior, in general.
In Table I, we have summarized our results on the asymptotic average number of TSs in variable-regular ensembles of LDPC codes with variable degree , for different types of trapping sets, and for different values of , and . As we will discuss later, the asymptotic expected multiplicity of trapping sets depends on the value of in relation to . This can be seen in Table I through the distinction of different columns. In Table I, the notation denotes the asymptotic average number of cycles of length in the Tanner graphs of the codes. The entry “–” in the table is used to indicate that it is impossible to have trapping sets of a particular type in the corresponding classes. In Table I, the value is used as an asymptotic approximation for the average number of TS structures that are isomorphic to a simple cycle of length . The value was computed (approximated) recently in [24] for random regular (irregular) Tanner graphs.
As another contribution of this work, as related to variable-regular graphs (Table I), we focus on structures which are not simple cycles but contain only a single (simple) cycle. These are the remaining structures (in addition to simple cycles) whose asymptotic expected multiplicity, we have proved to be a non-zero constant. For such structures, we first characterize them as a simple cycle appended by a number of trees of different sizes rooted at the variable nodes of the cycle. We then use this characterization to count the number of such structures on average in the asymptotic regime. The counting problem in this case is formulated recursively, and has a solution which is a generalization of Catalan numbers [25]. These results correspond to the entry in Table I.
We also investigate the asymptotic expected multiplicity of different types of TSs in irregular graphs, in cases where such expected values are non-zero and finite. In particular, to compute the asymptotic average multiplicity of LETS, ABS and SS structures in irregular graphs, we generalize the results of [24] to count the number of cycles of different lengths with different combinations of node degrees within irregular ensembles of bipartite graphs. Moreover, to compute the asymptotic average multiplicity of ETS structures, we generalize the results derived for regular graphs (recursively counting the single cycles appended by trees) to irregular graphs.
The organization of the rest of the paper is as follows: In Section II, we present some definitions and notations. This is followed in Section III by our main result on the asymptotic average number of an arbitrary local structure in the Tanner graph of random LDPC codes. In this section, we also apply the general result to different types of trapping sets including ETSs, LETSs, ABSs, EABSs and SSs, for both variable-regular and irregular LDPC code ensembles. In Section IV, we compute the expected number of SS, ABS and LETS structures whose asymptotic average multiplicity is constant in irregular LDPC codes. Next, In Section V, we compute the expected number of ETS structures whose asymptotic average multiplicity is constant in biregular, variable-regular and irregular ensembles. Section VI is devoted to numerical results. The paper is concluded with some remarks in Section VII.
II Definitions and notations
For a graph , we denote the node set and the edge set of by and , respectively. In this work, we consider graphs with no loop or parallel edges, where a loop is defined as an edge that connects a node to itself. For and , notations and are used to denote the neighbor set of , and the set of nodes of which has a neighbor in , respectively. A path of length in is a sequence of distinct nodes in , such that , for . A cycle of length is a sequence of distinct nodes in such that form a path of length , and . We may refer to a path or a cycle by the set of their nodes, or by the set of their edges, or by both. A chordless or simple cycle in a graph is a cycle such that no two nodes of the cycle are connected by an edge that does not itself belong to the cycle.
A graph is called bipartite, if the node set can be partitioned into two disjoint subsets and (i.e., ), such that every edge in connects a node from to a node from . The Tanner graph of a low-density parity-check (LDPC) code is a bipartite graph, in which and are referred to as variable nodes and check nodes, respectively. Parameters and in this case are used to denote and , respectively. Parameter is the length of the code, and the code rate satisfies .
The number of edges incident to a node is called the degree of , and is denoted by or . Also, we denote the maximum degree and the minimum degree of by and , respectively. A bipartite graph is called biregular, if all the nodes on the same side of the given bipartition have the same degree, i.e., if all the nodes in have the same degree and all the nodes in have the same degree . It is clear that, for a biregular graph, . A bipartite graph that is not biregular is called irregular. A Tanner graph is called variable-regular with variable degree , if for each variable node , . Also, a -regular Tanner graph is a variable-regular graph with variable degree , in which for each check node , . The variable and check node degree distributions of irregular LDPC codes are represented by polynomials and , where and denote the fraction of the edges in the Tanner graph connected to degree- variable and check nodes, respectively, and , , and are the minimum and maximum variable and check node degrees in the Tanner graph, respectively.
In a Tanner graph , for a subset of , the induced subgraph of in , denoted by , is the graph with the set of nodes , and the set of edges . The set of check nodes with odd and even degrees in are denoted by and , respectively. Also, the terms unsatisfied check nodes and satisfied check nodes are used to refer to the check nodes in and , respectively. Throughout this paper, the size of an induced subgraph is defined to be the number of its variable nodes (i.e., ).
For a given Tanner graph , a set , is said to be an * trapping set (TS)* if and . Alternatively, set is said to belong to the class of TSs. A stopping set (SS) is a TS for which the degree of each check node in is at least two. An elementary trapping set (ETS) is a TS for which all the check nodes in have degree one or two. A leafless ETS (LETS) is an ETS for which each variable node in is connected to at least two satisfied check nodes in . An absorbing set (ABS) is a TS for which all the variable nodes in are connected to more nodes in than in . Also, an elementary absorbing set (EABS) is an ABS for which all the check nodes in have degree one or two. A fully absorbing set (FABS) is an ABS with the extra constraint that each variable node in is connected to strictly fewer nodes in than in . Also, an elementary fully absorbing set (EFABS) is an FABS such that each check node in has degree one or two.
In the asymptotic analysis presented in this work, for a given ensemble of LDPC codes (Tanner graphs), identified by certain degree distributions and the block length , we consider the case where tends to infinity. In such an asymptotic scenario, we say that a structure is local in a Tanner graph , if the definition of depends on a constant number (with respect to ) of nodes in . In particular, as we are interested in subgraphs induced in by a set of variable nodes , if such subgraphs are local, we refer to them as being locally induced subgraphs. For example, if one considers a constant positive integer , and Tanner graphs whose maximum variable degree is a constant in , then ETSs, LETSs, ABSs, EABSs, and SSs are all local structures. (Note that such sets can only exist for , and thus is also a constant in .) On the other hand, the definition of FABSs depends on all the variable nodes in the Tanner graph, and thus FABSs are not local structures. In this work, our focus is on TSs with and being constants in . We also consider Tanner graphs whose maximum variable and check node degrees are constant. Thus, in this work, all ETSs, LETSs, ABSs, EABSs and SSs are local structures. We note that the locality constraint is one of the main assumptions required for the proof of our results including Theorem 1 in the next section.
We conclude this section with the definition of -permutations that will be used in the proof of our main result. The -permutations of objects are the different ordered arrangements of -element subsets of the objects. The number of such permutations is equal to .
III Asymptotic Average Number of an Arbitrary Local Structure in LDPC Code (Tanner Graph) Ensembles
III-A Main Result
In this subsection, we find a lower and an upper bound on the average number of a locally induced subgraph of a random Tanner graph with a given degree distribution, in the asymptotic regime where the size of the graph tends to infinity.
Theorem 1**.**
Let be a fixed natural number, such that , and , and that . Consider the probability space of all Tanner graphs with node set , where , and , . Suppose that the degree sets and are selected according to the distributions and , respectively, and that the graphs in are selected uniformly at random. For , denote by the number of copies of a subgraph , induced by a constant number of variable nodes, in . Then, in the asymptotic regime of , we have
[TABLE]
where is the expected value of , and and are the number of nodes and edges of , respectively.
Proof.
First, we introduce the method that we use to construct a random Tanner graph. The method is the same as the one used in [24]. Suppose that the variable nodes are labeled as , and check nodes as . For each node with degree , we assign a bin that contains cells, and consider a random perfect matching (bijection) to pair the cells on one side of the graph to the cells on the other side. The bipartite graphs constructed by connecting any two paired cells with an edge are called configurations. There are configurations, where is the number of edges in the graph. In the rest of the proof, we assume that configurations are selected uniformly at random. From each configuration, we construct a Tanner graph such that if there is an edge between two cells, then we place an edge between the corresponding nodes (bins) in the Tanner graph. The resulted Tanner graphs can thus be considered as images of the configurations that are obtained from the random perfect matchings. Following the notation in [24], we denote the ensemble of bipartite graphs so constructed by , and note that the graphs in can have parallel edges. We also note that a uniform distribution over the configurations induces a non-uniform distribution over . The next step in the construction of is to remove all the graphs with parallel edges from . It is now easy to see that a uniform distribution over the configurations, with the condition that images with parallel edges are rejected, induces a uniform distribution over . The reason is that corresponding to each graph in , we have the same number of configurations.
It was proved in [24] that rather than working in the probability space , one can work in , and that if a property holds true asymptotically almost surely (a.a.s.) for , it also holds true a.a.s. for . (See the proof of Theorem 1 in [24].) We adopt the same approach here, and in addition, instead of directly working in , which has a non-uniform distribution, we perform the calculations in the space of configurations (with uniform distribution).
Consider a bipartite structure of interest , such that , , and . We say there is a copy of in a configuration corresponding to a Tanner graph , if there is a set of edges in the configuration, whose image in corresponds to a subgraph which is isomorphic to . We denote the number of copies of in a configuration by . Now considering that, given a set of edges, there are configurations containing those edges, and that configurations are selected uniformly at random, we have
[TABLE]
where, the last equation is valid asymptotically since is a constant in , and thus in the number of edges of the Tanner graph. In the following, we derive upper and lower bounds on , and subsequently on .
Upper bound on : To form a copy of the structure , we choose an -permutation of the bins from and an -permutation of the bins from (note that some of the permutations may result in the same copy of . By considering all possible -permutations and -permutations, we obtain an upper bound on ). Next, after fixing the -permutation and the -permutation, for each , , for the bin in the -permutation, we select cells in order. We note that if the number of cells in the bin is not equal to , then a copy of cannot be formed (by definition, the degree of a variable node in a subgraph induced by a set of variable nodes must be equal to the degree of that variable node in the Tanner graph). On the check side, for each , , for the bin in the -permutation, we select cells in order. If the number of cells in the bin is less than , then a copy of cannot be formed. (Note that by considering all possible orderings for cells and cells, we find an upper bound on , since some of those orderings may result in the same copy of ). The number of possible choices of the cells on the variable side is thus upper bounded by , that can be further bounded from above by . Similarly, the number of choices of the cells on the check side is upper bounded by . We therefore have
[TABLE]
Lower bound on : If any of the variable degrees of is missing in the variable node degree distribution of the Tanner graph , then, no copy of can exist in . Otherwise, assume that there are different variable degrees , in , and denote the number of variable nodes with degree by . We thus have . Denote by the set of variable nodes in with degree . By the construction of the ensemble, we have , for some constant values . On the check side, denote the set of all check nodes with degree at least equal to the largest check degree in by . Here also, , for some constants and . It is then easy to establish the following lower bound on :
[TABLE]
where . Using the linear lower bounds on , and , followed by the well-known lower bound , we obtain
[TABLE]
where is a constant, and we have used .
The proof is then completed by combining (3) and (5) with (2), and noting that , where is the minimum variable degree in . ∎
Theorem 1 shows that depending on the relative values of and , the expected number of a structure can tend to zero, infinity or a non-zero constant, as tends to infinity. The following lemma establishes a connection between the number of cycles in , and the value of .
Lemma 1**.**
Consider a graph with the set of nodes , and the set of edges . If , then does not contain any cycle. Else, if , then contains at least two cycles, and if , then contains only one (simple) cycle.
Based on Theorem 1 and Lemma 1, we have the following corollary.
Corollary 1**.**
Consider a random ensemble of Tanner graphs and a given subgraph induced by a constant number of variable nodes in such Tanner graphs. Depending on whether contains at least two cycles, only one cycle, or no cycle, the average number of structure in the ensemble tends to zero, to a positive constant, or to infinity, as the size of the graphs (length of the codes) tends to infinity.
In the following subsections, we apply the results of this subsection to different categories of trapping sets, i.e., ETSs, LETSs, ABSs, EABSs and SSs, respectively.
III-B Elementary Trapping Sets (ETS)
In this subsection, based on the general result of Theorem 1, we study the asymptotic behavior of ETSs in both regular and irregular LDPC code ensembles.
The following results show that every class of ETSs has a consistent behavior in variable-regular LDPC codes, and that the behavior is fully determined by the ratio and the variable degree .
Proposition 1**.**
Consider an ETS in a variable-regular Tanner graph with variable degree . We then have
[TABLE]
Proof.
Suppose that is induced in a Tanner graph by the set of variable nodes . By counting the number of edges in from the variable side, we have . We also have
[TABLE]
where the second equality follows from the fact that all the satisfied and unsatisfied check nodes have degree two and one, in the ETS, respectively. The proof is then completed by subtracting from (7). ∎
The next theorem is resulted from Theorem 1 and Proposition 1, and describes the asymptotic expected number of ETSs in different classes of variable-regular LDPC code ensembles.
Theorem 2**.**
Consider a random variable-regular LDPC code with variable degree and length . Denote by the number of elementary trapping sets in . Then, for being a constant in , in the asymptotic regime where , we have:
[TABLE]
Thus, depending on whether , , or , the expected value tends to infinity, zero, or a positive constant value in .
Example 1**.**
Figure 1 shows three ETS structures in variable-regular Tanner graphs with , each satisfying one of the conditions in Theorem 2. (Variable nodes, satisfied and unsatisfied check nodes are shown by full circles, empty squares and full squares, respectively.)
In Section V, we study the case of in more details, and derive closed-form formulas to calculate the expected multiplicity of ETSs that satisfy in biregular and variable-regular Tanner graphs.
Unlike variable-regular Tanner graphs, in irregular Tanner graphs, classes of ETSs demonstrate an inconsistent behavior, i.e., in general, in an class, one can find at least two structures whose expected numbers tend to different values (infinity, zero or a non-zero constant), as . This is explained in the following example for the class.
Example 2**.**
Figure 2 shows three ETS structures, all in the class, but each with a different asymptotic behavior. While the asymptotic expected values of the leftmost and the rightmost structures are infinity and zero, respectively, for the middle structure, the asymptotic expected value is a positive constant (see, Corollary 1).
In Section V, we further discuss the ETSs whose average multiplicity tends to a constant in irregular graphs asymptotically and derive formulas to compute such expected values.
III-C Leafless Elementary Trapping Sets (LETS)
A LETS is a special case of an ETS, and thus the general results presented in Proposition 1 and Theorem 2 are also applicable to LETS structures of random variable-regular Tanner graphs. For LETSs, however, from the three scenarios of , , and , only the last two can happen. This is proved in the following lemma.
Lemma 2**.**
For any LETS structure in the class of a variable-regular LDPC code with variable degree , we have .
Proof.
Consider an LETS structure induced by the set of variable nodes . Counting the number of edges in the subgraph from the two sides of the graph, we have . Since is a LETS structure, each variable node is connected to at least 2 satisfied check nodes. This implies , which together with the previous equation complete the proof. ∎
Theorem 3**.**
Consider a random variable-regular LDPC code with variable degree and length . Denote by the number of leafless elementary trapping sets in . Then, for being a constant in , in the asymptotic regime where , we have:
[TABLE]
Thus, depending on whether , or , the expected value tends to zero, or a positive constant value in .
Based on Theorem 3, it is clear that, in the asymptotic regime, only the LETS classes with are non-empty. All the structures in such classes contain only one cycle, and we thus have the following result.
Corollary 2**.**
*In random variable-regular LDPC codes with variable degree and length , as , the only non-empty classes of local LETS structures are those with . For each such class, all structures within the class correspond to a simple (chordless) cycle of length 2a. *
For irregular LDPC codes, the following result is in parallel with Lemma 2.
Lemma 3**.**
For any LETS structure in an irregular LDPC code, we have .
Proof.
To prove the lemma, we show that any LETS structure has a cycle. In , each variable node is connected to at least two satisfied check nodes. On the other hand, the degree of each satisfied check node is two. Now, if we consider the subgraph formed by variable nodes, satisfied check nodes and the edges in between, we have a graph with minimum degree two. It is well-known that a graph with the minimum degree two has at least one cycle. ∎
Based on Lemma 3, for irregular LDPC codes also, the asymptotic expected number of LETS structures is either zero or a constant non-zero value. The classes of LETS structures in irregular codes also demonstrate an inconsistent behavior, and among all the structures within an class, only those that correspond to simple cycles of length have an asymptotically non-zero expected value for their multiplicity. We thus have the following result.
Proposition 2**.**
Consider random irregular LDPC codes with , and length . As , for a constant value of , the sum of the expected number of LETS structures in all the classes, for different values of , tends to the expected value of the number of simple cycles of length in the code.
The cycle structure of random regular and irregular Tanner graphs was studied in [24]. In particular, it was shown in [24] that the asymptotic expected value of the number of cycles of length in irregular graphs with variable degrees and check node degrees , as the size of the graph tends to infinity, is approximated by
[TABLE]
where is the number of edges in the graph. This result for variable-regular graphs with variable degree reduces to
[TABLE]
and for biregular graphs to
[TABLE]
Considering that in the asymptotic regime of , by Corollary 1, the expected number of cycles with chords tends to zero, one can use the above approximations for chordless cycles, and use them along with Corollary 2, to obtain asymptotic estimates on the average number of LETS structures in different classes of regular graphs.
For irregular graphs, the result of Proposition 2 together with the approximation (8) can be used to estimate the sum of LETS structures for a given and different values. In Section IV, we fine tune our analysis for irregular graphs and compute the expected number of LETS structures for any given values of and for which the expected multiplicity tends to a constant.
III-D Absorbing Sets (ABS)
The following result relates the number of nodes and edges of an ABS.
Lemma 4**.**
For any ABS structure within a Tanner graph with , we have .
Proof.
Let be the ABS induced in the Tanner graph by the set of variable nodes . By the definition of an ABS, each variable node in is connected to more nodes in than in . Each node in has thus at least two neighbors in . On the other hand, the degree of each node in within is at least two. Thus, if we consider the subgraph of containing the nodes in and , and the edges in between, we obtain a graph with minimum degree two. Such a graph, thus, has a cycle. This completes the proof. ∎
Based on Theorem 1 and Lemma 4, it is clear that the average number of any ABS structure tends to either zero or a positive constant (not to infinity) as the block length tends to infinity. The following theorem distinguishes between the two cases depending on the variable degrees of the Tanner graph.
Theorem 4**.**
Consider random Tanner graphs with variable node degree distribution . If , then all the classes of local ABSs have zero multiplicity asymptotically. Otherwise, if , then all the local classes of ABSs with and , have zero multiplicity asymptotically. In this case (), the structures in the local class with asymptotically non-zero multiplicity all correspond to simple cycles of length consisting only of degree-3 variable nodes. Finally, if , then only the local ABS structures whose variable degrees are only or can have non-zero multiplicity asymptotically. In the classes with a given and different values of , such structures are all simple cycles of length .
Proof.
If , then each variable node in the ABS must be connected to at least three satisfied check nodes. Now, consider the subgraph that consists of the variable nodes of the ABS, its satisfied check nodes and the edges in between. This subgraph has minimum degree two, and a node of degree at least three, and thus contains at least two cycles. This means that the ABS itself contains at least two cycles and thus, based on Corollary 1, its multiplicity is zero asymptotically. The same argument applies to any ABS structure that has at least one node with degree greater than or equal to four (for the cases with or ).
If , thus, all ABS structures have asymptotically zero multiplicity, except those whose variable nodes, all, have degree three. In this case, by the definition of an ABS, we must have , as each degree-3 variable node must be connected to at least two satisfied check nodes. Now consider the case where and . For this case, we consider two scenarios and show that for both scenarios the ABS structures will have more than one cycle and thus their multiplicity is zero asymptotically: (1) all unsatisfied check nodes have degree one, (2) at least one unsatisfied check node has degree or larger. In the first scenario, since , there must exist a variable node in the ABS that has no connection to unsatisfied check nodes. Node is thus connected to three satisfied check nodes, and with the same argument presented before, the ABS will have more than one cycle. In the second scenario, consider the subgraph of the ABS consisting of variable nodes, satisfied check nodes and the unsatisfied check node plus all the edges in between. This subgraph has minimum degree and a node with degree at least , and thus has more than one cycle. Therefore, for the case of , the only classes of ABSs with asymptotically non-zero multiplicity are classes. The only elements within the class whose multiplicity is non-zero asymptotically are simple cycles consisting of only degree-3 variable nodes, where each variable node is connected to one unsatisfied check node of degree one, and the degree of all satisfied check nodes are two.
For graphs with , the proof of the statement of the proposition is similar. In this case, the only structures whose average multiplicity tends to a constant are those with variable degrees only or , that contain only a simple cycle. ∎
From Theorem 4, it can be seen that any class of ABSs has a consistent behavior in any ensemble of variable-regular or irregular LDPC codes with . The same applies to any class of ABSs with in any ensemble of LDPC codes with . One can, however, easily provide examples where two ABS structures within the same class have different asymptotic behavior in an ensemble with . This is demonstrated in Fig. 3 for the class. While the structure in Fig. 3(a) has only one cycle and thus has an average multiplicity tending to a non-zero constant, the structure in Fig. 3(b) contains more than one cycle and thus its multiplicity is zero asymptotically. For ensembles with , it can be seen that, in general, an class of ABSs with and , has an inconsistent behavior (see Fig. 4, for two structures with different asymptotic behaviors in the class).
In addition to the asymptotic results of Theorem 4, the following finite-length result applies to ABSs of variable-regular LDPC codes.
Lemma 5**.**
There is no ABS structure with , in a variable-regular Tanner graph with variable degree .
Proof.
By the definition of an ABS, each variable node is connected to at least satisfied check nodes. The number of edges in the induced subgraph connected to unsatisfied check nodes is thus at most or . ∎
Corollary 3**.**
There is no ABS structure with , in a variable-regular Tanner graph with variable degree .
In Section IV, we complement the results of Theorem 4, by calculating the asymptotic expected multiplicity of ABSs for irregular codes with and , in classes with (see Corollaries 5 and 6).
III-E Elementary Absorbing Sets (EABS)
Elementary ABSs are a special case of ABSs. All the results presented in the previous subsection are thus applicable to EABSs as well. On the other hand, EABSs, for codes with , are a special case of LETSs. Therefore, the results presented in Subsection III-C are also applicable to EABSs. In fact, for the variable-regular graphs with or , or irregular graphs with variable degrees only and , the sets of EABSs and LETSs are identical.
In the asymptotic regime of , for variable-regular Tanner graphs with , using Theorem 4, one can see that all EABS classes are empty. For the cases of and , based of Corollary 2, the only non-empty classes (asymptotically) are those with , whose members are simple cycles of length . The average multiplicity of such classes can then be approximated by (9) or (10).
In general, for an irregular Tanner graph, the following result shows that among ABS structures, only those that are elementary can possibly have non–zero multiplicity in the asymptotic regime of . This implies that for graphs with , the asymptotic results presented in Theorem 4, related to ABS structures with constant average multiplicity, applies directly to EABSs.
Proposition 3**.**
Any non-elementary local ABS structure in a Tanner graph with contains more than one cycle. The multiplicity of in a random Tanner graph thus tends to zero as the size of the graph tends to infinity.
Proof.
The non-elementary ABS structure has either () a satisfied check node with degree or larger, or () an unsatisfied check node of degree or larger. In addition, each variable node in is connected to at least two satisfied check nodes. For Case (), consider the subgraph of consisting of all the variable nodes in , all the satisfied check nodes and the edges in between. This subgraph has minimum degree and has a node with degree or larger. It thus contains more than one cycle, and so does the ABS structure , itself. For Case (), consider the subgraph of containing all the variable nodes, all the satisfied check nodes and the unsatisfied check node with degree or larger, as well as all the edges in between. This subgraph has minimum degree and has a node with degree or larger, and thus contains more than one cycle. ∎
III-F Stopping sets (SS)
In stopping sets, each check node is connected to at least two variable nodes. For Tanner graphs with , therefore, any SS structure has a minimum degree at least , and as a result, contains at least one cycle. If we, however, further limit the degree distribution of Tanner graphs to , any SS structure will have a minimum degree of two and at least one node with degree or larger. This implies that the SS structure will contain more than one cycle.
Lemma 6**.**
Any SS structure in a Tanner graph with contains at least one cycle, i.e., . For Tanner graphs with , however, any SS structure contains more than one cycle, i.e., .
It is easy to see that, in general, the only SS structures that contain only one cycle are those with all variable nodes having degree . For such stopping sets, we have the following result.
Lemma 7**.**
Consider a stopping set that belongs to the class, and in which all the variable nodes have degree . We then have . Moreover, among all such SS structures, only those with can contain only one cycle. Those in other classes all have more than one cycle.
Proof.
Since in an SS, each check node has degree at least , the number of check nodes in the SS (with all variable nodes having degree ) must be less than or equal to the number of variable nodes and thus . If , however, since the degree of an odd-degree check node in the SS is at least , then there will be edges connected to odd-degree check nodes. This, alone, will be larger than the total number of edges connected to variable nodes. Thus, we must have . For the second part of the lemma, we notice that any SS with has at least one check node of degree or larger, and thus contains more than one cycle. ∎
Based on Lemma 7, the only SS structures with only one cycle are codewords (). In such structures, the number of variable and check nodes is the same, and all variable nodes and check nodes have degree . In fact, these structures are all simple cycles of length formed by degree-2 variable nodes.
Theorem 5**.**
Any class of local stopping sets has a consistent behavior in any ensemble of LDPC codes with , with the asymptotic multiplicity equal to zero. Moreover, the same result applies to LDPC codes with and stopping set classes with . In ensembles with , classes, in general, however, do not demonstrate a consistent behavior. The only stopping set structures in these classes whose average multiplicity tends to a non-zero constant by increasing the block length are simple cycles of length consisting of degree- variable nodes. All the remaining structures have an asymptotic multiplicity of zero.
Fig. 5 shows two stopping sets, both having only variable nodes with degree , and both in the class, but with different asymptotic behavior.
In the following section, to complement the result of Theorem 5, in Corollary 4, we compute the asymptotic expected multiplicity of stopping sets in the class.
IV Asymptotic Expected Multiplicity of Stopping Set, ABS and LETS Structures in Irregular Bipartite Graphs
The key results that enable us to find the asymptotic expected value of stopping set, ABS and LETS structures in irregular graphs are the following theorem, and its generalization in Theorem 7, that compute the expected number of cycles of a given length with a given combination of variable and check node degrees.
Theorem 6**.**
Consider the probability space of all bipartite graphs with variable and check node degree distributions and , respectively, where the graphs in are selected uniformly at random. Also, assume that , , and that and are fixed natural numbers, and let the number of variable nodes and the number of check nodes tend to infinity. Let and be sequences of non-negative integers such that and . For given vectors , and , denote by the number of cycles of length in such that each cycle has variable nodes with degree , for each , and check nodes with degree , for each . Then, for any fixed even value of , we have
[TABLE]
Proof.
We use the same model and notations as described in the proof of Theorem 1, where bipartite graphs are images of configurations. Rather than working in the probability space to find the expected value, we then work in the probability space corresponding to the configurations. Using an approach similar to the one used in [24], one can show that the expected values in the two spaces are asymptotically equal.
For a configuration, we define an ()-cycle to be a set of edges, like , connecting distinct bins, like . The connections are such that for each , the edge , connects a cell in bin to a cell in bin , where , and the two cells in each bin , connected to the two edges and , are distinct (). Also, for the bins corresponding to variable nodes, for each , there are bins, each with cells, and for the bins corresponding to check nodes, for each , there are bins, each with cells. We now compute the number of ()-cycles, , in a configuration.
For a bin with cells, we have choices for the two cells that are at the end points of the edges connected to the bin. Hence, in order to choose all the cells on both sides of the graph, we have
[TABLE]
choices, where and are the sets of variable and check nodes with degree , respectively. To count the number of ()-cycles in a configuration, we also need to consider different orderings of the bins on each side of the graph. This results in
[TABLE]
where the division by is for counting each cycle in the above process times. We thus have
[TABLE]
In the following, we simplify (13) in a number of steps. First,
[TABLE]
Also, for any two integers , where tends to infinity and is a constant number, we have
[TABLE]
[TABLE]
Moreover, by the definition of degree distributions, for each , we have:
[TABLE]
Consequently, by (17) and (16), we have
[TABLE]
Using in (18), we further obtain
[TABLE]
By counting the number of edges from the variable node side of the graph, and by (17), we have
[TABLE]
By substituting (19) and (20), for both sides of the graph, in (13), we obtain (11).
∎
Remark 1**.**
*For a -regular Tanner graph, the result of Theorem 6, obtained by replacing and in (11), reduces to (10) obtained in [24]. *
In the context of finding the asymptotic multiplicity of SS, ABS and LETS structures with only one simple cycle, we are particularly concerned about the degrees of variable nodes involved in the cycle, as these degrees determine the number of unsatisfied check nodes, , in the trapping set. The degrees of the check nodes in the cycle however, are of no consequence to the structure of the trapping set. In the following, we address this new counting problem by generalizing the result of Theorem 6 to the cases where some of the variable or check nodes can have degrees that are freely chosen from the degree distributions.
Theorem 7**.**
Consider the random ensemble of irregular bipartite graphs , described in Theorem 6, and let and be the set of variable and check node degrees satisfying , for . Let and be sequences of nonnegative integers such that and . For vectors , , and , denote by the number of cycles of length in such that each cycle has at least variable nodes with degree , for each , and at least check nodes with degree , for each .333The cycle thus has variable nodes and check nodes, whose degrees are freely selected from the degree distributions. Then, as the size of the graph tends to infinity, for any fixed even value of , we have
[TABLE]
where the approximation is an asymptotic upper bound within the fixed multiplicative factor of from the exact value, with Specht’s ratio defined by , for , and ( is Euler’s constant), and .
Proof.
Similar to the proofs of Theorems 1 and 6, here also, we work in the random ensemble of configurations. For each , , let be the set of variable nodes with degree . Also, for each , , let be the set of check nodes with degree . To form an ()-cycle, for each , , one needs to choose bins from . Next, from the remaining unchosen nodes in , one needs to choose bins. Similar selection process will need to be performed on the check node side of the graph. Since the size of the graph tends to infinity and since and thus all and values are constants, in the selection of variable nodes and check nodes, we can assume that the number of choices is asymptotically equal to the case where these nodes are selected from the whole set of variable and check nodes, respectively. Therefore, by taking similar steps as those taken to derive (12), we have the following equation for the number of ()-cycles in a configuration:
[TABLE]
where and . By substituting (18) in (22), we obtain
[TABLE]
In the following, we derive asymptotic upper and lower bounds on (23) that differ only in a multiplicative factor. For this, we first use Maclaurin’s inequality (see [26], pp 117-119), as described below. Let be positive real numbers, and for , define the averages as follows:
[TABLE]
where the summation is over all distinct sets of indices. Maclaurin’s inequality then states:
[TABLE]
Using Maclaurin’s inequality, we thus have
[TABLE]
On the other hand [27],
[TABLE]
where , with and equal to the maximum and minimum values of numbers , and Specht’s ratio is defined by
[TABLE]
in which is Euler’s number.
Combining (24) and (25), we have
[TABLE]
Now, let . Focusing on the upper bound in (27), we then have
[TABLE]
Similarly, we can establish the following asymptotic lower bound:
[TABLE]
Now, by applying (28) and (29) to (23) for both sides of the graph, we obtain the asymptotic upper and lower bounds on . In particular, the asymptotic value of the upper bound for (23), being used to obtain the approximate value of , is calculated as follows:
[TABLE]
On the other hand, by (20), we have
[TABLE]
This together with (30) and result in (7).
The asymptotic lower bound on is equal to the upper bound of (7) multiplied by , proving the claim about the accuracy of the approximation. ∎
Remark 2**.**
In Theorem 7, consider the case where . In this case, we are interested in cycles of length without any constraint on the node degrees. One can see that in this case, Equation (7) reduces to (8), obtained in [24].
Corollary 4**.**
For the ensemble of irregular bipartite graphs discussed in Theorem 7, let . Then, the expected multiplicity of local stopping sets is given by
[TABLE]
Proof.
Based on Theorem 5, we need to find the asymptotic expected number of simple cycles of length that consist of variable nodes of degree . This can be estimated from Theorem 7 by counting the number of cycles of length that consist of degree- variable nodes and check nodes with free degrees, i.e., and , and noting that in the asymptotic regime, all such cycles are simple. ∎
Corollary 5**.**
For an ensemble of irregular graphs with , the asymptotic average multiplicity of ABSs is given by
[TABLE]
Proof.
Based on Theorem 4, the only ABS structures in the class whose asymptotic expected multiplicity is non-zero, are simple cycles consisting only of degree- variable nodes. We can thus estimate the number of such cycles using Theorem 7 by counting cycles of length that consist of degree- variable nodes, and check nodes of free degree, i.e., and , and the fact that in the asymptotic regime, all such cycles are simple. ∎
Corollary 6**.**
For an ensemble of irregular graphs with , the asymptotic average multiplicity of local ABSs for an arbitrary value of and different values of , is given by
[TABLE]
Proof.
Based on Theorem 4, the only ABS structures in the class whose asymptotic expected multiplicity is non-zero, are simple cycles consisting only of degree- and degree- variable nodes. We can thus estimate the number of such cycles using Theorem 7 by counting cycles of length that consist of degree- and degree- variable nodes, and check nodes of free degree, i.e., , and the fact that in the asymptotic regime, all such cycles are simple. The result of the corollary then follows from Equation (7) after some simplifications. ∎
Based on the discussions in Subsection III-C, we know that the only LETS structures that have finite non-zero multiplicity asymptotically are those with only a simple cycle. In Theorem 7, we derived an asymptotic approximation for the expected number of such cycles with different lengths and with different combination of variable degrees. The following proposition, whose proof is straight forward, along with the results of Theorem 7 can be used to estimate the asymptotic multiplicity of LETS structures in different classes for irregular codes.
Proposition 4**.**
Consider an ensemble of irregular LDPC codes, and suppose that constants and are such that the asymptotic expected multiplicity of LETSs in is a finite non-zero value. Then, the asymptotic average multiplicity of LETSs in is equal to the asymptotic expected number of simple cycles of length in whose variable node degrees add up to .
The following example explains how the combination of Theorem 7 and Proposition 4 can be used to estimate the multiplicity of LETS structures in different classes for irregular codes.
Example 3**.**
Consider the ensemble of irregular LDPC codes with degree distributions and , where , and are constant integers. For any fixed value of , any class of LETSs with the value of in the range has a non-zero finite expected value asymptotically. For each such class, with given and values, we can uniquely determine the number of variable nodes of each degree, and , that exist in simple cycles belonging to the LETS class. For this, one needs to solve the following system of linear equations: and , to find the number of variable nodes of degrees and in the cycle. Solving these equations results in and . Using Theorem 7, we can then estimate the asymptotic expected multiplicity of LETSs by
[TABLE]
V Asymptotic Expected Multiplicity of ETS Structures in Regular and Irregular Bipartite Graphs
The only trapping sets with finite non-zero asymptotic expected multiplicity are those which contain a single (simple) cycle. One can thus see that in the asymptotic regime, ETS structures with finite non-zero expected multiplicity consist of a simple cycle and a collection of trees stemming from the variable nodes of that cycle. Each such tree is rooted at the corresponding variable node and contains check nodes of degrees two and one, where degree- check nodes form all the leaves of the tree. The problem of computing the multiplicity of ETSs with variable nodes can thus be broken into two subproblems: (i) calculating the multiplicity of simple cycles of different length () with different variable degrees (if the graph is irregular), and (ii) calculation of the number of possible ways that a collection of trees with variable nodes (in addition to the roots) can be attached to the variable nodes of the cycle. To solve the first subproblem, we use the results of [24] and those obtained in Section IV. To tackle the second subproblem, we formulate the problem as a recursive counting problem whose solution is a generalization of Catalan numbers [25].
To formulate Subproblem (ii), we consider a simple cycle of length , where . In this case, the number of variable nodes within the ETS but outside is . For a given value of , we partition into non-negative integers , where each , satisfies , and corresponds to one of the variable nodes of . The integer , in fact, is used to denote the number of nodes outside that are in the tree rooted at the -th variable node of . We thus have the constraint . We further use the notation , to denote the multiplicity of ’s with . Thus, we have , and . We use the notation for any sequence on non-negative integers that satisfy these two equations. For example, .
In the following, we first consider the case of biregular graphs and then generalize the results to variable-regular and irregular cases.
V-A Biregular Bipartite Graphs
Theorem 8**.**
Consider the ensemble of -regular graphs in which the number of nodes tends to infinity. Given fixed values of , and , for , we have
[TABLE]
and for a constant , we have
[TABLE]
where the asymptotic approximation is an upper bound within a multiplicative factor of from the exact value.
Proof.
We first note that based on Theorem 2, the only classes of ETSs with finite non-zero asymptotic expected multiplicity are those with . These classes, which consist of all the structures that contain a single (simple) cycle, are the focus of this theorem. In the asymptotic regime, any ETS contains a simple cycle whose neighborhood within any constant radius is a forest with probability one. The structure of this forest follows the degree distribution of the ensemble, i.e., any tree rooted at one of the variable nodes of is connected to check nodes and each such check node is connected to other variable nodes, which in turn are each connected to other check nodes and so on. The ETS , in addition to , consists of some trees, each rooted at one of the variable nodes of . Consider a variable node of , which is the root of a tree within . The tree is a subgraph of the neighborhood tree of rooted at . We refer to the collection of trees such as , which are part of the ETS , and each are rooted at one of the variable nodes of , as the forest of . Each tree in this forest, is called a basic tree. The number of variable nodes other than the root in a basic tree is called the variable-size of the tree or “v-size,” in brief. Variable-size zero for a basic tree implies that variable node is connected to check nodes of degree- within the ETS. Basic trees have the following properties: (i) the root has degree , (ii) the degree of check nodes is either two or one, (iii) the degree of all the variable nodes in the tree, except the root, is , and (iv) the leaves of the tree are all check nodes (with degree one).
We use the notation to denote the number of possible basic trees with constant v-size initiated from an arbitrary variable node of an arbitrary simple cycle within a Tanner graph from the ensemble of -regular graphs in the asymptotic regime. One can then see that
[TABLE]
In (35), the outer summation is to count simple cycles of different length , and the inner summation is responsible for counting all the possible ways that the variable nodes that are not part of the cycle can be partitioned into subsets of different size . The multinomial coefficient describes the number of ways that the variable nodes of the simple cycle can be connected to basic trees of different v-sizes.
To calculate , we note that any of the variable nodes in a basic tree has a check node of degree as its parent. We thus have
[TABLE]
where denotes the number of possible rooted trees with nodes such that in each tree, the root has degree at most , and all the other nodes have degree at most . The term in (36) accounts for the fact that at each non-leaf check node of a basic tree, one has choices to select the variable node child.
By replacing (36) in (35), we obtain
[TABLE]
Taking the expected value of the above expression and replacing in the expected value with (10), we have
[TABLE]
By the definition of , we have
[TABLE]
where is the number of rooted trees with nodes such that the root has degree at most and each other node has degree at most . To calculate , we can then use the following recursion
[TABLE]
with base cases . The solution to the recursion (40) is a generalization of Catalan numbers [25], and is given by
[TABLE]
For the case , by (39), we have , and thus,
[TABLE]
By replacing with (42) in (38), we obtain (33).
To derive (34), we first establish an upper and a lower bound on in terms of . For the upper bound, it is clear that
[TABLE]
For the lower bound, by the definition of , we have
[TABLE]
On the other hand, for each in the above summation, the following inequality holds:
[TABLE]
Combining (45) with (44), we obtain
[TABLE]
This together with results in
[TABLE]
By replacing with the upper bound of (43) in (38), and using (41) for , we obtain the asymptotic upper bound of (34).
To obtain the asymptotic lower bound that determines the accuracy of Estimate (34), we apply the lower bound of (47) on to (38). This results in the following lower bound
[TABLE]
This lower bound can be further bounded from below by the following equation:
[TABLE]
To obtain (48), we have used the following sequence of inequalities:
[TABLE]
∎
Remark 3**.**
We note that (34) is, in general, less accurate than (33), in the sense that while (33) is asymptotically equal to the true expected value , Equation (34) provides only an asymptotic upper bound for . It should, however, be noted that in (38), we present an asymptotic value for even when . To use (38), one needs to calculate the exact value of using (39) and (41). This can be easily performed for the cases where and are relatively small. In Table II, we have listed the values of for some small values of and .
V-B Variable-regular bipartite graphs
Theorem 9**.**
Consider the random ensemble of variable-regular bipartite graphs with variable degree and check node degree distribution , corresponding to the set of check node degrees , and let the number of nodes in the graphs tend to infinity. In , for fixed values , , and , we have
[TABLE]
where is the average check node degree, and the approximation is an asymptotic upper bound within the fixed multiplicative factor of from the exact value, with being Specht’s ratio, defined in Theorem 7, and .
Proof.
Using similar discussions as those in the proof of Theorem 8, we obtain the following equation:
[TABLE]
in which denotes the number of possible basic trees of v–size in the ensemble that are rooted at the th variable node out of variables nodes (of the cycle) whose basic trees all have v-size . The terms are random variables due to the randomness in the degree of check nodes participating in the basic trees. For different values of , these random variables are independent and identically distributed, and have the following expected value:
[TABLE]
where was defined in the proof of Theorem 8. To derive (51), we have used the fact that each check node that is a parent of one of the variable nodes in a basic tree of v-size has a degree selected randomly and independently from the distribution . Taking the expected value of (50), replacing (51) for in the expected value, and considering the fact that random variables are independent for different values of and , we obtain
[TABLE]
By replacing with (9), with the upper bound of (43), and using (41) for in (52), we derive (9). To prove the accuracy claimed in the theorem, we derive an asymptotic lower bound which is equal to the upper bound of (9) multiplied by . The term is due to lower bounding with the exact same steps as those taken in the proof of Theorem 8. The term is due to lower bounding with times the asymptotic upper bound of (9), as explained in Theorem 1 of [24], and then taking the smallest value of , by setting . ∎
V-C Irregular bipartite graphs
In irregular bipartite graphs, for a given value of , the ETSs whose asymptotic multiplicity is a non-zero constant on average can have a variety of values. In Proposition 2, we estimated the sum of all such ETSs over different values of . In this subsection, we derive asymptotic estimates for the expected number of ETSs in a given class. For a given value of , we start from the smallest value, i.e., . This corresponds to ETSs that consist of a simple cycle and some trees connected to the variable nodes of that simple cycle, and all the variable nodes in the ETS have degree . This class is of particular interest since it is well-known that among ETSs with the same value of , those with the smallest value are most harmful.
Theorem 10**.**
Consider the random ensemble of irregular bipartite graphs with variable and check node degree distributions and , respectively, where corresponds to the set of check node degrees . Let the number of nodes in the graphs of tend to infinity. In , for fixed values , , and , we have
[TABLE]
where , is the number of edges in the graph , and the approximation is an asymptotic upper bound within the fixed multiplicative factor of from the exact value, with Specht’s ratio defined in Theorem 7, and .
Proof.
Using similar discussions as those in the proof of Theorem 8, we obtain the following equation:
[TABLE]
where is the number of cycles of length in which the degree of each variable node is , and denotes the number of basic trees of v-size , whose roots have degree and whose other variable nodes all have degree , in the ensemble. The index is used to identify the root of the basic tree out of variables nodes (of the cycle) whose basic trees all have v-size .
Considering that each variable node in a basic tree has degree with probability given below:
[TABLE]
one can see that
[TABLE]
where is defined in the proof of Theorem 9, with its expected value calculated in (51). We thus have
[TABLE]
Taking the expected value of (54), and replacing with (55) in the expected value, we obtain
[TABLE]
To calculate the expected value , we use the result of Theorem 7 and obtain
[TABLE]
By replacing with (57), with the upper bound of (43), and using (41) for in (56), we derive (10).
The proof for the accuracy of the estimate follows the same steps as those used in the proof of Theorem 9 for the accuracy of Estimate (9). ∎
Remark 4**.**
One can use (56) along with (57) and the exact values of (see Table II), to obtain a more accurate estimate compared to (10).
In an irregular ensemble, for a given value of , ETSs in classes with in the range of have finite non-zero asymptotic expected multiplicity. One can use an approach similar to that of Theorem 10 to estimate such expected values. The derivations, however, are in general tedious, as the calculations involve ETSs that are in the same class but consist of different combinations of variable nodes with different degrees.
VI Numerical results
In this section, we present some numerical results in relation to the theoretical results presented in previous sections. In particular, we provide the multiplicities of different trapping set structures within randomly constructed LDPC codes and compare the results to the average values predicted by the asymptotic analysis. We first focus on LETS structures, and use the exhaustive search algorithms of [4] and [5] to find the multiplicity of LETS structures within different classes.
For the first experiment, we consider random biregular and LDPC codes with different block lengths as listed in Table III. The multiplicities of LETS structures in different classes with , for the codes, and with , for the codes, are listed in Tables IV and V, respectively. In both tables, we have also listed the asymptotic average number of LETS structures within each class obtained based on Corollary 2. For and codes, the asymptotic average value is non-zero only for classes with and , respectively. For these cases, the asymptotic average multiplicity of and classes is approximated by (10) as and , respectively.
The results of Tables IV and V show that the non-zero expected values provide a good approximation for the multiplicity of LETSs in and classes, for and codes, respectively, even at relatively short block lengths. The results also show that the multiplicity of other classes decrease with increasing (and tend asymptotically to zero). The rate of decrease, however, depends on the class and is faster for some classes than others.
In the second experiment, we consider irregular LDPC codes with degree distributions , and , and construct random codes of block lengths and . All the codes have rate and girth . To obtain the asymptotic expected number of LETS structures for this ensemble, we use the results presented in Section IV. For any given , we note that the asymptotic expected multiplicity is only non-zero for values in the range of . We also note that the degree distribution of this ensemble matches the one discussed in Example 32. Using the notations of Example 32, for the ensemble under consideration, we have , , , and . Replacing these in (32), we obtain
[TABLE]
Using (58), for , we obtain the results listed in the last column of Tables VI–IX, respectively. In each table, the results for different values of in the range of , are provided. As explained before, the asymptotic multiplicities are non-zero only for . In each table, we have also given the actual multiplicities of different LETSs for the five random codes. The comparison shows a good match between the theoretical asymptotic value and the finite-length numerical results even at relatively short block lengths.
As the next experiment, we consider the ensemble of biregular LDPC codes and find the asymptotic average multiplicity of ETSs for different values of . For a given , the ETS class is the only class with finite non-zero asymptotic multiplicity for this ensemble. We use (33) of Theorem 8 for the derivations, and assume that the girth .
For , since , in (33), we have . The outer summation therefore has only one term corresponding to . This implies that the inner summation has only one term that counts the expected number of cycles of length , which by (10) is equal to , asymptotically. We thus have
[TABLE]
For , the outer summation in (33) has two terms corresponding to . For the case of , the inner summation is responsible for counting the expected number of chordless -cycles, which by (10) is equal to , asymptotically. For the case of , there is only one sequence of values that satisfies , and that is and . For these values, the term inside the double-summation of (33) is calculated as:
[TABLE]
By adding the results for and , we obtain
[TABLE]
For , there are three terms in the outer summation involving the following calculations: (a) ; The inner summation counts the asymptotic expected number of chordless cycles of length , which is . (b) ; In this case, there is only one sequence , that satisfies , and that is . For this sequence, we have
[TABLE]
(c) ; In this case, we have two possible sequences of , satisfying , and they are , and . These sequences have the following contributions, respectively:
[TABLE]
[TABLE]
By adding the results of Case (a), (60) for Case (b), and (61) and (62), for Case (c), we obtain
[TABLE]
The asymptotic expected values of (33), just derived, are listed along with the multiplicity of ETSs in Codes in Table X. The comparison shows a good match between the two values across the range of block lengths. For the sake of completeness, for , we present the non-zero multiplicities of the other ETS classes within the range , , in Table XI. As expected, by increasing the block length the multiplicities decrease (and asymptotically tend to zero).
As another example to compare the asymptotic theoretical results of this paper for ETSs with those of finite-length random codes, we consider the -regular ensemble from which the codes in Table III are selected. Based on the results of Theorem 8, the asymptotic expected multiplicity of ETS classes for this ensemble can be approximated by (34), or more accurately by (38). (This is the only class of ETSs with non-zero asymptotic expected multiplicity for different values of .) In the following, we use (38) to estimate the multiplicity of ETSs for .
For , , and there is only one term corresponding to , and in the double-summation of (38). This term counts the asymptotic average number of cycles of length through (10), and is equal to . We thus have
[TABLE]
For , there are two terms in the outer summation: (a) ; In this case, the only term in the inner summation corresponds to , and is equal to the asymptotic average number of chordless cycles of length , obtained from (10) to be . (b) ; In this case there is only one sequence that satisfies . The contribution of this sequence in (38), considering that from Table II, is given by
[TABLE]
By adding the results for and , we obtain
[TABLE]
For , there are three terms in the outer summation of (38): (a) ; For this case, we have the contribution of , which counts the asymptotic expectted number of chordless -cycles. (b) ; In this case there is only one sequence satisfying , i.e., , , with the following contribution:
[TABLE]
(c) ; In this case, we have two possible sequences , satisfying . They are , and , with contributions:
[TABLE]
and
[TABLE]
respectively. (Note that from Table III, we have .) By adding the results of Case (a), (63), (64) and (65), we obtain
[TABLE]
The theoretical results, just derived, are listed in Table XII along with multiplicities of ETSs in classes of random codes. One can see a good match for different block lengths.
We end this section with deriving approximations for the asymptotic expected multiplicity of ETSs for , within the irregular ensemble with the following degree distributions: , and . This is the same ensemble we discussed earlier in this section for LETS distributions. We use the results of Theorem 10, and in particular Equations (56) and (57) to derive the expected values. (See Remark 4.) We assume , and note that for this ensemble , and thus Equation (56) can be used to find for different values of . For a given value of , the ETS class has the smallest value among all the classes with non-zero asymptotic expected multiplicity for this ensemble.
For ETSs, we have , and the only term in the double-summation of (56) is , which is calculated by (57) as . We thus have
[TABLE]
For ETSs, there are two term in the outer summation of (56): (a) ; For this case, the inner summation is , which can be calculated by (57) as . (b) ; In this case there is only one sequence , that satisfies . The contribution of this sequence in the inner summation of (56) is
[TABLE]
By substituting , , , and , in the above equation, we obtain the value . By adding the results of Cases (a) and (b), we have
[TABLE]
For ETSs, the outer summation of (56) has three terms: (a) ; For this case, the inner summation is , which by (57) has a value . (b) ; In this case, there is only one sequence , that satisfies . The contribution of this sequence is
[TABLE]
(c) ; In this case we have two sequences , that satisfy , and they are , and . For the first sequence the contribution is
[TABLE]
For the second sequence, the contribution is
[TABLE]
By adding the result of Case (a) with that of Case (b), i.e., (66), and those of Case (c), (67) and (68), we obtain
[TABLE]
The above results along with the multiplicity of corresponding ETS classes in randomly selected LDPC codes of different block length from the ensemble are listed in Table XIII.
VII Conclusion
In this paper, we studied the asymptotic behavior of local structures within the Tanner graph of randomly constructed regular and irregular LDPC codes, as the code’s block length tends to infinity. Examples of such structures are different categories of trapping sets, such as stopping sets, elementary trapping sets, or absorbing sets, that are harmful in the error floor region of LDPC codes. We derived a simple asymptotic relationship for the expected number of such structures based on the difference between the number of nodes and the number of edges of the structure. This was then related to the number of cycles in the structure, where we demonstrated that depending on the structure having, zero, one, or more than one cycle, the asymptotic expected value is infinity, a constant non-zero value, or zero, respectively. This general result was then applied to different categories of trapping sets to derive more specific results on the asymptotic expected values of different structures and classes of structures. In particular, for the case where the asymptotic expected value of a structure is a non-zero constant, we derived the exact value or approximations for the constant value in terms of the girth and the degree distributions of the ensemble. For these derivations, we used the existing asymptotic results of [24] on the average number of cycles. We also extended the results of [24] to count the number of cycles with specific combination of variable and check node degrees. Moreover, to count the number of elementary trapping sets, we formulated the counting problem recursively, and solved it using a generalization of Catalan numbers. We also demonstrated through numerical results that the asymptotic results derived in this paper are rather accurate even at finite block lengths.
An important aspect of this work was to demonstrate that different (non-isomorphic) structures within the same class of trapping sets can, in general, behave differently, in the asymptotic regime of infinite block length. This was not investigated in previous studies where the focus had been on the asymptotic behavior of the whole class rather than its individual members. In addition, to the best of our knowledge, the results presented here, are the most general in the literature, in the sense that the framework developed in this work is applicable to any local substructure of a Tanner graph. Previous works were limited to specific categories of substructures such as stopping sets, elementary trapping sets or absorbing sets. In fact, the results presented here for leafless elementary trapping sets, which are the vast majority of dominant trapping sets over the AWGN channel and the BSC, are the first of their kind. We also believe that the results presented in this work on the cases where the asymptotic expected multiplicity of a structure is a non-zero constant are the first of their kind, in the sense that, while the fact that the asymptotic average multiplicity of a certain class of structures tends to a constant was reported in the existing literature, the exact value of that constant was not determined. In this work, the exact value or an estimate for such constants is provided as a function of girth and degree distributions of the ensemble.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] C. Di, D. Proietti, I. E. Telatar, T. J. Richardson, and R. L. Urbanke, “Finite-length analysis of low-density parity-check codes on the binary erasure channel,” IEEE Trans. Inf. Theory , vol. 48, no. 6, pp. 1570–1579, Jun. 2002.
- 2[2] M. Karimi and A. H. Banihashemi, “Efficient algorithm for finding dominant trapping sets of LDPC codes,” IEEE Trans. Inform. Theory , vol. 58, no. 11, pp. 6942–6958, Nov. 2012.
- 3[3] M. Karimi and A. H. Banihashemi, “On characterization of elementary trapping sets of variable-regular LDPC codes,” IEEE Trans. Inf. Theory , vol. 60, no. 9, pp. 5188–5203, Sept. 2014.
- 4[4] Y. Hashemi and A. H. Banihashemi, “New characterization and efficient exhaustive search algorithm for leafless elementary trapping sets of variable-regular LDPC codes,” IEEE Trans. Inform. Theory , vol. 62, no. 12, pp. 6713 – 6736, Dec. 2016.
- 5[5] Y. Hashemi and A. H. Banihashemi, “Characterization and efficient exhaustive search algorithm for elementary trapping sets of irregular LDPC codes,” IEEE Trans. Inf. Theory , vol. 64, no. 5, pp. 3411–3430, May 2018.
- 6[6] H. Xiao and A. H. Banihashemi, “Estimation of bit and frame error rates of finite-length low-density parity-check codes on binary symmetric channels,” IEEE Trans. Commun. , vol. 55, no. 12, pp. 2234–2239, Dec. 2007.
- 7[7] L. Dolecek, Z. Zhang, V. Anantharam, M. J. Wainwright, and B. Nikolic, “Analysis of absorbing sets and fully absorbing sets of array-based LDPC codes,” IEEE Trans. Inf. Theory , vol. 56, no. 1, pp. 181–201, Jan. 2010.
- 8[8] A. Mc Gregor and O. Milenkovic, “On the hardness of approximating stopping and trapping sets,” IEEE Trans. Inform. Theory , vol. 56, no. 4, pp. 1640–1650, Apr. 2010.
