TL;DR
This paper develops a novel, accurate approximation framework for the probability of successful data recovery in sparse Random Linear Network Coding, enhancing reliability analysis for future smart city broadcast systems.
Contribution
It introduces a Stein--Chen based performance framework that provides tight probability approximations applicable to various system parameters, surpassing existing bounds.
Findings
Approximation closely matches Monte Carlo simulations
Significant improvement over existing performance bounds
Applicable to diverse system configurations
Abstract
Ultra-reliable Point-to-Multipoint (PtM) communications are expected to become pivotal in networks offering future dependable services for smart cities. In this regard, sparse Random Linear Network Coding (RLNC) techniques have been widely employed to provide an efficient way to improve the reliability of broadcast and multicast data streams. This paper addresses the pressing concern of providing a tight approximation to the probability of a user recovering a data stream protected by this kind of coding technique. In particular, by exploiting the Stein--Chen method, we provide a novel and general performance framework applicable to any combination of system and service parameters, such as finite field sizes, lengths of the data stream and level of sparsity. The deviation of the proposed approximation from Monte Carlo simulations is negligible, improving significantly on the state of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Reliability of Broadcast Communications Under
Sparse Random Linear Network Coding
Suzie Brown, Oliver Johnson and Andrea Tassi
This work is partially supported by the University of Bristol Faculty of Science, the School of Mathematics and the VENTURER Project, which is supported by Innovate UK under Grant Number 102202. S. Brown and O. Johnson are with the School of Mathematics, University of Bristol, UK (e-mail: [email protected], [email protected]). A. Tassi is with the Department of Electrical and Electronic Engineering, University of Bristol, UK (e-mail: [email protected]).
Abstract
Ultra-reliable Point-to-Multipoint (PtM) communications are expected to become pivotal in networks offering future dependable services for smart cities. In this regard, sparse Random Linear Network Coding (RLNC) techniques have been widely employed to provide an efficient way to improve the reliability of broadcast and multicast data streams. This paper addresses the pressing concern of providing a tight approximation to the probability of a user recovering a data stream protected by this kind of coding technique. In particular, by exploiting the Stein–Chen method, we provide a novel and general performance framework applicable to any combination of system and service parameters, such as finite field sizes, lengths of the data stream and level of sparsity. The deviation of the proposed approximation from Monte Carlo simulations is negligible, improving significantly on the state of the art performance bounds.
Index Terms:
Sparse random network coding, broadcast communication, multicast communications, Stein–Chen method.
I Introduction
In next-generation networks, reliable broadcast communication is expected to be critical. In particular, this holds true in future networks of self-driving vehicles where road-side base stations (BSs) will broadcast live sensor data [1]. For example in the 5G-PPP’s “bird’s eye” use case, live 3D Light Detection and Ranging (LiDAR) scans of vehicles engaging a traffic junction are broadcast to the incoming vehicles – enabling them to take an informed decision on how to safely drive through the junction [1]. In this kind of network, a key performance indicator is the user delivery probability, defined as the probability of a user successfully recovering the transmitted data stream.
Generally, modern communication systems enhance the reliability of Point-to-Multipoint (PtM) data streams by employing Application Level-Forward Error Correction (AL-FEC) techniques, which are usually based on Luby Transform (LT) or Raptor codes [2, 3]. These kinds of codes only operate to their capacity if large block lengths are employed, which could be a problem in the presence of delay-sensitive services [2]. For this reason, in our system model reliability of PtM data streams is ensured via the Random Linear Network Coding (RLNC) approach [4].
The RLNC approach requires the BS to split each PtM data stream into source packets, which form a source message. A sequence of coded packets is obtained in a rateless fashion by linearly combining the source packets. A user recovers the PtM data stream as soon as it collects linearly independent coded packets [4, 5]. A drawback of the RLNC approach is the computational complexity of the decoding phase, which is a function of and the finite field size considered during the encoding phase [6, 7]. Tassi et al. [8] observed that this complexity can be significantly reduced by adopting a sparse implementation of the RLNC approach, where the number of non-zero elements in the encoding matrix is smaller. However, as the encoding matrix becomes sparser, the number of coded packet transmissions needed by a user to recover the source message is likely to increase. To date, an exact expression for the user delivery probability as a function of the sparsity and number of coded packet transmissions is still unknown.
The key contribution of this paper is a tight approximation to the user delivery probability in a system where broadcast source messages are protected by sparse RLNC (see Section III). Our approximation is valid for any finite field size, sparsity level, and data stream length. As shown in Section IV, the deviation of the proposed model from simulation results is negligible. Our approximation enables service providers to increase the sparsity level (reducing the complexity of the decoding phase) while ensuring a target user delivery probability.
The lack of an exact performance model for sparse RLNC implementations is caused by the lack of an accurate expression for the probability of a sparse random matrix, generated over a finite field, being full rank [9, 8, 6]. Garrido et al. [6] proposed models based on absorbing Markov chains to characterize the user performance, in a particular implementation of sparse RLNC where the number of source packets employed to generate each coded packet is fixed. This assumption significantly simplifies the performance modeling issue, yet [6] mostly relies on Monte Carlo simulation to estimate, via a regression technique, the statistical correlation between the rows of full rank sparse matrices.
Unlike [6], this paper refers to a more general sparse RLNC scheme where a source packet participates in the generation of a coded packet with probability , for . With regards to this general sparse RLNC formulation, Tassi et al. [8] proposed the first performance model valid for any finite field size, data stream length and probability . In particular, the probability bound proved in [10, Theorem 6.3] allowed the authors [8] to derive a tractable but not tight performance bound. More recently, the theoretical framework proposed by A. Khan et al. [11] extended [10, Theorem 6.3] and can be directly used to upper- and lower-bound the user delivery probability. However, again often these bounds are not tight.
In this paper, we address the limitations of the previous studies and provide the following contributions:
- •
We propose an accurate expression for the user delivery probability suitable for general sparse RLNC formulations and applicable to any combination of system parameters, which overcomes the lack of generality of the model proposed in [12] and [6]. In particular, with regards to [6], a new set of Monte Carlo simulations are required to re-derive a performance model as the field size, the number of source packets defining a source message or the number of source packets involved in the generation of each coded packet changes.
- •
Unlike most recent works [8, 11] which build upon [10, Theorem 6.3], we approximate the user delivery probability by employing a novel mathematical framework based on the Stein-Chein method. In fact, for and , [10, Theorem 6.3] notoriously is not a tight lower-bound to the probability of a sparse random matrix being full rank [8], with a subsequent impact on the estimation of the user delivery probability.
- •
Regardless of the field size and level of sparsity of the encoding matrix, our approximation of the user delivery probability is very close to simulated values. On the other hand, the state of the art upper- and lower-bound to the user delivery probability proposed in [11] significantly deviate from Monte Carlo simulations for a binary field, with the lower-bound performing better than the upper-bound. For larger field sizes, both our approximation and the upper-bound as per [11] to the user delivery probability tightly follow simulation results but, in this case, the lower-bound as per [11] significantly deviates from our Monte Carlo simulations. As such, unlike our approximation, neither the upper-bound nor lower-bound consistently give a tight approximation of the user delivery probability, regardless of the field size.
The rest of the paper is organized as follows. Section II presents the considered system model. Section III discusses the proposed performance characterization model for sparse RLNC implementations and states our novel approximate result in Theorem III.1. The accuracy of the proposed performance model is considered using Monte Carlo simulation in Section IV. Finally, in Section V, we draw our conclusions.
II System Model
We consider a system model where one transmitter broadcasts a stream of coded packets to multiple receiving nodes, over a channel with packet error probability equal to . We assume that the transmission time of a coded packet is equal to one time step, and that the time needed to transmit coded packets is equal to time steps.
We say that a source message consists of source packets where consists of elements of a finite field of size . A coded packet is also formed by elements from and is defined as where is referred to as a coding coefficient. Provided that coded packets have been broadcast by the transmitter, the input to the broadcast channel can be expressed in matrix notation as . The matrix is defined by elements , i.e., . Coding coefficients are chosen at random over , in an identical and independent fashion according to the following probability law [8]:
[TABLE]
where . The greater the value of , the more likely that a coding coefficient is equal to [math], so we observe that the average number of source packets actively participating in the generation of a coded packet is a function of . The ‘classic’ RLNC scheme refers to equal to [8] (so the coding coefficients are uniform on ), ‘sparse’ RLNC schemes are characterized by .
Let be the set of coded packets that have been successfully received by a user, for . At the receiving end, each user populates a decoding matrix with the columns of associated with the coded packets that have been successfully received. Finally, relation holds. The source message is recovered as soon as becomes full rank and hence, contains a invertible matrix.
III Performance Analysis
Based on [4], we observe that the probability of a user to recover a source message, i.e., the user delivery probability, as a function of can be expressed as follows:
[TABLE]
where is the probability of a decoding matrix being full rank, as a function of . In the case of classic RLNC, it is known that \mathrm{R}_{K,n}(p)_{\big{|}_{p=1/q}}=\prod_{t=0}^{K-1}\left[1-\frac{1}{q^{n-t}}\right] exactly [4]. For sparse RLNC schemes, an exact expression for is still unknown but, as proposed in [11], it can be approximated by means of the following lower-bound
[TABLE]
and upper-bound
[TABLE]
where and . From [10], it follows that and are the lower- and upper-bound to the probability, for , of a being non-full rank, respectively. Finally, we write for the probability that any set of rows of a matrix sums to the zero vector in , which can be expressed (directly following from [9, Eq. (5)]), for and , as:
[TABLE]
Both (3) and (4) are based on and , which essentially account for the event that some sets of rows form a non-full rank matrix. Overall, and give a notoriously not tight approximation of . This holds true especially for and [8]. In addition, the right-hand terms in the minimization of (3) and in the maximization of (4) represent the probability of having any set of rows that linearly combined sums to the zero vector and the probability of having any groups of rows that are equal to the zero vector, respectively. In both cases, these events are significantly different to the event that some submatrix in is not full rank – thus impacting on the tightness of (3) and (4). In the remainder of this section, we address this issue by providing a novel expression for , which tightly approximates the user delivery probability across a large range of system parameters.
III-A Proposed Performance Model for Sparse RLNC
Therefore, we consider the key research question: Given a decoding matrix , formed according to the probability model (1), what is the probability that has rank ? We remark that for , the source message cannot be recovered, i.e., is equal to [math]. In the remainder of this section, we focus on the case where and we wish to know whether the rows of form a linearly independent set, i.e., the rank of is . We give the following definition.
Definition III.1
Write for a set of labels. For each , we regard as a subset of the set of indices composed of items.
It is immediate to prove that the following remark holds.
Remark III.1
Matrix is full rank if and only if no linear combinations of any sets of rows indexed by a sums to the zero vector over the field . For , we can consider the collection of events
[TABLE]
where we write for the -th row of , and where addition is understood to be over . We know that is full rank if and only if none of the events occur for any .
Example III.1
Consider the following matrix when :
[TABLE]
In this case, rows 1 and 4 are identical, so occurs. Further, rows 2,3 and 5 sum to zero over , so also occurs. In addition, since both these sets of rows sum to zero, their union must also sum to zero, so also occurs.
Example III.1 illustrates why it is not sufficiently accurate to estimate the full-rank probability of by considering the expected number of events which occur, using the expression for the probability of each individual . This approach ignores the fact that such events are positively correlated. In general, given disjoint sets such that occur, then will occur for each of the sets formed as unions of the
Our proposed performance framework builds upon a different set of statistical events, defined as follows.
Definition III.2
Let be defined as follows
[TABLE]
which is the event that the rows indexed by sum to the zero vector in but that no collection of rows indexed by a proper subset of sums to the zero vector.
In general, from Definition III.2, we have the following remark.
Remark III.2
Matrix is full rank if and only if none of the events occurs, for . This choice of events significantly mitigates the impact of the correlation among events observed in Example III.1. There, and both occur (since no subset of them sums to zero), however does not. This enables us to derive a tighter approximation of .
The proposed derivation of involves two approximation steps: (i) We approximate the probability of event happening for any set consisting of a given number of items, and (ii) Since results based on the Stein-Chen method [13, 14] show the sum of approximately independent zero–one variables with small probability of being one is close to Poisson, we approximate with a negative exponential function. Firstly, we consider the following quantities:
Definition III.3
For each :
For each such that the set has cardinality , the event has the same probability of happening, defined as
[TABLE] 2. 2.
We define a further quantity recursively as follows:
[TABLE]
where (since taking gives an empty sum) .
Lemma III.1
Term (defined in (8)) can be approximated as (defined in (9)).
Proof:
See Appendix A. ∎
We observe that an obvious way in which can fail to have full rank is that a particular row is identically zero. Indeed, considering this event gives an upper bound on , which for certain parameter values can be reasonably tight. For this reason, we condition out these events as follows:
[TABLE]
where we can write the first term of (10) directly as .
From Lemma III.1, we prove the following result.
Theorem III.1
We approximate the second term of (10) as
[TABLE]
where
[TABLE]
so we approximate as follows:
[TABLE]
Proof:
See Appendix A. ∎
The most computationally intensive part of calculating (13) is the derivation of , which requires operations. However, since the expression of is independent of , it has to be computed only once to approximate . In addition, for and , the average time needed to compute111Tests performed by running our benchmark code on one core of an Intel Xeon CPU E5-2650v4 operated at . (normalized by ) is equal to \text{,}\mathrm{s}, $7.7\cdot 10^{-3}$\text{\,}\mathrm{s}, \text{,}\mathrm{s} and $4.3\cdot 10^{-1}$\text{\,}\mathrm{s}, respectively. It is also key to note that Theorem III.1 allows us to decouple the impact that any submatrix of has on the approximation of as in (13), for . As the following remark explains, this allows us to further approximate (13) by reducing the number of summation terms defining and hence, reducing the computational complexity of the approximation (13).
Remark III.3
Consider the set of all the submatrices of , then approximates the probability that at least one of these submatrices is not full rank, assuming has no zero rows. For this reason, the approximation of given in (13) can be further approximated by referring to those submatrices of composed by up to rows, for . As such, with define the -th approximation order of (13) as follows:
[TABLE]
Let us consider the following approximation order optimization (AOO) problem222For the sake of compactness, with a slight abuse of notation, we say that is always equal to , for any .:
[TABLE]
where function is defined as . The solution to the AOO problem represents the smallest-order approximation of (13) associated with a target error value or such that is smaller than , for .
Remark III.4
From (12), it follows that term is a non-decreasing function of , i.e., relation holds. As such, for any given , and , the error function attains only one maximum, for . For this reason, the AOO problem can be solved iteratively evaluating , for , until and or is smaller than or equal to .
Remark III.5
From Remark III.2, is full rank if and only if none of the events occurs, for , and . However, for non-binary fields, the aforementioned statement captures a subsets of events when a random matrix is full rank. As such, we propose to use (13) to approximate , for .
IV Analytical Results
This section compares the approximation we proposed in Theorem III.1 against the approximation (3) and (4). Both our simulator and the implementation of the proposed theoretical framework are available online [15].
Fig. 1 shows the relationship that exists between the order of the approximation as in (14) and the number of received coded packets in , for , a source message composed by packets and . From (14), we remark that, for a given value of , is a non-increasing function of . This is directly related to the fact that small approximation orders account for submatrices of composed of a reduced number of rows. This can be intuitively explained by considering the extreme case where is large compared to . In this case, the probability of being full rank can be approximated by the probability of having a set of (non-zero) rows of where no rows are identical – this corresponds to the case where is set equal to .
The aforementioned facts are confirmed by Fig. 1. For instance, for , the value of drops from () to () to remain almost unchanged for . In particular, by solving the AOO problem for , and , we obtain an optimal value of equal to as per Remark III.3. We also observe that the value of appears to sharply decrease as increases, which makes computationally convenient to approximate with . For instance, Fig. 1 shows that the error function takes values smaller than or equal to for and – thus making it pointless to approximate with an heuristic order equal to or greater than . In the remainder of this section, to highlight the accuracy of our approximation, we will refer to a value of or .
Fig. 2 compares the user delivery probability for and and . We compare the value for implied by (2), substituting the approximations to given by (3), (4) and our proposal in (14) to the probability estimated by Monte Carlo simulations. Results are given as a function of , which represents the transmission overhead, i.e., the number of coded packets in excess of that are transmitted. Assuming that the time needed to transmit each coded packet is fixed and equal to one time slot, the goodput of the system can be immediately expressed as the bit length of the source message divided by the time duration of time slots. For concreteness, we considered a value of packet error probability , which is the maximum transport block error probability regarded as acceptable in a Long Term Evolution-Advanced (LTE-A) system [4]. In particular, in the case where , Fig. 2a shows that the maximum gap between our proposed approximation (14) and simulation results is equal to , which occurs for . Fig. 2b refers to the case when and shows that the gap between (14) and simulation results is negligible.
In contrast, both Fig. 2a and 2b show that approximating using the state of the art (3) and (4) leads to significantly deviate from the simulation results. For instance, for , and , the absolute deviation can be up to and , in the case of (3) and (4), respectively. In general, the maximum Mean Squared Error (MSE) between simulations and our proposed approximation (14) is experienced for and and it is equal to . That is smaller than the corresponding MSEs between simulation and approximations (3) and (4), which are equal to and , respectively (between and times smaller). In addition, Fig. 2b shows that, for , our proposal overlaps simulation results while the considered alternatives significantly deviate. In this case, the MSE of our approximation is between times (for ) and times () smaller than in the case of (3) and, between times () and times (for ) smaller than in the case of (4).
Fig. 3 shows that for , our approximation either overlaps (for ) or marginally diverges from simulation results (). In the latter case, we observe that the (absolute gap) never exceeds and the maximum MSE is equal to . Similar behavior is also exhibited by the case where is approximated as in (3) or (4) and . However, as increases to , the approximation based on (3) significantly deviates from simulation results even of a quantity larger than ( and ).
Generally, from Figs. 2 and 3 we observe that, for binary fields (with the only exception of and ), the approximation based on (3) is tighter than that based on (4). However, the exact opposite holds as both and increase. Our proposed approximation avoids this issue. In fact, our solution tightly approximates simulation results, for all the cases considered. These conclusions are also confirmed by Fig. 4, which shows the probability as a function of , for , .
As an immediate application of a tighter approximation of the user delivery probability, we can accurately estimate the average transmission overhead needed for a user to recover a source message (that is ). In particular, Fig. 5 shows the average transmission overhead as a function of , for and and, and . Direct proof of the quality of the proposed approximation is given by the fact that the deviation between theoretical and simulation results is negligible across the whole range of parameters – the maximum gap between theory and simulations is equal to and occurs for and .
V Conclusion
This paper presented a novel approximated performance model for a sparse RLNC implementation. The proposed model exploits the Stein-Chen method to derive a tight approximation to the probability of a user recovering a source message. Analytical results show that the Mean Squared Error (MSE) between our approximation and simulation results, for and , never exceeds and , respectively. On the other hand, the state of the art bounds are not always tight. For instance, when our proposal is between and times closer in MSE to simulations.
Appendix A
Proof:
By symmetry it is enough to consider a subset of rows of the form , where . The key is to fix one row (say row ) and to consider the smallest set of rows containing row which sums to zero. Consider , a subset of with , and say that event occurs when both: (i) the rows of with indices in add to zero and, (ii) rows with indices in add to zero, but no subset of these rows add to zero, i.e. event occurs see (7). By considering (i) and (ii) together, occurs when the rows in both the sets and (but no subset of ) add to zero. In other words equals . Since rows in are statistically independent, for each of size , the event occurs with probability
[TABLE]
Furthermore, since for any we observe:
[TABLE]
This relation holds because (i) we assume events are approximately disjoint (ii) there are possible sets of size containing 1. In Example III.1, if then occurs as previously discussed, indeed so does with . This concludes the proof. ∎
Proof:
We write for the collection of indices such that and define the random variable
[TABLE]
where indicator function equals if a particular event has occurred, or [math] otherwise. Observe that {\mathbb{P}}(\mbox{\mathbf{M} is full rank}\;|\;\{\mbox{\mathbf{M} has no zero rows}\})={\mathbb{P}}(W=0). By (8), if has elements, independence of the rows means that the probability
[TABLE]
Hence, by counting sets of different sizes in , we it follows that . Further, is the sum of a large number of zero–one variables, each of which equals one with small probability, and where each random variable in the sum is independent of a large proportion of the other terms. These are the conditions under which is close to Poisson, as shown by the Stein–Chen method [13, 14]. Approximation (13) follows because [13, Theorem 1] means that . ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] “5G-PPP White Paper on Automotive Vertical Sector,” 5G Infrastructure Public Private Partnership, Tech. Rep., Oct. 2015. [Online]. Available: https://5g-ppp.eu/wp-content/uploads/2014/02/5G-PPP-White-Paper-on-Automotive-Vertical-Sectors.pdf
- 2[2] E. Magli, M. Wang, P. Frossard, and A. Markopoulou, “Network Coding Meets Multimedia: A Review,” IEEE Trans. Multimedia , vol. 15, no. 5, pp. 1195–1212, 2013.
- 3[3] P. Wang, G. Mao, Z. Lin, M. Ding, W. Liang, X. Ge, and Z. Lin, “Performance Analysis of Raptor Codes Under Maximum Likelihood Decoding,” IEEE Trans. Commun. , vol. 64, no. 3, pp. 906–917, Mar. 2016.
- 4[4] A. Tassi, I. Chatzigeorgiou, and D. Vukobratović, “Resource-Allocation Frameworks for Network-Coded Layered Multimedia Multicast Services,” IEEE J. Sel. Areas Commun. , vol. 33, no. 2, pp. 141–155, Feb. 2015.
- 5[5] E. Tsimbalo, A. Tassi, and R. J. Piechocki, “Reliability of Multicast under Random Linear Network Coding,” Oct. 2017. [Online]. Available: http://arxiv.org/abs/1709.05477
- 6[6] P. Garrido, D. E. Lucani, and R. Agüero, “Markov Chain Model for the Decoding Probability of Sparse Network Coding,” IEEE Trans. Commun. , vol. 65, no. 4, pp. 1675–1685, Apr. 2017.
- 7[7] D. Andrén, L. Hellstrøm, and K. Markstrøm, “On the Complexity of Matrix Reduction Over Finite Fields,” Advances in Applied Mathematics , vol. 39, no. 4, pp. 428–452, 2007.
- 8[8] A. Tassi, I. Chatzigeorgiou, and D. E. Lucani, “Analysis and Optimization of Sparse Random Linear Network Coding for Reliable Multicast Services,” IEEE Trans. Commun. , vol. 64, no. 1, pp. 285–299, Jan. 2016.
