Computation of the spectrum of $\text{dc}^2$-balanced codes
Kees A. Schouhamer Immink, Kui Cai

TL;DR
This paper develops an approximation method for analyzing the spectrum of dc2-balanced codes using the central limit theorem, achieving high accuracy for codeword length 256.
Contribution
It introduces a cubic approximation for the auto-correlation function of dc2-balanced codes and quantifies the spectral approximation error.
Findings
Auto-correlation function approximated by a cubic function
Spectrum approximation error less than 0.04 dB for n=256
Method provides accurate spectral analysis of dc2-balanced codes
Abstract
We apply the central limit theorem for deriving approximations to the auto-correlation function and power density function (spectrum) of second-order spectral null (dc2-balanced) codes.We show that the auto-correlation function of dc2-balanced codes can be accurately approximated by a cubic function. We show that the difference between the approximate and exact spectrum is less than 0.04 dB for codeword length n = 256.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptical Wireless Communication Technologies · graph theory and CDMA systems · Error Correcting Code Techniques
Computation of the spectrum of -balanced codes
††thanks: Kees A. Schouhamer Immink is with Turing Machines Inc, Willemskade 15d, 3016 DK Rotterdam, The Netherlands. E-mail: [email protected]. ††thanks: Kui Cai is with Singapore University of Technology and Design (SUTD), 8 Somapah Rd, 487372, Singapore. E-mail: [email protected]. ††thanks: This work is supported by Singapore Agency of Science and Technology (A*Star) PSF research grant and SUTD-ZJU grant ZJURP1500102
Kees A. Schouhamer Immink and Kui Cai
Abstract
We apply the central limit theorem for deriving approximations to the auto-correlation function and power density function (spectrum) of second-order spectral null (-balanced) codes. We show that the auto-correlation function of -balanced codes can be accurately approximated by a cubic function. We show that the difference between the approximate and exact spectrum is less than 0.04 dB for codeword length .
I Introduction
Spectral null, or dc-balanced, codes have been applied in cable transmission [1], [2], magnetic recording [3], and optical recording systems [4], [5]. Spectral null codes have recently been advocated in visible light communications (VLC) systems, where light intensity of solid-state light sources, mostly LEDs, are varied [6]. It is desirable that the intensity variation of the light is invisible to the users, that is, annoying flicker should be mitigated [7]. This requirement implies that the spectrum of the modulated signal should not contain low-frequency components. Light sources are usually connected to the AC power grid, and therefore generate interference components at 50, 60 Hz, or the higher harmonics. Rejection of these interfering components can easily be accomplished by high-pass filtering, but in order not to degrade the wanted communication signal by this filtering, low-frequency components should be absent in the modulated signal. Three types of dc-balanced codes, the Manchester code (bi-phase), a 4B6B code, and a 8B10B code, have been adopted in VLC standard IEEE 802.15.7-2011 [8] for flicker mitigation and dimming control [6], [9].
Higher-order spectral null codes, such as -balanced codes, offer a greater rejection of the low-frequency components than regular dc-balanced codes [10]. Constructions of higher-order spectral null codes have been presented in for example [11], [12], [13], [14], [15], [16]. Spectral properties of higher-order spectral null codes have been published for small values of the codeword length [10]. For larger values of , Immink and Cai [17] have presented simple expressions for approximating the auto-correlation function and spectrum of higher-order spectral null codes. We apply statistical arguments for deriving improved approximations to the auto-correlation function and spectrum of -balanced codes.
Section II commences with background on -balanced codes. In Section III, we derive an approximation to the auto-correlation function and spectrum of -balanced codes for asymptotically large values of the codeword length by counting -balanced codewords using the central limit theorem. Approximations for asymptotically large will be discussed in Section III-C. In Section IV, we appraise the spectral performance of -balanced codes. Section V shows our conclusions.
II Background on -balanced Block Codes
Let the -bit codeword \mbox{\boldmathx}=(x_{1},x_{2},\ldots,x_{n}) over the binary symbol alphabet =, be a member of a codebook . The encoder emits codewords from randomly and independently (i.i.d.). The auto-correlation function, , of a sequence of codeword symbols is given by [18], [19], [20]
[TABLE]
where denotes the cardinality of and , , is the bipolar representation of . If both and its inverse \bar{}\mbox{\boldmathx} are members of , then the power spectral density (psd), in short spectrum, versus frequency of the emitted symbol sequence is
[TABLE]
A regular ‘full-set’ dc-balanced block code comprises all possible codewords that have equal numbers of 0’s and 1’s ( even). Franklin and Pierce [2] showed that the spectrum of a full-set dc-balanced block code has a null at the zero frequency, that is, . -balanced spectral null codes are dc-balanced codes that satisfy a second condition, namely
[TABLE]
where denotes the second derivative of at . Note that the above frequency domain conditions imply, see (2), that
[TABLE]
A codeword, , is -balanced if it satisfies [10], [21]
[TABLE]
A block code comprising a full set of -balanced codewords, denoted by , is defined by
[TABLE]
The set is empty if [10]. Let \mbox{\boldmathx}\in S_{2} then its reverse \mbox{\boldmathx}_{r}=(x_{n},\ldots,x_{1})\in S_{2}, since for a \mbox{\boldmathx}\in S_{2}
[TABLE]
A useful metric of the low-frequency spectral content, denoted by , called Low Frequency Spectral Weight (LFSW) [15], is the first non-zero coefficient of the Taylor expansion of (2), that is,
[TABLE]
We derive from (2) that
[TABLE]
The number of -balanced codewords, denoted by , for asymptotically large , equals [16], [22]
[TABLE]
In the range we have found experimentally that a better approximation is found by applying a small correction term, namely
[TABLE]
We consider here the spectral properties of full-set block codes, that is, denotes the set of all possible words, , that satisfy condition (5). Finding an expression of the spectral properties of a full-set for large values of is an open problem as the computation requires the evaluation of (1) for each \mbox{\boldmathx}\in S_{2} [12]. In the next section, our main contribution, we address an alternative method, which is based on statistical analysis, which gives a simple and good approximation to the spectrum.
III Auto-correlation function
Let be a codeword in , and let and , , , be two (different) index positions in the codeword . Then, we obtain for the average correlation, denoted by , between the symbols at positions and averaged over all codewords \mbox{\boldmathx}\in S_{2},
[TABLE]
where denotes the number of -balanced codewords that satisfy condition . Then, using (1) and (1), we find the auto-correlation function
[TABLE]
For reasons of symmetry, we have
[TABLE]
By using the central limit theorem, we compute below an approximation to the number of -balanced codewords that have a ‘1’ at positions and , , for asymptotically large values of .
III-A Counting of codewords using the central limit theorem
The number of -balanced codewords, , , that is required for computing the auto-correlation function using (13) and (14), can be computed using generating functions. For very large , however, this rapidly becomes an impractically cumbersome exercise, and an efficient alternative method is considered a desideratum.
To that end, we exploit the central limit theorem by regarding the integer variables as i.i.d. binary random variables whose numerical outcomes ‘0’ or ‘1’ are equally likely. We define the stochastic variables and by
[TABLE]
and
[TABLE]
where , .
The central limit theorem [23], Chapter 8, states that for asymptotically large the distribution of the stochastic variables and , which are obtained by summing a large number, , of independent stochastic variables, approaches a two-dimensional Gaussian distribution.
Let denote the expected value operator for all possible codewords in . Let the parameters and denote the average of and , and let and denote the variance of and . The parameter denotes the linear correlation coefficient between the random variables and . Then the bi-variate Gaussian distribution, denoted by , is given by
[TABLE]
where
[TABLE]
[TABLE]
and
[TABLE]
We have
[TABLE]
[TABLE]
and . We may find after a routine computation using (20) that
[TABLE]
and similarly
[TABLE]
The variances , , and the correlation coefficient can be found without too much difficulty:
[TABLE]
[TABLE]
and
[TABLE]
The total number of -sequences with , equals , so that for asymptotically large , the number of -sequences versus and , denoted by , can be approximated by
[TABLE]
A -balanced codeword satisfies, by definition, the conditions, see (5), and . Then, is found after substituting and into (24). We find
[TABLE]
[TABLE]
After combining (10), (14), and (25), we obtain
[TABLE]
In order to reduce the clerical work and offer more insight, we define the four (real) variables
[TABLE]
[TABLE]
[TABLE]
and
[TABLE]
With some effort we find the expressions
[TABLE]
and
[TABLE]
We finally obtain
[TABLE]
where we can easily verify, since and \mbox{\boldmathx}_{r}\in S_{2}, see (7), that
[TABLE]
The auto-correlation function, , is found using (13). In the next subsection, we show results of computations.
III-B Results of computations
By invoking (13) and (30) we are now able to compute an estimate of the auto-correlation function . Figure 1 shows results of computations for , 64, and 128. As a comparison we plotted the exact auto-correlation function of a full set of -balanced sequences, denoted by , which was computed using an enumeration technique and generating functions [10].
The accuracy of the approximate auto-correlation function, , cannot easily be determined from Figure 1 for the larger values of . Figure 2, Curve ‘without correction’, shows , the difference between the two auto-correlation functions versus for the selected and 256. We notice that the difference between the two functions decreases with increasing and . For in the whole range the difference, , is less than .
Although the difference, , is relatively small, especially for larger , see Figure 2, the ‘checks’, see (4), and , which accumulate the small error differences, are not necessarily satisfied. We have observed that with increasing that is converging to zero (as it should), while is not. As a result, the spectra, computed using do not satisfy the spectral conditions (3).
We propose to add a small correction term to so that both ‘checks’, and , are satisfied. We add to the correction term , where the (real) parameters, and , are chosen such that and . Define
[TABLE]
and
[TABLE]
then we find two linear equations with two unknowns, and , namely
[TABLE]
and
[TABLE]
After solving the above system, where we substitute the well-known expressions for , , we obtain
[TABLE]
and
[TABLE]
For example, for , we find that and . So that and . The result of the correction can be seen in Figure 2, curves ‘with correction’, for and 256. We notice in the range a significant improvement in the accuracy of the estimate of the auto-correlation function.
III-C Further approximations for asymptotically large
With (30) we can straightforwardly compute the auto-correlation function and spectrum . In this section, we attempt to approximate for asymptotically large , which might offer more insight in the trade-offs between redundancy and spectral properties. We apply to (30) the well-known series approximations
[TABLE]
and
[TABLE]
We have experimented with the various options available for trading accuracy versus simplicity of the expression, and propose
[TABLE]
Using (13), we obtain
[TABLE]
Then, after deleting the smallest terms, we obtain the simple cubic function
[TABLE]
which can be rewritten as
[TABLE]
where . The checks (4) for the above yield
[TABLE]
and
[TABLE]
In order to satisfy both checks (4), we add to the correction term , and define
[TABLE]
where after using (32) and (33), we obtain
[TABLE]
and
[TABLE]
Note that and are relatively small terms in (38) for asymptotically large . Figure 3 shows the difference between exact and estimated auto-correlation function versus for .
As a final proof of the pudding, we compare the (exact) spectrum of full set codewords versus the spectrum, denoted by , which is computed using the above approximated auto-correlation function . The difference, (dB), between the spectrum, , computed using , and the exact spectrum, , of full set codewords, which was computed using generating functions, is plotted in Figure 4. We may observe that the difference between the two spectra is very small, less than 0.05 dB for and less than 0.03 dB for .
The LFSW metric, is, using (9),
[TABLE]
Table I shows for selected values of , where as a comparison we have listed the LFSW of full set -balanced codes, denoted by . We may notice that for the difference between and is less than half a percent.
III-D Comparison with prior art
In [17], it is postulated that the auto-correlation of -balanced spectral null codes, denoted by , can be modelled by the simple parabola’s equation
[TABLE]
where the (real) parameters and are given by
[TABLE]
and
[TABLE]
It has been shown in [17] that the parabola’s equation (40) is an accurate approximation to the exact correlation function of full-set -balanced spectral null codes. Figure 3, Curve (b), shows the difference between exact and estimated auto-correlation function versus for . We notice that the newly developed , Curve (a), is almost an order more accurate than (40) presented in the prior art. Figure 5, Curve b, shows that the quotient of the exact spectrum and the one based on prior art (40), , is for less than 0.7 dB, and also here we notice that the newly developed theory is more than an order more accurate.
IV Appraisal of spectral performance
A system designer is usually confronted with a restricted redundancy budget, so that with a given redundancy the designer searches for a balanced code that offers the best rejection of low-frequency components. In this section, we compare the spectral performance of regular dc-balanced codes with that of -balanced codes. We start with a summary of properties of dc-balanced codes.
IV-A Codes with a first-order spectral null
Let the codeword length of a regular full-set dc-balanced code be denoted by , even. Each codeword has an equal number of 0’s and 1’s, so that the number of available dc-balanced codewords, denoted by , is simply [24]
[TABLE]
The auto-correlation function, , and the spectrum, , of dc-balanced codes is [2]
[TABLE]
and
[TABLE]
At the very low-frequency end, we have[15]
[TABLE]
where
[TABLE]
IV-B Performance comparison
We compare the spectral content of dc-balanced versus that of -balanced codes, where we assume that both types of codes have the same redundancy. Let and denote the maximum information rate of a -balanced code or dc-balanced of length and , respectively, then we have, using (10),
[TABLE]
and, using (41),
[TABLE]
Table II shows a few examples of the codewords length and for which dc-balanced and -balanced codes have equal redundancy, respectively, that is, . In the range shown in Table II, the codeword length of a -balanced code is approximately a factor of 4.5 larger than the codeword length of a dc-balanced code for achieving the same rate .
Figure 6 shows three examples of spectrum pairs of dc-balanced and -balanced codes with the same redundancy versus frequency for a) , b) , and c) , see also Table II. We may notice the points of intersection of the spectra of dc-balanced and -balanced codes. A further perusal of the diagram reveals that the points of intersection are at around -20 dB, which implies that -balanced codes are to be preferred when a low-frequency spectral suppression is required better than around -20 dB. Additional computations show that this ‘20 dB rule’ applies to all codes with a rate larger than 0.75.
V Conclusions
By applying the central limit theorem, we have derived an approximate expression for the auto-correlation function and spectrum of full-set -balanced codes for asymptotically large values of the codeword length . We have shown that the auto-correlation function of -balanced codes can be accurately approximated by a simple cubic function. We have compared the approximate spectrum with the exact spectrum of full set -balanced codes. We have shown that the difference between the approximated and exact spectrum is less than 0.04 dB for .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] K. W. Cattermole, “Principles of Digital Line Coding,” Int. Journal of Electronics, vol. 55, pp. 3-33, July 1983.
- 2[2] J. N. Franklin and J. R. Pierce, “Spectra and Efficiency of Binary Codes without DC,” IEEE Trans. Commun., vol. COM-20, pp. 1182-1184, Dec. 1972.
- 3[3] Y. Ng, K. Cai, K. S. Chan, M. R. Elidrissi, M. Y. Lin, Z. Yuan, C .L. Ong, and S. Ang, “Signal Processing for Dedicated Servo Recording System,” IEEE Trans. Magn., vol. 51, no. 10, Oct. 2015.
- 4[4] K. Cai, K. A. S. Immink, M. Zhang, and R. Zhao, “Design of Spectrum Shaping Codes for High-Density Data Storage,” Trans. on Consumer Electronics , vol. CE-63, pp. 477-482, Nov. 2017.
- 5[5] K. A. S. Immink, “Spectral Null Codes,” IEEE Trans. Magn., vol. MAG-26, no. 2, pp. 1130-1135, March 1990.
- 6[6] A. R. Ndjiongue, H. C. Ferreira, and T. M. N. Ngatched, Visible Light Communications (VLC) Technology , Wiley Encyclopedia of Electrical and Electronics Engineering, 2015.
- 7[7] M. Oh, “A Flicker Mitigation Modulation Scheme for Visible Light Communications”, 2013 15th International Conference on Advanced Communications Technology (ICACT), Pyeong Chang, South Korea, Jan. 2013.
- 8[8] S. Rajagopal, R. D. Roberts, and S-K. Lim, “IEEE 802.15.7 Visible Light Communication: Modulation Schemes and Dimming Support,” IEEE Communications Magazine, vol. 50, No.3, pp. 72-82, March 2012.
