Randomness Evaluation with the Discrete Fourier Transform Test Based on Exact Analysis of the Reference Distribution
Hiroki Okada, Ken Umeno

TL;DR
This paper derives a precise mathematical reference distribution for the DFT test in NIST SP 800-22, improving its reliability and sensitivity for evaluating randomness in cryptographic applications.
Contribution
It provides an exact chi-squared distribution for the power spectrum in the DFT test, replacing the previous numerical estimation method.
Findings
The proposed test is more reliable than the existing DFT test.
The new test is more sensitive in detecting non-random sequences.
Experimental results confirm the improved performance of the proposed method.
Abstract
In this paper, we study the problems in the discrete Fourier transform (DFT) test included in NIST SP 800-22 released by the National Institute of Standards and Technology (NIST), which is a collection of tests for evaluating both physical and pseudo-random number generators for cryptographic applications. The most crucial problem in the DFT test is that its reference distribution of the test statistic is not derived mathematically but rather numerically estimated, the DFT test for randomness is based on a pseudo-random number generator (PRNG). Therefore, the present DFT test should not be used unless the reference distribution is mathematically derived. Here, we prove that a power spectrum, which is a component of the test statistic, follows a chi-squared distribution with 2 degrees of freedom. Based on this fact, we propose a test whose reference distribution of the test statistic is…
| Null hypothesis | is | ||
|---|---|---|---|
| = “generator is ideal” | True | False | |
| Judgment of | Reject | False Positive | True Positive |
| (Type I error) | |||
| Fail to reject | True Negative | False Negative | |
| (Type II error) | |||
| Test | ||||||
|---|---|---|---|---|---|---|
| Passing rate | ||||||
| 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| 0.5 | 0.9 | 0.8 | 1.0 | 0.0 | 0.0 | |
| 0.7 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 0.9 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 0.9 | 0.9 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 0.7 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 1.0 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 0.9 | 1.0 | 0.9 | 1.0 | 0.0 | 0.0 | |
| 0.7 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 0.8 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 0.9 | 1.0 | 0.9 | 1.0 | 0.0 | 0.0 | |
| 0.9 | 1.0 | 0.9 | 1.0 | 0.0 | 0.0 | |
| 1.0 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 1.0 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 0.9 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 0.9 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 1.0 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 0.9 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 1.0 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 1.0 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 1.0 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 1.0 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 0.9 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| 0.8 | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | |
| Test | ||||||
| Passing rate | ||||||
| AES-CTR | 0.952 | 0.995 | 0.996 | 1.000 | 0.988 | 1.000 |
| Mersenne-Twister | 0.948 | 0.996 | 0.993 | 1.000 | 0.991 | 1.000 |
| Xorshift | 0.947 | 0.989 | 0.996 | 1.000 | 0.986 | 1.000 |
| VSC 2.0 | 0.952 | 0.998 | 0.994 | 1.000 | 0.988 | 1.000 |
| LCG | 0.952 | 0.995 | 0.995 | 0.998 | 0.984 | 0.999 |
| Micali-Schnorr | 0.975 | 0.993 | 1.000 | 1.000 | 0.994 | 1.000 |
| QCG-I | 0.955 | 0.994 | 0.997 | 1.000 | 0.697 | 0.991 |
| QCG-II | 0.954 | 0.993 | 0.993 | 0.998 | 0.000 | 0.000 |
| CCG | 0.667 | 0.900 | 0.911 | 0.995 | 0.000 | 0.000 |
| Test | |||
|---|---|---|---|
| Reliability | low | high enough | high enough |
| Sensitivity | high | low | definitely high |
| Parameter | |||
|---|---|---|---|
| 100,000 | 100,000 | 4,000 | |
| 1,000 | 1,000 | 25,000 | |
| Number of -s |
| small | large | |
|---|---|---|
| Second-level test | Accurate | Erroneous |
| Distribution of | Erroneous | Accurate |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Randomness Evaluation with the Discrete Fourier Transform Test
Based on Exact Analysis of the Reference Distribution
Hiroki Okada and Ken Umeno
Abstract
In this paper, we study the problems in the discrete Fourier transform (DFT) test included in NIST SP 800-22 released by the National Institute of Standards and Technology (NIST), which is a collection of tests for evaluating both physical and pseudo-random number generators for cryptographic applications. The most crucial problem in the DFT test is that its reference distribution of the test statistic is not derived mathematically but rather numerically estimated; the DFT test for randomness is based on a pseudo-random number generator (PRNG). Therefore, the present DFT test should not be used unless the reference distribution is mathematically derived. Here, we prove that a power spectrum, which is a component of the test statistic, follows a chi-squared distribution with 2 degrees of freedom. Based on this fact, we propose a test whose reference distribution of the test statistic is mathematically derived. Furthermore, the results of testing non-random sequences and several PRNGs showed that the proposed test is more reliable and definitely more sensitive than the present DFT test.
H. Okada and K. Umeno are with the Department of Applied Mathematics and Physics, Graduate School of Informatics, Kyoto University, Kyoto, JAPAN.
e-mail: [email protected], [email protected]
Keywords: Computer security, random sequences, statistical analysis
1 Introduction
Random numbers are used in many types of applications, such as cryptography, numerical simulations, and so on. However, it is not easy to generate “truly” random number sequences. Pseudo-random number generators (PRNGs) generate the sequences by iterating some recurrence relation; therefore, the sequences are theoretically not “truly” random. The binary “truly” random sequence is defined as the sequence in which each element has a probability of exactly of being “0” or “1” and in which the elements are statistically independent of each other. It is also difficult to ascertain if the sequence is truly random; therefore, the randomness of the sequences is evaluated statistically.
NIST SP 800-22 [1, 2] is one of the famous statistical test suites for randomness that was used for selecting the Advanced Encryption Standard (AES) algorithm. NIST SP 800-22 consists of fifteen tests, and every test is hypothesis testing, where the hypothesis is that the input sequence is truly random; if the hypothesis is not rejected in all the tests, it is implied that the input sequences are random. Among the tests included in NIST SP 800-22, the DFT test is of the greatest concern to us. This test detects periodic features of a random number sequence; input sequences are discrete Fourier transformed, and the test statistic is composed of the Fourier coefficients. In 2003, Kim et al. [3, 4] reported that the DFT test and the Lempel-Ziv test in the original NIST SP 800-22 [1] have crucial theoretical problems. Regarding the DFT test, it is reported that the test statistic does not follow the expected reference distribution because of the problem that the DFT test regards Fourier coefficients as independent stochastic variables although they are not. Kim et al. numerically estimated the distribution of the test statistic with pseudo-random numbers generated with a PRNG and proposed a new DFT test with the estimated distribution. In 2005, Hamano [5] theoretically scrutinized the distribution of the Fourier coefficients in the original DFT test. However, he could not derive the theoretical distribution of the test statistic, but he did make the problems in the DFT test clearer. In 2005, because of these reports, in NIST SP 800-22 version 1.7, the Lempel-Ziv test was deleted, and the DFT test was revised according to the report of Kim et al. The DFT test has not subsequently been revised. In 2012, Pareschi et al. [6] reviewed three tests included in NIST SP 800-22, and they also numerically estimated the distribution of the test statistic. Consequently, they reported that the distribution estimated by Kim et al. is not sufficiently accurate. As stated above, several researchers have attempted to revise the DFT test. However, the distribution of the test statistic has still not been derived theoretically but rather numerically estimated.
In this paper, we review the problems in the DFT test, and we prove three facts, which are important for analyzing the reference distribution of the test statistic: Under the assumption that the input sequence is an ideal random number sequence, when ,
- •
The asymptotic distributions of both and are the standard normal distribution () when .
- •
When is sufficiently large, and are statistically independent of each other.
- •
The asymptotic distribution of is a chi-squared distribution with 2 degrees of freedom when .
Here, is an -bit binary sequence, is the -th discrete Fourier coefficient of , and and are the real and imaginary parts of , and they are defined in , and in Section 2, respectively. There is no information about these factors in NIST SP800-22, and, to the best of our knowledge, no researchers who have studied the DFT test have ever provided rigorous proofs. These factors are necessary for analyzing the reference distribution of the test statistic. Furthermore, we propose a new DFT test based on the fact that is the asymptotic distribution of . By comparing the results of several PRNGs, we show that our test is more reliable and definitely more sensitive than the present DFT test.
2 Discrete Fourier Transform Test
In this section, we explain the procedure of the original DFT test (), released in 2001 [1], before the revision in 2005 [2]. We also explain the problems reported by several researchers [4, 5]. The focus of this test is the peak heights in the discrete Fourier transform of the sequence. The purpose of this test is to detect periodic features in the tested sequence that would indicate a deviation from the assumption of randomness. The intention is to detect whether the number of peaks exceeding the 95 % threshold is significantly different than 5 %.
2.1 The procedure of the original DFT test
The zeros and ones of the input sequence are converted to values of and to create the sequence , where . For simplicity, let be even. 2. 2)
Apply a discrete Fourier transform (DFT) to to produce Fourier coefficients . The Fourier coefficient and its real and imaginary parts and are defined as follows:
[TABLE] 3. 3)
Compute , where
[TABLE]
Because , are discarded. 4. 4)
Compute a threshold value . The 95% values are supposed to be .
According to SP800-22, is considered to follow , and is defined by the following equation.
[TABLE]
Several researchers [4, 5] reported that this was incorrect, and it was accordingly revised as in the DFT test in the revised NIST SP800-22 [2]. 5. 5)
Count
[TABLE]
If are mutually independent, then under the assumption of randomness, can be considered to follow , where is the binomial distribution.
According to the central limit theorem, when is sufficiently large, the approximation to is given by the normal distribution . Therefore, when is sufficiently large, under the assumption of randomness,
[TABLE] 6. 6)
Compute a test static
[TABLE]
When is sufficiently large, under the assumption of randomness, the test statistic can be considered to follow 7. 7)
Compute -; .
If , then conclude that the sequence is non-random, where is a significance level of the DFT test. NIST recommends [2]. Therefore, we also define . If , conclude that the sequence is random. 8. 8)
Perform 1) to 7) for sample sequences ; -s are computed. 9. 9)
(Second-level test I: Proportion of sequences passing a test)
Count the number of sample sequences for which - and define it as . Then, under the assumption of randomness, follows , which approximates when is sufficiently large. Therefore, the proportion of sequences passing a test () approximately follows . The range of acceptable is determined using the significance interval defined as
[TABLE]
If the proportion falls outside of this interval, there is evidence that the data are non-random. 10. 10)
(Second-level test II: Uniform distribution of -s)
Uniformity may also be determined by applying a test and determining a - corresponding to the goodness-of-fit distributional test on the -s obtained for an arbitrary statistical test (i.e., the - of the -s). This is performed by computing
[TABLE]
where is the number of -s in sub-interval . A - is calculated such that
[TABLE]
where igamc is the complementary incomplete gamma function. If
[TABLE]
the sequences can be considered to be uniformly distributed, where is the significance level for . 11. 11)
If the set of -s passes both 9) and 10), the physical or pseudo-random number generators that generated the input sequences are concluded to be ideal.
2.2 The fundamental problems of the original and present DFT tests
Kim et al. [4] and Hamano [5] reported the following:
- •
The test statistic does not follow ;
- •
does not follow .
Furthermore, Kim et al., using Secure Hash Generator (G-SHA1) [2] as a PRNG, estimated that
[TABLE]
and was revised according to this report of Kim et al. [2]; the present DFT test, denoted as , has not been revised since then. Therefore, the reference distribution of the test statistic of is not mathematically derived. Furthermore, Pareschi et al. reported that the numerical estimation is not sufficiently accurate; they numerically estimated that
[TABLE]
Moreover, Pareschi et al. proposed that the DFT test with this test statistic () is more reliable. (The definition of the reliability of a test is discussed in Section 5.) Therefore, it can be considered that still has errors. First, and are performed based on a PRNG, whose randomness should be evaluated with a randomness test; they cannot be used unless the reference distribution is mathematically derived.
As stated in step 5) in Section 2.1, are considered to be mutually independent. However, are not mutually independent, and this problem is expected to be the main factor for why does not follow [4, 5]. Furthermore, before considering this problem, it is also necessary to ensure that follows . Although is considered to follow in step 4) in Section 2.1, there is no information about this in SP800-22, and no researchers studying the DFT test have ever provided rigorous proofs to the best of our knowledge. We provide a proof for the DFT test in Section 3.
3 The asymptotic distribution of
In this section, we analyze the asymptotic distribution of . From the definition of in (1),
[TABLE]
When ,
[TABLE]
Under the assumption that is an ideal random number sequence, and are mutually independent, and . Therefore, as a consequence of the central limit theorem, when is sufficiently large, follows , and follows a chi-squared distribution with 1 degree of freedom . Thus, does not follow .
In the following, we consider the case when . Here, follows if the following is true:
- •
Both and follow .
- •
and are mutually independent.
In the following 2 subsections, we prove the following Theorem 1, Theorem 2 and Theorem 3:
[TABLE]
From the definition of , Theorem 3 can be proven by combing Theorem 1 and Theorem 2.
3.1 Proof of Theorem 1: The asymptotic distribution of
In this subsection, we prove Theorem 1. Hamano [5] showed that the average, variance, skewness, and kurtosis of and are the same. However, it cannot be proven that is the asymptotic distribution of based only on these factors.
is expressed as , where . Under the assumption that is an ideal random number sequence, the characteristic function of denoted by is expressed as follows:
[TABLE]
where
[TABLE]
[TABLE]
Using the Taylor expansion about a point , we obtain
[TABLE]
[TABLE]
Since
[TABLE]
[TABLE]
Thus, is the asymptotic distribution of . Likewise, it can be proven that is the asymptotic distribution of .
3.2 Proof of Theorem 2: Statistical independence of and
In this subsection, we prove Theorem 2. Let us define a 2-dimensional stochastic variable as the following equation:
[TABLE]
Under the assumption that is an ideal random number sequence, the characteristic function of denoted by \psi(\mbox{\boldmatht}) is expressed as follows:
[TABLE]
where
[TABLE]
Therefore,
[TABLE]
Using the Taylor expansion about a point \mbox{\boldmatht}=\mbox{\boldmath0}, we obtain
[TABLE]
Since
[TABLE]
we obtain
[TABLE]
Therefore, when is sufficiently large, the joint probability distribution function is described as follows:
[TABLE]
As we proved before, is the asymptotic distribution of both and . Thus, when is sufficiently large, the probability distribution functions of and are and , respectively. Therefore, when is sufficiently large, the following equation is obtained:
[TABLE]
This means that and are mutually independent when is sufficiently large.
4 The proposed DFT test
In Section 3, we proved Theorem 3, stating that follows when is sufficiently large. Therefore, if are mutually independent, we can consider that follows . However, are not mutually independent. Therefore, it is necessary to mathematically analyze the distribution of the test statistic under the condition that are not mutually independent. Hamano [5] attempted to mathematically derive the distribution of the set , but he could not do so, and we also could not derive this distribution. However, we rigorously proved that the asymptotic distribution of is , and we develop the new DFT test () based on this fact. The reference distribution of the test statistic of is mathematically derived, whereas that of is estimated with a PRNG. We explain the test statistic of in the next subsection.
4.1 The procedure of the proposed DFT test
In the standard approach in NIST SP800-22, each sequence is analyzed; thus, sequences give -s. However, generates (: length of a sequence) -s. Therefore, more -s are generated since is generally larger than . Since the number of -s should not be too large (see Section 5.3), before conducting , it is necessary to adjust the length of the sequences and make them into more sets of short sequences (see also Table 5), assuming that the set input sequences are continuously generated by an RNG. Therefore, is theoretically not appropriate for the isolated set of sequences.
The procedure of the proposed DFT test is described as follows:
The zeros and ones of the -length input sequence are converted to values of and to create the sequence , where . For simplicity, let be even. 2. 2)
Apply a discrete Fourier transform (DFT) to each to produce Fourier coefficients . The Fourier coefficient and its real and imaginary parts and are defined as follows:
[TABLE] 3. 3)
For all , perform the Kolmogorov-Smirnov (KS) test [8, 9] on the empirical cumulative distribution function of defined as based on the difference from and compute the - . Here, the KS statistic and are defined as follows.
[TABLE]
where is the cumulative distribution function of the Kolmogorov-Smirnov distribution:
[TABLE]
Note that -s are computed in this step, while the computes -s. 4. 4)
Perform the second-level tests I and II defined in the original DFT test (see Section 2.1-9, 2.1-10). If the set of -s passes both second-level tests I and II, the physical or pseudo-random number generator that generated the input sequences is concluded to be ideal.
5 Experiments
In this section, we explain the experiments that we performed and the conclusions derived from their results. In these experiments, we compare the reliability and sensitivity of and . The reliability of tests means a low probability of false positives (type I error) (see Table 1), and the sensitivity of tests means a low probability of false negatives (type II error). Now, the null hypothesis of the tests () is that the “generator is ideal”. Therefore, a false positive (type I error) means an erroneous identification of an ideal generator as not random, and a false negative (type II error) means an erroneous identification of a generator that is not ideal as random. Comparing the probability of type I error and type II error, we can conclude which test is better.
For simplicity, in this experiment, we modify the significance interval of the second-level test I defined in (4) as follows:
[TABLE]
With this modified significance interval, the significance level of the second-level test I () is modified to be .
5.1 Experiment 1: Test results for periodic sequences
In this experiment, we compare the sensitivity of and . Sensitivity means a low false negative rate (low probability of type I error), i.e., high true positive rate. Here, we compare the true positive rate of each test result.
[TABLE]
Now, we define an -length input sequence as
[TABLE]
where
[TABLE]
[TABLE]
We purposely create non-random (periodic) sequences from the -length sequence using the method described as follows:
[TABLE]
Therefore,
[TABLE]
We can clearly state this sequence is a non-random sequence. Therefore, if the test does not reject the (=null hypothesis: “generator is random”), then it is a false negative (type II error).
For each , we use sets of an -length () input sequence generated by the Mersenne Twister algorithm [10] and covert them to non-random -length sequences . Table 5 in Section 5.3 shows the parameters and for each test. In Section 5.3, we explain why the parameters and for are different from the other tests. Note that is the same. Table 2, Fig. 1 and Fig. 2 show the passing rate , which is defined as follows:
[TABLE]
Because we know that is non-random, we know that =FALSE, and the passing rate means a false negative rate in this experiment. Now, the significance levels of second-level tests I and II are and (defined in (5)), respectively. Therefore, the significance intervals defined in Eq. of and are described as follows:
[TABLE]
[TABLE]
Therefore, if or , we can conclude that the true positive rate is high, and we can conclude that the test is sensitive.
As shown in Table 2, Fig. 1 and Fig. 2, and of are all , whereas of and are not as low. From this table and the figures, we can conclude that is more sensitive than the other tests.
5.2 Experiment 2: Test results for existing pseudo-random number generators
We use sets of an -length () input sequence generated by
- •
AES Counter Mode (AES-CTR) [11],
- •
Mersenne Twister [10],
- •
Xorshift random number generator [12],
- •
Vector Stream Cipher 2.0 (VSC 2.0) [13],
- •
Linear congruential generator (LCG) [2],
- •
Cubic congruential generator (CCG) [2],
- •
Quadratic congruential generator I (QCG-I) [2],
- •
Quadratic congruential generator II (QCG-II) [2],
- •
Micali-Schnorr random bit generator [2].
VSC 2.0 is a stream cipher based on chaos theory, which was proposed by A. Iwasaki and K. Umeno [13]. We test these PRNGs using both the DFT and MS-DFT tests, and we compare the results. The parameter sets of and are the same as Table 5 in Section 5.3.
Now, the significance levels of second-level tests I and II are and , respectively, and in this experiment, 1000 -length sequences generated by each PRNG are tested. Table 3, Fig. 3 and Fig. 4 show the passing rate , defined as follows:
[TABLE]
Now, the significance intervals (99%) of passing rates and are described as,
[TABLE]
[TABLE]
respectively.
In this experiment, for each PRNG is defined as follows:
- •
is TRUE (considered as random): AES-CTR, Mersenne-Twister, Xorshift, VSC 2.0, LCG (Define them as “good PRNGs”).
Because these PRNGs pass all the tests included in NIST SP800-22 [2, 13], we consider them as random in this experiment.
- •
is FALSE (considered as non-random): Micali-Schnorr random bit generator, QCG-I, QCG-II, CCG (Define them as “bad PRNGs”).
Because these PRNGs are rejected by several tests included in NIST SP800-22 [2], we consider them as non-random in this experiment.
Under the assumption that this definition of is appropriate, let us consider the sensitivity and reliability of , and . As shown in Fig. 4, it is difficult to compare the reliability from the figure. This is because is very small, whereas the number of sets of input sequences is . Therefore, in this experiment, we focus on Fig. 3 and derive the conclusion of this experiment as follows.
- •
Reliability; of “good PRNGs” (AES-CTR, Mersenne-Twister, Xorshift, VSC 2.0, and LCG).
If the of “good PRNGs” lies inside its significance interval, we can conclude that the reliability of the test is sufficiently high.
As shown in Fig. 3, the of “good PRNGs” of and lies inside its significance interval, whereas that of is lower than the threshold. Therefore, we can conclude that the reliabilities of and are sufficiently high. Moreover, we can conclude that the reliability of is low.
- •
Sensitivity; the of “bad PRNGs” (Micali- Schnorr random bit generator, QCG-I, QCG-II, and CCG).
If the of “bad PRNGs” lies lower than the threshold, we can conclude that the sensitivity of the test is the highest.
As shown in Fig. 3, except for the Micali-Schnorr random bit generator, the of “bad PRNGs” of are definitely lower than the other tests. The of are also low, but not as low as , and the of are higher than the of . Therefore, we can conclude that the reliability of is definitely high, that of is high, and that of is low.
These conclusions from the aforementioned experiment are summarized in Table 4. We can conclude that is more reliable and definitely more sensitive than .
5.3 Appropriate selection of and
As shown in Table 5, the parameters and of are different from the other tests. NIST SP800-22 recommends and [2] (in experiments 1 and 2, we defined and for and to avoid excessive computation because we need and of -length sequences, respectively). However, as we stated in Step 3) in Section 4.1, in the procedure of , -s are generated, whereas and generate -s.
Pareschi et al. reported that the number of -s should not be too large because for extremely large numbers of -s, the second-level tests always fail [15, 16]. Pareschi et al. recommended that, in the case that , for the frequency test included in NIST SP800-22, the number of -s should be smaller than . Therefore, in , should not be too large (in , should not be too large). However, as we proved in Theorem 3, is the asymptotic distribution of . Therefore, should be as large as possible. Thus, in , a selection of the parameter is a trade-off between the error of the second-level test and the error of the distribution of (as shown in Table 6). Considering this trade-off, we defined the value of as shown in Table 5. The appropriate selection of and in still needs to be analyzed more specifically.
6 Conclusion
In this paper, we have considered the DFT test included in the NIST SP800-22 statistical test suite for random number sequences. The most crucial problem in the present DFT test (denoted as ) is that the reference distribution of its test statistic is not mathematically derived but is rather obtained by numerical estimation with a pseudo-random number generator; the basis of the test for randomness itself is based on a pseudo-random number generator. Therefore, cannot be used unless the reference distribution is mathematically derived.
We proved that the asymptotic distribution of the power spectrum is , and based on this fact, we proposed a new DFT test denoted as , whose distribution of the test statistic is mathematically derived.
Furthermore, although appropriate selection of the parameters and for still need to be analyzed more specifically, the results of testing non-random sequences and several pseudo-random number generators showed that is more reliable and definitely more sensitive than , which is the current standard DFT test.
Biography
Hiroki Okada
received his BSc degree in informatics from the Kyoto University, Japan, in Mar. 2014. He received his MSc degree in informatics from the Department of Applied Mathematics & Physics, Graduate School of Informatics Kyoto University, Japan, in Mar. 2016. He joined KDDI Corp. in Apr. 2016.
Ken Umeno
received his BSc degree in electronic communication from Waseda University, Japan, in 1990. He received his MSc and PhD degrees in physics from the University of Tokyo, Japan, in 1992 and 1995, respectively. From 1998 until he joined Kyoto University as a Professor in 2012, he worked for Japan’s Ministry of Posts and Telecommunications in its Communications Research Laboratory (currently the National Institute of Information and Communications Technology). From 2004 to 2012, he was CEO and President of ChaosWare, Inc. He received the LSI IP Award in 2003 and the Telecom-System Awards in 2003 and in 2008. He holds 46 registered Japanese patents, 23 registered United States patents and more than 5 international patents in the fields of telecommunications, security, and financial engineering. His research interests include ergodic theory, statistical computing, coding theory, chaos theory, information security, and GNSS based earthquake prediction.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Juan Soto, et al. , Special Publication 800-22, NIST, 2001
- 2[2] Special Publication 800-22 Revision 1a, NIST, 2010. http://csrc.nist.gov/publications/nistpubs/800-22-rev 1a/SP 800-22rev 1a.pdf
- 3[3] S. Kim, K. Umeno, and A. Hasegawa. “On the NIST statistical test suite for randomness,” IEICE Technical Report, ISEC 2003-87, Dec. 2003.
- 4[4] S. J. Kim, K. Umeno, and A. Hasegawa, “Corrections of the NIST Statistical Test Suite for Randomness,” Cryptology e Print Archive, Tech. Rep. 2004/018 , 2004.
- 5[5] K. Hamano, “The distribution of the spectrum for the discrete Fourier transform test included in SP 800-22,” IEICE Trans. Fundamentals, vol. E 88-A, no. 1, pp. 67-73, 2005.
- 6[6] Pareschi, F., Rovatti, R., & Setti, G. “On Statistical Tests for Randomness included in the NIST SP 800-22 test suite and based on the Binomial Distribution,” IEEE Transactions on Information Forensics and Security 7.2, pp. 491-505, 2012.
- 7[7] K, Hirose. “An inquiry report about test for pseudo random number generators - on the Discrete Fourier Transform test included in NIST SP 800-22”, 2005. http://www.cryptrec.go.jp/estimation/rep_ID 0212.pdf
- 8[8] M. A. Stephens. : Tests based on EDF statistics. In: D’Agostino, R.B. and Stephens, M.A., eds.: Goodness-of-Fit Techniques. Marcel Dekker, New York, 1986.
