On the Mathematical Validity of the Higuchi Method
Lukas Liehr, Peter Massopust

TL;DR
This paper rigorously analyzes the Higuchi method for estimating fractal dimensions, confirming its accuracy for certain functions and highlighting its potential instability under perturbations.
Contribution
It provides a precise mathematical formulation of the Higuchi method and evaluates its validity and robustness for different classes of functions.
Findings
Accurately estimates the dimension for non-fractal functions
Provides a geometrical interpretation of the algorithm
Shows the method can be unstable under perturbations
Abstract
In this paper, we discuss the Higuchi algorithm which serves as a widely used estimator for the box-counting dimension of the graph of a bounded function . We formulate the method in a mathematically precise way and show that it yields the correct dimension for a class of non-fractal functions. Furthermore, it will be shown that the algorithm follows a geometrical approach and therefore gives a reasonable estimate of the fractal dimension of a fractal function. We conclude the paper by discussing the robustness of the method and show that it can be highly unstable under perturbations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On the Mathematical Validity of the Higuchi Method
Lukas Liehr
Peter Massopust∗
Centre of Mathematics
Technical University of Munich
Boltzmannstrasse 3
85748 Garching b. Munich
Germany
Abstract.
In this paper, we discuss the Higuchi algorithm which serves as a widely used estimator for the box-counting dimension of the graph of a bounded function . We formulate the method in a mathematically precise way and show that it yields the correct dimension for a class of non-fractal functions. Furthermore, it will be shown that the algorithm follows a geometrical approach and therefore gives a reasonable estimate of the fractal dimension of a fractal function. We conclude the paper by discussing the robustness of the method and show that it can be highly unstable under perturbations.
Key words and phrases:
Higuchi method, box-counting dimension, Weierstrass function, fractal function, total variation
1991 Mathematics Subject Classification:
28A80, 37L30
∗Corresponding author
1. Introduction and Preliminaries
In the following, we provide a mathematical investigation of the Higuchi method [7], an algorithm which aims at approximating the box-counting dimension of the graph of a bounded real valued function arising from a (measured) time series. For numerous applications, the graphs of these functions exhibit fractal characteristics and the Higuchi method is therefore used in various areas of science where such functions appear. It has been applied to such diverse subjects as analyzing heart rate variability [6], characterizing primary waves in seismograms [5], analyzing changes in the electroencephalogram in Alzheimer’s disease [2], and digital images [1]. In the original article, the validity of the algorithm was based on a number of numerical simulations and not on an exact mathematical validation.
This article presents mathematically precise conditions of when and for what type of functions the Higuchi method gives the correct value for the box-counting dimension of its graph. Of particular importance in this context are also the robustness and the behavior of the algorithm under perturbation of the data. The latter reflects imprecise measurements and the existence of measurement errors. We show that the Higuchi method as proposed in [7] is neither robust nor stable under perturbations of the data.
This paper uses the following notation and terminology. The collection of all continuous real-valued functions will be denoted by . Such functions will be regarded as models for continuous signals measured over time with values . A function is called bounded if there exists a nonempty interval such that , for all .
In many applications one is interested in characterizing the irregularity of the graph of a bounded function :
[TABLE]
A widely used measurement for the irregularity of is given by the box-counting dimension of its graph .
Definition 1.1** (Box-counting dimension).**
Suppose be a nonempty bounded subset of and . Denote by the smallest number of boxes of side length equal to needed to cover . The box-counting dimension of is defined to be
[TABLE]
provided the limit exists.
For more details, we refer the interested reader to, for instance, [4, 3, 8]. It follows from the above definition that any nonempty bounded subset of satisfies .
We note that if , the graph of is bounded and thus . The case where , i.e., when is a so-called space-filling or Peano curve will not be considered here.
2. The Higuchi Fractal Dimension
In this section, we review the definition of the Higuchi method as introduced in [7] and derive some immediate consequences. Without loss of generalization, we restrict ourselves to the unit interval .
For a given bounded function and , we define a finite time series by
[TABLE]
The time series represents samples of obtained by a uniform partitioning of in subintervals.
The Higuchi method uses the time series and a parameter with the property (here, denotes the ceiling function) to compute a fractal dimension, which we refer to as the Higuchi fractal dimension (HFD). This approach is originally stated in [7]. We start by formulating the method in a mathematically precise way.
For this purpose, we first present an algorithmic description of the Higuchi method based on the original article [7].
Algorithm 2.1** (Higuchi fractal dimension).**
*.
The Higuchi algorithm uses samples of and then forms “lengths” , , . Each length is the average over values ,
[TABLE]
and the values are computed via
[TABLE]
For notational simplicity and the mathematical analysis of the terms appearing in , we set
[TABLE]
and
[TABLE]
We use the notation to emphasize that this value can be regarded as an approximation for the total variation of on the interval . The relation between the total variation of and the Higuchi method will be discussed below in Section 3.
The algorithm then collects all indices with into an index set and defines a data set
[TABLE]
The slope of the best fitting affine function through (in the least-square sense) is defined to be the Higuchi fractal dimension of with parameters and : .
If the index set is empty or contains just one element then is defined to be . We emphasize that in general we cannot guarantee that the values differ from zero (the simplest examples are constant functions where for all ). Therefore, the definition of the index set is necessary.
Recall that for a given data set the slope of an affine function that is the best least squares fit of the data can be computed via
[TABLE]
where and are the mean values of and , respectively. For every samples of one has to specify a . Moreover, we need since otherwise the values are not well defined for all and all .
Definition 2.2** (Admissible Input).**
We call a pair admissible for the Higuchi method if
[TABLE]
The notation is used to denote the Higuchi fractal dimension of with parameters and . The following observations follow immediately from the definition of the algorithm.
Proposition 2.3**.**
Let be a bounded function and an admissible pair.
- (1)
If f is constant then for all and therefore . 2. (2)
If for all with , then . 3. (3)
If is affine then for every admissible choice of , i.e.,
[TABLE]
Proof.
(1) If for all and some then for all and for all . Therefore, . Hence, which implies .
(2) Assume for . Then there exists a such that for all . Since , there exist with . Without loss of generality we may assume that . It follows that
[TABLE]
Hence, any line through two consecutive points in has slope . Therefore, a linear regression line through all points in has slope as well. All in all, we obtain .
(3) If is constant then and therefore . If is not constant then there exists an and a such that . Consequently,
[TABLE]
which implies
[TABLE]
Therefore, and the assertion follows from (1). ∎
The third part of Proposition 2.3 shows that the normalization factor cannot be dropped without modifications since otherwise the computed dimension of an affine function could already differ from 1. But for an affine function the graph is a straight line and therefore the simplest example of a geometric object with box-counting dimension 1. We discuss the geometric meaning of the constants in more detail in Section 4.
3. Relation to Functions of Bounded Variation
In the original paper by Higuchi [7] and many other papers where the Higuchi method is applied to data sets, the values are identified as the normalized lengths of a curve generated by the sub-time series
[TABLE]
with . In fact, the sum
[TABLE]
does not measure the Euclidean length of the curve but we can regard it as an approximation for the total variation of . Let us recall the definition of the total variation of a function .
Definition 3.1** (Total variation).**
Let be a function and with . For a partition of with , define
[TABLE]
and
[TABLE]
We call the mesh size of . Further set
[TABLE]
where the supremum is taken over all partitions of . Then is called the total variation of . If then is said to have bounded variation on and we define
[TABLE]
As an input for the Higuchi algorithm one usually uses non-constant functions since otherwise for all and therefore there is no non-horizontal regression line through the data set . We can express this condition in terms of the total variation of .
Lemma 3.2**.**
For a function , if and only if is constant on .
Proof.
The statements follow directly from the definition of . ∎
It is known that if then . See, for instance, [4] for a proof.
In the following, we examine whether or not the Higuchi algorithm yields the correct dimension of graph() if is continuous and of bounded variation. To do so we first show a relation between the values defined in Algorithm 2.1 and the variation of . This relation is based on the following result.
Theorem 3.3**.**
Let and let be a sequence of partitions of such that as . Then
[TABLE]
Proof.
See the Appendix. ∎
Proposition 3.4**.**
Let and be defined as in the Higuchi algorithm (with fixed). Then
[TABLE]
Proof.
Define partitions
[TABLE]
Since it follows from Theorem 3.3 that
[TABLE]
Define and . Then
[TABLE]
Set . Since and as , we have by the continuity of . Consequently,
[TABLE]
The preceding proposition yields the next result.
Theorem 3.5**.**
Let . Then
[TABLE]
In other words, converges to if we choose an admissible sequence in such a way that is fixed and goes to infinity.
Proof.
Since , we only have to consider the values and . We have
[TABLE]
If is constant then the result follows from Proposition 2.3. If is not constant then by Lemma 3.2. Furthermore, we have by Proposition 3.4 and . Therefore, there exists an such that
[TABLE]
The slope of a linear regression line through
[TABLE]
is given by
[TABLE]
The fraction on the right-hand side goes to zero as . But is exactly and the result follows. ∎
Example 3.6**.**
The function satisfies . Figure 2 shows the convergence of to the value .
4. The Geometric Idea Behind the Higuchi Method
Consider squares of the form
[TABLE]
where and . We call a collection of such squares a -mesh of . For a nonempty bounded subset , we define the quantity to represent the number of -mesh squares that intersect , i.e.,
[TABLE]
Using -meshes, one obtains an equivalent definition for the box-counting dimension which turns out to be more convenient for a numerical approach.
Theorem 4.1**.**
Let be nonempty and bounded and let be the number of intersecting -mesh squares as defined in 3. Then
[TABLE]
provided this limit exists.
Proof.
See, for instance, [4], pp. 41–43. ∎
For and the area of an intersecting -mesh is given by
[TABLE]
Inserting (5) into (4), we can write the box-counting dimension of as
[TABLE]
At this point the question arises, what is a good approximation of the area when is the graph of a bounded function ?
To this end, we start by taking numbers with and such that . Set . For subintervals of of length an approximation of an intersecting -mesh of graph is given by
[TABLE]
Now for the whole graph of , the sum
[TABLE]
approximates the area . This approach is shown in Figure 3. The bigger and the smaller , the better the approximation.
In (7) we started on the very left of with [math] and took steps to provide a value close to . We could also start at and approximate by summands. It is reasonable to compute approximations of with starting points
[TABLE]
A starting point greater than would not be meaningful since otherwise a whole subinterval of length would be missing. Thus, we obtain values of the form
[TABLE]
which can be written as
[TABLE]
where is defined exactly as in the Higuchi method. Note that the definition of does not consider the graph of defined on the intervals
[TABLE]
Set and . To obtain an even better approximation, we take these two subintervals into account and proceed in the following way.
The area of an intersecting -mesh of is approximately equal to . For a fractal function , i.e., a function whose graphs exhibits fractal characteristics such as self-referentiality [9], the corresponding area of graph() when defined on whole is approximately
[TABLE]
where is chosen in such a way that
[TABLE]
where denotes the length of an interval. The interval has length which yields
[TABLE]
Hence, , where is the normalization constant as defined in the Higuchi algorithm [7]. Finally, we obtain values
[TABLE]
and their mean
[TABLE]
yields
[TABLE]
Using (6) in combination with a linear regression line, we obtain
[TABLE]
where is the slope of a regression line through data points
[TABLE]
This method yields a highly geometric approach for a numerical computation of the box-counting dimension. In fact, it is equivalent to the Higuchi method.
Theorem 4.2**.**
Let be the slope of the regression line through the data (9). Then .
Proof.
Using formula (2) for the slope obtained by the method of least squares, we have
[TABLE]
with
[TABLE]
Define further
[TABLE]
where is defined as in the Higuchi algorithm,
[TABLE]
It follows that
[TABLE]
as well as
[TABLE]
A simple computation yields
[TABLE]
Inserting (11) and (13) into (10) gives
[TABLE]
Setting , we see that is exactly equal to the slope of a regression line through
[TABLE]
and therefore . Consequently,
[TABLE]
which proves the assertion. ∎
Example 4.3** (Weierstrass function).**
Define by
[TABLE]
where and are fixed. The function is continuous, nowhere differentiable, and it is known that . (See, for instance, [4], p. 162.) The sequence provides a sequence of approximations of . Figure 4 represents the values of this sequence if we choose and fixed and .
5. Stability Under Perturbations
Let be a bounded function, an admissible input, and the corresponding time series given by
[TABLE]
Let be the lengths computed by the Higuchi algorithm. Assume that there is a such that . This means and the regression line through
[TABLE]
does not take the index into account. Recall that is defined by
[TABLE]
Thus, means that
[TABLE]
This implies that for every the values
[TABLE]
lie on a line. Suppose we perturb a value by a small constant . For simplicity, we assume . Define a new time series via
[TABLE]
Denote by , , , , , and the corresponding values in the Higuchi algorithm with respect to the perturbed time series. Observe that the values
[TABLE]
lie on a line if any only if . For the perturbation by destroys the collinearity. If follows that for and . Hence,
[TABLE]
which implies that . On the other hand, for all which were originally in the index set , i.e., for which , we have
[TABLE]
with
[TABLE]
It follows that for sufficiently small, and . Therefore, the new data set consists of all points with and in addition of at least one new point which is
[TABLE]
Whilst the lengths , , stay nearly untouched, the new point can have a significant influence on the slope of the new regression line. This follows from the fact that
[TABLE]
Example 5.1**.**
Let be admissible, , and . Define to be a continuous interpolation function such that for all ,
[TABLE]
It follows form the preceding discussion that Figure 5 shows an example with , , and :
[TABLE]
where a linear spline through the interpolation points. Note that the slope of the regression line of the perturbed time series is approximately , a nonsensical result as the box-counting dimension of must be .
Example 5.2**.**
Let be admissible and with Consider a function with the property that
[TABLE]
Assume that is odd, and . Then is even if and only if is odd. Therefore,
[TABLE]
This implies
[TABLE]
For even , and the values are either even (for even) or odd (for odd). Hence, . Hence,
[TABLE]
and consequently
[TABLE]
By Proposition 2.3 we obtain
Corollary 5.3**.**
For every admissible input there exists a bounded function with
[TABLE]
Further can be chosen to be of bounded variation. In this case:
[TABLE]
Proof.
Following the construction of the preceding example, we can choose to be an interpolation function through a finite set of interpolation points. In particular, we can choose as a function of bounded variation such as a linear spline. Then
[TABLE]
but . ∎
We continue with the function from Example 5.2. A perturbation of by a small value yields
[TABLE]
So the index set which previously consisted of all odd values of now changes to . Hence, is maximal. Furthermore,
[TABLE]
for all even . Figure 6 displays the corresponding regression lines.
Appendix: Proof of Theorem 3.3
Theorem 3.3.* Let and let be a sequence of partitions of such that as . Then*
[TABLE]
Proof.
Let and let be a partition of such that
[TABLE]
Let , The uniform continuity of on the compact set yields the existence of a such that
[TABLE]
Furthermore, can be chosen in such a way that any partition of with contains at least two points in every subinterval . Let be such a partition. Define
[TABLE]
then contains at least two points. Set and . Let be the partition Since is a refinement of , we have
[TABLE]
By the definition of and inequality (16), it follows that
[TABLE]
Hence,
[TABLE]
and the result follows. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] H. Ahammer. Higuchi method of digital images. P Lo S ONE , 6(9), 2011.
- 2[2] A. H. Husseen Al-Nuaimi, E. Jammeh, L. Sun, and E. Ifeachor. Complexity measures for quantifying changes in electroencephalogram in alzheimer’s disease, 2018.
- 3[3] K. Falconer. The Geometry of Fractal Sets . Cambridge University Press, 1992.
- 4[4] K. Falconer. Fractal Geometry: Mathematical Foundations and Applications . Wiley, 2003.
- 5[5] G. Gálvez-Coyt, A. Muñoz-Diosdado, J. A. Peralta, and F. Angula-Brown. Parameters of higuchi’s method to characterize primary waves in some seismograms from the mexican subduction zone. Acta Geophysica , 60(3):910–927, 2012.
- 6[6] R.S. Gomolka, S. Kampusch, E. Kaniusas, F. Türk, J.C. Széles, and W. Klonowski. Higuchi fractal dimension of heart rate variability during percutaneous auricular vagus nerve stimulation in healthy and diabetic subjects. Frontiers in Physiology , 9(1162), 2018.
- 7[7] T. Higuchi. Approach to an irregular time series on the basis of the fractal theory. Physica D , 31(2):277–283, 1988.
- 8[8] P. Massopust. Interpolation and Approximation with Splines and Fractals . Oxford University Press, 2010.
