Entropy numbers of finite dimensional mixed-norm balls and function space embeddings with small mixed smoothness
Sebastian Mayer, Tino Ullrich

TL;DR
This paper derives precise bounds for entropy numbers of finite-dimensional mixed-norm balls and applies these results to determine optimal asymptotic rates for embeddings of certain function spaces with small mixed smoothness, resolving an open problem.
Contribution
It provides the first matching bounds for entropy numbers of mixed-norm embeddings and establishes optimal dimension-free rates for Besov and Triebel-Lizorkin space embeddings with small mixed smoothness.
Findings
Matching bounds for entropy numbers of mixed-norm embeddings
Optimal asymptotic rates for function space embeddings
Resolution of an open problem in the literature
Abstract
We study the embedding and prove matching bounds for the entropy numbers provided that and . Based on this finding, we establish optimal dimension-free asymptotic rates for the entropy numbers of embeddings of Besov and Triebel-Lizorkin spaces of small dominating mixed smoothness which settles an open question in the literature. Both results rely on a novel covering construction recently found by Edmunds and Netrusov.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Entropy numbers of finite dimensional mixed-norm balls and function space embeddings with small mixed smoothness
Sebastian Mayera and Tino Ullrich*b,*111Corresponding author: [email protected]
a Fraunhofer SCAI, Schloss Birlinghoven, Sankt Augustin, Germany
b TU Chemnitz, Fakultät für Mathematik, 09107 Chemnitz
Abstract
We study the embedding and prove matching bounds for the entropy numbers provided that and . Based on this finding, we establish optimal dimension-free asymptotic rates for the entropy numbers of embeddings of Besov and Triebel-Lizorkin spaces of small dominating mixed smoothness, which gives a complete answer to Open Problem 6.4 in [8]. Both results rely on a novel covering construction recently found by Edmunds and Netrusov [10].
1 Introduction
Entropy numbers quantify the degree of compactness of a set, i.e., how well the set can be approximated by a finite set. Given a compact set in a quasi-Banach space , the -th entropy number is defined to be the smallest radius such that can be covered with copies of the ball , i.e.,
[TABLE]
The concept of entropy numbers can be easily extended to operators. Given a compact operator , where and are quasi-Banach spaces, the -th entropy number of the operator is defined to be
[TABLE]
If the spaces are clear from the context, we will abbreviate by .
Entropy numbers (or the inverse concept of metric entropy) belong to the fundamental concepts of approximation theory. They appear in various approximation problems, e.g., in the estimation of the decay of operator eigenvalues [4, 11, 20], in the estimation of learning rates for machine learning problems [38, 42], or in bounding -numbers like approximation, Gelfand, or Kolmogorov numbers from below [4, 16]. We note that Gelfand numbers find application in the recent field of compressive sensing [6, 13, 16] and Information Based Complexity in general. Entropy numbers are also closely connected to small ball problems in probability theory [21, 24]. For further applications and basic properties, we refer to the monographs [5, 28], and the recent survey [8, Chapter 6].
The subject of this paper is to improve estimates for entropy numbers of embeddings between function spaces of dominating mixed smoothness
[TABLE]
where is a bounded domain, , and . The case stands for the scale of Besov spaces of dominating mixed smoothness, while refers to the scale of Triebel-Lizorkin spaces, which includes classical and Sobolev spaces of mixed smoothness. That is why (1) also includes the classical embeddings
[TABLE]
if . Function space embeddings of this type play a crucial role in hyperbolic cross approximation [8]. Entropy numbers of such embeddings have been the subject of intense study, see [41], [8, Chapt. 6] and the recent papers by A.S. Romanyuk [34, 33, 35] and V.N. Temlyakov [39]. Note that there is a number of deep open problems connected to the case , which reach out to probability and discrepancy theory [8, 2.6, 6.4].
Typically, one observes asymptotic decays of the form
[TABLE]
where . This behavior is also well-known for -numbers of these embeddings like approximation, Gelfand, or Kolmogorov numbers, see [8] and the references therein. Although the main rate is the same as in the univariate case, the dimension still appears in the logarithmic term. We show that the logarithmic term completely disappears in regimes of small smoothness
[TABLE]
That is, we establish sharp purely polynomial asymptotic bounds of the form
[TABLE]
which depends on the underlying dimension only in the constant. This settles several open questions stated in the literature [8, 41], see Section 5, and makes the framework highly relevant for high-dimensional approximation.
A key ingredient in the proof of (3) is a counterpart of Schütt’s theorem for the entropy numbers of the embedding
[TABLE]
where and . We prove matching bounds for all parameter constellations. A particularly relevant case for the purpose of this paper is the situation where and
[TABLE]
Here, we have the surprising behavior
[TABLE]
Note that this relation is not a trivial extension of the classical Schütt result [37], which reads as
[TABLE]
for the norm--embedding , where . In fact, using trivial embeddings would give an additional -term in the third case of (4). The absence of this -term makes (4) interesting and useful as we will see below.
For and , it requires only trivial and standard volumetric arguments to establish matching bounds for the entropy numbers . The middle range is much more involved. In general, it is far from straightforward to generalize the proof ideas from (Schütt) to . Fortunately, the crucial work has already been done in a recent work by Edmunds and Netrusov [10]. They prove a general abstract version of Schütt’s theorem for operators between vector-valued sequence spaces. It remains for us to turn these general, abstract bounds into explicit estimates for the entropy numbers . Unfortunately, the paper [10] is written very concisely, which makes it difficult to follow the arguments at several points. Hence, we decided to provide some additional, explanatory material. We hope that Section 3 helps a broader readership to appreciate the powerful ideas in [10], in particular, a novel covering construction based on dyadic grids.
Outline.
The paper is organized as follows. In Section 2, we recapitulate basics definitions and results including entropy numbers and Schütt’s theorem. Afterwards, in Section 3, we discuss the generalization of Schütt’s theorem by [10]. In Section 4, we show consequences of this result, including matching bounds for the entropy numbers . Finally, we improve upper bounds for the entropy numbers of Besov and Triebel-Lizorkin embeddings in regimes of small smoothness in Section 5.
Notation.
As usual denotes the natural numbers, , denotes the integers, the real numbers, the positive real numbers, and ℂ the complex numbers. For we denote . We write for the natural logarithm. denotes the set of all -matrices with real entries and denotes the Euclidean space. Vectors are usually denoted with . For and , we use the quasi-norm with the usual modification in the case . If is a (quasi-)normed space, then denotes its unit ball and the (quasi-)norm of an element in is denoted by . If is a Banach space, then we denote its dual by . We will frequently use the quasi-norm constant, i.e., the smallest constant satisfying
[TABLE]
For a given we say that is a -norm if
[TABLE]
As is well known, any quasi-normed space can be equipped with an equivalent -norm (for a certain , see [2, 32]). If is a continuous operator we write and for its operator (quasi-)norm. The notation indicates that the identity operator is continuous. For two non-negative sequences we write if there exists a constant such that for all . We will write if and . If is a set of parameters, then we write if there exists a constant depending only on such that for all .
Let . For , the -dimensional mixed space is defined as the space of all matrices equipped with the mixed (quasi-)norm
[TABLE]
with the usual modification that the corresponding sum is replaced by a maximum in the case that either or . We always refer to the -space supported on as the outer space and to the -space supported on as the inner space. For any and we define as the matrix for , for .
2 Entropy numbers and Schütt’s theorem
Let us recall basic notions and properties concerning entropy numbers. Let be a subset of a quasi-Banach space . Given , an -covering is a set of points such that
[TABLE]
An -packing is a set of points such that for pairwise different . The covering number is the smallest such that there exists an -covering of , while the packing number is the largest such that there exists an -packing of . It is easy to see that
[TABLE]
The metric entropy is defined to be
[TABLE]
see Remark 4 for the relation of metric entropy to other notions of entropy.
The -th entropy number can be redefined as
[TABLE]
It is easy to see that the sequence of entropy numbers is decaying, i.e., . Moreover, the set is compact in if and only if .
Let denote an operator mapping between two quasi-Banach spaces and . Recall from the introduction that the operator’s entropy numbers are given by
[TABLE]
Clearly, we have
[TABLE]
If are both operators from to , and is a -normed space, then the entropy numbers of the sum can be estimated as follows
[TABLE]
Moreover, if and then
[TABLE]
In particular, this gives
[TABLE]
For further general properties of entropy numbers and basic estimates, we refer the reader to the monographs [5, 25, 29]. For remarks on the history of entropy number research, see [5, 42].
In the concrete situation where and for , the entropy numbers of the embedding are completely understood in terms of their decay in and . This central result is often referred to as Schütt’s theorem. For its history and references, see Remark 3. We only state the interesting case here.
Theorem 1** (Schütt’s theorem).**
For and , we have
[TABLE]
The constants in the estimates do neither depend on nor on .
Remark 2**.**
Note that as long as because for different .
Remark 3**.**
In 1984, Schütt [37] gave a proof for the general case of symmetric Banach spaces, which implies Theorem 1 if . In the range , we upper bound was first proved for all by Edmunds and Triebel [11] in 1996 by covering the unit ball using suitable sparse vectors. Edmunds and Netrusov [9, Thm. 2] generalized this covering construction in 1998 to arbitrary quasi-Banach spaces. In the same paper, Edmunds and Netrusov also proved matching lower bounds for general quasi-Banach spaces [9, Thm. 2]. Kühn [22] also proved the lower bound for with in 2001. Both [9, Thm. 2] and [22] rely on the very same idea to pack the unit ball with sparse vectors and use the fundamental combinatorial fact discussed in Remark 12 (ii) below. In 2000, Guédon and Litvak [15, Thm. 6] provided an alternative proof of Theorem 1 that relies completely on interpolation arguments and improved the constants in the upper bound.
Remark 4**.**
The concept of metric entropy for compact sets has been introduced independently by Kolmogorov [18] and Pontrjagin and Schnirelmann [31]. It should not be confused with the metric entropy of a dynamical system, which also has been introduced by Kolmogorov [19]. The latter entropy is also called Kolmogorov-Sinai entropy or measure-theoretic entropy. However, these two notions of metric entropy are related [1]. There is also a deep connection between Kolmogorov-Sinai entropy and the notions of information entropy and thermodynamic entropy [3].
3 Edmunds-Netrusov revisited
In addition to Schütt’s theorem, the main tool that we employ in this work is a powerful result by Edmunds and Netrusov [10]. They prove a generalization of Schütt’s theorem for vector-valued sequence spaces. Let us restate the part of their result that is relevant for us.
Theorem 5** (Theorems 3.1 and 3.2 in [10]).**
Let such that , and let and be -normed quasi-Banach spaces. For such that , let
[TABLE]
and
[TABLE]
For , we have the following.
- (i)
If , then
[TABLE] 2. (ii)
If , then there are absolute constants such that
[TABLE]
Theorem 5 states abstract lower and upper bounds that are “matching” in the sense that both have the same functional form. At first glance, this functional form is not obvious to expect and not easy to interpret. In addition, we found it difficult to follow the arguments in [10] at several points due to its succinct style of presentation. We thus believe that it is of value to review their key arguments and to provide some additional material that makes Theorem 5 more comprehensible. This is the subject of the remainder of this section. The reader who is only interested in applications of Theorem 5 may proceed directly to Section 4.
Remark 6**.**
Theorems 3.1 and 3.2 in [10] are only stated for . However, these theorems also hold true for . First note that in the latter case, we have
[TABLE]
Now for , Theorem 5 has been proved in [27, Thm. 4.3]. For , the lower bound in Theorem 5 is a consequence of [27, Thm. 4.3] in combination with arguments analogous to Remark 12; the upper bound is trivial.
3.1 A special case to begin with
If it is clear that one simply has to take -fold Cartesian products of the optimal covering and packing of in to obtain the bounds
[TABLE]
In any other case, simple Cartesian products will not be good enough.
The special case of equal inner spaces also allows for a rather straightforward solution if the dimension of the inner space is finite. For an easier understanding of the contribution in [10], see Theorem 5 above, we find it instructive to give a direct proof of this special case and point out its limitations. Indeed, a straightforward generalization of the well-known Edmunds-Triebel covering construction [11] based on volume arguments will do the job to establish the optimal upper bound. Recall that the essence of this covering construction is a result from best -term approximation, sometimes referred to as Stechkin’s inequality, see [8, Sect. 7.4], which yields a -covering of in using only -sparse vectors. We simply have to extend this approach to row-sparse matrices. To improve readability, we will omit some technical details in the following proof.
Proposition 7**.**
Let and be (quasi-)normed with . Let further and . Then, for ,
[TABLE]
Proof.
The first case is trivial. The last case follows from volumetric arguments using the recent findings in Section [17, Sect. 3.2]. By these we know that
[TABLE]
and for accordingly. For we use the standard volume argument to obtain
[TABLE]
For the second case let . Clearly, we have that
[TABLE]
where . When we replace the rows with the largest -(quasi-)norm by [math] in , then the resulting matrix has a -(quasi-)norm of at most , which follows from a well-known relation for best -term approximation in . Hence, if we wish to cover the set by balls of radius , it suffices to take care of the largest components of the vectors in . That is, we take a suitable covering of in and append zero rows to every matrix of the covering. A similar volumetric argument as above in (8), (9) tells us that
[TABLE]
so that we obtain a covering of with cardinality .
Combining the coverings for all possible index sets yields an -covering of in , where , with cardinality
[TABLE]
Now, given , we choose
[TABLE]
such that
[TABLE]
is assured. Consequently, we obtain the upper bound
[TABLE]
∎
Remark 8**.**
One way to obtain the matching lower bound in the case is to generalize the proof idea underlying Schütt’s theorem (Theorem 1) in the case that . However, the standard combinatorial lemma is not sufficient here. A suitable packing to do this generalization has already been considered in [6, Prop. 5.3]. See also Remark 12 below.
3.2 The covering construction by Edmunds and Netrusov
The generalized Edmunds-Triebel covering is optimal for finite dimensional , see Proposition 7 in the previous section. In the general situation, where is compactly embedded into , it seems that the volumetric arguments underlying (11) are too coarse to obtain sharp estimates (at least in the finite dimensional situation). The main contribution of [10] is a covering construction which resolves this shortcoming by not using volumetric arguments at all. In particular, and do not have to be finite dimensional. We give a detailed recapitulation of their idea in this section. For some comments concerning the lower bound in Theorem 5, see Remark 12 at the end of this section.
The covering in [10] works in the very general situation where we are given quasi-Banach spaces and , see Proposition 10 below. The basic idea is to cover the unit ball by cuboids
[TABLE]
where and is exponential in (think of each cuboid as an anisotropically rescaled version of ). The crux is to find suitable vectors such that an optimal covering can be reached by covering the cuboid using a product of optimal coverings of ,…,. Edmunds and Netrusov [10] had the idea to consider vectors that form a dyadic grid derived from the simplex
[TABLE]
The dyadic grid is constructed with the help of the following mapping. Let
[TABLE]
and for , put
[TABLE]
This mapping leads to a finite grid with the following properties.
Lemma 9** (Simplification of Lemma 2.2 in [10]).**
For , let . The set has the following properties.
- (i)
For all , there is such that for all . 2. (ii)
For all , we have . 3. (iii)
For all , we have for each . 4. (iv)
We have .
Proof.
Given , let . We clearly have and for all indices . Further
[TABLE]
which is a crucial property to estimate the cardinality of the set . Let
[TABLE]
Clearly, . Varying over all elements in the simplex, can be any of the subsets of . Fixing , there are at most possibilities for . Fixing up to , there are at most possibilities for . Hence, in total the set may contain at most
[TABLE]
many elements. ∎
The dyadic grid according to Lemma 9 allows to establish the following upper bound on entropy numbers.
Proposition 10** (Reformulation of Lemma 2.3 in [10]).**
Let and be quasi-Banach spaces, let , and let such that . Then, we have
[TABLE]
Proof.
Consider the transformed grid
[TABLE]
By Lemma 9 (i), we have
[TABLE]
where is the cuboid defined in (12).
Let be given by . For each
[TABLE]
let be a -covering. Then, for every , there is such that and
[TABLE]
By construction of the set , we have and
[TABLE]
Finally, note that the product has cardinality
[TABLE]
which, in combination with , implies the desired result. ∎
Proposition 10 is not the complete final answer. For , we have to modify the proof of Proposition 7. We sketch the proof and refer to the proof of [10, Thm 3.1] for technical details.
Proposition 11**.**
Let . Then, we have
[TABLE]
where is defined in Theorem 5.
Proof sketch.
Let . It is clear that, analogously to (10), we have
[TABLE]
Similar as in Proposition 7, we can use a covering for to construct a covering for . Consider now and let be a minimal -covering of in . Let . Then, for every , there is such that
[TABLE]
where the second term on the right-hand side follows from the best -term approximation result already used in Proposition 7. Consequently, we have
[TABLE]
In contrast to Proposition 7, volumetric arguments would now give a suboptimal estimate for the entropy numbers . In this general situation, it requires Proposition 10 with and to get the proper estimate. Concretely, since , we have
[TABLE]
which leads in combination with Proposition 10 and (14) to an upper bound of the form
[TABLE]
The usual arguments show that it is optimal to choose of the order . ∎
Remark 12**.**
We close this section with some remarks concerning the lower bound in Theorem 5. Its proof relies on two surprisingly simple observations, see [10] for details.
(i) Let be a maximal -packing of in . Using the Gilbert-Varshamov bound, which is well-known in coding theory [14, 40], we know that contains elements of mutual distance , where . This leads to the lower bound
[TABLE]
see [27, p. 68] and [10, Lem. 2.6] for a more general formulation. Given , we have to make a good choice for the dimension to maximize the lower bound. Choose for some to obtain
[TABLE]
If , we conclude
[TABLE]
If , then guarantees and thus
[TABLE]
(ii) Choose a vector such that
[TABLE]
We construct a packing by building row-sparse matrices, where the nonzero rows contain copies of and the row support sets are chosen according to the following combinatorial fact that is well-known in various disciplines of mathematics, see. e.g., [13, Lemma 10.12], [22], [12] or [30, Prop. 2.21, p. 219]. Given such that , there exist subsets of , where
[TABLE]
such that each subset has cardinality and
[TABLE]
This leads to the lower bound
[TABLE]
In view of the packing construction that we have mentioned in Remark 8 it is somewhat surprising that it is not necessary to combine the combinatorics of the two observations in order to obtain the optimal abstract bound in Theorem 5. An explanation is given in [27, Rem. 4.13, p. 69].
4 Consequences of the Edmunds-Netrusov result
We discuss some consequences of Theorem 5. Let us begin with considering the entropy numbers
[TABLE]
We have the following matching bounds.
Theorem 13**.**
Let and . Then, we have
[TABLE]
For , we have the following case distinctions.
- (i)
Let .
- (i.a)
In the special case , we have
[TABLE] 2. (i.b)
If and , then
[TABLE] 3. (i.c)
If and , then
[TABLE] 2. (ii)
Let . Then, we have
[TABLE]
Proof.
For and , it requires only standard volumetric arguments, see [27, Appendix A] for details. Let us also refer to [7, Lemma 3], where this case has been already considered. Let and be as defined in Theorem 5. Moreover, throughout the proof, we write for ,
[TABLE]
Ad (i.a). Since , it follows from Theorem 1 that for and consequently that and for all . Now, for , we have that for , so the sequence is bounded from above by a monotonically increasing sequence. For , we have
[TABLE]
Since decays faster in than increases, we conclude that for , the sequence is “essentially monotonically decreasing”. To be more precise attains at its maximum, where the factor depends only on and . Hence, the maximum of can be bounded from above by a constant times the maximum of and therefore by . Using analogous arguments for , we conclude that and
[TABLE]
for .
Ad (i.b). Consider now and . For , we have in consequence of Theorem 1, that for and
[TABLE]
Since , the sequence is bounded from above and below up to a constant by a monotonically increasing sequence and consequently, the maximum is attained at such that . Since , we further have
[TABLE]
For we find as before that and for , we have the estimate
[TABLE]
Ad (i.c). Consider now . For , we find since the sequence is bounded from below and above by a sequence that increases monotonically in . If , then
[TABLE]
and
[TABLE]
Finally, if , then .
Ad (ii). For , we observe that
[TABLE]
since the term is decaying in at least as fast as is growing. Hence,
[TABLE]
Next, we consider . Since , we find
[TABLE]
where we have used in the last estimate. At the same time, since , we also have and thus
[TABLE]
Finally, for it is easy to see that
[TABLE]
∎
Remark 14**.**
The upper bound for in Theorem 13 also follows from [7, Lem. 3]. The upper bound in Theorem 13 (ii) has also been proved in [41, Lem 3.16] for the range . The proof there uses the following covering construction, which appeared first in [23, Proof of Prop. 4] to our knowledge. Let and be (quasi-)Banach spaces and . The covering rests on the idea to split the ball into subsets of matrices with non-increasing rows,
[TABLE]
where the union is taken over all permutations of . This leads to the upper bound
[TABLE]
for . If
[TABLE]
with , and we chooses for some such that
[TABLE]
then (16) is strong enough to obtain the upper bound in Theorem 13 (ii), provided
[TABLE]
∎
Now we increase the level of abstraction and consider mixed norms of higher order. Let, for , the weighted spaces and be given by
[TABLE]
with , and . The dimensions and are non-decreasing natural numbers satisfying . These spaces are used as “inner spaces” in the way that
[TABLE]
Note that for with , , , the norm is given by
[TABLE]
We are interested in the behavior of the entropy numbers
[TABLE]
in the special situation .
Proposition 15**.**
Let and . Let further and , as above. Then we have for all and
[TABLE]
Proof.
We use Theorem 5, in particular the upper bound in Proposition 10. Since we obtain
[TABLE]
Let us evaluate the first . With Theorem 13, (i.b), (i.c) we have
[TABLE]
Because of the maximum is attained for , which leads to
[TABLE]
Let us discuss the second . Using again Proposition 10 we obtain
[TABLE]
Due to our assumption the exponent for is positive in both cases. Since we may replace by to increase the right-hand side. This leads to
[TABLE]
∎
We are now aiming for a similar relation for small .
Proposition 16**.**
Let and . Then we have for the estimate
[TABLE]
Proof.
Again we use Theorem 5, in particular the upper bound in Proposition 10. This gives
[TABLE]
where we used once again Theorem 13, (i.b). Clearly, we get
[TABLE]
Since the function is bounded on we conclude with
[TABLE]
∎
5 Polynomial decay of entropy numbers for
multivariate function space embeddings
We come to the main subject of this paper, improved upper bounds for entropy numbers of function space embeddings (1) in regimes of small mixed smoothness.
5.1 Function spaces of dominating mixed smoothness
Besov and Triebel-Lizorkin spaces of mixed smoothness are typically defined via a dyadic decomposition on the Fourier side. Let be the standard tensorized dyadic decomposition of unity, see [36] and [41]. We further denote by the space of tempered distributions and by the space of distributions (dual space of , which represents the space of test functions on the bounded domain ). The Besov space of dominating mixed smoothness with smoothness parameter and integrability parameters is given by
[TABLE]
with the usual modification in the case . The Triebel-Lizorkin space of dominating mixed smoothness is given by
[TABLE]
The latter scale of spaces contains the classical spaces and Sobolev spaces with dominating mixed smoothness if and , namely we have and for . Note that we also have for all and . Though we have the embedding
[TABLE]
for and , see [36, Chapt. 2], the embedding (20) is never compact. Hence, the entropy numbers of embeddings between function spaces defined on the whole do not converge to zero. We restrict our considerations to spaces on bounded domains . Let be an arbitrary bounded domain in . Then, we define for as
[TABLE]
and its (quasi-)norm is given by . The embedding (20) transfers to the bounded domain and is compact such that the entropy numbers decay and converge to zero.
5.2 Sequence spaces
The key to establishing the decay rate of entropy numbers for the embedding (1) is a discretization technique which has been developed over the years by several authors beginning with Maiorov [26]. Later, when wavelet isomorphisms have been established, this technique has been refined by Lemarie, Meyer, Triebel and many others. In [41, Thm. 2.10] Vybíral gave the necessary modifications to deal with the above defined spaces in detail. The main advantage of this approach is to transfer questions for function space embeddings to certain sequence spaces.
Using sufficiently smooth wavelets with sufficiently many vanishing moments (and the notation from [41]) the mapping
[TABLE]
represents a sequence spaces isomorphism between and
[TABLE]
respectively, with the usual modification in the case . Here we denote for and
[TABLE]
and
[TABLE]
Further, denotes the characteristic function of . Consider the sequence spaces
[TABLE]
with (quasi-)norms given by
[TABLE]
Let us also define the following building blocks for fixed
[TABLE]
Clearly, for we have
[TABLE]
and . Consider
[TABLE]
for such that the embedding is compact. Defining the building blocks
[TABLE]
we have . Of course, the identity
[TABLE]
holds true, where denotes the corresponding embedding operator on the respective block (24). Although these operators have the same mapping properties we use different notations to formally distinguish between them. If we also have, for a finite index set , that
[TABLE]
where and , which means and in the notation of (17). In particular, we have .
5.3 Entropy numbers
As a consequence of the boundedness of certain restriction and extension operators, see [41, 4.5], the investigation of entropy numbers of Besov space embeddings can be shifted to the sequences spaces side. We formulate our first result in the framework of sequence spaces, which improves the upper bound. More specifically, we prove that the lower bound in (36) is sharp in the case that , which also includes the limiting case . What is known in this direction is summarized in Remark 20 below.
Proposition 17**.**
Let be a bounded domain and , such that
[TABLE]
Then we have
[TABLE]
Proof.
The lower bound follows by [41, Thm. 3.18]. The upper bound is the actual contribution. We argue as follows.
Step 1. Put and fix , where is large enough (depending on ). We decompose the identity operator as follows
[TABLE]
where and . With an eye on Proposition 10 this means in particular that and (for large enough). Using (5) we obtain
[TABLE]
Step 2. We estimate the first summand. By (25) this breaks down to the entropy numbers
[TABLE]
with chosen as after (25) and denotes the range for . Putting
[TABLE]
and in (17) we may apply Proposition 15 . Since we may apply Proposition 15 to obtain
[TABLE]
Note that, due to Proposition 15, we only used that . To estimate the first summand in (27) it is not needed that .
Step 3. Let us care for the second summand in (27). Clearly, it can be reduced to (28) with spaces defined analogously, but with running this time in the range
[TABLE]
Hence, we have . We apply Proposition 16 to end up with (29). Note, that we have used here only that , or, equivalently, .
Step 4. Finally, we deal with the third summand in (27). Clearly, we have
[TABLE]
This gives
[TABLE]
This concludes the proof. ∎
In the next theorem we consider the situation where a Besov type sequence space compactly embeds into a Triebel-Lizorkin type sequence space. This setting is particularly important, since it leads to results with target space .
Proposition 18**.**
Let be a bounded domain and , , , such that
[TABLE]
Then we have
[TABLE]
Proof.
Again, the lower bounds follow from [41, Thm. 3.18].
Step 1. In case we use the commutative diagram in Figure 1.
[TABLE]
Hence, we may use Proposition 17 and obtain
[TABLE]
Step 2. Now we consider . After decomposing the identity operator in an analogous way as in (26) and (27) we use the commutative diagrams in Figure 2 for the first and second summand, respectively. In fact, for the first summand in (27) we obtain by (7)
[TABLE]
Note, that the identity operator is bounded since is a bounded domain.
Furthermore, the entropy numbers
[TABLE]
can be estimated by the same reasoning as in Step 2 of the proof of Proposition 17. Note that may be smaller than . However, this is not important for the argument (based on Proposition 15). It remains to consider the second summand in (27). Here we use the right diagram in Figure 2 and obtain
[TABLE]
We continue to estimate the appearing entropy numbers as in Step 3 of the proof of Proposition 17. Note that might be larger than . However, for the argument, we only need . This concludes the proof. ∎
Let us finally consider the situation, where a Triebel-Lizorkin type sequence space compactly embeds into a Besov type sequence space.
Proposition 19**.**
Let , , , such that
[TABLE]
and let be a bounded domain. Then we have
[TABLE]
Proof.
The lower bound follows from [41, Thm. 3.18].
Step 1. For the upper bound in the case we may use the commutative diagram in Figure 3 below to decompose the identity operator. Afterwards, we use (6) to reduce everything to the situation in Proposition 17.
Step 2. In case we argue analogously to Step 2 of Proposition 18. This time we use the decompositions in Figure 4 for the first and second summand in (27), respectively. ∎
Unfortunately, we were not able to find a corresponding result for the situation. So, this remains an open problem.
Remark 20**.**
To clarify the contribution of this paper, let us briefly recapitulate the known results and open questions which motivated this work. For several results and historical remarks on the subject we refer to [8] and the references therein. In particular, Vybíral [41, Thm. 4.9] proved for and in the case of small smoothness
[TABLE]
that there is for any a number such that
[TABLE]
The result is a direct consequence of the bound for (the case of “large smoothness”), saying that
[TABLE]
In fact, the entropy numbers in (36) can be bounded from above by
[TABLE]
if . Now choose such that , which, together with (37) and replaced by , implies (36). ∎
The propositions proved above allow to improve a number of existing results for the entropy numbers of the embedding
[TABLE]
Theorem 21**.**
Let be a bounded domain and but . Let , and such that
[TABLE]
In addition, we assume if and if , respectively. Then it holds
[TABLE]
Proof.
The result is a direct consequence of Propositions 17, 18, 19 and the machinery described in the proof of [41, Thm. 4.11]. ∎
As a corollary of Theorem 21, we obtain the following result, which settles Open Problem 6.4 in [8].
Corollary 22**.**
Let be as above. Let further , and . Then we have
[TABLE]
Proof.
Identifying in the case , the result is a direct consequence of Theorem 21. ∎
With the final corollary below (from Theorem 21) we close some more gaps in [41, Thm. 4.18 (ii), (iii)].
Corollary 23**.**
Let be as above. We have the following sharp bounds for entropy numbers.
(i)* Let and . Then, we have*
[TABLE]
(ii)* Let and . Then, we have*
[TABLE]
(iii)* Let , and . Then, we have*
[TABLE]
Remark 24**.**
Entropy numbers of mixed smoothness Sobolev-Besov embeddings into , where , recently gained significant interest, see [39] and [34, 33, 35]. There are some fundamental open problems connected with , see [8, 2.6, 6.4, 6.5]. Interestingly, when choosing the third index small enough in Corollaries 22, 23 we get rid of the logarithm.
Acknowledgment.
The authors would like to thank Dinh Dũng, Thomas Kühn, Van Kien Nguyen, Winfried Sickel and Vladimir N. Temlyakov for several discussions on the topic. They would also like to thank two anonymous referees for their valuable comments. T.U. and S.M. would like to acknowledge support by the DFG Ul-403/2-1 and by the Fraunhofer Cluster of Excellence “Cognitive Internet Technologies”.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Akashi. An operator theoretical characterization of ε 𝜀 \varepsilon -entropy in Gaussian processes. Kodai Mathem. Journ. , 9(1):58–67, 1986.
- 2[2] T. Aoki. Locally bounded linear topological spaces. Proceedings of the Imperial Academy , 18(10):588–594, 1942.
- 3[3] P. Billingsley. Ergodic theory and information . John Wiley & Sons, Inc., New York-London-Sydney, 1965.
- 4[4] B. Carl. Entropy numbers, s 𝑠 s -numbers, and eigenvalue problems. J. Funct. Anal. , 41(3):290–306, 1981.
- 5[5] B. Carl and I. Stephani. Entropy, compactness and the approximation of operators . Cambrige Univ. Press, Cambridge, 1990.
- 6[6] S. Dirksen and T. Ullrich. Gelfand numbers related to structured sparsity and Besov space embeddings with small mixed smoothness. J. Complexity , 48:69–102, 2018.
- 7[7] D. Dũng. Non-linear approximations using sets of finite cardinality or finite pseudo-dimension. volume 17, pages 467–492. 2001. 3rd Conference of the Foundations of Computational Mathematics (Oxford, 1999).
- 8[8] D. Dũng, V. N. Temlyakov, and T. Ullrich. Hyperbolic Cross Approximation . Advanced Courses in Mathematics. CRM Barcelona. Birkhäuser/Springer, 2018.
