Quasi-Monte Carlo integration for twice differentiable functions over a   triangle

Takashi Goda; Kosuke Suzuki; Takehito Yoshiki

arXiv:1701.08562·math.NA·December 9, 2019

Quasi-Monte Carlo integration for twice differentiable functions over a triangle

Takashi Goda, Kosuke Suzuki, Takehito Yoshiki

PDF

TL;DR

This paper develops a quasi-Monte Carlo method for integrating twice differentiable functions over a triangle, achieving near-optimal error bounds with explicit point sequence constructions.

Contribution

It provides an explicit construction of point sequences for QMC integration over a triangle with near-optimal error bounds, extending previous methods.

Findings

01

Achieves integration error of order N^{-1} (log N)^3

02

Includes a construction by Basu and Owen (2015) as a special case

03

Proves the bounds are nearly optimal given known lower bounds

Abstract

We study quasi-Monte Carlo integration for twice differentiable functions defined over a triangle. We provide an explicit construction of infinite sequences of points including one by Basu and Owen (2015) as a special case, which achieves the integration error of order $N^{- 1} (lo g N)^{3}$ for any $N \geq 2$ . Since a lower bound of order $N^{- 1}$ on the integration error holds for any linear quadrature rule, the upper bound we obtain is best possible apart from the $lo g N$ factor. The major ingredient in our proof of the upper bound is the dyadic Walsh analysis of twice differentiable functions over a triangle under a suitable recursive partitioning.

Equations320

I (f) = \frac{1}{∣ T ∣} \int_{T} f (x) d x,

I (f) = \frac{1}{∣ T ∣} \int_{T} f (x) d x,

I (f; P_{N}, W_{N}) = n = 0 \sum N - 1 w_{n} f (x_{n}),

I (f; P_{N}, W_{N}) = n = 0 \sum N - 1 w_{n} f (x_{n}),

I (f; P_{N}) = \frac{1}{N} n = 0 \sum N - 1 f (x_{n}) .

I (f; P_{N}) = \frac{1}{N} n = 0 \sum N - 1 f (x_{n}) .

∥ f ∥_{C^{2} (T)} := 0 \leq δ_{1} + δ_{2} \leq 2 max \frac{\partial ^{δ_{1} + δ_{2}} f}{\partial x _{1}^{δ_{1}} \partial x _{2}^{δ_{2}}}_{L^{\infty} (T)},

∥ f ∥_{C^{2} (T)} := 0 \leq δ_{1} + δ_{2} \leq 2 max \frac{\partial ^{δ_{1} + δ_{2}} f}{\partial x _{1}^{δ_{1}} \partial x _{2}^{δ_{2}}}_{L^{\infty} (T)},

e^{wor} (C^{2} (T); P_{N}) = f \in C^{2} (T) ∥ f ∥_{C^{2} (T)} \leq 1 sup ∣ I (f; P_{N}) - I (f) ∣.

e^{wor} (C^{2} (T); P_{N}) = f \in C^{2} (T) ∥ f ∥_{C^{2} (T)} \leq 1 sup ∣ I (f; P_{N}) - I (f) ∣.

\frac{1}{N} n = 0 \sum N - 1 f \circ g (x_{n}),

\frac{1}{N} n = 0 \sum N - 1 f \circ g (x_{n}),

e^{wor} (C^{2} (T); P_{N}) \leq C \frac{( lo g _{2} N ) ^{3}}{N},

e^{wor} (C^{2} (T); P_{N}) \leq C \frac{( lo g _{2} N ) ^{3}}{N},

e^{wor} (C^{2} (T); P_{2^{m}}) \leq C \frac{m ^{2}}{2 ^{m}},

e^{wor} (C^{2} (T); P_{2^{m}}) \leq C \frac{m ^{2}}{2 ^{m}},

e^{wor} (C^{2} (T); P_{N}, W_{N}) := f \in C^{2} (T) ∥ f ∥_{C^{2} (T)} \leq 1 sup ∣ I (f; P_{N}, W_{N}) - I (f) ∣ \geq \frac{c}{N},

e^{wor} (C^{2} (T); P_{N}, W_{N}) := f \in C^{2} (T) ∥ f ∥_{C^{2} (T)} \leq 1 sup ∣ I (f; P_{N}, W_{N}) - I (f) ∣ \geq \frac{c}{N},

△ (A, B, C) := {w_{1} A + w_{2} B + w_{3} C ∣ w_{1}, w_{2}, w_{3} \geq 0, w_{1} + w_{2} + w_{3} = 1},

△ (A, B, C) := {w_{1} A + w_{2} B + w_{3} C ∣ w_{1}, w_{2}, w_{3} \geq 0, w_{1} + w_{2} + w_{3} = 1},

T^{(1)} (ξ_{11}, ξ_{12}) = ⎩ ⎨ ⎧ △ (\frac{B + C}{2}, \frac{C + A}{2}, \frac{A + B}{2}) △ (A, \frac{A + B}{2}, \frac{A + C}{2}) △ (\frac{B + A}{2}, B, \frac{B + C}{2}) △ (\frac{C + A}{2}, \frac{C + B}{2}, C) (ξ_{11}, ξ_{12}) = (0, 0), (ξ_{11}, ξ_{12}) = (1, 0), (ξ_{11}, ξ_{12}) = (0, 1), (ξ_{11}, ξ_{12}) = (1, 1) .

T^{(1)} (ξ_{11}, ξ_{12}) = ⎩ ⎨ ⎧ △ (\frac{B + C}{2}, \frac{C + A}{2}, \frac{A + B}{2}) △ (A, \frac{A + B}{2}, \frac{A + C}{2}) △ (\frac{B + A}{2}, B, \frac{B + C}{2}) △ (\frac{C + A}{2}, \frac{C + B}{2}, C) (ξ_{11}, ξ_{12}) = (0, 0), (ξ_{11}, ξ_{12}) = (1, 0), (ξ_{11}, ξ_{12}) = (0, 1), (ξ_{11}, ξ_{12}) = (1, 1) .

T^{(2)} (ξ_{11} ξ_{21} ξ_{12} ξ_{22}) = (T^{(1)} (ξ_{11}, ξ_{12}))^{(1)} (ξ_{21}, ξ_{22}) .

T^{(2)} (ξ_{11} ξ_{21} ξ_{12} ξ_{22}) = (T^{(1)} (ξ_{11}, ξ_{12}))^{(1)} (ξ_{21}, ξ_{22}) .

T^{(n)} ξ_{11} ⋮ ξ_{n 1} ξ_{12} ⋮ ξ_{n 2}

T^{(n)} ξ_{11} ⋮ ξ_{n 1} ξ_{12} ⋮ ξ_{n 2}

= (T^{(1)} \dots (T^{(1)} (ξ_{11}, ξ_{12}))^{(1)} \dots)^{(1)} (ξ_{n 1}, ξ_{n 2}) .

T^{(i)} ξ_{11} ⋮ ξ_{n 1} ξ_{12} ⋮ ξ_{n 2} = T^{(i)} ξ_{11} ⋮ ξ_{i 1} ξ_{12} ⋮ ξ_{i 2} .

T^{(i)} ξ_{11} ⋮ ξ_{n 1} ξ_{12} ⋮ ξ_{n 2} = T^{(i)} ξ_{11} ⋮ ξ_{i 1} ξ_{12} ⋮ ξ_{i 2} .

ϕ^{(n)} : X \in F_{2}^{n \times 2} \to the center of the subregion T^{(n)} (X),

ϕ^{(n)} : X \in F_{2}^{n \times 2} \to the center of the subregion T^{(n)} (X),

ϕ^{(i)} ξ_{11} ⋮ ξ_{n 1} ξ_{12} ⋮ ξ_{n 2} = ϕ^{(i)} ξ_{11} ⋮ ξ_{i 1} ξ_{12} ⋮ ξ_{i 2} .

ϕ^{(i)} ξ_{11} ⋮ ξ_{n 1} ξ_{12} ⋮ ξ_{n 2} = ϕ^{(i)} ξ_{11} ⋮ ξ_{i 1} ξ_{12} ⋮ ξ_{i 2} .

η_{i} (X) := (- 1)^{∣ {1 \leq a \leq i - 1 ∣ (ξ_{a 1}, ξ_{a 2}) = (0, 0)} ∣},

η_{i} (X) := (- 1)^{∣ {1 \leq a \leq i - 1 ∣ (ξ_{a 1}, ξ_{a 2}) = (0, 0)} ∣},

ϕ^{(i)} (X) = j = 1 \sum i \frac{η _{j} ( X )}{2 ^{j}} e (ξ_{j 1}, ξ_{j 2}) .

ϕ^{(i)} (X) = j = 1 \sum i \frac{η _{j} ( X )}{2 ^{j}} e (ξ_{j 1}, ξ_{j 2}) .

ϕ_{X}^{(i)} (σ) := ϕ^{(i)} (X) + \frac{η _{i + 1} ( X )}{2 ^{i}} e (σ) .

ϕ_{X}^{(i)} (σ) := ϕ^{(i)} (X) + \frac{η _{i + 1} ( X )}{2 ^{i}} e (σ) .

T^{(i)} (X) = ϕ^{(i)} (X) + \frac{η _{i + 1} ( X )}{2 ^{i}} T .

T^{(i)} (X) = ϕ^{(i)} (X) + \frac{η _{i + 1} ( X )}{2 ^{i}} T .

(ξ_{1 j}^{(h)}, ξ_{2 j}^{(h)}, \dots, ξ_{nj}^{(h)})^{⊤} = C_{j} \cdot (η_{0}, η_{1}, \dots, η_{m - 1})^{⊤},

(ξ_{1 j}^{(h)}, ξ_{2 j}^{(h)}, \dots, ξ_{nj}^{(h)})^{⊤} = C_{j} \cdot (η_{0}, η_{1}, \dots, η_{m - 1})^{⊤},

ψ_{n} ξ_{1} ⋮ ξ_{n} := \frac{ξ _{1}}{2} + \frac{ξ _{2}}{2 ^{2}} + \dots + \frac{ξ _{n}}{2 ^{n}},

ψ_{n} ξ_{1} ⋮ ξ_{n} := \frac{ξ _{1}}{2} + \frac{ξ _{2}}{2 ^{2}} + \dots + \frac{ξ _{n}}{2 ^{n}},

(ξ_{1 j}^{(h)}, ξ_{2 j}^{(h)}, \dots)^{⊤} = C_{j} \cdot (η_{0}, η_{1}, \dots, η_{a - 1}, 0, 0, \dots)^{⊤},

(ξ_{1 j}^{(h)}, ξ_{2 j}^{(h)}, \dots)^{⊤} = C_{j} \cdot (η_{0}, η_{1}, \dots, η_{a - 1}, 0, 0, \dots)^{⊤},

S_{T} = {ϕ^{(ν (h))} (X (h)) ∣ h \in N_{0}},

S_{T} = {ϕ^{(ν (h))} (X (h)) ∣ h \in N_{0}},

P^{⊥} := {K = (κ_{ij}) \in F_{2}^{n \times 2} ∣ C_{1}^{⊤} κ_{11} ⋮ κ_{n 1} \oplus C_{2}^{⊤} κ_{12} ⋮ κ_{n 2} = 0 \in F_{2}^{m}} .

P^{⊥} := {K = (κ_{ij}) \in F_{2}^{n \times 2} ∣ C_{1}^{⊤} κ_{11} ⋮ κ_{n 1} \oplus C_{2}^{⊤} κ_{12} ⋮ κ_{n 2} = 0 \in F_{2}^{m}} .

μ_{1} (k) := max {i \in N ∣ κ_{i} \neq = 0},

μ_{1} (k) := max {i \in N ∣ κ_{i} \neq = 0},

μ_{1} (K) := μ_{1} (k_{1}) + μ_{1} (k_{2}) .

μ_{1} (K) := μ_{1} (k_{1}) + μ_{1} (k_{2}) .

(κ_{1}, \dots, κ_{n})^{⊤} \to (κ_{1}, \dots, κ_{n}, 0, 0, \dots)^{⊤},

(κ_{1}, \dots, κ_{n})^{⊤} \to (κ_{1}, \dots, κ_{n}, 0, 0, \dots)^{⊤},

μ_{1} (P^{⊥}) := K \in P^{⊥} ∖ {0} min μ_{1} (K),

μ_{1} (P^{⊥}) := K \in P^{⊥} ∖ {0} min μ_{1} (K),

μ_{1} (P^{⊥}) \geq m - t + 1,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Quasi-Monte Carlo integration for twice differentiable functions over a triangle††thanks:

The research of T. Goda was supported by JSPS Grant-in-Aid for Young Scientists No.15K20964. The research of K. Suzuki and T. Yoshiki was partially supported under the Australian Research Councils Discovery Projects funding scheme (project number DP150101770). The research of K. Suzuki was partially supported under CREST, JST.

Takashi Goda, Kosuke Suzuki, Takehito Yoshiki Graduate School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan ([email protected])Graduate School of Science, Hiroshima University, 1-3-1 Kagamiyama, Higashihiroshima 739-8526, Japan ([email protected])Research Support Department, University Management Division, Osaka City University, 3-3-138 Sugimoto, Sumiyoshi-ku, Osaka-shi, 558-8585 Japan. ([email protected])

Abstract

We study quasi-Monte Carlo integration for twice differentiable functions defined over a triangle. We provide an explicit construction of infinite sequences of points including one by Basu and Owen (2015) as a special case, which achieves the integration error of order $N^{-1}(\log N)^{3}$ for any $N\geq 2$ . Since a lower bound of order $N^{-1}$ on the integration error holds for any linear quadrature rule, the upper bound we obtain is best possible apart from the $\log N$ factor. The major ingredient in our proof of the upper bound is the dyadic Walsh analysis of twice differentiable functions over a triangle under a suitable recursive partitioning.

Keywords: Quasi-Monte Carlo, digital nets and sequences, numerical integration on triangle, dyadic Walsh analysis

MSC classifications: Primary, 42C10, 65D32; Secondary, 41A55, 65C05, 65D30

1 Introduction

In this paper we study numerical integration of twice differentiable functions defined over a triangle $T\subset\mathbb{R}^{2}$ . For an integrable function $f\colon T\to\mathbb{R}$ , we denote the true normalized integral of $f$ by

[TABLE]

where $|T|$ denotes the Lebesgue measure of $T$ . As an approximation of $I(f)$ , we consider a linear algorithm of the form

[TABLE]

for an $N$ -element point set $P_{N}=\{\boldsymbol{x}_{0},\ldots,\boldsymbol{x}_{N-1}\}\subset T$ and a set of real-valued weights $W_{N}=\{w_{0},\ldots,w_{N-1}\}$ . In particular, a quasi-Monte Carlo (QMC) integration is an equal-weight quadrature rule where the weights sum up to 1, i.e., a linear algorithm with the special choice $w_{n}=1/N$ for all $n$ . Therefore, $I(f)$ is simply approximated by

[TABLE]

If an infinite sequence of points $\mathcal{S}=\{\boldsymbol{x}_{n}\in T\mid n\geq 0\}$ is given, the first $N$ elements of $\mathcal{S}$ are used as $P_{N}$ .

We define the norm in $C^{2}(T)$ by

[TABLE]

and study the worst-case absolute error over the unit ball of $C^{2}(T)$ , i.e.,

[TABLE]

Thus an obvious goal in this context is to construct a good point set or sequence in $T$ such that the quantity $e^{\mathrm{wor}}(C^{2}(T);P_{N})$ is small either for some $N$ or uniformly for all $N\geq 2$ .

The theory of QMC integration has been developed in depth with the particular focus on approximating the integral of functions defined over the unit cube $[0,1]^{s}$ , see for instance [6, 9, 12]. In fact, not much attention has been paid to QMC integration over non-cubical domains until recently. We have to point out, however, that many practical problems are not necessarily given by quadrature over the unit cube. So far, the most standard approach to QMC integration over a non-cubical domain $\Omega$ is to find a uniformity-preserving transformation $g\colon[0,1]^{s}\to\Omega$ and then to approximate the normalized integral of $f\colon\Omega\to\mathbb{R}$ by

[TABLE]

for $\boldsymbol{x}_{0},\ldots,\boldsymbol{x}_{N-1}\in[0,1]^{s}$ . In the literature, Fang and Wang [7] introduced several transformations from the unit cube to the ball, sphere, and simplex. Pillards and Cools [10] studied 5 different transformations from the unit cube to the simplex. More recently, Basu and Owen [3] gave sufficient conditions on $g$ so that $f\circ g$ is either of bounded variation or satisfies additional smoothness conditions.

Instead of applying a uniformity-preserving transformation, more direct and explicit constructions of point sets and sequences in a triangular domain $T$ have been introduced recently by Basu and Owen [2]. One is based on the van der Corput sequence in base 4 in conjunction with a recursive partitioning of $T$ . The other is given by a rotation of an integer lattice through an angle whose tangent is badly approximable. A discrepancy measure derived in [4] was employed as a quality criterion of these constructions, and it was shown that the latter one attains a lower discrepancy. Nonetheless, the former one is of practical importance since it is extensible and can be randomized.

In this paper, motivated by the first construction of Basu and Owen, we study QMC integration for smooth functions in $C^{2}(T)$ . In particular, we give an explicit construction of infinite sequences of points including one by Basu and Owen as a special case, and prove that our quadrature rule achieves the worst-case error of order $N^{-1}(\log N)^{3}$ for any $N\geq 2$ in $C^{2}(T)$ . The main result of this paper can be summarized as follows:

Theorem 1.

For a triangle $T\subset\mathbb{R}^{2}$ , we can explicitly construct an infinite sequence $\mathcal{S}$ of points in $T$ for which there exists a constant $C>0$ such that

[TABLE]

for all $N\geq 2$ , and in particular,

[TABLE]

for all $m\in\mathbb{N}$ .

Roughly speaking, our approach for the proof of Theorem 1 is to exploit the decay of the Walsh coefficients for $f\in C^{2}(T)$ under a suitable recursive partitioning of $T$ . By following the essentially same argument using bump functions as in [1], see also [5, Section 2.7], we see that a lower bound of order $N^{-1}$ on the worst-case error holds for any linear algorithm in $C^{2}(T)$ . Namely, there exists a constant $c>0$ such that

[TABLE]

holds for any choice of $P_{N}$ and $W_{N}$ . Thus the upper bound we obtain is best possible apart from the $\log N$ factor.

The rest of this paper is organized as follows. In the next section we present an explicit construction of infinite sequences of points in $T$ . We prove an upper bound on the worst-case error for our quadrature rule in Section 3, whereas the result on the decay of the Walsh coefficients for $f\in C^{2}(T)$ , which is necessary for the proof of an upper bound, is shown later in Section 4.

Throughout this paper we use the following notations. Let $\mathbb{Z}$ be the set of integers, $\mathbb{N}$ the set of positive integers, and $\mathbb{N}_{0}:=\mathbb{N}\cup\{0\}$ . We denote the two-element field by $\mathbb{F}_{2}$ , which is identified with the set $\{0,1\}\subset\mathbb{Z}$ equipped with addition and multiplication modulo 2. The addition operation in $\mathbb{F}_{2}$ is denoted by $\oplus$ , and in case of vectors or matrices over $\mathbb{F}_{2}$ , $\oplus$ is applied componentwise. Further we denote a triangular domain with vertices $A,B,C\in\mathbb{R}^{2}$ by

[TABLE]

and the diameter of a set $S\subset\mathbb{R}^{2}$ by $d(S)$ . Without loss of generality, we assume that the center of a triangle $T$ is located at the origin in $\mathbb{R}^{2}$ , i.e., if the center of $T$ is not located at the origin, it suffices to shift the whole domain $T$ .

2 Explicit construction

2.1 Recursive partitioning

In a similar way to [2, Section 3], here we introduce a recursive partitioning of a triangle $T=\triangle(A,B,C)$ . We first partition the triangle into 4 congruent subtriangles, to each of which a pair $(\xi_{11},\xi_{12})\in\mathbb{F}_{2}^{2}$ is assigned with $(0,0)$ in the center. Then we partition each subtriangle into 4 congruent sub-subtriangles, to each of which a pair $(\xi_{21},\xi_{22})\in\mathbb{F}_{2}^{2}$ is assigned again with $(0,0)$ in the center. Hence every sub-subtriangle can be now identified with a set of pairs $(\xi_{11},\xi_{12})$ and $(\xi_{21},\xi_{22})$ . This is illustrated in Figure 1. It is obvious that this recursive partitioning of the triangle defines the mapping from $\mathbb{F}_{2}^{\mathbb{N}\times 2}$ to $T$ , which is surjective but not injective. Moreover, for a matrix $X=(\xi_{ij})\in\mathbb{F}_{2}^{\mathbb{N}\times 2}$ with $\xi_{ij}=0$ for all $i>n$ , the first $n$ rows of $X$ determines which of $4^{n}$ congruent subregions the matrix $X$ is mapped within, and from the condition that the pair $(0,0)$ is always assigned in the center, we see that the matrix $X$ is mapped to the center of the corresponding subregion.

We now describe our recursive partitioning more precisely. The subtriangle of $T=\triangle(A,B,C)$ for a pair $(\xi_{11},\xi_{12})\in\mathbb{F}_{2}^{2}$ is defined by

[TABLE]

Then the sub-subtriangle for a set of pairs $(\xi_{11},\xi_{12})$ and $(\xi_{21},\xi_{22})$ is defined by

[TABLE]

In this way, the subregion for a matrix $X=(\xi_{ij})_{1\leq i\leq n,j=1,2}\in\mathbb{F}_{2}^{n\times 2}$ with some $n\in\mathbb{N}$ is defined recursively by

[TABLE]

For simplicity of notation, as long as $1\leq i\leq n$ , we write

[TABLE]

Moreover we define the mapping $\phi^{(n)}\colon\mathbb{F}_{2}^{n\times 2}\to T$ by

[TABLE]

and, again for simplicity of notation, as long as $1\leq i\leq n$ we write

[TABLE]

Regarding the map $\phi^{(n)}$ we have the following lemma. Since the result can be easily proved by induction on $i$ , we omit the proof.

Lemma 1.

For a matrix $X=(\xi_{ij})_{1\leq i\leq n,j=1,2}\in\mathbb{F}_{2}^{n\times 2}$ , we define $\eta_{i}(X)\in\{\pm 1\}$ by $\eta_{1}(X):=1$ and

[TABLE]

for $2\leq i\leq n+1$ . Let $T=\triangle(\boldsymbol{e}(1,0),\boldsymbol{e}(0,1),\boldsymbol{e}(1,1))$ with $\boldsymbol{e}(1,0),\boldsymbol{e}(0,1),\boldsymbol{e}(1,1)\in\mathbb{R}^{2}$ and $\boldsymbol{e}(0,0)=(\boldsymbol{e}(1,0)+\boldsymbol{e}(0,1)+\boldsymbol{e}(1,1))/3=\boldsymbol{0}$ . Then the following holds true:

For $1\leq i\leq n$ , we have

[TABLE] 2. 2.

For $\boldsymbol{\sigma}\in\mathbb{F}_{2}^{2}\setminus\{(0,0)\}$ , define $\phi_{X}^{(i)}(\boldsymbol{\sigma})\in\mathbb{R}^{2}$ by

[TABLE]

Then we have $T^{(i)}(X)=\triangle(\phi_{X}^{(i)}(1,0),\phi_{X}^{(i)}(0,1),\phi_{X}^{(i)}(1,1))$ . In particular

[TABLE]

2.2 Generating infinite sequences of points in a triangle

We describe how to generate an infinite sequence of points in a triangle $T$ . For this purpose, we first introduce the definition of digital nets over $\mathbb{F}_{2}$ for the two-dimensional case.

Definition 1.

For $m,n\in\mathbb{N}$ with $n\geq m$ , let $C_{1},C_{2}\in\mathbb{F}_{2}^{n\times m}$ . For an integer $0\leq h<2^{m}$ , denote the dyadic expansion of $h$ by $h=\eta_{0}+\eta_{1}2+\cdots+\eta_{m-1}2^{m-1}$ . Define the matrix $X(h)=(\xi^{(h)}_{ij})_{1\leq i\leq n,j=1,2}\in\mathbb{F}_{2}^{n\times 2}$ by

[TABLE]

for $j=1,2$ . Then we call the subset $P=\{X(h)\mid 0\leq h<2^{m}\}\subset\mathbb{F}_{2}^{n\times 2}$ a (two-dimensional) digital net over $\mathbb{F}_{2}$ with generating matrices $C_{1},C_{2}$ .

Remark 1.

By using the map $\psi_{n}\colon\mathbb{F}_{2}^{n}\to[0,1)$ defined by

[TABLE]

for $(\xi_{1},\ldots,\xi_{n})^{\top}\in\mathbb{F}_{2}^{n}$ , digital nets over $\mathbb{F}_{2}$ are usually defined as point sets in $[0,1)^{2}$ by $\psi_{n}(P):=\{\psi_{n}(X(h))\mid 0\leq h<2^{m}\}\subset[0,1)^{2}$ , where $\psi_{n}$ is applied columnwise. Here we see that the integer $n$ denotes the precision of points. In this paper, it is more reasonable to define digital nets over $\mathbb{F}_{2}$ as subsets in $\mathbb{F}_{2}^{n\times 2}$ instead of point sets in $[0,1)^{2}$ .

The above definition can be extended to digital sequences over $\mathbb{F}_{2}$ .

Definition 2.

Let $C_{1},C_{2}\in\mathbb{F}_{2}^{\mathbb{N}\times\mathbb{N}}$ . For each $C_{j}=(c_{kl}^{(j)})_{k,l\in\mathbb{N}}$ , we assume $c_{kl}^{(j)}=0$ for all sufficiently large $k$ . For $h\in\mathbb{N}_{0}$ , denote the dyadic expansion of $h$ by $h=\eta_{0}+\eta_{1}2+\cdots+\eta_{a-1}2^{a-1}$ . Define the matrix $X(h)=(\xi^{(h)}_{ij})_{i\in\mathbb{N},j=1,2}\in\mathbb{F}_{2}^{\mathbb{N}\times 2}$ by

[TABLE]

for $j=1,2$ . Then we call the infinite sequence $\mathcal{S}=\{X(h)\mid h\in\mathbb{N}_{0}\}\subset\mathbb{F}_{2}^{\mathbb{N}\times 2}$ a (two-dimensional) digital sequence over $\mathbb{F}_{2}$ with generating matrices $C_{1},C_{2}$ .

Remark 2.

It follows from the assumption $c_{kl}^{(j)}=0$ for all sufficiently large $k$ that, for each $h\in\mathbb{N}_{0}$ , there exists a unique $\nu(h)\in\mathbb{N}_{0}$ such that $\xi^{(h)}_{\nu(h)+1,j}=\xi^{(h)}_{\nu(h)+2,j}=\cdots=0$ for both $j=1,2$ . Furthermore, for $m\in\mathbb{N}$ , the first $2^{m}$ elements of $\mathcal{S}$ can be regarded as a digital net over $\mathbb{F}_{2}$ with generating matrices $C_{1}^{n\times m},C_{2}^{n\times m}$ for some $n\geq m$ , where we denote by $C_{j}^{n\times m}$ the left upper $n\times m$ sub-matrix of $C_{j}$ .

Now we are ready to present how to generate an infinite sequence of points in a triangle $T$ .

Definition 3.

Let $\mathcal{S}\subset\mathbb{F}_{2}^{\mathbb{N}\times 2}$ be a digital sequence over $\mathbb{F}_{2}$ with generating matrices $C_{1},C_{2}$ . Then an infinite sequence of points in $T$ is given by

[TABLE]

where the function $\nu\colon\mathbb{N}_{0}\to\mathbb{N}_{0}$ is given as in Remark 2.

It is clear from this definition that our infinite sequence of points is determined by generating matrices $C_{1},C_{2}$ . Thus we need some quality measure for generating matrices to make an explicit construction of $\mathcal{S}_{T}$ possible, which is discussed in the next subsection.

2.3 Dual net and a new weight function

We first recall the notion of dual net.

Definition 4.

For $m,n\in\mathbb{N}$ with $n\geq m$ , let $P\subset\mathbb{F}_{2}^{n\times 2}$ a digital net over $\mathbb{F}_{2}$ with generating matrices $C_{1},C_{2}\in\mathbb{F}_{2}^{n\times m}$ . The dual net of $P$ is defined by

[TABLE]

Remark 3.

Let $\mathcal{S}\subset\mathbb{F}_{2}^{\mathbb{N}\times 2}$ be a digital sequence over $\mathbb{F}_{2}$ . As mentioned in Remark 2, the first $2^{m}$ elements of $\mathcal{S}$ are a digital net over $\mathbb{F}_{2}$ with generating matrices $C_{1}^{n\times m},C_{2}^{n\times m}$ for some $n\geq m$ . Thus the above definition of the dual net still applies to such an initial finite segment of $\mathcal{S}$ .

The following weight function, introduced in [8] and [11], is well known.

Definition 5.

For $k=(\kappa_{1},\kappa_{2},\ldots)^{\top}\in\mathbb{F}_{2}^{\mathbb{N}}\setminus\{\boldsymbol{0}\}$ , where all but only a finite number of $\kappa_{i}$ are 0, we define

[TABLE]

and $\mu_{1}(\boldsymbol{0}):=0$ . In case of a matrix $K=(k_{1},k_{2})\in\mathbb{F}_{2}^{\mathbb{N}\times 2}$ with $k_{1},k_{2}\in\mathbb{F}_{2}^{\mathbb{N}}$ , where all but only a finite number of elements in $k_{1},k_{2}$ are 0, we define

[TABLE]

If $k$ is an element in $\mathbb{F}_{2}^{n}$ for some $n\in\mathbb{N}$ , by considering an injection

[TABLE]

we use the same symbol $\mu_{1}$ to define the weight function for such $k$ . A similar abuse of notation is also done in case of a matrix $K\in\mathbb{F}_{2}^{n\times 2}$ for finite $n$ .

For a digital net $P\subset\mathbb{F}_{2}^{n\times 2}$ , we define the so-called minimum weight by

[TABLE]

which has been often used as a quality measure of generating matrices for QMC integration over the unit cube. If a digital net $P$ satisfies

[TABLE]

for some $0\leq t\leq m$ , we call $P$ a digital $(t,m,2)$ -nets over $\mathbb{F}_{2}$ . Furthermore, for a digital sequence $\mathcal{S}\subset\mathbb{F}_{2}^{\mathbb{N}\times 2}$ , if there exists a non-negative integer $t$ such that the first $2^{m}$ elements of $\mathcal{S}$ are a digital $(t,m,2)$ -net over $\mathbb{F}_{2}$ for any $m>t$ , we call $\mathcal{S}$ a digital $(t,2)$ -sequence over $\mathbb{F}_{2}$ .

Remark 4.

Any digital net satisfies the above inequality for $t=m$ . In practice, we prefer a larger value of $\mu_{1}(P^{\perp})$ and thus equivalently a smaller value of $t$ , and $t=0$ is best possible. We refer to **[6, 9]** for several explicit constructions of digital nets and sequences with small $t$ -value. 2. 2.

In what follows, we restrict ourselves to digital $(t,2)$ -sequences with upper triangular generating matrices $C_{1},C_{2}$ which satisfy $c_{kl}^{(j)}=0$ for $k>l$ . Explicit constructions of digital sequences by Sobol **[13]** and Tezuka **[14]** hold this property. Besides, by allowing the situation $t>m$ , the first $2^{m}$ elements of a digital $(t,2)$ -sequence can be regarded as a digital $(t,m,2)$ -net for any $m\in\mathbb{N}$ .

Now we introduce a new weight function which suits our purpose.

Definition 6.

Let $n\in\mathbb{N}\cup\{\infty\}$ . For a matrix $K=(k_{1},k_{2})\in\mathbb{F}_{2}^{n\times 2}$ with $k_{1},k_{2}\in\mathbb{F}_{2}^{n}$ , where all but only a finite number of elements in $k_{1},k_{2}$ are 0 if $n=\infty$ , we define

[TABLE]

We can define the weight function $v$ equivalently as follows: For a matrix $K=(\kappa_{ij})\in\mathbb{F}_{2}^{n\times 2}\setminus\{\boldsymbol{0}\}$ , where all but only a finite number of $\kappa_{ij}$ are 0 if $n=\infty$ , define

[TABLE]

and $v(\boldsymbol{0}):=0$ .

Similarly to $\mu_{1}(P^{\perp})$ , we define the minimum weight of a digital net $P$ by

[TABLE]

Here we prefer a digital net $P$ with a large value of $v(P^{\perp})$ . In the following lemma, we show that a digital $(t,m,2)$ -net with small $t$ is exactly what we want.

Lemma 2.

Let $P\subset\mathbb{F}_{2}^{n\times 2}$ be a digital $(t,m,2)$ -net over $\mathbb{F}_{2}$ . Then we have

[TABLE]

Moreover let $\mathcal{S}=\{X(h)\mid h\in\mathbb{N}_{0}\}\subset\mathbb{F}_{2}^{\mathbb{N}\times 2}$ be a digital $(t,2)$ -sequence over $\mathbb{F}_{2}$ . Then for any $m>t$ , we have

[TABLE]

Proof.

Let $K=(k_{1},k_{2})\in\mathbb{F}_{2}^{\mathbb{N}\times 2}$ with $k_{1},k_{2}\in\mathbb{F}_{2}^{\mathbb{N}}$ , where all but only a finite number of elements in $k_{1},k_{2}$ are 0. From the definitions of $\mu_{1}$ and $v$ , we have

[TABLE]

which gives

[TABLE]

This proves the first statement. The second statement directly follows from the definition of a digital $(t,2)$ -sequence. ∎

Hence our explicit construction of an infinite sequence of points in $T$ is to use a digital $(t,2)$ -sequence over $\mathbb{F}_{2}$ with upper triangular generating matrices which is mapped to $T$ according to Definition 3. In the next section, we prove that such an infinite sequence of points in $T$ achieves the almost optimal order of convergence for smooth functions in $C^{2}(T)$ .

Before going into the proof of an error bound, we provide another explicit construction inspired by the first construction due to Basu and Owen [2]. Let $C_{1},C_{2}\in\mathbb{F}_{2}^{\mathbb{N}\times\mathbb{N}}$ be given by

[TABLE]

For $h\in\mathbb{N}_{0}$ with finite dyadic expansion $h=\eta_{0}+\eta_{1}2+\cdots$ , we have

[TABLE]

Thus for even $m$ , it is obvious that the first $2^{m}$ elements of $\mathcal{S}$ generated by these matrices are given by

[TABLE]

Considering the image of the map $\phi^{(m/2)}\colon\mathbb{F}_{2}^{(m/2)\times 2}\to T$ , we can easily check that the point set in $T$ obtained in this way is the same as that of Basu and Owen. This implies that our construction scheme includes their explicit construction as a special case.

Moreover it is easy to show that the first $2^{m}$ elements of $\mathcal{S}$ are actually a digital $(\lceil m/2\rceil,m,2)$ -net over $\mathbb{F}_{2}$ . It can be seen from Lemma 2 that the minimum weight for $v$ is bounded below by $(m+1)/4$ , which can be improved as follows. Since the result follows from direct calculation, we omit the proof.

Lemma 3.

Let $C_{1},C_{2}\in\mathbb{F}_{2}^{\mathbb{N}\times\mathbb{N}}$ be given by (2). For $m\in\mathbb{N}$ , let $P$ be a digital net over $\mathbb{F}_{2}$ with generating matrices $C_{1}^{m\times m},C_{2}^{m\times m}$ . Then we have

[TABLE]

3 Upper bound

Here we prove an upper bound on the worst-case error for our quadrature rule in $C^{2}(T)$ by using the result later shown in Section 4.

3.1 Discretized function on a triangle

Definition 7.

For an integrable function $f\colon T\to\mathbb{R}$ and $n\in\mathbb{N}$ , we define the $n$ -th discretized function $F_{n}\colon\mathbb{F}_{2}^{n\times 2}\to\mathbb{R}$ by

[TABLE]

for $X\in\mathbb{F}_{2}^{n\times 2}$ .

Obviously we have

[TABLE]

Moreover it can be shown that $F_{n}$ approximates $f$ well.

Lemma 4.

Let $X\in\mathbb{F}_{2}^{n\times 2}$ and $\boldsymbol{y}\in T^{(n)}(X)$ . For any $f\in C^{2}(T)$ , we have

[TABLE]

Proof.

From Definition 7 we have

[TABLE]

Let us fix $\boldsymbol{y}=(y_{1},y_{2}),\boldsymbol{z}=(z_{1},z_{2})\in T^{(n)}(X)$ and consider the line segment $Z=\{\boldsymbol{y}+s(\boldsymbol{z}-\boldsymbol{y})\mid 0\leq s\leq 1\}$ . Since $T^{(n)}(X)$ is a triangle, and thus is convex, the set $Z$ is included in $T^{(n)}(X)$ . Hence we have

[TABLE]

which completes the proof. ∎

3.2 Walsh functions and coefficients

In order to exploit the smoothness of functions in $C^{2}(T)$ , we shall conduct a discrete Walsh-Fourier analysis of the discretized function $F_{n}$ defined on $\mathbb{F}_{2}^{n\times 2}$ later in Section 4. Right now we just introduce the definition of Walsh functions and briefly review some basic facts so as to make the proof of the main result in the next subsection accessible.

First the Walsh functions are defined as follows.

Definition 8.

Let $n\in\mathbb{N}$ be fixed. For a matrix $K=(\kappa_{ij})\in\mathbb{F}_{2}^{n\times 2}$ , the $K$ -th Walsh function $\mathrm{wal}_{k}\colon\mathbb{F}_{2}^{n\times 2}\to\{\pm 1\}$ is defined by

[TABLE]

for $X=(\xi_{ij})\in\mathbb{F}_{2}^{n\times 2}$ .

For a function $F\colon\mathbb{F}_{2}^{n\times 2}\to\mathbb{R}$ , we have the following Walsh expansion:

[TABLE]

where $\hat{F}(K)$ denotes the $K$ -th Walsh coefficient defined by

[TABLE]

The following character property holds between a digital net over $\mathbb{F}_{2}$ and Walsh functions, see for instance [5, Lemmas 4.2 & 4.5] for the proof.

Lemma 5.

Let $P\subset\mathbb{F}_{2}^{n\times 2}$ be a digital net over $\mathbb{F}_{2}$ . Then we have

[TABLE]

Using this lemma, for any $\sigma\in\mathbb{F}_{2}^{n\times 2}$ we have

[TABLE]

where the second equality stems from the fact

[TABLE]

for any $X,Y\in\mathbb{F}_{2}^{n\times 2}$ .

3.3 Proof of the main result

In order to prove Theorem 1, it suffices to show:

Theorem 2.

Let $\mathcal{S}\in\mathbb{F}_{2}^{\mathbb{N}\times 2}$ be either a digital $(t,2)$ -sequence with upper triangular generating matrices or a digital sequence with generating matrices given by (2), and let $\mathcal{S}_{T}\subset T$ be constructed according to Definition 3. Denote the first $N$ elements of $\mathcal{S}_{T}$ by $P_{N}$ . For any $f\in C^{2}(T)$ , the following holds true:

For all $N\geq 2$ , we have

[TABLE] 2. 2.

For all $m\in\mathbb{N}$ , we have

[TABLE]

Proof.

We only prove the case where $\mathcal{S}$ is a digital $(t,2)$ -sequence with upper triangular generating matrices. The case where $\mathcal{S}$ is a digital sequence with generating matrices given by (2) can be shown in exactly the same way.

We denote the dyadic expansion of $N$ by $N=2^{a_{1}}+\cdots+2^{a_{r}}$ , where $a_{1}>\cdots>a_{r}\geq 0$ . We split the first $N$ elements of $\mathcal{S}$ , denoted by $\{X(h)=(\xi^{(h)}_{ij})\in\mathbb{F}_{2}^{\mathbb{N}\times 2}\mid 0\leq h<N\}$ , into $r$ non-overlapping subsets

[TABLE]

It is the well-known property of a digital sequence that each subset $P^{(l)}$ is given by digitally shifting a digital net $\{X(h)\mid 0\leq h<2^{a_{l}}\}$ , see for instance [6, Proof of Theorem 4.84]. That is, there exists $\sigma_{l}\in\mathbb{F}_{2}^{\mathbb{N}\times 2}$ such that

[TABLE]

Let $n=\lceil\log_{2}N\rceil$ . Due to the property of upper triangular matrices, all of the elements in $P^{(1)},\ldots,P^{(r)}$ and $\sigma_{1},\ldots,\sigma_{r}$ can have at most the first $n$ rows different from $\boldsymbol{0}\in\mathbb{F}_{2}^{2}$ . Thus we obtain

[TABLE]

For each $l=1,\ldots,r$ , we write

[TABLE]

which is a digital $(t,a_{l},2)$ -net with generating matrices $C_{1}^{n\times a_{l}},C_{2}^{n\times a_{l}}$ , see the second item of Remark 4. By using (3), (4), and Lemma 4 we have

[TABLE]

Let $D=\max(2\sqrt{2}d(T),4(d(T))^{2})$ . Applying the result obtained in Lemma 10, we have

[TABLE]

The sum on the right-hand side is bounded by

[TABLE]

where we write $L(w)=\{K\in\mathbb{F}_{2}^{n\times 2}\mid v(K)\leq w\}$ , which is a linear subspace of $\mathbb{F}_{2}^{n\times 2}$ . The following obvious inclusions

[TABLE]

induces the injective map

[TABLE]

Therefore we have

[TABLE]

It follows from the fact $Q_{l}^{\perp}\cap L(w)=\{\boldsymbol{0}\}$ for $w<v(Q_{l}^{\perp})$ that

[TABLE]

and thus $|Q_{l}^{\perp}\cap L(w)|\leq 2^{2(w-v(Q_{l}^{\perp})+1)}$ for $w\geq v(Q_{l}^{\perp})$ . Now we obtain

[TABLE]

Using this bound and Lemma 2, the summand in (5) can be bounded by

[TABLE]

Plugging this bound into (5), we have

[TABLE]

Hence the result for the first item follows. The second item follows easily by considering the case $N=2^{m}$ , for which we have $r=1$ and $n=m$ . ∎

4 Walsh analysis on a triangle

In this section, we give a bound on the Walsh coefficient $\hat{F}_{n}(K)$ for the $n$ -th discretized function $F_{n}\colon\mathbb{F}_{2}^{n\times 2}\to\mathbb{R}$ for $f\in C^{2}(T)$ . We first present a formula between the Walsh coefficient $\hat{F}_{n}(K)$ and the so-called dyadic differences in Lemma 8. Here the dyadic differences are defined in Definition 10. Note that the concept of the dyadic differences is originally introduced in [15], while we need to change the definition slightly so as to suit our purpose, i.e, QMC integration over a triangular domain. Converting the dyadic differences into the usual derivatives, we have a bound on the Walsh coefficient $\hat{F}_{n}(K)$ for $f\in C^{2}(T)$ in Lemma 10.

4.1 Definitions and basic results

Here we introduce some more definitions and show some basic but necessary results related to them. For $1\leq i\leq n$ , $\boldsymbol{\kappa}\in\mathbb{F}_{2}^{2}$ and $X=(\boldsymbol{\xi}_{i})_{i=1}^{n}\in\mathbb{F}_{2}^{n\times 2}$ with $\boldsymbol{\xi}_{i}\in\mathbb{F}_{2}^{2}$ , we define the operation

[TABLE]

Moreover, for $\boldsymbol{\kappa},\boldsymbol{\kappa}^{\prime}\in\mathbb{F}_{2}^{2}$ , we define

[TABLE]

Regarding the group operation $\oplus_{i}$ , we have the following.

Lemma 6.

Let $\boldsymbol{\kappa}\in\mathbb{F}_{2}^{2}\setminus\{\boldsymbol{0}\}$ , $X=(\boldsymbol{\xi}_{i})_{i=1}^{n}\in\mathbb{F}_{2}^{n\times 2}$ , and $1\leq i\leq n$ .

For $\boldsymbol{\xi}_{i}\not\in\{\boldsymbol{0},\boldsymbol{\kappa}\}$ ,

[TABLE] 2. 2.

For $\boldsymbol{\xi}_{i}\in\{\boldsymbol{0},\boldsymbol{\kappa}\}$ ,

[TABLE]

Proof.

Let us consider the first item. It follows from Lemma 1 that

[TABLE]

where we write $X\oplus_{i}\boldsymbol{\kappa}=((X\oplus_{i}\boldsymbol{\kappa})_{j})_{j=1}^{n}$ . For $\boldsymbol{\xi}_{i}\not\in\{\boldsymbol{0},\boldsymbol{\kappa}\}$ with $\boldsymbol{\kappa}\neq\boldsymbol{0}$ , we have $\boldsymbol{\xi}_{i}\oplus\boldsymbol{\kappa}\neq\boldsymbol{0}$ , which implies $\eta_{i+1}(X\oplus_{i}\boldsymbol{\kappa})=\eta_{i+1}(X)$ . Thus, by the definition of $X\oplus_{i}\boldsymbol{\kappa}$ , we have

[TABLE]

and $\eta_{j}(X\oplus_{i}\boldsymbol{\kappa})=\eta_{j}(X)$ for $1\leq j\leq n+1$ . Using these facts, we have

[TABLE]

Hence we have the result.

Let us move on to the second item. For $\boldsymbol{\xi}_{i}\in\{\boldsymbol{0},\boldsymbol{\kappa}\}$ with $\boldsymbol{\kappa}\neq\boldsymbol{0}$ , we have $\{\boldsymbol{\xi}_{i},\boldsymbol{\xi}_{i}\oplus\boldsymbol{\kappa}\}=\{(0,0),\boldsymbol{\kappa}\}$ , which implies $\eta_{i+1}(X\oplus_{i}\boldsymbol{\kappa})=-\eta_{i+1}(X)$ . Thus, by the definition of $X\oplus_{i}\boldsymbol{\kappa}$ , we have (7) and

[TABLE]

Using these equalities we have

[TABLE]

Hence we have the result. ∎

Let $X=(\boldsymbol{\xi}_{i})_{i=1}^{n}\in\mathbb{F}_{2}^{n\times 2}$ , $\boldsymbol{\kappa}\in\mathbb{F}_{2}^{2}\setminus\{\boldsymbol{0}\}$ and $1\leq i\leq n$ . By abuse of notation, we define the map $\cdot\oplus_{i}\boldsymbol{\kappa}|_{T^{(n)}(X)}$ also for a real vector $\boldsymbol{y}\in T^{(n)}(X)$ by

[TABLE]

As long as there is no risk of confusion, we simply denote it as $\boldsymbol{y}\oplus_{i}\boldsymbol{\kappa}$ . By comparing this definition with the results of Lemma 6, it is straightforward to see that the image of the restriction of the map $\cdot\oplus_{i}\boldsymbol{\kappa}$ to $T^{(n)}(X)$ is $T^{(n)}(X\oplus_{i}\boldsymbol{\kappa})$ . By the definition of $\boldsymbol{y}\oplus_{i}\boldsymbol{\kappa}$ , this map is isometric, and thus, is a $C^{1}$ function. This map has the following relationship with the group operator $\oplus_{i}$ .

Lemma 7.

For any $X\in\mathbb{F}_{2}^{n\times 2}$ , the following holds true:

For $1\leq i\leq n$ , $\boldsymbol{\kappa}\in\mathbb{F}_{2}^{2}$ , and $f\in L^{1}(T^{(n)}(X\oplus_{i}\boldsymbol{\kappa}))$ , we have

[TABLE] 2. 2.

For $1\leq i,i^{\prime}\leq n$ , $\boldsymbol{\kappa},\boldsymbol{\kappa}^{\prime}\in\mathbb{F}_{2}^{2}$ , and $f\in L^{1}(T^{(n)}(X\oplus_{i}\boldsymbol{\kappa}\oplus_{i^{\prime}}\boldsymbol{\kappa}^{\prime}))$ , we have

[TABLE] 3. 3.

For $\boldsymbol{y}\in T^{(n)}(X)$ and $\boldsymbol{\kappa}\in\mathbb{F}_{2}^{2}$ , we have

[TABLE]

and

[TABLE]

Proof.

Let us consider the first item. Since $\cdot\oplus_{i}\boldsymbol{\kappa}\colon T^{(n)}(X)\to T^{(n)}(X\oplus_{i}\boldsymbol{\kappa})$ is isometric, using the change of variables $\boldsymbol{z}=\boldsymbol{y}\oplus_{i}\boldsymbol{\kappa}$ , we have $\,\mathrm{d}\boldsymbol{z}=\,\mathrm{d}\boldsymbol{y}$ . Thus the result follows.

The second item follows from applying the first item twice.

Finally let us consider the third item. Since $\boldsymbol{y}\in T^{(n)}(X)$ , we also have $\boldsymbol{y}\in T^{(i^{\prime})}(X)\subset T^{(i)}(X)$ . As above, we have $\boldsymbol{y}\oplus_{i^{\prime}}\boldsymbol{\kappa}\in T^{(n)}(X\oplus_{i^{\prime}}\boldsymbol{\kappa})\subset T^{(i^{\prime})}(X\oplus_{i^{\prime}}\boldsymbol{\kappa})$ for $\boldsymbol{y}\in T^{(n)}(X)$ . Since the subregion $T^{(i)}(X\oplus_{i^{\prime}}\boldsymbol{\kappa})$ with $i<i^{\prime}$ does not depend on $\boldsymbol{\kappa}$ and is identical to $T^{(i)}(X)$ , we have

[TABLE]

Since we now know that $\boldsymbol{y},\boldsymbol{y}\oplus_{i}\boldsymbol{\kappa}\in T^{(i-1)}(X)$ for $\boldsymbol{y}\in T^{(n)}(X)$ , it follows that

[TABLE]

which completes the proof. ∎

Furthermore, we need the following maps $\sigma,p_{1},p_{2}$ all from $\mathbb{F}_{2}^{2}$ to $\mathbb{F}_{2}^{2}$ :

[TABLE]

For $\boldsymbol{\kappa}\in\mathbb{F}_{2}^{2}$ , let $P(\boldsymbol{\kappa})=\{p_{1}(\boldsymbol{\kappa}),p_{2}(\boldsymbol{\kappa})\}$ and $N(\boldsymbol{\kappa})=\mathbb{F}_{2}^{2}\setminus P(\boldsymbol{\kappa})$ . It is then trivial to see $P(\boldsymbol{\kappa})\cap N(\boldsymbol{\kappa})=\emptyset$ and $P(\boldsymbol{\kappa})\cup N(\boldsymbol{\kappa})=\mathbb{F}_{2}^{2}$ for any $\boldsymbol{\kappa}\in\mathbb{F}_{2}^{2}$ .

Now fix $K=(\boldsymbol{\kappa}_{i})_{i=1}^{n}\in\mathbb{F}_{2}^{n\times 2}$ with $\boldsymbol{\kappa}_{i}\in\mathbb{F}_{2}^{2}$ . We divide the set $\mathbb{F}_{2}^{n\times 2}$ into some mutually exclusive subsets:

[TABLE]

for $w=1,\ldots,v(K)-1$ . The following properties obviously hold:

[TABLE]

4.2 Bounds on Walsh coefficients

Using the division of $\mathbb{F}_{2}^{n\times 2}$ by $R_{w}(K)$ introduced in the previous section, we consider separating the $K$ -th Walsh coefficient $\hat{F}(K)$ of $F\colon\mathbb{F}_{2}^{n\times 2}\to\mathbb{R}$ into the following values $R_{w}\hat{F}(K)$ .

Definition 9.

Let $F\colon\mathbb{F}_{2}^{n\times 2}\to\mathbb{R}$ . For $K\in\mathbb{F}_{2}^{n\times 2}$ and $0\leq w\leq v(K)-1$ , we define the Walsh coefficient of $F$ on the subset $R_{w}(K)$ :

[TABLE]

Note that it is obvious to see

[TABLE]

Thus, in order to obtain an upper bound on $\hat{F}(K)$ , it suffices to show an upper bound on each $R_{w}\hat{F}(K)$ . For this goal, we first introduce the concept of dyadic differences.

Definition 10.

For a function $F\colon\mathbb{F}_{2}^{n\times 2}\to\mathbb{R}$ , the $i$ -th dyadic difference for $K=(\boldsymbol{\kappa}_{i})_{i=1}^{n}\in\mathbb{F}_{2}^{n\times 2}$ is defined by

[TABLE]

for $i=1,\ldots,n$ .

We now show the following key equalities on $R_{w}\hat{F}(K)$ and dyadic differences.

Lemma 8.

Let $F\colon\mathbb{F}_{2}^{n\times 2}\to\mathbb{R}$ be a function. For $K=(\boldsymbol{\kappa}_{i})_{i=1}^{n}\in\mathbb{F}_{2}^{n\times 2}\setminus\{\boldsymbol{0}\}$ , the following holds true:

For $0\leq w\leq v(K)-1$ , we have

[TABLE] 2. 2.

For $1\leq w\leq v(K)-1$ , we have

[TABLE] 3. 3.

For $1\leq w\leq v(K)-1$ , we have

[TABLE]

Proof.

Let us consider the first and second items. Denote

[TABLE]

We first show that we have

[TABLE]

For $1\leq i\leq n$ with $i\neq p$ , the $i$ -th components of $X\oplus_{p}\sigma(\boldsymbol{\kappa}_{p})$ and $X$ are same, and the $p$ -th component of $X\oplus_{p}\sigma(\boldsymbol{\kappa}_{p})$ is $\boldsymbol{\xi}_{p}\oplus\sigma(\boldsymbol{\kappa}_{p})$ for $X=(\boldsymbol{\xi}_{i})_{i=1}^{n}$ . Thus we only need to show that $\boldsymbol{\xi}_{p}\oplus\sigma(\boldsymbol{\kappa}_{p})$ belongs to the $p$ -th component of $R_{w}(K)$ . For $p=v(K)$ , it obviously holds since the $p$ -th component of $R_{w}(K)$ is $\mathbb{F}_{2}^{2}$ . For $p=w$ , the $p$ -th component of $R_{w}(K)$ is $P(\boldsymbol{\kappa}_{w})$ , and thus from the property

[TABLE]

we see that $\boldsymbol{\xi}_{p}\oplus\sigma(\boldsymbol{\kappa}_{p})$ belongs to the $p$ -th component of $R_{w}(K)$ .

Using the equality (9) and from the property of Walsh functions, we have

[TABLE]

Then it follows that

[TABLE]

which completes the proof of the second item by putting $p=w$ . Let us consider the case $p=v(K)$ . From the definition of $v$ , we have $\boldsymbol{\kappa}_{v(K)}\neq 0$ and thus $\mathrm{wal}_{\boldsymbol{\kappa}_{v(K)}}(\sigma(\boldsymbol{\kappa}_{v(K)}))=-1$ . Hence we have the result for the first item.

From the result for the first item, to which the result for the second item is applied with $F$ replaced by $d_{K}^{(v(K))}F$ , we have

[TABLE]

Hence the result for the third item follows. ∎

Converting the dyadic differences to the usual derivatives, we shall get a bound on $R_{w}\hat{F}(K)$ where $F$ denotes the $n$ -th discretized function of $f\in C^{2}(T)$ . As a preparation we need the following lemma.

Lemma 9.

Let $\boldsymbol{y},\boldsymbol{z}_{1},\boldsymbol{z}_{2}\in\mathbb{R}^{2}$ with $\boldsymbol{y},\boldsymbol{y}+\boldsymbol{z}_{1},\boldsymbol{y}+\boldsymbol{z}_{2},\boldsymbol{y}+\boldsymbol{z}_{1}+\boldsymbol{z}_{2}\in T$ . For $f\in C^{2}(T)$ , we have

[TABLE]

and

[TABLE]

Proof.

Since $T$ is convex, we have $\{\boldsymbol{y}+s\boldsymbol{z}_{1}+t\boldsymbol{z}_{2}\mid 0\leq s,t\leq 1\}\subset T$ . Following a similar argument as in the proof of Lemma 4, we can get the first inequality of this lemma. Thus let us focus on the second one. Again in a similar way as in the proof of Lemma 4, we have for $\boldsymbol{z}_{1}=(z_{11},z_{12})$ , $\boldsymbol{z}_{2}=(z_{21},z_{22})$

[TABLE]

The summand in the last expression for a given $i$ is bounded by

[TABLE]

from which the second inequality of this lemma obviously follows. ∎

Eventually we arrive at showing upper bounds on $R_{w}\hat{F}(K)$ and $\hat{F}$ .

Lemma 10.

Let $f\in C^{2}(T)$ be a function and $F_{n}:\mathbb{F}_{2}^{n\times 2}\to\mathbb{R}$ be its $n$ -th discretized function. For any $K\in\mathbb{F}_{2}^{n\times 2}$ , we have

[TABLE]

Proof.

First we recall that $\mathrm{wal}_{\boldsymbol{\kappa}_{v(K)}}(\sigma(\boldsymbol{\kappa}_{v(K)}))=-1$ holds since $\boldsymbol{\kappa}_{v(K)}\neq 0$ . Thus for any $X\in\mathbb{F}_{2}^{n\times 2}$ we have

[TABLE]

We use this equality without any notice.

We now show a bound on $R_{0}\hat{F}_{n}(K)$ . From the first item of Lemma 8 and the triangle inequality, we have

[TABLE]

From the obvious fact $|T^{(n)}(X\oplus_{v}\sigma(\boldsymbol{\kappa}_{v(K)}))|=|T^{(n)}(X)|$ and the first item of Lemma 7, we have

[TABLE]

where we use the result in Lemma 9 with $\boldsymbol{z}_{1}=\boldsymbol{y}\oplus_{v(K)}\sigma(\boldsymbol{\kappa}_{v(K)})-\boldsymbol{y}$ in the second inequality, and then the third item of Lemma 7 in the last inequality. Thus we obtain a bound on $R_{0}\hat{F}_{n}(K)$ :

[TABLE]

Next we show a bound on $R_{w}\hat{F}_{n}(K)$ for $1\leq w\leq v(K)-1$ . From the third item of Lemma 8 and the triangle inequality, we have

[TABLE]

From the second item of Lemma 7, we have

[TABLE]

In what follows, we continue with further arguments separately for the cases $\boldsymbol{\kappa}_{w}=\boldsymbol{0}$ and $\boldsymbol{\kappa}_{w}\neq\boldsymbol{0}$ .

Let us consider the case $\boldsymbol{\kappa}_{w}\neq\boldsymbol{0}$ . In this case we have $\mathrm{wal}_{\boldsymbol{\kappa}_{w}}(\sigma(\boldsymbol{\kappa}_{w}))=-1$ . Thus we obtain

[TABLE]

It is easy to see by definition that $\boldsymbol{0},\sigma(\boldsymbol{\kappa}_{w})\not\in P(\boldsymbol{\kappa}_{w})$ for $\boldsymbol{\kappa}_{w}\neq\boldsymbol{0}$ . Since $X=(\boldsymbol{\xi}_{i})_{i=1}^{n}\in R_{w}(K)$ implies

[TABLE]

we have $\boldsymbol{\xi}_{w}\neq\boldsymbol{0},\sigma(\boldsymbol{\kappa}_{w})$ . Further it follows from the third item of Lemma 7 that $\boldsymbol{y}\oplus_{v(K)}\sigma(\boldsymbol{\kappa}_{v(K)})\in T^{(w)}(X)$ . Thus we obtain

[TABLE]

and

[TABLE]

Comparing these equalities gives

[TABLE]

from which we see that $\boldsymbol{y},\boldsymbol{y}\oplus_{w}\sigma(\boldsymbol{\kappa}_{w}),(\boldsymbol{y}\oplus_{v(K)}\sigma(\boldsymbol{\kappa}_{v(K)}))\oplus_{w}\sigma(\boldsymbol{\kappa}_{w}),\boldsymbol{y}\oplus_{v(K)}\sigma(\boldsymbol{\kappa}_{v(K)})$ form a parallelogram. By using the result in Lemma 9 with $\boldsymbol{z}_{1}=\boldsymbol{y}\oplus_{w}\sigma(\boldsymbol{\kappa}_{w})-\boldsymbol{y}$ and $\boldsymbol{z}_{2}=\boldsymbol{y}\oplus_{v(K)}\sigma(\boldsymbol{\kappa}_{v(K)})-\boldsymbol{y}$ and then the third item of Lemma 7 again, we have

[TABLE]

Thus we obtain a bound on $R_{w}\hat{F}_{n}(K)$ :

[TABLE]

Let us move onto the case $\boldsymbol{\kappa}_{w}=\boldsymbol{0}$ . In this case we have $\mathrm{wal}_{\boldsymbol{\kappa}_{w}}(\sigma(\boldsymbol{\kappa}_{w}))=1$ . Thus we obtain

[TABLE]

Since $\boldsymbol{\kappa}_{w}=\boldsymbol{0}$ , we see that

[TABLE]

Thus, from (8) we obtain

[TABLE]

and

[TABLE]

Comparing these equalities gives

[TABLE]

from which we see that $\boldsymbol{y},\boldsymbol{y}\oplus_{v(K)}\sigma(\boldsymbol{\kappa}_{v(K)}),\boldsymbol{y}\oplus_{w}\sigma(\boldsymbol{\kappa}_{w}),(\boldsymbol{y}\oplus_{v(K)}\sigma(\boldsymbol{\kappa}_{v(K)}))\oplus_{w}\sigma(\boldsymbol{\kappa}_{w})$ form a parallelogram. (Note that the points $\boldsymbol{y}$ and $(\boldsymbol{y}\oplus_{v(K)}\sigma(\boldsymbol{\kappa}_{v(K)}))\oplus_{w}\sigma(\boldsymbol{\kappa}_{w})$ form not the diagonal but the edge of the parallelogram unlike in the case of $\boldsymbol{\kappa}_{w}\neq\boldsymbol{0}$ .) By using the result in Lemma 9 with $\boldsymbol{z}_{1}=(\boldsymbol{y}\oplus_{v(K)}\sigma(\boldsymbol{\kappa}_{v(K)}))\oplus_{w}\sigma(\boldsymbol{\kappa}_{w})-\boldsymbol{y}$ and $\boldsymbol{z}_{2}=\boldsymbol{y}\oplus_{v(K)}\sigma(\boldsymbol{\kappa}_{v(K)})-\boldsymbol{y}$ and then the third item of Lemma 7 again together with the triangle inequality, we have

[TABLE]

Thus we obtain a bound on $R_{w}\hat{F}_{n}(K)$ :

[TABLE]

Finally a bound on $\hat{F}_{n}(K)$ is given by

[TABLE]

Hence we complete the proof of this lemma. ∎

Bibliography15

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] N. S. Bakhvalov, Approximate computation of multiple integrals (in Russian), Vestnik Moskov. Univ. Ser. Mat. Meh. Astr. Fiz. Him. 4 (1959), 3–18.
2[2] K. Basu, A. B. Owen, Low discrepancy constructions in the triangle, SIAM J. Numer. Anal. 53 (2015), 743–761.
3[3] K. Basu, A. B. Owen, Transformations and Hardy-Krause variation, SIAM J. Numer. Anal. 54 (2016), 1946–1966.
4[4] L. Brandolini, L. Colzani, G. Gigante, G. Travaglini, A Koksma–Hlawka inequality for simplices, in: Trends in Harmonic Analysis, Springer, New York, 2013, pp. 33–46.
5[5] J. Dick, A. Hinrichs and F. Pillichshammer, Proof techniques in quasi-Monte Carlo theory, J. Complexity 31 (2015), 327–371.
6[6] J. Dick, F. Pillichshammer, Digital Nets and Sequences: Discrepancy Theory and Quasi-Monte Carlo Integration , Cambridge University Press, Cambridge, 2010.
7[7] K.-T. Fang, Y. Wang, Number-Theoretic Methods in Statistics , Chapman & Hall, London, 1994.
8[8] H. Niederreiter, Low-discrepancy point sets, Monatsh. Math., 102 (1986), 155–167.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Quasi-Monte Carlo integration for twice differentiable functions over a triangle††thanks:

Abstract

1 Introduction

Theorem 1**.**

2 Explicit construction

2.1 Recursive partitioning

Lemma 1**.**

2.2 Generating infinite sequences of points in a triangle

Definition 1**.**

Remark 1**.**

Definition 2**.**

Remark 2**.**

Definition 3**.**

2.3 Dual net and a new weight function

Definition 4**.**

Remark 3**.**

Definition 5**.**

Remark 4**.**

Definition 6**.**

Lemma 2**.**

Proof.

Lemma 3**.**

3 Upper bound

3.1 Discretized function on a triangle

Definition 7**.**

Lemma 4**.**

Proof.

3.2 Walsh functions and coefficients

Definition 8**.**

Lemma 5**.**

3.3 Proof of the main result

Theorem 2**.**

Proof.

4 Walsh analysis on a triangle

4.1 Definitions and basic results

Lemma 6**.**

Proof.

Lemma 7**.**

Proof.

4.2 Bounds on Walsh coefficients

Definition 9**.**

Definition 10**.**

Lemma 8**.**

Proof.

Lemma 9**.**

Proof.

Lemma 10**.**

Proof.

Theorem 1.

Lemma 1.

Definition 1.

Remark 1.

Definition 2.

Remark 2.

Definition 3.

Definition 4.

Remark 3.

Definition 5.

Remark 4.

Definition 6.

Lemma 2.

Lemma 3.

Definition 7.

Lemma 4.

Definition 8.

Lemma 5.

Theorem 2.

Lemma 6.

Lemma 7.

Definition 9.

Definition 10.

Lemma 8.

Lemma 9.

Lemma 10.