Testing Graphs against an Unknown Distribution

Lior Gishboliner; Asaf Shapira

arXiv:1905.09903·math.CO·September 29, 2021

Testing Graphs against an Unknown Distribution

Lior Gishboliner, Asaf Shapira

PDF

TL;DR

This paper characterizes which graph properties remain testable when the vertex distribution is unknown and arbitrary, extending classical graph property testing to a more general and realistic setting.

Contribution

The paper provides a complete characterization of testable graph properties under unknown vertex distributions, including a new removal lemma for vertex-weighted graphs.

Findings

01

Characterization of testable properties under unknown distributions

02

A new removal lemma for vertex-weighted graphs

03

Extension of classical graph testing models

Abstract

The area of graph property testing seeks to understand the relation between the global properties of a graph and its local statistics. In the classical model, the local statistics of a graph is defined relative to a uniform distribution over the graph's vertex set. A graph property $P$ is said to be testable if the local statistics of a graph can allow one to distinguish between graphs satisfying $P$ and those that are far from satisfying it. Goldreich recently introduced a generalization of this model in which one endows the vertex set of the input graph with an arbitrary and unknown distribution, and asked which of the properties that can be tested in the classical model can also be tested in this more general setting. We completely resolve this problem by giving a (surprisingly "clean") characterization of these properties. To this end, we prove a removal lemma…

Equations120

E W \in P \sum {x, y} \in (2 W) \sum D (x) D (y) = {x, y} \in (2 U) \sum D (x) D (y) \cdot \frac{1}{k} < \frac{1}{2} \cdot \frac{1}{k} < η,

E W \in P \sum {x, y} \in (2 W) \sum D (x) D (y) = {x, y} \in (2 U) \sum D (x) D (y) \cdot \frac{1}{k} < \frac{1}{2} \cdot \frac{1}{k} < η,

D_{U^{'}} (u) = \frac{D ( u )}{D ( U ^{'} )} \leq \frac{\frac{1}{2 a}}{1 - \frac{1}{a}} = \frac{1}{2 ( a - 1 )} .

D_{U^{'}} (u) = \frac{D ( u )}{D ( U ^{'} )} \leq \frac{\frac{1}{2 a}}{1 - \frac{1}{a}} = \frac{1}{2 ( a - 1 )} .

D (U_{i}) = D_{U^{'}} (U_{i}) \cdot D (U^{'}) \geq \frac{1}{2 ( a - 1 )} \cdot D (U^{'}) \geq \frac{1}{2 ( a - 1 )} \cdot (1 - \frac{1}{a}) = \frac{1}{2 a}

D (U_{i}) = D_{U^{'}} (U_{i}) \cdot D (U^{'}) \geq \frac{1}{2 ( a - 1 )} \cdot D (U^{'}) \geq \frac{1}{2 ( a - 1 )} \cdot (1 - \frac{1}{a}) = \frac{1}{2 a}

d (X^{+}, Y)

d (X^{+}, Y)

> \frac{1}{D ( X ^{+} ) D ( Y )} \cdot D (X^{+}) D (Y) \cdot (d + ε) = d + ε .

δ = δ (h, η) = min {\frac{1}{4 ( h - 1 )}, \frac{η}{2}, \frac{1}{2} \cdot (\frac{η}{2})^{h - 1} \cdot δ (h - 1, η /2)} .

δ = δ (h, η) = min {\frac{1}{4 ( h - 1 )}, \frac{η}{2}, \frac{1}{2} \cdot (\frac{η}{2})^{h - 1} \cdot δ (h - 1, η /2)} .

(u_{2}, \dots, u_{h}) \in U^{'} \sum i = 2 \prod h D (u_{i}) \geq δ (h - 1, η /2) \cdot i = 2 \prod h D (U_{i}^{'}) \geq δ (h - 1, η /2) \cdot (η /2)^{h - 1} \cdot i = 2 \prod h D (U_{i}) \geq 2 δ i = 2 \prod h D (U_{i}) .

(u_{2}, \dots, u_{h}) \in U^{'} \sum i = 2 \prod h D (u_{i}) \geq δ (h - 1, η /2) \cdot i = 2 \prod h D (U_{i}^{'}) \geq δ (h - 1, η /2) \cdot (η /2)^{h - 1} \cdot i = 2 \prod h D (U_{i}) \geq 2 δ i = 2 \prod h D (U_{i}) .

(u_{1}, \dots, u_{h}) \in U \sum i = 1 \prod h D (u_{i}) \geq u_{1} \in U_{1}^{'} \sum D (u_{1}) \cdot 2 δ i = 2 \prod h D (U_{i}) = D (U_{1}^{'}) \cdot 2 δ i = 2 \prod h D (U_{i}) \geq δ i = 1 \prod h D (U_{i}),

(u_{1}, \dots, u_{h}) \in U \sum i = 1 \prod h D (u_{i}) \geq u_{1} \in U_{1}^{'} \sum D (u_{1}) \cdot 2 δ i = 2 \prod h D (U_{i}) = D (U_{1}^{'}) \cdot 2 δ i = 2 \prod h D (U_{i}) \geq δ i = 1 \prod h D (U_{i}),

ζ = ζ_{\ref l e m : T u r a n_{R} am sey} (t, δ) = \frac{1}{4 a ^{2} \cdot T _{\ref l e m : r e g} ( ε , a )} .

ζ = ζ_{\ref l e m : T u r a n_{R} am sey} (t, δ) = \frac{1}{4 a ^{2} \cdot T _{\ref l e m : r e g} ( ε , a )} .

D (Q_{i}) \geq \frac{D ( P _{i} )}{3 r ∣ Q ∣} \geq \frac{ε}{3∣ P ^{'} ∣ ^{2} ∣ Q ∣} \geq \frac{ε}{3∣ Q ∣ ^{3}} \geq \frac{ε}{3 s ^{3}} = \frac{1}{S},

D (Q_{i}) \geq \frac{D ( P _{i} )}{3 r ∣ Q ∣} \geq \frac{ε}{3∣ P ^{'} ∣ ^{2} ∣ Q ∣} \geq \frac{ε}{3∣ Q ∣ ^{3}} \geq \frac{ε}{3 s ^{3}} = \frac{1}{S},

E [1 \leq i < j \leq r \sum D (P_{i}) D (P_{j}) \cdot ∣ d (Q_{i}, Q_{j}) - d (P_{i}, P_{j}) ∣] =

E [1 \leq i < j \leq r \sum D (P_{i}) D (P_{j}) \cdot ∣ d (Q_{i}, Q_{j}) - d (P_{i}, P_{j}) ∣] =

1 \leq i < j \leq r \sum Q_{i}^{'} \in Q_{i}, Q_{j}^{'} \in Q_{j} \sum D (Q_{i}^{'}) D (Q_{j}^{'}) \cdot d (Q_{i}^{'}, Q_{j}^{'}) - d (P_{i}, P_{j}) \leq \frac{ε}{3},

E_{s} (r) = min {\frac{ε}{2}, \frac{1}{Ψ ( s + r )}} .

E_{s} (r) = min {\frac{ε}{2}, \frac{1}{Ψ ( s + r )}} .

S^{'} (s) = S_{\ref l e m : r e p r ese n t a t i v es} (E_{s}, 2^{s} \cdot ⌈ 1/ ε ⌉), S^{''} (s) = max {s, \frac{2 S ^{'} ( s )}{ε} \cdot Ψ (s + S^{'} (s))} .

S^{'} (s) = S_{\ref l e m : r e p r ese n t a t i v es} (E_{s}, 2^{s} \cdot ⌈ 1/ ε ⌉), S^{''} (s) = max {s, \frac{2 S ^{'} ( s )}{ε} \cdot Ψ (s + S^{'} (s))} .

S = S_{\ref l e m : i t er a t i o n s} (Ψ, ε) = s_{⌈ 2/ ε ⌉} .

S = S_{\ref l e m : i t er a t i o n s} (Ψ, ε) = s_{⌈ 2/ ε ⌉} .

r \leq S_{\ref l e m : r e p r ese n t a t i v es} (E_{s}, 2^{s} \cdot ⌈ 1/ ε ⌉) = S^{'} (s) .

r \leq S_{\ref l e m : r e p r ese n t a t i v es} (E_{s}, 2^{s} \cdot ⌈ 1/ ε ⌉) = S^{'} (s) .

D (Q_{i}) = D_{Y^{'}} (Q_{i}) \cdot D (Y^{'}) \geq D_{Y^{'}} (Q_{i}) \cdot \frac{ε}{2} \geq \frac{ε}{2 S _{\ref l e m : r e p r ese n t a t i v es} ( E _{s} , 2 ^{s} \cdot ⌈ 1/ ε ⌉ )} = \frac{ε}{2 S ^{'} ( s )} \geq \frac{1}{S ^{''} ( s )} \geq \frac{1}{S ^{''} ( s _{⌈ 2/ ε ⌉ - 1} )} = \frac{1}{s _{⌈ 2/ ε ⌉}} = \frac{1}{S},

D (Q_{i}) = D_{Y^{'}} (Q_{i}) \cdot D (Y^{'}) \geq D_{Y^{'}} (Q_{i}) \cdot \frac{ε}{2} \geq \frac{ε}{2 S _{\ref l e m : r e p r ese n t a t i v es} ( E _{s} , 2 ^{s} \cdot ⌈ 1/ ε ⌉ )} = \frac{ε}{2 S ^{'} ( s )} \geq \frac{1}{S ^{''} ( s )} \geq \frac{1}{S ^{''} ( s _{⌈ 2/ ε ⌉ - 1} )} = \frac{1}{s _{⌈ 2/ ε ⌉}} = \frac{1}{S},

\frac{1}{S ^{''} ( s )}

\frac{1}{S ^{''} ( s )}

\leq \frac{1}{Ψ ( s + r )} \cdot D (Q_{i}) \leq \frac{1}{Ψ ( ∣ X ∣ + r )} \cdot D (Q_{i}),

ζ : N \to (0, 1), ζ (m) = ζ_{\ref l e m : T u r a n_{R} am sey} (Ψ (m), \frac{1}{Ψ ( m )}),

ζ : N \to (0, 1), ζ (m) = ζ_{\ref l e m : T u r a n_{R} am sey} (Ψ (m), \frac{1}{Ψ ( m )}),

Ψ^{'} : N \to N, Ψ^{'} (m) = \frac{2Ψ ( m )}{ζ ( m )} .

Ψ^{'} : N \to N, Ψ^{'} (m) = \frac{2Ψ ( m )}{ζ ( m )} .

S = S_{\ref l e m : r e g_{m} ain} (Ψ, ε) := \frac{S _{\ref l e m : i t er a t i o n s} ( Ψ ^{'} , ε )}{ζ ( S _{\ref l e m : i t er a t i o n s} ( Ψ ^{'} , ε ))} \geq S_{\ref l e m : i t er a t i o n s} (Ψ^{'}, ε) .

S = S_{\ref l e m : r e g_{m} ain} (Ψ, ε) := \frac{S _{\ref l e m : i t er a t i o n s} ( Ψ ^{'} , ε )}{ζ ( S _{\ref l e m : i t er a t i o n s} ( Ψ ^{'} , ε ))} \geq S_{\ref l e m : i t er a t i o n s} (Ψ^{'}, ε) .

D (u) < \frac{1}{Ψ ^{'} ( m )} \cdot D (Q_{i}) < \frac{ζ ( m )}{Ψ ( m )} \cdot D (Q_{i}) \leq ζ (m) \cdot D (Q_{i})

D (u) < \frac{1}{Ψ ^{'} ( m )} \cdot D (Q_{i}) < \frac{ζ ( m )}{Ψ ( m )} \cdot D (Q_{i}) \leq ζ (m) \cdot D (Q_{i})

D (Q_{i, k}) \geq ζ (m) \cdot D (Q_{i}) = ζ (∣ X ∣ + r) \cdot D (Q_{i}) \geq ζ (∣ X ∣ + r) \cdot \frac{1}{S _{\ref l e m : i t er a t i o n s} ( Ψ ^{'} , ε )} \geq \frac{ζ ( S _{\ref l e m : i t er a t i o n s} ( Ψ ^{'} , ε ))}{S _{\ref l e m : i t er a t i o n s} ( Ψ ^{'} , ε )} = \frac{1}{S},

D (Q_{i, k}) \geq ζ (m) \cdot D (Q_{i}) = ζ (∣ X ∣ + r) \cdot D (Q_{i}) \geq ζ (∣ X ∣ + r) \cdot \frac{1}{S _{\ref l e m : i t er a t i o n s} ( Ψ ^{'} , ε )} \geq \frac{ζ ( S _{\ref l e m : i t er a t i o n s} ( Ψ ^{'} , ε ))}{S _{\ref l e m : i t er a t i o n s} ( Ψ ^{'} , ε )} = \frac{1}{S},

Ψ_{F} (m) = K \in F_{m} max F \in F : F \to K min ∣ V (F) ∣ .

Ψ_{F} (m) = K \in F_{m} max F \in F : F \to K min ∣ V (F) ∣ .

Ψ (m) = max {\frac{8}{ε}, Ψ_{F} (m), \frac{1}{δ _{\ref l e m : co u n t in g} ( Ψ _{F} ( m ) , \frac{ε}{8} )}},

Ψ (m) = max {\frac{8}{ε}, Ψ_{F} (m), \frac{1}{δ _{\ref l e m : co u n t in g} ( Ψ _{F} ( m ) , \frac{ε}{8} )}},

s = s_{P} (ε) := \frac{2 S ^{S + 1}}{δ _{\ref l e m : co u n t in g} ( S , \frac{ε}{8} )} .

s = s_{P} (ε) := \frac{2 S ^{S + 1}}{δ _{\ref l e m : co u n t in g} ( S , \frac{ε}{8} )} .

1 \leq i < j \leq r \sum D (P_{i}) D (P_{j}) \cdot (∣ d (Q_{i}, Q_{j}) - d (P_{i}, P_{j}) ∣ + \frac{ε}{4}) \leq

1 \leq i < j \leq r \sum D (P_{i}) D (P_{j}) \cdot (∣ d (Q_{i}, Q_{j}) - d (P_{i}, P_{j}) ∣ + \frac{ε}{4}) \leq

\frac{ε}{4} + 1 \leq i < j \leq r \sum D (P_{i}) D (P_{j}) \cdot ∣ d (Q_{i}, Q_{j}) - d (P_{i}, P_{j}) ∣ \leq \frac{ε}{2} .

(u_{i, k})_{i, k} \in U \sum i = 1 \prod r k = 1 \prod f_{i} D (u_{i, k}) \geq δ_{\ref l e m : co u n t in g} (h, \frac{ε}{8}) \cdot i = 1 \prod r k = 1 \prod f_{i} D (U_{i, k}) \geq δ_{\ref l e m : co u n t in g} (Ψ_{F} (m), \frac{ε}{8}) \cdot S^{- ∣ W ∣},

(u_{i, k})_{i, k} \in U \sum i = 1 \prod r k = 1 \prod f_{i} D (u_{i, k}) \geq δ_{\ref l e m : co u n t in g} (h, \frac{ε}{8}) \cdot i = 1 \prod r k = 1 \prod f_{i} D (U_{i, k}) \geq δ_{\ref l e m : co u n t in g} (Ψ_{F} (m), \frac{ε}{8}) \cdot S^{- ∣ W ∣},

δ_{\ref l e m : co u n t in g} (Ψ_{F} (m), \frac{ε}{8}) \cdot S^{- ∣ X ∣ - ∣ W ∣} .

δ_{\ref l e m : co u n t in g} (Ψ_{F} (m), \frac{ε}{8}) \cdot S^{- ∣ X ∣ - ∣ W ∣} .

1 - (1 - δ_{\ref l e m : co u n t in g} (S, \frac{ε}{8}) \cdot S^{- S})^{s / S} = 1 - (1 - δ_{\ref l e m : co u n t in g} (S, \frac{ε}{8}) \cdot S^{- S})^{\frac{2 S ^{S}}{δ _{\ref l e m : co u n t in g} ( S , \frac{ε}{8} )}} \geq 1 - e^{- 2} \geq \frac{2}{3},

1 - (1 - δ_{\ref l e m : co u n t in g} (S, \frac{ε}{8}) \cdot S^{- S})^{s / S} = 1 - (1 - δ_{\ref l e m : co u n t in g} (S, \frac{ε}{8}) \cdot S^{- S})^{\frac{2 S ^{S}}{δ _{\ref l e m : co u n t in g} ( S , \frac{ε}{8} )}} \geq 1 - e^{- 2} \geq \frac{2}{3},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Testing Graphs against an Unknown Distribution111A preliminary version of this paper has appeared in the Proceedings of STOC ’19.

Lior Gishboliner School of Mathematics, Tel Aviv University, Tel Aviv 69978, Israel. Email: [email protected]. Supported in part by ERC Starting Grant 633509.

Asaf Shapira

School of Mathematics, Tel Aviv University, Tel Aviv 69978, Israel. Email: asafico $@$ tau.ac.il. Supported in part by ISF Grant 1028/16 and ERC Starting Grant 633509.

Abstract

The area of graph property testing seeks to understand the relation between the global properties of a graph and its local statistics. In the classical model, the local statistics of a graph is defined relative to a uniform distribution over the graph’s vertex set. A graph property ${\cal P}$ is said to be testable if the local statistics of a graph can allow one to distinguish between graphs satisfying ${\cal P}$ and those that are far from satisfying it.

Goldreich recently introduced a generalization of this model in which one endows the vertex set of the input graph with an arbitrary and unknown distribution, and asked which of the properties that can be tested in the classical model can also be tested in this more general setting. We completely resolve this problem by giving a (surprisingly “clean”) characterization of these properties. To this end, we prove a removal lemma for vertex weighted graphs which is of independent interest.

1 Introduction

1.1 Background and the main result

Property testers are fast randomized algorithms whose goal is to distinguish (with high probability, say, $2/3$ ) between objects satisfying some fixed property ${\cal P}$ and those that are $\varepsilon$ -far from satisfying it. Here, $\varepsilon$ -far means that an $\varepsilon$ -fraction of the input object should be modified in order to obtain an object satisfying ${\cal P}$ . The study of such problems originated in the seminal papers of Rubinfeld and Sudan [28], Blum, Luby and Rubinfeld [9], and Goldreich, Goldwasser and Ron [20]. Problems of this nature have been studied in so many areas that it will be impossible to survey them here. Instead, the reader is referred to the recent monograph [18] for more background and references. While this area studies questions in theoretical computer science, it has several strong connections with central problems in extremal combinatorics, most notably to the regularity method and the removal lemma, see Subsection 1.2.

The classical property testing model assumes that one can uniformly sample entries of the input. In distribution-free testing, one assumes that the input is endowed with some arbitrary and unknown distribution ${\cal D}$ , which also affects the way one defines the distance to satisfying a property. As discussed in [19], one motivation for this model is that it can handle settings in which one cannot produce uniformly distributed entries from the input. Another motivation is that the distribution ${\cal D}$ can assign higher weight/importance to parts of the input which we want to have higher impact on the distance to satisfying the given property. Until very recently, problems of this type were studied almost exclusively in the setting of testing properties of functions, see [10, 11, 15, 17, 24]. Let us mention that distribution-free testing is similar in spirit to the celebrated PAC learning model of Valiant [31], see also the discussion in [27].

Our investigation here concerns a distribution-free variant of the adjacency matrix model, also known as the dense graph model. The adjacency matrix model was first defined and studied in [20], where the area of property testing was first introduced. This model has been extensively studied in the past two decades, see Chapter $8$ of [18]. For a selected (but certainly not comprehensive) list of works on the dense graph model of property testing, see [2, 21, 23].

Instead of defining the adjacency matrix model of [20], let us directly define its distribution-free variant which was introduced recently by Goldreich [19]. Since the distribution in this model is over the input’s vertices, it is called the Vertex-Distribution-Free (VDF) model222Goldreich suggested to study variants of this model in other settings (such as bounded degree graphs [22]) as well. For brevity, we will use the term “VDF model” to refer to the “VDF variant of the adjacency matrix model”.. The input to the algorithm is a graph $G$ and some arbitrary and unknown distribution ${\cal D}$ on $V(G)$ . We will thus usually refer to the input as the pair $(G,{\cal D})$ . For a pair of graphs $G_{1},G_{2}$ on the same vertex-set $V$ , and for a distribution $\mathcal{D}$ on $V$ , the (edit) distance between $G_{1}$ and $G_{2}$ with respect to $\mathcal{D}$ is defined as $\sum_{\{x,y\}\in E(G_{1})\triangle E(G_{2})}{\mathcal{D}(x)\mathcal{D}(y)}$ . We say that $(G,{\cal D})$ is $\varepsilon$ -far from satisfying a graph property333A graph property is simply a family of graphs closed under isomorphism. ${\cal P}$ if for every $G^{\prime}\in{\cal P}$ , the distance between $G$ and $G^{\prime}$ with respect to $\mathcal{D}$ is at least $\varepsilon$ . A tester for a graph property $\mathcal{P}$ is an algorithm that receives as input a pair $(G,\mathcal{D})$ and a proximity parameter $\varepsilon$ , and distinguishes with high probability (say $\frac{2}{3}$ ) between the case that $G$ satisfies $\mathcal{P}$ and the case that $(G,\mathcal{D})$ is $\varepsilon$ -far from $\mathcal{P}$ . The algorithm has access to a device that produces random vertices from $G$ distributed according to ${\cal D}$ . The only444Note that the algorithm does not receive $|V(G)|$ as part of the input. other way the algorithm can access $G$ is by performing “edge queries” of the form “is $(u,v)$ an edge of $G$ ?”. We say that property ${\cal P}$ is testable in the VDF model if there is a function $q(\varepsilon)$ and a tester for $\mathcal{P}$ that always performs a total number of at most $q(\varepsilon)$ vertex samples and edge queries to the input. We stress again that ${\cal D}$ is unknown to the tester, so (in particular) that $q$ should be independent of ${\cal D}$ . The function $q$ is sometimes referred to as the sample (or query) complexity of the tester. A tester has 1-sided error if it always accepts an input satisfying ${\cal P}$ . Otherwise it has 2-sided error.

Suppose we assume that in the VDF model, the distribution ${\cal D}$ is restricted to be the uniform distribution; in particular, the distance between $n$ -vertex graphs $G,G^{\prime}$ (on the same vertex-set) is $|E(G)\triangle E(G^{\prime})|/n^{2}$ , and $G$ is $\varepsilon$ -far from ${\cal P}$ if one needs to change at least $\varepsilon n^{2}$ edges to turn $G$ into a graph satisfying ${\cal P}$ . In this paper we will refer to this model as the standard model. This model is “basically” equivalent to the adjacency matrix model, which was introduced in [20]. We refer the reader to [19] for a discussion on the subtle differences between the adjacency matrix model and the above defined standard model555Just as an example, in [20] the tester “knows” $|V(G)|$ while in the VDF model (and thus also in the standard model) it does not..

A very elegant result proved in [19], states that if ${\cal P}$ is testable in the VDF model then it is testable in the standard model with one-sided error. A natural follow-up question, raised by Goldreich in [19], asks whether every property which is testable with one-sided error in the standard model, is also testable in the VDF model. A characterization of the properties testable with one-sided error in the standard model was given in [5], where it was shown that these are precisely the semi-hereditary properties (see [5] for the definition of this term). We show (see Proposition 4.2), that if $\mathcal{P}$ is testable in the VDF model then $\mathcal{P}$ is hereditary666A graph property is hereditary if it is closed under removal of vertices.. Since there are properties which are semi-hereditary but not hereditary, this implies a negative answer to Goldreich’s question. Thus, it is natural to ask the following revised version of Goldreich’s question:

Problem 1.1.

Are all hereditary graph properties testable in the VDF model?

It might be natural to guess777This was at least our initial guess. that every hereditary property is testable in the VDF model, the justification being that all lemmas that were used in [5] should also hold for weighted graphs. As it turns out, this is indeed the case. However, putting all these lemmas together does not seem to work in the VDF model. As our main result, Theorem 1 below, shows, it is no coincidence that the proof technique of [5] does not carry over as is to the weighted setting.

We start with an important definition. Let us say that a graph property $\mathcal{P}$ is extendable if for every graph $G$ satisfying $\mathcal{P}$ there is a graph $G^{\prime}$ on $|V(G)|+1$ vertices which satisfies $\mathcal{P}$ and contains $G$ as an induced subgraph. In other words, $\mathcal{P}$ is extendable if whenever $G$ is a graph satisfying $\mathcal{P}$ and $v$ is a “new” vertex (i.e. $v\notin V(G)$ ), one can connect $v$ to $V(G)$ in such a way that this larger graph will also satisfy $\mathcal{P}$ . Note that if $\mathcal{P}$ is extendable then in fact for every graph $G\in\mathcal{P}$ and for every $n>|V(G)|$ , there is an $n$ -vertex graph satisfying $\mathcal{P}$ which contains $G$ as an induced subgraph. Our main result in this paper is the following:

Theorem 1.

A graph property is testable in the VDF model if and only if it is hereditary and extendable.

It is interesting to compare the above (rather) simple characterization of the properties that are testable in the VDF model, with the (very) complicated characterization of [2] of the properties that are testable in the standard model.

Let us mention some immediate consequences of Theorem 1. Since a graph cannot contain both an isolated vertex and a vertex connected to all other vertices, we infer that for every fixed $H$ the (hereditary) property of being induced $H$ -free is extendable. We thus infer that:

Corollary 2.

The property of being induced $H$ -free is testable in the VDF model for every fixed $H$ .

It is also clear that the property of being $H$ -free is extendable if and only if $H$ has no isolated vertices. We thus infer that:

Corollary 3.

The property of being $H$ -free is testable in the VDF model if and only if $H$ has no isolated vertices.

It is easy to see that most (natural) hereditary graph properties are extendable, so Theorem 1 immediately implies that they are all testable in the VDF model. These include the properties of being Perfect, Interval, Chordal and $k$ -Colorable. In the other direction, Theorem 1 implies that if $H$ has an isolated vertex then $H$ -freeness is not testable in the VDF model. If one is interested in a more “natural” non-extendable hereditary property, then it is not hard to see that another such example is the property ${\cal P}$ of being induced $\{A,B\}$ -free, where $A$ (resp. $B$ ) is the graph obtained from the $2$ -edge path $P_{2}$ by adding a new vertex which is adjacent to all $3$ vertices of $P_{2}$ (resp. not adjacent to any vertex of $P_{2}$ ). It is easy to see that $C_{5}$ satisfies ${\cal P}$ but is not extendable. It was proved in [19] that the properties of being Hamiltonian, Eulerian and Connected are not testable in the VDF model. Those three results follow immediately from our Theorem 1 since these properties are not hereditary.

1.2 The combinatorial interpretation of Theorem 1

Let us discuss the combinatorial implications of Theorem 1 and its relation to other results in the area of extremal combinatorics. The famous triangle removal lemma of Ruzsa and Szemerédi [29] states that if a graph $G$ is $\varepsilon$ -far from being triangle free (with respect to the uniform distribution), then a (uniform) sample of $s(\varepsilon)$ vertices from $G$ contains a triangle with probability at least $\frac{2}{3}$ . We refer the reader to [13] for more background on this lemma and its variants. The result of [5] mentioned above, can be thought of as a generalization of this lemma to arbitrary hereditary properties. It can be stated as saying that for every hereditary graph property $\mathcal{P}$ there is a function $s_{\mathcal{P}}:(0,1)\rightarrow\mathbb{N}$ such that the following holds for every $\varepsilon>0$ . If a graph $G$ is $\varepsilon$ -far from $\mathcal{P}$ (with respect to the uniform distribution) then a (uniform) sample of $s_{\mathcal{P}}(\varepsilon)$ vertices from $G$ induces a graph not satisfying $\mathcal{P}$ with probability at least $2/3$ .

To prove (the “if” direction of) Theorem 1, we will actually prove the following combinatorial statement, which can be thought of as a vertex-weighted version of the graph removal lemma.

Theorem 4.

For every hereditary and extendable graph property $\mathcal{P}$ there is a function $s_{\mathcal{P}}:(0,1)\rightarrow\mathbb{N}$ such that the following holds for every $\varepsilon>0$ and for every vertex-weighted graph $(G,\mathcal{D})$ which is $\varepsilon$ -far from $\mathcal{P}$ . Let $u_{1},\dots,u_{s}$ , $s=s_{\mathcal{P}}(\varepsilon)$ , be a sequence of random vertices of $G$ , sampled according to $\mathcal{D}$ and independently. Then $G[\{u_{1},\dots,u_{s}\}]$ does not satisfy $\mathcal{P}$ with probability at least $\frac{2}{3}$ .

The following similar-looking result888We note that the results of [7] and [26] are more general. The authors of [26] actually prove that the conclusion of Theorem 5 holds for all graphons. The authors of [7] prove extensions of Theorem 5 in several directions, including a version for uniform hypergraphs, and a strengthening in which the notion of testability is replaced with the stronger notion of repairability. was (implicitly) proved by Austin and Tao [7] and Lovász and Szegedy [26].

Theorem 5 ([7, 26]).

For every hereditary graph property $\mathcal{P}$ there is a function $s_{\mathcal{P}}:(0,1)\rightarrow\mathbb{N}$ such that the following holds for every $\varepsilon>0$ and for every vertex-weighted graph $(G,\mathcal{D})$ which is $\varepsilon$ -far from $\mathcal{P}$ . Let $u_{1},\dots,u_{s}$ , $s=s_{\mathcal{P}}(\varepsilon)$ , be a sequence of random vertices of $G$ , sampled according to $\mathcal{D}$ and independently. Construct a graph $S$ on $[s]$ by letting $\{i,j\}\in E(S)$ if and only if $\{u_{i},u_{j}\}\in E(G)$ . Then $S$ does not satisfy $\mathcal{P}$ with probability at least $\frac{2}{3}$ .

Note that Theorem 5 holds for all hereditary properties, while Theorem 4 only holds for hereditary properties which are extendable. Observe that the graph $S$ in Theorem 5 is a blowup of the graph $G[U]$ , where $U=\{u_{1},\dots,u_{s}\}$ . Thus, the difference between Theorems 4 and 5 is that Theorem 5 only guarantees that a blowup of $G[U]$ does not satisfy $\mathcal{P}$ w.h.p., while Theorem 4 guarantees the stronger assertion that $G[U]$ itself does not satisfy $\mathcal{P}$ w.h.p. This is an important difference: while Theorem 4 immediately implies the existence of a VDF-tester for every hereditary and extendable property $\mathcal{P}$ (see Subsection 3.3), we do not know of any way of using Theorem 5 to prove the existence of such a tester. One natural candidate for a tester derived from Theorem 5 would be the algorithm which accepts if and only if the graph $S$ (defined in Theorem 5) does not satisfy $\mathcal{P}$ . It turns out, however, that this algorithm often fails to be a valid tester999For example, if $\mathcal{P}=C_{5}$ -freeness then this tester will reject w.h.p if the input graph is a triangle with uniform vertex distribution (as the graph $S$ will typically contain the 2-blowup of a triangle, and thus contain a copy of $C_{5}$ ), even though this input graph clearly satisfies $\mathcal{P}$ ..

It is worth noting that Theorem 5 can be deduced from the “unweighted” case, i.e. the result of [5], via a simple argument, see Lemma 5.5 and the discussion following it. On the other hand, the proof of Theorem 4 requires several new ideas on top of those used in [5].

1.3 Variants of the VDF model

The proof of the “only if” part of Theorem 1, showing that if $\mathcal{P}$ is either non-extendable or non-hereditary then $\mathcal{P}$ is not testable in the VDF model, relies on allowing the input graph to have only $O(1)$ vertices (where the constant is independent of $\varepsilon$ ); on excluding $|V(G)|$ from the input fed to the tester; and on having distributions $\mathcal{D}$ that assign to some vertices weight $\Theta(1)$ and to some vertices weight $o(1/|V(G)|)$ . This raises the natural question of what happens if we only require the tester to work on sufficiently large graphs; or if the tester receives $|V(G)|$ as part of the input; or if we forbid $\mathcal{D}$ from assigning very low or very high weights (as above). As the following four theorems show, either one of these variations has a dramatic effect on the model, since it then allows all hereditary properties to be testable.

We start with the setting in which the input graph is guaranteed to be large enough. In a revised version of [19], Goldreich asked whether every hereditary property $\mathcal{P}$ is testable (in the VDF model) on graphs of order at least $M=M_{\mathcal{P}}$ , for $M$ which is independent of $\varepsilon$ . As we show in Proposition 5.2, this turns out to be false. On the positive side, we show that under the stronger assumption that the input size is at least $M_{\mathcal{P}}(\varepsilon)$ (where $M_{\mathcal{P}}:(0,1)\rightarrow\mathbb{N}$ is a function dependent on $\mathcal{P}$ ), all hereditary properties are testable.

Theorem 6.

Under the promise that $|V(G)|\gg 1$ , every hereditary property is testable with one-sided error in the VDF model.

D. Ron (personal communication) asked what happens if we allow testers to receive $|V(G)|$ (i.e., the number of vertices in the input graph) as part of the input101010We note that in the VDF model as defined in [19], the number of vertices in the input graph is not known to the tester. Our following theorem answers this question.

Theorem 7.

If testers can receive $|V(G)|$ as part of the input, then every hereditary property is testable with one-sided error in the VDF model.

Finally, we consider settings in which restrictions are posed on the weights that the distribution $\mathcal{D}$ can assign.

Theorem 8.

Under the promise that $\max_{v\in V(G)}{\mathcal{D}}(v)=o(1)$ , every hereditary property is testable with one-sided error in the VDF model.

Theorem 9.

Under the promise that $\min_{v\in V(G)}{\mathcal{D}}(v)=\Omega\left(1/|V(G)|\right)$ , every hereditary property is testable with one-sided error in the VDF model.

We note that the implied constant in the $\Omega$ -notation in Theorem 9 is allowed to depend on $\varepsilon$ . We refer the reader to Section 5 for the precise statements of Theorems 6–9. Let us mention that the proofs of Theorems 6, 7 and 9 rely on reductions to our main result in this paper, Theorem 1. The proof of Theorem 8 proceeds by a reduction to the standard model (i.e. to the result of [5]). As part of this proof, we solve another problem raised in [19].

1.4 Paper overview

The rest of the paper is organized as follows. Section 2 is devoted to proving vertex-weighted analogues of several lemmas that were used in prior works (most notably regularity and counting lemmas, and corollaries thereof). Some more routine parts of these proofs are deferred to the appendix. In Section 3 we prove the “if” direction of Theorem 1 (i.e. Theorem 4). This is by far the most challenging (and interesting) part of this paper. The main step towards proving Theorem 1 is establishing Lemma 3.1, which is the key lemma of this paper. For the reader’s convenience, we give in Subsection 3.1 an overview of the key ideas of the proof. As the proofs in Section 2 are somewhat routine, we encourage readers who are familiar with the regularity method to skip Section 2 (at least on their first read), and go directly to Section 3.

The “only if” direction of Theorem 1 is proved in Section 4. In Section 5 we prove Theorems 6, 7, 8 and 9. We also raise two additional problems related to the VDF model; one is to what extent can one extend the results of Theorems 6-9 beyond hereditary properties, and the other asks if the sample complexity in the VDF model is the same as in the standard model (for properties that are testable in the VDF model), see Subsection 5.3. Along the way we resolve another open problem raised in [19] (see Lemma 5.5). Throughout the paper, when we say that a function is increasing/decreasing we mean weakly increasing/decreasing (i.e. non-decreasing/non-increasing).

2 Preliminary Lemmas

In this section we introduce vertex-weighted analogues of some key tools of the regularity method, most notable Szemerédi’s regularity lemma [30], the strong regularity lemma [1], and the counting lemma, as well as some standard corollaries thereof. We also prove some other auxiliary lemmas needed for the proof of Theorem 1.

We start with two simple lemmas regarding probability distributions111111Throughout the paper, we will simply write “distribution” to mean “probability distribution”. on a finite set. Given a distribution $\mathcal{D}$ on a set $U$ and a subset $W\subseteq U$ , we use the notation $\mathcal{D}(W):=\sum_{w\in W}{\mathcal{D}(w)}$ , and call $\mathcal{D}(W)$ the weight of $W$ . We denote by $\mathcal{D}_{W}$ the distribution $\mathcal{D}$ conditioned on $W$ , namely $\mathcal{D}_{W}(w)=\frac{\mathcal{D}(w)}{\mathcal{D}(W)}$ for every $w\in W$ .

Lemma 2.1.

For every set $U$ , for every $\eta\in(0,1)$ and for every distribution $\mathcal{D}$ on $U$ , there is a partition $\mathcal{P}$ of $U$ into $\lceil 1/\eta\rceil$ parts such that $\sum_{W\in\mathcal{P}}{\sum_{\{x,y\}\in\binom{W}{2}}}{\mathcal{D}(x)\mathcal{D}(y)}\leq\eta$ .

[Proof]Let $\mathcal{P}$ be a random partition of $U$ into $k:=\lceil 1/\eta\rceil$ parts, where each element is assigned to one of the parts uniformly at random and independently of all other elements. Then for every pair of distinct elements $x,y\in U$ , the probability that $x$ and $y$ belong to the same part is exactly $\frac{1}{k}$ . By linearity of expectation we have

[TABLE]

**so there is a choice of $\mathcal{P}$ with the required property. **

Lemma 2.2.

Let $a>0$ be an integer, let $U$ be a finite set and let $\mathcal{D}$ be a distribution on $U$ such that $\mathcal{D}(u)\leq\frac{1}{2a}$ for every $u\in U$ . Then there is a partition $U=U_{1}\cup\dots\cup U_{a}$ such that $\mathcal{D}(U_{i})\geq\frac{1}{2a}$ for every $1\leq i\leq a$ .

[Proof]We proof is by induction on $a$ . The base case $a=1$ is trivial, so we assume from now on that $a\geq 2$ . Let $U_{1}\subseteq U$ be a set of minimal size satisfying $\mathcal{D}(U_{1})\geq\frac{1}{2a}$ . Then $\mathcal{D}(U_{1})\leq\frac{1}{a}$ , because otherwise we could remove an arbitrary element of $U_{1}$ (whose weight by assumption is at most $\frac{1}{2a}$ ) and thus get a proper subset of $U_{1}$ having weight at least $\frac{1}{2a}$ , in contradiction the minimality of $U_{1}$ . Now set $U^{\prime}:=U\setminus U_{1}$ , noting that $\mathcal{D}(U^{\prime})\geq 1-\frac{1}{a}$ . Then every $u\in U^{\prime}$ satisfies

[TABLE]

So by the induction hypothesis for $(U^{\prime},\mathcal{D}_{U^{\prime}})$ , there is a partition $U^{\prime}=U_{2}\cup\dots\cup U_{a}$ such that

[TABLE]

**for every $2\leq i\leq a$ . This completes the proof. **

We consider vertex-weighted graphs, i.e. pairs $(G,\mathcal{D})$ such that $G$ is a graph and $\mathcal{D}$ is a distribution on $V(G)$ . For a set $X\subseteq V(G)$ , the subgraph of $(G,\mathcal{D})$ induced by $X$ is defined to be $(G[X],\mathcal{D}_{X})$ , where $\mathcal{D}_{X}$ is the distribution $\mathcal{D}$ conditioned on $X$ . The weight of an edge/non-edge $\{x,y\}$ (with respect to $\mathcal{D}$ ) is defined as $\mathcal{D}(x)\mathcal{D}(y)$ . For a pair of disjoint sets $X,Y\subseteq V(G)$ with $\mathcal{D}(X),\mathcal{D}(Y)>0$ , the density of $(X,Y)$ is denoted by $d(X,Y)$ and defined to be $d(X,Y)=\frac{1}{\mathcal{D}(X)\mathcal{D}(Y)}\sum_{(x,y)\in E(X,Y)}{\mathcal{D}(x)\mathcal{D}(y)}$ , where $E(X,Y)$ is the set of edges with one endpoint in $X$ and one endpoint in $Y$ . If $\mathcal{D}(X)=0$ or $\mathcal{D}(Y)=0$ then define $d(X,Y)=0$ . A pair of disjoint vertex-sets $(X,Y)$ is called $\varepsilon$ -regular if for every $X^{\prime}\subseteq X$ and $Y^{\prime}\subseteq Y$ with $\mathcal{D}(X^{\prime})\geq\varepsilon\mathcal{D}(X)$ and $\mathcal{D}(Y^{\prime})\geq\varepsilon\mathcal{D}(Y)$ , it holds that $|d(X^{\prime},Y^{\prime})-d(X,Y)|\leq\varepsilon$ . The following lemma describes some basic properties of $\varepsilon$ -regular pairs.

Lemma 2.3.

Let $(G,\mathcal{D})$ be a vertex-weighted graph, and let $X,Y\subseteq V(G)$ be disjoint vertex-sets such that $\mathcal{D}(X),\mathcal{D}(Y)>0$ , and such that the pair $(X,Y)$ is $\varepsilon$ -regular with density $d$ . Then the following holds.

For every $\alpha\geq\varepsilon$ and $X^{\prime}\subseteq X$ , $Y^{\prime}\subseteq Y$ with $\mathcal{D}(X^{\prime})\geq\alpha\mathcal{D}(X)$ and $\mathcal{D}(Y^{\prime})\geq\alpha\mathcal{D}(Y)$ , the pair $(X^{\prime},Y^{\prime})$ has density at least $d-\varepsilon$ and at most $d+\varepsilon$ , and is $\varepsilon^{\prime}$ -regular with $\varepsilon^{\prime}=\max\{\varepsilon/\alpha,2\varepsilon\}$ . 2. 2.

The set of vertices $x\in X$ which satisfy $|d(x,Y)-d|>\varepsilon$ has weight less than $2\varepsilon\cdot\mathcal{D}(X)$ .

[Proof]Starting with Item 1, let $X^{\prime}\subseteq X$ and $Y^{\prime}\subseteq Y$ be such that $\mathcal{D}(X^{\prime})\geq\alpha\mathcal{D}(X)$ and $\mathcal{D}(Y^{\prime})\geq\alpha\mathcal{D}(Y)$ . Since $\alpha\geq\varepsilon$ , the $\varepsilon$ -regularity of $(X,Y)$ implies that $d-\varepsilon\leq d(X^{\prime},Y^{\prime})\leq d+\varepsilon$ . Now let us show that $(X^{\prime},Y^{\prime})$ is $\varepsilon^{\prime}$ -regular with $\varepsilon^{\prime}=\max\{\varepsilon/\alpha,2\varepsilon\}$ . Let $X^{\prime\prime}\subseteq X^{\prime}$ and $Y^{\prime\prime}\subseteq Y^{\prime}$ be such that $\mathcal{D}(X^{\prime\prime})\geq\varepsilon^{\prime}\mathcal{D}(X^{\prime})$ and $\mathcal{D}(Y^{\prime\prime})\geq\varepsilon^{\prime}\mathcal{D}(Y^{\prime})$ . Then $\mathcal{D}(X^{\prime\prime})\geq\frac{\varepsilon}{\alpha}\mathcal{D}(X^{\prime})\geq\varepsilon\mathcal{D}(X)$ and similarly $\mathcal{D}(Y^{\prime\prime})\geq\varepsilon\mathcal{D}(Y)$ . So by the $\varepsilon$ -regularity of $(X,Y)$ we have $|d(X^{\prime\prime},Y^{\prime\prime})-d(X,Y)|\leq\varepsilon$ and hence $|d(X^{\prime\prime},Y^{\prime\prime})-d(X^{\prime},Y^{\prime})|\leq 2\varepsilon\leq\varepsilon^{\prime}$ , as required.

We now prove Item 2. Let $X^{+}$ (resp. $X^{-}$ ) be the set of all $x\in X$ satisfying $d(x,Y)>d+\varepsilon$ (resp. $d(x,Y)<d-\varepsilon$ ). We have

[TABLE]

**So unless $\mathcal{D}(X^{+})<\varepsilon\mathcal{D}(X)$ , we get a contradiction to the $\varepsilon$ -regularity of $(X,Y)$ . Similarly, we must have $\mathcal{D}(X^{-})<\varepsilon\mathcal{D}(X)$ . The assertion follows. ** The following is a vertex-weighted counting lemma.

Lemma 2.4 (Counting lemma for vertex-weighted graphs).

For every integer $h\geq 2$ and $\eta\in(0,1)$ there is $\delta=\delta_{\ref{lem:counting}}(h,\eta)$ such that the following holds. Let $H$ be a graph on $[h]$ and let $U_{1},\dots,U_{h}$ be pairwise-disjoint vertex-sets in a vertex-weighted graph $(G,\mathcal{D})$ , such that the following holds.

For every $1\leq i<j\leq h$ , if $\{i,j\}\in E(H)$ then $d(U_{i},U_{j})\geq\eta$ , and if $\{i,j\}\notin E(H)$ then $d(U_{i},U_{j})\leq 1-\eta$ . 2. 2.

For every $1\leq i<j\leq h$ , the pair $(U_{i},U_{j})$ is $\delta$ -regular.

Let $\mathcal{U}$ be the set of all $(u_{1},\dots,u_{h})\in U_{1}\times\dots\times U_{h}$ such that $u_{1},\dots,u_{h}$ induce a copy of $H$ in which $u_{i}$ plays the role of $i$ for every $1\leq i\leq h$ . Then $\sum_{(u_{1},\dots,u_{h})\in\mathcal{U}}{\prod_{i=1}^{h}{\mathcal{D}(u_{i})}}\geq\delta\prod_{i=1}^{h}{\mathcal{D}(U_{i})}$ .

[Proof]If $\mathcal{D}(U_{i})=0$ for some $1\leq i\leq h$ then there is nothing to prove, so suppose that $\mathcal{D}(U_{i})>0$ for every $1\leq i\leq h$ . The proof is by induction on $h$ . The base case $h=2$ trivially holds with $\delta=\delta(2,\eta)=\eta$ . So from now on we assume that $h\geq 3$ , and set

[TABLE]

For each $2\leq i\leq h$ , let $W_{i}$ be the set of all vertices $u_{1}\in U_{1}$ for which $|d(u_{1},U_{i})-d(U_{1},U_{i})|>\delta$ . By Item 2 of Lemma 2.3, we have $\mathcal{D}(W_{i})<2\delta\cdot\mathcal{D}(U_{1})$ . Hence, the set $U^{\prime}_{1}:=U_{1}\setminus\bigcup_{i=2}^{h}{W_{i}}$ satisfies $\mathcal{D}(U^{\prime}_{1})>\mathcal{D}(U_{1})-(h-1)\cdot 2\delta\cdot\mathcal{D}(U_{1})\geq\frac{1}{2}\mathcal{D}(U_{1})$ , where in the last inequality we used our choice of $\delta$ . Now fix any $u_{1}\in U^{\prime}_{1}$ . We define sets $U^{\prime}_{2},\dots,U^{\prime}_{h}$ as follows: for $2\leq i\leq h$ , if $\{1,i\}\in E(H)$ then set $U^{\prime}_{i}=N_{U_{i}}(u_{1})$ , and if $\{1,i\}\notin E(H)$ then set $U^{\prime}_{i}=U_{i}\setminus N_{U_{i}}(u_{1})$ . By using Item 1 and the fact that $u_{1}\in U^{\prime}_{1}$ , we get that $\mathcal{D}(U^{\prime}_{i})\geq(\eta-\delta)\mathcal{D}(U_{i})\geq\frac{\eta}{2}\cdot\mathcal{D}(U_{i})$ for every $2\leq i\leq h$ . By Item 1 of Lemma 2.3, and by Conditions 1-2 of the current lemma, we get that for every $2\leq i<j\leq h$ , the pair $(U^{\prime}_{i},U^{\prime}_{j})$ is $\delta^{\prime}$ -regular with $\delta^{\prime}=2\delta/\eta\leq\delta(h-1,\eta/2)$ , and that if $\{i,j\}\in E(H)$ then $d(U^{\prime}_{i},U^{\prime}_{j})\geq\eta-\delta\geq\eta/2$ and if $\{i,j\}\notin E(H)$ then $d(U^{\prime}_{i},U^{\prime}_{j})\leq 1-\eta+\delta\leq 1-\frac{\eta}{2}$ .

We now see that the sets $U^{\prime}_{2},\dots,U^{\prime}_{h}$ satisfy the requirements of the lemma with respect to the graph $H^{\prime}=H[\{2,\dots,h\}]$ and with $\frac{\eta}{2}$ in place of $\eta$ . Let $\mathcal{U}^{\prime}$ be the set of all $(u_{2},\dots,u_{h})\in U^{\prime}_{2}\times\dots\times U^{\prime}_{h}$ such that $u_{2},\dots,u_{h}$ induce a copy of $H^{\prime}$ with $u_{i}$ playing the role of $i$ for every $2\leq i\leq h$ . By the induction hypothesis, we have

[TABLE]

For every $(u_{2},\dots,u_{h})\in\mathcal{U}^{\prime}$ , the tuple $(u_{1},\dots,u_{h})$ induces a copy of $H$ with $u_{i}$ playing the role of $i$ for every $1\leq i\leq h$ . Hence, for every $(u_{2},\dots,u_{h})\in\mathcal{U}^{\prime}$ we have $(u_{1},\dots,u_{h})\in\mathcal{U}$ (where $\mathcal{U}$ is defined in the statement of the lemma). Since this is true for every $u_{1}\in U^{\prime}_{1}$ , we get that

[TABLE]

**as required. **

A partition $\mathcal{P}=\{V_{1},\dots,V_{r}\}$ of the vertex-set of a vertex-weighted graph $(G,\mathcal{D})$ is called $\varepsilon$ -regular if the sum of $\mathcal{D}(V_{i})\mathcal{D}(V_{j})$ over all pairs $1\leq i<j\leq r$ for which $(V_{i},V_{j})$ is not $\varepsilon$ -regular, is at most $\varepsilon$ . We now state vertex-weighted versions121212We note that a weighted version of Szemerédi’s regularity lemma, where both vertex-weights and edge-weights are allowed, was proved in [14], but only under the assumption that all vertex-weights are $o(1)$ . Hence this lemma is unsuitable in our setting. of Szemerédi’s regularity lemma [30] and of the strong regularity lemma [1]. The proofs of these lemmas appear in the appendix.

Lemma 2.5 (Szemerédi’s regularity lemma for vertex-weighted graphs).

For every $\varepsilon\in(0,1)$ and $m\geq 0$ there is $T=T_{\ref{lem:reg}}(\varepsilon,m)$ such that for every vertex-weighted graph $(G,\mathcal{D})$ and for every partition $\mathcal{P}_{0}$ of $V(G)$ of size not larger than $m$ , there is an $\varepsilon$ -regular partition $\mathcal{P}$ of $V(G)$ which has at most $T$ parts and refines $\mathcal{P}_{0}$ .

Lemma 2.6 (Strong regularity lemma for vertex-weighted graphs).

For every function $\mathcal{E}:\mathbb{N}\rightarrow(0,1)$ and for every integer $m$ , there is $S=S_{\ref{lem:strong_reg}}(\mathcal{E},m)$ such that for every vertex-weighted graph $(G,\mathcal{D})$ and for every partition $\mathcal{P}_{0}$ of $V(G)$ of size at most $m$ , there is a refinement $\mathcal{P}$ of $\mathcal{P}_{0}$ , and a refinement $\mathcal{Q}$ of $\mathcal{P}$ , such that the following holds.

$|\mathcal{Q}|\leq S$ . 2. 2.

The partition $\mathcal{Q}$ is $\mathcal{E}(|\mathcal{P}|)$ -regular. 3. 3.

$\sum_{P_{1},P_{2}\in\mathcal{P}}\sum_{Q_{1}\subseteq P_{1},Q_{2}\subseteq P_{2}}{\mathcal{D}(Q_{1})\mathcal{D}(Q_{2})\cdot|d(Q_{1},Q_{2})-d(P_{1},P_{2})|}\leq\mathcal{E}(0)$ . Here the outer sum is over all unordered pairs of distinct $P_{1},P_{2}\in\mathcal{P}$ , and the inner sum is over all $Q_{1},Q_{2}\in\mathcal{Q}$ such that $Q_{i}\subseteq P_{i}$ for $i=1,2$ .

Our last two lemmas are vertex-weighted analogues of well-known corollaries to Szemerédi’s regularity lemma and the strong regularity lemma, respectively. The “unweighted” versions of these corollaries were used in [5] in order to prove that every hereditary property is testable in the standard model.

Lemma 2.7.

For every integer $t\geq 1$ and for every $\delta>0$ there is $\zeta=\zeta_{\ref{lem:Turan_Ramsey}}(t,\delta)>0$ , such that the following holds. Let $(G,\mathcal{D})$ be a vertex-weighted graph such that every vertex in $G$ has weight less than $\zeta$ . Then there are pairwise-disjoint vertex-sets $Q_{1},\dots,Q_{t}\subseteq V(G)$ with the following properties.

$\mathcal{D}(Q_{i})\geq\zeta$ * for every $1\leq i\leq t$ .* 2. 2.

$(Q_{i},Q_{j})$ * is $\delta$ -regular for every $1\leq i<j\leq t$ .* 3. 3.

Either all pairs $(Q_{i},Q_{j})$ have density at least $\frac{1}{2}$ , or all pairs $(Q_{i},Q_{j})$ have density less than $\frac{1}{2}$ .

[Proof]Setting $a=4^{t}$ and $\varepsilon=\frac{\delta}{4a^{4}}$ , we will prove the lemma with

[TABLE]

Let $(G,\mathcal{D})$ satisfying $\mathcal{D}(v)<\zeta$ for every $v\in V(G)$ . Apply Lemma 2.2 with $U=V(G)$ , with the distribution $\mathcal{D}$ , and with $a$ as defined above. Lemma 2.2 supplies a partition $V(G)=U_{1}\cup\dots\cup U_{a}$ such that $\mathcal{D}(U_{i})\geq\frac{1}{2a}$ for every $1\leq i\leq a$ . Now apply Lemma 2.5 to $(G,\mathcal{D})$ with parameter $\varepsilon$ and with the partition $\mathcal{P}_{0}:=\{U_{1},\dots,U_{a}\}$ , to obtain an $\varepsilon$ -regular partition $\mathcal{P}$ which refines $\mathcal{P}_{0}$ . For each $1\leq i\leq a$ , put $\mathcal{P}_{i}=\{P\in\mathcal{P}:P\subseteq U_{i}\}$ , and sample $P_{i}\in\mathcal{P}_{i}$ with probability proportional to the weight of the parts, i.e. $P_{i}=P$ with probability $\frac{\mathcal{D}(P)}{\mathcal{D}(U_{i})}$ for every $P\in\mathcal{P}_{i}$ . We claim that with positive probability, $\mathcal{D}(P_{i})\geq\zeta$ for every $1\leq i\leq a$ , and all pairs $(P_{i},P_{j})$ are $\delta$ -regular. For every $1\leq i\leq a$ , the probability that $\mathcal{D}(P_{i})<\zeta$ is less than $\frac{\zeta\cdot|\mathcal{P}|}{\mathcal{D}(U_{i})}\leq\frac{\zeta\cdot T_{\ref{lem:reg}}(\varepsilon,a)}{1/2a}\leq\frac{1}{2a}$ , where in the first inequality we used the guarantees of Lemma 2.5. By the union bound, with probability at least $\frac{1}{2}$ we have $\mathcal{D}(P_{i})\geq\zeta$ for every $1\leq i\leq a$ . Next, observe that since $\mathcal{P}$ is $\varepsilon$ -regular and as $\varepsilon\leq\delta$ , the probability that $(P_{i},P_{j})$ is not $\delta$ -regular (for some specific $1\leq i<j\leq a$ ) is at most $\frac{\varepsilon}{\mathcal{D}(U_{i})\mathcal{D}(U_{j})}\leq 4a^{2}\varepsilon\leq\frac{1}{a^{2}}$ . So by taking the union bound over all pairs $1\leq i<j\leq a$ , we get that with probability at least $1-\binom{a}{2}\cdot\frac{1}{a^{2}}>\frac{1}{2}$ , all pairs $(P_{i},P_{j})$ are $\delta$ -regular. This proves our assertion.

**We thus showed that there is a choice of $P_{1},\dots,P_{a}$ such that $\mathcal{D}(P_{i})\geq\zeta$ for every $1\leq i\leq a$ and such that $(P_{i},P_{j})$ is $\delta$ -regular for every $1\leq i<j\leq a$ . Now consider an auxiliary graph on $[a]$ in which $\{i,j\}$ is an edge if $d(P_{i},P_{j})\geq\frac{1}{2}$ and $\{i,j\}$ is a non-edge if $d(P_{i},P_{j})<\frac{1}{2}$ . As $a=4^{t}$ , a well-known bound on Ramsey numbers implies that this graph contains either a clique or an independent set $\{i_{1},\dots,i_{t}\}$ . Then $Q_{1}=P_{i_{1}},\dots,Q_{t}=P_{i_{t}}$ satisfy the requirements of the lemma. **

Lemma 2.8.

For every function $\mathcal{E}:\mathbb{N}\rightarrow(0,1)$ and for every integer $m$ , there is $S=S_{\ref{lem:representatives}}(\mathcal{E},m)>0$ such that for every vertex-weighted graph $(G,\mathcal{D})$ and for every partition $\mathcal{P}_{0}$ of $V(G)$ having size at most $m$ , there is a partition $\mathcal{P}=\{P_{0},P_{1},\dots,P_{r}\}$ of $V(G)$ and vertex-sets $Q_{i}\subseteq P_{i}$ for $1\leq i\leq r$ , such that the following holds:

$\mathcal{D}(P_{0})<\mathcal{E}(0)$ . 2. 2.

For every $1\leq i\leq r$ , $P_{i}$ is contained in some part of $\mathcal{P}_{0}$ . 3. 3.

$\mathcal{D}(Q_{i})\geq 1/S$ * for every $1\leq i\leq r$ . In particular, $r\leq S$ .* 4. 4.

For every $1\leq i<j\leq r$ , the pair $(Q_{i},Q_{j})$ is $\mathcal{E}(r)$ -regular. 5. 5.

$\sum_{1\leq i<j\leq r}{\mathcal{D}(P_{i})\mathcal{D}(P_{j})\cdot|d(Q_{i},Q_{j})-d(P_{i},P_{j})|}\leq\mathcal{E}(0)$ .

[Proof]We may and will assume $\mathcal{E}$ is monotone decreasing131313Indeed, we can replace $\mathcal{E}$ with $\mathcal{E}^{\prime}(r)=\min_{s\leq r}{\mathcal{E}(s)}$ , which is clearly monotone decreasing.. For convenience, put $\varepsilon=\mathcal{E}(0)$ . Let $\mathcal{E}^{\prime}:\mathbb{N}\rightarrow(0,1)$ be the function $\mathcal{E}^{\prime}(r)=\min\left\{\mathcal{E}(r),\frac{\varepsilon^{2}}{2r^{4}},\frac{\varepsilon}{3}\right\}$ . We will show that one can choose $S=S_{\ref{lem:representatives}}(\mathcal{E},m):=\frac{3s^{3}}{\varepsilon}$ , where $s:=S_{\ref{lem:strong_reg}}(\mathcal{E}^{\prime},m)$ . Apply Lemma 2.6 to $(G,\mathcal{D})$ with parameter $\mathcal{E}^{\prime}$ and with the given partition $\mathcal{P}_{0}$ , to obtain partitions $\mathcal{P}^{\prime}$ and $\mathcal{Q}$ such that $\mathcal{P}^{\prime}$ refines $\mathcal{P}_{0}$ , $\mathcal{Q}$ refines $\mathcal{P}^{\prime}$ , and Items 1-3 in Lemma 2.6 hold. Let $P_{0}$ be the union of all parts of $\mathcal{P}^{\prime}$ of weight less than $\varepsilon/|\mathcal{P}^{\prime}|$ , and let $P_{1},\dots,P_{r}$ be the parts of $\mathcal{P}^{\prime}$ of weight at least $\varepsilon/|\mathcal{P}^{\prime}|$ . Then we have $\mathcal{D}(P_{0})<|\mathcal{P}^{\prime}|\cdot\varepsilon/|\mathcal{P}^{\prime}|=\varepsilon$ , establishing Item 1. Now set $\mathcal{P}=\{P_{0},P_{1},\dots,P_{r}\}$ . It is evident that Item 2 holds.

For each $1\leq i\leq r$ , denote $\mathcal{Q}_{i}=\{Q\in\mathcal{Q}:Q\subseteq P_{i}\}$ , and sample $Q_{i}\in\mathcal{Q}_{i}$ with probability proportional to the weight of the parts; in other words, for each $Q\in\mathcal{Q}_{i}$ , the probability that $Q_{i}=Q$ is $\frac{\mathcal{D}(Q)}{\mathcal{D}(P_{i})}$ . We will show that with positive probability, $Q_{1},\dots,Q_{r}$ satisfy Items 3-5. For each $1\leq i\leq r$ , the probability that $\mathcal{D}(Q_{i})<\frac{\mathcal{D}(P_{i})}{3r|\mathcal{Q}|}$ is less than $|\mathcal{Q}|\cdot\frac{1}{3r|\mathcal{Q}|}=\frac{1}{3r}$ . By the union bound, the probability that there is $1\leq i\leq r$ for which $\mathcal{D}(Q_{i})<\frac{\mathcal{D}(P_{i})}{3r|\mathcal{Q}|}$ is less than $\frac{1}{3}$ . So with probability larger than $\frac{2}{3}$ , for every $1\leq i\leq r$ we have

[TABLE]

where the last inequality is due to our choice of $\mathcal{Q}$ via Lemma 2.6.

We now prove that Item 4 holds with probability greater than $\frac{2}{3}$ . Fix any $1\leq i<j\leq r$ . Since $\mathcal{Q}$ is $\varepsilon^{\prime}$ -regular with $\varepsilon^{\prime}=\mathcal{E}^{\prime}(|\mathcal{P}|^{\prime})\leq\min\left\{\mathcal{E}(|\mathcal{P}^{\prime}|),\frac{\varepsilon^{2}}{2|\mathcal{P}^{\prime}|^{4}}\right\}$ , and since $\mathcal{E}(|\mathcal{P}^{\prime}|)\leq\mathcal{E}(r)$ (by the monotonicity of $\mathcal{E}$ ), the probability that the pair $(Q_{i},Q_{j})$ is not $\mathcal{E}(r)$ -regular is at most $\frac{\varepsilon^{2}/(2|\mathcal{P}^{\prime}|^{4})}{\mathcal{D}(P_{i})\mathcal{D}(P_{j})}\leq\frac{1}{2}|\mathcal{P}^{\prime}|^{-2}\leq\frac{1}{2}r^{-2}$ , where the first inequality holds because $\mathcal{D}(P_{i}),\mathcal{D}(P_{j})\geq\varepsilon/|\mathcal{P}^{\prime}|$ . By the union bound over all pairs $1\leq i<j\leq r$ , the probability that there is $1\leq i<j\leq r$ for which $(Q_{i},Q_{j})$ is not $\mathcal{E}(r)$ -regular is at most $\binom{r}{2}\cdot\frac{1}{2}r^{-2}<\frac{1}{3}$ .

It remains to show that Item 5 holds with probability at least $\frac{2}{3}$ . Observe that

[TABLE]

**where in the inequality we used Item 3 of Lemma 2.6, our choice of $\mathcal{E}^{\prime}$ , and the fact that $P_{1},\dots,P_{r}\in\mathcal{P}^{\prime}$ . So by Markov’s inequality, the probability that Item 5 fails is at most $\frac{1}{3}$ , as required. **

3 The Main Proof

In this section we prove the “if” direction of Theorem 1. In Subsection 3.1 we give a high-level overview of the main obstacle one needs to overcome in proving Theorem 1, and the main idea behind the way we overcome it. In Subsection 3.2 we state and prove Lemma 3.1, which constitutes the main ingredient in the proof of Theorem 1. Finally, we prove (the “if” direction of) Theorem 1 in Subsection 3.3.

3.1 Proof overview

The main difficulty:

Suppose ${\cal P}$ is an extendable hereditary graph property. We are given a graph $G$ and a distribution ${\cal D}$ so that $G$ is $\varepsilon$ -far from ${\cal P}$ with respect to ${\cal D}$ . Our goal is to show that a sample of $O(1)$ vertices141414Throughout this subsection, $\Omega(1)$ and $O(1)$ mean positive quantities that depend only on $\varepsilon$ and not on $n$ or ${\cal D}$ . from $G$ finds with high probability (whp) an induced subgraph $F$ of $G$ which does not satisfy ${\cal P}$ . There are two ways one can try to tackle this problem. First, one can take a blowup $G^{\prime}$ of $G$ , in which a vertex is replaced by a cluster of vertices whose size is proportional to the vertex’s weight under ${\cal D}$ , and thus (try to) “reduce” the problem to the non-weighted case. While this approach can allow one to handle some properties151515Indeed, this is the approach used in [19]., it seems that the main bottleneck is that a copy of $F$ in $G^{\prime}$ does not correspond necessarily to a copy of $F$ in $G$ , since $F$ might contain several of the vertices that replaced a vertex of $G$ . Moreover, if this vertex $v$ has weight $\Omega(1)$ then even a sample of size $O(1)$ will very likely contain several of the vertices of $G^{\prime}$ that replaced $v$ .

A second approach would be to just reprove the result of [5], while replacing the regularity lemmas used there with regularity lemmas for vertex-weighted graphs. While such lemmas are indeed not hard to prove (see e.g. Lemmas 2.4-2.8), the main problem is again vertices of high weight. Now the issue is that clusters of the regular partition might contain only a single vertex of high weight, a situation in which one would not be able to embed graphs $F$ that need to use more than one vertex from the same cluster.

The key new idea:

The main idea is then to prove a lemma that allows one to partition $G$ into three sets $X,Y,Z$ with the following properties: $(i)$ $Z$ will have total weight at most $\varepsilon/2$ , $(ii)$ all vertices in $X$ will have weight at least $\Omega(1)$ , $(iii)$ $Y$ will have a highly regular Szemerédi partition, that is, there will be a partition of the vertices of $Y$ into sets $P_{1},\ldots,P_{r}$ so that the bipartite graphs between all pairs $(P_{i},P_{j})$ are pseudo-random (or regular in the sense of the regularity lemma), $(iv)$ each of the clusters $P_{i}$ will have “enough” vertices, and $(v)$ for each $x\in X$ and set $P_{i}$ , either $x$ will be connected to all vertices of $P_{i}$ or to none of them. We will now see how a partition with properties $(i)$ – $(v)$ can allow one to test $\mathcal{P}$ . Let us note that the actual structure we will use is much more complicated than is described in the above five properties (cf. Lemma 3.1), and that in the present discussion we intentionally oversimplify some technical aspects in order to highlight our main new idea. For example, we will not actually be able to guarantee that all pairs $(P_{i},P_{j})$ are pseudo-random (or that the measure of pseudo-randomness of these pairs is sufficient for our purposes); instead, as is common in this type of proofs, we will have “representative sets” $Q_{i}\subseteq P_{i}$ such that all pairs $(Q_{i},Q_{j})$ are pseudo-random and most have roughly the same density as $(P_{i},P_{j})$ .

We first claim that $G[X\cup Y]$ (i.e. the graph induced by $X\cup Y$ ) is $\varepsilon/2$ -far from satisfying ${\cal P}$ . Indeed, if this is not the case, then we can first turn the graph induced by these sets into a graph satisfying ${\cal P}$ by making changes of total weight less than $\varepsilon/2$ , and then use the fact that ${\cal P}$ is extendable and the fact that the total weight of $Z$ is at most $\varepsilon/2$ in order to reconnect the vertices of $Z$ to $X\cup Y$ (and amongst themselves) so that the resulting graph will be in ${\cal P}$ . The total weight of edges we thus change is less than $\varepsilon$ , a contradiction.

We now examine the partition $P_{1},\ldots,P_{r}$ of $Y$ and perform a “cleaning” procedure analogous to the one performed in applications of the regularity lemma. By this we mean that we make (only!) within $Y$ changes of total weight less than $\varepsilon/2$ so that if after these changes the set $Y$ contains an induced copy of some (bounded-size) graph $F$ , then in the original graph, a sample of $O(1)$ vertices from $Y$ finds one such copy with high probability (whp). Here we will also rely on property $(iv)$ of the partition. The fact that $G[X\cup Y]$ is $\varepsilon/2$ -far from satisfying ${\cal P}$ and that we made changes of total weight less than $\varepsilon/2$ when cleaning $Y$ , means that $G[X\cup Y]$ (after the cleaning) indeed has an induced copy of a graph $F$ that does not satisfy ${\cal P}$ . We now claim that a sample of size $O(1)$ from $G$ (before the cleaning) finds a copy of $F$ whp. First, since the total weight of $Z$ is small, then sampling from $G$ is (effectively) like sampling from $G[X\cup Y]$ . Let now $F_{X}$ (resp. $F_{Y}$ ) be the subgraph of $F$ induced by $X$ (resp. $Y$ ). By the above discussion, a sample of size $O(1)$ finds a copy of $F_{Y}$ whp. Now, and this is the first crucial point, property $(v)$ mentioned above guarantees that the vertices of $X$ which form the copy of $F_{X}$ , form a copy of $F$ with every set of vertices in $Y$ which forms a copy of $F_{Y}$ . Now, and this is the second crucial point, property $(ii)$ above guarantees that a sample of $O(1)$ vertices finds the161616By “the” we mean that $X$ might contain only a single copy of $F_{X}$ , but this copy has to be of weight $\Omega(1)$ . This is in sharp contrast to the situation within $Y$ , where each copy of $F_{Y}$ might have very small weight, but the total weight of such copies must be $\Omega(1)$ . copy of $F_{X}$ contained in $X$ whp. Altogether, the algorithm finds an induced copy of $F$ using $O(1)$ queries.

The new regularity lemma:

As it turns out, one cannot hope to partition $G$ as described in the first paragraph above, and instead we will have to define a partition with a much more complicated set of features. This is stated in Lemma 3.1 in the next subsection. One of the main difficulties is making sure that parts $P_{i}$ of the partition of $Y$ will not contain only few (or even a single) vertices of high weight (i.e. we want to guarantee property $(iv)$ stated above). This is done by making sure that the weight of the vertices in $Y$ is very small compared to the weight of the parts $P_{1},\dots,P_{r}$ . This in itself is challenging, because at the same time we need to have many parts $P_{i}$ in order to satisfy property $(v)$ above. The proof of Lemma 3.1 will use some of the lemmas of Section 2, most notably Lemma 2.8, which we will need to iterate (at least implicitly) in order to find the sought-after partition in the statement of Lemma 3.1.

3.2 The Key Lemma

In this subsection we state and prove Lemma 3.1, which is the main ingredient in the proof of the “if” direction of Theorem 1.

Lemma 3.1.

For every function $\Psi:\mathbb{N}\rightarrow\mathbb{N}$ and $\varepsilon>0$ there is $S=S_{\ref{lem:reg_main}}(\Psi,\varepsilon)>0$ such that for every vertex-weighted graph $(G,\mathcal{D})$ there is a partition $V(G)=X\cup Y\cup Z$ , a partition $\mathcal{P}=\{P_{1},\dots,P_{r}\}$ of $Y$ , vertex-sets $Q_{i}\subseteq P_{i}$ , and pairwise-disjoint vertex-sets $Q_{i,1},\dots,Q_{i,t}\subseteq Q_{i}$ , where $t=\Psi(|X|+r)$ , such that the following holds:

$\mathcal{D}(Z)<\varepsilon$ . 2. 2.

Every vertex in $X$ has weight at least $1/S$ . 3. 3.

For every $x\in X$ and for every $1\leq i\leq r$ , either $x$ is adjacent to all vertices of $P_{i}$ , or to none of the vertices of $P_{i}$ . 4. 4.

$\sum_{1\leq i\leq r}\sum_{\{x,y\}\in\binom{P_{i}}{2}}{\mathcal{D}(x)\mathcal{D}(y)}\leq\varepsilon$ . 5. 5.

$\sum_{1\leq i<j\leq r}{{\mathcal{D}(P_{i})\mathcal{D}(P_{j})\cdot|d(Q_{i},Q_{j})-d(P_{i},P_{j})|}}\leq\varepsilon$ . 6. 6.

For every $1\leq i\leq r$ , all pairs $(Q_{i,k},Q_{i,\ell})$ are $\frac{1}{\Psi(|X|+r)}$ -regular, and either all pairs $(Q_{i,k},Q_{i,\ell})$ have density at least $\frac{1}{2}$ , or all pairs $(Q_{i,k},Q_{i,\ell})$ have density less than $\frac{1}{2}$ . 7. 7.

For every $1\leq i<j\leq r$ and $1\leq k,\ell\leq t$ , the pair $(Q_{i,k},Q_{j,{\ell}})$ is $\frac{1}{\Psi(|X|+r)}$ -regular and $|d(Q_{i,k},Q_{j,\ell})-d(Q_{i},Q_{j})|\leq\frac{1}{\Psi(|X|+r)}$ . 8. 8.

For every $1\leq i\leq r$ and $1\leq k\leq t$ , $\mathcal{D}(Q_{i,k})\geq 1/S$ .

Note that Items 2 and 8 in Lemma 3.1 together imply that $|X|+rt\leq S$ . The following lemma constitutes the main part of the proof of Lemma 3.1. After proving Lemma 3.2, we deduce Lemma 3.1 from Lemmas 3.2 and 2.7.

Lemma 3.2.

For every function $\Psi:\mathbb{N}\rightarrow\mathbb{N}$ and $\varepsilon>0$ there is $S=S_{\ref{lem:iterations}}(\Psi,\varepsilon)>0$ such that for every vertex-weighted graph $(G,\mathcal{D})$ there is a partition $V(G)=X\cup Y\cup Z$ , a partition $\mathcal{P}=\{P_{1},\dots,P_{r}\}$ of $Y$ and vertex-sets $Q_{i}\subseteq P_{i}$ (for $1\leq i\leq r$ ) such that Items 1-5 in Lemma 3.1 hold (with respect to $S=S_{\ref{lem:iterations}}(\Psi,\varepsilon)$ ), and such that the following two conditions are satisfied.

(a)

*For every $1\leq i<j\leq r$ , the pair $(Q_{i},Q_{j})$ is $\frac{1}{\Psi(|X|+r)}$ -regular. * 2. (b)

For every $1\leq i\leq r$ the following holds: $\mathcal{D}(Q_{i})\geq 1/S$ , and all vertices in $Q_{i}$ have weight less than $\frac{1}{\Psi(|X|+r)}\cdot\mathcal{D}(Q_{i})$ .

[Proof]We may and will assume that the function $\Psi$ is monotone increasing171717To guarantee that $\Psi$ is monotone increasing, we can simply replace $\Psi$ with the function $\Psi^{\prime}(s):=\max\{\Psi(0),\dots,\Psi(s)\}$ ., and that the function $S_{\ref{lem:representatives}}(\mathcal{E},m)$ , whose existence is guaranteed by Lemma 2.8, is monotone decreasing in $\mathcal{E}$ and monotone increasing in $m$ . Here, being monotone decreasing in $\mathcal{E}$ means that if a pair of functions $\mathcal{E}_{1},\mathcal{E}_{2}:\mathbb{N}\rightarrow(0,1)$ satisfy $\mathcal{E}_{1}(r)\leq\mathcal{E}_{2}(r)$ for every $r\in\mathbb{N}$ , then $S_{\ref{lem:representatives}}(\mathcal{E}_{1},m)\geq S_{\ref{lem:representatives}}(\mathcal{E}_{2},m)$ for every $m$ . For each $s\in\mathbb{N}$ , define the function $\mathcal{E}_{s}:\mathbb{N}\rightarrow(0,1)$ by

[TABLE]

Now define the functions $S^{\prime},S^{\prime\prime}:\mathbb{N}\rightarrow\mathbb{N}$ by setting:

[TABLE]

Note that $S^{\prime\prime}(s)\geq s$ for every $s\in\mathbb{N}$ , and that $S^{\prime}$ and $S^{\prime\prime}$ are monotone increasing. We define a monotone increasing sequence $s_{1},s_{2},\dots$ as follows: $s_{1}=1$ , and for each $i\geq 2$ , $s_{i}=S^{\prime\prime}(s_{i-1})$ . We will show that the lemma holds with

[TABLE]

Let $(G,\mathcal{D})$ be a vertex-weighted graph. We iteratively define a sequence of pairwise-disjoint vertex-sets ${X_{1},X_{2},\dots}\subseteq V(G)$ as follows: let $X_{1}$ be the set of all vertices of $G$ of weight at least $1/s_{1}$ ; for each $i\geq 2$ , let $X_{i}$ be the set of all vertices in $V(G)\setminus(X_{1}\cup\dots\cup X_{i-1})$ having weight at least $1/s_{i}$ . Since $X_{1},X_{2},\dots$ are pairwise-disjoint, there must be $1\leq i\leq\lceil 2/\varepsilon\rceil$ for which $\mathcal{D}(X_{i})\leq\varepsilon/2$ . We now set $Z^{\prime}=X_{i}$ , $X=X_{1}\cup\dots\cup X_{i-1}$ and $Y^{\prime}=V(G)\setminus(X\cup Z^{\prime})=V(G)\setminus(X_{1}\cup\dots\cup X_{i})$ . Note that $\mathcal{D}(Z^{\prime})\leq\varepsilon/2$ . Setting $s:=s_{i-1}\leq s_{\lceil 2/\varepsilon\rceil-1}\leq S$ , note that every vertex in $X$ has weight at least $\frac{1}{s}$ (so in particular $|X|\leq s$ ), while every vertex in $Y^{\prime}$ has weight less than $\frac{1}{s_{i}}=\frac{1}{S^{\prime\prime}(s)}$ .

If $\mathcal{D}(Y^{\prime})<\frac{\varepsilon}{2}$ then $\mathcal{D}(Y^{\prime}\cup Z^{\prime})<\varepsilon$ , so the assertion of the lemma holds for $Y=\emptyset$ and $Z=Z^{\prime}\cup Y^{\prime}$ , and we are done. So we may and will assume from now on that $\mathcal{D}(Y^{\prime})\geq\frac{\varepsilon}{2}$ . Let $\mathcal{P}^{\prime}_{0}$ be a partition of $Y^{\prime}$ into $\lceil 1/\varepsilon\rceil$ parts such that $\sum_{P\in\mathcal{P}^{\prime}_{0}}{\sum_{\{x,y\}\in\binom{P}{2}}{\mathcal{D}(x)\mathcal{D}(y)}}\leq\varepsilon$ , as guaranteed by Lemma 2.1. For every $x\in X$ , consider the partition $\mathcal{P}_{x}:=\{N_{Y^{\prime}}(x),Y^{\prime}\setminus N_{Y^{\prime}}(x)\}$ of $Y^{\prime}$ . Let $\mathcal{P}_{0}$ be the common refinement of the partitions $\mathcal{P}^{\prime}_{0}$ and $(\mathcal{P}_{x})_{x\in X}$ . Then for every $x\in X$ and $P\in\mathcal{P}_{0}$ , either $x$ is adjacent to every vertex of $P$ , or $x$ is not adjacent to any vertex of $P$ . Moreover, we have $|\mathcal{P}_{0}|\leq 2^{|X|}\cdot\lceil 1/\varepsilon\rceil\leq 2^{s}\cdot\lceil 1/\varepsilon\rceil$ .

Now apply Lemma 2.8 to $(G[Y^{\prime}],\mathcal{D}_{Y^{\prime}})$ with parameters $\mathcal{E}_{s}$ and $m=2^{s}\cdot\lceil 1/\varepsilon\rceil$ , and with the partition $\mathcal{P}_{0}$ (noting that $|\mathcal{P}_{0}|\leq m$ ), to obtain a partition $\mathcal{P}=\{P_{0},P_{1},\dots,P_{r}\}$ of $Y^{\prime}$ and vertex-sets $Q_{i}\subseteq P_{i}$ (for $1\leq i\leq r$ ), with the properties stated in that lemma. Note that in particular we have

[TABLE]

Set $Z=Z^{\prime}\cup P_{0}$ and $Y=Y^{\prime}\setminus P_{0}$ , noting that $\mathcal{D}(P_{0})<\mathcal{E}_{s}(0)\leq\frac{\varepsilon}{2}$ , and hence $\mathcal{D}(Z)=\mathcal{D}(Z^{\prime})+\mathcal{D}(P_{0})<\varepsilon$ , as required by Item 1 in Lemma 3.1. Items 3 and 4 in Lemma 3.1 hold because each of the sets $P_{1},\dots,P_{r}$ is contained in some part of $\mathcal{P}_{0}$ , and hence also in some part of $\mathcal{P}^{\prime}_{0}$ . Item 2 of Lemma 3.1 was already verified above, and Item 5 of Lemma 3.1 is guaranteed by Lemma 2.8. Item (a) holds because Lemma 2.8 guarantees that all pairs $(Q_{i},Q_{j})$ are $\mathcal{E}_{s}(r)$ -regular, and because $\mathcal{E}_{s}(r)\leq\frac{1}{\Psi(s+r)}\leq\frac{1}{\Psi(|X|+r)}$ (here we used our choice of $\mathcal{E}_{s}$ , the fact that $|X|\leq s$ , and the monotonicity of $\Psi$ ). It remains to prove Item (b). For each $1\leq i\leq r$ , we have

[TABLE]

where in the second inequality we used the guarantees of Lemma 2.8, and later we used our choice of $S^{\prime}$ and $S^{\prime\prime}$ , the monotonicity of $S^{\prime\prime}$ , and the fact that $s\leq s_{\lceil 2/\varepsilon\rceil-1}$ . Next, fix $1\leq i\leq r$ and recall that all vertices in $Q_{i}\subseteq Y\subseteq Y^{\prime}$ have weight less than

[TABLE]

**where in the first inequality we used our choice of $S^{\prime\prime}$ , in the last two inequalities we used the monotonicity of $\Psi$ , and in the second inequality we also used (1) and an intermediate step in (2). This shows that $\mathcal{D}(u)<\frac{1}{\Psi(|X|+r)}\cdot\mathcal{D}(Q_{i})$ for every $1\leq i\leq r$ and $u\in Q_{i}$ , as required. **

[Proof of Lemma 3.1] Define the functions

[TABLE]

and

[TABLE]

We may and will assume that the function $\zeta_{\ref{lem:Turan_Ramsey}}(t,\delta)$ is monotone decreasing in $t$ and monotone increasing in $\delta$ . This assumption implies that the function $\zeta$ defined above is monotone decreasing. We prove the lemma with

[TABLE]

Let $(G,\mathcal{D})$ be a vertex-weighted graph. Apply Lemma 3.2 to $(G,\mathcal{D})$ with parameters $\Psi^{\prime}$ and $\varepsilon$ , to obtain a partition $V(G)=X\cup Y\cup Z$ , a partition $\mathcal{P}=\{P_{1},\dots,P_{r}\}$ of $Y$ , and subsets $Q_{i}\subseteq P_{i}$ (for $1\leq i\leq r$ ) such that Items 1-5 of Lemma 3.1 hold (with respect to $S_{\ref{lem:iterations}}(\Psi^{\prime},\varepsilon)$ ), and so do Items (a) and (b) of Lemma 3.2.

Let us now prove that Items 6-8 (in Lemma 3.1) hold. It will be convenient to put $m:=|X|+r$ . By Item (b) in Lemma 3.2 and by our choice of $\Psi^{\prime}$ , we have

[TABLE]

for every $1\leq i\leq r$ and $u\in Q_{i}$ . Recalling our choice of $\zeta$ , we see that Lemma 2.7 is applicable to $(G[Q_{i}],\mathcal{D}_{Q_{i}})$ with parameters $t=\Psi(m)=\Psi(|X|+r)$ and $\delta=\frac{1}{\Psi(m)}=\frac{1}{\Psi(|X|+r)}$ . Applying Lemma 2.7 with this input, we obtain pairwise-disjoint vertex-sets $Q_{i,1},\dots,Q_{i,t}\subseteq Q_{i}$ satisfying the properties stated in that lemma. The guarantees of Lemma 2.7 immediately establish Item 6, and also imply that for every $1\leq k\leq t$ we have

[TABLE]

**where in the second and third inequalities we used the fact that $|X|+r,\frac{1}{\mathcal{D}(Q_{i})}\leq S_{\ref{lem:iterations}}(\Psi^{\prime},\varepsilon)$ , as guaranteed by Item 2 of Lemma 3.1 and Item (b) of Lemma 3.2; in the third inequality we also used the monotonicity of $\zeta$ . This establishes Item 8. It remains to prove Item 7. By Item (a) of Lemma 3.2, the pair $(Q_{i},Q_{j})$ is $\frac{1}{\Psi^{\prime}(m)}$ -regular for every $1\leq i<j\leq r$ . Fix any $1\leq k,\ell\leq t$ . Recalling that $\frac{1}{\Psi^{\prime}(m)}=\frac{\zeta(m)}{2\Psi(m)}$ and that $\mathcal{D}(Q_{i,k})\geq\zeta(m)\cdot\mathcal{D}(Q_{i}),\,\mathcal{D}(Q_{j,\ell})\geq\zeta(m)\cdot\mathcal{D}(Q_{j})$ , we apply Item 1 of Lemma 2.3 to $Q_{i},Q_{j},Q_{i,k},Q_{j,\ell}$ with parameter $\alpha=\zeta(m)$ , to conclude that $|d(Q_{i,k},Q_{j,\ell})-d(Q_{i},Q_{j})|\leq\frac{1}{\Psi^{\prime}(m)}\leq\frac{1}{\Psi(m)}=\frac{1}{\Psi(|X|+r)}$ , and that the pair $(Q_{i,k},Q_{j,\ell})$ is $\frac{1}{\Psi(|X|+r)}$ -regular, as required. **

3.3 Proof of the Main Result

In this subsection we prove (the “if” direction of) Theorem 1. For a hereditary and extendable graph property $\mathcal{P}$ , our tester for $\mathcal{P}$ will work as follows: given an input $(G,\mathcal{D})$ and a proximity parameter $\varepsilon$ , the tester samples a sequence of vertices $u_{1},\dots,u_{s}\in V(G)$ independently and with distribution $\mathcal{D}$ , where $s=s_{\mathcal{P}}(\varepsilon)$ is as in Theorem 4; the tester then accepts if and only if $G[\{u_{1},\dots,u_{s}\}]$ satisfies $\mathcal{P}$ . Since $\mathcal{P}$ is hereditary, this tester accepts with probability $1$ if the input graph satisfies $\mathcal{P}$ . In the other direction, Theorem 4 immediately implies that if the input $(G,\mathcal{D})$ is $\varepsilon$ -far from $\mathcal{P}$ then the tester rejects with probability at least $\frac{2}{3}$ . So we see that the “if” direction of Theorem 1 follows from Theorem 4.

From now on our goal is to prove Theorem 4. We start by introducing variants of some definitions from [5]. An embedding scheme is a complete graph $K$ with a vertex partition $A_{K}\cup B_{K}$ , such that every vertex in $B_{K}$ is colored black or white, every edge with an endpoint in $A_{K}$ is colored black or white, and every edge contained in $B$ is colored black, white or grey. Note that one of $A_{k},B_{k}$ may be empty; that the vertices of $A_{K}$ are not colored; and that the edges with at least one endpoint in $A_{K}$ cannot be colored grey. An embedding from a graph $F$ to an embedding scheme $K$ is a map $\varphi:V(F)\rightarrow V(K)$ such that the following holds:

For every $a\in A_{K}$ we have $|\varphi^{-1}(a)|\leq 1$ . 2. 2.

For every $b\in B_{K}$ , if $b$ is colored black then $\varphi^{-1}(b)$ induces a complete graph, and if $b$ is colored white then $\varphi^{-1}(b)$ induces an empty graph. 3. 3.

For every $\{x,y\}\in\binom{V(K)}{2}$ , if $\{x,y\}$ is colored black then the bipartite graph between $\varphi^{-1}(x)$ and $\varphi^{-1}(y)$ is complete, and if $\{x,y\}$ is colored white then the bipartite graph between $\varphi^{-1}(x)$ and $\varphi^{-1}(y)$ is empty (note that there are no restrictions in the case that $\{x,y\}$ is colored grey).

Note that Condition 3 implies that for every $a\in A_{K}$ and $x\in V(K)\setminus\{a\}$ , the bipartite graph between $\varphi^{-1}(a)$ and $\varphi^{-1}(x)$ is either complete or empty. We use the notation $F\rightarrow K$ to mean that there is an embedding from $F$ to $K$ . For a graph-family $\mathcal{F}$ and an integer $m$ , let $\mathcal{F}_{m}$ be the family of all embedding schemes $K$ on at most $m$ vertices, such that there is an embedding from some $F\in\mathcal{F}$ to $K$ . We now introduce a variant of the function $\Psi_{\mathcal{F}}$ defined in [5].

Definition 3.3.

For a graph-family $\mathcal{F}$ and an integer $m$ for which $\mathcal{F}_{m}\neq\emptyset$ , define

[TABLE]

If $\mathcal{F}_{m}=\emptyset$ then define $\Psi_{\mathcal{F}}(m)=0$ .

We are now ready to prove Theorem 4 (and thus also the “if” direction of Theorem 1).

[Proof of Theorem 4] Let $\mathcal{P}$ be a hereditary and extendable graph property. Let $\mathcal{F}=\mathcal{F}(\mathcal{P})$ be the family of graphs which do not satisfy $\mathcal{P}$ . Fix $\varepsilon\in(0,1)$ , and let $\Psi:\mathbb{N}\rightarrow\mathbb{N}$ be the function

[TABLE]

where $\Psi_{\mathcal{F}}$ is defined in Definition 3.3. We may and will assume that the function $\delta_{\ref{lem:counting}}(h,\eta)$ is monotone decreasing in $h$ and monotone increasing in $\eta$ . Set $S:=S_{\ref{lem:reg_main}}(\Psi,\frac{\varepsilon}{4})$ . We prove the theorem with

[TABLE]

Let $(G,\mathcal{D})$ be a vertex-weighted graph which is $\varepsilon$ -far from $\mathcal{P}$ . Apply Lemma 3.1 to $(G,\mathcal{D})$ with parameter $\frac{\varepsilon}{4}$ and with $\Psi$ as above, to obtain a partition $V(G)=X\cup Y\cup Z$ , a partition $\{P_{1},\dots,P_{r}\}$ of $Y$ , subsets $Q_{i}\subseteq P_{i}$ (for $1\leq i\leq r$ ), and pairwise-disjoint subsets $Q_{i,1},\dots,Q_{i,t}\subseteq Q_{i}$ , such that $t=\Psi(|X|+r)$ and Items 1-8 in Lemma 3.1 hold.

We claim that $G$ is $\frac{3\varepsilon}{4}$ -far from any graph $G^{\prime}$ on $V(G)$ which satisfies $G^{\prime}[X\cup Y]\in\mathcal{P}$ . So suppose by contradiction that there is a graph $G^{\prime}$ on $V(G)$ such that $G^{\prime}[X\cup Y]$ satisfies $\mathcal{P}$ and such that $G^{\prime}$ is $\frac{3\varepsilon}{4}$ -close to $G$ . Since $\mathcal{P}$ is extendable, there is a graph $G^{\prime\prime}$ on $V(G)=V(G^{\prime})$ such that $G^{\prime\prime}[X\cup Y]=G^{\prime}[X\cup Y]$ and such that $G^{\prime\prime}$ satisfies $\mathcal{P}$ . In order to turn $G^{\prime}$ into $G^{\prime\prime}$ , we only need to add/delete edges which are incident to vertices of $Z$ . Therefore, the total weight of edge-changes needed to turn $G^{\prime}$ into $G^{\prime\prime}$ is at most $\mathcal{D}(Z)<\frac{\varepsilon}{4}$ , as guaranteed by Item 1 of Lemma 3.1. So we see that $G$ can be turned into $G^{\prime\prime}$ , which satisfies $\mathcal{P}$ , by adding/deleting edges whose total weight is less than $\frac{3\varepsilon}{4}+\frac{\varepsilon}{4}=\varepsilon$ , in contradiction the assumption that $(G,\mathcal{D})$ is $\varepsilon$ -far from $\mathcal{P}$ .

We thus proved that $G$ is $\frac{3\varepsilon}{4}$ -far from any graph $G^{\prime}$ satisfying $G^{\prime}[X\cup Y]\in\mathcal{P}$ . Now, let $G^{\prime}$ be the graph obtained from $G$ by doing the following changes:

For every $1\leq i\leq r$ , if $d(Q_{i,k},Q_{i,\ell})\geq\frac{1}{2}$ for every $1\leq k<\ell\leq t$ then turn $P_{i}$ into a clique, and if $d(Q_{i,k},Q_{i,\ell})<\frac{1}{2}$ for every $1\leq k<\ell\leq t$ , then turn $P_{i}$ into an independent set. By Item 6 in Lemma 3.1, one of these options has to hold. The total weight of edge-changes needed in this item is at most $\frac{\varepsilon}{4}$ by Item 4 of Lemma 3.1. 2. 2.

For every $1\leq i<j\leq r$ , if $d(Q_{i},Q_{j})>1-\frac{\varepsilon}{4}$ then add all edges between $P_{i}$ and $P_{j}$ , and if $d(Q_{i},Q_{j})<\frac{\varepsilon}{4}$ then remove all edges between $P_{i}$ and $P_{j}$ (note that if $\frac{\varepsilon}{4}\leq d(Q_{i},Q_{j})\leq 1-\frac{\varepsilon}{4}$ then no changes are made in the bipartite graph between $P_{i}$ and $P_{j}$ ). The total weight of edge-changes needed in this item is less than $\frac{\varepsilon}{2}$ by Item 5 of Lemma 3.1. Indeed, observe that the total weight of changes between $P_{i},P_{j}$ is less than $\mathcal{D}(P_{i})\mathcal{D}(P_{j})\cdot\left(|d(Q_{i},Q_{j})-d(P_{i},P_{j})|+\frac{\varepsilon}{4}\right)$ by the triangle inequality. Hence, the total weight of changes is less than

[TABLE]

Note that no edge with an endpoint in $X$ was added/deleted in Items 1-2, so $G^{\prime}$ and $G$ agree on all edges that are incident to vertices of $X$ .

We see that the total weight of edge-changes made in Items 1-2 is less than $\frac{3\varepsilon}{4}$ . So $G^{\prime}[X\cup Y]$ cannot satisfy $\mathcal{P}$ , implying that $G^{\prime}[X\cup Y]\in\mathcal{F}$ . Note that by definition (see Items 1-2 above), the graph $G^{\prime}$ has the following properties:

(a)

For every $1\leq i\leq r$ , $P_{i}$ is either a clique or an independent set in $G^{\prime}$ . Moreover, $P_{i}$ is a clique in $G^{\prime}$ then $d_{G}(Q_{i,k},Q_{i,\ell})\geq\frac{1}{2}$ for every $1\leq k<\ell\leq t$ , and if $P_{i}$ is an independent set in $G^{\prime}$ then $d_{G}(Q_{i,k},Q_{i,\ell})<\nolinebreak\frac{1}{2}$ for every $1\leq k<\ell\leq t$ . 2. (b)

For every pair $1\leq i<j\leq r$ , if there is an edge in $G^{\prime}$ between $P_{i}$ and $P_{j}$ then $d_{G}(Q_{i},Q_{j})\geq\frac{\varepsilon}{4}$ . Then by Item 7 of Lemma 3.1 we have that $d_{G}(Q_{i,k},Q_{j,\ell})\geq\frac{\varepsilon}{4}-\frac{1}{\Psi(|X|+r)}\geq\frac{\varepsilon}{8}$ for every $1\leq k,\ell\leq t$ . Analogously, if there is a non-edge in $G^{\prime}$ between $P_{i}$ and $P_{j}$ then $d_{G}(Q_{i},Q_{j})\leq 1-\frac{\varepsilon}{4}$ , which implies (by Item 7 of Lemma 3.1) that $d_{G}(Q_{i,k},Q_{j,\ell})\leq 1-\frac{\varepsilon}{4}+\frac{1}{\Psi(|X|+r)}\leq 1-\frac{\varepsilon}{8}$ for every $1\leq k,\ell\leq t$ .

Now let $K$ be the following embedding scheme: $A_{K}=X$ and $B_{K}=\{b_{1},\dots,b_{r}\}$ ; for each $1\leq i\leq r$ , vertex $b_{i}$ is colored black if $P_{i}$ is a clique in $G^{\prime}$ and white if $P_{i}$ is an independent set in $G^{\prime}$ ; for each $x,x^{\prime}\in X$ , edge $\{x,x^{\prime}\}$ is colored black if $\{x,x^{\prime}\}\in E(G)$ and white if $\{x,x^{\prime}\}\notin E(G)$ ; for each $x\in X$ , $1\leq i\leq r$ , edge $\{x,b_{i}\}$ is colored black if the bipartite graph between $x$ and $P_{i}$ is complete and white if this bipartite graph is empty (Item 3 in Lemma 3.1 implies that one of these options must hold); finally, for every $1\leq i<j\leq r$ , edge $\{b_{i},b_{j}\}$ is colored black if the bipartite graph between $P_{i}$ and $P_{j}$ is complete in $G^{\prime}$ , white if the bipartite graph between $P_{i}$ and $P_{j}$ is empty in $G^{\prime}$ , and grey otherwise.

Observe that the map $\varphi:X\cup Y\rightarrow V(K)$ which maps $x$ to itself (for every $x\in X=A_{K}$ ) and $P_{i}$ to $b_{i}$ (for every $1\leq i\leq r$ ), is an embedding from $G^{\prime}[X\cup Y]$ to $K$ . Since $|V(K)|=|X|+r$ , we have $K\in\mathcal{F}_{m}$ for $m:=|X|+r$ . By the definition of the function $\Psi_{\mathcal{F}}$ (see Definition 3.3), there is $F\in\mathcal{F}$ such that $F\rightarrow K$ and $|V(F)|\leq\Psi_{\mathcal{F}}(m)=\Psi_{\mathcal{F}}(|X|+r)\leq\Psi(|X|+r)=t$ .

Now, fixing an embedding $\rho$ from $F$ to $K$ , write $W_{i}:=\rho^{-1}(b_{i})=\{w_{i,1},\dots,w_{i,f_{i}}\}$ for $1\leq i\leq r$ . Put $W=W_{1}\cup\dots\cup W_{r}$ and $H=F[W]$ . We claim that the sets $(Q_{i,k})_{1\leq i\leq r,1\leq k\leq f_{i}}$ satisfy the requirements 1-2 in Lemma 2.4 with respect to $h=|V(F)|\leq\Psi_{\mathcal{F}}(m)$ , $\eta=\frac{\varepsilon}{8}$ and $H$ as above, in the graph $G$ . In other words, we show that one can apply Lemma 2.4 with the sets $U_{1},\dots,U_{h}$ being $(Q_{i,k})_{1\leq i\leq r,1\leq k\leq f_{i}}$ , and with $G$ as the host graph. We actually already proved that Item 1 in Lemma 2.4 holds; indeed, this follows from the fact that $F\rightarrow K$ , the definition of the embedding scheme $K$ , and Items (a)-(b) above. Item 2 of Lemma 2.4 follows from Items 6-7 of Lemma 3.1, which together imply that for every $1\leq i\leq j\leq r$ and $1\leq k\leq f_{i},1\leq\ell\leq f_{j}$ (with the exception of $(i,k)$ = $(j,\ell)$ ), the pair $(Q_{i,k},Q_{j,\ell})$ is $\delta$ -regular with $\delta=\frac{1}{\Psi(m)}\leq\delta_{\ref{lem:counting}}(\Psi_{\mathcal{F}}(m),\frac{\varepsilon}{8})\leq\delta_{\ref{lem:counting}}(h,\frac{\varepsilon}{8})$ , as required.

We thus showed that Lemma 2.4 is applicable to the tuple of sets $(Q_{i,k})_{1\leq i\leq r,1\leq k\leq f_{i}}$ and the graph $H=F[W]$ (with the parameters defined above). Let $\mathcal{U}$ be the set of all tuples $(u_{i,k})_{1\leq i\leq r,1\leq k\leq f_{i}}$ , where $u_{i,k}\in Q_{i,k}$ , which induce (in $G$ ) a copy of $H=F[W]$ in which $u_{i,k}$ plays the role of $w_{i,k}$ for every $1\leq i\leq r$ and $1\leq k\leq f_{i}$ . By Lemma 2.4, we have

[TABLE]

**where in the last inequality we used the guarantees of Item 8 in Lemma 3.1 and the monotonicity of the function $\delta_{\ref{lem:counting}}$ . Observe that for every $(u_{i,k})_{i,k}\in\mathcal{U}$ , the subgraph of $G$ induced by the vertex-set $X\cup\{u_{i,k}:1\leq i\leq r,1\leq k\leq f_{i}\}$ contains an induced copy of $F$ . Indeed, this follows from the definition of $\mathcal{U}$ , the fact that $F\rightarrow K$ , and the definition of the embedding scheme $K$ . Now sample an $(|X|+|W|)$ -tuple of vertices from $G$ according to the distribution $\mathcal{D}$ and independently. Note that if every vertex in $X$ appears in the first $|X|$ vertices of the sample, and if the tuple of the last $|W|$ vertices of the sample belongs to $\mathcal{U}$ , then the subgraph induced by the sample contains an induced copy of $F$ and hence does not satisfy $\mathcal{P}$ (as $F\in\mathcal{F}$ ). The probability for this event is at least **

[TABLE]

Here we used (5) and Item 2 in Lemma 3.1. Next, note that $|X|+|W|\leq|X|+rt\leq S$ , where in the last inequality we used Items 2 and 8 of Lemma 3.1. Similarly, $\Psi_{\mathcal{F}}(m)\leq t\leq S$ . So we see that a sample of $S$ random vertices induces a graph which does not satisfy $\mathcal{P}$ with probability at least $\delta_{\ref{lem:counting}}\left(S,\frac{\varepsilon}{8}\right)\cdot S^{-S}$ . Therefore, a sample of $s=s_{\mathcal{P}}(\varepsilon)$ vertices (see (4)) induces a graph not satisfying $\mathcal{P}$ with probability at least

[TABLE]

**as required. This completes the proof. ** It is natural to ask about the dependence on $\varepsilon$ of the sample complexity of the tester supplied by Theorem 1. One answer is that one cannot prove any upper bound on the sample complexity which holds uniformly for all properties $\mathcal{P}$ , because it was shown in [6] that no such bound exists even in the standard model. Suppose then that one is interested only in “simple” properties such as induced $H$ -freeness (for some fixed $H$ ). In this case, it is not too hard to see that although we are iterating Lemma 2.8, which has wowzer-type (that is, iterated-tower) bounds181818To be precise, we mean here that the “standard” way of establishing Lemma 2.8 (which is also the way we prove this lemma in this paper) is via the strong regularity lemma (see Lemma 2.6), which is known to only give wowzer-type bounds [12, 25]. In [12], (an unweighted variant of) Lemma 2.8 was proved without the use of the strong regularity lemma, thus giving better, tower-type, bounds. This is alluded to in the following sentence. in this setting even for unweighted graphs (see [12, 25]), we are still getting “only” a wowzer-type bound. We should also point out that it might be possible to use the ideas in [12], together with those presented here, in order to get tower-type bounds on the sample complexity of testing induced $H$ -freeness in the VDF model.

4 VDF-Testable Properties are Extendable and Hereditary

In this section we prove the “only if” direction of Theorem 1. The proof is divided between Propositions 4.1 and 4.2. As shown in [19], we can (and will) always assume that a VDF tester only queries the input graph on pairs of vertices which it has sampled.

Proposition 4.1.

If a graph property $\mathcal{P}$ is not extendable, then $\mathcal{P}$ is not testable in the VDF model.

[Proof]Since $\mathcal{P}$ is not extendable, there is a graph $G_{1}\in\mathcal{P}$ , such that no $(|V(G_{1})|+1)$ -vertex graph satisfying $\mathcal{P}$ contains $G_{1}$ as an induced subgraph. Let $G_{2}$ be a graph obtained from $G_{1}$ by adding a “new” vertex $v$ (and putting an arbitrary bipartite graph between $v$ and $V(G_{1})$ ), let $\mathcal{D}_{1}$ be the uniform distribution on $V(G_{1})$ , and let $\mathcal{D}_{2}$ be the distribution on $V(G_{2})$ which assigns weight $\frac{1}{|V(G_{1})|}$ to each $u\in V(G_{1})\subseteq V(G_{2})$ and weight191919Evidently, if one does not wish to allow vertices of weight [math], then one can instead assign to $v$ a weight tending to [math]; or, more accurately, a weight that is small enough with respect to (the inverse of) the sample complexity of an alleged tester for $\mathcal{P}$ (in a proof by contradiction that such a tester does not exist). [math] to $v$ .

It is clear that for every integer $q$ , a sample of $q$ vertices from $G_{1}$ according to $\mathcal{D}_{1}$ is indistinguishable from a sample of $q$ vertices from $G_{2}$ according to $\mathcal{D}_{2}$ . Observe that $G_{1}$ satisfies $\mathcal{P}$ while $(G_{2},\mathcal{D}_{2})$ is $\frac{1}{|V(G_{1})|^{2}}$ -far from $\mathcal{P}$ . To see that the latter statement is true, observe that by our choice of $G_{1}$ , no matter how we change the bipartite graph between $v$ and $V(G_{1})$ , we will always get a graph that does not satisfy $\mathcal{P}$ . Hence, in order to make $G_{2}$ satisfy $\mathcal{P}$ , one must change the adjacency relation between a pair of vertices from $V(G_{1})$ , whose weight (under $\mathcal{D}_{2}$ ) is $\frac{1}{|V(G_{1})|}$ .

**Now, the fact that $(G_{1},\mathcal{D}_{1})$ and $(G_{2},\mathcal{D}_{2})$ are indistinguishable implies that $\mathcal{P}$ is not testable202020We note that if $\mathcal{P}$ is non-extendable but hereditary, then one can easily obtain infinitely many examples showing that $\mathcal{P}$ is not testable (rather than just the one example given in the proof of Proposition 4.1). Indeed, instead of adding just one vertex to $G_{1}$ , one can add to $G_{1}$ any number $k$ of vertices (for a large $k$ ), and give these new vertices weight $o(1/k)$ , while distributing the remaining weight uniformly among the vertices of $G_{1}$ (note that such an assignment is precisely what the setting of Theorem 9 forbids). The assumption that $\mathcal{P}$ is hereditary implies that every graph obtained in this way is $\frac{1-o(1)}{|V(G_{1})|^{2}}$ -far from satisfying $\mathcal{P}$ . Also, if the weight given to the “new” vertices is small enough, then these two weighted graphs are indistinguishable by a sample of any prescribed size. in the VDF model. **

Proposition 4.2.

If a graph property $\mathcal{P}$ is not hereditary, then $\mathcal{P}$ is not testable in the VDF model.

[**Proof]Since $\mathcal{P}$ is not hereditary, there is a graph $G_{1}$ and an induced subgraph $G_{2}$ of $G_{1}$ , such that $G_{1}$ satisfies $\mathcal{P}$ but $G_{2}$ does not. Let $\mathcal{D}_{2}$ be the uniform distribution on $V(G_{2})$ , and let $\mathcal{D}_{1}$ be the distribution on $V(G_{1})$ which is supported on $V(G_{2})\subseteq V(G_{1})$ and uniform when conditioned on $V(G_{2})$ , i.e. $\mathcal{D}_{1}(u)=\frac{1}{|V(G_{2})|}$ if $u\in V(G_{2})$ and $\mathcal{D}_{1}(u)=0$ if $u\in V(G_{1})\setminus V(G_{2})$ . Clearly, for every integer $q$ , a sample of $q$ vertices from $G_{1}$ according to $\mathcal{D}_{1}$ is indistinguishable from a sample of $q$ vertices from $G_{2}$ according to $\mathcal{D}_{2}$ . Also, $G_{1}$ satisfies $\mathcal{P}$ , whereas $(G_{2},\mathcal{D}_{2})$ is $\frac{1}{|V(G_{2})|^{2}}$ -far from $\mathcal{P}$ because $G_{2}\notin\mathcal{P}$ . Thus, $\mathcal{P}$ is not testable212121In analogy to Footnote 20, we note that if $\mathcal{P}$ is non-hereditary but extendable then one can obtain infinitely many examples showing that $\mathcal{P}$ is not testable (rather than just the one given in the proof of Proposition 4.2). Indeed, the extendability of $\mathcal{P}$ implies that there are arbitrarily large graphs which satisfy $\mathcal{P}$ and contain $G_{1}$ (and hence also $G_{2}$ ) as an induced subgraph. Each of these graphs (together with an appropriate distribution, as in the proof of Proposition 4.2) is a witness to the non-testability of $\mathcal{P}$ . in the VDF model. **

5 On Variations of the VDF Model and Related Problems

In the following two subsections we prove Theorems 6, 7, 8 and 9. We then consider two additional problems related to the VDF model; one problem asks if the query complexity in the VDF model is the same as in the standard model (for $\mathcal{P}$ that are testable in the VDF model), and the other asks for a characterization of the properties that are testable in variants of the VDF model (as in Theorems 6-9). We start by giving the precise definitions of the settings considered in Theorems 6-9.

The “large inputs” model

In this model, a property $\mathcal{P}$ is testable if there exists a function $M_{\mathcal{P}}:(0,1)\rightarrow\mathbb{N}$ such that for every $\varepsilon>0$ , $\mathcal{P}$ is $\varepsilon$ -testable with sample complexity depending only on $\varepsilon$ under the promise that inputs $(G,\mathcal{D})$ always satisfy $|V(G)|\geq\nolinebreak M_{\mathcal{P}}(\varepsilon)$ .

The “size-aware” model

In this model, testers are allowed to receive, as part of the input, the number of vertices of the input graph.

The “no heavy-weights” (NHW) model

In this model, a property $\mathcal{P}$ is testable if there exists a function $c_{\mathcal{P}}:(0,1)\rightarrow(0,1)$ such that for every $\varepsilon>0$ , $\mathcal{P}$ is $\varepsilon$ -testable with sample complexity depending only on $\varepsilon$ under the promise that inputs $(G,\mathcal{D})$ always satisfy $\max_{v\in V(G)}{\mathcal{D}(v)}\leq c_{\mathcal{P}}(\varepsilon)$ .

The “no light-weights” (NLW) model

In this model, a property $\mathcal{P}$ is testable if for all $\varepsilon,\delta>0$ , $\mathcal{P}$ is $\varepsilon$ -testable with sample complexity depending only on $\varepsilon$ and $\delta$ under the promise that inputs $(G,\mathcal{D})$ always satisfy $\min_{v\in V(G)}{\mathcal{D}(v)}\geq\delta/|V(G)|$ .

Theorem 6 (resp. 7, 8, 9) then states that every hereditary property is testable in the “large inputs” (resp. “size-aware”, NHW, NLW) model222222Note that if $\mathcal{P}$ is testable in the “large inputs” model then it is also testable in the NHW model, because by setting $c_{\mathcal{P}}(\varepsilon):=1/M_{\mathcal{P}}(\varepsilon)$ we can make sure that the input graph has at least $M_{\mathcal{P}}(\varepsilon)$ vertices. Still, we decided to include a separate proof for Theorem 8 (instead of deducing it from Theorem 6) for two reasons: one is that in the course of the proof we resolve another open question raised in [19]; and the other is that our proof of Theorem 8 shows that $\mathcal{P}$ is testable (in the NHW model) by a tester that accepts if and only if the subgraph induced by the sample satisfies $\mathcal{P}$ , whereas the tester given by the proof of Theorem 6 is not always of this form..

5.1 Proof of Theorems 6, 7 and 9

In this subsection we prove Theorems 6, 7 and 9, i.e. we show that every hereditary property is testable (with one-sided error) in the “large inputs”, “size-aware” and NLW models. Let us introduce some definitions that we will use throughout this subsection. Let $\mathcal{P}$ be a hereditary graph property. A graph $F$ is called $\mathcal{P}$ -good if for every $r\geq|V(F)|$ there is an $r$ -vertex graph which satisfies $\mathcal{P}$ and contains $F$ as an induced subgraph; this in particular implies that $F$ itself satisfies $\mathcal{P}$ . If $F$ is not $\mathcal{P}$ -good then it is called ${\mathcal{P}}$ -bad, and we denote by $r_{\mathcal{P}}(F)$ the minimal $r\geq|V(F)|$ such that there is no $r$ -vertex graph which satisfies $\mathcal{P}$ and contains $F$ as an induced subgraph. In particular, if $F$ does not satisfy $\mathcal{P}$ then it is $\mathcal{P}$ -bad and $r_{\mathcal{P}}(F)=|V(F)|$ . Note that since $\mathcal{P}$ is hereditary, if $F$ is $\mathcal{P}$ -bad then there is no graph on $r$ vertices for any $r\geq r_{\mathcal{P}}(F)$ which satisfies $\mathcal{P}$ and contains $F$ as an induced subgraph. Now let $\mathcal{H}=\mathcal{H}(\mathcal{P})$ be the property of being $\mathcal{P}$ -good. Then $\mathcal{H}\subseteq\mathcal{P}$ and $\mathcal{H}$ is hereditary, which follows from the definition of $\mathcal{P}$ -goodness and the fact that $\mathcal{P}$ is hereditary. Observe moreover that $\mathcal{H}$ is extendable. Indeed, let $G\in\mathcal{H}$ , and suppose, for the sake of contradiction, that for every $G^{\prime}$ on $|V(G)|+1$ vertices which contains $G$ as an induced subgraph, it holds that $G^{\prime}\notin\mathcal{H}$ . Then for every such $G^{\prime}$ , there is no graph on $r_{\mathcal{P}}(G^{\prime})$ vertices that satisfies $\mathcal{P}$ and contains $G^{\prime}$ as an induced subgraph. But this means that there is no graph on $\max_{G^{\prime}}{r_{\mathcal{P}}(G^{\prime})}$ vertices which satisfies $\mathcal{P}$ and contains $G$ as an induced subgraph, in contradiction to $G\in\mathcal{H}$ . We note also that if $\mathcal{P}$ itself is extendable then $\mathcal{H}=\mathcal{P}$ .

For an integer $s\geq 1$ , let $R_{\mathcal{P}}(s)$ be the maximum of $r_{\mathcal{P}}(F)$ over all $\mathcal{P}$ -bad graphs $F$ with at most $s$ vertices; if no such graphs exist, we set $R_{\mathcal{P}}(s)=0$ (this will not matter later on). We are now ready to prove Theorem 6, which we rephrase as follows.

Proposition 5.1.

For every hereditary property $\mathcal{P}$ there are functions $M_{\mathcal{P}},s_{\mathcal{P}}:(0,1)\rightarrow\mathbb{N}$ such that for every $\varepsilon>0$ , the property $\mathcal{P}$ is $\varepsilon$ -testable with one-sided error and sample complexity $s_{\mathcal{P}}(\varepsilon)$ under the promise that inputs $(G,\mathcal{D})$ always satisfy $|V(G)|\geq M_{\mathcal{P}}(\varepsilon)$ .

[Proof]Consider the (extendable and hereditary) property $\mathcal{H}=\mathcal{H}(\mathcal{P})$ defined above. By Theorem 4, there is a function $s_{\mathcal{H}}:(0,1)\rightarrow\mathbb{N}$ such that for every $\varepsilon>0$ and for every vertex-weighted graph $(G,\mathcal{D})$ which is $\varepsilon$ -far from $\mathcal{H}$ , a sample of $s$ vertices from $G$ (taken from $\mathcal{D}$ ) induces a subgraph which does not satisfy $\mathcal{H}$ with probability at least $\frac{2}{3}$ .

Our (“large inputs”-model) tester for $\mathcal{P}$ samples $s_{\mathcal{H}}(\varepsilon)$ vertices, and accepts if and only if the subgraph induced by the sample satisfies $\mathcal{H}$ . We prove the proposition with $M=M_{\mathcal{P}}(\varepsilon):=R_{\mathcal{P}}(s_{\mathcal{H}}(\varepsilon)).$

Let $(G,\mathcal{D})$ be a vertex-weighted graph with $|V(G)|\geq M$ . Suppose first that $G$ satisfies $\mathcal{P}$ . Our goal is to show that the subgraph induced by a sample of $s_{\mathcal{H}}(\varepsilon)$ vertices, taken from $\mathcal{D}$ and independently, satisfies $\mathcal{H}$ with probability $1$ . So suppose by contradiction that $G$ contains an induced subgraph $F$ on at most $s_{\mathcal{H}}(\varepsilon)$ vertices which does not satisfy $\mathcal{H}$ . In other words, $F$ is $\mathcal{P}$ -bad. By the definition of $r_{\mathcal{P}}(F)$ , there is no graph on $r_{\mathcal{P}}(F)$ vertices which satisfies $\mathcal{P}$ and contains $F$ as an induced subgraph. As $|V(G)|\geq M=R_{\mathcal{P}}(s_{\mathcal{H}}(\varepsilon))\geq r_{\mathcal{P}}(F)$ , and as $\mathcal{P}$ is hereditary, we get that $G$ does not satisfy $\mathcal{P}$ , a contradiction.

**Suppose now that $(G,\mathcal{D})$ is $\varepsilon$ -far from $\mathcal{P}$ . Then $(G,\mathcal{D})$ is also $\varepsilon$ -far from $\mathcal{H}$ , as $\mathcal{H}\subseteq\mathcal{P}$ . By our choice of $s_{\mathcal{H}}(\varepsilon)$ , a sample of $s_{\mathcal{H}}(\varepsilon)$ vertices of $G$ , taken from $\mathcal{D}$ and independently, does not satisfy $\mathcal{H}$ with probability at least $\frac{2}{3}$ . So our tester rejects $(G,\mathcal{D})$ with probability at least $\frac{2}{3}$ , as required. ** It is natural to ask whether we can replace the function $M_{\mathcal{P}}(\varepsilon)$ in Lemma 5.1 by a constant depending only on $\mathcal{P}$ (and not on $\varepsilon$ ). As is shown in the following proposition, we cannot.

Proposition 5.2.

There is a hereditary property $\mathcal{P}$ such that for every $M>0$ , there is no tester for $\mathcal{P}$ in the VDF model even if we are guaranteed that the input graph has at least $M$ vertices.

[**Proof]For each $k\geq 3$ , let $C_{k}^{*}$ be the graph obtained from the $k$ -cycle $C_{k}$ by adding an isolated vertex. Consider the property $\mathcal{P}=\{C_{k}^{*}:k\geq 3\}$ -freeness. Let $M>0$ . Set $G=C_{M}$ and $G^{\prime}=C_{M}^{*}$ . Let $\mathcal{D}$ be the uniform distribution on $V(G)$ , and let $\mathcal{D}^{\prime}$ be the distribution on $V(G^{\prime})$ which assigns weight [math] to the isolated vertex in $G^{\prime}$ , and is uniform on the rest of the vertices of $G^{\prime}$ . Then $G\in\mathcal{P}$ and $(G^{\prime},\mathcal{D}^{\prime})$ is $\frac{1}{M^{2}}$ -far from $\mathcal{P}$ , but a sample (of any number of vertices) from $(G,\mathcal{D})$ is indistinguishable from a sample of the same size from $(G^{\prime},\mathcal{D}^{\prime})$ . This shows that $\mathcal{P}$ is not testable even if we require input graphs to have at least $M$ vertices. ** We now move on to prove Theorem 7. [Proof of Theorem 7] Let $\mathcal{P}$ be a hereditary graph property. Our goal is to design (and prove the correctness of) a one-sided-error tester for $\mathcal{P}$ in the VDF model, provided that the tester receives $|V(G)|$ as part of the input. Let $M_{\mathcal{P}}:(0,1)\rightarrow\mathbb{N}$ be as in Lemma 5.1. On input $\varepsilon\in(0,1)$ , $G$ and $\mathcal{D}$ (where $G$ is a graph and $\mathcal{D}$ is a distribution on $V(G)$ ), our tester works as follows:

If $|V(G)|\geq M_{\mathcal{P}}(\varepsilon)$ , then invoke the tester whose existence is guaranteed by Lemma 5.1, and accept if and only if this tester accepts. 2. 2.

Otherwise, i.e. if $|V(G)|<M_{\mathcal{P}}(\varepsilon)$ , then do the following: setting $M:=M_{\mathcal{P}}(\varepsilon)$ and $t:=M\log(3M)/\varepsilon$ , sample vertices $u_{1},\dots,u_{t}\in V(G)$ according to $\mathcal{D}$ and independently, and put $U:=\{u_{1},\dots,u_{t}\}$ . Accept if and only if there exists a graph on $|V(G)|$ vertices which satisfies $\mathcal{P}$ and contains $G[U]$ as an induced subgraph (in the notation introduced at the beginning of this subsection, this is the same as saying that $r_{\mathcal{P}}(G[U])>|V(G)|$ ).

Let us prove the correctness of our tester. First, Lemma 5.1 guarantees that if $|V(G)|\geq M_{\mathcal{P}}(\varepsilon)$ then the tester works correctly; namely, it accepts with probability $1$ if $G\in\mathcal{P}$ , and rejects with probability at least $\frac{2}{3}$ if $(G,\mathcal{D})$ is $\varepsilon$ -far from $\mathcal{P}$ .

So from now on we may assume that $|V(G)|<M_{\mathcal{P}}(\varepsilon)$ . Suppose first that $G\in\mathcal{P}$ . Evidently, for every $U\subseteq V(G)$ there is a graph on $|V(G)|$ vertices which satisfies $\mathcal{P}$ and contains $G[U]$ as an induced subgraph (indeed, $G$ is such a graph). Hence, the tester accepts $G$ with probability $1$ (see Item 2).

Now suppose that $(G,\mathcal{D})$ is $\varepsilon$ -far from $\mathcal{P}$ . Observe that for each $v\in V(G)$ , the probability that $v\notin U$ is

[TABLE]

**By taking the union bound over all (at most $|V(G)|<m$ ) vertices $v\in V(G)$ which satisfy $\mathcal{D}(v)\geq\varepsilon/M$ , we see that the probability that there is $v\in V(G)\setminus U$ with $\mathcal{D}(v)\geq\varepsilon/M$ , is at most $\frac{1}{3}$ . Suppose that every $v\in V(G)\setminus U$ satisfies $\mathcal{D}(v)<\varepsilon/M$ (this happens with probability at least $\frac{2}{3}$ ). Then $\mathcal{D}(V(G)\setminus U)<|V(G)|\cdot\varepsilon/M<\varepsilon$ (where in the last inequality we used our assumption that $|V(G)|<M$ ). Now, if (by contradiction) there is a graph $G^{\prime}$ on $|V(G)|$ vertices which satisfies $\mathcal{P}$ and contains $G[U]$ as an induced subgraph, then one can turn $G$ into $G^{\prime}$ by only adding/deleting edges which are incident to vertices in $V(G)\setminus U$ . Since $\mathcal{D}(V(G)\setminus U)<\varepsilon$ , this stands in contradiction to the assumption that $(G,\mathcal{D})$ is $\varepsilon$ -far from $\mathcal{P}$ . We conclude that there is no such graph $G^{\prime}$ . This implies that $(G,\mathcal{D})$ is rejected with probability at least $\frac{2}{3}$ , as required. **

Finally, we prove Theorem 9, i.e. that every hereditary property is testable in the NLW model. We restate this theorem as follows.

Proposition 5.3.

For every hereditary property $\mathcal{P}$ there is a function $t_{\mathcal{P}}:(0,1)^{2}\rightarrow\mathbb{N}$ such that for all $\varepsilon,\delta>0$ , the property $\mathcal{P}$ is $\varepsilon$ -testable with one-sided error and sample complexity $t_{\mathcal{P}}(\varepsilon,\delta)$ under the promise that inputs $(G,\mathcal{D})$ always satisfy $\min_{v\in V(G)}{\mathcal{D}(v)}\geq\delta/|V(G)|$ .

[Proof]We start by specifying the function $t_{\mathcal{P}}(\varepsilon,\delta)$ . Consider the (extendable and hereditary) property $\mathcal{H}=\mathcal{H}(\mathcal{P})$ defined above. By Theorem 4, there is a function $s_{\mathcal{H}}:(0,1)\rightarrow\mathbb{N}$ such that for every $\varepsilon>0$ and for every vertex-weighted graph $(G,\mathcal{D})$ which is $\varepsilon$ -far from $\mathcal{H}$ , a sample of $s_{\mathcal{H}}(\varepsilon)$ vertices of $G$ (taken from $\mathcal{D}$ ) induces a subgraph which does not satisfy $\mathcal{H}$ with probability232323The statement of Theorem 4 only guarantees a success probability of $\frac{2}{3}$ , but this can clearly be amplified to $\frac{5}{6}$ by repeating the experiment $O(1)$ times. at least $\frac{5}{6}$ . Now set $R:=R_{\mathcal{P}}(s_{\mathcal{H}}(\varepsilon))$ and

[TABLE]

Our tester for $\mathcal{P}$ in the NLW model simply samples a sequence of $t_{\mathcal{P}}(\varepsilon,\delta)$ vertices of the input and accepts if and only if the subgraph induced by the sample satisfies $\mathcal{P}$ . Evidently, this tester accepts with probability $1$ if the input satisfies $\mathcal{P}$ . So to establish the correctness of our tester, it suffices to show that it rejects with probability at least $\frac{2}{3}$ if the input $(G,\mathcal{D})$ is $\varepsilon$ -far from $\mathcal{P}$ .

Let $\varepsilon,\delta>0$ , and let $(G,\mathcal{D})$ be a vertex-weighted graph on $n$ vertices which is $\varepsilon$ -far from $\mathcal{P}$ , and in which all vertices have weight at least $\delta/n$ . Let $u_{1},\dots,u_{t}$ be a sequence of $t=t_{\mathcal{P}}(\varepsilon,\delta)$ random vertices of $G$ , sampled according to $\mathcal{D}$ and independently, and set $U=\{u_{1},\dots,u_{t}\}$ . We need to show that with probability at least $\frac{2}{3}$ , $G[U]$ does not satisfy $\mathcal{P}$ . Suppose first that $n<2R$ . We claim that in this case we have $U=V(G)$ with probability at least $\frac{2}{3}$ (this is clearly sufficient because $G$ itself does not satisfy $\mathcal{P}$ ). For a vertex $v\in V(G)$ , the probability that $u_{i}\neq v$ for every $1\leq i\leq t$ is

[TABLE]

So by the union bound over all $n<2R$ vertices of $G$ , we see that with probability at least $\frac{2}{3}$ , $U=V(G)$ , as required.

Suppose now that $n\geq 2R$ . Our choice of $s=s_{\mathcal{H}}(\varepsilon)$ guarantees that with probability at least $\frac{5}{6}$ , the graph $F:=G[\{u_{1},\dots,u_{s}\}]$ does not satisfy $\mathcal{H}$ , meaning that it is $\mathcal{P}$ -bad. We will now show that with probability at least $\frac{5}{6}$ , we have $|U|\geq R$ . This will imply that with probability at least $\frac{2}{3}$ , $G[U]$ contains as an induced subgraph a $\mathcal{P}$ -bad graph $F$ on at most $s_{\mathcal{H}}(\varepsilon)$ vertices, and also $|U|\geq R=R_{\mathcal{P}}(s_{\mathcal{H}}(\varepsilon))\geq r_{\mathcal{P}}(F)$ . By the definition of $r_{\mathcal{P}}(F)$ , this would imply that $G[U]$ does not satisfy $\mathcal{P}$ , as required.

So from now on, our goal is to show that $|U|\geq R$ with probability at least $\frac{5}{6}$ . Fix a partition of $V(G)$ into $R$ sets $V_{1},\dots,V_{R}$ , each of size at least $\lfloor\frac{n}{R}\rfloor\geq\frac{n}{2R}$ . For each $1\leq i\leq R$ , let $A_{i}$ be the event that $U\cap V_{i}\neq\emptyset$ . Note that if $A_{i}$ occurs for every $1\leq i\leq R$ , then $|U|\geq R$ . Since $\mathcal{D}(V_{i})\geq|V_{i}|\cdot\frac{\delta}{n}\geq\frac{n}{2R}\cdot\frac{\delta}{n}=\frac{\delta}{2R}$ , the probability that $A_{i}$ does not occur is at most

[TABLE]

**By the union bound, the probability that there is $1\leq i\leq R$ for which $A_{i}$ does not occur, is at most $\frac{1}{6}$ , as required. This completes the proof. **

5.2 Proof of Theorem 8

In this subsection we prove Theorem 8, i.e. we show that every hereditary property is testable in the NHW model. Again, we rephrase as follows.

Proposition 5.4.

For every hereditary property $\mathcal{P}$ there are functions $t_{\mathcal{P}}:(0,1)\rightarrow\mathbb{N}$ and $c_{\mathcal{P}}:\nolinebreak(0,1)\rightarrow(0,1)$ such that for every $\varepsilon>0$ , the property $\mathcal{P}$ is $\varepsilon$ -testable with one-sided error and sample complexity $t_{\mathcal{P}}(\varepsilon)$ under the promise that inputs $(G,\mathcal{D})$ always satisfy $\max_{v\in V(G)}{\mathcal{D}(v)}\leq c_{\mathcal{P}}(\varepsilon)$ .

The key idea in the proof of Proposition 5.4, which appeared in [19], is to “blow up” the vertex-weighted graph $(G,\mathcal{D})$ by replacing each vertex $v$ with a vertex-set whose size is proportional to $\mathcal{D}(v)$ , and thus obtain an (unweighted) graph $G^{\prime}$ , to which one can apply known testability results in the standard model.

Let us introduce some definitions. For a graph $G$ , say on $V(G)=\{v_{1},\dots,v_{n}\}$ , and for integers $b_{1},\dots,b_{n}\geq 0$ , a $(b_{1},\dots,b_{n})$ -blowup of $G$ is any graph admitting a vertex-partition $V_{1}\cup\dots\cup V_{n}$ such that $|V_{i}|=b_{i}$ for every $1\leq i\leq n$ , and such that the bipartite graph between $V_{i}$ and $V_{j}$ is complete if $\{v_{i},v_{j}\}\in E(G)$ and empty if $\{v_{i},v_{j}\}\notin E(G)$ . The sets $V_{1},\dots,V_{n}$ are called the blowup-sets. Note that we do not pose any restrictions on the graphs induced by the sets $V_{1},\dots,V_{n}$ ; these graphs may be arbitrary. For simplicity of presentation, we assume henceforth that all vertex-weights are rational242424If one allows general (i.e. possibly irrational) weights, then it is necessary to change the definition of a $(\mathcal{D},N)$ -blowup by rounding $\mathcal{D}(v_{i})\cdot N$ to the closest integer. This results in an additive error of $\frac{n}{N}$ in the conclusion of Lemma 5.5, due to rounding. Consequently, in (the proofs of) Propositions 5.4 and 5.7 we need to consider $(\mathcal{D},N)$ -blowups with $N\rightarrow\infty$ in order to have this error term go to [math]. We also need to replace $\varepsilon$ in several places with (say) $\frac{\varepsilon}{2}$ (or any other number smaller than $\varepsilon$ ). For example, the conclusion of Proposition 5.7 should be that $\mathcal{P}$ is testable in the VDF model by a tester having one-sided error and sample complexity $q_{\mathcal{P}}(\varepsilon/2)$ .. Now let $\mathcal{D}$ be a distribution on $V(G)=\{v_{1},\dots,v_{n}\}$ , and let $N\in\mathbb{N}$ be such that $\mathcal{D}(v_{i})\cdot N$ is an integer for every $1\leq i\leq n$ ; such an $N$ is called admissible. A $(\mathcal{D},N)$ -blowup of $G$ is a $(b_{1},\dots,b_{n})$ -blowup of $G$ with $b_{i}=\mathcal{D}(v_{i})\cdot N$ for every $1\leq i\leq n$ . Note that a blowup is always treated as “unweighted” (in other words, the distribution on its vertices is uniform). Goldreich [19] proved that for every graph $F$ and $\varepsilon\in(0,1)$ , if a vertex-weighted graph $(G,\mathcal{D})$ is $\varepsilon$ -far from being $F$ -free, then for every admissible $N$ , any $(\mathcal{D},N)$ -blowup of $G$ is $\frac{\varepsilon}{\binom{|V(F)|}{2}}$ -far from being $F$ -free. Goldreich further asked whether the $\binom{|V(F)|}{2}^{-1}$ -factor can be avoided. In the following lemma we show that this is indeed the case, and moreover that an analogous statement holds for every hereditary property. This lemma is also the key ingredient in the proof of Proposition 5.4.

Lemma 5.5.

Let $\mathcal{P}$ be a hereditary graph property and let $(G,\mathcal{D})$ be a vertex-weighted graph which is $\varepsilon$ -far from $\mathcal{P}$ . Then for every admissible $N$ , any $(\mathcal{D},N)$ -blowup of $G$ is $\varepsilon$ -far from $\mathcal{P}$ .

[Proof]Fix any admissible $N$ and let $G^{\prime}$ be a $(\mathcal{D},N)$ -blowup of $G$ . As above, we use $v_{1},\dots,v_{n}$ to denote the vertices of $G$ , and $V_{1},\dots,V_{n}$ to denote the corresponding blowup sets. Suppose by contradiction that there is a graph $H^{\prime}$ on $V(G^{\prime})$ that satisfies $\mathcal{P}$ and is $\varepsilon$ -close to $G^{\prime}$ . Let $H$ be the random graph defined as follows: the vertex-set of $H$ is $V(H)=V(G)=\{v_{1},\dots,v_{n}\}$ . To define the edge-set of $H$ , sample for each $1\leq i\leq n$ a vertex $u_{i}\in V_{i}$ uniformly at random, and make $\{v_{i},v_{j}\}$ an edge in $H$ if and only if $\{u_{i},u_{j}\}$ is an edge in $H^{\prime}$ (for $1\leq i<j\leq n$ ). Then $H$ satisfies $\mathcal{P}$ (with probability $1$ ) because $H$ is isomorphic to an induced subgraph of $H^{\prime}$ and $\mathcal{P}$ is hereditary. Let us compute the expected distance between $H$ and $G$ (here the distance is with respect to the distribution $\mathcal{D}$ ). For each $1\leq i<j\leq n$ , the probability that $\{v_{i},v_{j}\}\in E(G)\triangle E(H)$ is precisely

[TABLE]

**Hence, the expected distance between $H$ and $G$ is **

[TABLE]

**where the last inequality uses the assumption that $G^{\prime}$ is $\varepsilon$ -close to $H^{\prime}$ . So $G$ is $\varepsilon$ -close to a graph $H$ which satisfies $\mathcal{P}$ , a contradiction. **

By combining Lemma 5.5 with the result of [5] (that all hereditary properties are testable with one-sided error in the standard model), we obtain the following: for every hereditary property $\mathcal{P}$ , for every vertex-weighted graph $(G,\mathcal{D})$ which is $\varepsilon$ -far from $\mathcal{P}$ , for every admissible $N$ and for every $(\mathcal{D},N)$ -blowup $G^{\prime}$ of $G$ , it holds that $G^{\prime}$ is $\varepsilon$ -far from $\mathcal{P}$ with respect to the uniform distribution, and hence a sample of some $s=s_{\mathcal{P}}(\varepsilon)$ vertices of $G^{\prime}$ , taken uniformly and independently, induces a graph which w.h.p. does not satisfy $\mathcal{P}$ . Observe that this induced subgraph of $G^{\prime}$ has (essentially) the same distribution as the graph $S$ on $[s]$ obtained by sampling vertices $u_{1},\dots,u_{s}\in V(G)$ from $\mathcal{D}$ independently, and letting $\{i,j\}\in E(S)$ if and only if $\{u_{i},u_{j}\}\in E(S)$ (this is precisely the graph defined in Theorem 5). We thus established Theorem 5, as promised in Subsection 1.2.

As noted in Subsection 1.2, the graph $S$ defined above is a blowup of an induced subgraph of $G$ , but is not necessarily a subgraph of $G$ in itself. This is because the sequence $u_{1},\dots,u_{s}$ might contain repeated vertices. In other words, it may be the case that $G^{\prime}$ contains “forbidden subgraphs” which use several vertices from one of the blowup-sets, and thus do not correspond to “forbidden subgraphs” in $G$ . This creates an obstacle for proving Proposition 5.4, because in order to prove this proposition we need to know that a (suitably chosen) random induced subgraph of $G$ (and not just the blowup thereof) does not satisfy $\mathcal{P}$ w.h.p. To avoid this obstacle, we use the assumption that all vertices in $G$ have relatively small weight, which guarantees that it is unlikely to sample more than once from some blowup-set (or in other words, that $S$ is isomorphic to $G[\{u_{1},\dots,u_{s}\}]$ .). We note that a different way of dealing with this obstacle is to restrict ourselves to properties for which we can guarantee, by appropriately choosing the graphs inside the blowup-sets, that there would not be any minimal forbidden subgraph which uses several vertices from one of the blowup-sets, see Subsection 5.3.

[Proof of Proposition 5.4] We start by specifying the functions $t_{\mathcal{P}}$ and $c_{\mathcal{P}}$ . By the main result of [5], there is a function $q_{\mathcal{P}}:(0,1)\rightarrow\mathbb{N}$ such that for every $\varepsilon>0$ and for every (unweighted) graph $G$ which is $\varepsilon$ -far from $\mathcal{P}$ , a sample of $q_{\mathcal{P}}(\varepsilon)$ vertices from $G$ , taken uniformly at random and independently, induces a graph which does not satisfy $\mathcal{P}$ with probability at least $\frac{5}{6}$ . Now set $t_{\mathcal{P}}(\varepsilon):=q_{\mathcal{P}}(\varepsilon)$ and

[TABLE]

Our tester for $\mathcal{P}$ in the NHW model simply samples a sequence of $t=t_{\mathcal{P}}(\varepsilon)$ vertices of the input and accepts if and only if the subgraph induced by the sample satisfies $\mathcal{P}$ . Evidently, this tester accepts with probability $1$ if the input satisfies $\mathcal{P}$ . So to establish the correctness of our tester, it suffices to show that it rejects with probability at least $\frac{2}{3}$ if the input $(G,\mathcal{D})$ is $\varepsilon$ -far from $\mathcal{P}$ .

Let $\varepsilon>0$ and let $(G,\mathcal{D})$ be a vertex-weighted graph on $n$ vertices which is $\varepsilon$ -far from $\mathcal{P}$ , and in which all vertices have weight at most $c$ , where $c=c_{\mathcal{P}}(\varepsilon)$ . Write $V(G)=\{v_{1},\dots,v_{n}\}$ and fix an admissible $N$ , that is, a positive integer $N$ such that $\mathcal{D}(v_{i})\cdot N$ is an integer for every $1\leq i\leq n$ . Let $G^{\prime}$ be an arbitrary $(\mathcal{D},N)$ -blowup of $G$ , and denote the blowup-sets by $V_{1},\dots,V_{n}$ . By Lemma 5.5, $G^{\prime}$ is $\varepsilon$ -far from $\mathcal{P}$ . This implies that a random sequence $u_{1},\dots,u_{q}$ of $q=q_{\mathcal{P}}(\varepsilon)$ vertices of $G^{\prime}$ , sampled uniformly and independently, induces a graph which does not satisfy $\mathcal{P}$ with probability at least $\frac{5}{6}$ .

Let $\varphi:V(G^{\prime})\rightarrow V(G)$ be the map which maps all elements of $V_{i}$ to $v_{i}$ (for every $1\leq i\leq n$ ). Observe that for $u\in V(G^{\prime})$ sampled uniformly, the random vertex $\varphi(u)\in V(G)$ has the distribution $\mathcal{D}$ (because $|V_{i}|=\mathcal{D}(v_{i})\cdot N=\mathcal{D}(v_{i})\cdot|V(G^{\prime})|$ ). Furthermore, if a set $U\subseteq V(G^{\prime})$ satisfies $|V_{i}\cap U|\leq 1$ for every $1\leq i\leq n$ , then $G[\varphi(U)]$ is isomorphic to $G^{\prime}[U]$ . Let $u_{1},\dots,u_{q}$ be a random sequence of vertices of $G^{\prime}$ , sampled uniformly and independently, and set $U:=\{u_{1},\dots,u_{q}\}$ . Recall that $G^{\prime}[U]$ does not satisfy $\mathcal{P}$ with probability at least $\frac{5}{6}$ . Furthermore, the probability that $|V_{i}\cap U|\geq 2$ for some $1\leq i\leq n$ is at most

[TABLE]

**We conclude that with probability at least $\frac{2}{3}$ , $G^{\prime}[U]$ does not satisfy $\mathcal{P}$ and $|V_{i}\cap U|\leq 1$ for every $1\leq i\leq n$ , implying that $G[\varphi(U)]$ does not satisfy $\mathcal{P}$ either. This completes the proof. **

It is natural to ask whether the function $c_{\mathcal{P}}(\varepsilon)$ from Proposition 5.4 needs to depend on $\varepsilon$ , namely whether the statement of Proposition 5.4 holds even if $c_{\mathcal{P}}$ is a constant function (depending only on $\mathcal{P}$ ). The proof of Proposition 5.2 shows, however, that this is not the case. In other words, allowing $c_{\mathcal{P}}(\varepsilon)$ to depend on $\varepsilon$ is unavoidable.

5.3 Testing in the VDF Model vs. Testing in the Standard Model

It is natural to ask about the relation between the sample complexity for testing a property in the VDF model and the sample complexity for testing it in the standard model. More specifically, it will be interesting to resolve the following:

Problem 5.6.

Is it true that every extendable hereditary property $\mathcal{P}$ can be tested in the VDF model with the same (or close to the same) sample complexity as in the (standard) dense graph model?

While at present we cannot answer this question, we can show that many natural properties $\mathcal{P}$ can be tested in the VDF model with (exactly) the same sample complexity as that of the (optimal) tester for $\mathcal{P}$ in the standard model, which works by sampling a certain number of vertices and accepting if and only if they induce a graph which satisfies $\mathcal{P}$ . This is explained in the following paragraph.

As mentioned in Subsection 5.2, the assumption made in Proposition 5.4 regarding the non-existence of high-weight vertices is needed in order to handle the possibility of having copies of some (forbidden) graph $F$ in $G^{\prime}$ which do not correspond to copies of $F$ in $G$ (where $G^{\prime}$ is some blowup of $G$ ). For some graph properties, however, such an assumption is not required, as we can make sure that every copy of a minimal forbidden graph in $G^{\prime}$ will correspond to such a copy in $G$ . To make this precise, we need the following definition. A family of graphs $\mathcal{F}$ is said to be blowup-avoidable if for every graph $G$ , say on $\{v_{1},\dots,v_{n}\}$ , and for every $n$ -tuple of integers $b_{1},\dots,b_{n}\geq 0$ , there is a $(b_{1},\dots,b_{n})$ -blowup $G^{\prime}$ of $G$ with blowup-sets $V_{1},\dots,V_{n}$ , such that there is no induced copy of any $F\in\mathcal{F}$ in $G^{\prime}$ which intersects some $V_{i}$ in at least $2$ vertices; in other words, for every $F\in\mathcal{F}$ , every induced copy of $F$ in $G^{\prime}$ corresponds to an induced copy of $F$ in $G$ . We say that a hereditary property $\mathcal{P}$ is blowup-avoidable if the family of minimal forbidden induced subgraphs for $\mathcal{P}$ is blowup-avoidable. We now prove the following proposition, which partially resolves Problem 5.6. The proof is similar to that of Proposition 5.4.

Proposition 5.7.

Let $\mathcal{P}$ be a hereditary graph property which is blowup-avoidable, and suppose that $\mathcal{P}$ admits a tester in the standard model, which works by sampling $q_{\mathcal{P}}(\varepsilon)$ vertices uniformly at random and independently, and accepting if and only if the subgraph induced by the sample satisfies $\mathcal{P}$ . Then $\mathcal{P}$ is testable in the VDF model by a tester having one-sided error and sample complexity252525Provided that the input distributions are only allowed to assign rational weights. If irrational weights are allowed, then the sample complexity (of the VDF tester for $\mathcal{P}$ ) should be slightly increased to (say) $q_{\mathcal{P}}(\varepsilon/2)$ , see Footnote 24. $q_{\mathcal{P}}(\varepsilon)$ .

[Proof]Given an input $(G,\mathcal{D})$ , the required VDF tester for $\mathcal{P}$ samples (from $\mathcal{D}$ ) a sequence of $q_{\mathcal{P}}(\varepsilon)$ vertices, and accepts if and only if the subgraph induced by the sample satisfies $\mathcal{P}$ . Since $\mathcal{P}$ is hereditary, this tester accepts with probability $1$ if the input graph satisfies $\mathcal{P}$ . So it remains to show that if the input $(G,\mathcal{D})$ is $\varepsilon$ -far from $\mathcal{P}$ , then with probability at least $\frac{2}{3}$ , a sequence of $q_{\mathcal{P}}(\varepsilon)$ vertices of $G$ , sampled according to $\mathcal{D}$ and independently, induces a graph which does not satisfy $\mathcal{P}$ .

Let $\mathcal{F}=\mathcal{F}(\mathcal{P})$ be the family of minimal forbidden induced subgraphs for $\mathcal{P}$ . Let $(G,\mathcal{D})$ be a vertex-weighted graph on $n$ vertices, which is $\varepsilon$ -far from $\mathcal{P}$ . Write $V(G)=\{v_{1},\dots,v_{n}\}$ and fix an admissible $N$ , that is, a positive integer $N$ such that $\mathcal{D}(v_{i})\cdot N$ is an integer for every $1\leq i\leq n$ . As $\mathcal{P}$ is blowup-avoidable, there is a $(\mathcal{D},N)$ -blowup $G^{\prime}$ of $G$ with blowup-sets $V_{1},\dots,V_{n}$ , such that there is no induced copy of any $F\in\mathcal{F}$ in $G^{\prime}$ which intersects some $V_{i}$ in at least $2$ vertices. By Lemma 5.5, $G^{\prime}$ is $\varepsilon$ -far from $\mathcal{P}$ . So by our choice of $q_{\mathcal{P}}(\varepsilon)$ , with probability at least $\frac{2}{3}$ it holds that a sequence of $q_{\mathcal{P}}(\varepsilon)$ vertices of $G^{\prime}$ , sampled uniformly and independently, induces a graph which does not satisfy $\mathcal{P}$ , and hence contains an induced copy of some $F\in\mathcal{F}$ .

**Let $\varphi:V(G^{\prime})\rightarrow V(G)$ be the map which maps all elements of $V_{i}$ to $v_{i}$ (for every $1\leq i\leq n$ ). Observe that for $u\in V(G^{\prime})$ sampled uniformly, the random vertex $\varphi(u)\in V(G)$ has the distribution $\mathcal{D}$ . Note that by our choice of $G^{\prime}$ , if $u_{1},\dots,u_{r}\in V(G^{\prime})$ span an induced copy of some $F\in\mathcal{F}$ (in the graph $G^{\prime}$ ), then $\varphi|_{\{u_{1},\dots,u_{r}\}}$ is injective (and hence an isomorphism), which implies that $\varphi(u_{1}),\dots,\varphi(u_{r})$ span an induced copy of $F$ in $G$ . It is now easy to see that a sequence of $q_{\mathcal{P}}(\varepsilon)$ vertices of $G$ , sampled from $\mathcal{D}$ and independently, does not satisfy $\mathcal{P}$ with probability at least $\frac{2}{3}$ , as required. **

To demonstrate the usefulness of Proposition 5.7, observe that induced $F$ -freeness is blowup-avoidable for every $F\in\{P_{2},P_{3},C_{4}\}$ (here $P_{k}$ is the path with $k$ edges). Indeed, this is established by taking the blowup-sets (in the definition of blowup-avoidability) to be cliques. By combining Proposition 5.7 with known results for the standard model [5, 3, 16], we immediately get that induced $F$ -freeness is testable in the VDF model with sample complexity $\text{poly}(1/\varepsilon)$ if $F\in\{P_{2},P_{3}\}$ , and with sample complexity at most $2^{\text{poly}(1/\varepsilon)}$ if $F=C_{4}$ .

We now describe another corollary of Proposition 5.7. We say that a graph property $\mathcal{P}$ is closed under blowups if for every graph $G$ satisfying $\mathcal{P}$ , every blowup of $G$ in which the blowup-sets are independent sets also satisfies $\mathcal{P}$ . We claim that if a hereditary property $\mathcal{P}$ is closed under blowups then it is also blowup-avoidable. To see this, let $\mathcal{F}$ be the set of minimal forbidden induced subgraphs for $\mathcal{P}$ , let $G$ be an $n$ -vertex graph, let $b_{1},\dots,b_{n}\geq 0$ be integers and let $G^{\prime}$ be the $(b_{1},\dots,b_{n})$ -blowup of $G$ in which the blowup-sets, $V_{1},\dots,V_{n}$ , are independent. Let $F\in\mathcal{F}$ and suppose that $G^{\prime}$ contains an induced copy of $F$ . If, by contradiction, this copy intersects some $V_{i}$ in more than one vertex, then $F$ is a blowup of some graph $F^{\prime}$ with $|V(F^{\prime})|<|V(F)|$ , where the blowup-sets are independent sets. Since $\mathcal{P}$ is closed under blowups and $F\notin\mathcal{P}$ , we must have $F^{\prime}\notin\mathcal{P}$ ; but this contradicts the fact that $F$ is a minimal forbidden induced subgraph for $\mathcal{P}$ .

So we see that the conclusion of Proposition 5.7 applies to hereditary properties which are closed under blowups. Some examples of such properties include $K_{t}$ -freeness; the property of having a homomorphism into a fixed graph $H$ (and in particular the property of being $k$ -colorable); and the property of being the blowup of a fixed graph $H$ (cf. [8]).

On the negative side, there are many natural hereditary properties which are extendable but not blowup-avoidable, such as the property of being $H$ -free for a graph $H$ which is neither a clique nor contains isolated vertices. It would be interesting to resolve Problem 5.6 for these properties.

5.4 Which Properties are Testable in the Variations of the VDF Model?

It may be interesting to characterize the graph properties which are testable in each of the variations of the VDF model (defined at the beginning of Section 5).

Problem 5.8.

Which graph properties are testable in the “large inputs”/“size-aware”/NHW/NLW model?

While at the moment we are unable to resolve Problem 5.8, we can rule out some initial guesses. A first guess might be that only hereditary properties are testable in these models. This, however, turns out to be false; for example, connectivity and hamiltonicity are testable in each of these models, as implied by the following proposition.

Proposition 5.9.

Let $\mathcal{P}$ be a property such that for every $\varepsilon>0$ there is $M(\varepsilon)$ so that every vertex-weighted graph on at least $M(\varepsilon)$ vertices is $\varepsilon$ -close to $\mathcal{P}$ . Then $\mathcal{P}$ is testable in all four variations of the VDF model.

[Proof]The fact that $\mathcal{P}$ is testable in the “large inputs” (resp. NHW) model is trivial; indeed, by choosing $M_{\mathcal{P}}(\varepsilon):=M(\varepsilon)$ (resp. $c_{\mathcal{P}}(\varepsilon):=1/M(\varepsilon)$ ) we can make sure that every input graph will be $\varepsilon$ -close to $\mathcal{P}$ , so a tester that simply accepts without making any queries is a valid tester for $\mathcal{P}$ .

Let us now consider the NLW model. Given $\varepsilon,\delta>0$ and an input graph $(G,\mathcal{D})$ with all vertex-weights at least $\frac{\delta}{|V(G)|}$ , our tester for $\mathcal{P}$ works as follows: setting $M:=M(\varepsilon)$ , the tester samples $CM\log(M)/\delta$ vertices according to $\mathcal{D}$ and independently (where $C$ is some large constant); if the number of distinct vertices in the sample is at least $M$ then the tester accepts (without making any queries), and otherwise the tester accepts if and only if the subgraph induced by the sample satisfies $\mathcal{P}$ . To see that this is a valid tester, observe that if $G$ has less than $M$ vertices then w.h.p. the tester samples all the vertices, and if $G$ has at least $M$ vertices then w.h.p. there are at least $M$ distinct vertices in the sample. This can be argued similarly as in the proof of Proposition 5.3, using that all vertices have weight at least $\frac{\delta}{|V(G)|}$ ; we omit the details.

**Finally, let us prove that $\mathcal{P}$ is testable in the “size-aware” model. On input $\varepsilon>0$ and $(G,\mathcal{D})$ , our tester for $\mathcal{P}$ (in the “size-aware” model) does the following: if $|V(G)|\geq M(\varepsilon)$ then the tester accepts without making any queries, and if $|V(G)|<M(\varepsilon)$ then the tester samples $t:=M\log(3M)/\varepsilon$ vertices $u_{1},\dots,u_{t}\in V(G)$ according to the distribution $\mathcal{D}$ and independently, where $M=M(\varepsilon)$ , and accepts if and only if there is a graph on $|V(G)|$ vertices which satisfies $\mathcal{P}$ and contains $G[\{u_{1},\dots,u_{t}\}]$ as an induced subgraph. The proof of correctness for this tester is similar to the proof of Theorem 7, and we leave the details to the reader. ** In order to apply Proposition 5.9 to the properties of connectivity and hamiltonicity, we observe that any vertex-weighted graph $(G,\mathcal{D})$ with $|V(G)|\geq 1/\varepsilon$ is $\varepsilon$ -close to being hamiltonian (and hence also connected). To see that this holds, take a random (cyclic) ordering $v_{1},\dots,v_{n}$ of the vertices of $G$ , and observe that for every pair of distinct $u,w\in V(G)$ , the probability that there is $1\leq i\leq n$ such that $\{u,w\}=\{v_{i},v_{i+1}\}$ is $n/\binom{n}{2}=\frac{2}{n-1}$ . This implies that the expected value of $\sum_{i=1}^{n}{\mathcal{D}(v_{i})\mathcal{D}(v_{i+1})}$ is $\frac{2}{n-1}\cdot\sum_{u,w\in V(G)}{\mathcal{D}(u)\mathcal{D}(w)}=\frac{2}{n-1}\cdot\frac{1}{2}\cdot\left(1-\sum_{v\in V(G)}{\mathcal{D}(v)^{2}}\right)\leq\frac{1}{n-1}\cdot\left(1-\frac{1}{n}\right)=\frac{1}{n}$ , where the last inequality follows from Cauchy-Schwarz (and the first sum is over unordered pairs $\{u,w\}$ ). This means that we can create a hamilton cycle by adding edges of total weight at most $\frac{1}{n}\leq\varepsilon$ . Let us also note that for connectivity there is a simpler argument: if $(G,\mathcal{D})$ is a vertex-weighted graph with $|V(G)|\geq 1/\varepsilon$ , then there is $v\in V(G)$ with $\mathcal{D}(v)\leq\varepsilon$ , and we can make $G$ connected by connecting $v$ to all other vertices.

Note that in some of the restricted models (e.g. the NLW model), the tester given by (the proof of) Proposition 5.9 has 2-sided error. It is also not hard to see that the NLW model admits no 1-sided-error tester for, e.g., connectivity. This shows that (some of) the restricted models allow for properties which are testable with 2-sided error but not with 1-sided error (unlike the “ordinary” VDF model, where we know that every testable property can be tested with $1$ -sided error, as follows from Theorems 1 and 4; see also [19, Theorem 2.3]).

Another natural guess regarding the answer to Problem 5.8 would be that every property which is testable in the standard model is also testable in the restricted models (see [2] for a characterization of the properties testable in the standard model). This guess is ruled out by the following proposition, which describes a property which is testable in the standard model but not in the restricted models.

Proposition 5.10.

The property $\mathcal{P}$ of having edge-density262626The edge-density of a (possibly vertex-weighted) graph $G$ is defined as $2e(G)/|V(G)|^{2}$ ; in other words, the density is defined with respect to the uniform distribution on $V(G)$ , and not with respect to the given distribution $\mathcal{D}$ . at most $\frac{1}{4}$ is not testable in either of the four variants of the VDF model.

[**Proof]Let $G_{1}$ be the $n$ -vertex graph consisting of a clique of size $\frac{n}{2}$ and $\frac{n}{2}$ isolated vertices, and let $\mathcal{D}_{1}$ be the uniform distribution on $V(G_{1})$ . Let $G_{2}$ be the $n$ -vertex graph consisting of a clique $X$ of size $\frac{3n}{4}$ and $\frac{n}{4}$ isolated vertices, and let $\mathcal{D}_{2}$ be the distribution on $V(G_{2})$ that assigns weight $\frac{2}{3n}$ to every vertex of $X$ , and weight $\frac{2}{n}$ to every vertex of $V(G_{2})\setminus X$ . Note that $(G_{1},\mathcal{D}_{1})$ and $(G_{2},\mathcal{D}_{2})$ are valid inputs in each of the variants of the VDF model (provided that $n$ is large enough), and that $G_{1}$ satisfies $\mathcal{P}$ while $(G_{2},\mathcal{D}_{2})$ is $\Omega(1)$ -far from $\mathcal{P}$ . On the other hand, we now show that for every $q$ , a sample of $q$ vertices from $(G_{1},\mathcal{D}_{1})$ is indistinguishable from a sample of $q$ vertices from $(G_{2},\mathcal{D}_{2})$ (provided that $n$ is large enough with respect to $q$ ). To this end, let $U_{i}$ be a set of $q$ random vertices of $G_{i}$ sampled according to $\mathcal{D}_{i}$ and independently (for $i=1,2$ ). Then for both $i=1,2$ , the graph $G_{i}[U_{i}]$ consists of a clique and some isolated vertices. Letting $X_{i}$ be the clique in $G_{i}[U_{i}]$ , we have **

[TABLE]

**and **

[TABLE]

**where in both cases, the additive term $o(1)$ accounts for the event that some vertex has been sampled more than once. So we see that $\left|\mathbb{P}[|X_{1}|=k]-\mathbb{P}[|X_{2}|=k]\right|=o(1)$ . This implies that the total variation distance between the distribution of $G_{1}[U_{1}]$ and the distribution of $G_{2}[U_{2}]$ is $o(1)$ . It follows that $\mathcal{P}$ is not testable in any of the four variants of the VDF model (note that knowing $n$ does not help to distinguish between $(G_{1},\mathcal{D}_{1})$ and $(G_{2},\mathcal{D}_{2})$ , since these graphs have the same number of vertices). **

The proof of Proposition 5.10 can be adapted to show that other properties are also not testable in either of the variants of the VDF model. These properties include the property of having a cut with at least $\alpha n^{2}$ edges (for $0<\alpha<\frac{1}{4}$ ) and the property of containing a clique with at least $\alpha n$ vertices (for $0<\alpha<1$ ).

Acknowledgements

We are grateful to an anonymous referee for spotting a gap in the proof of Theorem 1 in a preliminary version of the paper.

6 Proof of Lemmas 2.5 and 2.6

Here we prove lemmas 2.5 and 2.6. We start by extending some basic results about regular partitions to the vertex-weighted setting.

Lemma 6.1.

Let $X,Y$ be disjoint vertex-sets in a vertex-weighted graph $(G,\mathcal{D})$ , and let $\mathcal{P}_{X},\mathcal{P}_{Y}$ be partitions of $X,Y$ , respectively. Then

[TABLE]

and

[TABLE]

[Proof]We start with the first part of the lemma.

[TABLE]

To prove the second part, we set $\varepsilon(X^{\prime},Y^{\prime})=d(X^{\prime},Y^{\prime})-d(X,Y)$ for each $X^{\prime}\in\mathcal{P}_{X}$ , $Y^{\prime}\in\mathcal{P}_{Y}$ . Now,

[TABLE]

**where in the last equality we used the first part of the lemma. ** Let $(G,\mathcal{D})$ be a vertex-weighted graph, and let $\mathcal{P}=\{P_{1},\dots,P_{r}\}$ be a partition of $V(G)$ . The index of $\mathcal{P}$ , denoted $q(\mathcal{P})$ , is defined as

[TABLE]

Lemma 6.2.

For every vertex-partition $\mathcal{P}$ of a vertex-weighted graph $(G,\mathcal{D})$ , and for every refinement $\mathcal{P}^{\prime}$ of $\mathcal{P}$ , we have $q(\mathcal{P}^{\prime})\geq q(\mathcal{P})$ .

[Proof]Write $\mathcal{P}=\{P_{1},\dots,P_{r}\}$ , and for each $1\leq i\leq r$ put $\mathcal{P}^{\prime}_{i}=\{P^{\prime}\in\mathcal{P}^{\prime}:P^{\prime}\subseteq P_{i}\}$ . Then

[TABLE]

**where in the second inequality we used the second part of Lemma 6.1. **

Lemma 6.3.

Let $(G,\mathcal{D})$ be a vertex-weighted graph and let $\mathcal{P}=\{P_{1},\dots,P_{r}\}$ be a non- $\varepsilon$ -regular partition of $V(G)$ . Then there is a refinement $\mathcal{P}^{\prime}$ of $\mathcal{P}$ such that $|\mathcal{P}^{\prime}|\leq|\mathcal{P}|\cdot 2^{|\mathcal{P}|}$ and $q(\mathcal{P}^{\prime})\geq q(\mathcal{P})+\varepsilon^{5}$ .

[Proof]For each $1\leq i<j\leq r$ for which $(P_{i},P_{j})$ is not $\varepsilon$ -regular, let $P_{i,j}\subseteq P_{i}$ , $P_{j,i}\subseteq P_{j}$ be such that $\mathcal{D}(P_{i,j})\geq\varepsilon\mathcal{D}(P_{i}),\mathcal{D}(P_{j,i})\geq\varepsilon\mathcal{D}(P_{j})$ , and $|d(P_{i,j},P_{j,i})-d(P_{i},P_{j})|>\varepsilon$ . For each $1\leq i\leq r$ , let $\mathcal{P}_{i}$ be the partition of $P_{i}$ , formed by taking the common refinement of the partitions $\{P_{i,j},P_{i}\setminus P_{i,j}\}$ , where $j$ runs over all indices for which $(P_{i},P_{j})$ is not $\varepsilon$ -regular. Let $\mathcal{P}^{\prime}=\bigcup_{i=1}^{r}{\mathcal{P}_{i}}$ be the resulting refinement of $\mathcal{P}$ . Then clearly $|\mathcal{P}^{\prime}|\leq|\mathcal{P}|\cdot 2^{|\mathcal{P}|}$ . We now show that $q(\mathcal{P}^{\prime})\geq q(\mathcal{P})+\varepsilon^{5}$ . First, observe that by Lemma 6.1, for every $1\leq i<j\leq r$ we have $\sum_{X^{\prime}\in\mathcal{P}_{i},Y^{\prime}\in\mathcal{P}_{j}}{\mathcal{D}(X^{\prime})\mathcal{D}(Y^{\prime})\cdot d^{2}(X^{\prime},Y^{\prime})}\geq\mathcal{D}(P_{i})\mathcal{D}(P_{j})\cdot d^{2}(P_{i},P_{j}).$ Next, fix any pair $1\leq i<j\leq r$ for which $(P_{i},P_{j})$ is not $\varepsilon$ -regular. By Lemma 6.1 we have

[TABLE]

where in the penultimate inequality we used the first part of Lemma 6.1 to infer that

[TABLE]

Denoting by $\mathcal{N}$ the set of pairs $1\leq i<j\leq r$ for which $(P_{i},P_{j})$ is not $\varepsilon$ -regular, we see that

[TABLE]

**where in the last inequality we used the assumption that $\mathcal{P}$ is not $\varepsilon$ -regular. **

[**Proof of Lemma 2.5] For $i\geq 0$ , if $\mathcal{P}_{i}$ is not $\varepsilon$ -regular then we apply Lemma 6.3 to obtain a partition $\mathcal{P}_{i+1}$ which refines $\mathcal{P}_{i}$ and satisfies $|\mathcal{P}_{i+1}|\leq|\mathcal{P}_{i}|\cdot 2^{|\mathcal{P}_{i}|}$ and $q(\mathcal{P}_{i+1})\geq q(\mathcal{P}_{i})+\varepsilon^{5}$ . Since the index of any partition is at most $1$ , this process must end after at most $\varepsilon^{-5}$ steps. When the process ends, we have an $\varepsilon$ -regular partition. Since the number of steps depends only on $\varepsilon$ , the size of the resulting final partition can be upper-bounded by a function of $\varepsilon$ and $|\mathcal{P}_{0}|$ , as required. ** [Proof of Lemma 2.6] We may assume, without loss of generality, that $\mathcal{E}$ is monotone decreasing. Let $\mathcal{P}_{1}$ be the partition obtained by applying Lemma 2.5 with parameter $\varepsilon=\mathcal{E}(0)$ and with the partition $\mathcal{P}_{0}$ . Next, for each $i\geq 1$ , apply Lemma 2.5 with parameter $\mathcal{E}(|\mathcal{P}_{i}|)$ and with the partition $\mathcal{P}_{i}$ to obtain a partition $\mathcal{P}_{i+1}$ which is $\mathcal{E}(|\mathcal{P}_{i}|)$ -regular and refines $\mathcal{P}_{i}$ . In light of Lemma 6.2, and as the index of any partition is at most $1$ , there must be some $1\leq i\leq\frac{1}{\mathcal{E}^{2}(0)}$ for which $q(\mathcal{P}_{i+1})\leq q\mathcal{(}\mathcal{P}_{i})+\mathcal{E}^{2}(0)$ . For such an $i$ , set $\mathcal{P}=\mathcal{P}_{i}$ and $\mathcal{Q}=\mathcal{P}_{i+1}$ . Since $|\mathcal{P}_{0}|\leq m$ and the number of steps in the process is at most $\mathcal{E}^{2}(0)$ , and since the size of the partition guaranteed by Lemma 2.5 can be bounded from above by a function of the parameters of this lemma (which in our case depend only on $\mathcal{E}$ and $m$ ), we see that $|\mathcal{Q}|$ too can be bounded from above by a function of $\mathcal{E}$ and $m$ . This proves Item 1.

Item 2 is immediate from our choice of $\mathcal{Q}$ . It remains to prove Item 3. By the definition of the index and by our choice of $\mathcal{P}$ and $\mathcal{Q}$ , we have

[TABLE]

where in the first equality we used the second part of Lemma 6.1. The above implies that

[TABLE]

and hence

[TABLE]

**where the first inequality follows from Cauchy-Schwarz. This completes the proof. **

Bibliography31

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] N. Alon, E. Fischer, M. Krivelevich and M. Szegedy, Efficient testing of large graphs. Combinatorica 20 (2000), 451–476.
2[2] N. Alon, E. Fischer, I. Newman and A. Shapira, A combinatorial characterization of the testable graph properties: it’s all about regularity. SIAM Journal on Computing, 39(1) (2009), 143–167.
3[3] N. Alon and J. Fox, Easily testable graph properties, Combin. Probab. Comput. 24 (2015), 646–657.
4[4] N. Alon and A. Shapira, A characterization of easily testable induced subgraphs. Combinatorics, Probability and Computing 15 (2006), 791–805.
5[5] N. Alon and A. Shapira, A characterization of the (natural) graph properties testable with one-sided error. SIAM Journal on Computing 37 (2008), 1703–1727.
6[6] N. Alon and A. Shapira, Every monotone graph property is testable. SIAM Journal on Computing, 38(2) (2008), 505–522.
7[7] T. Austin and T. Tao, On the testability and repair of hereditary hypergraph properties, Random Structures and Algorithms 36 (2010), 373–463.
8[8] L. Avigad and O. Goldreich, Testing graph blow-up. In Studies in Complexity and Cryptography, Miscellanea on the Interplay between Randomness and Computation (2011), pp. 156–172. Springer, Berlin, Heidelberg.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Testing Graphs against an Unknown Distribution111A preliminary version of this paper has appeared in the Proceedings of STOC ’19.

Abstract

1 Introduction

1.1 Background and the main result

Problem 1.1**.**

Theorem 1**.**

Corollary 2**.**

Corollary 3**.**

1.2 The combinatorial interpretation of Theorem 1

Theorem 4**.**

Theorem 5** ([7, 26]).**

1.3 Variants of the VDF model

Theorem 6**.**

Theorem 7**.**

Theorem 8**.**

Theorem 9**.**

1.4 Paper overview

2 Preliminary Lemmas

Lemma 2.1**.**

Lemma 2.2**.**

Lemma 2.3**.**

Lemma 2.4** (Counting lemma for vertex-weighted graphs).**

Lemma 2.5** (Szemerédi’s regularity lemma for vertex-weighted graphs).**

Lemma 2.6** (Strong regularity lemma for vertex-weighted graphs).**

Lemma 2.7**.**

Lemma 2.8**.**

3 The Main Proof

3.1 Proof overview

The main difficulty:

The key new idea:

The new regularity lemma:

3.2 The Key Lemma

Lemma 3.1**.**

Lemma 3.2**.**

3.3 Proof of the Main Result

Definition 3.3**.**

4 VDF-Testable Properties are Extendable and Hereditary

Proposition 4.1**.**

Proposition 4.2**.**

5 On Variations of the VDF Model and Related Problems

The “large inputs” model

The “size-aware” model

The “no heavy-weights” (NHW) model

The “no light-weights” (NLW) model

5.1 Proof of Theorems 6, 7 and 9

Proposition 5.1**.**

Proposition 5.2**.**

Proposition 5.3**.**

5.2 Proof of Theorem 8

Proposition 5.4**.**

Lemma 5.5**.**

5.3 Testing in the VDF Model vs. Testing in the Standard Model

Problem 5.6**.**

Proposition 5.7**.**

5.4 Which Properties are Testable in the Variations of the VDF Model?

Problem 5.8**.**

Proposition 5.9**.**

Proposition 5.10**.**

Acknowledgements

6 Proof of Lemmas 2.5 and 2.6

Lemma 6.1**.**

Lemma 6.2**.**

Lemma 6.3**.**

Problem 1.1.

Theorem 1.

Corollary 2.

Corollary 3.

Theorem 4.

Theorem 5 ([7, 26]).

Theorem 6.

Theorem 7.

Theorem 8.

Theorem 9.

Lemma 2.1.

Lemma 2.2.

Lemma 2.3.

Lemma 2.4 (Counting lemma for vertex-weighted graphs).

Lemma 2.5 (Szemerédi’s regularity lemma for vertex-weighted graphs).

Lemma 2.6 (Strong regularity lemma for vertex-weighted graphs).

Lemma 2.7.

Lemma 2.8.

Lemma 3.1.

Lemma 3.2.

Definition 3.3.

Proposition 4.1.

Proposition 4.2.

Proposition 5.1.

Proposition 5.2.

Proposition 5.3.

Proposition 5.4.

Lemma 5.5.

Problem 5.6.

Proposition 5.7.

Problem 5.8.

Proposition 5.9.

Proposition 5.10.

Lemma 6.1.

Lemma 6.2.

Lemma 6.3.