On the Duffin-Schaeffer conjecture

Dimitris Koukoulopoulos; James Maynard

arXiv:1907.04593·math.NT·May 5, 2020

On the Duffin-Schaeffer conjecture

Dimitris Koukoulopoulos, James Maynard

PDF

TL;DR

This paper proves that the set of real numbers approximable by fractions with a certain error rate has full measure under a divergence condition, resolving a longstanding conjecture and refining classical approximation theorems.

Contribution

It establishes the Duffin-Schaeffer conjecture for Lebesgue measure and confirms Catlin's conjecture on non-reduced solutions, advancing metric number theory.

Findings

01

Proves the Duffin-Schaeffer conjecture for full measure.

02

Confirms Catlin's conjecture on non-reduced solutions.

03

Refines Khinchin's Theorem with new approximation conditions.

Abstract

Let $ψ : N \to R_{\geq 0}$ be an arbitrary function from the positive integers to the non-negative reals. Consider the set $A$ of real numbers $α$ for which there are infinitely many reduced fractions $a / q$ such that $∣ α - a / q ∣ \leq ψ (q) / q$ . If $\sum_{q = 1}^{\infty} ψ (q) ϕ (q) / q = \infty$ , we show that $A$ has full Lebesgue measure. This answers a question of Duffin and Schaeffer. As a corollary, we also establish a conjecture due to Catlin regarding non-reduced solutions to the inequality $∣ α - a / q ∣ \leq ψ (q) / q$ , giving a refinement of Khinchin's Theorem.

Equations612

\bigg{|}\alpha-\frac{a}{q}\bigg{|}\leqslant\frac{\psi(q)}{q}.

\bigg{|}\alpha-\frac{a}{q}\bigg{|}\leqslant\frac{\psi(q)}{q}.

\begin{split}\mathcal{K}_{q}=[0,1]\cap\bigcup_{a=0}^{q}\Big{[}\frac{a-\psi(q)}{q},\frac{a+\psi(q)}{q}\Big{]},\end{split}

\begin{split}\mathcal{K}_{q}=[0,1]\cap\bigcup_{a=0}^{q}\Big{[}\frac{a-\psi(q)}{q},\frac{a+\psi(q)}{q}\Big{]},\end{split}

K = q \to \infty lim sup K_{q} .

K = q \to \infty lim sup K_{q} .

min {ψ (q), 1/2} ⩽ λ (K_{q}) ⩽ 2 min {ψ (q), 1/2} .

min {ψ (q), 1/2} ⩽ λ (K_{q}) ⩽ 2 min {ψ (q), 1/2} .

\mathcal{A}_{q}:=[0,1]\cap\bigcup_{\begin{subarray}{c}1\leqslant a\leqslant q\\ \gcd(a,q)=1\end{subarray}}\Big{[}\frac{a-\psi(q)}{q},\frac{a+\psi(q)}{q}\Big{]}.

\mathcal{A}_{q}:=[0,1]\cap\bigcup_{\begin{subarray}{c}1\leqslant a\leqslant q\\ \gcd(a,q)=1\end{subarray}}\Big{[}\frac{a-\psi(q)}{q},\frac{a+\psi(q)}{q}\Big{]}.

A := q \to \infty lim sup A_{q} .

A := q \to \infty lim sup A_{q} .

q = 1 \sum \infty \frac{φ ( q ) ψ ( q )}{q} < \infty ⟹ λ (A) = 0.

q = 1 \sum \infty \frac{φ ( q ) ψ ( q )}{q} < \infty ⟹ λ (A) = 0.

q = 1 \sum \infty \frac{φ ( q ) ψ ( q )}{q} = \infty ⟹ λ (A) = 1.

q = 1 \sum \infty \frac{φ ( q ) ψ ( q )}{q} = \infty ⟹ λ (A) = 1.

q = 1 \sum \infty \frac{ψ ( q ) φ ( q )}{q} = \infty.

q = 1 \sum \infty \frac{ψ ( q ) φ ( q )}{q} = \infty.

\begin{split}\bigg{|}\alpha-\frac{a}{q}\bigg{|}\leqslant\frac{\psi(q)}{q}\end{split}

\begin{split}\bigg{|}\alpha-\frac{a}{q}\bigg{|}\leqslant\frac{\psi(q)}{q}\end{split}

ψ^{*} (q) := φ (q) sup {ψ (n) / n : n \in N, q ∣ n} .

ψ^{*} (q) := φ (q) sup {ψ (n) / n : n \in N, q ∣ n} .

Q \to \infty lim sup \frac{\sum _{q ⩽ Q} ψ ( q ) φ ( q ) / q}{\sum _{q ⩽ Q} ψ ( q )} > 0.

Q \to \infty lim sup \frac{\sum _{q ⩽ Q} ψ ( q ) φ ( q ) / q}{\sum _{q ⩽ Q} ψ ( q )} > 0.

q = 1 \sum \infty \frac{φ ( q ) ψ ( q )}{q ( lo g q ) ^{ε}} = \infty ⟹ λ (A) = 1,

q = 1 \sum \infty \frac{φ ( q ) ψ ( q )}{q ( lo g q ) ^{ε}} = \infty ⟹ λ (A) = 1,

2^{2^{j}} < q ⩽ 2^{2^{j + 1}} \sum \frac{ψ ( q ) φ ( q )}{q} = O (1/ j) for all j ⩾ 1,

2^{2^{j}} < q ⩽ 2^{2^{j + 1}} \sum \frac{ψ ( q ) φ ( q )}{q} = O (1/ j) for all j ⩾ 1,

s=\inf\Bigl{\{}\beta\in\mathbb{R}_{\geqslant 0}:\,\sum_{q=1}^{\infty}\varphi(q)(\psi(q)/q)^{\beta}<\infty\Bigl{\}}.

s=\inf\Bigl{\{}\beta\in\mathbb{R}_{\geqslant 0}:\,\sum_{q=1}^{\infty}\varphi(q)(\psi(q)/q)^{\beta}<\infty\Bigl{\}}.

dim_{H} (A) = min (s, 1) .

dim_{H} (A) = min (s, 1) .

S = q = 1 \sum \infty φ (q) n \in N q ∣ n sup \frac{ψ ( n )}{n} .

S = q = 1 \sum \infty φ (q) n \in N q ∣ n sup \frac{ψ ( n )}{n} .

n \in N d ∣ n sup \frac{ψ ( n )}{n} ⩾ \frac{ψ ( q _{i} )}{q _{i}} ⩾ \frac{1}{2 q _{i}} .

n \in N d ∣ n sup \frac{ψ ( n )}{n} ⩾ \frac{ψ ( q _{i} )}{q _{i}} ⩾ \frac{1}{2 q _{i}} .

q_{i - 1} < q ⩽ q_{i} \sum φ (q) n \in N q ∣ n sup \frac{ψ ( n )}{n} ⩾ q_{i - 1} < q ⩽ q_{i} q ∣ q_{i} \sum \frac{φ ( q )}{2 q _{i}} ⩾ \frac{1}{2 q _{i}} q ∣ q_{i} \sum φ (q) - \frac{1}{2 q _{i}} q ⩽ q_{i - 1} \sum φ (q) ⩾ \frac{1}{4},

q_{i - 1} < q ⩽ q_{i} \sum φ (q) n \in N q ∣ n sup \frac{ψ ( n )}{n} ⩾ q_{i - 1} < q ⩽ q_{i} q ∣ q_{i} \sum \frac{φ ( q )}{2 q _{i}} ⩾ \frac{1}{2 q _{i}} q ∣ q_{i} \sum φ (q) - \frac{1}{2 q _{i}} q ⩽ q_{i - 1} \sum φ (q) ⩾ \frac{1}{4},

\frac{ξ ( q )}{q} = n \in N q ∣ n max \frac{ψ ( n )}{n}

\frac{ξ ( q )}{q} = n \in N q ∣ n max \frac{ψ ( n )}{n}

\mathcal{C}_{q}=[0,1]\cap\bigcup_{\begin{subarray}{c}1\leqslant a\leqslant q\\ \gcd(a,q)=1\end{subarray}}\Big{[}\frac{a-\xi(q)}{q},\frac{a+\xi(q)}{q}\Big{]}\qquad\text{and}\qquad\mathcal{C}:=\limsup_{q\to\infty}\mathcal{C}_{q}.

\mathcal{C}_{q}=[0,1]\cap\bigcup_{\begin{subarray}{c}1\leqslant a\leqslant q\\ \gcd(a,q)=1\end{subarray}}\Big{[}\frac{a-\xi(q)}{q},\frac{a+\xi(q)}{q}\Big{]}\qquad\text{and}\qquad\mathcal{C}:=\limsup_{q\to\infty}\mathcal{C}_{q}.

C ∖ Q = K ∖ Q .

C ∖ Q = K ∖ Q .

q \in [x_{i}, 2 x_{i}] \sum \frac{φ ( q )}{q} ψ (q) \in [1, 2] .

q \in [x_{i}, 2 x_{i}] \sum \frac{φ ( q )}{q} ψ (q) \in [1, 2] .

q, r \in S g c d (q, r) ⩽ M (q, r) \sum \frac{φ ( q )}{q} \cdot \frac{φ ( r )}{r} \cdot P (q, r) ≪ x^{2 c},

q, r \in S g c d (q, r) ⩽ M (q, r) \sum \frac{φ ( q )}{q} \cdot \frac{φ ( r )}{r} \cdot P (q, r) ≪ x^{2 c},

S

S

M (q, r)

P (q, r)

q \in S \sum \frac{φ ( q )}{q} ≍ x^{c} q \in S \sum \frac{φ ( q )}{q} ψ (q) ≍ x^{c},

q \in S \sum \frac{φ ( q )}{q} ≍ x^{c} q \in S \sum \frac{φ ( q )}{q} ψ (q) ≍ x^{c},

p ∣ q r / g c d (q, r)^{2} p ⩾ t \sum \frac{1}{p} \approx 1

p ∣ q r / g c d (q, r)^{2} p ⩾ t \sum \frac{1}{p} \approx 1

q, r \in S g c d (q, r) ⩾ x^{1 - c} / t \sum \frac{φ ( q )}{q} \cdot \frac{φ ( r )}{r} ≪ \frac{x ^{2 c}}{t} .

q, r \in S g c d (q, r) ⩾ x^{1 - c} / t \sum \frac{φ ( q )}{q} \cdot \frac{φ ( r )}{r} ≪ \frac{x ^{2 c}}{t} .

δ_{j} = \frac{# {( v , w ) \in V _{j} \times W _{j} : g cd ( v , w ) > x ^{1 - c} / t }}{# V _{j} \cdot # W _{j}},

δ_{j} = \frac{# {( v , w ) \in V _{j} \times W _{j} : g cd ( v , w ) > x ^{1 - c} / t }}{# V _{j} \cdot # W _{j}},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On the Duffin-Schaeffer conjecture

Dimitris Koukoulopoulos

Département de mathématiques et de statistique

Université de Montréal

CP 6128 succ. Centre-Ville

Montréal, QC H3C 3J7

Canada

[email protected]

and

James Maynard

Mathematical Institute, Radcliffe Observatory quarter, Woodstock Road, Oxford OX2 6GG, England

[email protected]

Abstract.

Let $\psi:\mathbb{N}\to\mathbb{R}_{\geqslant 0}$ be an arbitrary function from the positive integers to the non-negative reals. Consider the set $\mathcal{A}$ of real numbers $\alpha$ for which there are infinitely many reduced fractions $a/q$ such that $|\alpha-a/q|\leqslant\psi(q)/q$ . If $\sum_{q=1}^{\infty}\psi(q)\varphi(q)/q=\infty$ , we show that $\mathcal{A}$ has full Lebesgue measure. This answers a question of Duffin and Schaeffer. As a corollary, we also establish a conjecture due to Catlin regarding non-reduced solutions to the inequality $|\alpha-a/q|\leqslant\psi(q)/q$ , giving a refinement of Khinchin’s Theorem.

Key words and phrases:

Diophantine approximation, Metric Number Theory, Duffin-Schaeffer conjecture, graph theory, density increment, compression arguments

2010 Mathematics Subject Classification:

Primary: 11J83. Secondary: 05C40

1. Introduction

Let $\psi:\mathbb{N}\to\mathbb{R}_{\geqslant 0}$ be an arbitrary function from the positive integers to the non-negative reals. Given $\alpha\in\mathbb{R}$ , we wish to understand when we can find infinitely many integers $a$ and $q$ such that

[TABLE]

Clearly, it suffices to restrict our attention to numbers $\alpha\in[0,1]$ .

When $\psi(q)=1/q$ for all $q$ , Dirichlet’s approximation theorem implies that, given any irrational $\alpha\in[0,1]$ , there are infinitely many coprime integers $a$ and $q$ satisfying (1.1). On the other hand, the situation can become significantly more complicated if $\psi$ behaves more irregularly. Even small variations in the size of $\psi$ can cause (1.1) to have no solutions for certain numbers $\alpha$ . However, there are several results in the literature that show that, under rather general conditions on $\psi$ , (1.1) has infinitely many solutions for almost all $\alpha\in[0,1]$ , in the sense that the residual set has null Lebesgue measure.

The prototypical such ‘metric’ result was proven by Khinchin in 1924 [15] (see also [16, Theorem 32]). To state his result, we let $\lambda$ denote the Lebesgue measure on $\mathbb{R}$ .

Khinchin’s theorem.

Consider a function $\psi:\mathbb{N}\to[0,+\infty)$ such that the sequence $(q\psi(q))_{q=1}^{\infty}$ is decreasing, and let $\mathcal{K}$ denote the set of real numbers $\alpha\in[0,1]$ for which (1.1) has infinitely many solutions $(a,q)\in\mathbb{Z}^{2}$ with $0\leqslant a\leqslant q$ . Then the following hold:

(a)

If $\sum_{q\geqslant 1}\psi(q)<\infty$ , then $\lambda(\mathcal{K})=0$ . 2. (b)

If $\sum_{q\geqslant 1}\psi(q)=\infty$ , then $\lambda(\mathcal{K})=1$ .

There is an intuitive way to explain why Khinchin’s result ought to be true. Consider the sets

[TABLE]

so that111Recall that if $X_{1},X_{2},\dots$ is a sequence of sets of real numbers, then $\limsup_{n\to\infty}X_{n}$ denotes the set of real numbers lying in infinitely many $X_{n}$ ’s.

[TABLE]

In addition,

[TABLE]

Thus, part (a) of Khinchin’s theorem is an immediate corollary of the ‘easy’ direction of the Borel-Cantelli lemma from Probability Theory [13, Lemma 1.2] applied to the probability space $[0,1]$ equipped with the measure $\lambda$ . If we knew, in addition, that the sets $\mathcal{K}_{q}$ were mutually independent, then we could apply the ‘hard’ direction of the Borel-Cantelli lemma [13, Lemma 1.3] to deduce part (b) of Khinchin’s theorem. Of course, the sets $\mathcal{K}_{q}$ are not mutually independent, so the difficulty in Khinchin’s proof is showing that there is enough ‘approximate independence’, so that $\mathcal{K}$ still has full measure.

In 1941, Duffin and Schaeffer [8] undertook a study of the limitations to the validity of Khinchin’s theorem, since the condition that $q\psi(q)$ is decreasing is not a necessary condition. They discovered that it is more natural to focus on reduced solutions $a/q$ to (1.1) that avoid overcounting issues arising when working with arbitrary fractions $a/q$ . To this end, let

[TABLE]

and

[TABLE]

Just like before, using the ‘easy’ direction of the Borel-Cantelli lemma, we immediately find that

[TABLE]

In analogy to Khinchin’s result, Duffin and Schaeffer conjectured that we also have the implication

[TABLE]

This is listed as Problem 46 in Montgomery’s lectures [18, Page 204].

The main result of the present paper is a proof of the Duffin-Schaeffer conjecture:

Theorem 1.

Let $\psi:\mathbb{N}\rightarrow\mathbb{R}_{\geqslant 0}$ be a function such that

[TABLE]

Let $\mathcal{A}$ be the set of $\alpha\in[0,1]$ for which the inequality

[TABLE]

has infinitely many coprime solutions $a$ and $q$ . Then $\mathcal{A}$ has Lebesgue measure 1.

As a direct corollary, we obtain Catlin’s conjecture [7] that deals with solutions to (1.7) where the approximations are not necessarily reduced fractions, giving an extension of Khinchin’s Theorem.

Theorem 2.

Let $\psi:\mathbb{N}\rightarrow\mathbb{R}_{\geqslant 0}$ and let $\mathcal{K}$ denote the set of $\alpha\in[0,1]$ for which the inequality (1.7) has infinitely many solutions $(a,q)\in\mathbb{Z}^{2}$ with $0\leqslant a\leqslant q$ . Define $\psi^{*}:\mathbb{N}\rightarrow\mathbb{R}_{\geqslant 0}$ by

[TABLE]

Then the following hold:

(a)

If $\sum_{q=1}^{\infty}\psi^{*}(q)<\infty$ , then $\lambda(\mathcal{K})=0$ . 2. (b)

If $\sum_{q=1}^{\infty}\psi^{*}(q)=\infty$ , then $\lambda(\mathcal{K})=1$ .

There has been much partial progress on the Duffin-Schaeffer conjecture in previous work. The assumption that the sequence $(q\psi(q))_{q=1}^{\infty}$ is decreasing implies that $(\psi(q)/q)_{q=1}^{\infty}$ is also decreasing. In particular, if $a/q$ is a fraction satisfying (1.1), then so is its reduction $a_{1}/q_{1}$ . Thus, as observed by Walfisz [24] (in work predating Duffin and Schaeffer’s conjecture), Khinchin’s Theorem implies the Duffin-Schaeffer conjecture when $q\psi(q)$ is decreasing. In the same paper, he strengthened part (b) of Khinchin’s theorem as follows: if $\sum_{q\geqslant 1}\psi(q)=\infty$ and $\psi(q)\ll\psi(2q)$ for all $q\in\mathbb{N}$ , then the set of $\alpha\in[0,1]$ for which (1.1) has infinitely many coprime solutions $a$ and $q$ has Lebesgue measure 1.

Duffin and Schaeffer [8] had already established their conjecture (1.6) when $\psi$ is sufficiently ‘regular’, in the sense that the function $\varphi(q)/q$ behaves like the constant function 1 when weighted with $\psi$ . More precisely, they proved (1.6) under the assumption that

[TABLE]

Since then, a variety of results towards the Duffin-Schaeffer conjecture have been proven. The first significant step was achieved by Erdős [10] and then improved by Vaaler [23], who demonstrated (1.6) when $\psi(q)=O(1/q)$ . In addition, Pollington and Vaughan [19] proved that the $d$ -dimensional analogue of the Duffin-Schaeffer conjecture holds for any $d\geqslant 2$ .

The proof of all three aforementioned results can be found in Harman’s book [13] (see Theorems 2.5, 2.6 and 3.6, respectively), along with various other cases of the Duffin-Schaeffer conjecture (see Theorems 2.9, 2.10, 3.7 and 3.8).

More recently, the focus shifted towards establishing variations of (1.6), where the assumption that the series $\sum_{q\geqslant 1}\psi(q)\varphi(q)/q$ diverges is replaced by a slightly stronger assumption. The first result of this kind was proven in 2006 by Haynes, Pollington and Velani [14], and was improved in 2013 by Beresnevich, Harman, Haynes and Velani [5]. The strongest published such result is the recent theorem of Aistleitner, Lachmann, Munsch, Technau and Zafeiropoulos [3] who showed that

[TABLE]

for any fixed $\varepsilon>0$ . In 2014, Aistleitner [1] established a companion result to the above one: he showed that if $\sum_{q=1}^{\infty}\psi(q)\varphi(q)/q$ diverges and $\psi$ is not ‘too concentrated’, in the sense that

[TABLE]

then $\lambda(\mathcal{A})=1$ .

*Remark**.*

In the recent progress report [2], Aistleitner explains how to improve on (1.8) and (1.9). In particular, his refined arguments allow him to replace $(\log q)^{\varepsilon}$ by $(\log\log q)^{\varepsilon}$ in (1.8).

Finally, Beresnevich and Velani [6] have proven that the Duffin-Schaeffer conjecture implies a Hausdorff measure version of itself. An immediate corollary of their results when combined with Theorem 1 is the following.

Corollary 3.

Let $\psi:\mathbb{N}\rightarrow[0,1/2]$ . Write $\mathcal{A}$ for the set of $\alpha\in[0,1]$ such that (1.7) has infinitely many coprime solutions $a$ and $q$ , and set

[TABLE]

Then the Hausdorff dimension $\dim_{\mathcal{H}}(\mathcal{A})$ of $\mathcal{A}$ satisfies

[TABLE]

The proof of Theorem 2, assuming Theorem 1, is explained in Section 2. For an outline of the proof of Theorem 1, we refer the readers to Section 3. Finally, the structure of the rest of the paper is presented in Section 4.

Notation

The letter $\mu$ will always denote a generic measure on $\mathbb{N}$ . We reserve the letter $\lambda$ for the Lebesgue measure on $\mathbb{R}$ .

Sets will be typically denoted by capital calligraphic letters such as $\mathcal{A},\mathcal{V}$ and $\mathcal{E}$ . A triple $G=(\mathcal{V},\mathcal{W},\mathcal{E})$ denotes a bipartite graph with vertex sets $\mathcal{V}$ and $\mathcal{W}$ and edge set $\mathcal{E}\subseteq\mathcal{V}\times\mathcal{W}$ .

Given a set or an event $\mathcal{E}$ , we let $\mathds{1}_{\mathcal{E}}$ denote its indicator function.

The letter $p$ will always denote a prime number. We also write $p^{k}\|n$ to mean that $p^{k}$ is the exact power of $p$ dividing the integer $n$ .

When we write $(a,b)$ , we mean the pair of $a$ and $b$ . In contrast, we write $\gcd(a,b)$ for the greatest common divisor of the integers $a$ and $b$ and $\operatorname{lcm}(a,b)$ for the least common multiple of $a$ and $b$ .

Finally, we adopt the usual asymptotic notation of Vinogradov: given two functions $f,g:X\to\mathbb{R}$ and a set $Y\subseteq X$ , we write “ $f(x)\ll g(x)$ for all $x\in Y$ ” if there is a constant $c=c(f,g,Y)>0$ such that $|f(x)|\leqslant cg(x)$ for all $x\in Y$ . The constant is absolute unless otherwise noted by the presence of a subscript. If $h:X\to\mathbb{R}$ is a third function, we use Landau’s notation “ $f=g+O(h)$ on $Y$ ” to mean that $|f-g|\ll h$ on $Y$ . Typically the set $Y$ is clear from the context and so not stated explicitly.

We introduce several new quantities and associated notation in Section 6 which are tailored to our application. In the interest of concreteness we have decided to use explicit constants in several parts of the argument, but we encourage the reader not to concern themselves with numerics on a first reading.

Acknowledgements

First and foremost, we would like to thank Sam Chow, Leo Goldmakher and Andrew Pollington for their valuable insights to this project: we have had extended discussions with them on various aspects of the Duffin-Schaeffer conjecture and are indebted to them for their contributions. In addition, we would like to thank Sam Chow for pointing out the connection of our paper to Catlin’s conjecture and the construction of the counterexample given in Section 15, and Sanju Velani for introducing J.M. to this problem. Finally, we are grateful to Christopher Aistleitner, Ben Green, Alan Haynes and Sam Chow for sending us various comments and corrections on an earlier version of our paper, as well as to the anonymous referees of the paper for their very detailed comments.

Our project began in the Spring of 2017 during our visit to the Mathematical Sciences Research Institute in Berkeley, California (supported by the National Science Foundation under Grant No. DMS-1440140). In addition, a significant part of our work took place during two visits of J.M. to the Centre de recherche mathématiques in Montréal in November 2017 and May 2018, and during the visit of D.K. to the University of Oxford in the Spring of 2019 (supported by Ben Green’s Simons Investigator Grant 376201). We would like to thank our hosts for their support and hospitality.

D.K. was also supported by the Natural Sciences and Engineering Research Council of Canada (Discovery Grant 2018-05699) and by the Fonds de recherche du Québec - Nature et technologies (projet de recherche en équipe - 256442). J.M. was also supported by a Clay Research Fellowship during the first half of this project, and this project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 851318) for the later stages.

2. Deduction of Theorem 2 from Theorem 1

Most of the details of this deduction can be found in Catlin’s original paper [7]. We give them here as well for the sake of completeness. For easy reference, let

[TABLE]

Firstly, we deal with a rather trivial case.

Case 1: There is a sequence of integers $q_{1}<q_{2}<\cdots$ such that $\psi(q_{i})\geqslant 1/2$ for all $i$ .

By passing to a subsequence if necessary, we may assume that $q_{i+1}\geqslant 2q_{i}^{2}$ for all $i$ . Recall the definition of the set $\mathcal{K}_{q}$ from (1.2). Since $\psi(q_{i})\geqslant 1/2$ , we infer that $\mathcal{K}_{q_{i}}=[0,1]$ for each $i$ . As a consequence, $\mathcal{K}=[0,1]$ . We claim that we also have $S=\infty$ . Indeed, for each $d|q_{i}$ , we have

[TABLE]

Consequently,

[TABLE]

since $\sum_{q|q_{i}}\varphi(q)=q_{i}$ and $\sum_{q\leqslant q_{i-1}}\varphi(q)\leqslant q_{i-1}^{2}\leqslant q_{i}/2$ . Summing (2.1) over all $i\geqslant 2$ proves our claim that $S=\infty$ .

Hence, if we are in Case 1, we see that $S=\infty$ and $\mathcal{K}=[0,1]$ , so that Theorem 2 holds.

Case 2: There are finitely many $q\in\mathbb{N}$ with $\psi(q)\geqslant 1/2$ .

Note that in this case replacing $\psi$ by $\min\{\psi,1/2\}$ does not affect either the convergence of $S$ , nor which numbers lie in the set $\mathcal{K}=\limsup_{q\to\infty}\mathcal{K}_{q}$ . Hence, we may assume without loss of generality that $\psi\leqslant 1/2$ . In particular, we have that $\lim_{n\to\infty}\psi(n)/n=0$ , so that we may replace $\sup$ by $\max$ in the definition of $S$ . We now follow an argument due to Catlin.

Consider the function $\xi$ defined by

[TABLE]

and the sets

[TABLE]

These are the analogues of the sets $\mathcal{A}_{q}$ and $\mathcal{A}$ that appear in Theorem 1, but with $\xi$ in place of $\psi$ . We claim that

[TABLE]

This will immediately complete the proof of Theorem 2(b) by applying Theorem 1. In addition, Theorem 2(a) will follow from (1.5).

Indeed, if $\alpha\in\mathcal{C}\setminus\mathbb{Q}$ , then there are infinitely many reduced fractions $a_{j}/q_{j}$ such $|\alpha-a_{j}/q_{j}|\leqslant\xi(q_{j})/q_{j}$ . By the definition of $\xi$ , there is some $n_{j}$ that is a multiple of $q_{j}$ such that $\xi(q_{j})/q_{j}=\psi(n_{j})/n_{j}$ . If we let $m_{j}=a_{j}n_{j}/q_{j}$ , then $|\alpha-m_{j}/n_{j}|\leqslant\psi(n_{j})/n_{j}$ for all $j$ . Since $\lim_{j\to\infty}q_{j}=\infty$ and $n_{j}\geqslant q_{j}$ for each $j$ , we also have that $\lim_{j\to\infty}n_{j}=\infty$ , whence $\alpha\in\mathcal{K}$ .

Conversely, let $\alpha\in\mathcal{K}\setminus\mathbb{Q}$ . Then there are infinitely many pairs $(m_{j},n_{j})\in\mathbb{N}^{2}$ such that $|\alpha-m_{j}/n_{j}|\leqslant\psi(n_{j})/n_{j}$ . If we let $a_{j}/q_{j}$ be the fraction $m_{j}/n_{j}$ in reduced form, we also have that $|\alpha-a_{j}/q_{j}|\leqslant\psi(n_{j})/n_{j}\leqslant\xi(q_{j})/q_{j}$ , where the last inequality follows by noticing that $q_{j}|n_{j}$ . This shows that $\alpha\in\mathcal{C}$ , as long as we can show that infinitely many of the fractions $a_{j}/q_{j}$ are distinct. But if this were not the case, there would exist a fraction $a/q$ such that $a_{j}/q_{j}=a/q$ for infinitely many $j$ , so that $|\alpha-a/q|\leqslant\psi(n_{j})/n_{j}\leqslant 1/(2n_{j})$ for all such $j$ . Letting $j\to\infty$ , we find that $\alpha=a/q\in\mathbb{Q}$ , a contradiction.

This completes the proof of (2.2), and hence of Theorem 2 in all cases.

3. Outline of the proof of Theorem 1

The purpose of this section is to explain in rough terms the main ideas that go into the proof of our main result. To simplify various technicalities, let us consider the special case where the function $\psi$ satisfies the following conditions:

(a)

$\psi(q)=0$ or $\psi(q)=q^{-c}$ for every $q\in\mathbb{N}$ ; 2. (b)

$\psi$ is non-zero only on square-free integers $q$ ; 3. (c)

There exists an infinite sequence $2<x_{1}<x_{2}<\dots$ such that:

(i)

$x_{j}>x_{j-1}^{2}$ ; 2. (ii)

$\psi$ is supported on $\cup_{i=1}^{\infty}[x_{i},2x_{i}]$ ; 3. (iii)

for each $i$ we have

[TABLE]

In this set-up, it follows from a well-known second moment argument (which will be explained in detail in Section 5) that to establish the Duffin-Schaeffer conjecture it is sufficient to show that for any $x\in\{x_{1},\,x_{2},\dots\}$ we have

[TABLE]

where

[TABLE]

Note that we have the estimate

[TABLE]

so the key to the proof is to show that $P(q,r)\ll 1$ on average over $q,r\in\mathcal{S}$ . This would then show suitable ‘approximate independence’ of the sets $\mathcal{A}_{q}$ defined by (1.3). The size of $P(q,r)$ is controlled by small primes dividing exactly one of $q,r$ . With this in mind, let us consider separately the contribution from $q,r$ with

[TABLE]

for different thresholds $t$ (which we think of as small compared with $x$ ). A calculation then shows that it is sufficient to show that for each $t\geqslant 1$

[TABLE]

In particular, we need to understand the structure of a set $\mathcal{S}$ where many of the pairs $(q,r)\in\mathcal{S}^{2}$ have a large common factor. There are $O(x^{c})$ choices of $q\in\mathcal{S}$ weighted by $\varphi(q)/q$ . Given $q\in\mathcal{S}$ , there are $x^{o(1)}$ divisors of $q$ that are at least $x^{1-c}/t$ . In turn, given such a divisor $d$ , there are $O(x^{c}t)$ integers $r\in[x,2x]$ which are a multiple of $d$ (forgetting the constraint $r\in\mathcal{S}$ ). This gives a bound $tx^{2c+o(1)}$ for the sum in (3.2), and so the key problem is to win back a little bit more than the $x^{o(1)}$ factor from the divisor bound. We wish to do this by gaining a structural understanding of sets $\mathcal{S}$ where many pairs have a large GCD. One way that many pairs in $\mathcal{S}$ can have a large GCD is if a positive proportion of elements of $\mathcal{S}$ are a multiple of some fixed divisor $d$ . It is natural to ask if this is the only such construction. If we ignore the $\varphi(q)/q$ weights, this leads us to the following prototypical question that we shall refer to as the Model Problem.

Model Problem.

Let $\mathcal{S}\subseteq[x,2x]$ satisfy $\#\mathcal{S}\asymp x^{c}$ and be such that there are $\#\mathcal{S}^{2}/100$ pairs $(a_{1},a_{2})\in\mathcal{S}^{2}$ with $\gcd(a_{1},a_{2})>x^{1-c}$ . Must it be the case that there is an integer $d\gg x^{1-c}$ which divides $\gg\#\mathcal{S}$ elements of $\mathcal{S}$ ?

It turns out that the answer to this Model Problem as stated is ‘no’, but a technical variant of it that is sufficient for proving Theorem 1 has a positive answer. For the purposes of this section, we will ignore this subtle issue; we will return to it and discuss it in detail in Section 15.

To attack our Model Problem, we use a ‘compression’ argument, roughly inspired by the papers of Erdős-Ko-Rado [11] and Dyson [9]. We will repeatedly pass to subsets of $\mathcal{S}$ where we have increasing control over whether given primes occur in the GCDs or not, whilst at the same time showing that the size of the original set is controlled in terms of the size of the new set. At the end of the iteration procedure we will then have arrived at a subset which controls the size of $\mathcal{S}$ , and where we know that all large GCDs are caused by a fixed divisor. Since the final set then has a very simple GCD structure, we will have enough information to establish (3.2).

To enable the iterations, we pass to a bipartite setup. We start out with sets $\mathcal{V}_{0}=\mathcal{W}_{0}=\mathcal{S}$ . Then, we construct two decreasing sequences of sets $\mathcal{V}_{0}\supset\mathcal{V}_{1}\supset\mathcal{V}_{2}\supset\cdots$ and $\mathcal{W}_{0}\supset\mathcal{W}_{1}\supset\mathcal{W}_{2}\supset\cdots$ , as well as a sequence of primes $p_{1},p_{2},\dots$ such that either $p_{j}$ divides all elements of $\mathcal{V}_{j}$ , or $p_{j}$ is coprime to all elements of $\mathcal{V}_{j}$ (and similarly with $\mathcal{W}_{j}$ ). Since $\mathcal{S}$ contains only square-free integers in the simplified set-up of this section, this means that there will be exponents $k_{j},\ell_{j}\in\{0,1\}$ such that $p_{j}^{k_{j}}\|v$ for all $v\in\mathcal{V}_{j}$ , and $p_{j}^{\ell_{j}}\|w$ for all $w\in\mathcal{W}_{j}$ . Hence, if we let $a_{j}=p_{1}^{k_{1}}\cdots p_{j}^{k_{j}}$ and $b_{j}=p_{1}^{\ell_{1}}\cdots p_{j}^{\ell_{j}}$ , then $a_{j}$ will divide all elements of $\mathcal{W}_{j}$ and $b_{j}$ will divide all elements of $\mathcal{V}_{j}$ .

We will construct the sets $\mathcal{V}_{1},\mathcal{V}_{2},\dots$ and $\mathcal{W}_{1},\mathcal{W}_{2},\dots$ in an iterative fashion. Assume that after $j$ iterations we have arrived at the sets $\mathcal{V}_{j},\mathcal{W}_{j}\subseteq\mathcal{S}$ . We then pick a prime $p_{j+1}$ that is different from $p_{1},\dots,p_{j}$ , and that occurs as the prime factor of $\gcd(v,w)$ for some $v\in\mathcal{V}_{j}$ , $w\in\mathcal{W}_{j}$ with $\gcd(v,w)>x^{1-c}/t$ . Our goal is to pass judiciously to subsets $\mathcal{V}_{j+1}\subseteq\mathcal{V}_{j}$ and $\mathcal{W}_{j+1}\subseteq\mathcal{W}_{j}$ where either $\mathcal{V}_{j+1}$ is all elements of $\mathcal{V}_{j}$ that are divisible by $p_{j+1}$ , or $\mathcal{V}_{j+1}$ is all elements of $\mathcal{V}_{j}$ coprime to $p_{j+1}$ (and similarly with $\mathcal{W}_{j+1}$ ). Since we’re assuming that $\mathcal{S}$ contains only square-free integers, we then will completely know the $p_{j+1}$ -divisibility of all elements of $\mathcal{V}_{j+1}$ and $\mathcal{W}_{j+1}$ , so in particular all GCDs between an element of $\mathcal{V}_{j+1}$ and $\mathcal{W}_{j+1}$ will either be multiple of $p_{j+1}$ , or all will be coprime to $p_{j+1}$ .

Eventually, we will arrive at a pair of sets $(\mathcal{V}_{J},\mathcal{W}_{J})$ such that every pair $(v,w)\in\mathcal{V}_{J}\times\mathcal{W}_{J}$ with $\gcd(v,w)>x^{1-c}/t$ has the property that all prime factors of $\gcd(v,w)$ will lie in the set $\{p_{1},\dots,p_{J}\}$ (and moreover we will ensure that there is at least one such pair). This terminates the iterative procedure. By construction, all elements of $\mathcal{V}=\mathcal{V}_{J}$ will be divisible by the fixed integer $a=a_{J}$ , and similarly all elements of $\mathcal{W}=\mathcal{W}_{J}$ will be divisible by the fixed integer $b=b_{J}$ . In addition, if $v\in\mathcal{V}$ and $w\in\mathcal{W}$ has $\gcd(v,w)>x^{1-c}/t$ , then in fact $\gcd(v,w)$ will be exactly equal to $\gcd(a,b)$ since we know the $p_{j}$ -divisibility for all elements of $\mathcal{V}$ and $\mathcal{W}$ . Thus, $\gcd(a,b)>x^{1-c}/t$ and actually every pair $v\in\mathcal{V}$ and $w\in\mathcal{W}$ has $\gcd(v,w)=\gcd(a,b)$ .

Naturally, the success of the above strategy depends on improving the ‘structure’ of the pair of sets $(\mathcal{V}_{j},\mathcal{W}_{j})$ at each stage of the algorithm. This will enable us to control a quantity like the left hand side of (3.2) in terms of a related quantity for $(\mathcal{V},\mathcal{W})=(\mathcal{V}_{J},\mathcal{W}_{J})$ . An initially appealing choice to measure the ‘structure’ might be

[TABLE]

namely the density of pairs $(v,w)$ with large GCD at stage $j$ . Iteratively increasing this quantity would try to mimic a ‘density increment’ strategy such as that used in the proof of Roth’s Theorem on arithmetic progressions [21, 22]. Unfortunately, such an argument loses all control over the size of the sets $\mathcal{V}_{j},\mathcal{W}_{j}$ , and so we lose control over the sum in (3.2).

An alternative suggestion might be to consider a different quantity which focuses on the size of the sets. Recall that all elements of $\mathcal{V}_{j}$ are a multiple of $a_{j}$ , all elements of $\mathcal{W}_{j}$ are a multiple of $b_{j}$ , and that in our final step we have $\gcd(a,b)>x^{1-c}/t$ . Thus

[TABLE]

where we used that $\mathcal{V}=\mathcal{V}_{J}$ and $\mathcal{W}=\mathcal{W}_{J}$ are subsets of $[x,2x]$ in (3.3). Thus, one might try to iteratively increase the quantity

[TABLE]

This would adequately control (3.2), but unfortunately it is not possible to guarantee that this quantity increases at each stage, and so this proposal also fails.

However, the variant

[TABLE]

turns out to (more-or-less) work well. Indeed, if the quantity (3.4) increases at each iteration, and at the final iteration all elements of $\mathcal{V}=\mathcal{V}_{J}$ are a multiple of $a=a_{J}$ , all elements of $\mathcal{W}=\mathcal{W}_{J}$ are a multiple of $b=b_{J}$ , and all edges come from pairs $(v,w)$ with $\gcd(v,w)=\gcd(a,b)>x^{1-c}/t$ , then we find that

[TABLE]

We note that in our setup $\#\mathcal{S}\asymp x^{c}$ , and that

[TABLE]

If it so happens that $\delta_{0}\leqslant 1/t$ , then we trivially obtain (3.2) (ignoring the $\varphi(q)/q$ weighting) from (3.6). On the other hand, if $\delta_{0}\gg 1/t$ , then (3.5) falls short of (3.2) only by a factor $t^{12}$ .

Finally, to win the additional factor of $t^{12}$ we make use of the fact that any edge $(q,r)$ in our graph satisfies (3.1). The crucial estimate is that

[TABLE]

This was the key idea in the earlier work of Erdős [10] and Vaaler [23] on the Duffin-Schaeffer conjecture. In our case, our iteration procedure has essentially reduced the proof to a similar situation to their work.

Indeed, in (3.3), we may restrict our attention to pairs $(v,w)$ such that $a|v$ , $b|w$ and

[TABLE]

Unless most of the contribution to the above sum of comes from primes in $a$ and $b$ , we can apply (3.7) to win a factor of size $e^{-t}=o(t^{-12})$ in (3.3). Finally, if the small primes in $a$ and $b$ do cause a problem, then a more careful analysis of our iteration procedure shows that we actually are able to increase the quantity (3.4) by more than $t^{12}$ by the final stage $J$ , which also suffices for establishing (3.2) in this case.

The above description has ignored several important technicalities; it turns out that the $\varphi(q)/q$ weights are vital for our argument to work (see the discussion in Section 15). In addition, we do not quite work with (3.4) but with a closely related (but more complicated) expression to enable this quantity to increase at each iteration. The iteration procedure of our argument is broken up into different stages. In between two of the principal iterative stages, we perform a certain ‘clean-up’ step at which we allow a small loss in the quantity (3.4). This step is essential in order to keep track of the condition (3.8) (which could otherwise become meaningless after too many iterations).

4. Structure of the paper

In the first half of the paper that consists of Sections 5-10, we reduce the proof of Theorem 1 to three technical iterative statements about particular graphs, which we call ‘GCD graphs’ (see Definition 6.1). Specifically, in Section 5 we use a second moment argument to reduce the proof to Proposition 5.4, which claims a suitable bound for sums of the form (3.2). Here, we make use of Lemmas 5.1-5.3 which are standard results from the literature. In Section 6 we introduce the key terminology of the paper and translate Proposition 5.4 into Proposition 6.3, a statement about edges in a particular ‘GCD graph’. In Section 7 we use results about the anatomy of integers (Lemmas 7.2 and 7.3) to reduce the situation to establishing Proposition 7.1, a technical statement claiming the existence of a ‘good’ GCD subgraph (where ‘good’ here means that there are integers $a$ and $b$ such that all vertices in $\mathcal{V}$ are divisible by $a$ , those in $\mathcal{W}$ are divisible by $b$ , and if $(v,w)$ is an edge, then $\gcd(v,w)=\gcd(a,b)$ ). Then in Section 8, we reduce the proof of Proposition 7.1 to five iterative claims which form the heart of the paper: Propositions 8.1-8.3 and Lemmas 8.4-8.5. In Sections 9 and 10 we then directly establish Lemmas 8.4 and 8.5, respectively, leaving the second half of the paper to demonstrate the key statements of Propositions 8.1-8.3.

The dependency diagram for the first half of the paper is as follows:

Theorem 1

Proposition 5.4

Proposition 6.3

Proposition 7.1

Proposition 8.1

Proposition 8.2

Proposition 8.3

Lemma 5.1

Lemma 5.2

Lemma 5.3

Lemma 7.3

Lemma 7.2

Lemma 10.1

Lemma 8.4

Lemma 8.5

The second half of the paper consists of Sections 11-14, and it is devoted to proving each of Proposition 8.1, 8.2 and 8.3. Before we embark on the proofs directly, we first establish several preparatory lemmas in Section 11. In particular we prove Lemmas 11.2-11.6 which are minor results on GCD graphs we will use later on. Section 12 is dedicated to the proof of Proposition 8.1, which is the easier iteration step, and relies on two auxiliary results: Lemmas 12.1 and 12.2. Section 13 is dedicated to the proof of Proposition 8.3, the iteration procedure for small primes. This proposition follows from Lemma 13.2, in turn relying on Lemmas 11.2, 11.3 and 13.1. Finally, in Section 14 we prove Proposition 8.2, which is the most delicate part of the iteration procedure. This follows quickly from Lemma 14.1, which in turn relies on Lemmas 11.3-11.6. The dependency diagram for the second half of the paper is as follows:

Proposition 8.1

Proposition 8.3

Proposition 8.2

Lemma 12.2

Lemma 12.1

Lemma 14.1

Lemma 11.4

Lemma 11.6

Lemma 11.3

Lemma 13.2

Lemma 11.2

Lemma 13.1

Lemma 10.1

Lemma 11.5

(We have not included the essentially trivial statement of Lemma 11.1 or Lemma 6.7 which are used frequently in the later sections.) All lemmas are proven in the section where they appear with the exception of Lemma 8.4 and Lemma 8.5, which are proven in Sections 9 and 10 respectively. All propositions are proven in sections later than they appear.

5. Preliminaries

We first reduce the proof of Theorem 1 to a second moment bound given by Proposition 5.4 below. This reduction is standard and appears in several previous works on the Duffin-Schaeffer conjecture. In particular, a vital component is the following ergodic 0-1 law due to Gallagher [12].

Lemma 5.1 (Gallagher’s 0-1 law).

Consider a function $\psi:\mathbb{N}\to\mathbb{R}_{\geqslant 0}$ and let $\mathcal{A}$ be as in (1.4). Then either $\lambda(\mathcal{A})=0$ or $\lambda(\mathcal{A})=1$ .

Proof.

This is Theorem 1 of [12]. ∎

Lemma 5.2 (The Duffin-Schaeffer Conjecture when $\psi$ only takes large values).

Let $\psi:\mathbb{N}\rightarrow\mathbb{R}_{\geqslant 0}$ be a function, and let $\mathcal{A}$ be as in (1.4). Assume, further, that:

(a)

For every $q\in\mathbb{Z}$ , either $\psi(q)=0$ or $\psi(q)\geqslant 1/2$ ; 2. (b)

$\sum_{q=1}^{\infty}\psi(q)\varphi(q)/q=\infty$ .

Then $\lambda(\mathcal{A})=1$ .

Proof.

This follows from [19, Theorem 2]. ∎

Lemma 5.3 (Bound for $\lambda(\mathcal{A}_{q}\cap\mathcal{A}_{r})$ ).

Consider a function $\psi:\mathbb{N}\to[0,1/2]$ and let $\mathcal{A}_{q}$ be as in (1.3). In addition, given $q,r\in\mathbb{N}$ , set

[TABLE]

If $q\neq r$ , then we have

[TABLE]

Proof.

This bound is given in [19, p. 195-196]. ∎

Given the above lemma, we introduce the notation

[TABLE]

for $a,b\in\mathbb{N}$ and $t\geqslant 1$ . The key result to proving Theorem 1 is:

Proposition 5.4 (Second moment bound).

Let $\psi$ and $M(q,r)$ be as in as in Lemma 5.3, and consider $Y\geqslant X\geqslant 1$ such that

[TABLE]

For each $t\geqslant 1$ , set

[TABLE]

Then

[TABLE]

Proof of Theorem 1 assuming Proposition 5.4.

We wish to prove that

[TABLE]

where $\mathcal{A}=\limsup_{q\to\infty}\mathcal{A}_{q}$ with $\mathcal{A}_{q}$ defined by (1.4). We first write

[TABLE]

In particular, $\psi_{2}(q)=\psi(q)$ if $\psi(q)\leqslant 1/2$ , and $\psi_{2}(q)=0$ otherwise.

If it so happens that $\sum_{q=1}^{\infty}\psi_{1}(q)\varphi(q)/q=\infty$ , then we apply Lemma 5.2 to $\psi_{1}$ to find that $\lambda(\limsup_{q\to\infty}\mathcal{B}_{q})=1$ , where $\mathcal{B}_{q}$ is defined as $\mathcal{A}_{q}$ but with $\psi$ replaced by $\psi_{1}$ . This proves (5.3), since $\psi_{1}(q)\leqslant\psi(q)$ , and so $\mathcal{B}_{q}\subseteq\mathcal{A}_{q}$ .

Therefore we may assume without loss of generality that $\sum_{q=1}^{\infty}\psi_{1}(q)\varphi(q)/q<\infty$ , and so $\sum_{q=1}^{\infty}\psi_{2}(q)\varphi(q)/q=\infty$ . Thus, we have reduced Theorem 1 to the case when

[TABLE]

By Lemma 5.1, the Duffin-Schaeffer conjecture will follow if we prove that $\lambda(\mathcal{A})>0$ , since this means $\mathcal{A}$ cannot have measure 0. Note that

[TABLE]

Now, let $X$ be a large parameter and fix $Y=Y(X)$ to be minimal such that

[TABLE]

(Such a $Y$ exists since $\psi(q)\leqslant 1/2$ for all $q$ .) Hence, we see that it suffices to prove that

[TABLE]

uniformly for all large enough $X$ , since this implies that $\lambda(\mathcal{A})>0$ by virtue of (5.4), and hence Theorem 1 follows.

For each $\alpha\in\mathbb{R}$ , consider the counting function

[TABLE]

We then have

[TABLE]

Hence, the Cauchy-Schwarz inequality implies that

[TABLE]

Thus, to establish (5.5), it is enough to prove that

[TABLE]

The terms with $q=r$ contribute a total

[TABLE]

and so we only need to consider the contribution of those terms with $q\neq r$ . Applying Lemma 5.3, we see that

[TABLE]

where we recall that

[TABLE]

Thus, (5.6) is reduced to showing that

[TABLE]

To prove this inequality, we divide the range of $q$ and $r$ into convenient subsets.

The pairs $(q,r)\in(\mathbb{Z}\cap[X,Y])^{2}$ with

[TABLE]

contribute a total of at most

[TABLE]

to the right hand side of (5.7), and so can be ignored.

For any other pair $(q,r)$ , we see that

[TABLE]

so certainly we have

[TABLE]

For any such pair, we let $j=j(q,r)$ be the largest integer such that

[TABLE]

Since $j$ is chosen maximally, we have

[TABLE]

Mertens’ theorem then implies that

[TABLE]

Therefore

[TABLE]

where we used again Mertens’ theorem. As above, those pairs with

[TABLE]

make an acceptable contribution to (5.7). Therefore we only need to consider pairs $(q,r)$ with $M(q,r)/\gcd(q,r)<\exp\exp(j)$ .

We have thus reduced (5.7) to showing that

[TABLE]

where $\mathcal{E}_{t}$ is defined by (5.2). To prove (5.8), we apply Proposition 5.4, which shows that the inner sum is $O(1/\exp\exp(j))$ . Since the sum of $e^{j}/\exp\exp(j)$ over $j\geqslant 0$ converges, this completes the proof of Theorem 1. ∎

Thus we are left to establish Proposition 5.4.

6. Bipartite GCD graphs

In this section we introduce the key notation that will underlie the rest of the paper. In particular, we show that Proposition 5.4 follows from a statement given by Proposition 6.3 about a weighted graph with additional information about divisibility of the integers making up its vertices. The rest of the paper is then dedicated to establishing suitable properties of such graphs, which we call ‘GCD graphs’.

If we let

[TABLE]

and we weight the elements of $\mathcal{V}$ with the measure

[TABLE]

then Proposition 5.4 can be interpreted as an estimate for the weighted edge density of the graph with set of vertices $\mathcal{V}$ and set of edges $\mathcal{E}_{t}$ defined by (5.2).

Our strategy for proving Proposition 5.4 is to use a ‘compression’ argument. More precisely, if $G_{1}$ denotes the graph described in the above paragraph, we will construct a finite sequence of graphs $G_{1},\dots,G_{J}$ where we make a small local change to pass from $G_{j}$ to $G_{j+1}$ that increases the amount of structure in the graph. The final graph $G_{J}$ will then be highly structured and easy to analyze. To keep control over the procedure, we keep track of how certain statistics of the graph change at each step. This enables us to show that the relevant properties of $G_{j}$ are suitably controlled by $G_{j+1}$ , and so $G_{1}$ is controlled by $G_{J}$ , where everything is explicit.

To perform the above construction, we introduce some new notation to take into account the extra information about prime power divisibility which we need to carry at each stage.

Definition 6.1 (GCD graph).

Let $G$ be a septuple $(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ such that:

(a)

$\mu$ is a measure on $\mathbb{N}$ such that $\mu(n)<\infty$ for all $n\in\mathbb{N}$ ; we extend to $\mathbb{N}^{2}$ by letting

[TABLE] 2. (b)

$\mathcal{V}$ and $\mathcal{W}$ are finite sets of positive integers; 3. (c)

$\mathcal{E}\subseteq\mathcal{V}\times\mathcal{W}$ , that is to say $(\mathcal{V},\mathcal{W},\mathcal{E})$ is a bipartite graph; 4. (d)

$\mathcal{P}$ is a set of primes; 5. (e)

$f$ and $g$ are functions from $\mathcal{P}$ to $\mathbb{Z}_{\geqslant 0}$ such that for all $p\in\mathcal{P}$ we have:

(i)

$p^{f(p)}|v$ for all $v\in\mathcal{V}$ , and $p^{g(p)}|w$ for all $w\in\mathcal{W}$ ; 2. (ii)

if $(v,w)\in\mathcal{E}$ , then $p^{\min\{f(p),g(p)\}}\|\gcd(v,w)$ ; 3. (iii)

if $f(p)\neq g(p)$ , then $p^{f(p)}\|v$ for all $v\in\mathcal{V}$ , and $p^{g(p)}\|w$ for all $w\in\mathcal{W}$ .

We then call $G$ a (bipartite) GCD graph with sets of vertices $(\mathcal{V},\mathcal{W})$ , set of edges $\mathcal{E}$ and multiplicative data $(\mathcal{P},f,g)$ . We will also refer to $\mathcal{P}$ as the set of primes of $G$ . If $\mathcal{P}=\emptyset$ , we say that $G$ has trivial set of primes and we view $f=f_{\emptyset}$ and $g=g_{\emptyset}$ as two copies of the empty function from $\emptyset$ to $\mathbb{Z}_{\geqslant 0}$ .

Definition 6.2 (Non-trivial GCD graph).

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ . We say that $G$ is non-trivial if $\mu(\mathcal{E})>0$ .

We now recast Proposition 5.4 in the language of GCD graphs.

Proposition 6.3 (Edge set bound).

Let $\psi:\mathbb{N}\rightarrow\mathbb{R}_{\geqslant 0}$ , $t\geqslant 1$ and $\mu$ be the measure $\mu(v)=\psi(v)\varphi(v)/v$ . Let $\mathcal{V}\subseteq\mathbb{N}$ satisfy $0<\mu(\mathcal{V})\ll 1$ . Let $G=(\mu,\mathcal{V},\mathcal{V},\mathcal{E},\emptyset,f_{\emptyset},g_{\emptyset})$ be a bipartite GCD graph with measure $\mu$ , vertex sets $\mathcal{V}$ , trivial set of primes, and edge set $\mathcal{E}\subseteq\mathcal{E}_{t}$ , where $\mathcal{E}_{t}$ is defined as in Proposition 5.4. Then

[TABLE]

Proof of Proposition 5.4 assuming Proposition 6.3.

Recall the notation $\psi$ , $M(q,r)$ , $L_{t}(a,b)$ , $X$ , $Y$ and $\mathcal{E}_{t}$ of Proposition 5.4. We wish to show that

[TABLE]

Let $\mu$ be the measure on $\mathbb{N}$ defined by $\mu(v):=\psi(v)\varphi(v)/v$ and let $\mathcal{V}=\mathbb{Z}\cap[X,Y]$ , so that

[TABLE]

Now define $\mathcal{E}=\mathcal{E}_{t}$ to be as in Proposition 5.4. We see that $(\mathcal{V},\mathcal{V},\mathcal{E})$ forms a bipartite graph with vertex sets two copies of $\mathcal{V}$ and edge set $\mathcal{E}$ . We now turn this bipartite graph into a GCD graph $G=(\mu,\mathcal{V},\mathcal{V},\mathcal{E},\emptyset,f_{\emptyset},g_{\emptyset})$ by attaching trivial multiplicative data to the bipartite graph (here $f_{\emptyset}$ and $g_{\emptyset}$ are viewed as two copies of the function of the empty set to $\mathbb{Z}_{\geqslant 0}$ ).

Since $0<\mu(\mathcal{V})\ll 1$ , Proposition 6.3 now applies, showing that

[TABLE]

This completes the proof. ∎

Thus we are left to establish Proposition 6.3. As we briefly explained in Section 3, this will be done by passing iteratively to subgraphs of $G$ on which we control the divisibility by more and more primes. To formalize this procedure, we introduce the concept of a GCD subgraph.

Definition 6.4 (GCD subgraph).

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ and $G^{\prime}=(\mu^{\prime},\mathcal{V}^{\prime},\mathcal{W}^{\prime},\mathcal{E}^{\prime},\mathcal{P}^{\prime},f^{\prime},g^{\prime})$ be two GCD graphs. We say that $G^{\prime}$ is a GCD subgraph of $G$ if:

[TABLE]

We write $G^{\prime}\preceq G$ if $G^{\prime}$ is a GCD subgraph of $G$ . Lastly, we say that $G^{\prime}$ is a non-trivial GCD subgraph of $G$ if $\mu(\mathcal{E}^{\prime})>0$ , that is to say $G^{\prime}$ is non-trivial as a GCD graph.

We thus see from the above definition that we only accept $G^{\prime}$ as a subgraph of $G$ if we have at least as much information about the divisibility of the vertices of $G^{\prime}$ compared to those of $G$ . In particular, we have that $p^{\min(f(p),g(p))}\|\gcd(v^{\prime},w^{\prime})$ for all $(v^{\prime},w^{\prime})\in\mathcal{E}^{\prime}$ and all $p\in\mathcal{P}$ .

We will devise an iterative argument that adds one prime at a time to $\mathcal{P}$ , so that we will eventually control the multiplicative structure of GCDs of connected vertices in the graph very well by the end of this process.

The main way we will produce a GCD subgraph of a GCD graph $G$ is by restricting to vertex sets with certain divisibility properties. Since we will use this several times, we introduce a specific notation for these GCD subgraphs.

Definition 6.5 (Special GCD subgraphs from prime power divisibility).

Let $p$ be a prime number, and let $k,\ell\in\mathbb{Z}_{\geqslant 0}$ .

(a)

If $\mathcal{V}$ is a set of integers and $k\in\mathbb{Z}_{\geqslant 0}$ , we set

[TABLE]

that is to say $\mathcal{V}_{p^{k}}$ is the set of integers in $\mathcal{V}$ whose $p$ -adic valuation is exactly $k$ . Here we have the understanding that $\mathcal{V}_{p^{0}}$ denotes the set of $v\in\mathcal{V}$ that are coprime to $p$ . In particular, $\mathcal{V}_{2^{0}}$ and $\mathcal{V}_{3^{0}}$ denote different sets of integers. 2. (b)

Let $G=(\mathcal{V},\mathcal{W},\mathcal{E})$ be a bipartite graph. If $\mathcal{V}^{\prime}\subseteq\mathcal{V}$ and $\mathcal{W}^{\prime}\subseteq\mathcal{W}$ , we define

[TABLE]

We also write for brevity

[TABLE] 3. (c)

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ be a GCD graph such that $p\notin\mathcal{P}$ . We then define the septuple

[TABLE]

where the functions $f_{p^{k}}$ , $g_{p^{\ell}}$ are defined on $\mathcal{P}\cup\{p\}$ by the relations $f_{p^{k}}|_{\mathcal{P}}=f$ , $g_{p^{\ell}}|_{\mathcal{P}}=g$ ,

[TABLE]

It is easy to check that $G_{p^{k},p^{\ell}}$ is a GCD subgraph of $G$ .

The aim of our iterative procedure is to obtain a simple GCD subgraph $G^{\prime}$ of our initial graph $G$ where the key quantitative aspects of $G$ are controlled by the corresponding quantities of $G^{\prime}$ . Here ‘simple’ graphs have many primes occurring in $\gcd(v,w)$ for $(v,w)\in\mathcal{E}$ to a fixed exponent, whilst for subgraphs to maintain control over the original graph we need to maintain sufficiently many edges relative to the number of vertices. This leads us to our last four definitions:

Definition 6.6 (Quantities associated to GCD graphs).

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ be a GCD graph.

(a)

If $\mu(\mathcal{V}),\mu(\mathcal{W})>0$ , then we define the edge density of $G$ by

[TABLE]

If $\mu(\mathcal{V})=0$ or $\mu(\mathcal{W})=0$ , we define the edge density of $G$ to be [math]. 2. (b)

The neighbourhood sets are defined by

[TABLE]

and similarly

[TABLE] 3. (c)

We let $\mathcal{R}(G)$ be given by

[TABLE]

That is to say $\mathcal{R}(G)$ is the set of primes occurring in a GCD which we haven’t yet accounted for. We split this into two further subsets:

[TABLE]

and

[TABLE] 4. (d)

The quality of $G$ is defined by

[TABLE]

where $\delta$ is the edge density of $G$ .

*Remark**.*

If $\mu(\mathcal{V}),\mu(\mathcal{W})>0$ we see that

[TABLE]

As mentioned in Section 3, there are two natural candidates for a quantity to increment; either $\delta$ or $\mu(\mathcal{V})\mu(\mathcal{W})\prod_{p\in\mathcal{P}}p^{|f(p)-g(p)|}$ (this is the natural generalization to non-squarefree integers). One should essentially think of the quality as a ‘hybrid’ of the two quantities, but with some additional factors which are included for technical reasons. The factor

[TABLE]

always lies in the interval $[1,\zeta(31/30)^{10}]$ , and so is always of size bounded away from 0 and from $\infty$ . This factor is included merely for convenience, and allows us to have a quality increment even if there is a tiny loss in our arguments in terms of $p$ . The factor

[TABLE]

is crucial for the proof of a quality increment in Lemma 14.1 and Proposition 8.2. This is related to the technical point that it is vital that the weights of our vertices contain the factor $\varphi(q)/q$ . We will discuss this feature in more detail in Section 15.

We will repeatedly make use of some trivial properties of GCD graphs, given by Lemma 6.7 below, without further comment.

Lemma 6.7 (Basic properties of GCD graphs).

Let $G_{1},G_{2},G_{3}$ be GCD graphs.

(a)

The property of being a GCD subgraph is transitive: If $G_{1}\preceq G_{2}$ and $G_{2}\preceq G_{3}$ , then $G_{1}\preceq G_{3}$ 2. (b)

If $G_{1}\preceq G_{2}$ , then $\mathcal{R}(G_{1})\subseteq\mathcal{R}(G_{2})$ . 3. (c)

If $G_{1}=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ is non-trivial, then $\mu(\mathcal{V}),\mu(\mathcal{W})>0$ . 4. (d)

Let $G_{1}$ have edge density $\delta$ . Then the following are equivalent:

(i)

$G_{1}$ * is non-trivial.* 2. (ii)

$\delta>0$ . 3. (iii)

$q(G_{1})>0$ .

Proof.

All statements are immediate from the definition of GCD subgraphs and of non-trivial GCD graphs. ∎

*Remark**.*

In part (b) of Lemma 6.7, it is not necessarily the case that $\mathcal{R}^{\flat}(G_{1})\subseteq\mathcal{R}^{\flat}(G_{2})$ nor that $\mathcal{R}^{\sharp}(G_{1})\subseteq\mathcal{R}^{\sharp}(G_{2})$ .

Having introduced all necessary terminology, we turn to the task of establishing Proposition 6.3.

7. Reduction to a good GCD subgraph

In this section, we reduce the proof of Proposition 6.3 (and hence of Theorem 1) to finding a ‘good’ GCD subgraph as described in Proposition 7.1 below. This reduction utilizes some results showing that few integers have lots of fairly small prime factors (based on ‘the anatomy of integers’).

Proposition 7.1 (Existence of a good GCD subgraph).

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\emptyset,f_{\emptyset},g_{\emptyset})$ be a GCD graph with trivial set of primes and edge density $\delta>0$ . Assume further that

[TABLE]

for some $t$ satisfying

[TABLE]

Then there is a GCD subgraph $G^{\prime}=(\mu,\mathcal{V}^{\prime},\mathcal{W}^{\prime},\mathcal{E}^{\prime},\mathcal{P}^{\prime},f^{\prime},g^{\prime})$ of $G$ with edge density $\delta^{\prime}>0$ such that:

(a)

$\mathcal{R}(G^{\prime})=\emptyset$ ; 2. (b)

For all $v\in\mathcal{V}^{\prime}$ , we have $\mu(\Gamma_{G^{\prime}}(v))\geqslant(9\delta^{\prime}/10)\mu(\mathcal{W}^{\prime})$ ; 3. (c)

For all $w\in\mathcal{W}^{\prime}$ , we have $\mu(\Gamma_{G^{\prime}}(w))\geqslant(9\delta^{\prime}/10)\mu(\mathcal{V}^{\prime})$ ; 4. (d)

One of the following holds:

(i)

$q(G^{\prime})\gg\delta t^{50}q(G)$ ; 2. (ii)

$q(G^{\prime})\gg q(G)$ , and if $(v,w)\in\mathcal{E}^{\prime}$ and we write them as $v=v^{\prime}\prod_{p\in\mathcal{P}^{\prime}}p^{f^{\prime}(p)}$ and $w=w^{\prime}\prod_{p\in\mathcal{P}^{\prime}}p^{g^{\prime}(p)}$ , then $L_{t}(v^{\prime},w^{\prime})\geqslant 4$ .

Our task is to prove that Proposition 7.1 implies Proposition 6.3. To do so, we need a couple of preparatory lemmas that exploit the condition that $L_{t}(v^{\prime},w^{\prime})\geqslant 4$ in Case (d)-(ii) of Proposition 7.1.

Lemma 7.2 (Bounds on multiplicative functions).

Let $k\in\mathbb{N}$ and write $\tau_{k}$ for the $k$ -th divisor function. If $f$ is a multiplicative function such that $0\leqslant f\leqslant\tau_{k}$ , then

[TABLE]

Proof.

This is [17, Theorem 14.2, p. 145]. ∎

Lemma 7.3 (Few numbers with many prime factors).

For $x,t\geqslant 1$ and $c\in[1,10]$ , we have

[TABLE]

the implied constant is absolute.

Proof.

We may assume that $t$ is large enough, since the result is trivial when $t$ is bounded. Set $T=t^{e^{c-1}}$ , so that $\sum_{t\leqslant p<T}1/p\leqslant c-1/2$ by Mertens’ theorem. Hence

[TABLE]

We wish to apply Lemma 7.2 when $f$ is the multiplicative function with $f(p^{\nu})=e^{2T/p}$ for $p\geqslant T$ and all $\nu\geqslant 1$ , and $f(p^{\nu})=1$ for $p<T$ . In particular, $f(p^{\nu})\leqslant e^{2}\leqslant 8$ for all prime powers $p^{\nu}$ , so that $f\leqslant\tau_{8}$ . Thus

[TABLE]

Since $e^{2T/p}=1+O(T/p)$ for $p\geqslant T$ , and $\sum_{p\geqslant T}T/p^{2}\ll 1$ , the sum of $(e^{2T/p}-1)/p$ over $p\in[T,x]$ is $O(1)$ . We thus conclude that

[TABLE]

Since $T=t^{e^{c-1}}$ , the lemma has been proven. ∎

Proof of Proposition 6.3 assuming Proposition 7.1.

Fix $t\geqslant 1$ and let $G$ be the GCD graph of Proposition 6.3 with set of edges $\mathcal{E}\subseteq\mathcal{E}_{t}$ (where $\mathcal{E}_{t}$ is defined in Proposition 5.4), weight $\mu(v)=\psi(v)\varphi(v)/v$ and edge density $\delta$ .

If $\delta\ll 1/t$ , then $\mu(\mathcal{E})\ll 1/t$ and so we are done. Therefore we may assume that

[TABLE]

Note that this implies that

[TABLE]

We apply Proposition 7.1 to $G$ to find a GCD subgraph $G^{\prime}=(\mu,\mathcal{V}^{\prime},\mathcal{W}^{\prime},\mathcal{E}^{\prime},\mathcal{P}^{\prime},f^{\prime},g^{\prime})$ of $G$ with edge density $\delta^{\prime}$ satisfying either case (d)-(d)(i) or (d)-(d)(ii) of its statement. In addition, we have that:

(a)

$\mathcal{R}(G^{\prime})=\emptyset$ ; 2. (b)

$\mu(\Gamma_{G^{\prime}}(v))\geqslant(9\delta^{\prime}/10)\mu(\mathcal{W}^{\prime})$ for all $v\in\mathcal{V}^{\prime}$ ; 3. (c)

$\mu(\Gamma_{G^{\prime}}(w))\geqslant(9\delta^{\prime}/10)\mu(\mathcal{V}^{\prime})$ for all $w\in\mathcal{W}^{\prime}$ .

Set

[TABLE]

The definition of a GCD graph implies that

[TABLE]

Moreover, since $\mathcal{R}(G^{\prime})=\emptyset$ , and $p^{\min\{f^{\prime}(p),g^{\prime}(p)\}}\|\gcd(v,w)$ for all $(v,w)\in\mathcal{E}^{\prime}$ , we have that

[TABLE]

Now, note that

[TABLE]

as well as

[TABLE]

Consequently, from the definition of $q(\cdot)$ , we find

[TABLE]

Proposition 7.1 offers a lower bound on $q(G^{\prime})/q(G)$ . Since

[TABLE]

we can obtain an upper bound on the size of $\mu(\mathcal{E})$ by estimating $q(G^{\prime})$ from above.

Note that

[TABLE]

where we recall that $M(v,w)=\max\{v\psi(w),w\psi(v)\}$ . Since $\gcd(v,w)=\gcd(a,b)$ for all $(v,w)\in\mathcal{E}^{\prime}$ , we infer that

[TABLE]

The vertex sets $\mathcal{V}^{\prime},\mathcal{W}^{\prime}$ are finite sets of positive integers. For each $v\in\mathcal{V}^{\prime}$ , let $w_{\max}(v)$ be the largest integer in $\mathcal{W}^{\prime}$ such that $(v,w_{\max}(v))\in\mathcal{E}^{\prime}$ . (This quantity is well-defined in virtue of property (b) above. In addition, we emphasise to the reader that ‘largest’ refers to the size of elements as positive integers, and does not depend on the measure $\mu$ .) Similarly, for each $w\in\mathcal{W}^{\prime}$ , let $v_{\max}(w)$ be the largest element of $\mathcal{V}^{\prime}$ such that $(v_{\max}(w),w)\in\mathcal{E}^{\prime}$ . Consequently,

[TABLE]

Now, let $w_{0}$ be the largest integer in $\mathcal{W}^{\prime}$ and $\mathcal{E}^{\prime\prime}=\{(v,w)\in\mathcal{E}^{\prime}:(v,w_{0})\in\mathcal{E}^{\prime}\}$ . We then have

[TABLE]

In addition, since $G^{\prime}$ satisfies conditions (b) and (c) in the statement of Proposition 7.1, we have

[TABLE]

Substituting this bound into (7.2), we find

[TABLE]

Here we used the trivial bound $\delta^{\prime}\leqslant 1$ in the second inequality. In addition,

[TABLE]

Since $a|v$ and $b|w$ , we have $\varphi(v)/v\leqslant\varphi(a)/a$ and $\varphi(w)/w\leqslant\varphi(b)/b$ . Therefore

[TABLE]

Together with (7.3), (7.4) and (7.5), this implies that

[TABLE]

We now split our argument depending on whether (d)-(d)(i) or (d)-(d)(ii) of Proposition 7.1 holds.

Case 1: (d)-(d)(i) of Proposition 7.1 holds.

In this case we have $q(G^{\prime})\gg\delta t^{50}q(G)$ . Writing $v=v^{\prime}a$ and $w=w^{\prime}b$ , we find that

[TABLE]

Together with (7.6), this implies that

[TABLE]

Since $q(G^{\prime})\gg\delta t^{50}q(G)$ in this case, and since $\delta\geqslant 1/t$ , this gives

[TABLE]

This establishes Proposition 6.3 in this case.

Case 2: (d)-(d)(ii) of Proposition 7.1 holds.

Write $v=v^{\prime}a$ and $w=w^{\prime}b$ . In this case

[TABLE]

We also have $q(G^{\prime})\gg q(G)$ .

From (7.7), we see that either

[TABLE]

whenever $(v,w)\in\mathcal{E}^{\prime}$ . Consequently,

[TABLE]

where

[TABLE]

For $S_{1}$ , we note that

[TABLE]

by Lemma 7.3, since $\exp(t^{e})\gg\exp(t)t^{2}$ . Similarly for $S_{2}$ , we find that

[TABLE]

by applying Lemma 7.3 once again. Substituting these bounds into (7.6), we conclude that

[TABLE]

Since we have $q(G)\ll q(G^{\prime})$ and $\delta\geqslant 1/t$ , this gives

[TABLE]

This establishes Proposition 6.3 in all cases. ∎

Thus we are left to prove Proposition 7.1.

8. Reduction of Proposition 7.1 to three iterative propositions

We will prove Proposition 7.1 by an iterative argument, where we repeatedly find GCD subgraphs with progressively nicer properties. In this section we reduce the proof to five technical iterative statements, given by three key propositions (Propositions 8.1-8.3) and two auxiliary lemmas (Lemmas 8.4-8.5) given below.

Proposition 8.1 (Iteration when $\mathcal{R}^{\flat}(G)\neq\emptyset$ ).

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ be a GCD graph with edge density $\delta>0$ such that

[TABLE]

Then there is a GCD subgraph $G^{\prime}$ of $G$ with edge density $\delta^{\prime}>0$ and multiplicative data $(\mathcal{P}^{\prime},f^{\prime},g^{\prime})$ such that

[TABLE]

where $N=\#\{p\in\mathcal{P}^{\prime}\setminus\mathcal{P}:f^{\prime}(p)\neq g^{\prime}(p)\}$ .

Proposition 8.2 (Iteration when $\mathcal{R}^{\flat}(G)=\emptyset$ ).

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ be a GCD graph with edge density $\delta>0$ such that

[TABLE]

Then there is a GCD subgraph $G^{\prime}=(\mu,\mathcal{V}^{\prime},\mathcal{W}^{\prime},\mathcal{E}^{\prime},\mathcal{P}^{\prime},f^{\prime},g^{\prime})$ of $G$ such that

[TABLE]

Propositions 8.1 and 8.2 deal with large primes. We need a complementary result that handles the small primes.

Proposition 8.3 (Bounded quality loss for small primes).

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\emptyset,f_{\emptyset},g_{\emptyset})$ be a GCD graph with edge density $\delta>0$ and trivial set of primes. Then there is a GCD subgraph $G^{\prime}=(\mu,\mathcal{V}^{\prime},\mathcal{W}^{\prime},\mathcal{E}^{\prime},\mathcal{P}^{\prime},f^{\prime},g^{\prime})$ of $G$ with edge density $\delta^{\prime}>0$ such that

[TABLE]

Finally, we need two further technical estimates. The first one strengthens the quality of the inequality $L_{t}(v,w)\geqslant 10$ when the set $\mathcal{R}^{\flat}(G)$ is empty, whereas the second allows one to pass to a subgraph where all vertices have high degree.

Lemma 8.4 (Removing the effect of $\mathcal{R}(G)$ from $L_{t}(v,w)$ ).

Let $t\geqslant 300$ and $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ be a GCD graph with edge density $\delta>0$ such that

[TABLE]

Then there exists a GCD subgraph $G^{\prime}=(\mu,\mathcal{V},\mathcal{W},\mathcal{E}^{\prime},\mathcal{P},f,g)$ of $G$ such that

[TABLE]

Lemma 8.5 (Subgraph with high-degree vertices).

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ be a GCD graph with edge density $\delta>0$ . Then there is a GCD subgraph $G^{\prime}=(\mu,\mathcal{V}^{\prime},\mathcal{W}^{\prime},\mathcal{E}^{\prime},\mathcal{P},f,g)$ of $G$ with edge density $\delta^{\prime}>0$ such that:

(a)

$q(G^{\prime})\geqslant q(G)$ ; 2. (b)

$\delta^{\prime}\geqslant\delta$ ; 3. (c)

For all $v\in\mathcal{V}^{\prime}$ and for all $w\in\mathcal{W}^{\prime}$ , we have

[TABLE]

Proof of Proposition 7.1 assuming Propositions 8.1-8.3 and Lemmas 8.4-8.5.

We will construct the required subgraph $G^{\prime}$ in several stages. It suffices to produce a GCD subgraph $G^{\prime}$ of $G$ satisfying only conclusions $(a)$ and $(d)$ of Proposition 7.1, since an application of Lemma 8.5 then produces a GCD subgraph satisfying all the conclusions.

Stage 1: Obtaining a GCD subgraph $G^{(1)}$ with $\mathcal{R}(G^{(1)})\subseteq\{p>10^{2000}\}$ .

Since $G$ has set of primes equal to the empy set, we may apply Proposition 8.3 to $G$ to produce a GCD subgraph $G^{(1)}=(\mu,\mathcal{V}^{(1)},\mathcal{W}^{(1)},\mathcal{E}^{(1)},\mathcal{P}^{(1)},f^{(1)},g^{(1)})$ of $G$ with edge density $\delta^{(1)}$ and for which

[TABLE]

In particular, we have

[TABLE]

for any $H\preceq G^{(1)}$ by Lemma 6.7(b).

Stage 2: Obtaining a GCD subgraph $G^{(2)}$ with $\mathcal{R}^{\flat}(G^{(2)})=\emptyset$ .

If $\mathcal{R}^{\flat}(G^{(1)})\neq\emptyset$ , then $G^{(1)}$ satisfies the conditions of Proposition 8.1. We then repeatedly apply Proposition 8.1 to produce a sequence of GCD subgraphs of $G^{(1)}$ given by

[TABLE]

until we obtain a GCD subgraph $G^{(2)}$ of $G^{(1)}$ which does not satisfy the conditions of Proposition 8.1. Since $\mathcal{R}(G^{(1)}_{i+1})\subsetneq\mathcal{R}(G^{(1)}_{i})$ and $\mathcal{R}(G^{(1)})$ is a finite set, this process must indeed terminate after a finite number of steps and produce a GCD graph $G^{(2)}:=(\mu,\mathcal{V}^{(2)},\mathcal{W}^{(2)},\mathcal{E}^{(2)},\mathcal{P}^{(2)},f^{(2)},g^{(2)})\preceq G^{(1)}$ that does not satisfy the conditions of Proposition 8.1. Since $\mathcal{R}(G^{(2)})\subseteq\{p>10^{2000}\}$ by (8.2), it must be the case that

[TABLE]

In addition, Proposition 8.1 implies that

[TABLE]

where

[TABLE]

Together with (8.1), this yields that

[TABLE]

On the other hand, if $\mathcal{R}^{\flat}(G^{(1)})=\emptyset$ , then we simply take $G^{(2)}=G^{(1)}$ and note that (8.3) is trivially satisfied by (8.1).

This completes Stage 2. The remaining part of the proof deviates according to whether the ratio $q(G^{(2)})/q(G)$ is larger or smaller than $(t/10)^{50}\delta/10^{10^{3000}}$ .

Case (a): $q(G^{(2)})/q(G)\geqslant(t/10)^{50}\delta/10^{10^{3000}}$ .

In this case we do not need to keep track of the condition that $L_{t}(v,w)\geqslant 10$ because we have a very large gain in the quality of the new graph. The next stage of the argument is then:

Stage 3a: Obtaining a GCD subgraph with $\mathcal{R}(G^{(3\text{a})})=\emptyset$ .

Notice that if $H\preceq G^{(2)}$ , then $\mathcal{R}(H)\subseteq\{p>10^{2000}\}$ by (8.2). Consequently, if $\mathcal{R}(H)\neq\emptyset$ , then either Proposition 8.1 or Proposition 8.2 is applicable to $H$ , thus producing a GCD subgraph $H^{\prime}$ of $H$ such that

[TABLE]

Since $\mathcal{R}(G^{(2)})$ is finite, starting with $H_{1}=G^{(2)}$ and iterating the above fact, we can construct a finite sequence of GCD subgraphs

[TABLE]

such that

[TABLE]

Applying the assumption that $q(G^{(2)})/q(G)\geqslant(t/10)^{50}\delta/10^{10^{3000}}$ , we infer that

[TABLE]

Hence, the GCD graph $G^{\prime}=G^{(3\text{a})}$ satisfies condition (a) and condition (d)-(d)(i) of Proposition 7.1, giving the result in this case. (Recall that we may also guarantee conditions (b) and (c) of Proposition 7.1 by feeding our graph into Lemma 8.5.)

In order to complete the proof of Proposition 7.1, it remains to consider the situation when $q(G^{(2)})/q(G)$ is not large.

Case (b): $q(G^{(2)})/q(G)<(t/10)^{50}\delta/10^{10^{3000}}$ .

In this case, the quality increment is small and we must make sure not to lose track of the condition $L_{t}(v,w)\geqslant 10$ . For this reason, we perform some cosmetic surgery to our graph before applying Proposition 8.2. This consists of Stage 3b that we present below.

Stage 3b: Removing the effect of primes in $\mathcal{R}(G^{(2)})$ from the anatomical condition $L_{t}(v,w)\geqslant 10$ .

Note that (8.3) implies that

[TABLE]

and that

[TABLE]

where we recall that

[TABLE]

(here we used the trivial bound $\delta\leqslant 1$ ).

Since $\delta^{(2)}\geqslant(10/t)^{50}$ , $\mathcal{R}^{\flat}(G^{(2)})=\emptyset$ , and $L_{t}(v,w)\geqslant 10$ for all $(v,w)\in\mathcal{E}^{(2)}$ , it is the case that $G^{(2)}$ satisfies the conditions of Lemma 8.4. Consequently, there exists a GCD subgraph $G^{(3\text{b})}=(\mu,\mathcal{V}^{(3\text{b})},\mathcal{W}^{(3\text{b})},\mathcal{E}^{(3\text{b})},\mathcal{P}^{(3\text{b})},f^{(3\text{b})},g^{(3\text{b})})$ of $G^{(2)}$ with

[TABLE]

and such that

[TABLE]

We claim that an inequality of the form (8.8) holds even if we remove from consideration the primes lying in the set

[TABLE]

It turns out that we can do this rather crudely, starting from the estimate

[TABLE]

Recalling that $t>10^{2000}$ and $\mathcal{P}^{(1)}\subseteq\{p\leqslant 10^{2000}\}$ , we deduce that

[TABLE]

Since $t\geqslant 10^{2000}$ , relation (8.5) implies that $N\leqslant 2\log(t^{50})=100\log t<t$ , that is to say the right hand side of (8.9) is $\leqslant 1$ . As a consequence,

[TABLE]

Having removed the effect to the condition $L_{t}(v,w)\geqslant 10$ of primes from the sets $\mathcal{R}(G^{(2)})\cup\mathcal{P}_{\text{diff}}^{(2)}$ , we are ready to complete the construction of $G^{\prime}$ in Case (b).

Stage 4b: Obtaining a GCD subgraph with $\mathcal{R}(G^{(4\text{b})})=\emptyset$ .

We argue as in Stage 3a: for each $H\preceq G^{(3\text{b})}$ , we have $\mathcal{R}(H)\subseteq\{p>10^{2000}\}$ by (8.2). Hence, if $\mathcal{R}(H)\neq\emptyset$ , then either Proposition 8.1 or Proposition 8.2 is applicable to $H$ , thus producing a GCD subgraph $H^{\prime}$ of $H$ such that

[TABLE]

where $\mathcal{P}_{H}$ and $\mathcal{P}_{H^{\prime}}$ denote the set of primes of $H$ and of $H^{\prime}$ , respectively. Since $\mathcal{R}(G^{(3\text{b})})$ is finite, starting with $H_{1}=G^{(3\text{b})}$ and iterating the above fact, we can construct a finite sequence of GCD subgraphs

[TABLE]

such that

[TABLE]

In addition, note that

[TABLE]

where the second relation follows by fact (8.6) that $\mathcal{P}^{(3\text{b})}=\mathcal{P}^{(2)}$ . We now verify that if we let

[TABLE]

then condition (d)-(d)(ii) of Proposition 7.1 is satisfied. This suffices for the completion of the proof, since $G^{\prime}$ clearly satisfies condition (a) of Proposition 7.1, and an application of Lemma 8.5 can also ensure conditions (b) and (c).

First of all, note that by (8.7) and (8.3) and $q(G^{(4\text{b})})\geqslant q(G^{(3\text{b})})$ , we have

[TABLE]

Let $(v,w)\in\mathcal{E}^{(4\text{b})}$ . It remains to check that $L_{t}(v^{\prime},w^{\prime})\geqslant 4$ , where $v^{\prime}$ and $w^{\prime}$ are defined by the relations

[TABLE]

By the definition of the set $\mathcal{R}(G^{(4\text{b})})$ and since $\mathcal{R}(G^{(4b)})=\emptyset$ , all prime factors of $\gcd(v,w)$ belong to $\mathcal{P}^{(4\text{b})}$ . But for each prime $p\in\mathcal{P}^{(4\text{b})}$ we have $p^{\min\{f^{(4\text{b})}(p),g^{(4\text{b})}(p)\}}\|\gcd(v,w)$ . Thus

[TABLE]

In particular, we must have that

[TABLE]

Now, let $p$ be a prime such that

[TABLE]

Since $p\nmid v^{\prime}w^{\prime}$ but $p|vw$ , we must have $p\in\mathcal{P}^{(4\text{b})}$ , and so $p^{\min\{f^{(4\text{b})}(p),g^{(4\text{b})}(p)\}}\|\gcd(v,w)$ . In addition, our assumptions that $p\nmid v^{\prime}w^{\prime}$ and $p|vw/\gcd(v,w)^{2}$ imply that $f^{(4\text{b})}(p)\neq g^{(4\text{b})}(p)$ . If $p\in\mathcal{P}^{(2)}$ , we infer that $p\in\mathcal{P}^{(2)}_{\text{diff}}$ . On the other hand, if $p\notin\mathcal{P}^{(2)}$ , then the inclusion $\mathcal{P}^{(4\text{b})}\subseteq\mathcal{P}^{(2)}\cup\mathcal{R}(G^{(2)})$ implies that $p\in\mathcal{R}(G^{(2)})$ . In either case, we have that $p\in\mathcal{P}^{(2)}_{\text{diff}}\cup\mathcal{R}(G^{(2)})$ . Thus, since $\mathcal{E}^{(4\text{b})}\subseteq\mathcal{E}^{(3\text{b})}$ , we may use the bound (8.10), which gives

[TABLE]

In particular, $G^{\prime}=G^{(4\text{b})}$ satisfies the conditions of case (d)-(d)(ii) of Proposition 7.1. This completes the proof of Proposition 7.1 in Case (b) too. ∎

Thus we are left to establish Propositions 8.1-8.3 and Lemmas 8.4-8.5. We begin with the last two results because they are easier to establish.

9. Proof of Lemma 8.4

In this section we establish Lemma 8.4 directly.

For brevity, let

[TABLE]

We have

[TABLE]

Fix for the moment a prime $p\in\mathcal{R}(G)$ . Since we have $\mathcal{R}^{\flat}(G)=\emptyset$ , it must be the case that $p\in\mathcal{R}^{\sharp}(G)$ , that is to say there exists some $k\in\mathbb{Z}_{\geqslant 0}$ such that

[TABLE]

Now we note that if $p|vw/\gcd(v,w)^{2}$ , then $p^{j}\|v$ and $p^{\ell}\|w$ for some $j\neq\ell$ . In particular we cannot have $p^{k}\|v$ and $p^{k}\|w$ . Thus

[TABLE]

Thus we conclude that

[TABLE]

where in the final line we used the fact that $\delta=\mu(\mathcal{E})/\mu(\mathcal{V})\mu(\mathcal{W})\geqslant(10/t)^{50}$ .

Now, let us define

[TABLE]

Evidently, we have that

[TABLE]

Thus $\mu(\mathcal{E}^{\prime})\geqslant 99\mu(\mathcal{E})/100$ . We then take $G^{\prime}:=(\mu,\mathcal{V},\mathcal{W},\mathcal{E}^{\prime},\mathcal{P},f,g)$ and note that

[TABLE]

Finally, we note that

[TABLE]

for $t\geqslant 300$ by [20, Theorem 5], and so if $(v^{\prime},w^{\prime})\in\mathcal{E}^{\prime}$ then

[TABLE]

Hence, since $\mathcal{E}^{\prime}\subseteq\mathcal{E}\subseteq\{(v,w)\in\mathcal{V}\times\mathcal{W}:L_{t}(v,w)\geqslant 10\}$ , for any $(v^{\prime},w^{\prime})\in\mathcal{E}^{\prime}$ we have

[TABLE]

This completes the proof of Lemma 8.4.∎

We are left to establish Propositions 8.1-8.3 and Lemma 8.5.

10. Proof of Lemma 8.5

In this section we establish Lemma 8.5. We begin with an auxiliary lemma.

Lemma 10.1 (Quality increment or all vertices have high degree).

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ be a GCD graph with edge density $\delta>0$ . For each $v\in\mathcal{V}$ and for each $w\in\mathcal{W}$ , we let

[TABLE]

be the sets of their neighbours. Then one of the following holds:

(a)

For all $v\in\mathcal{V}$ and for all $w\in\mathcal{W}$ , we have

[TABLE] 2. (b)

There is a GCD subgraph $G^{\prime}=(\mu,\mathcal{V}^{\prime},\mathcal{W}^{\prime},\mathcal{E}^{\prime},\mathcal{P},f,g)$ of $G$ with edge density $\delta^{\prime}\geqslant\delta$ , quality $q(G^{\prime})\geqslant q(G)$ , and such that either $\mathcal{V}^{\prime}\subsetneq\mathcal{V}$ or $\mathcal{W}^{\prime}\subsetneq\mathcal{W}$ .

Proof.

Assume that (a) fails. Then either its first or its second inequality fails. Assume that the first one fails for some $v\in\mathcal{V}$ ; the other case is entirely analogous. Let $\mathcal{E}^{\prime}$ be the set of edges between the vertex sets $\mathcal{V}\setminus\{v\}$ and $\mathcal{W}$ . Note that

[TABLE]

because $\mu(\Gamma_{G}(v))<9\delta\mu(\mathcal{W})/10$ , $\mu(v)\leqslant\mu(\mathcal{V})$ , and $\mu(\mathcal{E})>0$ by the assumption $\delta>0$ . In particular, we have $\mu(\mathcal{W}),\mu(\mathcal{V}\setminus\{v\})>0$ . We then consider $G^{\prime}=(\mu,\mathcal{V}\setminus\{v\},\mathcal{W},\mathcal{E}^{\prime},\mathcal{P},f,g)$ , which is a GCD subgraph of $G$ . Let $G^{\prime}$ have edge density $\delta^{\prime}$ . We claim that $\delta^{\prime}\geqslant\delta$ and $q(G^{\prime})\geqslant q(G)$ .

Indeed, we have

[TABLE]

Thus the edge density $\delta^{\prime}$ of $G^{\prime}$ satisfies

[TABLE]

Thus we see that $\delta^{\prime}\geqslant\delta$ , and that

[TABLE]

This proves our claim that $q(G^{\prime})\geqslant q(G)$ too, thus completing the proof of the lemma. ∎

Proof of Lemma 8.5.

We note that conclusion $(c)$ of Lemma 8.5 is the same as conclusion $(a)$ of Lemma 10.1. Thus, if $G$ does not satisfy conclusion $(c)$ of Lemma 8.5, then we may repeatedly apply Lemma 10.1 to produce a sequence of GCD subgraphs

[TABLE]

until we arrive at a GCD subgraph of $G$ which satisfies conclusion $(a)$ of Lemma 10.1. This process must terminate after a finite number of steps since at least one of the vertex sets of $G_{i+1}$ has one less element than the corresponding vertex set of $G_{i}$ . Let the process terminate at $G_{J}$ , which satisfies conclusion $(a)$ of Lemma 10.1, and let $\delta_{i}$ be the edge density of $G_{i}$ . Since $\delta_{i+1}\geqslant\delta_{i}$ and $q(G_{i+1})\geqslant q(G_{i})$ by Lemma 10.1, we have that

[TABLE]

Since the multiplicative data are also maintained at each iteration, we see that taking $G^{\prime}=G_{J}$ gives the result. ∎

Thus we are left to establish Propositions 8.1-8.3.

11. Preparatory Lemmas on GCD graphs

Our remaining task is to prove Propositions 8.1-8.3. Before we attack these directly, we establish various preliminary results about GCD graphs in this section, which we will then use in the remaining sections to prove Propositions 8.1-8.3.

Lemma 11.1 (Quality variation for special GCD subgraphs).

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ be a GCD graph, $p\in\mathcal{R}(G)$ and $k,\ell\in\mathbb{Z}_{\geqslant 0}$ . If $G_{p^{k},p^{\ell}}$ is as in Definition 6.5, then $G_{p^{k},p^{\ell}}$ is a GCD subgraph of $G$ . In addition, if $G$ is non-trivial and $\mu(\mathcal{V}_{p^{k}}),\mu(\mathcal{W}_{p^{\ell}})>0$ , then we have

[TABLE]

Proof.

This follows directly from the definitions. ∎

Lemma 11.2 (One subgraph must have limited quality loss).

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ be a GCD graph with edge density $\delta>0$ , and let $\mathcal{V}=\mathcal{V}_{1}\sqcup\dots\sqcup\mathcal{V}_{I}$ and $\mathcal{W}=\mathcal{W}_{1}\sqcup\dots\sqcup\mathcal{W}_{J}$ be partitions of $\mathcal{V}$ and $\mathcal{W}$ . Then there is a GCD subgraph $G^{\prime}=(\mu,\mathcal{V}^{\prime},\mathcal{W}^{\prime},\mathcal{E}^{\prime},\mathcal{P},f,g)$ of $G$ with edge density $\delta^{\prime}>0$ such that

[TABLE]

and with $\mathcal{V}^{\prime}\in\{\mathcal{V}_{1},\dots,\mathcal{V}_{I}\}$ , $\mathcal{W}^{\prime}\in\{\mathcal{W}_{1},\dots,\mathcal{W}_{J}\}$ , and $\mathcal{E}^{\prime}=\mathcal{E}\cap(\mathcal{V}^{\prime}\times\mathcal{W}^{\prime})$ .

Proof.

For brevity let $\mathcal{E}_{i,j}=\mathcal{E}\cap(\mathcal{V}_{i}\times\mathcal{W}_{j})$ be the edges between $\mathcal{V}_{i}$ and $\mathcal{W}_{j}$ for $i\in\{1,\dots,I\}$ and $j\in\{1,\dots,J\}$ . Since the partitions of $\mathcal{V}$ and $\mathcal{W}$ induce a partition $\mathcal{E}=\sqcup_{i=1}^{I}\sqcup_{j=1}^{J}\mathcal{E}_{i,j}$ of $\mathcal{E}$ , we have

[TABLE]

Thus, by the pigeonhole principle, there is a choice of $i_{0}$ and $j_{0}$ such that $\mu(\mathcal{E}_{i_{0},j_{0}})\geqslant\mu(\mathcal{E})/(IJ)>0$ . We then let $G^{\prime}=(\mu,\mathcal{V}_{i_{0}},\mathcal{W}_{j_{0}},\mathcal{E}_{i_{0},j_{0}},\mathcal{P},f,g)$ , which is clearly a non-trivial GCD subgraph of $G$ . We see that

[TABLE]

and

[TABLE]

This gives the result. ∎

Lemma 11.3 (Few edges between unbalanced sets, I).

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ be a GCD graph with edge density $\delta>0$ . Let $p\in\mathcal{R}(G)$ , $r\in\mathbb{Z}_{\geqslant 1}$ and $k\in\mathbb{Z}_{\geqslant 0}$ be such that $p^{r}>10^{2000}$ and

[TABLE]

(In particular, if $p\leqslant 10^{40}$ , the last hypothesis is vacuous.)

If we set $\mathcal{L}_{k,r}=\{\ell\in\mathbb{Z}_{\geqslant 0}:|\ell-k|\geqslant r+1\}$ and write $\delta_{p^{k},p^{\ell}}$ for the edge density of the graph $G_{p^{k},p^{\ell}}$ , then one of the following holds:

(a)

There is $\ell\in\mathcal{L}_{k,r}$ such that $q(G_{p^{k},p^{\ell}})>2q(G)$ and $\delta_{p^{k},p^{\ell}}q(G_{p^{k},p^{\ell}})>2\delta q(G)>0$ . 2. (b)

$\sum_{\ell\in\mathcal{L}_{k,r}}\mu(\mathcal{E}_{p^{k},p^{\ell}})\leqslant\mu(\mathcal{E})/(4p^{31/30})$ .

Proof.

Assume that conclusion $(b)$ does not hold, so $\sum_{\ell\in\mathcal{L}_{k,r}}\mu(\mathcal{E}_{p^{k},p^{\ell}})>\mu(\mathcal{E})/(4p^{31/30})$ and we wish to establish $(a)$ . Then there must exist some $\ell\in\mathcal{L}_{k,r}$ such that

[TABLE]

where we used that $\sum_{|j|\geqslant 0}2^{-|j|/20}\leqslant 2/(1-2^{-1/20})\leqslant 60$ . In particular, $G_{p^{k},p^{\ell}}$ is a non-trivial GCD graph. Since $\mu(\mathcal{W}_{p^{k}})\geqslant(1-10^{40}/p)\mu(\mathcal{W})$ , we have that $\mu(\mathcal{W}_{p^{\ell}})\leqslant 10^{40}\mu(\mathcal{W})/p$ . Consequently,

[TABLE]

Since $|k-\ell|\geqslant r+1\geqslant 2r/3+4/3$ , we have

[TABLE]

In addition, note that $(p/2^{1/2})\geqslant p^{1/2}$ for all primes. Therefore

[TABLE]

by our assumption that $p^{r}>10^{2000}$ .

Similarly, we have

[TABLE]

Since $p/2^{11/20}\geqslant p^{9/20}$ and $p^{r}>10^{2000}$ , we conclude that

[TABLE]

This completes the proof of the lemma. ∎

The symmetric version of Lemma 11.3 to the above one also clearly holds:

Lemma 11.4 (Few edges between unbalanced sets, II).

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ be a GCD graph with edge density $\delta>0$ . Let $p\in\mathcal{R}(G)$ , $r\in\mathbb{Z}_{\geqslant 1}$ and $\ell\in\mathbb{Z}_{\geqslant 0}$ be such that $p^{r}>10^{2000}$ and

[TABLE]

and set $\mathcal{K}_{\ell,r}=\{k\in\mathbb{Z}_{\geqslant 0}:|\ell-k|\geqslant r+1\}$ . If $\delta_{p^{k},p^{\ell}}$ denotes the edge density of the graph $G_{p^{k},p^{\ell}}$ , then one of the following holds:

(a)

There is $k\in\mathcal{K}_{\ell,r}$ such that $q(G_{p^{k},p^{\ell}})>2q(G)$ and $\delta_{p^{k},p^{\ell}}q(G_{p^{k},p^{\ell}})>2\delta q(G)>0$ . 2. (b)

$\sum_{k\in\mathcal{K}_{\ell,r}}\mu(\mathcal{E}_{p^{k},p^{\ell}})\leqslant\mu(\mathcal{E})/(4p^{31/30})$ .

Next, we prove a lemma about the connectivity of small vertex sets of a GCD graph.

Lemma 11.5 (Few edges between small sets).

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ be a GCD graph with edge density $\delta>0$ and let $\eta\in(0,1)$ . Then one of the following holds:

(a)

For all sets $\mathcal{A}\subseteq\mathcal{V}$ and $\mathcal{B}\subseteq\mathcal{W}$ such that $\mu(\mathcal{A})\leqslant\eta\cdot\mu(\mathcal{V})$ and $\mu(\mathcal{B})\leqslant\eta\cdot\mu(\mathcal{W})$ , we have $\mu(\mathcal{E}\cap(\mathcal{A}\times\mathcal{B}))\leqslant\eta^{9/5}\cdot\mu(\mathcal{E})$ . 2. (b)

There is a GCD subgraph $G^{\prime}=(\mu,\mathcal{V}^{\prime},\mathcal{W}^{\prime},\mathcal{E}^{\prime},\mathcal{P},f,g)$ of $G$ such that $q(G^{\prime})>q(G)$ , $\mathcal{V}^{\prime}\subsetneq\mathcal{V}$ and $\mathcal{W}^{\prime}\subsetneq\mathcal{W}$ .

Proof.

Assume that (a) fails. Hence, there exist sets $\mathcal{A}\subseteq\mathcal{V}$ and $\mathcal{B}\subseteq\mathcal{W}$ such that $\mu(\mathcal{A})\leqslant\eta\cdot\mu(\mathcal{V})$ , $\mu(\mathcal{B})\leqslant\eta\cdot\mu(\mathcal{W})$ and $\mu(\mathcal{E}\cap(\mathcal{A}\times\mathcal{B}))>\eta^{9/5}\cdot\mu(\mathcal{E})$ . We then set $\mathcal{E}^{\prime}=\mathcal{E}\cap(\mathcal{A}\times\mathcal{B})$ and consider the GCD subgraph $G^{\prime}=(\mu,\mathcal{A},\mathcal{B},\mathcal{E}^{\prime},\mathcal{P},f,g)$ of $G$ . Since $\mu(\mathcal{E}^{\prime})>0$ , this is a non-trivial GCD graph. In addition, since $\mu(\mathcal{V})>0$ (because $G$ is non-trivial) and $\eta<1$ (by assumption), we have $\mu(\mathcal{A})\leqslant\eta\mu(\mathcal{V})<\mu(\mathcal{V})$ , and thus $\mathcal{A}\subsetneq\mathcal{V}$ . Similarly, we find that $\mathcal{B}\subsetneq\mathcal{W}$ . Finally, for the quality of $G^{\prime}$ , we have

[TABLE]

This completes the proof of the lemma. ∎

By iterating this lemma, we arrive at the following result.

Lemma 11.6 (Subgraph with few edges between all small sets).

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ be a GCD graph with edge density $\delta>0$ , and let $\eta\in(0,1)$ . Then there is a GCD subgraph $G^{\prime}=(\mu,\mathcal{V}^{\prime},\mathcal{W}^{\prime},\mathcal{E}^{\prime},\mathcal{P},f,g)$ of $G$ with edge density $\delta^{\prime}>0$ such that both of the following hold:

(a)

$q(G^{\prime})\geqslant q(G)>0$ . 2. (b)

For all sets $\mathcal{A}\subseteq\mathcal{V}^{\prime}$ and $\mathcal{B}\subseteq\mathcal{W}^{\prime}$ such that $\mu(\mathcal{A})\leqslant\eta\cdot\mu(\mathcal{V}^{\prime})$ and $\mu(\mathcal{B})\leqslant\eta\cdot\mu(\mathcal{W}^{\prime})$ , we have $\mu(\mathcal{E}^{\prime}\cap(\mathcal{A}\times\mathcal{B}))\leqslant\eta^{9/5}\mu(\mathcal{E}^{\prime})$ .

Proof.

We note that conclusion $(b)$ of Lemma 11.6 is the same as conclusion $(a)$ of Lemma 11.5. Thus, if $G$ does not satisfy conclusion $(b)$ of Lemma 11.6, then we may repeatedly apply Lemma 11.5 to produce a sequence of GCD subgraphs

[TABLE]

until we arrive at a GCD subgraph of $G$ which satisfies conclusion $(a)$ of Lemma 11.5. This process must terminate after a finite number of steps since $G_{i+1}$ has strictly smaller vertex sets than those of $G_{i}$ . Let the process terminate at $G_{J}$ , which satisfies conclusion $(a)$ of Lemma 11.5. Since $q(G_{i+1})>q(G_{i})$ by Lemma 11.5, we have that

[TABLE]

Lastly, since the multiplicative data are maintained at each iteration, we see that taking $G^{\prime}=G_{J}$ gives the result. ∎

12. Proof of Proposition 8.1

In this section we prove Proposition 8.1, which is the iteration procedure for ‘generic’ primes. This section is essentially self-contained (relying only on the notation of Section 6 and the trivial Lemma 11.1), and serves as a template for the proofs of the harder Propositions 8.2 and 8.3.

Lemma 12.1 (Bounds on edge sets).

Consider a GCD graph $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ and a prime $p\in\mathcal{R}(G)$ . For each $k,\ell\in\mathbb{Z}_{\geqslant 0}$ , let

[TABLE]

Then there exist $k,\ell\in\mathbb{Z}_{\geqslant 0}$ such that $\alpha_{k},\beta_{\ell}>0$ and

[TABLE]

Proof.

Let $\mathcal{X}=\{(k,\ell)\in\mathbb{Z}_{\geqslant 0}^{2}:\alpha_{k},\beta_{\ell}>0\}$ . Note that if $(k,\ell)\in\mathbb{Z}_{\geqslant 0}^{2}\setminus\mathcal{X}$ , then $\mu(\mathcal{E}_{p^{k},p^{\ell}})\leqslant\mu(\mathcal{V}_{p^{k}})\mu(\mathcal{W}_{p^{\ell}})=\alpha_{k}\beta_{\ell}\mu(\mathcal{V})\mu(\mathcal{W})=0$ . Thus $\sum_{(k,\ell)\in\mathcal{X}}\mu(\mathcal{E}_{p^{k},p^{\ell}})=\mu(\mathcal{E})$ . Hence, if we assume that the inequality in the statement of the lemma does not hold for any pair $(k,\ell)\in\mathcal{X}$ , we must have

[TABLE]

where

[TABLE]

and

[TABLE]

Thus, to arrive at a contradiction, it suffices to show that

[TABLE]

First of all, note that $\sum_{|j|\geqslant 1}2^{-|j|/20}=2/(2^{1/20}-1)\leqslant 100$ , whence

[TABLE]

Observing that

[TABLE]

we conclude that

[TABLE]

Since $\alpha_{k},\beta_{\ell}$ are non-negative reals which sum to 1, there exists some $k_{0}\geqslant 0$ such that

[TABLE]

We thus find that

[TABLE]

where we used the Cauchy-Schwarz inequality to bound $\sum_{k}(\alpha_{k}\beta_{k})^{1/2}$ from above. We also find that

[TABLE]

As a consequence,

[TABLE]

The function $x\mapsto x^{2/5}+2(1-x)/5$ is increasing for $0\leqslant x\leqslant 1$ , and so maximized at $x=1$ . Thus we infer that $S_{1}+S_{2}\leqslant 1$ as required, completing the proof of the lemma. ∎

Lemma 12.2 (Quality increment unless a prime power divides almost all).

Consider a GCD graph $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ with edge density $\delta>0$ and a prime $p\in\mathcal{R}(G)$ with $p>10^{40}$ . Then one of the following holds:

(a)

There is a GCD subgraph $G^{\prime}$ of $G$ with multiplicative data $(\mathcal{P}^{\prime},f^{\prime},g^{\prime})$ and edge density $\delta^{\prime}>0$ such that

[TABLE] 2. (b)

There is some $k\in\mathbb{Z}_{\geqslant 0}$ such that

[TABLE]

Proof.

Let $\alpha_{k}$ and $\beta_{\ell}$ be defined as in the statement of Lemma 12.1. Consequently, there are $k,\ell\in\mathbb{Z}_{\geqslant 0}$ such that $\alpha_{k},\beta_{\ell}>0$ and

[TABLE]

In particular, $\mu(\mathcal{E}_{p^{k},p^{\ell}})>0$ , so that $G_{p^{k},p^{\ell}}$ is a non-trivial GCD subgraph of $G$ . We separate two cases, according to whether $k=\ell$ or not.

Case 1: $k=\ell$ .

Let $G^{\prime}=G_{p^{k},p^{k}}$ . Lemma 11.1 and our lower bound $\mu(\mathcal{E}_{p^{k},p^{k}})\geqslant(\alpha_{k}\beta_{k})^{9/10}\mu(\mathcal{E})$ imply that

[TABLE]

In addition,

[TABLE]

This establishes conclusion (a) in this case, noting that $f^{\prime}(p)=g^{\prime}(p)=k$ so $\mathds{1}_{f^{\prime}(p)\neq g^{\prime}(p)}=0$ .

Case 2: $k\neq\ell$

As before, we let $G^{\prime}=G_{p^{k},p^{\ell}}$ , and use Lemma 11.1 and our lower bound on $\mathcal{E}_{p^{k},p^{\ell}}$ to find that

[TABLE]

where

[TABLE]

In addition, we have

[TABLE]

Note that

[TABLE]

Indeed, this follows by our assumption that $k\neq\ell$ , which implies that $\beta_{k}+\beta_{\ell}\leqslant\sum_{j\geqslant 0}\beta_{j}=1$ . Combining the above, we conclude that

[TABLE]

Now, assume that conclusion (a) of the lemma does not hold, so that the left hand side of (12.3) is $\leqslant 2$ . Since $|k-\ell|\geqslant 1$ and all primes are at least $2$ , we must then have that

[TABLE]

where we used our assumption that $p\geqslant 10^{40}$ for the last inequality. In particular, this gives

[TABLE]

We note that

[TABLE]

Thus by the arithmetic-geometric mean inequality, and relations (12.5) and (12.4), we have

[TABLE]

In particular, $\max\{\alpha_{\ell},\beta_{k}\}\geqslant 1/2$ .

We consider the case when $\beta_{k}\geqslant 1/2$ ; the case with $\alpha_{\ell}\geqslant 1/2$ is entirely analogous with the roles of $\beta$ and $\alpha$ swapped, and the roles of $k$ and $\ell$ swapped. Thus, to complete the proof of the lemma, it suffices to show that

[TABLE]

The first inequality of (12.4) states that

[TABLE]

Since $\beta_{k}\geqslant 1/2$ , we infer that

[TABLE]

In particular, $\alpha_{k}\geqslant 1-10^{40}/p$ and $\alpha_{k}\geqslant 1/2$ , whence

[TABLE]

This completes the proof of (12.6) and hence of the lemma. ∎

Proof of Proposition 8.1.

This follows almost immediately from Lemma 12.2. Since $\mathcal{R}(G)\subseteq\{p>10^{2000}\}$ by assumption, if $p\in\mathcal{R}(G)$ then $p>10^{2000}$ . We have also assumed that $\mathcal{R}^{\flat}(G)\neq\emptyset$ . Consequently, there is a prime $p\in\mathcal{R}^{\flat}(G)$ with $p>10^{2000}>10^{40}$ . We now apply Lemma 12.2 with this choice of $p$ . By definition of $\mathcal{R}^{\flat}(G)$ , conclusion $(b)$ cannot hold, and so conclusion $(a)$ must hold. This then gives the result. ∎

We are left to establish Proposition 8.3 and Proposition 8.2.

13. Proof of Proposition 8.3

In this section we prove Proposition 8.3, which is the iteration procedure for small primes. This section relies on the notation of Section 6, Lemma 10.1, the Lemmas 11.1-11.3 from Section 11 and Lemma 12.2. The basic idea of the proof is similar to that of Proposition 8.1, but we can no longer ensure a quality increment when the primes are small; instead we show that there is only a bounded loss.

Lemma 13.1 (Small quality loss or prime power divides positive proportion).

Consider a GCD graph $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ with edge density $\delta>0$ , and let $p\in\mathcal{R}(G)$ be a prime. Then one of the following holds:

(a)

There is a GCD subgraph $G^{\prime}$ of $G$ with multiplicative data $(\mathcal{P}^{\prime},f^{\prime},g^{\prime})$ and edge density $\delta^{\prime}>0$ such that

[TABLE] 2. (b)

There is some $k\in\mathbb{Z}_{\geqslant 0}$ such that

[TABLE]

Proof.

Assume that conclusion $(a)$ does not hold, so we intend to establish $(b)$ . For $k,\ell\in\mathbb{Z}_{\geqslant 0}$ , let $\mu(\mathcal{V}_{p^{k}})=\alpha_{k}\mu(\mathcal{V})$ and $\mu(\mathcal{W}_{p^{\ell}})=\beta_{\ell}\mu(\mathcal{W})$ . We begin as in the proof of Lemma 12.2, by considering $k,\ell\in\mathbb{Z}_{\geqslant 0}$ satisfying (12.1) and the inequalities $\alpha_{k},\beta_{\ell}>0$ . In particular, $G_{p^{k},p^{\ell}}$ is a non-trivial GCD subgraph of $G$ .

We note that the proof of Lemma 12.2 up to relation (12.3) requires no assumption on the size of $p$ . Now, if $k=\ell$ , then Case 1 of the proof of Lemma 12.2 shows that conclusion $(a)$ must hold, contradicting our assumption. Therefore we may assume that $k\neq\ell$ . Now, arguing as in Case 2 of the proof of Lemma 12.2, and setting $G^{\prime}=G_{p^{k},p^{\ell}}$ and

[TABLE]

we infer that

[TABLE]

Therefore we have that

[TABLE]

Since $S\geqslant(\alpha_{k}+\beta_{\ell})(1-\max\{\alpha_{\ell},\beta_{k}\})$ , we have

[TABLE]

so $\max\{\alpha_{\ell},\beta_{k}\}\geqslant 9/10$ . We deal with the case when $\beta_{k}\geqslant 9/10$ ; the case with $\alpha_{\ell}\geqslant 9/10$ is entirely analogous with the roles of $k$ and $\ell$ and the roles of $\alpha$ and $\beta$ swapped.

Since $\beta_{k}\geqslant 9/10$ , we have

[TABLE]

In particular, $\alpha_{k}\geqslant 9/10$ and so conclusion $(b)$ holds, as required. ∎

Lemma 13.2 (Adding small primes to $\mathcal{P}$ ).

Let $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ be a GCD graph with edge density $\delta>0$ . Let $p\in\mathcal{R}(G)$ be a prime with $p\leqslant 10^{2000}$ .

Then there is a GCD subgraph $G^{\prime}$ of $G$ with set of primes $\mathcal{P}^{\prime}$ and edge density $\delta^{\prime}>0$ such that

[TABLE]

Proof.

We first repeatedly apply Lemma 10.1 until we arrive at a GCD subgraph

[TABLE]

of $G$ with edge density $\delta^{(1)}$ such that

[TABLE]

as well as

[TABLE]

(We must eventually arrive at such a subgraph since the vertex sets are strictly decreasing at each stage but can never become empty since the edge density remains bounded away from 0.)

We now apply Lemma 13.1 to $G^{(1)}$ . If conclusion $(a)$ of Lemma 13.1 holds, then there is a GCD subgraph $G^{(2)}$ of $G^{(1)}$ satisfying the conclusion of Lemma 13.2, so we are done by taking $G^{\prime}=G^{(2)}$ . Therefore we may assume that instead conclusion $(b)$ of Lemma 13.1 holds, so there is some $k\in\mathbb{Z}_{\geqslant 0}$ such that

[TABLE]

In fact we claim that either the conclusion of Lemma 13.2 holds, or we have the stronger condition

[TABLE]

Relation (13.2) follows immediately from (13.1) if $p\leqslant 10^{41}$ , so let us assume that $p>10^{41}$ . We then apply Lemma 12.2 to $G^{(1)}$ . If conclusion $(a)$ of Lemma 12.2 holds, then there is a GCD subgraph $G^{(3)}$ of $G^{(1)}$ satisfying the required conditions of Lemma 13.2, so we are done by taking $G^{\prime}=G^{(3)}$ . Therefore we may assume that conclusion $(b)$ of Lemma 12.2 holds, so that there is some $k^{\prime}\geqslant 0$ such that $\mu(\mathcal{V}^{(1)}_{p^{k^{\prime}}})/\mu(\mathcal{V}^{(1)})\geqslant 1-10^{40}/p\geqslant 9/10$ and $\mu(\mathcal{W}^{(1)}_{p^{k^{\prime}}})/\mu(\mathcal{W}^{(1)})\geqslant 1-10^{40}/p\geqslant 9/10$ . Since there cannot be two disjoint subsets of $\mathcal{V}^{(1)}$ of density $\geqslant 9/10$ , we must then have $k^{\prime}=k$ , thus proving (13.2) in this case too.

In conclusion, regardless of the size of $p$ we have established (13.2). Next, we fix an integer $r\leqslant 6644$ such that $p^{r}>10^{2000}$ (such an integer exists because $2^{6644}>10^{2000}$ ) and we apply Lemma 11.3.

If conclusion $(a)$ of Lemma 11.3 holds, then we take $G^{\prime}=G^{(1)}_{p^{k},p^{\ell}}$ , whose quality satisfies

[TABLE]

and whose edge density $\delta^{\prime}$ satisfies

[TABLE]

In particular, $\delta^{\prime}>0$ , so the proof is complete in this case.

Thus we may assume that conclusion $(b)$ of Lemma 11.3 holds, so that

[TABLE]

where we recall the notation $\mathcal{L}_{k,r}:=\{\ell\in\mathbb{Z}_{\geqslant 0}:|\ell-k|\geqslant r+1\}$ . Let

[TABLE]

and let

[TABLE]

be the set of edges between $\mathcal{V}^{(1)}_{p^{k}}$ and $\widetilde{\mathcal{W}}^{(1)}$ in $G^{(1)}$ . Since $\mu(\mathcal{V}^{(1)}_{p^{k}})\geqslant 9\mu(\mathcal{V}^{(1)})/10$ and $\mu(\Gamma_{G^{(1)}}(v))\geqslant 9\delta^{(1)}\mu(\mathcal{W}^{(1)})/10$ for all $v\in\mathcal{V}^{(1)}_{p^{k}}$ , we have

[TABLE]

Let $G^{(2)}=(\mu,\mathcal{V}^{(1)}_{p^{k}},\widetilde{\mathcal{W}}^{(1)},\mathcal{E}^{(2)},\mathcal{P},f,g)$ be the GCD subgraph of $G^{(1)}$ formed by restricting to $\mathcal{V}^{(1)}_{p^{k}}$ and $\widetilde{\mathcal{W}}^{(1)}$ . Since $\mu(\mathcal{E}^{(2)})>0$ , $G^{(2)}$ is a non-trivial GCD subgraph. If $\delta^{(2)}$ denotes its edge density, then

[TABLE]

In addition, we have that

[TABLE]

Finally, we apply Lemma 11.2 to the partition

[TABLE]

of $\widetilde{\mathcal{W}}^{(1)}$ into $\leqslant 2\cdot 6644+1\leqslant 15000$ subsets. This produces a GCD subgraph

[TABLE]

of $G^{(2)}$ for some $\ell\geqslant 0$ with $|\ell-k|\leqslant r$ such that

[TABLE]

In addition, Lemma 11.2 implies that the density of $G^{(3)}$ , call it $\delta^{(3)}$ , satisfies

[TABLE]

Finally, we note that $G^{(1)}_{p^{k},p^{\ell}}$ is a GCD subgraph of $G^{(3)}$ with set of primes $\mathcal{P}\cup\{p\}$ , edge density $\delta^{(1)}_{p^{k},p^{\ell}}=\delta^{(3)}$ , and quality $q(G^{(1)}_{p^{k},p^{\ell}})\geqslant q(G^{(3)})$ . Taking $G^{\prime}=G^{(1)}_{p^{k},p^{\ell}}$ then gives the result. ∎

Proof of Proposition 8.3.

If $\mathcal{R}(G)\cap\{p\leqslant 10^{2000}\}=\emptyset$ , then we can simply take $G^{\prime}=G$ .

If $\mathcal{R}(G)\cap\{p\leqslant 10^{2000}\}\neq\emptyset$ , then we can choose a prime $p\in\mathcal{R}(G)\cap\{p\leqslant 10^{2000}\}$ and apply Lemma 13.2. We do this repeatedly to produce a sequence of GCD subgraphs

[TABLE]

such that

[TABLE]

for each $i$ , where $\delta_{i}$ denotes the edge density of $G_{i}$ . In addition, we let $\mathcal{P}_{i}$ denote the set of primes associated to $G_{i}$ , so that $\emptyset=\mathcal{P}_{1}\subseteq\mathcal{P}_{2}\subseteq\cdots\subseteq\{p\leqslant 10^{2000}\}$ .

At each stage, the set $\mathcal{R}(G_{i})\cap\{p\leqslant 10^{2000}\}$ is strictly smaller than before. So, after at most $10^{2000}$ steps we arrive at a GCD subgraph $G^{(1)}=(\mu,\mathcal{V}^{(1)},\mathcal{W}^{(1)},\mathcal{E}^{(1)},\mathcal{P}^{(1)},f^{(1)},g^{(1)})$ of $G$ with

[TABLE]

Let $\delta^{(1)}$ denote the edge density of the end graph $G^{(1)}$ . Iterating the two inequalities of (13.3) at most $10^{2000}$ times, we find that

[TABLE]

Thus, taking $G^{\prime}=G^{(1)}$ gives the result. ∎

Thus we are just left to establish Proposition 8.2.

14. Proof of Proposition 8.2

Finally, in this section we prove Proposition 8.2, and hence complete the proof of Theorem 1. The proof is similar to that of Proposition 8.1, but more care is required when dealing with the primes coming from $\mathcal{R}^{\sharp}(G)$ .

Lemma 14.1 (Quality increment even when a prime power divides almost all).

Consider a GCD graph $G=(\mu,\mathcal{V},\mathcal{W},\mathcal{E},\mathcal{P},f,g)$ with edge density $\delta>0$ and let $p\in\mathcal{R}(G)$ be a prime with $p\geqslant 10^{2000}$ . Then there is a GCD subgraph $G^{\prime}$ of $G$ with set of primes $\mathcal{P}^{\prime}=\mathcal{P}\cup\{p\}$ such that

[TABLE]

Proof.

First of all, we may assume without loss of generality that for all sets $\mathcal{A}\subseteq\mathcal{V}$ and $\mathcal{B}\subseteq\mathcal{W}$ , we have that

[TABLE]

Indeed, if $G$ does not satisfy (14.1), then we apply Lemma 11.6 with $\eta=10^{40}/p$ to replace $G$ by a non-trivial subgraph $G^{(1)}$ that does have this property (noticing that $(10^{40}/p)^{9/5}\leqslant 1/(2p^{3/2})$ for $p\geqslant 10^{2000}$ ). In addition, $G^{(1)}$ has the same multiplicative data as $G$ and its quality is strictly larger. Hence, we may work with $G^{(1)}$ instead. So, from now on, we assume that (14.1) holds.

We now apply Lemma 12.2. If conclusion $(a)$ of Lemma 12.2 holds, then we are done. Thus we may assume that conclusion $(b)$ holds, that is to say there is some $k\in\mathbb{Z}_{\geqslant 0}$ such that

[TABLE]

In particular, by (14.1) we see that

[TABLE]

Now, set

[TABLE]

with the convention that $\mathcal{V}_{p^{-1}}=\emptyset=\mathcal{W}_{p^{-1}}$ . In view of Lemmas 11.3 and 11.4 applied with $r=1$ , we may assume that

[TABLE]

and

[TABLE]

Hence, if we let

[TABLE]

then (14.2)-(14.4) imply that

[TABLE]

where we used our assumption that $p>10^{2000}$ and the inequality $(1-x)^{2/3}\leqslant 1-2x/3$ for $x\in[0,1]$ that follows from Taylor’s theorem. We then consider the non-trivial GCD subgraph $G^{*}=(\mu,\mathcal{V},\mathcal{W},\mathcal{E}^{*},\mathcal{P},f,g)$ of $G$ formed by restricting the edge set to $\mathcal{E}^{*}$ . Note that

[TABLE]

Now, let $(v,w)\in\mathcal{E}^{*}$ . We have the following five possibilities:

(a)

$v\in\mathcal{V}_{p^{k}}$ and $w\in\mathcal{W}_{p^{k}}$ , in which case $p^{k}\|v,w$ and $p^{k}\|\gcd(v,w)$ ; 2. (b)

$v\in\mathcal{V}_{p^{k}}$ and $w\in\mathcal{W}_{p^{k+1}}$ , in which case $p^{k}|v,w$ and $p^{k}\|\gcd(v,w)$ ; 3. (c)

$v\in\mathcal{V}_{p^{k+1}}$ and $w\in\mathcal{W}_{p^{k}}$ , in which case $p^{k}|v,w$ and $p^{k}\|\gcd(v,w)$ ; 4. (d)

$v\in\mathcal{V}_{p^{k}}$ and $w\in\mathcal{W}_{p^{k-1}}$ , in which case $p^{k}\|v$ , $p^{k-1}\|w$ and $p^{k-1}\|\gcd(v,w)$ ; 5. (e)

$v\in\mathcal{V}_{p^{k-1}}$ and $w\in\mathcal{W}_{p^{k}}$ , in which case $p^{k-1}\|v$ , $p^{k}\|w$ and $p^{k-1}\|\gcd(v,w)$ .

We then set $G^{+}=(\mu,\mathcal{V}^{+},\mathcal{W}^{+},\mathcal{E}^{+},\mathcal{P}\cup\{p\},f^{+},g^{+})$ , where:

[TABLE]

as well as

[TABLE]

By looking at possibilities (a), (b) and (c), it is easy to check that $G^{+}$ is a GCD subgraph of $G^{*}$ (and hence of $G$ ). Note that $\mu(\mathcal{V}^{+})\geqslant\mu(\mathcal{V}_{p^{k}})\geqslant 1-10^{40}/p>0$ . Similarly, we have $\mu(\mathcal{W}^{+})>0$ . Consequently, its quality satisfies the relation

[TABLE]

(This relation is valid even if $\mu(\mathcal{E}^{+})=0$ .) We separate two cases.

Case 1: $k=0$ .

In this case $\mathcal{V}_{p^{k-1}}=\mathcal{W}_{p^{k-1}}=\emptyset$ , so all parameters of $G^{+}$ are the same as those of $G^{*}$ except that the set of primes of $G^{+}$ is $\mathcal{P}\cup\{p\}$ instead of $\mathcal{P}$ and $f$ , $g$ have been extended to take the value 0 at $p$ . As a consequence,

[TABLE]

In particular, by (14.5) we have

[TABLE]

Thus the lemma follows by taking $G^{\prime}=G^{+}$ .

Case 2: $k\geqslant 1$ .

In this case we have

[TABLE]

We also consider the GCD subgraphs $G_{p^{k},p^{k-1}}$ and $G_{p^{k-1},p^{k}}$ of $G$ . Notice that $\mu(\mathcal{V}_{p^{k}})\geqslant 1-10^{40}/p>0$ for $p\geqslant 10^{2000}$ . Hence, if $\mu(\mathcal{W}_{p^{k-1}})>0$ , then Lemma 11.1 implies that

[TABLE]

Similarly, if $\mu(\mathcal{V}_{p^{k-1}})>0$ , then we have

[TABLE]

Since $\mu(\mathcal{V}_{p^{k}})\geqslant(1-10^{40}/p)\mu(\mathcal{V})$ , we have that $\mu(\mathcal{V}_{p^{k-1}})\leqslant 10^{40}\mu(\mathcal{V})/p$ . Similarly, we have that $\mu(\mathcal{W}_{p^{k-1}})\leqslant 10^{40}\mu(\mathcal{W})/p$ . To this end, let $0\leqslant A,B\leqslant 10^{40}$ be such that

[TABLE]

We note that this implies that

[TABLE]

We also note that $\mu(\mathcal{E}_{p^{k},p^{k-1}})\leqslant\mu(\mathcal{V}_{p^{k}})\mu(\mathcal{W}_{p^{k-1}})\leqslant B\mu(\mathcal{V})/p$ , so if $\mu(\mathcal{E}_{p^{k},p^{k-1}})>0$ then $B>0$ . Similarly if $\mu(\mathcal{E}_{p^{k-1},p^{k}})>0$ then $A>0$ .

Combining (14.6) and (14.9) with (14.5), we find

[TABLE]

Similarly, provided $B>0$ , (14.7), (14.9) and (14.5) give

[TABLE]

and, provided $A>0$ , (14.8), (14.9) and (14.5) give

[TABLE]

We now claim that at least one of the following inequalities holds:

[TABLE]

If (14.13) holds then $q(G^{+})\geqslant q(G)$ by (14.10). If (14.14) holds, then $\mu(\mathcal{E}_{p^{k},p^{k-1}})>0$ , so $B>0$ , and so $q(G_{p^{k},p^{k-1}})\geqslant q(G)$ by (14.11) and (14.14). Finally, if (14.15) holds, then $\mu(\mathcal{E}_{p^{k-1},p^{k}})>0$ , so $A>0$ , and so $q(G_{p^{k-1},p^{k}})\geqslant q(G)$ by (14.12) and (14.15). Therefore this claim would complete the proof by choosing $G^{\prime}\in\{G^{+},G_{p^{k},p^{k+1}},G_{p^{k+1},p^{k}}\}$ according to which of the inequalities (14.13)-(14.15) hold.

Since $\mu(\mathcal{E}^{+})+\mu(\mathcal{E}_{p^{k},p^{k-1}})+\mu(\mathcal{E}_{p^{k-1},p^{k}})=\mu(\mathcal{E}^{*})$ , at least one of (14.13)-(14.15) holds if we can prove that

[TABLE]

Using the inequality $1-x\leqslant e^{-x}$ three times, we find that

[TABLE]

Since we also have that $e^{-x}\leqslant 1-x+x^{2}/2$ for $x\geqslant 0$ , as well as $0\leqslant A,B\leqslant 10^{40}$ , we conclude that

[TABLE]

By the arithmetic-geometric mean inequality, we have that $(9A+1)/10\geqslant A^{9/10}$ and $(9B+1)/10\geqslant B^{9/10}$ , whence

[TABLE]

Since $(1-x)^{1/3}\leqslant 1-x/3$ for $x\in[0,1]$ , we must have that $S<1$ for $p\geqslant 10^{2000}$ , thus completing the proof of the lemma. ∎

Proof of Proposition 8.2.

This follows almost immediately from Lemma 14.1. Our assumptions that $\mathcal{R}(G)\subseteq\{p>10^{2000}\}$ and $\mathcal{R}^{\sharp}(G)\neq\emptyset$ imply that there is a prime $p>10^{2000}$ lying in $\mathcal{R}(G)$ . Thus we can apply Lemma 14.1 with this choice of $p$ and complete the proof. ∎

This completes the proof of Proposition 8.2, and hence Theorem 1.

15. Concluding remarks and counterexamples to the Model Problem

It is a vital feature of our proof that the weight of all vertices $v$ has a factor $\varphi(v)/v$ , as naturally arises from the setup of the Duffin-Schaeffer conjecture. This allows our proof to (just) work, but without weights of this type our argument would fail. At first sight this point may appear to be a mere technicality, but without these weights there are genuine counterexamples to the entire approach.

First, let us see where the proof breaks down without the $\varphi(v)/v$ factors. Although most of the argument holds for a general measure $\mu$ , in Proposition 6.3 we specialize to the measure $\mu(v)=\psi(v)\varphi(v)/v$ . In the proof of Proposition 6.3 (in particular, in relation (7.6)), the $\varphi(v)\varphi(w)/vw$ factor cancels out the factor $ab/\varphi(a)\varphi(b)$ coming from

[TABLE]

in the definition of quality. Otherwise, the proof of Proposition 6.3 would fail. On the other hand, if we were to modify the definition of the quality and remove from it the product in (15.1), then instead the proof of Lemma 14.1 would break down and we would not obtain a quality increment when there are many primes dividing a proportion of $1-1/p$ of each vertex set. Thus the argument we present fails without the $\varphi(q)/q$ weights.

Now, let use explain why the presence of the weight $\varphi(v)/v$ is essential for the kind of argument we have given to work. Without using the $\varphi(v)/v$ weights, we essentially are attempting to prove that the Model Problem of Section 3 has an affirmative answer. However, one can construct examples to show that this is not the case. Such examples are based on the observation that all pairwise GCDs of elements of $\{n!/j:\,n/2\leqslant j\leqslant n\}$ are at least $(n-2)!$ , but there is no fixed integer of size $\gg(n-2)!$ dividing a positive proportion of elements of this set. (We thank Sam Chow for showing us this construction.)

Specifically, we select an integer $n\sim\log\log{x}$ , a prime $p\in[x^{1-c}n^{2}/n!,(9/8)x^{1-c}n^{2}/n!]$ , and then take

[TABLE]

It is straightforward to verify that $\mathcal{S}\subseteq[x,2x]$ , and $\#\mathcal{S}\asymp x^{c}$ . Moreover, we see that if $v_{1}=n!pm_{1}/j_{1}$ and $v_{2}=n!pm_{2}/j_{2}$ are two elements of $\mathcal{S}$ , then

[TABLE]

so all pairs $v_{1},v_{2}$ in $\mathcal{S}$ have a large gcd. However, we can easily check that there is no integer $d\gg x^{1-c}$ dividing a positive proportion of elements of $\mathcal{S}$ , and so this shows that the Model Problem of Section 3 has a negative answer.

On the other hand, if we count integers $v$ with weight $\mu(v)=\varphi(v)/v$ , then the set $\mathcal{S}$ we defined above has total weight $\mu(\mathcal{S})\asymp x^{c}/\log{n}$ , and so it fails to be of a sufficiently large size unless we take $n$ bounded (in which case the prime $p$ is of size $\asymp x^{1-c}$ and it divides a positive proportion of the elements of $\mathcal{S}$ ). Thus the above counterexample no longer works if we count integers with weight $\mu$ .

Bibliography24

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] C. Aistleitner, A note on the Duffin-Schaeffer conjecture with slow divergence. Bull. Lond. Math. Soc. 46 (2014), no. 1, 164–168.
2[2] by same author, Decoupling theorems for the Duffin-Schaeffer problem. Progress report (2019), 24 pages, ar Xiv:1907.04590.
3[3] C. Aistleitner, T. Lachmann, M. Munsch, N. Technau, and A. Zafeiropoulos, The Duffin-Schaeffer conjecture with extra divergence. Preprint, https://arxiv.org/abs/1803.05703 .
4[4] V. Beresnevich, V. Bernik, M. Dodson and S. Velani, Classical metric Diophantine approximation revisited. Analytic number theory, 38–61, Cambridge Univ. Press, Cambridge, 2009.
5[5] V. Beresnevich, G. Harman, A. K. Haynes and S. Velani, The Duffin-Schaeffer conjecture with extra divergence II. Math. Z. 275 (2013), no. 1-2, 127–133.
6[6] V. Beresnevich and S. Velani, A mass transference principle and the Duffin-Schaeffer conjecture for Hausdorff measures. Ann. of Math. (2) 164 (2006), no. 3, 971–992.
7[7] P. A. Catlin, Two problems in metric Diophantine approximation. I. J. Number Theory 8 (1976), no. 3, 282–288.
8[8] R. J. Duffin and A. C. Schaeffer, Khinchin’s problem in metric Diophantine approximation. Duke Math. J. 8 (1941), 243–255.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

On the Duffin-Schaeffer conjecture

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

Khinchin’s theorem**.**

Theorem 1**.**

Theorem 2**.**

Remark*.*

Corollary 3**.**

Notation

Acknowledgements

2. Deduction of Theorem 2 from Theorem 1

3. Outline of the proof of Theorem 1

Model Problem**.**

4. Structure of the paper

5. Preliminaries

Lemma 5.1** (Gallagher’s 0-1 law).**

Proof.

Lemma 5.2** (The Duffin-Schaeffer Conjecture when ψ\psiψ only takes large values).**

Proof.

Lemma 5.3** (Bound for λ(Aq∩Ar)\lambda(\mathcal{A}_{q}\cap\mathcal{A}_{r})λ(Aq​∩Ar​)).**

Proof.

Proposition 5.4** (Second moment bound).**

Proof of Theorem 1 assuming Proposition 5.4.

6. Bipartite GCD graphs

Definition 6.1** (GCD graph).**

Definition 6.2** (Non-trivial GCD graph).**

Proposition 6.3** (Edge set bound).**

Proof of Proposition 5.4 assuming Proposition 6.3.

Definition 6.4** (GCD subgraph).**

Definition 6.5** (Special GCD subgraphs from prime power divisibility).**

Definition 6.6** (Quantities associated to GCD graphs).**

Remark*.*

Lemma 6.7** (Basic properties of GCD graphs).**

Proof.

Remark*.*

7. Reduction to a good GCD subgraph

Proposition 7.1** (Existence of a good GCD subgraph).**

Lemma 7.2** (Bounds on multiplicative functions).**

Proof.

Lemma 7.3** (Few numbers with many prime factors).**

Proof.

Proof of Proposition 6.3 assuming Proposition 7.1.

8. Reduction of Proposition 7.1 to three iterative propositions

Proposition 8.1** (Iteration when R♭(G)≠∅\mathcal{R}^{\flat}(G)\neq\emptysetR♭(G)=∅).**

Proposition 8.2** (Iteration when R♭(G)=∅\mathcal{R}^{\flat}(G)=\emptysetR♭(G)=∅).**

Proposition 8.3** (Bounded quality loss for small primes).**

Lemma 8.4** (Removing the effect of R(G)\mathcal{R}(G)R(G) from Lt(v,w)L_{t}(v,w)Lt​(v,w)).**

Lemma 8.5** (Subgraph with high-degree vertices).**

Proof of Proposition 7.1 assuming Propositions 8.1-8.3 and Lemmas 8.4-8.5.

9. Proof of Lemma 8.4

10. Proof of Lemma 8.5

Lemma 10.1** (Quality increment or all vertices have high degree).**

Proof.

Proof of Lemma 8.5.

11. Preparatory Lemmas on GCD graphs

Lemma 11.1** (Quality variation for special GCD subgraphs).**

Proof.

Lemma 11.2** (One subgraph must have limited quality loss).**

Proof.

Lemma 11.3** (Few edges between unbalanced sets, I).**

Proof.

Lemma 11.4** (Few edges between unbalanced sets, II).**

Lemma 11.5** (Few edges between small sets).**

Proof.

Lemma 11.6** (Subgraph with few edges between all small sets).**

Proof.

12. Proof of Proposition 8.1

Lemma 12.1** (Bounds on edge sets).**

Proof.

Lemma 12.2** (Quality increment unless a prime power divides almost all).**

Proof.

Proof of Proposition 8.1.

Khinchin’s theorem.

Theorem 1.

Theorem 2.

*Remark**.*

Corollary 3.

Model Problem.

Lemma 5.1 (Gallagher’s 0-1 law).

Lemma 5.2 (The Duffin-Schaeffer Conjecture when $\psi$ only takes large values).

Lemma 5.3 (Bound for $\lambda(\mathcal{A}_{q}\cap\mathcal{A}_{r})$ ).

Proposition 5.4 (Second moment bound).

Definition 6.1 (GCD graph).

Definition 6.2 (Non-trivial GCD graph).

Proposition 6.3 (Edge set bound).

Definition 6.4 (GCD subgraph).

Definition 6.5 (Special GCD subgraphs from prime power divisibility).

Definition 6.6 (Quantities associated to GCD graphs).

*Remark**.*

Lemma 6.7 (Basic properties of GCD graphs).

*Remark**.*

Proposition 7.1 (Existence of a good GCD subgraph).

Lemma 7.2 (Bounds on multiplicative functions).

Lemma 7.3 (Few numbers with many prime factors).

Proposition 8.1 (Iteration when $\mathcal{R}^{\flat}(G)\neq\emptyset$ ).

Proposition 8.2 (Iteration when $\mathcal{R}^{\flat}(G)=\emptyset$ ).

Proposition 8.3 (Bounded quality loss for small primes).

Lemma 8.4 (Removing the effect of $\mathcal{R}(G)$ from $L_{t}(v,w)$ ).

Lemma 8.5 (Subgraph with high-degree vertices).

Lemma 10.1 (Quality increment or all vertices have high degree).

Lemma 11.1 (Quality variation for special GCD subgraphs).

Lemma 11.2 (One subgraph must have limited quality loss).

Lemma 11.3 (Few edges between unbalanced sets, I).

Lemma 11.4 (Few edges between unbalanced sets, II).

Lemma 11.5 (Few edges between small sets).

Lemma 11.6 (Subgraph with few edges between all small sets).

Lemma 12.1 (Bounds on edge sets).

Lemma 12.2 (Quality increment unless a prime power divides almost all).

Lemma 13.1 (Small quality loss or prime power divides positive proportion).

Lemma 13.2 (Adding small primes to $\mathcal{P}$ ).

Lemma 14.1 (Quality increment even when a prime power divides almost all).