On the Duffin-Schaeffer conjecture
Dimitris Koukoulopoulos, James Maynard

TL;DR
This paper proves that the set of real numbers approximable by fractions with a certain error rate has full measure under a divergence condition, resolving a longstanding conjecture and refining classical approximation theorems.
Contribution
It establishes the Duffin-Schaeffer conjecture for Lebesgue measure and confirms Catlin's conjecture on non-reduced solutions, advancing metric number theory.
Findings
Proves the Duffin-Schaeffer conjecture for full measure.
Confirms Catlin's conjecture on non-reduced solutions.
Refines Khinchin's Theorem with new approximation conditions.
Abstract
Let be an arbitrary function from the positive integers to the non-negative reals. Consider the set of real numbers for which there are infinitely many reduced fractions such that . If , we show that has full Lebesgue measure. This answers a question of Duffin and Schaeffer. As a corollary, we also establish a conjecture due to Catlin regarding non-reduced solutions to the inequality , giving a refinement of Khinchin's Theorem.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On the Duffin-Schaeffer conjecture
Dimitris Koukoulopoulos
Département de mathématiques et de statistique
Université de Montréal
CP 6128 succ. Centre-Ville
Montréal, QC H3C 3J7
Canada
and
James Maynard
Mathematical Institute, Radcliffe Observatory quarter, Woodstock Road, Oxford OX2 6GG, England
Abstract.
Let be an arbitrary function from the positive integers to the non-negative reals. Consider the set of real numbers for which there are infinitely many reduced fractions such that . If , we show that has full Lebesgue measure. This answers a question of Duffin and Schaeffer. As a corollary, we also establish a conjecture due to Catlin regarding non-reduced solutions to the inequality , giving a refinement of Khinchin’s Theorem.
Key words and phrases:
Diophantine approximation, Metric Number Theory, Duffin-Schaeffer conjecture, graph theory, density increment, compression arguments
2010 Mathematics Subject Classification:
Primary: 11J83. Secondary: 05C40
1. Introduction
Let be an arbitrary function from the positive integers to the non-negative reals. Given , we wish to understand when we can find infinitely many integers and such that
[TABLE]
Clearly, it suffices to restrict our attention to numbers .
When for all , Dirichlet’s approximation theorem implies that, given any irrational , there are infinitely many coprime integers and satisfying (1.1). On the other hand, the situation can become significantly more complicated if behaves more irregularly. Even small variations in the size of can cause (1.1) to have no solutions for certain numbers . However, there are several results in the literature that show that, under rather general conditions on , (1.1) has infinitely many solutions for almost all , in the sense that the residual set has null Lebesgue measure.
The prototypical such ‘metric’ result was proven by Khinchin in 1924 [15] (see also [16, Theorem 32]). To state his result, we let denote the Lebesgue measure on .
Khinchin’s theorem**.**
Consider a function such that the sequence is decreasing, and let denote the set of real numbers for which (1.1) has infinitely many solutions with . Then the following hold:
- (a)
If , then . 2. (b)
If , then .
There is an intuitive way to explain why Khinchin’s result ought to be true. Consider the sets
[TABLE]
so that111Recall that if is a sequence of sets of real numbers, then denotes the set of real numbers lying in infinitely many ’s.
[TABLE]
In addition,
[TABLE]
Thus, part (a) of Khinchin’s theorem is an immediate corollary of the ‘easy’ direction of the Borel-Cantelli lemma from Probability Theory [13, Lemma 1.2] applied to the probability space equipped with the measure . If we knew, in addition, that the sets were mutually independent, then we could apply the ‘hard’ direction of the Borel-Cantelli lemma [13, Lemma 1.3] to deduce part (b) of Khinchin’s theorem. Of course, the sets are not mutually independent, so the difficulty in Khinchin’s proof is showing that there is enough ‘approximate independence’, so that still has full measure.
In 1941, Duffin and Schaeffer [8] undertook a study of the limitations to the validity of Khinchin’s theorem, since the condition that is decreasing is not a necessary condition. They discovered that it is more natural to focus on reduced solutions to (1.1) that avoid overcounting issues arising when working with arbitrary fractions . To this end, let
[TABLE]
and
[TABLE]
Just like before, using the ‘easy’ direction of the Borel-Cantelli lemma, we immediately find that
[TABLE]
In analogy to Khinchin’s result, Duffin and Schaeffer conjectured that we also have the implication
[TABLE]
This is listed as Problem 46 in Montgomery’s lectures [18, Page 204].
The main result of the present paper is a proof of the Duffin-Schaeffer conjecture:
Theorem 1**.**
Let be a function such that
[TABLE]
Let be the set of for which the inequality
[TABLE]
has infinitely many coprime solutions and . Then has Lebesgue measure 1.
As a direct corollary, we obtain Catlin’s conjecture [7] that deals with solutions to (1.7) where the approximations are not necessarily reduced fractions, giving an extension of Khinchin’s Theorem.
Theorem 2**.**
Let and let denote the set of for which the inequality (1.7) has infinitely many solutions with . Define by
[TABLE]
Then the following hold:
- (a)
If , then . 2. (b)
If , then .
There has been much partial progress on the Duffin-Schaeffer conjecture in previous work. The assumption that the sequence is decreasing implies that is also decreasing. In particular, if is a fraction satisfying (1.1), then so is its reduction . Thus, as observed by Walfisz [24] (in work predating Duffin and Schaeffer’s conjecture), Khinchin’s Theorem implies the Duffin-Schaeffer conjecture when is decreasing. In the same paper, he strengthened part (b) of Khinchin’s theorem as follows: if and for all , then the set of for which (1.1) has infinitely many coprime solutions and has Lebesgue measure 1.
Duffin and Schaeffer [8] had already established their conjecture (1.6) when is sufficiently ‘regular’, in the sense that the function behaves like the constant function 1 when weighted with . More precisely, they proved (1.6) under the assumption that
[TABLE]
Since then, a variety of results towards the Duffin-Schaeffer conjecture have been proven. The first significant step was achieved by Erdős [10] and then improved by Vaaler [23], who demonstrated (1.6) when . In addition, Pollington and Vaughan [19] proved that the -dimensional analogue of the Duffin-Schaeffer conjecture holds for any .
The proof of all three aforementioned results can be found in Harman’s book [13] (see Theorems 2.5, 2.6 and 3.6, respectively), along with various other cases of the Duffin-Schaeffer conjecture (see Theorems 2.9, 2.10, 3.7 and 3.8).
More recently, the focus shifted towards establishing variations of (1.6), where the assumption that the series diverges is replaced by a slightly stronger assumption. The first result of this kind was proven in 2006 by Haynes, Pollington and Velani [14], and was improved in 2013 by Beresnevich, Harman, Haynes and Velani [5]. The strongest published such result is the recent theorem of Aistleitner, Lachmann, Munsch, Technau and Zafeiropoulos [3] who showed that
[TABLE]
for any fixed . In 2014, Aistleitner [1] established a companion result to the above one: he showed that if diverges and is not ‘too concentrated’, in the sense that
[TABLE]
then .
Remark*.*
In the recent progress report [2], Aistleitner explains how to improve on (1.8) and (1.9). In particular, his refined arguments allow him to replace by in (1.8).
Finally, Beresnevich and Velani [6] have proven that the Duffin-Schaeffer conjecture implies a Hausdorff measure version of itself. An immediate corollary of their results when combined with Theorem 1 is the following.
Corollary 3**.**
Let . Write for the set of such that (1.7) has infinitely many coprime solutions and , and set
[TABLE]
Then the Hausdorff dimension of satisfies
[TABLE]
The proof of Theorem 2, assuming Theorem 1, is explained in Section 2. For an outline of the proof of Theorem 1, we refer the readers to Section 3. Finally, the structure of the rest of the paper is presented in Section 4.
Notation
The letter will always denote a generic measure on . We reserve the letter for the Lebesgue measure on .
Sets will be typically denoted by capital calligraphic letters such as and . A triple denotes a bipartite graph with vertex sets and and edge set .
Given a set or an event , we let denote its indicator function.
The letter will always denote a prime number. We also write to mean that is the exact power of dividing the integer .
When we write , we mean the pair of and . In contrast, we write for the greatest common divisor of the integers and and for the least common multiple of and .
Finally, we adopt the usual asymptotic notation of Vinogradov: given two functions and a set , we write “ for all ” if there is a constant such that for all . The constant is absolute unless otherwise noted by the presence of a subscript. If is a third function, we use Landau’s notation “ on ” to mean that on . Typically the set is clear from the context and so not stated explicitly.
We introduce several new quantities and associated notation in Section 6 which are tailored to our application. In the interest of concreteness we have decided to use explicit constants in several parts of the argument, but we encourage the reader not to concern themselves with numerics on a first reading.
Acknowledgements
First and foremost, we would like to thank Sam Chow, Leo Goldmakher and Andrew Pollington for their valuable insights to this project: we have had extended discussions with them on various aspects of the Duffin-Schaeffer conjecture and are indebted to them for their contributions. In addition, we would like to thank Sam Chow for pointing out the connection of our paper to Catlin’s conjecture and the construction of the counterexample given in Section 15, and Sanju Velani for introducing J.M. to this problem. Finally, we are grateful to Christopher Aistleitner, Ben Green, Alan Haynes and Sam Chow for sending us various comments and corrections on an earlier version of our paper, as well as to the anonymous referees of the paper for their very detailed comments.
Our project began in the Spring of 2017 during our visit to the Mathematical Sciences Research Institute in Berkeley, California (supported by the National Science Foundation under Grant No. DMS-1440140). In addition, a significant part of our work took place during two visits of J.M. to the Centre de recherche mathématiques in Montréal in November 2017 and May 2018, and during the visit of D.K. to the University of Oxford in the Spring of 2019 (supported by Ben Green’s Simons Investigator Grant 376201). We would like to thank our hosts for their support and hospitality.
D.K. was also supported by the Natural Sciences and Engineering Research Council of Canada (Discovery Grant 2018-05699) and by the Fonds de recherche du Québec - Nature et technologies (projet de recherche en équipe - 256442). J.M. was also supported by a Clay Research Fellowship during the first half of this project, and this project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 851318) for the later stages.
2. Deduction of Theorem 2 from Theorem 1
Most of the details of this deduction can be found in Catlin’s original paper [7]. We give them here as well for the sake of completeness. For easy reference, let
[TABLE]
Firstly, we deal with a rather trivial case.
Case 1: There is a sequence of integers such that for all .
By passing to a subsequence if necessary, we may assume that for all . Recall the definition of the set from (1.2). Since , we infer that for each . As a consequence, . We claim that we also have . Indeed, for each , we have
[TABLE]
Consequently,
[TABLE]
since and . Summing (2.1) over all proves our claim that .
Hence, if we are in Case 1, we see that and , so that Theorem 2 holds.
Case 2: There are finitely many with .
Note that in this case replacing by does not affect either the convergence of , nor which numbers lie in the set . Hence, we may assume without loss of generality that . In particular, we have that , so that we may replace by in the definition of . We now follow an argument due to Catlin.
Consider the function defined by
[TABLE]
and the sets
[TABLE]
These are the analogues of the sets and that appear in Theorem 1, but with in place of . We claim that
[TABLE]
This will immediately complete the proof of Theorem 2(b) by applying Theorem 1. In addition, Theorem 2(a) will follow from (1.5).
Indeed, if , then there are infinitely many reduced fractions such . By the definition of , there is some that is a multiple of such that . If we let , then for all . Since and for each , we also have that , whence .
Conversely, let . Then there are infinitely many pairs such that . If we let be the fraction in reduced form, we also have that , where the last inequality follows by noticing that . This shows that , as long as we can show that infinitely many of the fractions are distinct. But if this were not the case, there would exist a fraction such that for infinitely many , so that for all such . Letting , we find that , a contradiction.
This completes the proof of (2.2), and hence of Theorem 2 in all cases.
3. Outline of the proof of Theorem 1
The purpose of this section is to explain in rough terms the main ideas that go into the proof of our main result. To simplify various technicalities, let us consider the special case where the function satisfies the following conditions:
- (a)
or for every ; 2. (b)
is non-zero only on square-free integers ; 3. (c)
There exists an infinite sequence such that:
- (i)
; 2. (ii)
is supported on ; 3. (iii)
for each we have
[TABLE]
In this set-up, it follows from a well-known second moment argument (which will be explained in detail in Section 5) that to establish the Duffin-Schaeffer conjecture it is sufficient to show that for any we have
[TABLE]
where
[TABLE]
Note that we have the estimate
[TABLE]
so the key to the proof is to show that on average over . This would then show suitable ‘approximate independence’ of the sets defined by (1.3). The size of is controlled by small primes dividing exactly one of . With this in mind, let us consider separately the contribution from with
[TABLE]
for different thresholds (which we think of as small compared with ). A calculation then shows that it is sufficient to show that for each
[TABLE]
In particular, we need to understand the structure of a set where many of the pairs have a large common factor. There are choices of weighted by . Given , there are divisors of that are at least . In turn, given such a divisor , there are integers which are a multiple of (forgetting the constraint ). This gives a bound for the sum in (3.2), and so the key problem is to win back a little bit more than the factor from the divisor bound. We wish to do this by gaining a structural understanding of sets where many pairs have a large GCD. One way that many pairs in can have a large GCD is if a positive proportion of elements of are a multiple of some fixed divisor . It is natural to ask if this is the only such construction. If we ignore the weights, this leads us to the following prototypical question that we shall refer to as the Model Problem.
Model Problem**.**
Let satisfy and be such that there are pairs with . Must it be the case that there is an integer which divides elements of ?
It turns out that the answer to this Model Problem as stated is ‘no’, but a technical variant of it that is sufficient for proving Theorem 1 has a positive answer. For the purposes of this section, we will ignore this subtle issue; we will return to it and discuss it in detail in Section 15.
To attack our Model Problem, we use a ‘compression’ argument, roughly inspired by the papers of Erdős-Ko-Rado [11] and Dyson [9]. We will repeatedly pass to subsets of where we have increasing control over whether given primes occur in the GCDs or not, whilst at the same time showing that the size of the original set is controlled in terms of the size of the new set. At the end of the iteration procedure we will then have arrived at a subset which controls the size of , and where we know that all large GCDs are caused by a fixed divisor. Since the final set then has a very simple GCD structure, we will have enough information to establish (3.2).
To enable the iterations, we pass to a bipartite setup. We start out with sets . Then, we construct two decreasing sequences of sets and , as well as a sequence of primes such that either divides all elements of , or is coprime to all elements of (and similarly with ). Since contains only square-free integers in the simplified set-up of this section, this means that there will be exponents such that for all , and for all . Hence, if we let and , then will divide all elements of and will divide all elements of .
We will construct the sets and in an iterative fashion. Assume that after iterations we have arrived at the sets . We then pick a prime that is different from , and that occurs as the prime factor of for some , with . Our goal is to pass judiciously to subsets and where either is all elements of that are divisible by , or is all elements of coprime to (and similarly with ). Since we’re assuming that contains only square-free integers, we then will completely know the -divisibility of all elements of and , so in particular all GCDs between an element of and will either be multiple of , or all will be coprime to .
Eventually, we will arrive at a pair of sets such that every pair with has the property that all prime factors of will lie in the set (and moreover we will ensure that there is at least one such pair). This terminates the iterative procedure. By construction, all elements of will be divisible by the fixed integer , and similarly all elements of will be divisible by the fixed integer . In addition, if and has , then in fact will be exactly equal to since we know the -divisibility for all elements of and . Thus, and actually every pair and has .
Naturally, the success of the above strategy depends on improving the ‘structure’ of the pair of sets at each stage of the algorithm. This will enable us to control a quantity like the left hand side of (3.2) in terms of a related quantity for . An initially appealing choice to measure the ‘structure’ might be
[TABLE]
namely the density of pairs with large GCD at stage . Iteratively increasing this quantity would try to mimic a ‘density increment’ strategy such as that used in the proof of Roth’s Theorem on arithmetic progressions [21, 22]. Unfortunately, such an argument loses all control over the size of the sets , and so we lose control over the sum in (3.2).
An alternative suggestion might be to consider a different quantity which focuses on the size of the sets. Recall that all elements of are a multiple of , all elements of are a multiple of , and that in our final step we have . Thus
[TABLE]
where we used that and are subsets of in (3.3). Thus, one might try to iteratively increase the quantity
[TABLE]
This would adequately control (3.2), but unfortunately it is not possible to guarantee that this quantity increases at each stage, and so this proposal also fails.
However, the variant
[TABLE]
turns out to (more-or-less) work well. Indeed, if the quantity (3.4) increases at each iteration, and at the final iteration all elements of are a multiple of , all elements of are a multiple of , and all edges come from pairs with , then we find that
[TABLE]
We note that in our setup , and that
[TABLE]
If it so happens that , then we trivially obtain (3.2) (ignoring the weighting) from (3.6). On the other hand, if , then (3.5) falls short of (3.2) only by a factor .
Finally, to win the additional factor of we make use of the fact that any edge in our graph satisfies (3.1). The crucial estimate is that
[TABLE]
This was the key idea in the earlier work of Erdős [10] and Vaaler [23] on the Duffin-Schaeffer conjecture. In our case, our iteration procedure has essentially reduced the proof to a similar situation to their work.
Indeed, in (3.3), we may restrict our attention to pairs such that , and
[TABLE]
Unless most of the contribution to the above sum of comes from primes in and , we can apply (3.7) to win a factor of size in (3.3). Finally, if the small primes in and do cause a problem, then a more careful analysis of our iteration procedure shows that we actually are able to increase the quantity (3.4) by more than by the final stage , which also suffices for establishing (3.2) in this case.
The above description has ignored several important technicalities; it turns out that the weights are vital for our argument to work (see the discussion in Section 15). In addition, we do not quite work with (3.4) but with a closely related (but more complicated) expression to enable this quantity to increase at each iteration. The iteration procedure of our argument is broken up into different stages. In between two of the principal iterative stages, we perform a certain ‘clean-up’ step at which we allow a small loss in the quantity (3.4). This step is essential in order to keep track of the condition (3.8) (which could otherwise become meaningless after too many iterations).
4. Structure of the paper
In the first half of the paper that consists of Sections 5-10, we reduce the proof of Theorem 1 to three technical iterative statements about particular graphs, which we call ‘GCD graphs’ (see Definition 6.1). Specifically, in Section 5 we use a second moment argument to reduce the proof to Proposition 5.4, which claims a suitable bound for sums of the form (3.2). Here, we make use of Lemmas 5.1-5.3 which are standard results from the literature. In Section 6 we introduce the key terminology of the paper and translate Proposition 5.4 into Proposition 6.3, a statement about edges in a particular ‘GCD graph’. In Section 7 we use results about the anatomy of integers (Lemmas 7.2 and 7.3) to reduce the situation to establishing Proposition 7.1, a technical statement claiming the existence of a ‘good’ GCD subgraph (where ‘good’ here means that there are integers and such that all vertices in are divisible by , those in are divisible by , and if is an edge, then ). Then in Section 8, we reduce the proof of Proposition 7.1 to five iterative claims which form the heart of the paper: Propositions 8.1-8.3 and Lemmas 8.4-8.5. In Sections 9 and 10 we then directly establish Lemmas 8.4 and 8.5, respectively, leaving the second half of the paper to demonstrate the key statements of Propositions 8.1-8.3.
The dependency diagram for the first half of the paper is as follows:
Theorem 1
Proposition 5.4
Proposition 6.3
Proposition 7.1
Proposition 8.1
Proposition 8.2
Proposition 8.3
Lemma 5.1
Lemma 5.2
Lemma 5.3
Lemma 7.3
Lemma 7.2
Lemma 10.1
Lemma 8.4
Lemma 8.5
The second half of the paper consists of Sections 11-14, and it is devoted to proving each of Proposition 8.1, 8.2 and 8.3. Before we embark on the proofs directly, we first establish several preparatory lemmas in Section 11. In particular we prove Lemmas 11.2-11.6 which are minor results on GCD graphs we will use later on. Section 12 is dedicated to the proof of Proposition 8.1, which is the easier iteration step, and relies on two auxiliary results: Lemmas 12.1 and 12.2. Section 13 is dedicated to the proof of Proposition 8.3, the iteration procedure for small primes. This proposition follows from Lemma 13.2, in turn relying on Lemmas 11.2, 11.3 and 13.1. Finally, in Section 14 we prove Proposition 8.2, which is the most delicate part of the iteration procedure. This follows quickly from Lemma 14.1, which in turn relies on Lemmas 11.3-11.6. The dependency diagram for the second half of the paper is as follows:
Proposition 8.1
Proposition 8.3
Proposition 8.2
Lemma 12.2
Lemma 12.1
Lemma 14.1
Lemma 11.4
Lemma 11.6
Lemma 11.3
Lemma 13.2
Lemma 11.2
Lemma 13.1
Lemma 10.1
Lemma 11.5
(We have not included the essentially trivial statement of Lemma 11.1 or Lemma 6.7 which are used frequently in the later sections.) All lemmas are proven in the section where they appear with the exception of Lemma 8.4 and Lemma 8.5, which are proven in Sections 9 and 10 respectively. All propositions are proven in sections later than they appear.
5. Preliminaries
We first reduce the proof of Theorem 1 to a second moment bound given by Proposition 5.4 below. This reduction is standard and appears in several previous works on the Duffin-Schaeffer conjecture. In particular, a vital component is the following ergodic 0-1 law due to Gallagher [12].
Lemma 5.1** (Gallagher’s 0-1 law).**
Consider a function and let be as in (1.4). Then either or .
Proof.
This is Theorem 1 of [12]. ∎
Lemma 5.2** (The Duffin-Schaeffer Conjecture when only takes large values).**
Let be a function, and let be as in (1.4). Assume, further, that:
- (a)
For every , either or ; 2. (b)
.
Then .
Proof.
This follows from [19, Theorem 2]. ∎
Lemma 5.3** (Bound for ).**
Consider a function and let be as in (1.3). In addition, given , set
[TABLE]
If , then we have
[TABLE]
Proof.
This bound is given in [19, p. 195-196]. ∎
Given the above lemma, we introduce the notation
[TABLE]
for and . The key result to proving Theorem 1 is:
Proposition 5.4** (Second moment bound).**
Let and be as in as in Lemma 5.3, and consider such that
[TABLE]
For each , set
[TABLE]
Then
[TABLE]
Proof of Theorem 1 assuming Proposition 5.4.
We wish to prove that
[TABLE]
where with defined by (1.4). We first write
[TABLE]
In particular, if , and otherwise.
If it so happens that , then we apply Lemma 5.2 to to find that , where is defined as but with replaced by . This proves (5.3), since , and so .
Therefore we may assume without loss of generality that , and so . Thus, we have reduced Theorem 1 to the case when
[TABLE]
By Lemma 5.1, the Duffin-Schaeffer conjecture will follow if we prove that , since this means cannot have measure 0. Note that
[TABLE]
Now, let be a large parameter and fix to be minimal such that
[TABLE]
(Such a exists since for all .) Hence, we see that it suffices to prove that
[TABLE]
uniformly for all large enough , since this implies that by virtue of (5.4), and hence Theorem 1 follows.
For each , consider the counting function
[TABLE]
We then have
[TABLE]
Hence, the Cauchy-Schwarz inequality implies that
[TABLE]
Thus, to establish (5.5), it is enough to prove that
[TABLE]
The terms with contribute a total
[TABLE]
and so we only need to consider the contribution of those terms with . Applying Lemma 5.3, we see that
[TABLE]
where we recall that
[TABLE]
Thus, (5.6) is reduced to showing that
[TABLE]
To prove this inequality, we divide the range of and into convenient subsets.
The pairs with
[TABLE]
contribute a total of at most
[TABLE]
to the right hand side of (5.7), and so can be ignored.
For any other pair , we see that
[TABLE]
so certainly we have
[TABLE]
For any such pair, we let be the largest integer such that
[TABLE]
Since is chosen maximally, we have
[TABLE]
Mertens’ theorem then implies that
[TABLE]
Therefore
[TABLE]
where we used again Mertens’ theorem. As above, those pairs with
[TABLE]
make an acceptable contribution to (5.7). Therefore we only need to consider pairs with .
We have thus reduced (5.7) to showing that
[TABLE]
where is defined by (5.2). To prove (5.8), we apply Proposition 5.4, which shows that the inner sum is . Since the sum of over converges, this completes the proof of Theorem 1. ∎
Thus we are left to establish Proposition 5.4.
6. Bipartite GCD graphs
In this section we introduce the key notation that will underlie the rest of the paper. In particular, we show that Proposition 5.4 follows from a statement given by Proposition 6.3 about a weighted graph with additional information about divisibility of the integers making up its vertices. The rest of the paper is then dedicated to establishing suitable properties of such graphs, which we call ‘GCD graphs’.
If we let
[TABLE]
and we weight the elements of with the measure
[TABLE]
then Proposition 5.4 can be interpreted as an estimate for the weighted edge density of the graph with set of vertices and set of edges defined by (5.2).
Our strategy for proving Proposition 5.4 is to use a ‘compression’ argument. More precisely, if denotes the graph described in the above paragraph, we will construct a finite sequence of graphs where we make a small local change to pass from to that increases the amount of structure in the graph. The final graph will then be highly structured and easy to analyze. To keep control over the procedure, we keep track of how certain statistics of the graph change at each step. This enables us to show that the relevant properties of are suitably controlled by , and so is controlled by , where everything is explicit.
To perform the above construction, we introduce some new notation to take into account the extra information about prime power divisibility which we need to carry at each stage.
Definition 6.1** (GCD graph).**
Let be a septuple such that:
- (a)
is a measure on such that for all ; we extend to by letting
[TABLE] 2. (b)
and are finite sets of positive integers; 3. (c)
, that is to say is a bipartite graph; 4. (d)
is a set of primes; 5. (e)
and are functions from to such that for all we have:
- (i)
for all , and for all ; 2. (ii)
if , then ; 3. (iii)
if , then for all , and for all .
We then call a (bipartite) GCD graph with sets of vertices , set of edges and multiplicative data . We will also refer to as the set of primes of . If , we say that has trivial set of primes and we view and as two copies of the empty function from to .
Definition 6.2** (Non-trivial GCD graph).**
Let . We say that is non-trivial if .
We now recast Proposition 5.4 in the language of GCD graphs.
Proposition 6.3** (Edge set bound).**
Let , and be the measure . Let satisfy . Let be a bipartite GCD graph with measure , vertex sets , trivial set of primes, and edge set , where is defined as in Proposition 5.4. Then
[TABLE]
Proof of Proposition 5.4 assuming Proposition 6.3.
Recall the notation , , , , and of Proposition 5.4. We wish to show that
[TABLE]
Let be the measure on defined by and let , so that
[TABLE]
Now define to be as in Proposition 5.4. We see that forms a bipartite graph with vertex sets two copies of and edge set . We now turn this bipartite graph into a GCD graph by attaching trivial multiplicative data to the bipartite graph (here and are viewed as two copies of the function of the empty set to ).
Since , Proposition 6.3 now applies, showing that
[TABLE]
This completes the proof. ∎
Thus we are left to establish Proposition 6.3. As we briefly explained in Section 3, this will be done by passing iteratively to subgraphs of on which we control the divisibility by more and more primes. To formalize this procedure, we introduce the concept of a GCD subgraph.
Definition 6.4** (GCD subgraph).**
Let and be two GCD graphs. We say that is a GCD subgraph of if:
[TABLE]
We write if is a GCD subgraph of . Lastly, we say that is a non-trivial GCD subgraph of if , that is to say is non-trivial as a GCD graph.
We thus see from the above definition that we only accept as a subgraph of if we have at least as much information about the divisibility of the vertices of compared to those of . In particular, we have that for all and all .
We will devise an iterative argument that adds one prime at a time to , so that we will eventually control the multiplicative structure of GCDs of connected vertices in the graph very well by the end of this process.
The main way we will produce a GCD subgraph of a GCD graph is by restricting to vertex sets with certain divisibility properties. Since we will use this several times, we introduce a specific notation for these GCD subgraphs.
Definition 6.5** (Special GCD subgraphs from prime power divisibility).**
Let be a prime number, and let .
- (a)
If is a set of integers and , we set
[TABLE]
that is to say is the set of integers in whose -adic valuation is exactly . Here we have the understanding that denotes the set of that are coprime to . In particular, and denote different sets of integers. 2. (b)
Let be a bipartite graph. If and , we define
[TABLE]
We also write for brevity
[TABLE] 3. (c)
Let be a GCD graph such that . We then define the septuple
[TABLE]
where the functions , are defined on by the relations , ,
[TABLE]
It is easy to check that is a GCD subgraph of .
The aim of our iterative procedure is to obtain a simple GCD subgraph of our initial graph where the key quantitative aspects of are controlled by the corresponding quantities of . Here ‘simple’ graphs have many primes occurring in for to a fixed exponent, whilst for subgraphs to maintain control over the original graph we need to maintain sufficiently many edges relative to the number of vertices. This leads us to our last four definitions:
Definition 6.6** (Quantities associated to GCD graphs).**
Let be a GCD graph.
- (a)
If , then we define the edge density of by
[TABLE]
If or , we define the edge density of to be [math]. 2. (b)
The neighbourhood sets are defined by
[TABLE]
and similarly
[TABLE] 3. (c)
We let be given by
[TABLE]
That is to say is the set of primes occurring in a GCD which we haven’t yet accounted for. We split this into two further subsets:
[TABLE]
and
[TABLE] 4. (d)
The quality of is defined by
[TABLE]
where is the edge density of .
Remark*.*
If we see that
[TABLE]
As mentioned in Section 3, there are two natural candidates for a quantity to increment; either or (this is the natural generalization to non-squarefree integers). One should essentially think of the quality as a ‘hybrid’ of the two quantities, but with some additional factors which are included for technical reasons. The factor
[TABLE]
always lies in the interval , and so is always of size bounded away from 0 and from . This factor is included merely for convenience, and allows us to have a quality increment even if there is a tiny loss in our arguments in terms of . The factor
[TABLE]
is crucial for the proof of a quality increment in Lemma 14.1 and Proposition 8.2. This is related to the technical point that it is vital that the weights of our vertices contain the factor . We will discuss this feature in more detail in Section 15.
We will repeatedly make use of some trivial properties of GCD graphs, given by Lemma 6.7 below, without further comment.
Lemma 6.7** (Basic properties of GCD graphs).**
Let be GCD graphs.
- (a)
The property of being a GCD subgraph is transitive: If and , then 2. (b)
If , then . 3. (c)
If is non-trivial, then . 4. (d)
Let have edge density . Then the following are equivalent:
- (i)
* is non-trivial.* 2. (ii)
. 3. (iii)
.
Proof.
All statements are immediate from the definition of GCD subgraphs and of non-trivial GCD graphs. ∎
Remark*.*
In part (b) of Lemma 6.7, it is not necessarily the case that nor that .
Having introduced all necessary terminology, we turn to the task of establishing Proposition 6.3.
7. Reduction to a good GCD subgraph
In this section, we reduce the proof of Proposition 6.3 (and hence of Theorem 1) to finding a ‘good’ GCD subgraph as described in Proposition 7.1 below. This reduction utilizes some results showing that few integers have lots of fairly small prime factors (based on ‘the anatomy of integers’).
Proposition 7.1** (Existence of a good GCD subgraph).**
Let be a GCD graph with trivial set of primes and edge density . Assume further that
[TABLE]
for some satisfying
[TABLE]
Then there is a GCD subgraph of with edge density such that:
- (a)
; 2. (b)
For all , we have ; 3. (c)
For all , we have ; 4. (d)
One of the following holds:
- (i)
; 2. (ii)
, and if and we write them as and , then .
Our task is to prove that Proposition 7.1 implies Proposition 6.3. To do so, we need a couple of preparatory lemmas that exploit the condition that in Case (d)-(ii) of Proposition 7.1.
Lemma 7.2** (Bounds on multiplicative functions).**
Let and write for the -th divisor function. If is a multiplicative function such that , then
[TABLE]
Proof.
This is [17, Theorem 14.2, p. 145]. ∎
Lemma 7.3** (Few numbers with many prime factors).**
For and , we have
[TABLE]
the implied constant is absolute.
Proof.
We may assume that is large enough, since the result is trivial when is bounded. Set , so that by Mertens’ theorem. Hence
[TABLE]
We wish to apply Lemma 7.2 when is the multiplicative function with for and all , and for . In particular, for all prime powers , so that . Thus
[TABLE]
Since for , and , the sum of over is . We thus conclude that
[TABLE]
Since , the lemma has been proven. ∎
Proof of Proposition 6.3 assuming Proposition 7.1.
Fix and let be the GCD graph of Proposition 6.3 with set of edges (where is defined in Proposition 5.4), weight and edge density .
If , then and so we are done. Therefore we may assume that
[TABLE]
Note that this implies that
[TABLE]
We apply Proposition 7.1 to to find a GCD subgraph of with edge density satisfying either case (d)-(d)(i) or (d)-(d)(ii) of its statement. In addition, we have that:
- (a)
; 2. (b)
for all ; 3. (c)
for all .
Set
[TABLE]
The definition of a GCD graph implies that
[TABLE]
Moreover, since , and for all , we have that
[TABLE]
Now, note that
[TABLE]
as well as
[TABLE]
Consequently, from the definition of , we find
[TABLE]
Proposition 7.1 offers a lower bound on . Since
[TABLE]
we can obtain an upper bound on the size of by estimating from above.
Note that
[TABLE]
where we recall that . Since for all , we infer that
[TABLE]
The vertex sets are finite sets of positive integers. For each , let be the largest integer in such that . (This quantity is well-defined in virtue of property (b) above. In addition, we emphasise to the reader that ‘largest’ refers to the size of elements as positive integers, and does not depend on the measure .) Similarly, for each , let be the largest element of such that . Consequently,
[TABLE]
Now, let be the largest integer in and . We then have
[TABLE]
In addition, since satisfies conditions (b) and (c) in the statement of Proposition 7.1, we have
[TABLE]
Substituting this bound into (7.2), we find
[TABLE]
Here we used the trivial bound in the second inequality. In addition,
[TABLE]
Since and , we have and . Therefore
[TABLE]
Together with (7.3), (7.4) and (7.5), this implies that
[TABLE]
We now split our argument depending on whether (d)-(d)(i) or (d)-(d)(ii) of Proposition 7.1 holds.
Case 1: (d)-(d)(i) of Proposition 7.1 holds.
In this case we have . Writing and , we find that
[TABLE]
Together with (7.6), this implies that
[TABLE]
Since in this case, and since , this gives
[TABLE]
This establishes Proposition 6.3 in this case.
Case 2: (d)-(d)(ii) of Proposition 7.1 holds.
Write and . In this case
[TABLE]
We also have .
From (7.7), we see that either
[TABLE]
whenever . Consequently,
[TABLE]
where
[TABLE]
For , we note that
[TABLE]
by Lemma 7.3, since . Similarly for , we find that
[TABLE]
by applying Lemma 7.3 once again. Substituting these bounds into (7.6), we conclude that
[TABLE]
Since we have and , this gives
[TABLE]
This establishes Proposition 6.3 in all cases. ∎
Thus we are left to prove Proposition 7.1.
8. Reduction of Proposition 7.1 to three iterative propositions
We will prove Proposition 7.1 by an iterative argument, where we repeatedly find GCD subgraphs with progressively nicer properties. In this section we reduce the proof to five technical iterative statements, given by three key propositions (Propositions 8.1-8.3) and two auxiliary lemmas (Lemmas 8.4-8.5) given below.
Proposition 8.1** (Iteration when ).**
Let be a GCD graph with edge density such that
[TABLE]
Then there is a GCD subgraph of with edge density and multiplicative data such that
[TABLE]
where .
Proposition 8.2** (Iteration when ).**
Let be a GCD graph with edge density such that
[TABLE]
Then there is a GCD subgraph of such that
[TABLE]
Propositions 8.1 and 8.2 deal with large primes. We need a complementary result that handles the small primes.
Proposition 8.3** (Bounded quality loss for small primes).**
Let be a GCD graph with edge density and trivial set of primes. Then there is a GCD subgraph of with edge density such that
[TABLE]
Finally, we need two further technical estimates. The first one strengthens the quality of the inequality when the set is empty, whereas the second allows one to pass to a subgraph where all vertices have high degree.
Lemma 8.4** (Removing the effect of from ).**
Let and be a GCD graph with edge density such that
[TABLE]
Then there exists a GCD subgraph of such that
[TABLE]
Lemma 8.5** (Subgraph with high-degree vertices).**
Let be a GCD graph with edge density . Then there is a GCD subgraph of with edge density such that:
- (a)
; 2. (b)
; 3. (c)
For all and for all , we have
[TABLE]
Proof of Proposition 7.1 assuming Propositions 8.1-8.3 and Lemmas 8.4-8.5.
We will construct the required subgraph in several stages. It suffices to produce a GCD subgraph of satisfying only conclusions and of Proposition 7.1, since an application of Lemma 8.5 then produces a GCD subgraph satisfying all the conclusions.
Stage 1: Obtaining a GCD subgraph with .
Since has set of primes equal to the empy set, we may apply Proposition 8.3 to to produce a GCD subgraph of with edge density and for which
[TABLE]
In particular, we have
[TABLE]
for any by Lemma 6.7(b).
Stage 2: Obtaining a GCD subgraph with .
If , then satisfies the conditions of Proposition 8.1. We then repeatedly apply Proposition 8.1 to produce a sequence of GCD subgraphs of given by
[TABLE]
until we obtain a GCD subgraph of which does not satisfy the conditions of Proposition 8.1. Since and is a finite set, this process must indeed terminate after a finite number of steps and produce a GCD graph that does not satisfy the conditions of Proposition 8.1. Since by (8.2), it must be the case that
[TABLE]
In addition, Proposition 8.1 implies that
[TABLE]
where
[TABLE]
Together with (8.1), this yields that
[TABLE]
On the other hand, if , then we simply take and note that (8.3) is trivially satisfied by (8.1).
This completes Stage 2. The remaining part of the proof deviates according to whether the ratio is larger or smaller than .
Case (a): .
In this case we do not need to keep track of the condition that because we have a very large gain in the quality of the new graph. The next stage of the argument is then:
Stage 3a: Obtaining a GCD subgraph with .
Notice that if , then by (8.2). Consequently, if , then either Proposition 8.1 or Proposition 8.2 is applicable to , thus producing a GCD subgraph of such that
[TABLE]
Since is finite, starting with and iterating the above fact, we can construct a finite sequence of GCD subgraphs
[TABLE]
such that
[TABLE]
Applying the assumption that , we infer that
[TABLE]
Hence, the GCD graph satisfies condition (a) and condition (d)-(d)(i) of Proposition 7.1, giving the result in this case. (Recall that we may also guarantee conditions (b) and (c) of Proposition 7.1 by feeding our graph into Lemma 8.5.)
In order to complete the proof of Proposition 7.1, it remains to consider the situation when is not large.
Case (b): .
In this case, the quality increment is small and we must make sure not to lose track of the condition . For this reason, we perform some cosmetic surgery to our graph before applying Proposition 8.2. This consists of Stage 3b that we present below.
Stage 3b: Removing the effect of primes in from the anatomical condition .
Note that (8.3) implies that
[TABLE]
and that
[TABLE]
where we recall that
[TABLE]
(here we used the trivial bound ).
Since , , and for all , it is the case that satisfies the conditions of Lemma 8.4. Consequently, there exists a GCD subgraph of with
[TABLE]
and such that
[TABLE]
We claim that an inequality of the form (8.8) holds even if we remove from consideration the primes lying in the set
[TABLE]
It turns out that we can do this rather crudely, starting from the estimate
[TABLE]
Recalling that and , we deduce that
[TABLE]
Since , relation (8.5) implies that , that is to say the right hand side of (8.9) is . As a consequence,
[TABLE]
Having removed the effect to the condition of primes from the sets , we are ready to complete the construction of in Case (b).
Stage 4b: Obtaining a GCD subgraph with .
We argue as in Stage 3a: for each , we have by (8.2). Hence, if , then either Proposition 8.1 or Proposition 8.2 is applicable to , thus producing a GCD subgraph of such that
[TABLE]
where and denote the set of primes of and of , respectively. Since is finite, starting with and iterating the above fact, we can construct a finite sequence of GCD subgraphs
[TABLE]
such that
[TABLE]
In addition, note that
[TABLE]
where the second relation follows by fact (8.6) that . We now verify that if we let
[TABLE]
then condition (d)-(d)(ii) of Proposition 7.1 is satisfied. This suffices for the completion of the proof, since clearly satisfies condition (a) of Proposition 7.1, and an application of Lemma 8.5 can also ensure conditions (b) and (c).
First of all, note that by (8.7) and (8.3) and , we have
[TABLE]
Let . It remains to check that , where and are defined by the relations
[TABLE]
By the definition of the set and since , all prime factors of belong to . But for each prime we have . Thus
[TABLE]
In particular, we must have that
[TABLE]
Now, let be a prime such that
[TABLE]
Since but , we must have , and so . In addition, our assumptions that and imply that . If , we infer that . On the other hand, if , then the inclusion implies that . In either case, we have that . Thus, since , we may use the bound (8.10), which gives
[TABLE]
In particular, satisfies the conditions of case (d)-(d)(ii) of Proposition 7.1. This completes the proof of Proposition 7.1 in Case (b) too. ∎
Thus we are left to establish Propositions 8.1-8.3 and Lemmas 8.4-8.5. We begin with the last two results because they are easier to establish.
9. Proof of Lemma 8.4
In this section we establish Lemma 8.4 directly.
For brevity, let
[TABLE]
We have
[TABLE]
Fix for the moment a prime . Since we have , it must be the case that , that is to say there exists some such that
[TABLE]
Now we note that if , then and for some . In particular we cannot have and . Thus
[TABLE]
Thus we conclude that
[TABLE]
where in the final line we used the fact that .
Now, let us define
[TABLE]
Evidently, we have that
[TABLE]
Thus . We then take and note that
[TABLE]
Finally, we note that
[TABLE]
for by [20, Theorem 5], and so if then
[TABLE]
Hence, since , for any we have
[TABLE]
This completes the proof of Lemma 8.4.∎
We are left to establish Propositions 8.1-8.3 and Lemma 8.5.
10. Proof of Lemma 8.5
In this section we establish Lemma 8.5. We begin with an auxiliary lemma.
Lemma 10.1** (Quality increment or all vertices have high degree).**
Let be a GCD graph with edge density . For each and for each , we let
[TABLE]
be the sets of their neighbours. Then one of the following holds:
- (a)
For all and for all , we have
[TABLE] 2. (b)
There is a GCD subgraph of with edge density , quality , and such that either or .
Proof.
Assume that (a) fails. Then either its first or its second inequality fails. Assume that the first one fails for some ; the other case is entirely analogous. Let be the set of edges between the vertex sets and . Note that
[TABLE]
because , , and by the assumption . In particular, we have . We then consider , which is a GCD subgraph of . Let have edge density . We claim that and .
Indeed, we have
[TABLE]
Thus the edge density of satisfies
[TABLE]
Thus we see that , and that
[TABLE]
This proves our claim that too, thus completing the proof of the lemma. ∎
Proof of Lemma 8.5.
We note that conclusion of Lemma 8.5 is the same as conclusion of Lemma 10.1. Thus, if does not satisfy conclusion of Lemma 8.5, then we may repeatedly apply Lemma 10.1 to produce a sequence of GCD subgraphs
[TABLE]
until we arrive at a GCD subgraph of which satisfies conclusion of Lemma 10.1. This process must terminate after a finite number of steps since at least one of the vertex sets of has one less element than the corresponding vertex set of . Let the process terminate at , which satisfies conclusion of Lemma 10.1, and let be the edge density of . Since and by Lemma 10.1, we have that
[TABLE]
Since the multiplicative data are also maintained at each iteration, we see that taking gives the result. ∎
Thus we are left to establish Propositions 8.1-8.3.
11. Preparatory Lemmas on GCD graphs
Our remaining task is to prove Propositions 8.1-8.3. Before we attack these directly, we establish various preliminary results about GCD graphs in this section, which we will then use in the remaining sections to prove Propositions 8.1-8.3.
Lemma 11.1** (Quality variation for special GCD subgraphs).**
Let be a GCD graph, and . If is as in Definition 6.5, then is a GCD subgraph of . In addition, if is non-trivial and , then we have
[TABLE]
Proof.
This follows directly from the definitions. ∎
Lemma 11.2** (One subgraph must have limited quality loss).**
Let be a GCD graph with edge density , and let and be partitions of and . Then there is a GCD subgraph of with edge density such that
[TABLE]
and with , , and .
Proof.
For brevity let be the edges between and for and . Since the partitions of and induce a partition of , we have
[TABLE]
Thus, by the pigeonhole principle, there is a choice of and such that . We then let , which is clearly a non-trivial GCD subgraph of . We see that
[TABLE]
and
[TABLE]
This gives the result. ∎
Lemma 11.3** (Few edges between unbalanced sets, I).**
Let be a GCD graph with edge density . Let , and be such that and
[TABLE]
(In particular, if , the last hypothesis is vacuous.)
If we set and write for the edge density of the graph , then one of the following holds:
- (a)
There is such that and . 2. (b)
.
Proof.
Assume that conclusion does not hold, so and we wish to establish . Then there must exist some such that
[TABLE]
where we used that . In particular, is a non-trivial GCD graph. Since , we have that . Consequently,
[TABLE]
Since , we have
[TABLE]
In addition, note that for all primes. Therefore
[TABLE]
by our assumption that .
Similarly, we have
[TABLE]
Since and , we conclude that
[TABLE]
This completes the proof of the lemma. ∎
The symmetric version of Lemma 11.3 to the above one also clearly holds:
Lemma 11.4** (Few edges between unbalanced sets, II).**
Let be a GCD graph with edge density . Let , and be such that and
[TABLE]
and set . If denotes the edge density of the graph , then one of the following holds:
- (a)
There is such that and . 2. (b)
.
Next, we prove a lemma about the connectivity of small vertex sets of a GCD graph.
Lemma 11.5** (Few edges between small sets).**
Let be a GCD graph with edge density and let . Then one of the following holds:
- (a)
For all sets and such that and , we have . 2. (b)
There is a GCD subgraph of such that , and .
Proof.
Assume that (a) fails. Hence, there exist sets and such that , and . We then set and consider the GCD subgraph of . Since , this is a non-trivial GCD graph. In addition, since (because is non-trivial) and (by assumption), we have , and thus . Similarly, we find that . Finally, for the quality of , we have
[TABLE]
This completes the proof of the lemma. ∎
By iterating this lemma, we arrive at the following result.
Lemma 11.6** (Subgraph with few edges between all small sets).**
Let be a GCD graph with edge density , and let . Then there is a GCD subgraph of with edge density such that both of the following hold:
- (a)
. 2. (b)
For all sets and such that and , we have .
Proof.
We note that conclusion of Lemma 11.6 is the same as conclusion of Lemma 11.5. Thus, if does not satisfy conclusion of Lemma 11.6, then we may repeatedly apply Lemma 11.5 to produce a sequence of GCD subgraphs
[TABLE]
until we arrive at a GCD subgraph of which satisfies conclusion of Lemma 11.5. This process must terminate after a finite number of steps since has strictly smaller vertex sets than those of . Let the process terminate at , which satisfies conclusion of Lemma 11.5. Since by Lemma 11.5, we have that
[TABLE]
Lastly, since the multiplicative data are maintained at each iteration, we see that taking gives the result. ∎
12. Proof of Proposition 8.1
In this section we prove Proposition 8.1, which is the iteration procedure for ‘generic’ primes. This section is essentially self-contained (relying only on the notation of Section 6 and the trivial Lemma 11.1), and serves as a template for the proofs of the harder Propositions 8.2 and 8.3.
Lemma 12.1** (Bounds on edge sets).**
Consider a GCD graph and a prime . For each , let
[TABLE]
Then there exist such that and
[TABLE]
Proof.
Let . Note that if , then . Thus . Hence, if we assume that the inequality in the statement of the lemma does not hold for any pair , we must have
[TABLE]
where
[TABLE]
and
[TABLE]
Thus, to arrive at a contradiction, it suffices to show that
[TABLE]
First of all, note that , whence
[TABLE]
Observing that
[TABLE]
we conclude that
[TABLE]
Since are non-negative reals which sum to 1, there exists some such that
[TABLE]
We thus find that
[TABLE]
where we used the Cauchy-Schwarz inequality to bound from above. We also find that
[TABLE]
As a consequence,
[TABLE]
The function is increasing for , and so maximized at . Thus we infer that as required, completing the proof of the lemma. ∎
Lemma 12.2** (Quality increment unless a prime power divides almost all).**
Consider a GCD graph with edge density and a prime with . Then one of the following holds:
- (a)
There is a GCD subgraph of with multiplicative data and edge density such that
[TABLE] 2. (b)
There is some such that
[TABLE]
Proof.
Let and be defined as in the statement of Lemma 12.1. Consequently, there are such that and
[TABLE]
In particular, , so that is a non-trivial GCD subgraph of . We separate two cases, according to whether or not.
Case 1: .
Let . Lemma 11.1 and our lower bound imply that
[TABLE]
In addition,
[TABLE]
This establishes conclusion (a) in this case, noting that so .
Case 2:
As before, we let , and use Lemma 11.1 and our lower bound on to find that
[TABLE]
where
[TABLE]
In addition, we have
[TABLE]
Note that
[TABLE]
Indeed, this follows by our assumption that , which implies that . Combining the above, we conclude that
[TABLE]
Now, assume that conclusion (a) of the lemma does not hold, so that the left hand side of (12.3) is . Since and all primes are at least , we must then have that
[TABLE]
where we used our assumption that for the last inequality. In particular, this gives
[TABLE]
We note that
[TABLE]
Thus by the arithmetic-geometric mean inequality, and relations (12.5) and (12.4), we have
[TABLE]
In particular, .
We consider the case when ; the case with is entirely analogous with the roles of and swapped, and the roles of and swapped. Thus, to complete the proof of the lemma, it suffices to show that
[TABLE]
The first inequality of (12.4) states that
[TABLE]
Since , we infer that
[TABLE]
In particular, and , whence
[TABLE]
This completes the proof of (12.6) and hence of the lemma. ∎
Proof of Proposition 8.1.
This follows almost immediately from Lemma 12.2. Since by assumption, if then . We have also assumed that . Consequently, there is a prime with . We now apply Lemma 12.2 with this choice of . By definition of , conclusion cannot hold, and so conclusion must hold. This then gives the result. ∎
We are left to establish Proposition 8.3 and Proposition 8.2.
13. Proof of Proposition 8.3
In this section we prove Proposition 8.3, which is the iteration procedure for small primes. This section relies on the notation of Section 6, Lemma 10.1, the Lemmas 11.1-11.3 from Section 11 and Lemma 12.2. The basic idea of the proof is similar to that of Proposition 8.1, but we can no longer ensure a quality increment when the primes are small; instead we show that there is only a bounded loss.
Lemma 13.1** (Small quality loss or prime power divides positive proportion).**
Consider a GCD graph with edge density , and let be a prime. Then one of the following holds:
- (a)
There is a GCD subgraph of with multiplicative data and edge density such that
[TABLE] 2. (b)
There is some such that
[TABLE]
Proof.
Assume that conclusion does not hold, so we intend to establish . For , let and . We begin as in the proof of Lemma 12.2, by considering satisfying (12.1) and the inequalities . In particular, is a non-trivial GCD subgraph of .
We note that the proof of Lemma 12.2 up to relation (12.3) requires no assumption on the size of . Now, if , then Case 1 of the proof of Lemma 12.2 shows that conclusion must hold, contradicting our assumption. Therefore we may assume that . Now, arguing as in Case 2 of the proof of Lemma 12.2, and setting and
[TABLE]
we infer that
[TABLE]
Therefore we have that
[TABLE]
Since , we have
[TABLE]
so . We deal with the case when ; the case with is entirely analogous with the roles of and and the roles of and swapped.
Since , we have
[TABLE]
In particular, and so conclusion holds, as required. ∎
Lemma 13.2** (Adding small primes to ).**
Let be a GCD graph with edge density . Let be a prime with .
Then there is a GCD subgraph of with set of primes and edge density such that
[TABLE]
Proof.
We first repeatedly apply Lemma 10.1 until we arrive at a GCD subgraph
[TABLE]
of with edge density such that
[TABLE]
as well as
[TABLE]
(We must eventually arrive at such a subgraph since the vertex sets are strictly decreasing at each stage but can never become empty since the edge density remains bounded away from 0.)
We now apply Lemma 13.1 to . If conclusion of Lemma 13.1 holds, then there is a GCD subgraph of satisfying the conclusion of Lemma 13.2, so we are done by taking . Therefore we may assume that instead conclusion of Lemma 13.1 holds, so there is some such that
[TABLE]
In fact we claim that either the conclusion of Lemma 13.2 holds, or we have the stronger condition
[TABLE]
Relation (13.2) follows immediately from (13.1) if , so let us assume that . We then apply Lemma 12.2 to . If conclusion of Lemma 12.2 holds, then there is a GCD subgraph of satisfying the required conditions of Lemma 13.2, so we are done by taking . Therefore we may assume that conclusion of Lemma 12.2 holds, so that there is some such that and . Since there cannot be two disjoint subsets of of density , we must then have , thus proving (13.2) in this case too.
In conclusion, regardless of the size of we have established (13.2). Next, we fix an integer such that (such an integer exists because ) and we apply Lemma 11.3.
If conclusion of Lemma 11.3 holds, then we take , whose quality satisfies
[TABLE]
and whose edge density satisfies
[TABLE]
In particular, , so the proof is complete in this case.
Thus we may assume that conclusion of Lemma 11.3 holds, so that
[TABLE]
where we recall the notation . Let
[TABLE]
and let
[TABLE]
be the set of edges between and in . Since and for all , we have
[TABLE]
Let be the GCD subgraph of formed by restricting to and . Since , is a non-trivial GCD subgraph. If denotes its edge density, then
[TABLE]
In addition, we have that
[TABLE]
Finally, we apply Lemma 11.2 to the partition
[TABLE]
of into subsets. This produces a GCD subgraph
[TABLE]
of for some with such that
[TABLE]
In addition, Lemma 11.2 implies that the density of , call it , satisfies
[TABLE]
Finally, we note that is a GCD subgraph of with set of primes , edge density , and quality . Taking then gives the result. ∎
Proof of Proposition 8.3.
If , then we can simply take .
If , then we can choose a prime and apply Lemma 13.2. We do this repeatedly to produce a sequence of GCD subgraphs
[TABLE]
such that
[TABLE]
for each , where denotes the edge density of . In addition, we let denote the set of primes associated to , so that .
At each stage, the set is strictly smaller than before. So, after at most steps we arrive at a GCD subgraph of with
[TABLE]
Let denote the edge density of the end graph . Iterating the two inequalities of (13.3) at most times, we find that
[TABLE]
Thus, taking gives the result. ∎
Thus we are just left to establish Proposition 8.2.
14. Proof of Proposition 8.2
Finally, in this section we prove Proposition 8.2, and hence complete the proof of Theorem 1. The proof is similar to that of Proposition 8.1, but more care is required when dealing with the primes coming from .
Lemma 14.1** (Quality increment even when a prime power divides almost all).**
Consider a GCD graph with edge density and let be a prime with . Then there is a GCD subgraph of with set of primes such that
[TABLE]
Proof.
First of all, we may assume without loss of generality that for all sets and , we have that
[TABLE]
Indeed, if does not satisfy (14.1), then we apply Lemma 11.6 with to replace by a non-trivial subgraph that does have this property (noticing that for ). In addition, has the same multiplicative data as and its quality is strictly larger. Hence, we may work with instead. So, from now on, we assume that (14.1) holds.
We now apply Lemma 12.2. If conclusion of Lemma 12.2 holds, then we are done. Thus we may assume that conclusion holds, that is to say there is some such that
[TABLE]
In particular, by (14.1) we see that
[TABLE]
Now, set
[TABLE]
with the convention that . In view of Lemmas 11.3 and 11.4 applied with , we may assume that
[TABLE]
and
[TABLE]
Hence, if we let
[TABLE]
[TABLE]
where we used our assumption that and the inequality for that follows from Taylor’s theorem. We then consider the non-trivial GCD subgraph of formed by restricting the edge set to . Note that
[TABLE]
Now, let . We have the following five possibilities:
- (a)
and , in which case and ; 2. (b)
and , in which case and ; 3. (c)
and , in which case and ; 4. (d)
and , in which case , and ; 5. (e)
and , in which case , and .
We then set , where:
[TABLE]
as well as
[TABLE]
By looking at possibilities (a), (b) and (c), it is easy to check that is a GCD subgraph of (and hence of ). Note that . Similarly, we have . Consequently, its quality satisfies the relation
[TABLE]
(This relation is valid even if .) We separate two cases.
Case 1: .
In this case , so all parameters of are the same as those of except that the set of primes of is instead of and , have been extended to take the value 0 at . As a consequence,
[TABLE]
In particular, by (14.5) we have
[TABLE]
Thus the lemma follows by taking .
Case 2: .
In this case we have
[TABLE]
We also consider the GCD subgraphs and of . Notice that for . Hence, if , then Lemma 11.1 implies that
[TABLE]
Similarly, if , then we have
[TABLE]
Since , we have that . Similarly, we have that . To this end, let be such that
[TABLE]
We note that this implies that
[TABLE]
We also note that , so if then . Similarly if then .
Combining (14.6) and (14.9) with (14.5), we find
[TABLE]
Similarly, provided , (14.7), (14.9) and (14.5) give
[TABLE]
and, provided , (14.8), (14.9) and (14.5) give
[TABLE]
We now claim that at least one of the following inequalities holds:
[TABLE]
If (14.13) holds then by (14.10). If (14.14) holds, then , so , and so by (14.11) and (14.14). Finally, if (14.15) holds, then , so , and so by (14.12) and (14.15). Therefore this claim would complete the proof by choosing according to which of the inequalities (14.13)-(14.15) hold.
Since , at least one of (14.13)-(14.15) holds if we can prove that
[TABLE]
Using the inequality three times, we find that
[TABLE]
Since we also have that for , as well as , we conclude that
[TABLE]
By the arithmetic-geometric mean inequality, we have that and , whence
[TABLE]
Since for , we must have that for , thus completing the proof of the lemma. ∎
Proof of Proposition 8.2.
This follows almost immediately from Lemma 14.1. Our assumptions that and imply that there is a prime lying in . Thus we can apply Lemma 14.1 with this choice of and complete the proof. ∎
This completes the proof of Proposition 8.2, and hence Theorem 1.
15. Concluding remarks and counterexamples to the Model Problem
It is a vital feature of our proof that the weight of all vertices has a factor , as naturally arises from the setup of the Duffin-Schaeffer conjecture. This allows our proof to (just) work, but without weights of this type our argument would fail. At first sight this point may appear to be a mere technicality, but without these weights there are genuine counterexamples to the entire approach.
First, let us see where the proof breaks down without the factors. Although most of the argument holds for a general measure , in Proposition 6.3 we specialize to the measure . In the proof of Proposition 6.3 (in particular, in relation (7.6)), the factor cancels out the factor coming from
[TABLE]
in the definition of quality. Otherwise, the proof of Proposition 6.3 would fail. On the other hand, if we were to modify the definition of the quality and remove from it the product in (15.1), then instead the proof of Lemma 14.1 would break down and we would not obtain a quality increment when there are many primes dividing a proportion of of each vertex set. Thus the argument we present fails without the weights.
Now, let use explain why the presence of the weight is essential for the kind of argument we have given to work. Without using the weights, we essentially are attempting to prove that the Model Problem of Section 3 has an affirmative answer. However, one can construct examples to show that this is not the case. Such examples are based on the observation that all pairwise GCDs of elements of are at least , but there is no fixed integer of size dividing a positive proportion of elements of this set. (We thank Sam Chow for showing us this construction.)
Specifically, we select an integer , a prime , and then take
[TABLE]
It is straightforward to verify that , and . Moreover, we see that if and are two elements of , then
[TABLE]
so all pairs in have a large gcd. However, we can easily check that there is no integer dividing a positive proportion of elements of , and so this shows that the Model Problem of Section 3 has a negative answer.
On the other hand, if we count integers with weight , then the set we defined above has total weight , and so it fails to be of a sufficiently large size unless we take bounded (in which case the prime is of size and it divides a positive proportion of the elements of ). Thus the above counterexample no longer works if we count integers with weight .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] C. Aistleitner, A note on the Duffin-Schaeffer conjecture with slow divergence. Bull. Lond. Math. Soc. 46 (2014), no. 1, 164–168.
- 2[2] by same author, Decoupling theorems for the Duffin-Schaeffer problem. Progress report (2019), 24 pages, ar Xiv:1907.04590.
- 3[3] C. Aistleitner, T. Lachmann, M. Munsch, N. Technau, and A. Zafeiropoulos, The Duffin-Schaeffer conjecture with extra divergence. Preprint, https://arxiv.org/abs/1803.05703 .
- 4[4] V. Beresnevich, V. Bernik, M. Dodson and S. Velani, Classical metric Diophantine approximation revisited. Analytic number theory, 38–61, Cambridge Univ. Press, Cambridge, 2009.
- 5[5] V. Beresnevich, G. Harman, A. K. Haynes and S. Velani, The Duffin-Schaeffer conjecture with extra divergence II. Math. Z. 275 (2013), no. 1-2, 127–133.
- 6[6] V. Beresnevich and S. Velani, A mass transference principle and the Duffin-Schaeffer conjecture for Hausdorff measures. Ann. of Math. (2) 164 (2006), no. 3, 971–992.
- 7[7] P. A. Catlin, Two problems in metric Diophantine approximation. I. J. Number Theory 8 (1976), no. 3, 282–288.
- 8[8] R. J. Duffin and A. C. Schaeffer, Khinchin’s problem in metric Diophantine approximation. Duke Math. J. 8 (1941), 243–255.
