Explicit estimates for the distribution of numbers free of large prime   factors

Jared D. Lichtman; Carl Pomerance

arXiv:1705.02442·math.NT·January 8, 2019

Explicit estimates for the distribution of numbers free of large prime factors

Jared D. Lichtman, Carl Pomerance

PDF

TL;DR

This paper provides explicit, tight bounds for counting smooth numbers, improving numerical understanding beyond traditional asymptotic estimates and challenging existing conjectures.

Contribution

It introduces a saddle point method to derive explicit bounds for smooth numbers, surpassing previous asymptotic approximations.

Findings

01

Explicit bounds for smooth number counts are tighter than previous estimates.

02

The method can exclude the Dickman-de Bruijn asymptotic estimate as too small.

03

The bounds challenge the Hildebrand-Tenenbaum main term as too large.

Abstract

There is a large literature on the asymptotic distribution of numbers free of large prime factors, so-called $smooth$ or $friable$ numbers. But there is very little known about this distribution that is numerically explicit. In this paper we follow the general plan for the saddle point argument of Hildebrand and Tenenbaum, giving explicit and fairly tight intervals in which the true count lies. We give two numerical examples of our method, and with the larger one, our interval is so tight we can exclude the famous Dickman-de Bruijn asymptotic estimate as too small and the Hildebrand-Tenenbaum main term as too large.

Equations262

Ψ (x, y) \sim x ρ (u) (y \to \infty, x = y^{u})

Ψ (x, y) \sim x ρ (u) (y \to \infty, x = y^{u})

u ρ^{'} (u) + ρ (u - 1)

u ρ^{'} (u) + ρ (u - 1)

ρ (u)

\displaystyle\Psi(x,y)=x\rho(u)\Big{(}1+O_{\varepsilon}\Big{(}\frac{\log(1+u)}{\log y}\Big{)}\Big{)}

\displaystyle\Psi(x,y)=x\rho(u)\Big{(}1+O_{\varepsilon}\Big{(}\frac{\log(1+u)}{\log y}\Big{)}\Big{)}

ζ (s, y) = n \geq 1 P (n) \leq y \sum n^{- s} = p \leq y \prod (1 - p^{- s})^{- 1}

ζ (s, y) = n \geq 1 P (n) \leq y \sum n^{- s} = p \leq y \prod (1 - p^{- s})^{- 1}

Ψ (x, y) = n \leq x P (n) \leq y \sum 1 \leq P (n) \leq y \sum (x / n)^{σ} = x^{σ} ζ (σ, y) .

Ψ (x, y) = n \leq x P (n) \leq y \sum 1 \leq P (n) \leq y \sum (x / n)^{σ} = x^{σ} ζ (σ, y) .

ϕ_{j} (s, y) = \frac{\partial ^{j}}{\partial s ^{j}} lo g ζ (s, y) .

ϕ_{j} (s, y) = \frac{\partial ^{j}}{\partial s ^{j}} lo g ζ (s, y) .

ϕ_{1} (s, y) = - p \leq y \sum \frac{lo g p}{p ^{s} - 1}

ϕ_{1} (s, y) = - p \leq y \sum \frac{lo g p}{p ^{s} - 1}

\displaystyle\Psi(x,y)=\frac{x^{\alpha}\zeta(\alpha,y)}{\alpha\sqrt{2\pi\sigma_{2}(x,y)}}\Big{(}1+O\Big{(}\frac{1}{u}+\frac{\log y}{y}\Big{)}\Big{)}

\displaystyle\Psi(x,y)=\frac{x^{\alpha}\zeta(\alpha,y)}{\alpha\sqrt{2\pi\sigma_{2}(x,y)}}\Big{(}1+O\Big{(}\frac{1}{u}+\frac{\log y}{y}\Big{)}\Big{)}

α (x, y) \sim \frac{lo g ( 1 + y / lo g x )}{lo g y}

α (x, y) \sim \frac{lo g ( 1 + y / lo g x )}{lo g y}

\displaystyle\sigma_{2}(x,y)\sim\Big{(}1+\frac{\log x}{y}\Big{)}\log x\log y,

\displaystyle\sigma_{2}(x,y)\sim\Big{(}1+\frac{\log x}{y}\Big{)}\log x\log y,

Ψ (x, y)

Ψ (x, y)

Ψ (x, y)

Ψ (x, y) \geq x^{1 - l o g l o g x / l o g y} = \frac{x}{( lo g x ) ^{u}}

Ψ (x, y) \geq x^{1 - l o g l o g x / l o g y} = \frac{x}{( lo g x ) ^{u}}

Ψ (x, y) \leq 1.39 y^{1 - σ} x^{σ} ζ (σ, y) / lo g x

Ψ (x, y) \leq 1.39 y^{1 - σ} x^{σ} ζ (σ, y) / lo g x

\frac{B ^{-}}{Ψ} \geq 1 - \frac{lo g x}{c lo g 3/ lo g 2} and \frac{B ^{+}}{Ψ} \leq 1 + \frac{2 lo g x}{c lo g 3/ lo g 2},

\frac{B ^{-}}{Ψ} \geq 1 - \frac{lo g x}{c lo g 3/ lo g 2} and \frac{B ^{+}}{Ψ} \leq 1 + \frac{2 lo g x}{c lo g 3/ lo g 2},

\displaystyle O\Big{(}\frac{y}{\log_{2}y}+\frac{y\log x}{\log^{2}y}+c\log x\log c\Big{)}

\displaystyle O\Big{(}\frac{y}{\log_{2}y}+\frac{y\log x}{\log^{2}y}+c\log x\log c\Big{)}

\displaystyle O\Big{(}c\frac{y^{2/3}}{\log y}+c\log x\log c\Big{)}

\displaystyle O\Big{(}c\frac{y^{2/3}}{\log y}+c\log x\log c\Big{)}

\begin{array}[]{c|cc}x&10^{100}&10^{500}\\ y&10^{15}&10^{35}\\ \hline\cr\\ \text{KP}&1.786\cdot 10^{84}&1.857\cdot 10^{456}\\ \text{R}&4.599\cdot 10^{96}&9.639\cdot 10^{484}\\ \text{GS}&5.350\cdot 10^{95}&6.596\cdot 10^{483}\\ \text{DD}&2.523\cdot 10^{94}&1.472\cdot 10^{482}\\ \text{HT}&2.652\cdot 10^{94}&1.5127\cdot 10^{482}\\ \Psi^{-}&2.330\cdot 10^{94}&1.4989\cdot 10^{482}\\ \Psi^{+}&2.923\cdot 10^{94}&1.5118\cdot 10^{482}\\ \end{array}

\begin{array}[]{c|cc}x&10^{100}&10^{500}\\ y&10^{15}&10^{35}\\ \hline\cr\\ \text{KP}&1.786\cdot 10^{84}&1.857\cdot 10^{456}\\ \text{R}&4.599\cdot 10^{96}&9.639\cdot 10^{484}\\ \text{GS}&5.350\cdot 10^{95}&6.596\cdot 10^{483}\\ \text{DD}&2.523\cdot 10^{94}&1.472\cdot 10^{482}\\ \text{HT}&2.652\cdot 10^{94}&1.5127\cdot 10^{482}\\ \Psi^{-}&2.330\cdot 10^{94}&1.4989\cdot 10^{482}\\ \Psi^{+}&2.923\cdot 10^{94}&1.5118\cdot 10^{482}\\ \end{array}

KP is the Konyagin-Pomerance lower bound x / (lo g x)^{u},

KP is the Konyagin-Pomerance lower bound x / (lo g x)^{u},

R is the Rankin upper bound x^{α} ζ (α, y),

GS is the Granville-Soundararajan upper bound 1.39 y^{1 - α} x^{α} ζ (α, y) / lo g x,

DD is the Dickman-de Bruijn main term ρ (u) x, and

HT is the Hildebrand-Tenenbaum main term x^{α} ζ (α, y) / (α 2 π σ_{2}) .

Ψ (x, y) = \frac{1}{2 π i} \int_{σ - i \infty}^{σ + i \infty} ζ (s, y) \frac{x ^{s}}{s} d s,

Ψ (x, y) = \frac{1}{2 π i} \int_{σ - i \infty}^{σ + i \infty} ζ (s, y) \frac{x ^{s}}{s} d s,

Ψ (x, y) = \frac{1}{2 π i} \int_{α - i T}^{α + i T} ζ (s, y) \frac{x ^{s}}{s} d s + Error .

Ψ (x, y) = \frac{1}{2 π i} \int_{α - i T}^{α + i T} ζ (s, y) \frac{x ^{s}}{s} d s + Error .

\displaystyle\Big{|}\frac{\zeta(s,y)}{\zeta(\alpha,y)}\Big{|}

\displaystyle\Big{|}\frac{\zeta(s,y)}{\zeta(\alpha,y)}\Big{|}

\displaystyle\leq\exp\Big{\{}-\sum_{p\leq y}\frac{1-\cos(t\log p)}{p^{\alpha}}\Big{\}}.

p \leq y \sum \frac{1 - cos ( t lo g p )}{p ^{α}} .

p \leq y \sum \frac{1 - cos ( t lo g p )}{p ^{α}} .

W (y, w) = \frac{y ^{1 - α} - w ^{1 - α}}{1 - α} + error .

W (y, w) = \frac{y ^{1 - α} - w ^{1 - α}}{1 - α} + error .

\displaystyle\big{|}\text{Error}\big{|}

\displaystyle\big{|}\text{Error}\big{|}

\displaystyle\leq x^{\alpha}\sum_{\begin{subarray}{c}P(n)\leq y\\ T|\log(x/n)|>T^{d}\end{subarray}}\frac{1}{n^{\alpha}}\frac{1}{\pi T|\log(x/n)|}+\sum_{\begin{subarray}{c}P(n)\leq y\\ T|\log(x/n)|\leq T^{d}\end{subarray}}\Big{(}\frac{x}{n}\Big{)}^{\alpha}

\displaystyle\leq\frac{x^{\alpha}\zeta(\alpha,y)}{\pi T^{d}}+e^{\alpha T^{d-1}}\Big{[}\Psi(xe^{T^{1-d}},y)-\Psi(xe^{-T^{1-d}},y)\Big{]}.

ψ (x) = p^{m} \leq x \sum lo g p, ϑ (x) = p \leq x \sum lo g p,

ψ (x) = p^{m} \leq x \sum lo g p, ϑ (x) = p \leq x \sum lo g p,

.05 x \leq x - ϑ (x) \leq 1.95 x .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Explicit estimates for the distribution of numbers free of large prime factors

Jared D. Lichtman

Department of Mathematics, Dartmouth College, Hanover, NH 03755

[email protected]

and

Carl Pomerance

Department of Mathematics, Dartmouth College, Hanover, NH 03755

[email protected]

Abstract.

There is a large literature on the asymptotic distribution of numbers free of large prime factors, so-called smooth or friable numbers. But there is very little known about this distribution that is numerically explicit. In this paper we follow the general plan for the saddle point argument of Hildebrand and Tenenbaum, giving explicit and fairly tight intervals in which the true count lies. We give two numerical examples of our method, and with the larger one, our interval is so tight we can exclude the famous Dickman–de Bruijn asymptotic estimate as too small and the Hildebrand–Tenenbaum main term as too large.

1. Introduction

For a positive integer $n>1$ , denote by $P(n)$ the largest prime factor of $n$ , and let $P(1)=1$ . Let $\Psi(x,y)$ denote the number of $n\leq x$ with $P(n)\leq y$ . Such integers $n$ are known as $y$ -smooth, or $y$ -friable. Asymptotic estimates for $\Psi(x,y)$ are quite useful in many applications, not least of which is in the analysis of factorization and discrete logarithm algorithms.

One of the earliest results is due to Dickman [7] in 1930, who gave an asympotic formula for $\Psi(x,y)$ in the case that $x$ is a fixed power of $y$ . Dickman showed that

[TABLE]

for every fixed $u\geq 1$ , where $\rho(u)$ is the “Dickman–de Bruijn” function, defined to be the continuous solution of the delay differential equation

[TABLE]

There remain the questions of the error in the approximation (1.1), and also the case when $u=\log x/\log y$ is allowed to grow with $x$ and $y$ . In 1951, de Bruijn [4] proved that

[TABLE]

holds uniformly for $x\geq 2$ , $\exp\{(\log x)^{5/8+\varepsilon}\}<y\leq x$ , for any fixed $\varepsilon>0$ . After improvements in the range of this result by Maier and Hensley, Hildebrand [12] showed that the de Bruijn estimate holds when $\exp(\{(\log\log x)^{5/3+\varepsilon}\})\leq y\leq x$ .

In 1986, Hildebrand and Tenenbaum [13] provided a uniform estimate for $\Psi(x,y)$ for all $x\geq y\geq 2$ , yielding an asymptotic formula when $y$ and $u$ tend to infinity. The starting point for their method is an elementary argument of Rankin [17] from 1938, commonly known now as Rankin’s “trick”. For complex $s$ , define

[TABLE]

(where $p$ runs over primes) as the partial Euler product of the Riemann zeta function $\zeta(s)$ . Then for $0<\sigma<1$ , we have

[TABLE]

Then $\sigma$ can be chosen optimally to minimize $x^{\sigma}\zeta(\sigma,y)$ .

Let

[TABLE]

The function

[TABLE]

is especially useful since the solution $\alpha=\alpha(x,y)$ to $\phi_{1}(\alpha,y)+\log x=0$ gives the optimal $\sigma$ in (1.2). We also denote $\sigma_{j}(x,y)=|\phi_{j}(\alpha(x,y),y)|$ .

In this language, Hildebrand and Tenenbaum [13] proved that the estimate

[TABLE]

holds uniformly for $x\geq y\geq 2$ . As suggested by this formula, quantities $\alpha(x,y)$ and $\sigma_{2}(x,y)$ are of interest in their own right, and were given uniform estimates which imply the formulae

[TABLE]

and

[TABLE]

together which imply

[TABLE]

These formulae indicate that $\Psi(x,y)$ undergoes a “phase change” when $y$ is of order $\log x$ , see [3]. This paper concentrates on the range where $y$ is considerably larger, say $y>(\log x)^{4}$ .

The primary aim of this paper is to make the Hildebrand–Tenenbaum method explicit and so effectively construct an algorithm for obtaining good bounds for $\Psi(x,y)$ .

1.1. Explicit Results

Beyond the Rankin upper bound $\Psi(x,y)\leq x^{\alpha}\zeta(\alpha,y)$ , we have the explicit lower bound

[TABLE]

due to Konyagin and Pomerance [11]. Recently Granville and Soundararajan [10] found an elementary improvement of Rankin’s upper bound, which they have graciously permitted us to include in an appendix in this paper. In particular, they show that

[TABLE]

for every value of $\sigma\in[1/\log y,1]$ , see Theorem 5.1.

In another direction, by relinquishing the goal of a compact formula, several authors have devised algorithms to compute bounds on $\Psi(x,y)$ for given $x,y$ as inputs. For example, using an accuracy parameter $c$ , Bernstein [2] created an algorithm to generate bounds $B^{-}(x,y)\leq\Psi(x,y)\leq B^{+}(x,y)$ with

[TABLE]

running in

[TABLE]

time. Parsell and Sorenson [15] refined this algorithm to run in

[TABLE]

time, as well as obtaining faster and tighter bounds assuming the Riemann Hypothesis. The largest example computed by this method was an approximation of $\Psi(2^{255},2^{28})$ .

As seen in Figure 1, the bounds presented in this paper far outshine best-known upper and lower bounds for the two examples presented. We also provide the main term estimates $x^{\alpha}\zeta(\alpha,y)/\alpha\sqrt{2\pi\sigma_{2}}$ from [13] and $\rho(u)x$ from [7] as points of reference. It is interesting that our estimates in the second example are closer to the truth than are the Dickman–de Bruijn and Hildebrand–Tenenbaum main terms. The second-named author has asked if $\Psi(x,y)\geq x\rho(u)$ holds in general for $x\geq 2y\geq 2$ , see [9, (1.25)]. This inequality is known for $u$ bounded and $x$ sufficiently large, see the discussion in [14, Section 9].

Here,

[TABLE]

Our principal result, which benefits from some notation developed over the course of the paper, is Theorem 3.11. It is via this theorem that we were able to estimate $\Psi(10^{100},10^{15})$ and $\Psi(10^{500},10^{35})$ as in the table above.

2. Plan for the paper

The basic strategy of the saddle-point method relies on Perron’s formula, which implies the identity

[TABLE]

for any $\sigma>0$ . It turns out that the best value of $\sigma$ to use is $\alpha=\alpha(x,y)$ discussed in the Introduction. We are interested in abridging the integral at a certain height $T$ and then approximating the contribution given by the tail. To this end, we have

[TABLE]

There is a change in behavior occurring in $\zeta(s,y)$ when $t$ is on the order $1/\log y$ . In [13] it is shown that

[TABLE]

Thus when $t$ is small (compared to $1/\log y$ ) the oscillatory terms are in resonance, and when $t$ is large the oscillatory terms should exhibit cancellation. This behavior suggests we should divide our range of integration into $|t|\leq T_{0}$ and $T_{0}<|t|<T$ , where $T_{0}\approx 1/\log y$ is a parameter to be optimized.

The contribution for $|t|\leq T_{0}$ will constitute a “main terrm”, and so we will try to estimate this part very carefully. In this range we forgo (2.2) and attack the integrand $\zeta(s,y)x^{s}/s$ directly. The basic idea is to expand $\phi(s,y)=\log\zeta(s,y)$ as a Taylor series in $t$ . This approach, when carefully done, gives us fairly close upper and lower bounds for the integral. In our smaller example, the upper bound is less than 1% higher than the lower bound, and in the larger example, this is better by a factor of 20. Considerably more noise is encountered beyond $T_{0}$ and in the Error in (2.1).

For the second range $T_{0}<|t|<T$ , we focus on obtaining a satisfactory lower bound on the sum over primes,

[TABLE]

Our strategy is to sum the first $L$ terms directly, and then obtain an analytic formula $W(y,w)$ to lower bound the remaining terms starting at some $w\geq L$ , where essentially

[TABLE]

With an explicit version of Perron’s formula, the Error in (2.1) may be handled by

[TABLE]

Here $d\approx\frac{1}{2}$ is a parameter of our choosing, which we set to balance the two terms above. Thus the problem of bounding $|$ Error $|$ is reduced to estimating the number of $y$ -smooth integers in the “short” interval $\big{(}xe^{-T^{1-d}},xe^{T^{1-d}}\big{]}$ .

This latter portion is better handled when $T$ is large, but the earlier portion in the range $[T_{0},T]$ is better handled when $T$ is small. Thus, $T$ is numerically set to balance these two forces.

In our proofs we take full advantage of some recent calculations involving the prime-counting function $\pi(x)$ and the Chebyshev functions

[TABLE]

with $p$ running over primes and $m$ running over positive integers. As a corollary of the papers [5], [6] of Büthe we have the following excellent result.

Proposition 2.1.

For $1427\leq x\leq 10^{19}$ we have

[TABLE]

We have

[TABLE]

Proof.

The first assertion is one of the main results in Büthe [6]. Let $H$ be a number such that all zeros of the Riemann zeta-function with imaginary parts in $[0,H]$ lie on the $1/2$ -line. Inequality (7.4) in Büthe [5] asserts that if $x/\log x\leq H^{2}/4.92^{2}$ and $x\geq 5000$ , then

[TABLE]

We can take $H=3\cdot 10^{10}$ , see Platt [16]. Thus, we have the result in the range $10^{19}\leq x\leq e^{45}$ . For $x\geq e^{45}$ we have from Büthe [5] that $|\psi(x)-x|/x\leq 1.118\cdot 10^{-8}$ . Further, we have (see [18, (3.39)]) for $x>0$ ,

[TABLE]

(This result can be improved, but it is not important to us.) Thus, for $x\geq e^{45}$ we have $|\vartheta(x)-x|/x\leq 1.151\cdot 10^{-8}$ , establishing our result in this range. For the latter two ranges we argue similarly, using $|\psi(x)-x|\leq 1.165\cdot 10^{-9}$ when $x\geq e^{50}$ and $|\psi(x)-x|\leq 2.885\cdot 10^{-10}$ for $x\geq e^{55}$ , both of these inequalities coming from [5]. ∎

We remark that there are improved inequalities at higher values of $x$ , found in [5] and [8], which one would want to use if estimating $\Psi(x,y)$ for larger values of $y$ than we have done here.

3. The main argument

As in the Introduction, for complex $s$ , define

[TABLE]

which is the Riemann zeta function restricted to $y$ -smooth numbers, and for $j\geq 0$ , let

[TABLE]

We have the explicit formulae,

[TABLE]

Note that for $y\geq 2$ , $\sigma>0$ , $\phi_{1}(\sigma,y)$ is strictly increasing from [math], so there is a unique solution $\alpha=\alpha(x,y)>0$ to the equation

[TABLE]

Since we cannot exactly solve this equation, we shall assume any choice of $\alpha$ that we use is a reasonable approximation to the exact solution, and we must take into account an upper bound for the difference between our value and the exact value. We denote

[TABLE]

so that the Taylor series of $\phi(s,y)=\log\zeta(s,y)$ about $s=\alpha$ is

[TABLE]

Our first result, which is analogous to Lemma 10 in [13], sets the stage for our estimates.

Lemma 3.1.

Let $0<d<1$ and $T>1$ . We have that

[TABLE]

Proof.

We have

[TABLE]

where the interchange of sum and integral is justified since $\zeta(s,y)$ is a finite product, hence uniformly convergent as a sum.

By Perron’s formula (see [1, §11.12]), we have

[TABLE]

Together these imply

[TABLE]

This completes the proof. ∎

In using this result we have the problems of performing the integration from $\alpha-iT$ to $\alpha+iT$ and estimating the number of $y$ -smooth integers in the interval $\big{(}xe^{-T^{d-1}},xe^{T^{1-d}}\big{]}$ . We turn first to the integral evaluation.

Recall that $B_{j}=B_{j}(t)=\sigma_{j}(x,y)t^{j}/j!$ and let $B_{1}^{*}=B_{1}^{*}(t)=t\log x-B_{1}(t)$ . Note that $B_{1}^{*}=0$ if $\alpha$ is chosen perfectly.

Lemma 3.2.

For $s=\alpha+it$ , we have

[TABLE]

where $a_{5},b_{5}$ are real numbers, depending on the choice of $t$ , with $|a_{5}+ib_{5}|\leq B_{5}(t)$ .

Proof.

We expand $\phi(\alpha+it,y)=\log\zeta(\alpha+it,y)$ in a Taylor series around $t=0$ . There exists some real $\xi$ between 0 and $t$ such that

[TABLE]

Since $\zeta(s,y)=\exp(\phi(s,y))$ , we obtain

[TABLE]

Letting $i\phi_{5}(\alpha+i\xi)t^{5}/5!=a_{5}+b_{5}i$ , we have

[TABLE]

and taking the real part gives the result. ∎

The main contribution to the integral in Lemma 3.1 turns out to come from the interval $[-T_{0},T_{0}]$ , where $T_{0}$ is fairly small. We have

[TABLE]

Note that the integrand, written as a Taylor series around $s=\alpha$ , has real coefficients, so the real part is an even function of $t$ and the imaginary part is an odd function. Thus, the integral is real, and its value is double the value of the integral on $[0,T_{0}]$ .

Consider the cosine, sine combination in Lemma 3.2:

[TABLE]

and let

[TABLE]

We have, for each value of $t$ , the constraint that $|v|\leq v_{0}(t)$ . The partial derivative of $f(t,v)$ with respect to $v$ is zero when $\arctan(t/\alpha)-B_{3}(t)\equiv 0\pmod{\pi}$ . Let

[TABLE]

If $u(t)\not\in[-v_{0}(t),v_{0}(t)]$ , then $f(t,v)$ is monotone in $v$ on that interval; otherwise it has a min or max at $u(t)$ . Let $T_{3},T_{2},T_{1},T_{0}$ be defined, respectively, as the least positive solutions of the equations

[TABLE]

Then $0<T_{3}<T_{2}<T_{1}<T_{0}$ . We have the following properties for $f(t,v)$ :

(1)

For $t$ in the interval $[0,T_{3}]$ we have $f(t,v)$ increasing for $v\in[-v_{0}(t),v_{0}(t)]$ , so that

[TABLE] 2. (2)

For $t$ in the interval $[T_{3},T_{2}]$ , we have $f(t,v)$ increasing for $-v_{0}(t)\leq v\leq u(t)$ and then decreasing for $u(t)\leq v\leq v_{0}(t)$ . Thus,

[TABLE] 3. (3)

For $t\in[T_{2},T_{1}]$ , $f(t,v)$ is decreasing for $v\in[-v_{0}(t),v_{0}(t)]$ , so that

[TABLE] 4. (4)

For $t\in[T_{1},T_{0}]$ , we have $f(t,v)$ decreasing for $v\in[-v_{0}(t),u(t)+\pi]$ and increasing for $v\in[u(t)+\pi,v_{0}(t)]$ ; that is,

[TABLE]

Note too that $f(t,v)$ has a sign change from positive to negative in the interval $[T_{2},T_{1}]$ . Let $Z^{-},Z^{+}$ be, respectively, the least positive roots of $f(t,v(t))=0$ , $f(t,-v(t))=0$ .

Let $I_{0}^{+}$ be an upper bound for the function appearing in Lemma 3.2 on $[0,T_{0}]$ using $|a_{5}|,|b_{5}|\leq B_{5}$ and the above facts about $f(t,v)$ , and let $I_{0}^{-}$ be the corresponding lower bound. We choose $a_{5}=B_{5}$ in $I_{0}^{+}$ when the cos, sin combination is positive, and $a_{5}=-B_{5}$ when it is negative. For $I_{0}^{-}$ , we choose $a_{5}$ in the reverse way.

Let

[TABLE]

We thus have the following result, which is our analogue of Lemma 11 in [13].

Lemma 3.3.

We have

[TABLE]

In order to estimate the integral in Lemma 3.1 when $|t|>T_{0}$ we must know something about prime sums to $y$ .

Lemma 3.4.

We have

[TABLE]

where

[TABLE]

and

[TABLE]

Proof.

For $0\leq v\leq 1<t$ , equation (3.14) in [13] states that

[TABLE]

Applied to (3.17) in [13] with $v=(1-\cos(t\log p))/2$ , we have that

[TABLE]

This completes the proof. ∎

Our goal now is to find a way to estimate $W(v,w,t)$ . The following result is analogous to Lemma 6 in [13].

Lemma 3.5.

Let $s$ be a complex number, let $1<w<v$ , and define

[TABLE]

(i) If $v\leq 10^{19}$ we have

[TABLE]

(ii) If $10^{19}\leq w\leq v$ we have

[TABLE]

where $\beta=1-\alpha$ and

[TABLE]

Proof.

(i) By partial summation,

[TABLE]

so by the first part of Proposition 2.1,

[TABLE]

(ii) Similarly, by the second part of Proposition 2.1,

[TABLE]

∎

The following result plays the role of Corollary 6.1 in [13].

Lemma 3.6.

For $t\in{\mathbb{R}}$ , $z>1$ , and $\beta=1-\alpha$ , let

[TABLE]

(i) For $1427\leq w<v\leq 10^{19}$ we have that $W(v,w,t)\geq W_{0}(v,w,t)$ , where

[TABLE]

(ii) For $10^{19}\leq w<v$ we have that $W(v,w,t)\geq W_{0}(v,w,t)$ , where

[TABLE]

Proof.

We apply Lemma 3.5 with $s=1-\beta$ and $s=1-\beta+it$ , and take the real part of the difference. Letting the difference of the sums be $S$ , we have that

[TABLE]

which is the sum we wish to bound.

For a positive real number $z$ , let $S_{z}:=\frac{z^{\beta}}{\beta}-\frac{z^{\beta-it}}{\beta-it}$ . We have that

[TABLE]

so by Lemma 3.6,

[TABLE]

Thus,

[TABLE]

Recalling the definition of $F_{s}(v,w)$ , we have

[TABLE]

which gives the desired result by (3.4) and Lemma 3.5. ∎

From Lemma 3.4, we see that a goal is to bound $W(y,1,t)$ from below, and pieces of this sum are bounded by Lemma 3.6. Ideally, if $y$ were sufficiently small $W$ could be computed directly and the problem settled. In practice $W$ might only be computed up to some convenient number $L$ , suitable for numerical integration, after which the analytic bound $W_{0}(y,w,t)$ may be used. Still, there are further refinements to be made. Just as $x/\log x$ loses out to $\textnormal{li}(x)$ , $W_{0}$ on a long interval is smaller than $W_{0}$ summed on a partition of the interval into shorter parts. This plan is reflected in the following lemma.

Lemma 3.7.

If $v,w$ satisfy the hypotheses of Lemma 3.5, let

[TABLE]

Suppose that $w,L$ satisfy $1427,L\leq w$ . If $y\leq 10^{19}$ , then

[TABLE]

If $y>e^{55}$ and $1427,L\leq w\leq 10^{19}$ , let

[TABLE]

Then

[TABLE]

We remark that if $10^{19}<y\leq e^{55}$ , then there is an appropriate inequality for $J_{1}$ involving fewer $W_{j}$ ’s. If $y$ is much larger than our largest example of $y=10^{35}$ , one might wish to use better approximations to $\vartheta(y)$ than were used in Proposition 2.1.

Proof.

If $1427\leq w<v$ and $[w,v]$ satisfy the hypotheses of Lemma 3.5, we have

[TABLE]

The result then follows from Lemma 3.4. ∎

*Remark 3.8**.*

We implement Lemma 3.7 by choosing $L$ as large as possible so as not to interfere overly with numerical integration. We have found that $L=10^{6}$ works well. The ratio $e$ in the definition of $W_{*}$ is convenient, but might be tweaked for slightly better results. The individual terms in the sum $W(L,1,t)$ are as in (3.2), except for the first 30 primes, where instead we forgo using the inequality in (3.3), using instead the slightly larger expression

[TABLE]

We choose $w$ as a function $w(t)$ in such a way that the bound in Lemma 3.6 is minimized. For simplicity, we ignore the oscillating terms, i.e., we set

[TABLE]

equal to 0. Multiplying by $w^{1/2+\alpha}$ and solving for $w$ gives

[TABLE]

We let

[TABLE]

Our next result, based on [13, Lemma 9], gives a bound on the number of $y$ -smooth integers in a short interval.

Lemma 3.9.

Let $0<d<1$ , $T>1$ be such that $z:=(e^{2T^{d-1}}-1)^{-1}>1$ . We have

[TABLE]

where, with $W(y,w,t)$ as in Lemma 3.6,

[TABLE]

Proof.

Let $\xi=xe^{-T^{d-1}}$ , so that

[TABLE]

For $\xi<n\leq\xi+\frac{\xi}{z}$ , we have that

[TABLE]

so $0>\log(\xi/n)\geq-\log(1+1/z)\geq-\frac{1}{z}$ , which implies that $0<[z\log(\xi/n)]^{2}\leq 1.$ Thus,

[TABLE]

For $\sigma,v\in{\mathbb{R}}$ , we have the formula

[TABLE]

Letting $\sigma=\alpha/z$ , $v=-z\log(\xi/n)$ , we obtain

[TABLE]

Since $\alpha\leq 1\leq z$ , changing variables $t\mapsto t/z$ and taking the modulus gives

[TABLE]

This last integral may be estimated by the method of Lemma 3.4, giving

[TABLE]

We have

[TABLE]

and the lemma now follows from (3.5) and the definition of $\xi$ . ∎

*Remark 3.10**.*

For $t$ large, say $t>2z\log z$ , we can ignore the term $W(y,1,t)$ in $J_{2}$ , getting a suitably tiny numerical estimate for the tail of this rapidly converging integral. The part for $t$ small may be integrated numerically with $w(t),L$ as in Remark 3.8.

With these lemmas, we now have our principal result.

Theorem 3.11.

Let $d,T,z$ be as in Lemma 3.9, let $J_{0}^{\pm}$ be as in (3.1), $J_{1}$ as in Lemma 3.4, and $J_{2}$ as in Lemma 3.9. We have

[TABLE]

and

[TABLE]

4. Computations

In this section we give some guidance on how, for a given pair $x,y$ , the numbers $\alpha$ , $\zeta(\alpha,y)$ , and $\sigma_{j}$ for $j\leq 5$ may be numerically approximated. Further, we discuss how these data may be used to numerically approximate $\Psi(x,y)$ via Theorem 3.11.

4.1. Computing $\alpha$

Given a number $a\in(0,1)$ and a large number $y$ we may obtain upper and lower bounds for the sum

[TABLE]

First, we choose a moderate bound $w_{0}\leq y$ where we can compute the sum $\sigma_{1}(a,w_{0})$ relatively easily, such as $w_{0}=179{,}424{,}673$ , the ten-millionth prime. The sum

[TABLE]

may be approximated easily with Proposition 2.1 and partial summation. Let $l^{-}(a,w_{0},y)$ be a lower bound for this sum and let $l^{+}(a,w_{0},y)$ be an upper bound. Then

[TABLE]

We choose $\alpha$ as a number $a$ where $\log x$ lies between these two bounds. If a given trial for $a$ is too small, this is detected by our lower bound for $\sigma_{1}(a,y)$ lying above $\log x$ , and if $a$ is too large, we see this if our upper bound for $\sigma_{1}(a,y)$ lies below $\log x$ . It does not take long via linear interpolation to find a reasonable choice for $\alpha$ . While narrowing in, one might use a less ambitious choice for $w_{0}$ .

The partial summation used to estimate (4.1) and similar sums may be summarized in the following result.

Lemma 4.1.

Suppose $f(t)$ is positive and $f^{\prime}(t)$ is negative on $[w_{0},w_{1}]$ . Suppose too that $t-2\sqrt{t}<\vartheta(t)\leq t$ on $[w_{0},w_{1}]$ . Then

[TABLE]

Because of Proposition 2.1, the condition on $\vartheta$ holds if $[w_{0},w_{1}]\subset[1427,10^{19}]$ . For intervals beyond $10^{19}$ , it is easy to fashion an analogue of Lemma 4.1 using the other estimates of Proposition 2.1.

4.2. Computing $\sigma_{0}=\log\zeta(\alpha,y)$ and the other $\sigma_{j}$ ’s

Once a choice for $\alpha$ is computed it is straightforward to compute $\sigma_{0}$ and the other $\sigma_{j}$ ’s.

We have

[TABLE]

We may compute this sum up to some moderate $w_{0}$ as with the $\alpha$ computation. For the range $w_{0}<p\leq y$ we may approximate the summand by $p^{-\alpha}$ and sum this over $(w_{0},y]$ using partial summation (Lemma 4.1) and Proposition 2.1, say a lower bound is $l_{0}^{-}$ and an upper bound is $l_{0}^{+}$ . Then

[TABLE]

The other $\sigma_{j}$ ’s are computed in a similar manner.

4.3. Data

We record our calculations of $\alpha$ and the numbers $\sigma_{j}$ for two examples. Note that we obtain bounds for $\zeta$ via $\sigma_{0}=\log\zeta$ .

Note that $\sigma_{1}^{*}$ is an upper bound for $|\sigma_{1}-\log x|$ , and $\sigma_{5}^{+}$ is an upper bound for $\sigma_{5}$ .

The functions $\alpha(x,y)$ and $\sigma_{j}(x,y)$ are of interest in their own right. A simple observation from their definitions allows for more general bounds on $\alpha$ and $\sigma_{j}$ using the data in Figure 2, as described in the following remark.

*Remark 4.2**.*

For pairs $x,y$ and $x^{\prime},y^{\prime}$ , if $x\geq x^{\prime}$ and $y\leq y^{\prime}$ then $\alpha(x,y)\leq\alpha(x^{\prime},y^{\prime})$ . Similarly, if $\alpha(x,y)\geq\alpha(x^{\prime},y^{\prime})$ and $y\leq y^{\prime}$ then $\sigma_{j}(x,y)\leq\sigma_{j}(x^{\prime},y^{\prime})$ .

4.4. A word on numerical integration

The numerical integration needed to estimate $J_{1},J_{2}$ is difficult, especially when we choose a large value of $L$ , like $L=10^{6}$ . We performed these integrals independently on both Mathematica and Sage platforms. It helps to segment the range of integration, but even so, the software can report an error bound in addition to the main estimate. In such cases we have always added on this error bound and then rounded up, since we seek upper bounds for these integrals. In a case where one wants to be assured of a rigorous estimate, there are several options, each carrying some costs. One can use a Simpson or midpoint quadrature with a mesh say of $0.1$ together with a careful estimation of the higher derivatives needed to estimate the error. An alternative is to do a Riemann sum with mesh $0.1$ , where on each interval and for each separate cosine term appearing, the maximum contribution is calculated. If this is done with $T=4\cdot 10^{5}$ and $L=10^{6}$ , there would be magnitude $10^{11}$ of these calculations. The extreme value of the cosine contribution would either be at an endpoint of an interval or $-1$ if the argument straddles a number that is $\pi\bmod 2\pi$ . We have done a mild form of this method in our estimation of the integrals $J_{0}^{\pm}$ .

4.5. Example estimates

We list some example values of $x,y$ and the corresponding estimates in the figure below.

5. Appendix

We prove the following theorem.

Theorem 5.1 (Granville and Soundararajan).

If $3\leq y\leq x$ and $1/\log y\leq\sigma\leq 1$ , then

[TABLE]

Proof.

By the identity $\log n=\sum_{d|n}\Lambda(d)$ , we have

[TABLE]

Thus,

[TABLE]

Using the estimates in [18] we see that the maximum of $(1+\pi(t))/(t/\log t)$ occurs at $t=7$ , so that

[TABLE]

for all $t>1$ . The above estimate then gives

[TABLE]

We now note that if $1/\log y\leq\sigma\leq 1$ , then

[TABLE]

Indeed, in the first case, since $t^{1-\sigma}$ is non-decreasing in $t$ , we have $(x/n)^{1-\sigma}\leq y^{1-\sigma}$ . And in the second case, since $t^{-\sigma}\log t$ is decreasing in $t$ for $t\geq y$ , we have $(x/n)^{-\sigma}\log(x/n)\leq y^{-\sigma}\log y$ .

We thus have

[TABLE]

This completes the proof. ∎

Acknowledgments

We warmly thank Jan Büthe, Anne Gelb, Habiba Kadiri, Dave Platt, Brad Rodgers, Jon Sorenson, Tim Trudgian, and John Voight for their interest and help. We are also very appreciative of Andrew Granville and Kannan Soundararajan for allowing us to include their elementary upper bound prior to the publication of their book. The first author was partially supported by a Byrne Scholarship at Dartmouth. The second author was partially supported by NSF grant number DMS-1440140 while in residence at the Mathematical Sciences Research Institute in Berkeley.

Bibliography18

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] T. M. Apostol, An introduction to analytic number theory , Springer-Verlag, New York–Heidelberg, 1976.
2[2] D. J. Bernstein, Arbitrarily tight bounds on the distribution of smooth numbers , in M. Bennett, et al., eds., Proceedings of the Millennial Conference on Number Theory, volume 1, pages 49–66. A. K. Peters, 2002.
3[3] N. G. de Bruijn, On the number of positive integers ≤ x absent 𝑥 \leq x and free of prime factors > y absent 𝑦 >y , Nederl. Akad. Wetensch. Proc. Ser. A 54 (1951), 50–60.
4[4] by same author, On the number of positive integers ≤ x absent 𝑥 \leq x and free of prime factors > y absent 𝑦 >y . II , Nederl. Akad. Wetensch. Proc. Ser. A 69 = Indag. Math. 28 (1966), 239–247.
5[5] J. Büthe, Estimating π ( x ) 𝜋 𝑥 \pi(x) and related functions under partial RH assumptions , Math. Comp. 85 (2016), 2483–2498.
6[6] by same author, An analytic method for bounding ψ ( x ) 𝜓 𝑥 \psi(x) , Math. Comp., to appear, https://doi.org/10.1090/mcom/3264. Also see arxiv.org 1511.02032.
7[7] K. Dickman, On the frequency of numbers containing prime factors of a certain relative magnitude , Ark. Mat. Astr. Fys. 22 (1930), 1–14.
8[8] L. Faber and H. Kadiri, New bounds for ψ ( x ) 𝜓 𝑥 \psi(x) , Math. Comp. 84 (2015), 1339–1357.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Explicit estimates for the distribution of numbers free of large prime factors

Abstract.

1. Introduction

1.1. Explicit Results

2. Plan for the paper

Proposition 2.1**.**

Proof.

3. The main argument

Lemma 3.1**.**

Proof.

Lemma 3.2**.**

Proof.

Lemma 3.3**.**

Lemma 3.4**.**

Proof.

Lemma 3.5**.**

Proof.

Lemma 3.6**.**

Proof.

Lemma 3.7**.**

Proof.

Remark 3.8*.*

Lemma 3.9**.**

Proof.

Remark 3.10*.*

Theorem 3.11**.**

4. Computations

4.1. Computing α\alphaα

Lemma 4.1**.**

4.2. Computing σ0=log⁡ζ(α,y)\sigma_{0}=\log\zeta(\alpha,y)σ0​=logζ(α,y) and the other σj\sigma_{j}σj​’s

4.3. Data

Remark 4.2*.*

4.4. A word on numerical integration

4.5. Example estimates

5. Appendix

Theorem 5.1** (Granville and Soundararajan).**

Proof.

Acknowledgments

Proposition 2.1.

Lemma 3.1.

Lemma 3.2.

Lemma 3.3.

Lemma 3.4.

Lemma 3.5.

Lemma 3.6.

Lemma 3.7.

*Remark 3.8**.*

Lemma 3.9.

*Remark 3.10**.*

Theorem 3.11.

4.1. Computing $\alpha$

Lemma 4.1.

4.2. Computing $\sigma_{0}=\log\zeta(\alpha,y)$ and the other $\sigma_{j}$ ’s

*Remark 4.2**.*

Theorem 5.1 (Granville and Soundararajan).