1305 2897v4

MAJOR ARCS FOR GOLDBACHS PROBLEM
arXiv:1305.2897v4 [math.NT] 14 Apr 2014
H. A. HELFGOTT
Abstract. The ternary Goldbach conjecture states that every odd number
n
P 7 is the sum of three primes. The estimation of the Fourier series
px e(p) and related sums has been central to the study of the problem
since Hardy and Littlewood (1923).
Here we show how to estimate such Fourier series for in the so-called
major arcs, i.e., for close to a rational of small denominator. This is part of
the authors proof of the ternary Goldbach conjecture.
In contrast to most previous work on the subject, we will rely on a finite
verification of the Generalized Riemann Hypothesis up to a bounded conductor and bounded height, rather than on zero-free regions. We apply a rigorous
verification due to D. Platt; the results we obtain are both rigorous and unconditional.
The main point of the paper will be the development of estimates on parabolic cylinder functions that make it possible to use smoothing functions based
on the Gaussian. The generality of our explicit formulas will allow us to work
with a wide variety of such functions.
Contents
1. Introduction
1.1. Results
1.2. Main ideas
1.3. Acknowledgments
2. Preliminaries
2.1. Notation
2.2. Dirichlet characters and L functions
2.3. Mellin transforms
3. The Mellin transform of the twisted Gaussian
3.1. How to choose a smoothing function?
3.2. The Mellin transform of the twisted Gaussian
3.3. General approach and situation
3.4. Setup
3.5. The saddle point
3.6. The contour
3.7. The integral over the main contour segment C2
3.8. The integral over the rest of the contour
3.9. Conclusion
4. Explicit formulas
4.1. A general explicit formula
2
4.2. Sums and decay for the Gaussian: (t) = et /2
2
4.3. The case of (t) = t2 et /2 and (t)
4.4. The case of + (t)
4.5. A sum for + (t)2
4.6. A verification of zeros and its consequences
1
2
3
5
6
7
7
7
8
9
11
12
13
13
13
17
18
24
27
30
31
40
44
49
52
56
H. A. HELFGOTT
Appendix A. Extrema via bisection and truncated series

Appendix B. Norms of smoothing functions
B.1. The decay of M h(i )
B.2. The difference + in 2 norm.
B.3. Norms involving +

B.5. The -norm of +
References
60
68
69
72
73
74
76
78
1. Introduction
The ternary Goldbach conjecture (or three-prime problem) states that every
odd number n greater than 5 can be written as the sum of three primes. Hardy
and Littlewood (1923) were the first to treat the problem by means of the circle
method, i.e., Fourier analysis over Z. (Fourier transforms of functions on Z live
on the circle R/Z.)
I. M. Vinogradov [Vin37] showed in 1937 that the ternary Goldbach conjecture
is true for all n above a large constant C. The main innovation in his work
consisted in the estimation of sums of the form
X
(1.1)
(n)e(n)
nN
for outside the so-called major arcs these being a union of short intervals
in R/Z around the rationals with small denominator. (Here (n) is the von
Mangoldt function, defined as (n) = log p for n a power of a prime p and
(n) = 0 for n having at least two prime factors, whereas e(t) = e2it .)
The estimation of such sums for in the major arcs is also important, and
goes back to Hardy and Littlewood [HL23]. In some ways, their work is rather
modern in particular, it studies a version of (1.1) with smooth truncation:
X
(n)e(n)(n/x),
S (, x) =
n
R+
where :
C is a smooth function; in [HL23], (t) = et .
We will show how to estimate sums such as S (, x) for in the major arcs.
We will see how we can obtain good estimates by using smooth functions based
2
on the Gaussian et /2 . This will involve proving new, fully explicit bounds for
the Mellin transform1 of the twisted Gaussian, or, what is the same, parabolic
cylindrical functions in certain ranges. It will also require explicit formulae that
are general and strong enough, even for moderate values of x.
Any estimate on S (,
P x) for in the major arcs relies on the properties of
L-functions L(s, ) = n (n)ns , where : (Z/qZ) C is a multiplicative
character. In particular, what is key is the location of the zeroes of L(s, ) in
the critical strip 0 (s) 1 (a region in which L(s, ) can be defined by
analytic continuation). In contrast to most previous work, we will not use zerofree regions, which are too narrow for our purposes. Rather, we use a verification
of the Generalized Riemann Hypothesis up to bounded height for all conductors
q 300000 (due to D. Platt [Plab]).
1See 2.3 for definitions.
The bounds we will obtain have shown themselves to be strong enough to prove
the ternary Goldbach conjecture. See [Helb]. A key feature of the present work
is that it allows one to mimic a wide variety of smoothing functions by means
of estimates on the Mellin transform of a single smoothing function here, the
2
Gaussian et /2 .
2
1.1. Results. Write (t) = et /2 . Let us first give a bound for exponential
sums on the primes using as the smooth weight.
Theorem 1.1. Let x be a real number 108 . Let be a primitive character
mod q, 1 q r, where r = 300000.
Then, for any R with || 4r/q,

X
(n/x)2
n e 2 = Iq=1 c
(n)(n)e
() x + E x,
x
n=1
where Iq=1 = 1 if q = 1, Iq=1 = 0 if q 6= 1, and

650400
1
22
+ 112 .
|E| 5.281 10
+
x
q
R
We normalize the Fourier transform fb as follows: fb(t) = e(xt)f (x)dx.
2 2
2e2 .
Of course, c
() is just
As it turns out, smooth weights based on the Gaussian are often better in
applications than the Gaussian itself. Let us give a bound based on (t) =
t2 (t).
2
Theorem 1.2. Let (t) = t2 et /2 . Let x be a real number 108 . Let be a

primitive character mod q, 1 q r, where r = 300000.

n (n/x) = Iq=1 b() x + E x,

(n)(n)e
x
n=1

4.269 1014
1
276600
|E|
+
+ 56 .
q
q
x
The advantage of (t) = t2 (t) over is that it vanishes at the origin (to
second order); as we shall see, this makes it is easier to estimate exponential
sums with the smoothing M g, where M is a Mellin convolution and g is
nearly arbitrary. Here is a good example that is used, crucially, in [Helc].
2
Corollary 1.3. Let (t) = t2 et /2 M 2 (t), where 2 = 1 M 1 and 1 =

2 I[1/2,1] . Let x be a real number 108 . Let be a primitive character mod q,
1 q r, where r = 300000.

n (n/x) = Iq=1 b() x + E x,

(n)(n)e
x
n=1

1
380600
4.269 1014
+
+ 76 .
|E|
q
x
q
H. A. HELFGOTT
Let us now look at a different kind of modification of the Gaussian smoothing.

Say we would like a weight of a specific shape; for example, for the purposes of
[Helc], we would like an approximation to the function
(
2
t3 (2 t)3 e(t1) /2 for t [0, 2],
(1.2)
: t 7
0
otherwise.
At the same time, what we have is an estimate for the Mellin transform of the
2
Gaussian et /2 , centered at t = 0.
The route taken here is to work with an approximation + to . We let
2 /2
+ (t) = hH (t) tet
(1.3)
where hH is a band-limited approximation to

(
t2 (2 t)3 et1/2
(1.4)
h(t) =
0
if t [0, 2],
otherwise.
By band-limited we mean that the restriction of the Mellin transform of hH to

the imaginary axis is of compact support. (We could, alternatively, let hH be a
function whose Fourier transform is of compact support; this would be technically easier in some ways, but it would also lead to using GRH verifications less
efficiently.)
To be precise: we define
FH (t) =
(1.5)
sin(H log y)
,
log y
hH (t) = (h M FH )(y) =
h(ty 1 )FH (y)
dy
y
and H is a positive constant. It is easy to check that M FH (i ) = 1 for H <

< H and M FH (i ) = 0 for > H or < H (unsurprisingly, since FH
is a Dirichlet kernel under a change of variables). Since, in general, the Mellin
transform of a multiplicative convolution f M g equals M f M g, we see that
the Mellin transform of hH , on the imaginary axis, equals the truncation of the
Mellin transform of h to [iH, iH]. Thus, hH is a band-limited approximation
to h, as we desired.
The distinction between the odd and the even case in the statement that follows
simply reflects the two different points up to which computations where carried
out in [Plab]; these computations, in turn, were tailored to the needs of [Hela]
(as was the shape of + itself).
2
Theorem 1.4. Let (t) = + (t) = hH (t)tet /2 , where hH is as in (1.5) and

H = 200. Let x be a real number 1012 . Let be a primitive character mod q,
where 1 q 150000 if q is odd, and 1 q 300000 if q is even.
Then, for any R with || 600000 gcd(q, 2)/q,

n (n/x) = Iq=1 b() x + E x,

(n)(n)e
x
n=1
1
6.18 1012 1.14 1010
+
+
|E|
q
q
x

499100
+ 52 .
If q = 1, we have the sharper bound

|E| 3.34 1011 +
251100
.
x
This is a paradigmatic example, in that, following the proof given in 4.4, we

2
can bound exponential sums with weights of the form hH (t)et /2 , where hH
is a band-limited approximation to just about any continuous function of our
choosing.
Lastly, we will need an explicit estimate of the 2 norm corresponding to the
sum in Thm. 1.4, for the trivial character.
2
Proposition 1.5. Let (t) = + (t) = hH (t)tet /2 , where hH is as in (1.5) and

H = 200. Let x be a real number 1012 .
Then
Z
X
2
+
(t) log xt dt + E1 x log x
(n)(log n) 2 (n/x) = x
0
n=1
= 0.640206x log x 0.021095x + E2 x log x,
where
|E1 | 1.536 1015 +
310.84
|E2 | 2 106 +
310.84
.
x
1.2. Main ideas. We will be working with smoothed sums
X
(n)(n)e(n/x)(n/x).
(1.6)
S (, x) =
n=1
Our integral will actually be of the form

Z
S+ (, x)2 S (, x)e(N )d,
(1.7)
M
where + and are two different smoothing functions to be discussed soon.

Estimating the sums (1.6) on M reduces to estimating the sums
X
(n)(n)e(n/x)(n/x)
(1.8)
S (/x, x) =
n=1
for varying among all Dirichlet characters modulo q r0 and for || cr0 /q,
i.e., || small. Sums such as (1.8) are estimated using Dirichlet L-functions L(s, )
(see 2.2). An explicit formula gives an expression
X
(1.9)
S, (/x, x) = Iq=1 b()x
F ()x + small error,
where Iq=1 = 1 if q = 1 and Iq=1 = 0 otherwise. Here runs over the complex
numbers with L(, ) = 0 and 0 < () < 1 (non-trivial zeros). The function
F is the Mellin transform of e(t)(t) (see 2.3).
The questions are then: where are the non-trivial zeros of L(s, )? How fast
does F () decay as () ?
Write = (s), = (s). The belief is, of course, that = 1/2 for every
non-trivial zero (Generalized Riemann Hypothesis), but this is far from proven.
Most work to date has used zero-free regions of the form 1 1/C log q| |, C
a constant. This is a classical zero-free region, going back, qualitatively, to de la
Vallee-Poussin (1899). The best values of C known are due to McCurley [McC84]
and Kadiri [Kad05].
H. A. HELFGOTT
These regions seem too narrow to yield a proof of the three-primes theorem.
What we will use instead is a finite verification of GRH up to Tq , i.e., a computation showing that, for every Dirichlet character of conductor q r0 (r0 a
constant, as above), every non-trivial zero = + i with | | Tq satisfies
() = 1/2. Such verifications go back to Riemann; modern computer-based
methods are descended in part from a paper by Turing [Tur53]. (See the historical article [Boo06].) In his thesis [Pla11], D. Platt gave a rigorous verification for
r0 = 105 , Tq = 108 /q. In coordination with the present work, he has extended
this to
all odd q 3 105 , with Tq = 108 /q,
all even q 4 105 , with Tq = max(108 /q, 200 + 7.5 107 /q).
This was a major computational effort, involving, in particular, a fast implementation of interval arithmetic (used for the sake of rigor).
What remains to discuss, then, is how to choose in such a way F () decreases
fast enough as | | increases, so that (1.9) gives a good estimate. We cannot hope
for F () to start decreasing consistently before | | is at least as large as a multiple
of 2||. Since varies within (cr0 /q, cr0 /q), this explains why Tq is taken
inversely proportional to q in the above. As we will work with r0 150000, we
also see that we have little margin for maneuver: we want F () to be extremely
small already for, say, | | 80||. We also have a Scylla-and-Charybdis situation,
courtesy of the uncertainty principle: roughly speaking, F () cannot decrease
faster than exponentially on | |/|| both for || 1 and for large.
The most delicate case is that of large, since then | |/|| is small. It turns
out we can manage to get decay that is much faster than exponential for large,
while no slower than exponential for small. This we will achieve by working
2
with smoothing functions based on the (one-sided) Gaussian (t) = et /2 .
2
The Mellin transform of the twisted Gaussian e(t)et /2 is a parabolic cylinder
function U (a, z) with z purely imaginary. Since fully explicit estimates for U (a, z),
z imaginary, have not been worked in the literature, we will have to derive them
ourselves.
Once we have fully explicit estimates for the Mellin transform of the twisted
Gaussian, we are able to use essentially any smoothing function based on the
2
Gaussian (t) = et /2 . As we already saw, we can and will consider smoothing
functions obtained by convolving the twisted Gaussian with another function
and also functions obtained by multiplying the twisted Gaussian with another
function. All we need to do is use an explicit formula of the right kind that
is, a formula that does not assume too much about the smoothing function or
the region of holomorphy of its Mellin transform, but still gives very good error
terms, with simple expressions.
All results here will be based on a single, general explicit formula (Lem. 4.1)
valid for all our purposes. The contribution of the zeros in the critical trip can
be handled in a unified way (Lemmas 4.3 and 4.4). All that has to be done for
each smoothing function is to bound a simple integral (in (4.24)). We then apply
a finite verification of GRH and are done.
1.3. Acknowledgments. The author is very thankful to D. Platt, who, working in close coordination with him, provided GRH verifications in the necessary
ranges, and also helped him with the usage of interval arithmetic. Warm thanks
are due to A. C
ordoba and J. Cilleruelo, for discussions on the method of stationary phase, and to V. Blomer and N. Temme, for further help with parabolic cylinder functions. Gratitude is also felt towards A. Booker, B. Green, R.
Heath-Brown, H. Kadiri, O. Ramare, T. Tao and M. Watkins, for discussions
on Goldbachs problem and related issues. Additional references were graciously
provided by R. Bryant, S. Huntsman and I. Rezvyakova.
Travel and other expenses were funded in part by the Adams Prize and the
Philip Leverhulme Prize. The authors work on the problem started at the Universite de Montreal (CRM) in 2006; he is grateful to both the Universite de Montreal
and the Ecole

Normale Superieure for providing pleasant working environments.
The present work would most likely not have been possible without free and
publicly available software: PARI, Maxima, Gnuplot, VNODE-LP, PROFIL /
BIAS, SAGE, and, of course, LATEX, Emacs, the gcc compiler and GNU/Linux in
general. Some exploratory work was done in SAGE and Mathematica. Rigorous
calculations used either D. Platts interval-arithmetic package (based in part on
Crlibm) or the PROFIL/BIAS interval arithmetic package underlying VNODELP.
The calculations contained in this paper used a nearly trivial amount of resources; they were all carried out on the authors desktop computers at home
and work. However, D. Platts computations [Plab] used a significant amount
of resources, kindly donated to D. Platt and the author by several institutions.
This crucial help was provided by MesoPSL (affiliated with the Observatoire de
Paris and Paris Sciences et Lettres), Universite de Paris VI/VII (UPMC - DSI P
ole Calcul), University of Warwick (thanks to Bill Hart), University of Bristol,
France Grilles (French National Grid Infrastructure, DIRAC instance), Universite de Lyon 1 and Universite de Bordeaux 1. Both D. Platt and the author would
like to thank the donating organizations, their technical staff, and all academics
who helped to make these resources available to them.
2. Preliminaries
2.1. Notation. As is usual, we write for the Moebius function, for the von
Mangoldt function. We let (n) be the number of divisors of an integer n and
(n) the number of prime divisors. For p prime, n a non-zero integer, we define
vp (n) to be the largest non-negative integer such that p |n.
We write (a, b) for the greatest common divisor of a and b. If there is any risk
of
with the pair (a, b), we write gcd(a, b). Denote by (a, b ) the divisor
Q confusion
vp (a) of a. (Thus, a/(a, b ) is coprime to b, and is in fact the maximal
p|b p
divisor of a with this property.)
As is customary, we write e(x) for e2ix . We write |f |r for the Lr norm of a
function f .
We write O (R) to mean a quantity at most R in absolute value.
2.2. Dirichlet characters and L functions. A Dirichlet character : Z C
of modulus q is a character of (Z/qZ) lifted to Z with the convention that
(n) = 0 when (n, q) 6= 1. Again by convention, there is a Dirichlet character of
modulus q = 1, namely, the trivial character T : Z C defined by T (n) = 1
for every n Z.
If is a character modulo q and is a character modulo q |q such that (n) =
(n) for all n coprime to q, we say that induces . A character is primitive if

it is not induced by any character of smaller modulus. Given a character , we
H. A. HELFGOTT
write for the (uniquely defined) primitive character inducing . If a character

mod q is induced by the trivial character T , we say that is principal and
write 0 for (provided the modulus q is clear from the context). In other words,
0 (n) = 1 when (n, q) = 1 and 0 (n) = 0 when (n, q) = 0.
A Dirichlet L-function
( a Dirichlet character) is defined as the anaP L(s, )
s
lytic continuation of n (n)n to the entire complex plane; there is a pole at
s = 1 if is principal.
A non-trivial zero of L(s, ) is any s C such that L(s, ) = 0 and 0 < (s) <
1. (In particular, a zero at s = 0 is called trivial, even though its contribution
can be a little tricky to work out. The same would go for the other zeros with
(s) = 0 occuring for non-primitive, though we will avoid this issue by working
mainly with primitive.) The zeros that occur at (some) negative integers are
called trivial zeros.
The critical line is the line (s) = 1/2 in the complex plane. Thus, the generalized Riemann hypothesis for Dirichlet L-functions reads: for every Dirichlet
character , all non-trivial zeros of L(s, ) lie on the critical line. Verifiable finite
versions of the generalized Riemann hypothesis generally read: for every Dirichlet
character of modulus q Q, all non-trivial zeros of L(s, ) with |(s)| f (q)
lie on the critical line (where f : Z R+ is some given function).
2.3. Mellin transforms. The Mellin transform of a function : (0, ) C is
Z
(x)xs1 dx.
(2.1)
M (s) :=
0
R
If
is in 1 with respect to dt (i.e., 0 |(x)|x1 dx < ), then the
Mellin transform is defined on the line + iR. Moreover, if (x)x1 is in 1 for
= 1 and for = 2 , where 2 > 1 , then it is easy to see that it is also in
1 for all (1 , 2 ), and that, moreover, the Mellin transform is holomorphic
on {s : 1 < (s) < 2 }. We then say that {s : 1 < (s) < 2 } is a strip of
holomorphy for the Mellin transform.
The Mellin transform becomes a Fourier transform (of (e2v )e2v ) by
means of the change of variables x = e2v . We thus obtain, for example, that
the Mellin transform is an isometry, in the sense that
Z
Z
1
2 2 dx
=
|M f ( + it)|2 dt.
|f (x)| x
(2.2)
x
2
0
(x)x1
Recall that, in the case of the Fourier transform, for |fb|2 = |f |2 to hold, it is
enough that f be in 1 2 . This gives us that, for (2.2) to hold, it is enough that
f (x)x1 be in 1 and f (x)x1/2 be in 2 (again, with respect to dt, in both
cases).
We write f M g for the multiplicative, or Mellin, convolution of f and g:
Z
x dw
.
f (w)g
(2.3)
(f M g)(x) =
w w
0
In general,
(2.4)
M (f M g) = M f M g
and
(2.5)
M (f g)(s) =
1
2i
+i
i
M f (z)M g(s z)dz
[GR00, 17.32]
provided that z and sz are within the strips on which M f and M g (respectively)
are well-defined.
We also have several useful transformation rules, just as for the Fourier transform. For example,
M (f (t))(s) = (s 1) M f (s 1),
M (tf (t))(s) = s M f (s),
(2.6)
M ((log t)f (t))(s) = (M f ) (s)
(as in, e.g., [BBO10, Table 1.11]).

Since (see, e.g., [BBO10, Table 11.3] or [GR00, 16.43])
(M I[a,b] )(s) =
we see that
(2.7)
M 2 (s) =
1 2s
s
2
bs a s
,
s
M 4 (s) =
1 2s
s
4
Let fz = ezt , where (z) > 0. Then

Z
Z
1 t
zt s1
e dt
e t dt = s
(M f )(s) =
z 0
0
Z
Z
1 z u s1
1 t s1
(s)
= s
e u du = s
e t dt = s ,
z 0
z 0
z
where the next-to-last step holds by contour integration, and the last step holds
by the definition of the Gamma function (s).
3. The Mellin transform of the twisted Gaussian
Our aim in this section is to give fully explicit, yet relatively simple bounds for
2
the Mellin transform F () of e(t) (t), where (t) = et /2 and is arbitrary.
The rapid decay that results will establish that the Gaussian is a very good
choice for a smoothing, particularly when the smoothing has to be twisted by an
additive character e(t).
Gaussian smoothing has been used before in number theory; see, notably,
Heath-Browns well-known paper on the fourth power moment of the Riemann
zeta function [HB79]. What is new here is that we will derive fully explicit bounds
on the Mellin transform of the twisted Gaussian. This means that the Gaussian
smoothing will be a real option in explicit work on exponential sums in number
theory and elsewhere from now on.
(There has also been work using the Gaussian after a logarithmic change of
variables; see, in particular, [Leh66]. In that case, the Mellin transform is simply
a Gaussian (as in, e.g., [MV07, Ex. XII.2.9]). However, for non-zero, the Mellin
2
transform of a twist e(t)e(log t) /2 decays very slowly, and thus would not be in
general useful.)
2
Theorem 3.1. Let f (t) = et /2 e(t), R. Let F be the Mellin transform of

f . Let s = + i , 0, 6= 0. Let = 2. Then, if sgn() 6= sgn( ),
(3.1)
|F (s)| C0,, e
| |
()2
0.4798| |
+ C1, e

| |

2 25
min 81 (
) , 32 | |
+ C2,, e
10
H. A. HELFGOTT
where
(3.2)

2(() 1)
1
arccos
,
()
p !

| |
= min 2, 2||
1 + max(7.831 , 1.631 )
1
E() =
2
C0,,
3.3
C1,
C2,,
min
s
p
!
1+ 2
1 + 1 + 2
2
= 1+
,
| | ,
() =
| |
2

| | 5 p
= P min
,
| |
,
2|| 4
3/2
| | p
| |
2|| ,
where P (x) = x2 if [0, 2], P (x) = x2 + ( 2)x4 if (2, 4] and

P (x) = x2 + ( 2)x4 + . . . + ( 2k)x2(k+1) if (2k, 2(k + 1)].
If sgn() = sgn( ) (or = 0) and | | 2,
|F (s)| C,
e 4 | | ,
(3.3)
where
C,
e/2 | |/2
2
1 + 23/2 ||
(/2)
for [0, 1],
| |
for > 0 arbitrary.

2
As we shall see, the choice of (t) = et /2 can be easily motivated by the

method of stationary phase, but the problem is actually solved by the saddlepoint method. One of the challenges here is to keep all expressions explicit and
practical. This turns out to require the use of validated numerics (Appendix A);
in particular, the bisection method (implemented using interval arithmetic) gets
combined with a closer examination at infinity and near extrema.
The expressions in Thm. 3.1 can be easily simplified further in applications
with some mild constraints, especially if one is ready to make some sacrifices in
the main term.
2
Corollary 3.2. Let f (t) = et /2 e(t), R. Let F be the Mellin transform

of f . Let s = + i , where [0, 1] and | | max(100, 4 2 ||). Then, for
0 k 2,

2
| | k 0.1065(
) if | | < 3 ()2 ,
e
2
2||
(3.4) |F (s + k)| + |F (k + 1 s)| ck
| |k/2 e0.1598| |
if | | 23 ()2 ,
where c0 = 4.226, c1 = 3.516, c2 = 3.262.
It is natural to look at |F (s + k)| + |F (k + 1 s)| with s in the critical strip

(s) [0, 1], since such expressions are key to the study of exponential sums with
2
a smoothing function equal to or based on tk et /2 .
Let us end by a remark that may be relevant to applications outside number
theory. By (3.8), Thm. 3.1 gives us bounds on the parabolic cylinder function
U (a, z) for z purely imaginary and |(a)| 1/2. The bounds are useful when
|(a)| is at least somewhat larger than |(z)| (i.e., when | | is large compared
to ). While the Thm. 3.1 is stated for 0 (i.e., for (a) 0), extending the
11
result to larger half-planes for a is not too hard integration by parts can be
used to push a to the right.
As we shall see in 3.2, the literature on parabolic cylinder functions is rich
and varied. However, it stopped short of giving fully explicit expressions for a
general and z imaginary. That precluded, for instance, the use of the Gaussian
in explicit work on exponential sums in number theory. Such work will now be
possible.
3.1. How to choose a smoothing function? The method of stationary phase
([Olv74, 4.11], [Won01, II.3])) suggests that the main contribution to the integral
Z
dt
e(t)(t)ts
(3.5)
F (t) =
t
0
should come when the phase has derivative 0. The phase part of (3.5) is
e(t) = t(s) = e2it+ log t
(where we write s = + i ); clearly,
(2t + log t) = 2 +
=0
t
when t = /2. This is meaningful when t 0, i.e., sgn( ) 6= sgn(). The

contribution of t = /2 to (3.5) is then

+i 1
ei
(3.6)
(t)e(t)ts1 =
2
2
multiplied by a width approximately equal to a constant divided by
p
|(2it + log t) | =
The absolute value of (3.6) is

(3.7)
2||
| /t2 | = p .
| |
1

.

2 2
In other words, if sgn( ) 6= sgn() and is not too small, asking that F ( +i )
decay rapidly as | | amounts to asking that (t) decay rapidly as t 0.
Thus, if we ask for F ( + i ) to decay rapidly as | | for all moderate , we
are requesting that
(1) (t) decay rapidly as t ,
(2) the Mellin transform F0 ( + i ) decay rapidly as .
Requirement (2) is there because we also need to consider F ( + it) for very
small, and, in particular, for = 0.
There is clearly an uncertainty-principle issue here; one cannot do arbitrarily
well in both aspects at the same time. Once we are conscious of this, the choice
(t) = et in Hardy-Littlewood actually looks fairly good: obviously, (t) = et
decays exponentially, and its Mellin transform (s + i ) also decays exponentially
as . Moreover, for this choice of , the Mellin transform F (s) can be
written explicitly: F (s) = (s)/(1 2i)s .
12
H. A. HELFGOTT
It is not hard to work out an explicit formula2 for (t) = et . However, it is

not hard to see that, for F (s) as above, F (1/2 + it) decays like et/2|| , just as
we expected from (3.7). This is a little too slow for our purposes: we will often
have to work with relatively large , and we would like to have to check the zeroes
of L functions only up to relatively low heights t. We will settle for a different
choice of : the Gaussian.
2
The decay of the Gaussian smoothing function (t) = et /2 is much faster than
exponential. Its Mellin transform is (s/2), which decays exponentially as (s)
. Moreover, the Mellin transform F (s) ( 6= 0), while not an elementary or
very commonly occurring function, equals (after a change of variables) a relatively
well-studied special function, namely, a parabolic cylinder function U (a, z) (or,
in Whittakers [Whi03] notation, Da1/2 (z)).
For not too small, the main term will indeed work out to be proportional to
2
e( /2) /2 , as the method of stationary phase indicated. This is, of course, much
better than e /2|| . The cost is that the Mellin transform (s/2) for = 0
now decays like e(/4)| | rather than e(/2)| | . This we can certainly afford.
3.2. The Mellin transform of the twisted Gaussian. We wish to approximate the Mellin transform
Z
dt
2
et /2 e(t)ts ,
F (s) =
t
0
where R. The parabolic cylinder function U : C2 C is given by

Z
2
1 2
1
ez /4

U (a, z) =
ta 2 e 2 t zt dt
1
2 +a 0
for (a) > 1/2; this can be extended to all a, z C either by analytic continuation or by other integral representations ([AS64, 19.5], [Tem10, 12.5(i)]).
Hence

1
(i)2
(3.8)
F (s) = e
(s)U s , 2i .
2
The second argument of U is purely imaginary; it would be otherwise if a Gaussian

of non-zero mean were chosen.
Let us briefly discuss the state of knowledge up to date on Mellin transforms of
2
twisted Gaussian smoothings, that is, et /2 multiplied by an additive character
e(t). As we have just seen, these Mellin transforms are precisely the parabolic
cylinder functions U (a, z).
The function U (a, z) has been well-studied for a and z real; see, e.g., [Tem10].
Less attention has been paid to the more general case of a and z complex. The
most notable exception is by far the work of Olver [Olv58], [Olv59], [Olv61],
[Olv65]; he gave asymptotic series for U (a, z), a, z C. These were asymptotic
series in the sense of Poincare, and thus not in general convergent; they would
solve our problem if and only if they came with error term bounds. Unfortunately,
it would seem that all fully explicit error terms in the literature are either for a
2There may be a minor gap in the literature in this respect. The explicit formula given in
[HL23, Lemma 4] does not make all constants explicit. The constants and trivial-zero terms
were fully worked out for q = 1 by [Wig20] (cited in [MV07, Exercise 12.1.1.8(c)]; the sign of
hyp,q (z) there seems to be off). As was pointed out by Landau (see [Har66, p. 628]), [HL23,
Lemma 4] actually has mistaken terms for non-primitive. (The author thanks R. C. Vaughan
for this information and the references.)
13
and z real, or for a and z outside our range of interest (see both Olvers work and
[TV03].) The bounds in [Olv61] involve non-explicit constants. Thus, we will
have to find expressions with explicit error bounds ourselves. Our case is that of
a in the critical strip, z purely imaginary.
3.3. General approach and situation. We will use the saddle-point method
(see, e.g., [dB81, 5], [Olv74, 4.7], [Won01, II.4]) to obtain bounds with an
optimal leading-order term and small error terms. (We used the stationary-phase
method solely as an exploratory tool.)
What do we expect to obtain? Both the asymptotic expressions in [Olv59] and
the bounds in [Olv61] make clear that, if the sign of = (s) is different from that
of , there will a change in behavior when gets to be of size about (2)2 . This is
unsurprising, given our discussion using stationary phase: for |(a)| smaller than
a constant times |(z)|2 , the term proportional to e(/4)| | = e|(a)|/2 should
be dominant, whereas for |(a)| much larger than a constant times |(z)|2 , the
2
1
term proportional to e 2 ( 2 ) should be dominant.
3.4. Setup. We write
(3.9)
(u) =
u2
(2i)u i log u
2
for u real or complex, so that

F (s) =
e(u) u
du
.
u
We will be able to shift the contour of integration as we wish, provided that

it starts at 0 and ends at a point at infinity while keeping within the sector
arg(u) (/4, /4).
We wish to find a saddle point. At a saddle point, (u) = 0. This means that
(3.10)
u 2i
i
= 0,
u
i.e.,
u2 + iu i = 0,
where = 2. The solutions to (u) = 0 are thus
i 2 + 4i
(3.11)
u0 =
.
2
The second derivative at u0 is
(3.12)
(u0 ) =

1
1
u20 + i = 2 (iu0 + 2i ).
2
u0
u0
Assign the names u0,+ , u0, to the roots in (3.11) according to the sign in front
of the square-root (where the square-root is defined so as to have argument in
(/2, /2]).
We assume without loss of generality that 0. We shall also assume at first
that 0 (i.e., 0), as the case < 0 is much easier.
3.5. The saddle point. Let us start by estimating

s y/2
(3.13)
u0,+ e = |u0,+ | e arg(u0,+ ) ey/2 ,
14
H. A. HELFGOTT
where y = (iu0 ). (This is the main part of the contribution of the saddle
point, without the factor that depends on the contour.) We have
!
r

2
2
p
4i
i
i
1 + 2 .
=
i + 2 + 4i
(3.14)
y=
2
2
2
Solving a quadratic equation, we get that

r
r
r
4i
j() 1
j() + 1
+i
,
(3.15)
1 + 2 =
2
2
where j() = (1 + 2 )1/2 and = 4 /2 . Thus

!
r
2
j() + 1
y=
1 .
2
2
Let us now compute the argument of u0,+ :

p
p
arg(u0,+ ) = arg i + 2 + 4i = arg i + 1 + i
!
r
r
1 + j()
1 + j()
= arg i +
+i
2
2
1+j()
1
(3.16)
2
= arcsin s q

q
1+j()
2 1+j()
1
2
2
v
u
u1
= arcsin t
2
(by cos 2 = 1 2 sin2 ). Thus
2
= 1 arccos
1 + j()
2
2
1 + j()
s
!!
r
y
2
2
j() + 1
arg(u0,+ ) + = arccos
1
2
1 + j() 2
2
2
(3.17)

2(() 1)
1
,
= arccos
()
2
p
where () = (1 + j())/2.
It is clear that

1
2(() 1)
(3.18)
lim arccos
()
2
whereas
2(() 1)

=
(3.19)
arccos
()
2 4
4
as 0+ .
1
We are still missing a factor of |u0,+ | (from (3.13)),
p a factor of |u0,+ | (from
the invariant differential du/u) and a factor of 1/ | (u0,+ )| (from the passage
by the saddle-point along a path of steepest descent). By (3.12), this is
|u0,+ |1
p
=
| (u0,+ )|
|u0,+ |1
|u0,+ |
p
p
=
.
1
|iu0,+ + 2i |
|iu0,+ + 2i |
|u0,+ |
15
By (3.11) and (3.15),

!
r

i + 2 + 4i r 1 + j()
1 + j()

+
1 i
|u0,+ | =
=

2
2
2
2
s
r
1
+
j()
1
+
j()
1 + j()
(3.20)
+
+12
=
2
2
2
2
s
r
p
1 + j()
=
=
1 + j() 2
()2 ().
2
2
2
Proceeding as in (3.14), we obtain that

!
r
i

4i

|iu0,+ + 2i | =
i + 1 + 2 + 2i
2

r
r
2
2 j() + 1 i2 j() 1

= + 2i +

2
2
2
2
2
v
(3.21)
!2
!2
u
r
r
2 u
j() + 1
j() 1
t
1 +
=
+
2
2
2
s
r
r
j() + 1
j() 1
2
2
j() + + 1 2
=
2
.
2
2
2
p
p
Since j() 1 = / j() + 1, this means that
(3.22)
v
s
u
2
u
2
(j() + 1 + 2 )
|iu0,+ + 2i | = tj() + 2 + 1
2
j() + 1
2 p
j() + j()2 (())1 (j() + j()2 )
2
p
2 j() p
2 p
2()2 j()(1 (())1 ) =

()2 ().
=
2
2
Hence
|
|u0,+
p
=
|iu0,+ + 2i |

p
2 ()
()
1
2
= 1
1
1/4
(j())
2 ())1/4
2 2 4 j() 4
(()
1/4
2
3
1
2 2 4
2
41
2
2

(()2 ()) 2 4
.
1
1 (() ())
2 j() 4
It remains to determine the direction of steepest descent at the saddle-point
u0,+ . Let v C point in that direction. Then, by definition, v 2 (u0,+ ) is real
and positive, where is as in 3.9. Thus arg(v) = arg( (u0,+ ))/2. By (3.12),
arg( (u0,+ )) = arg(iu0,+ + 2i ) 2 arg(u0,+ ).
Starting as in (3.21), we obtain that
arg(iu0,+ + 2i ) = arctan
1 +
j1
2
j+1
2
16
H. A. HELFGOTT
and
(3.23)
1 +
j1
2
j+1
2
j1
2

j+1
2
p
p
2(j 1) + 2(j + 1)
=
=
j1
1 + j+1
2
p

q
2
j 2 1 + (j + 1)
+ j+1
+ 1 ( + (j + 1))
=
=
j1
j1
(j + 1)(1 + j/)
2( + j)
(1 + j/)
=
=
.
=
j1
1+
Hence, by (3.16),
arg( (u0,+ )) = arctan
2( + j)
arccos ()1 .
Therefore, the direction of steepest descent is

(3.24)
arg( (u0,+ ))
1
2v(v + j)
= arg(u0,+ ) arctan
2
2
= arg(u0,+ ) arctan ,
arg(v) =
where
(3.25)
= tan
1
2v(v + j)
arctan
.
2
Since
1
1
tan =
=
2
sin tan
we see that
(3.26)
1+
1
1
,
tan2 tan
1+ 2
2
4 ( + j)
2( + j)
Recall as well that
cos =
2
1 + cos
,
2
sin =
2
1 cos
.
2
Hence, if we let
(3.27)
0 = arg(u0,+ ) =
1
1
arccos
,
2
()
we get that
cos 0 = cos
(3.28)
sin 0 = sin
1
1
arccos
2
()
1
1
arccos
2
()
=
=
We will prove now the useful inequality

(3.29)
arctan > 0 ,
1
1
+
,
2 2()
1
1
.
2 2()
17
i.e., arg(v) < 0. By (3.24), (3.25) and

p (3.27), this is equivalent to arccos(1/)
2
arctan 2(
+ j)/. Since tan = 1/ cos 1, we know that arccos(1/) =
arctan 2 1; thus, in order to prove (3.29), it is enough to check that
p
2( + j)
2 1
.
This is easy, since j > and 2 1 < < 2.

3.6. The contour. We must now choose the contour of integration. First, let
us discuss our options. By (A.2), 0.79837; moreover, it is easy to show
that tends to 1 when either 0+ or . This means that neither the
simplest contour (a straight ray from the origin within the first quadrant) nor
what is arguably the second simplest contour (leaving the origin on a ray within
the first quadrant, then sliding down a circle centered at the origin, passing past
the saddle point until you reach the x-axis, which you then follow to infinity)
are acceptable: either contour passes through the saddle point on a direction
close to 45 degrees (= arctan(1)) off from the direction of steepest descent. (The
saddle-point method allows in principle for any direction less than 45 degrees off
from the direction of steepest descent, but the bounds can degrade rapidly by
more than a constant factor when 45 degrees are approached.)
It is thus best to use a curve that goes through the saddle point u+,0 in the
direction of steepest descent. We thus should use an element of a two-parameter
family of curves. The curve should also have a simple description in terms of
polar coordinates.
We decide that our contour C will be a limacon of Pascal. (Excentric circles
would have been another reasonable choice.) Let C be parameterized by

c
p
1
x = r2 y2
(3.30)
y = r + c0 r,
for r [(c0 1)/c1 , c0 /c1 ], where c0 and c1 are parameters to be set later.
The curve goes from (0, (c0 1)/c1 ) to (c0 /c1 , 0), and stays within the first
quadrant.3 In order for the curve to go through the point u0,+ , we must have
c1 r0
(3.31)
+ c0 = sin 0 ,
where
p
r0 = |u0,+ | =
()2 (),
(3.32)
2
and 0 and sin 0 are as in (3.27) and (3.28). We must also check that the curve
C goes through u0,+ in the direction of steepest descent. The argument of the
point (x, y) is

c r
y
1
+ c0 .
= arcsin = arcsin
r
Hence

d arcsin c1r + c0
c1 /
d
c1 r
=
=r
=r
.
r
dr
dr
cos
cos arcsin c1r + c0
This means that, if v is tangent to C at the point u0,+ ,
tan(arg(v) arg(u0,+ )) = r
3Because c 1, by (A.14).
0
d
c1 r0
=
,
dr
cos 0
18
H. A. HELFGOTT
and so, by (3.24),

(3.33)
c1 =
cos 0
,
r0
where is as in (3.25). In consequence,

c1 r0
+ sin 0 = (cos 0 ) + sin 0 ,
c0 =
and so, by (3.28),

r
r
r
1
1
1 + 1/
1
1
,
c0 =
+
+
.
(3.34)
c1 =
2

2 2
2 2
Incidentally, we have also shown that the arc-length infinitesimal is
s
v
r

u
d 2
(c1 r/)2
r2
u1 +
(3.35) |du| = 1 + r
dr = 1 +
dr
=
2 dr.

t
dr
cos2
c0
2
2
c1
c
1
The contour will be as follows: first we go out of the origin along a straight
radial segment C1 ; then we meet the curve C, and we follow it clockwise for a
segment C2 , with the saddle-point roughly at its midpoint; then we follow another
radial ray C3 up to infinity. For small, C3 will just be the x-axis. Both C1 and
C3 will be contained within the first quadrant; we will specify them later.
3.7. The integral over the main contour segment C2 . We recall that
u2
+ iu i log u.
2
Our aim is now to bound the integral
Z
e((u)) u1 du
(3.36)
(u) =
C2
over the main contour segment C2 . We will proceed as follows. First, we will
parameterize the integral using a real variable , with the value = 0 corresponding to the saddle point u = u0,+ . We will bound ((u)) from below by
an expression of the form ((u0,+ )) + 2 . We then bound |u|1 |du/d| from
above by a constant. This procedure will give a bound that is larger than the
true value by at most a (very moderate) constant factor.
For u = x + iy (or (r, ) in polar coordinates), (3.36) gives us
(3.37)
r 2 2y 2
y
x2 y 2
y + =
y + arcsin
2
2
r
2
4
=
0 () = 2 2 0 (),
((u)) =
where, by (3.30), (3.31), and (3.34),
( + 0 )2
(1 2(sin 0 c1 )2 )
2
arcsin(sin 0 c1 )
+ 0
(sin 0 c1 ) +
,
0 () =
and
(3.38)
r r0
,
0 =
r0
.
19
By (3.30), (3.31) and (3.38),

y
= c0 c1 ( + 0 ) = sin 0 c1
r
(3.39)
and so
(3.40)
c0 c1 0 = sin 0 .
The variable will range within an interval

(3.41)
[0 , 1 ]

1 sin 0 sin 0
,
.
c1
c1
(Here = (1 sin 0 )/(c1 ) corresponds to the intersection with the y-axis, and
= (sin 0 )/(c1 ) corresponds to the intersection with the x-axis.)
We work out the expansions around 0 of
(3.42)
( + 0 )2
2 cos 20
(1 2(sin 0 c1 )2 ) = 0
+ (0 cos 20 + 202 c1 sin 0 )
2
2

cos 20
2 2 2
+ 4c1 0 sin 0 c1 0 2
+
2
+ 2(0 c21 2 + c1 sin 0 ) 3 c21 2 4 ,

0 sin 0
sin 0
+ 0
(sin 0 c1 ) =
+
+ c1 0 + c1 2 ,
0
1 X Pk (sin 0 ) (c1 )k k
arcsin(sin 0 c1 )
=
+
4
4 4
(cos 0 )2k1
k!
k=1

c1
1
(c1 )2 sin 0 2
0
+
+
+ ... ,
=
4 4 cos 0
2(cos 0 )3
where P1 (t) = 1 and P

k+1 (t) = Pk (t)(1t )+(2k1)tP (t) for k 1. (This follows
from (arcsin z) = 1/ 1 z 2 ; it is easy to show that (arcsin z)(k) = Pk (z)(1
z 2 )(k1/2) .)
P
k
We sum these three expressions and obtain a series 0 () =
k ak . We
already know that
(1) a0 equals the value of ((u))/(2 2 ) at the saddle point u0,+ ,

(2) a1 = 0,
(3)
1
a2 =
2
2
dr
d
2
2
2

du
du
1

| (u0,+ )| |r=r0 = | (u0,+ )| |r=r0 .
dr
2
dr
Here, as we know from (3.12), (3.22) and (3.20),

| iu0,+ + 2i |
| (u0,+ )| =
=
|u0,+ |2
j() 2
2
2
2 (
2j()
,
2
20
H. A. HELFGOTT
and, by (3.34) and (3.35),
(3.43)
v

u
|du|
du
u
|r=r =
|
=
t1 +
r=r
0
0
dr
|dr|
=
=
Thus,
1
a2 =
2
2
c21
c2
1 + 12
r02
c0
c1
r0
2
2
2 ( )
1
1
2 + 2
2 =
1+
1 + c21
r02
c21
2 1 sin2 0
2
1 + 1/
1 + 2 .
2j()
(1 + 2 ),
2
where is as in (3.26).
Let us simplify our expression for 0 () somewhat. We can replace the third
series in (3.42) by a truncated Taylor series ending at k = 2, namely,

c1
0
1
(c1 )2 sin 1 2
arcsin(sin 0 c1 )
=
+
+
4
4 4 cos 0
2(cos 1 )3
for some 1 between 0 and . Then 1 [0, /2], and so
0
1 c1
arcsin(sin 0 c1 )
.
4
4 4 cos 0
Since
R() = c21 2 2 + 2(sin 0 c1 0 )c1
is a quadratic with negative leading coefficient, its minimum within [0 , 1 ] (see
(3.41)) is bounded from below by min(R((1 sin 0 )/(c1 )), R((sin 0 )/(c1 ))).
We compare

sin 0
= 2c3 sin 0 sin2 0 ,
R
c1
where c3 = sin 0 c1 0 , and

1 sin 0
R
= 2c3 (1 sin 0 ) (1 sin 0 )2
c1
= 2c3 sin 0 sin 0 2c3 1 + 2 sin 0
The question is whether

sin 0
1 sin 0
R
= 2c3 1 + 2 sin 0
R
c1
c1
= 2(sin 0 c1 0 ) 1 + 2 sin 0
= 2c1 0 1
is positive. It is:
c1 0
c1 0 =
= c1
2
=
2
1 + 1/
,
2
2
21
and, as we know from (A.2), > 0.79837 is greater than 1/ 2 = 0.70710 . . . .

Hence, by (3.40),

sin 0
R() R
= 2c3 sin 0 sin2 0 = sin2 0 2c1 0 sin 0
c1
= sin2 0 2(c0 sin 0 ) sin 0 = 3 sin2 0 2c0 sin 0
= 3 sin2 0 2((cos 0 ) + sin 0 ) sin 0 = sin2 0 (sin 20 ) .
We conclude that
(3.44)
0 ()
((u0,+ ))
+ 2 ,
2 2
where
(3.45)
1 (c1 )2 sin 0
+ sin2 0 (sin 20 ) .
4 2(cos 0 )3
= a2
We can simplify this further, using

)2 sin
1 + 1/
1 (c1
0
2
= 2
3
4 2(cos 0 )
8
=
1
2

1
1 3/2
+
2
2
2
1
2
p
1 1/
2
p
=
2
4 1 + 1/
2
2
q
=
= p
= q
4 2 1
4 j+1 j1
4 2 /4
2
2
and (by (3.28))
1
1
2 1
2 =
sin 20 = 2 sin 0 cos 0 = 2
4 4
2
/2
1
=
=
.
=
2
(j + 1)/2
j+1
Therefore (again by (3.28))

r
2j
1
2 1
1
2
(3.46)
=
(1
+
.
2
2
2
2 2 j + 1
Now recall that our task is to bound the integral

Z 1
Z

2 2 0 ()
1 du dr
((u))
1
e
(( + 0 ))
e
|u|
|du| =
dr d d
0
C2

(3.47)
Z 1

1 du
2 2 2
((u0,+ ))
( + 0 )
e
() e
dr d.
0
(We are using (3.37) and (3.44).) Since u0,+ is a solution to equation (3.10), we
see from (3.9) that
!
u20,+
((u0,+ )) =
+ iu0,+ i log u0,+
2

iu0,+ i
1
=
+
+ arg(u0,+ ) = (iu0,+ ) + arg(u0 , +).
2
2
2
We defined y = (iu0 ) (after (3.13)), and we computed y/2 arg(u0,+ ) in

(3.17). This gives us
((u0 ,+))

arccos
=e
2(1)
1

22
H. A. HELFGOTT
If 1, we can bound
1
(3.48)
( + 0 )
(
01
(0 + 0 )1
if 0,
if < 0,
provided that 0 + 0 > 0 (as will be the case). If > 1, then

(
01
if 0,
1
( + 0 )
(1 + 0 )1 if > 0.
By (3.35),
s
r
2
du
(c
r/)
(c1 ( + 0 ))2
1
= 1+
1
+
=
dr
cos2
1 (sin 0 c1 )2
(This diverges as /2; this is main reason why we cannot actually follow the
curve all the way to the y-axis.) Since we are aiming at a bound that is tight
only up to an order of magnitude, we can be quite brutal here, as we were when
using (3.48): we bound (c1 r/)2 from above by its value when the curve meets
the x-axis (i.e., when r = c0 /c1 ). We bound cos2 from below by its value when
= 1 . We obtain
s
s
2
du
c
c20
0
= 1+
1
+
=
,
dr
1 (sin 0 c1 1 )2
cos2
where is the value of when = 1 .
Finally, we complete the integral in (3.47), we split it in two (depending on
whether 0 or < 0) and use
Z
Z
/2
1
2
2 2 2
e d = .
e
d
0

0
Therefore,
(3.49)
Z
C2
e((u)) |u|1 |du|

c20
/2
1+
= () e
01 + (j + 0 )1
2
cos

2(1)

1 ! s
arccos 1
2
2
j
1
c0 e
r0
1+ 1+
=
1+
,
2
0
cos2

arccos
2(1)
1

where j = 0 if 1 and j = 1 if > 1. We can set 1 = (sin 0 )/(c1 ). We

can also express 0 + 0 in terms of :
(3.50)
0 + 0 =
(c0 sin ) c1
c0 sin
r
=
=
.
c1
Since 0 = r0 /() (by (3.38)) and r0 is as in (3.32),

p
()2 ()
0 =
.
2
Definition (3.25) implies immediately that 1. Thus, by (3.34),
p
(3.51)
c1 0 = 2(1 + 1/) 2 2,
while, by (A.2),
(3.52)
c1 0 =
2(1 + 1/) 0.79837
23
By (3.50) and (3.51),

0 1
0
c1 0
2
(3.53)
1+
=
=
,
0
0 + 0
c0 sin
c0 sin
whereas

1/ 2
sin 0
1
1.62628.
1+
=1+
1+
0
c1 0
0.79837 2
We will now use some (rigorous) numerical bounds, proven in Appendix A. First
of all, by (A.14), c0 > 1 for all > 0; this assures us that c0 sin > 0, and
so the last expression in (3.53) is well defined. By (3.50), this also shows that
0 + 0 > 0, i.e., the curve C stays within the first quadrant for 0 /2, as
we said before.
We would also like to have an upper bound for
s

c20
1
,
1+
(3.54)
cos2

using (3.46). With this in mind, we finally choose :
(3.55)
= .
4
Thus, by (A.29),
s
s
c20
1
1 + 2c20 p
min(5, 0.86) min( 5, 0.93 ).
1+
2
cos
We also get
2
2
7.82843.
c0 sin
1 1/ 2
Finally, by (A.32),
(
r
/6
if 4,
2

1
1
2
if > 4
2 23/2 1 23/2
2
and so, since = 4 /, = 2 and (1 1/23/2 ) 2/3, (3.32) gives us

(

2
if 2
2
=
min
, .
r0 23
3
if > 2
3
We conclude that

Z
2(1)
arccos 1
((u))
1
2
e
|u|
|du| = C, e
(3.56)
,
C2
where
(3.57)
1

3.3
3/2
C, = min 2,
1 + max 7.831 , 1.631
min( /, )
for all > 0, > 0 and all . By reflection on the x-axis, the same bound holds
for < 0, < 0 and all . Lastly, (3.56) is also valid for = 0, provided we
replace (3.57) and the exponent of (3.56) by their limits as 0+ .
24
H. A. HELFGOTT
3.8. The integral over the rest of the contour. It remains to complete the
contour. Since we have set = /4, C1 will be a segment of the ray at 45
degrees from the x-axis, counterclockwise
(i.e., y = x, x 0). The segment will
go from (0, 0) up to (x, y) = (r / 2, r / 2), where, by (3.30),

c1
y
1
=
= r + c0 ,
r
2
and so

1
(3.58)
r =
c0
.
c1
2
Let w = (1 + i)/ 2. Looking at (3.9), we see that

(3.59)
Z
Z

u2 /2
s1
(u) 1

e
e(u)u du =
e
u
du

C1
C
Z r
Z 1
((u))
1
e((tw)) t1 dt,
e
|u|
|du| =
C1
where (u) is as in (3.36). Here

2

t
t

= + ,
i + iwt i log t + i
(3.60)
((tw)) =
2
4
4
2
and, by (A.33),

r
+ 0.076392 + =
0.30556 > 0.4798.
4
4
4
2
Consider first the case 1. Then
Z r
Z r
r
1
2 4
((tw)) 1
e
.
e
t
dt r
e 2 4 dt r
0
By (3.58) and (A.33),
(3.61)
Hence, for 1,
(3.62)
C1
u2 /2
e(u)u
/2
s1

du /2 e0.4798 .
Assume now that 0 < 1, s 6= 0. We can see that it is wise to start by an

integration by parts, so as to avoid convergence problems arising from the term
t1 within the integral as 0+ . We have
Z
Z
us
us wr
2
u2 /2
s1
u2 /2
eu /2 e(u)
du.
e
e(u)u du = e
e(u) |0
s
s
C1
C1
By (3.60),
r
u2 /2
r
us wr

((wr )) r
4
e
2
e(u)
|
e
=
e
0

s
|s|
As for the integral,

(3.63)
Z
Z
us
us
2
u2 /2
(u + i)eu /2iu du
du =
e
e(u)
s
s
C1
C1
Z
Z
1
2
i
2
=
eu /2 e(u)us+1 du
eu /2 e(u)us du.
s C1
s C1
25
Hence, by (3.59) and (3.60),

Z
Z r
Z r
us

t
t
4 +1
u2 /2

2
t
dt +
e
e(u)
du
e
e 2 4 t dt

s
|s| 0
|s| 0
C1
!Z
+1
r t
r
r

e 2 4 dt
+
|s|
|s|
0
!
+1
r
+ r
r
2
min
, r e 2 4
+2
r
r
2r
e 2 4.
+
By (3.61),
+2
r
(1 + 2)r
2 +1 + (1 + 2) 2
+
.
We conclude that
2 +1 + (1 + 2) 2 r +

u2 /2
s1

e
e(u)u du
e 2 4

C1
!
1+ 2
2 e0.4798
1+
when [0, 1); by (3.62), this is true for 1 as well.

Now let us examine the contribution of the last segment C3 of the contour.
Since C2 hits the x-axis at c0 /c1 , we define C3 to be the segment of the x-axis
going from x = c0 /c1 till x = . Then
Z
Z
Z

dx
dx
dt
2 /2
2
2 /2

x
s
t
s

=
e
e(x)x
ex /2 x .
e
e(t)t

(3.64)
c0

c0

t
x
x
C3
c1
c1
Now

2
2
2
ex /2 x2 = ex /2 x1 ( 2)ex /2 x3

2
2
2
ex /2 (x2 + ( 2)x4 ) = ex /2 x1 ( 2)( 4)ex /2 x5
and so on, implying that

Z
dx
2
ex /2 x
x
t
x

x2 /2
e
x2 + ( 2)x4
2
x
+ ( 2)x4 + ( 2)( 4)x6
and so on. By (A.36),
c0
min
c1
,
4
if 0 2,
if 2 4.
if 4 6,
We conclude that

Z

1 2 25

2 /2
dt
,
min
(
)
t
s
2
32
P min

,
,
e
e
e(t)t

t
4
C3
26
H. A. HELFGOTT
where we can set P (x) = x2 if [0, 2], P (x) = x2 +(2)x4 if [2, 4]

and P (x) = x2 + ( 2)x4 + . . . + ( 2k)x2(k+1) if [2k, 2(k + 1)].
***
We have left the case < 0 for the very end. In this case, we can afford to use
a straight ray from the origin as our contour of integration. Let C be the ray at
angle /4 from the origin, i.e., y = (tan(/4 ))x, x > 0, where > 0 is
small. Write v = e(/4)i . The integral to be estimated is
Z
2
eu /2 e(u)us1 du.
I=
C
Let us try = 0 first. Much as in (3.59) and (3.60), we obtain, for < 0,
Z t
Z
dt
+ 4 1
4
2
t
dt = e
|I|
e
e||t/ 2 t
t
0
0
! Z
!
(3.65)
2
2
dt
= e 4
et t =
() e 4
||
t
||
0
for > 0. Recall that () 1 for 0 < < 1 (because () = ( + 1) and
() 1 for all [1, 2]; the inequality () 1 for [1, 2] can in turn be
proven by (1) = (2) = 1, (1) < 0 < (2) and the convexity of ()). We
see that, while (3.65) is very good in most cases, it poses problems when either
or is close to 0.
Let us first deal with the issue of small. For general and 0,

Z t2
sin 2t cos( 4 )+( 4 ) 1
t
dt
|I|
e 2
0
( 4 )
e
=
( 4 )
e( 4 )
=
t
t
(sin 2)/2
t2 sin 2 dt
e
2/21
(sin 2)/2
ey y 2
t2
e 2 t
dt
t
dy
e
/2
4
.
2
(/2)e
=
y
2(sin 2)/2
Here we can choose = (arcsin 2/ )/2 (for 2). Then 2 (/2) (2/ ) =
/ , and so
(3.66)
|I|
e 2
e/2 /2
/2
4
(/2) e 4 .
2
(/2)e
2
2(2/ )/2
The only issue that remains is that may be close to 0, in which case (/2)
can be large. We can resolve this, as before, by doing an integration by parts. In
general, for 1 < < 1, s 6= 0:
Z
us
us v
2
u2 /2
eu /2 e(u)
|I| e
e(u) |0
du
s
s
C
Z
2
us
(u + i)eu /2iu du
=
(3.67)
s
C
Z
Z
2
i
2
1
eu /2 e(u)us+1 du +
eu /2 e(u)us du.
=
s C
s C
27
Now we apply (3.65) with s + 1 and s + 2 instead of s, and get that

!+2
!+1
2
||
2
1
( + 2) e 4 +
( + 1) e 4
|I| =
|s| ||
|s| ||
!

4
2
1
+ 2 e 4 .
2
||
Alternatively, we may apply (3.66) and obtain
1 e/2
|| e/2
(( + 2)/2) (+2)/2 e 4 +
(( + 1)/2) (+1)/2 e 4
|s| 2
|s| 2

e/2 /2
||
1+
e 4
2
for [0, 1], where we are using the facts that (s) for s [1/2, 1] and
(s) 1 for s [1, 2].
|I|
3.9. Conclusion. Summing (3.56) with the bounds obtained in 3.8, we obtain
our final estimate. Recall that we can reduce the case < 0 to the case > 0 by
reflection. This finishes the proof of Theorem 3.1. Let us now extract its main
corollary in effect, a simplified restatement of the theorem.
Proof of Corollary 3.2. Let E() be as in (3.2). Let
(
0.1598
if 1.5,
(3.68)
L() =
0.1065 if < 1.5.
Note that 0.1598 E(1.5), whereas 0.1065 E(1.5)/1.5. We claim that E()
L(): this is so for 1.5 because E() is increasing on , and for 1.19
because of (3.79), and for [1.19, 1.5] by the bisection method (with 20 iterations).
By Thm. 3.1, for 0 k 2 and s = + i with [0, 1] and | |
max(4 2 ||, 100),
|F (s + k)| + |F (k + 1 s)|
is at most
E
(3.69) C0,, e
| |
()2
0.4798
+C1, e

2 25
min 81 (
) , 32 | |
+C2,, e
+C e 4 | | ,
where C0,, is at most

3/2 1
4.217
if k = 0,
2 (1 + 7.831 )
2

| p

|
|
min |
,
||
| | p
2 (1 + 1.63)
, | | if k = 1,
3.507 min
3/2
||

2
!
| p
2
|
|
min |
,
||
3.251 min
, | |
if k = 2,
2 (1 + 1.632 )
3/2
||
28
H. A. HELFGOTT
and where
(3.70)
C1,
C,
!
k+1
1+ 2
1+
| | 2 ,
| |
e/2 | |
k+1
2
C2,,
||
1
+
| |

1
| | 5 p
min
if k = 0,
,
|
|
|| 4
1
if k = 1,

min | | , 5 p| | + 1 if k = 2,
|| 4
for k = 0,
for k = 1,
for k = 2.
(We define, as usual, = 2.)

Since | | max(2||, 100),
!
!
e/2
||
1+ 2
+
max 1 + p , e( 4 0.4798)| |
1+
| |
2
| |
!
(3.71)
| |1/2
1.0242 + 2.406 max 1 + , e( 4 0.4798)| |
2
1.0242 + 9.194e( 4 0.4798)100 1.025

and so
(3.72)
C1, e0.4798| | + C e 4 | | 1.025| |
k+1
2
e0.4798| | .
Since | | 100 and tr ect (r, c 0) is decreasing for t r/c,

(3.73)
1.025| |(k+1)/2 e0.4798| | 1.025 10k+1 e(0.47980.1598)100 e0.1598| |

1.3 1011 e0.1598| |
for k 2.
Yet again by | | max(4 2 ||, 100),
(3.74)
e 8 ( ) e0.0184( ) e0.1066( ) 0.055 e0.1066( ) ,

25
e 32 1.03 1027 e0.1598| | .
We also get (starting from (3.70))
0.055/ min(2, 12.5) 0.0088

0.055 C2,,

0.055/2 0.0088
(3.75)
k
k
2
, | |
min
0.055((2)1 + (2)2 ) 0.0102
||
if k = 0,
if k = 1,
if k = 2.
It is easy to see from (3.68) that

L
| |
()2
| |

2
max e0.1066( ) , e0.1598| | ,
whether | |/()2 is greater than 1.5 or not. We conclude that (3.69) is at most
2)
(4.217 + 0.0089)eL(| |/()

if k = 0, at most
(3.507 + 0.0089) min(| |/,
2)
| |)eL(| |/()
29
if k = 1, and at most
2)
(3.251 + 0.0103) min((| |/)2 , | |)eL(| |/()
if k = 2. (We see that the error terms coming from (3.73) and the second line
of (3.74) are being absorbed by the least significant digits of the bound from
(3.75).) We simplify matters further by using min((| |/)2 , | |) = (| |/||)2 for
| | < 1.5()2 , and bounding min((| |/)2 , | |) | | for | | 1.5()2 .

Let us take a second look at Thm. 3.1. While, in the present paper, we will
use it only through Corollary 3.2, it seems worthwhile to say a few words more
on the behavior of its bounds.
2
The terms in (3.1) other than C0,, eE(| |/() )| | are usually very small. In
practice, Thm. 3.1 should be applied when | |/2|| is larger than a moderate
constant (say 8) and | | is larger than a somewhat larger constant (say 100). The
assumptions of Cor. 3.2 are thus typical.
2
For comparison, the Mellin transform of et /2 (i.e., F0 = M f0 ) is 2s/21 (s/2),
which decays like e(/4)| | . For very small (e.g., | | < 2), it can make sense to
use the trivial bound
Z
2/2
dt
2
et /2 t = 2/21 (/2)
(3.76)
|F (s)| F0 () =
t
0
for (0, 1]. Alternatively, we could use integration by parts (much as in (3.67)),
followed by the trivial bound:
Z
us
F (s + 2) 2i
2
du =
F (s + 1),
eu /2 e(u)
(3.77)
F (s) =
s
s
s
0
and so
(3.78)
|F (s)|
+2
1
2
+2
2
+1
+2 2
|s|
|2|
+1
2
1 + 2||
2
|s|
for 0 1, since 2x (x) 2 for x [1/2, 3/2].

In the proof of Corollary 3.2, we used a somewhat crude approximation to the
function E() defined in (3.2). It is worthwhile to give some approximations to
E() that, while still simple, are a little better.
Lemma 3.3. Let E() and () be as in (3.2). Then
1
5 3
E()
8
384
for all > 0. We can also write
(3.79)
(3.80)
E() =

sin 2

,
4
2
4(1 + sin )
where = arcsin 1/().

Clearly, (3.79) is useful for small, whereas (3.80) is useful for large (since
then is close to 0). Taking derivatives, we see that (3.80) implies that E() is
decreasing on ; thus, E() is increasing on . Note that (3.79) gives us that

!

5
2
1 2
| |
1
.
| |
(3.81)
E
()2
2 2
48 4 ||2
30
H. A. HELFGOTT
Proof. Let = arccos 1/(). Then () = 1/(cos ), whereas

p
2
1 + 2 = 2 2 () 1 =
1,
cos2
s
r
2
2
4
4
(3.82)
1 1 =
=
2
4
cos
cos cos2
2 1 cos2
2 sin
=
=
.
cos2
cos2
Thus
(3.83)

2 cos1 1
(1 cos ) cos
(1 cos2 ) cos
=
2E() =
=
2 sin
sin
sin (1 + cos )
cos2
=
sin 2
sin cos
=
.
1 + cos
4 cos2 2
By (A.37) and (3.82), this implies that

2E()
53
,
4 24 8
giving us (3.79).
To obtain (3.80), simply define = /2 ; the desired inequality follows
from the last two steps of (3.83).

4. Explicit formulas
An explicit formula is an expression restating a sum such as S, (/x, x) as a
sum of the Mellin transform G (s) over the zeros of the L function L(s, ). More
specifically, for us, G (s) is the Mellin transform of (t)e(t) for some smoothing
function and some R. We want a formula whose error terms are good both
for very close or equal to 0 and for farther away from 0. (Indeed, our choice(s)
of will be made so that F (s) decays rapidly in both cases.)
We will be able to base all of our work on a single, general explicit formula,
namely, Lemma 4.1. This explicit formula has simple error terms given purely in
terms of a few norms of the given smoothing function . We also give a common
framework for estimating the contribution of zeros on the critical strip (Lemmas
4.3 and 4.4).
2
The first example we work out is that of the Gaussian smoothing (t) = et /2 .
We actually do this in part for didactic purposes and in part because of its
likely applicability elsewhere; for our applications, we will always use smoothing
2
2
functions based on tet /2 and t2 et /2 , generally in combination with something
2
else. Since (t) = et /2 does not vanish at t = 0, its Mellin transform has a pole
at s = 0 something that requires some additional work (Lemma 4.2; see also
the proof of Lemma 4.1).
Other than that, for each function (t), all that has to be done is to bound
an integral (from Lemma 4.3) and bound a few norms. Still, both for and
for + , we find a few interesting complications. Since + is defined in terms of
a truncation of a Mellin transform (or, alternatively, in terms of a multiplicative
convolution with a Dirichlet kernel, as in (1.3) and (1.5)), bounding the norms
takes a little work. We leave this to Appendix B. The effect of
of + and +
31
the convolution is then just to delay the decay a shift, in that a rapidly decaying
function f ( ) will get replaced by f ( H), H a constant.
2
The smoothing function is defined as a multiplicative convolution of t2 et /2
2
with something else. Given that we have an explicit formula for t2 et /2 , we
obtain an explicit formula for by what amounts to just exchanging the order
of a sum and an integral; this is an idea valid in general (see (4.54)).
4.1. A general explicit formula. We will prove an explicit formula valid whenever the smoothing and its derivative satisfy rather mild assumptions they
will be assumed to be L2 -integrable and to have strips of definition containing
{s : 1/2 (s) 3/2}, though any strip of the form {s : (s) 1 + }
would do just as well.
(For explicit formulas with different sets of assumptions, see, e.g., [IK04, 5.5]
and [MV07, Ch. 12].)
The main idea in deriving any explicit formula is to start with an expression
giving a sum as integral over a vertical line with an integrand involving a Mellin
transform (here, G (s)) and an L-function (here, L(s, )). We then shift the line
of integration to the left. If stronger assumptions were made (as in Exercise 5 in
[IK04, 5.5]), we could shift the integral all the way to (s) = ; the integral
would then disappear, replaced entirely by a sum over zeros (or even, as in the
same Exercise 5, by a particularly simple integral). Another possibility is to shift
the line only to (s) = 1/2 + for some > 0 but this gives a weaker result,
and at any rate the factor L (s, )/L(s, ) can be large and messy to estimate
within the critical strip 0 < (s) < 1.
Instead, we will shift the line to s = 1/2. We can do this because the
assumptions on and are enough to continue G (s) analytically up to there
(with a possible pole at s = 0). The factor L (s, )/L(s, ) is easy to estimate
for s < 0 and s = 0 (by the functional equation), and the part of the integral
on s = 1/2 coming from G (s) can be estimated easily using the fact that the
Mellin transform is an isometry.
1
+
Lemma 4.1. Let : R+
0 R be in C . Let x R , R. Let be a primitive
character mod q, q 1.
Write G (s) for the Mellin transform of (t)e(t). Assume that (t) and (t)
are in 2 (with respect to the measure dt) and that (t)t1 and (t)t1 are in
1 (again with respect to dt) for all in an open interval containing [1/2, 3/2].
Then

X
X
n (n/x) = Iq=1 b()x

G ()x
(n)(n)e
x
(4.1)
n=1
1/2
R + O (log q + 6.01) (| |2 + 2||||2 ) x

,
where
if q = 1,
if q 6= 1,

L (1, )
2
+
+ O (c0 )
R = (0) log
q
L(1, )
Iq=1
(4.2)
(
1
=
0
for q > 1, R = (0) log 2 for q = 1 and

(t)
2 (t)

.
(4.3)
c0 = O + (t) t + 2|| + |(t) t|1
3
1
t
t
1
32
H. A. HELFGOTT
The norms ||P

2 , | |2 , | (t)/ t|1 , etc., are taken with respect to the usual measure
dt. The sum is a sum over all non-trivial zeros of L(s, ).
Proof. Since (a) (t)t1 is in 1 for in an open interval containing 3/2 and (b)
(t)e(t) has bounded variation (since , 1 , implying that the derivative of
(t)e(t) is also in 1 ), the Mellin inversion formula (as in, e.g., [IK04, 4.106])
holds:
Z 3 +i
2
1
G (s)xs ns ds.
(n/x)e(n/x) =
2i 3 i
2
P
Since G (s) is bounded for (s) = 3/2 (by (t)t3/21 1 ) and n (n)n3/2
is bounded as well, we can change the order of summation and integration as
follows:
Z 3 +i
X
X
2
1
(n)(n)
(n)(n)e(n/x)(n/x) =
G (s)xs ns ds
3
2i
i
n=1
n=1
2
Z 3 +i X
2
1
(4.4)
(n)(n)G (s)xs ns ds
=
2i 3 i n=1
2
Z 3 +i
2
L (s, )
1
G (s)xs ds.
=
2i 3 i
L(s, )
2
(This is the way the procedure always starts: see, for instance, [HL23, Lemma
1] or, to look at a recent standard reference, [MV07, p. 144]. We are being very
scrupulous about integration because we are working with general .)
The first question we should ask ourselves is: up to where can we extend
G (s)? Since (t)t1 is in 1 for in an open interval I containing [1/2, 3/2],
the transform G (s) is defined for (s) in the same interval I. However, we
also know that the transformation rule M (tf (t))(s) = s M f (s) (see (2.6); by
integration by parts) is valid when s is in the holomorphy strip for both M (tf (t))
and M f . In our case (f (t) = (t)e(t)), this happens when (s) (I 1) I (so
that both sides of the equation in the rule are defined). Hence s G (s) (which
equals s M f (s)) can be analytically continued to (s) in (I 1) I, which is an
open interval containing [1/2, 3/2]. This implies immediately that G (s) can
be analytically continued to the same region, with a possible pole at s = 0.
When does G (s) have a pole at s = 0? This happens when sG (s) is non-zero
at s = 0, i.e., when M (tf (t))(0) 6= 0 for f (t) = (t)e(t). Now
Z
M (tf (t))(0) =
f (t)dt = lim f (t) f (0).
0
f (t)
We already know that

= (d/dt)((t)e(t)) is in 1 . Hence, limt f (t)
exists, and must be 0 because f is in 1 . Hence M (tf (t))(0) = f (0) = (0).
Let us look at the next term in the Laurent expansion of G (s) at s = 0. It is
Z
M (tf (t))(s) f (0)
1
sG (s) (0)
= lim
= lim
f (t)(ts 1)dt
lim
s0
s0 s 0
s0
s
s
Z
Z
ts 1
f (t) log t dt.
dt =
f (t) lim
=
s0
s
0
0
Here we were able to exchange the limit and the integral because f (t)t is in 1
for in a neighborhood of 0; in turn, this is true because f (t) = (t) + 2i(t)
and (t)t and (t)t are both in 1 for in a neighborhood of 0. In fact, we will
33
use the easy bounds |(t) log t| (2/3)(|(t)t1/2 |1 + |(t)t1/2 |1 ), | (t) log t|
(2/3)(| (t)t1/2 |1 + | (t)t1/2 |1 ), resulting from the inequality

1
2 1
t 2 + t 2 | log t|,
(4.5)
3
valid for all t > 0.
We conclude that the Laurent expansion of G (s) at s = 0 is
(4.6)
G (s) =
(0)
+ c0 + c1 s + . . . ,
s
where
c0 = O (|f (t) log t|1 )

(t)
2 (t)

= O + (t) t + 2 + |(t) t|1
.
3
1
t
t
1
We shift the line of integration in (4.4) to (s) = 1/2. We obtain

Z 2+i
X
1
L (s, )
G (s)xs ds = Iq=1 G (1)x
G ()x R
2i 2i
L(s, )
(4.7)
Z 1/2+i
L (s, )
1
G (s)xs ds,
2i 1/2i L(s, )
where
R = Ress=0
L (s, )
G (s).
L(s, )
Of course,
G (1) = M ((t)e(t))(1) =
(t)e(t)dt = b().
Let us work out the Laurent expansion of L (s, )/L(s, ) at s = 0. By the

functional equation (as in, e.g., [IK04, Thm. 4.15]),

s+
1s+
1
1
L (1 s, )
L (s, )
= log
,

(4.8)
L(s, )
q
2
2
2
2
L(1 s, )
where (s) = (s)/(s) and
(
0
=
1
if (1) = 1
if (1) = 1.
By (1 x) (x) = cot x (immediate from (s)(1 s) = / sin s) and

(s) + (s + 1/2) = 2((2s) log 2) (Legendre; [AS64, (6.3.8)]),

s+
1s+
(s + )
1
.
+
= (1 s) + log 2 + cot
(4.9)
2
2
2
2
2
Hence, unless q = 1, the Laurent expansion of L (s, )/L(s, ) at s = 0 is

1
2
L (1, )
a1 a2
+ log
(1)
+ 2 + ... .
+
s
q
L(1, )
s
s
Here (1) = , the Euler gamma constant [AS64, (6.3.2)].

There is a special case for q = 1 due to the pole of (s) at s = 1. We know
that (0)/(0) = log 2 (see, e.g., [MV07, p. 331]).
34
H. A. HELFGOTT
From this and (4.6), we conclude that, if (0) = 0, then

(
c0 if q > 1 and (1) = 1,
R=
0 otherwise,
where c0 = O (| (t) log t|1 + 2|||(t) log t|1 ). If (0) 6= 0, then

(
2
L (1, )
c0 if (1) = 1
R = (0) log
+
+
q
L(1, )
0 otherwise.
for q > 1, and
R = (0) log 2
for q = 1.
It is time to estimate the integral on the right side of (4.7). For that, we will
need to estimate L (s, )/L(s, ) for (s) = 1/2 using (4.8) and (4.9).
If (z) = 3/2, then |t2 + z 2 | 9/4 for all real t. Hence, by [OLBC10, (5.9.15)]
and [GR00, (3.411.1)],
Z
tdt
1
2
(z) = log z
2 + z 2 )(e2t 1)
2z
(t
0
!
Z
tdt
1
= log z
+ 2 O
9 2t
2z
1)
0
4 (e

Z
1
tdt
8
(4.10)
= log z
+ O
2z 9
e2t 1

0
8
1
1
+ O
(2)(2)
= log z
2z 9
(2)2

1
1
10
+O
= log z + O
.
= log z
2z
27
27
Thus, in particular, (1 s) = log(3/2 i ) + O (10/27), where we write s =

1/2 + i . Now

i
e 4 2 + e 4 i+ 2

(s
+
)
cot
=
e 4 i 2 e 4 i+ 2 = 1.

2
Since (s) = 1/2, a comparison of Dirichlet series gives

L (1 s, ) | (3/2)|

(4.11)
L(1 s, ) |(3/2)| 1.50524,
where (3/2) and (3/2) can be evaluated by Euler-Maclaurin. Therefore, (4.8)

and (4.9) give us that, for s = 1/2 + i ,

L (s, )

log q + log 3 + i + 10 + log 2 + + 1.50524
L(s, )
27

2
2

(4.12)

9
q 1

+ 4.1396.
log + log 2 +
2
4
Recall that we must bound the integral on the right side of (4.7). The absolute
value of the integral is at most x1/2 times

Z 1 +i
L (s, )

2
1

ds.
G
(s)
(4.13)

2 1 i L(s, )
2
35
By Cauchy-Schwarz, this is at most

v
v
u Z 1 +i
u Z 1 +i
2
u 1
u 1
L (s, ) 1
2
2
t
t

|G (s)s|2 |ds|
|ds|

1
1
2 i L(s, ) s
2 i
2
By (4.12),
v
v
uZ 1 +i
uZ 1 +i

2
u
u
L (s, ) 1
log q 2
2
2
t
t

L(s, ) s |ds|
s |ds|
1
1
2 i
2 i
v
uZ

u 1 log 2 + 9 + 4.1396 + log 2
t
2
4
+
d
1
2
4 +
2 log q + 226.844,
where we compute the last integral numerically.4

Again, we use the fact that, by (2.6), sG (s) is the Mellin transform of
d(e(t)(t))
= 2ite(t)(t) te(t) (t)
dt
Hence, by Plancherel (as in (2.2)),
(4.15)
v
sZ
u Z 1 +i
u 1
2
2
t
|2ite(t)(t) te(t) (t)|2 t2 dt
|G (s)s| |ds| =
2 1 i
0
2
sZ
sZ
(4.14)
= 2||
Thus, (4.13) is at most

log q +
226.844
2
|(t)|2 dt +

| |2 + 2||||2 .
| (t)|2 dt.
Lemma 4.1 leaves us with three tasks: bounding the sum of G ()x over all
non-trivial zeroes with small imaginary part, bounding the sum of G ()x over
all non-trivial zeroes with large imaginary part, and bounding L (1, )/L(1, ).
Let us start with the last task: while, in a narrow sense, it is optional in
that, in most of our applications, we will have (0) = 0, thus making the term
L (1, )/L(1, ) disappear it is also very easy and can be dealt with quickly.
Since we will be using a finite GRH check in all later applications, we might
as well use it here.
Lemma 4.2. Let be a primitive character mod q, q > 1. Assume that all
non-trivial zeroes = + it of L(s, ) with |t| 5/8 satisfy () = 1/2. Then

L (1, ) 5

L(1, ) 2 log M (q) + c,
4By a rigorous integration from = 100000 to = 100000 using VNODE-LP [Ned06],
which runs on the PROFIL/BIAS interval arithmetic package[Kn

u99].
36
H. A. HELFGOTT

P

where M (q) = maxn mn (m) and
2 3
c = 5 log
= 15.07016 . . . .
(9/4)/(9/8)
Proof. By a lemma of Landaus (see, e.g., [MV07, Lemma 6.3], where the constants are easily made explicit) based on the Borel-Caratheodory Lemma (as in
[MV07, Lemma 6.2]), any function f analytic and zero-free on a disc Cs0 ,R = {s :
|s s0 | R} of radius R > 0 around s0 satisfies

f (s)
2R log M/|f (s0 )|
=O
(4.16)
f (s)
(R r)2
for all s with |s s0 | r, where 0 < r < R and M is the maximum of |f (z)|
on Cs0 ,R . Assuming L(s, ) has no non-trivial zeros off the critical line with
|(s)| H, where H > 1/2, we set s0 = 1/2 + H, r = H 1/2, and let R H .
We obtain

maxsCs0 ,H |L(s, )|
L (1, )
(4.17)
= O 8H log
.
L(1, )
|L(s0 , )|
Now
|L(s0 , )|
Y
Y (1 p2s0 )1
(2s0 )
.
=
(1 + ps0 )1 =
s
1
0
(1 p )
(s0 )
p
p
Since s0 = 1/2 + H, Cs0 ,H is contained in {s C : (s) > 1/2} for any value of
H. We choose (somewhat arbitrarily) H = 5/8.
By partial summation, for s = + it with 1/2 < 1 and any N Z+ ,
X
X
(m) (N + 1)s
(m)ns
L(s, ) =
nN
(4.18)
nN +1
mN
mn
(m) (ns (n + 1)s+1 )
!
N 11/2
1
,
+N
+ M (q)N
=O
1 1/2

P

where M (q) = maxn mn (m). We set N = M (q)/3, and obtain
p
(4.19)
|L(s, )| 2M (q)N 1/2 = 2 3 M (q).
We put this into (4.17) and are done.
Let M (q) be as in the statement of Lem. 4.2. Since the sum of (n) ( mod q,
q > 1) over any interval of length q is 0, it is easy to see that M (q) q/2. We
also have the following explicit version of the P
olya-Vinogradov inequality:
(
2
q log q + 42 q log log q + 23 q if (1) = 1,
2
(4.20)
M (q) 1
1
if (1) = 1.
2 q log q + q log log q + q
Taken together with M (q) q/2, this implies that
(4.21)
M (q) q 4/5
37
for all q 1, and also that

M (q) 2q 3/5
(4.22)
for all q 1.
Notice, lastly, that

2
log
log q + log e 2
+

q
32
for all q 3. (There are no primitive characters modulo 2, so we can omit q = 2.)
We conclude that, for primitive and non-trivial,

(1, )

2
L
log
log e 2 + log q + 5 log q 54 + 15.07017
+

q
L(1, )
32
2
3 log q + 15.289.
Obviously, 15.289 is more than log 2, the bound for trivial. Hence, the absolute
value of the quantity R in the statement of Lemma 4.1 is at most
(4.23)
|(0)|(3 log q + 15.289) + |c0 |
for all primitive .

P
It now remains to bound the sum G ()x in (4.1). Clearly

X
X

G ()x
|G ()| x() .

Recall that these are sums over the non-trivial zeros of L(s, ).
We first prove a general lemma on sums of values of functions on the non-trivial
zeros of L(s, ). This is little more than partial summation, given a (classical)
bound for the number of zeroes N (T, ) of L(s, ) with |(s)| T . The error
term becomes particularly simple if f is real-valued and decreasing; the statement
is then practically identical to that of [Leh66, Lemma 1] (for principal), except
for the fact that the error term is improved here.
Lemma 4.3. Let f : R+ C be piecewise C 1 . Assume limt f (t)t log t = 0.
Let be a primitive character mod q, q 1; let denote the non-trivial zeros
of L(s, ). Then, for any y 1,
Z
X
1
qT
f (()) =
dT
f (T ) log
2 y
2
(4.24)
where
(4.25)
non-trivial
()>y

Z

1
f (T ) g (T )dT ,
+ O |f (y)|g (y) +
2
y
g (T ) = 0.5 log qT + 17.7
If f is real-valued and decreasing on [y, ), the second line of (4.24) equals

Z
f (T )
1
dT .
O
4 y
T
38
H. A. HELFGOTT
Proof. Write N (T, ) for the number of non-trivial zeros of L(s, ) with |(s)|
T . Write N + (T, ) for the number of (necessarily non-trivial) zeros of L(s, )
with 0 < (s) T . Then, for any f : R+ C with f piecewise differentiable
and limt f (t)N (T, ) = 0,
Z
X
f (T ) dN + (T, )
f (()) =
y
:()>y
1
=
2
f (T )(N + (T, ) N + (y, ))dT
f (T )(N (T, ) N (y, ))dT.
Now, by [Ros41, Thms. 1719] and [McC84, Thm. 2.1] (see also [Tru, Thm. 1]),
qT
T
+ O (g (T ))
(4.26)
N (T, ) = log
2e
for T 1, where g (T ) is as in (4.25). (This is a classical formula; the references
serve to prove the explicit form (4.25) for the error term g (T ).)
Thus, for y 1,

Z
X
T
qT
y
qy
1
log
log
f (T )
dT
f (()) =
2 y
2e
2e
:()>y
(4.27)

Z

1

+ O |f (y)|g (y) +
f (T ) g (T )dT .
2
y
Here
(4.28)
f (T )
T
qT
y
qy
log
log
2e
2e
dT =
1
2
f (T ) log
qT
dT.
2
If f is real-valued and decreasing (and so, by limt f (t) = 0, non-negative),

Z
Z

f (T ) g (T )dT = f (y)g (y)
f (T )g (T )dT
|f (y)|g (y) +
y
= 0.5
since
g (T )
f (T )
dT,
T
0.5/T for all T T0 .

P
Now we can bound the sum

G ()x . The bound we will give is pro
portional to T0 log qT0 , whereas a more obvious approach would give a bound
proportional to T0 log qT0 . This (large) improvement is due to our usage of isometry (after an application of Cauchy-Schwarz) to bound integrals throughout. It
is also this usage that allows us to give a general bound depending only on a few
norms of and its variants.
+
Lemma 4.4.
Let : R0 R be such that both (t) and (log t)(t) lie in L1 L2
and (t)/ t lies in L1 (with respect to dt). Let R. Let G (s) be the Mellin
transform of (t)e(t).
Let be a primitive character mod q, q 1. Let T0 1. Assume that all
non-trivial zeros of L(s, ) with |()| T0 lie on the critical line. Then
X
|G ()|
non-trivial
|()|T0
39
is at most
(4.29)
p
p
(||2 + | log |2 ) T0 log qT0 + (17.21| log |2 (log 2 e)||2 ) T0

+ (t)/ t (1.32 log q + 34.5)
1
Proof. For s = 1/2 + i , we have the trivial bound

Z

dt
|(t)|t1/2 = (t)/ t ,
(4.30)
|G (s)|
t
1
0
where F is as in (4.44). We also have the trivial bound
(4.31)
Z
Z

dt
s dt

(log t)(t)t1
|G (s)| =
|(log
t)(t)|t
=
(log t)(t)t
1
t
t
0
0
for s = + i .
Let us start by bounding the contribution of very low-lying zeros (|()| 1).
By (4.26) and (4.25),
q
1
+ O (0.5 log q + 17.7) = O (0.819 log q + 16.8).
N (1, ) = log
2e
Therefore,

X

|G ()| (t)t1/2 (0.819 log q + 16.8).
1
non-trivial
|()|1
Let us now consider zeros with |()| > 1. Apply Lemma 4.3 with y = 1 and
(
|G (1/2 + it)| if t T0 ,
f (t) =
0
if t > T0 .
This gives us that

X
(4.32)
:1<|()|T0
T0
qT
dT
2
1

Z
+ O |f (1)|g (1) +
|f (T )| g (T ) dT ,
f (()) =
f (T ) log
where we are using the fact that f ( + i ) = f ( i ) (because is real-valued).

By Cauchy-Schwarz,
s
s

Z
Z
Z
qT
qT 2
1 T0
1 T0
1 T0
2
f (T ) log
dT
|f (T )| dT
dT .
log
1
2
1
1
2
Now
2
Z
Z
Z

1
1
1 T0
2
dT
G
|e(t)(t)|2 dt = ||22
|f (T )| dT
+
iT

1
2
2
0
by Plancherel (as in (2.2)). We also have

Z T0
Z qT0
2
qT 2
2
(log t)2 dt
log
dT
2
q 0
1
Hence
1
T0
qT
dT
f (T ) log
2
s
qT0
log
2e
2
qT0
log
2e
2
+ 1 ||2
+1
T0 .
T0 .
40
H. A. HELFGOTT
Again by Cauchy-Schwarz,
Z
|f (T )| g (T ) dT
1
2
|f (T )|2 dT
T0
1
|g (T )|2 dT .
Since |f (T )| = |G (1/2 + iT )| and (M ) (s) is the Mellin transform of log(t)

e(t)(t) (by (2.6)),
Z
1
|f (T )|2 dT = |(t) log(t)|2 .
2
Much as before,
Z
Z T0
|g (T )|2 dT
T0
(0.5 log qT + 17.7)2 dT
= (0.25(log qT0 )2 + 17.2(log qT0 ) + 296.09)T0 .
Summing, we obtain
Z
Z
1 T0
qT
|f (T )| g (T ) dT
f (T ) log
dT +
1
2
1

p
1
qT0
log qT0
T0
+
+ 17.21 |(t)(log t)|2
log
||2 +
2e 2
2
Finally, by (4.30) and (4.25),

|f (1)|g (1) (t)/ t (0.5 log q + 17.7).
1
By (4.32) and the assumption that all non-trivial zeros with |()| T0 lie on
the line (s) = 1/2, we conclude that
X
p
|G ()| (||2 + | log |2 ) T0 log qT0
non-trivial
1<|()|T0
+ (17.21| log |2 (log 2 e)||2 ) T0

+ (t)/ t (0.5 log q + 17.7).
1
4.2. Sums and decay for the Gaussian: (t) = et /2 . It is now time to
derive our bounds for the Gaussian smoothing. Thanks to the general work we
have donePso far, there is really only one thing left to do, namely, an estimate for
the sum |F ()| over non-trivial zeros with |()| > T0 .
2
Lemma 4.5. Let (t) = et /2 . Let x R+ , R. Let be a primitive

character mod q, q 1. Assume that all non-trivial zeros of L(s, ) with
|()| T0 satisfy (s) = 1/2. Assume that T0 max(4 2 ||, 100).
Write F (s) for the Mellin transform of (t)e(t). Then

T0 2
X
qT0
0.1065 ||
0.1598T0
4.329e
+ 0.802||e
.
|F ()| log
2
non-trivial
|()|>T0
41
Here we have preferred to give a bound with a simple form. It is probably

feasible to derive from Theorem 3.1 a bound essentially proportional to eE()T0 ,
where = T0 /()2 and E() is as in (3.2). (As we discussed in 3.9, E()
2
behaves as e(/4)T0 for large and as e0.125(T0 /()) for small.)
Proof. First of all,
X
non-trivial
|()|>T0
|F ()| =
non-trivial
()>T0
(|F ()| + |F (1 )|) ,
by the functional equation (which implies that non-trivial zeros come in pairs ,
1 ). Hence, by Cor. 3.2,
X
X
f (()),
|F ()|
(4.33)
non-trivial
|()|>T0
where
(4.34)
non-trivial
()>T0
(
2
e0.1065( )
f ( ) = 4.226
e0.1598| |
if | | < 23 ()2 ,
if | | 32 ()2 .
It is easy to check that f ( ) is a decreasing function of for T0 .

We now apply Lemma 4.3. We obtain that

Z
X
qT
1
1
log
+
dT.
(4.35)
f (T )
f (())
2
2
4T
T0
non-trivial
()>T0
We just need to estimate some integrals. For any y 1, c, c1 > 0,

Z ct
Z
Z
c1 ct
1
1
e
ct
log t +
e dt
log t
+ c1
dt
e dt +
t
ct
c
t
y
y
y

(log y)ecy
1
=
+
+ c1 E1 (cy),
c
c
R
R
where E1 (x) = x et dt/t. Clearly, E1 (x) x et dt/x = ex /x. Hence

cy
Z
1
1 e
c1 ct
e dt log y +
+ c1
.
log t +
t
c
y
c
y
We conclude that

Z
1
qt
1
0.1598t
e
log
+
dt
2
2 4t
T0

Z
q Z
log 2
/2 ct
1
log t +
e dt +
ect dt
2 T0
t
2c T0

1 1
q
1
+
+
ecT0
log T0 + log
=
2c
2
c
2 T0
with c = 0.1598. Since T0 100, this is at most

(4.36)
1.0242 log
qT0 cT0
e
.
2
42
H. A. HELFGOTT
Now let us deal with the Gaussian term. (It appears only if T0 < (3/2)()2 ,
as otherwise | | (3/2)()2 holds whenever | | T0 .) For any y e, c 0,
Z
(4.37)
ct2
(4.38)
(4.39)
1
dt =
c
ect
dt =
t
ct2
(log t)e
dt
t2
cy
cy 2
1
dt
cy
t2
te
cy
ecy
dt
,
2cy
et
E1 (cy 2 )
ecy
dt =
,
2t
2
2cy 2
log t 1
log t +
2ct2
ect dt =
log y cy2
.
e
2cy
Hence
Z
T0
=
(4.40)
T0
||

1
qT
1
log
+
dT
2
2
4T

q||t
1
||
0.1065t2
log
+
e
dt
2
2
4t
T
0.1065(
)
||
2
T0
log ||
T0
2c ||
||
2
q||
2
log
T0
2c ||
+
8c
1
T0
||
c
2 e
T0
||
2
with c = 0.1065. Since T0 100 and q 1,

1
qT0
2
0.0057 log
8T0
400
2
2
Thus, the last line of (4.40) is less than
(4.41)
1.0057
||
2
0
log qT
2
2cT0
||
T0
||
2
2 c
= 1.0057
e
4cT0
T0
||
2
Again by T0 4 2 ||, we see that 1.0057||/(4cT0 ) 1.0057/(16c) 0.18787.

To obtain our final bound, we simply sum (4.36) and (4.41), and multiply by
4.266 (the constant in (4.34)). We conclude that the integral in (4.35) is at most

0.1598T0
4.329e

T0 2
0.1065 ||
+ 0.8015||e
log
qT0
.
2

2
We need to record a few norms related to the Gaussian (t) = et /2 before

we proceed. Recall we are working with the one-sided Gaussian, i.e., we set
43
(t) = 0 for t < 0. Symbolic integration then gives

(4.42)
t2
2
,
e dt =
| |2 =
2
0
t2 /2 2
2
,
(te
) dt =
| |2 =
4
0
Z
2
2
et (log t)2 dt
| log |2 =
0

2
=
+ 2 2 + 8 log 2 + 8(log 2)2 1.94753,
16
Z t2 /2
e
(1/4)
dt = 3/4 2.15581
| (t)/ t|1 =
2
t
Z0
t2
(3/4)
e 2 tdt = 1/4 1.03045

(t)/ t| = | (t) t|1 =
|
2
0
Z

2 3
t

e 2 t 2 dt = 1.07791.
(t)t1/2 = (t)t3/2 =
1
We can now state what is really our main result for the Gaussian smoothing.
(The version in the introduction will, as we shall later see, follow from this, given
numerical inputs.)
2
Proposition 4.6. Let (t) = et /2 . Let x 1, R. Let be a primitive

|()| T0 lie on the critical line. Assume that T0 max(4 2 ||, 100).
Then
(4.43)
(
X
b()x + O (err, (, x)) x if q = 1,
n
=
n
(n)(n)e
x
x
O (err, (, x)) x
if q > 1,
n=1
where

T0 2
qT0
0.1065 ||
0.1598T0
err, (, x) = log
4.329e
+ 0.802||e
2
p
p
1
+ (2.337 T0 log qT0 + 21.817 T0 + 2.85 log q + 74.38)x 2
+ (3 log q + 14|| + 17)x1 + (log q + 6) (1 + 5||) x3/2 .
Proof. Let F (s) be the Mellin transform of (t)e(t). By Lemmas 4.4 (with
G = F ) and 4.5,

X

F ()x

non-trivial

is at most (4.29) (with = ) times x, plus

T0 2
qT0
0.1065 ||
x.
log
4.329e0.1598T0 + 0.802||e
2
By the norm computations in (4.42), we see that (4.29) is at most
p
p
2.337 T0 log qT0 + 21.817 T0 + 2.85 log q + 74.38.
44
H. A. HELFGOTT
Let us now apply Lemma 4.1. We saw that the value of R in Lemma 4.1 is
bounded by (4.23). We know that (0) = 1. Again by (4.42), we get from (4.3)
that c0 1.4056 + 13.3466||. Hence
|R| 3 log q + 13.347|| + 16.695.
Lastly,
|2 + 2||| |2 0.942 + 4.183|| 1 + 5||.

|
Clearly
(6.01 6) (1 + 5||) + 13.347|| + 16.695 < 14|| + 17,
and so we are done.
2 /2
4.3. The case of (t) = t2 et

based on the Gaussian:
(t) =
and (t). We will now work with a weight
2 /2
t2 et
0
if t 0,
if t < 0.
The fact that this vanishes at t = 0 actually makes it easier to work with at
several levels.
Its Mellin transform is just a shift of that of the Gaussian. Write
t2
F (s) = (M (e 2 e(t)))(s),
(4.44)
G (s) = (M ((t)e(t)))(s).
Then, by the definition of the Mellin transform,

G (s) = F (s + 2).
We start by bounding the contribution of zeros with large imaginary part, just
as before.
2
Lemma 4.7. Let (t) = t2 et /2 . Let x R+ , R. Let be a primitive

|()| T0 satisfy (s) = 1/2. Assume that T0 max(4 2 ||, 100).
Write G (s) for the Mellin transform of (t)e(t). Then
!
T2
X
qT0
0.1065 0 2
0.1598T0
()
.
3.5e
+ 0.64e
|G ()| T0 log
2
non-trivial
|()|>T0
Proof. We start by writing

X
|G ()| =
non-trivial
|()|>T0
non-trivial
()>T0
(|F ( + 2)| + |F ((1 ) + 2)|) ,
where we are using G () = F ( + 2) and the fact that non-trivial zeros come in
pairs , 1 .
By Cor. 3.2 with k = 2,
X
X
f (()),
|G ()|
non-trivial
|()|>T0
non-trivial
()>T0
where
(4.45)
2
2
|
0.1065 |
|
e
| |e0.1598| | + 41 |
f ( ) = c2
| |e0.1598| |
45
if | | < 23 ()2 ,
if | | 32 ()2 ,
where c2 = 3.262. We are including the term c2 | |e0.1598| | in both cases in part
because we will not bother to take it out (just as we did not bother in the proof
of Lem. 4.5) and in part to ensure that f ( ) is a decreasing function of for
T0 .
We can now apply Lemma 4.3. We obtain, again,

Z
X
1
qT
1
log
+
f (())
f (T )
(4.46)
dT.
2
2
4T
T0
non-trivial
()>T0
Just as before, we will need to estimate some integrals.

For any y 1, c, c1 > 0,

Z
1
y
ct
+
ecy ,
te dt =
c c2
y

Z
Z
c1 ct
a1
a
1
t log t +
e dt
t+
log t 2 ect dt
t
c
c c t
y
y
y
a cy
=
+
e
log y,
c c2
where
log y
c1
1
c + c + y
.
a = log y
1
c c2 y
Setting c = 0.1598, c1 = /2, y = T0 100, we obtain that

Z
1
qT
1
log
+
T e0.1598T dT
2
2
4T
T0

1
T0
T0
1
a
q
+ 2 +
+ 2 log T0 e0.1598T0
log
2
2
c
c
c
c
and
a=
/2
log T0
1
0.1598 + 0.1598 + T0
log T0
1
0.1598 0.15982 T0
1.235.
Multiplying by c2 = 3.262 and simplifying by the assumption T0 100, we obtain

that

Z
1
qT0
1
qT0 0.1598T0
(4.47)
3.255T e0.1598T
log
+
e
.
dT 3.5T0 log
2
2
4T
2
T0
Now let us examine the Gaussian term. First of all when does it arise? If
T0 (3/2)()2 , then | | (3/2)()2 holds whenever | | T0 , and so (4.45)
does not give us a Gaussian term. Recall that T0 4 2 ||, which means that
|| 8/3 implies that T0 (3/2)()2 . We can thus assume from now on that
|| > 8/3, since otherwise there is no Gaussian term to treat.
For any y 1, c, c1 > 0,

Z
Z
y
1
1
2
ct2
2 ct2
2
+ 2
dt =
ecy ,
dt <
t e
t + 2 2 e
4c t
2c 4c y
y
y
46
H. A. HELFGOTT
ct2
(t log t + c1 t) e
where
a=
at log et log et
a
dt
t log t +
2
2c
2c
4c t
y
(2cy + a) log y + a cy2
,
e
=
4c2
log ey
2c
y log ey
1
2c
4c2 y
c1 y +
ect dt
c1 y + 4c21y2
1
= + y log ey
.
y
12
2c
4c y
(Note that a decreases as y 1 increases.) Setting c = 0.1065, c1 = 1/(2||)

3/16 and y = T0 /(||) 4, we obtain

Z
1
q||t
1
2
log
+
t2 e0.1065t dt
T0
2
2
4||t
||

T0 2
T0
q||
1
1
0.1065 ||
log
+
2
2
2c|| 4c2 4

T0
T0
2c ||
+ a log ||
+ a 0.1065 T0 2
1
||
e
+
2
4c2
and
3
+ 40.106512 (4)2
4 16
1
a
0.092.
+ 4 log 4e
1
4
2
20.1065
40.1065 4
Multiplying by (c2 /4)||, we get that

2
Z
T
qT0
1
T 2 0.1065 ||
c2
1
log
+
e
dT
(4.48)
||
2
2
4T
T0 4

T0 2
0.1065 ||
is at most e

(4.49)
times
q||
T0
eT0
(0.61T0 + 0.716||) log
+ 0.61T0 log
+ 0.827|| log
2
||
||
!
1
1 + log T0 /||
qT0
qT0
0.64T0 log
,
0.61 + 0.828
T0 log
T0 /||
2
2
where we are using several times the assumption that T0 4 2 ||.

We sum (4.47) and the estimate for (4.48) we have just got to reach our conclusion.

Again, we record some norms obtained by symbolic integration:
(4.50)
3
7
||22 =
,
| |22 =
,
8
16
8(3 8) log 2 + 3 2 + 6 2 + 24(log 2)2 + 16 32

| log |22 =
64
0.16364,
21/4 (1/4)
3
|(t)/ t|1 =
1.07791,
|(t) t|1 = 23/4 (3/4) 1.54568,
4
4
Z 2
Z
2
2
t
t
| (t)/ t|1 =
t3/2 e 2 dt t3/2 e 2 dt 1.48469,
0
2
| (t) t|1 1.72169.
47
Proposition 4.8. Let (t) = t2 et /2 . Let x 1, R. Let be a primitive

|()| T0 lie on the critical line. Assume that T0 max(4 2 ||, 100).
Then
(4.51)
(

X
b()x + O (err, (, x)) x if q = 1,
n (n/x) =
(n)(n)e
x
O (err, (, x)) x
if q > 1,
n=1
where
(4.52)
!
T02
qT0
0.1065
()2
3.5e0.1598T0 + 0.64e
err, (, x) = T0 log
2

p
p
+ 1.22 T0 log qT0 + 5.053 T0 + 1.423 log q + 37.19 x1/2
+ (3 + 11||)x1 + (log q + 6) (1 + 6||) x3/2 .
Proof. We proceed as in the proof of Prop. 4.6. The contribution of Lemma 4.7
is
!
T2
qT0
0.1065 0 2
0.1598T0
()
x,
3.5e
+ 0.64e
T0 log
2
whereas the contribution of Lemma 4.4 is at most
p
p
(1.22 T0 log qT0 + 5.053 T0 + 1.423 log q + 37.188) x.

Let us now apply Lemma 4.1. Since (0) = 0, we have
R = O (c0 ) = O (2.138 + 10.99||).

Lastly,
| |2 + 2||||2 0.881 + 5.123||.
Now that we have Prop. 4.8, we can derive from it similar bounds for a smoothing defined as the multiplicative convolution of with something else. In general,
for 1 , 2 : [0, ) C, if we know how to bound sums of the form
X
(4.53)
Sf,1 (x) =
f (n)1 (n/x),
n
we can bound sums of the form Sf,1 M 2 , simply by changing the order of summation and integration:
n
X
Sf,1 M 2 =
f (n)(1 M 2 )
x
n
Z X
Z
(4.54)
n
dw
dw
f (n)1
=
2 (w)
Sf,1 (wx)2 (w) .
=
wx
w
w
0
0
n
This is particularly nice if 2 (t) vanishes in a neighbourhood of the origin, since

then the argument wx of Sf,1 (wx) is always large.
2
We will use 1 (t) = t2 et /2 , 2 (t) = 1 M 1 , where 1 is 2 times the characteristic function of the interval [1/2, 1]. This is the example that will be used
in [Helc]. The motivation for the choice of 1 and 2 is clear: we have just got
bounds based on 1 (t) in the major arcs, and the minor-arc bounds in [Helb]
(used in [Helc], not here) were obtained for the weight 2 (t).
48
H. A. HELFGOTT
2
Corollary 4.9. Let (t) = t2 et /2 , 1 = 2 I[1/2,1] , 2 = 1 M 1 . Let =

2 M . Let x R+ , R. Let be a primitive character mod q, q 1.
Assume that all non-trivial zeros of L(s, ) with |()| T0 lie on the critical
line. Assume that T0 max(4 2 ||, 100).
Then
(4.55)
(

b ()x + O (err , (, x)) x if q = 1,

n (n/x) =
(n)(n)e
x
O (err , (, x)) x
if q > 1,
n=1
where
(4.56)
!
T2
qT0
0.1065 0 2
0.1598T0
()
3.5e
+ 0.0019 e
err, (, x) = T0 log
2

p
p
1
+ 1.675 T0 log qT0 + 6.936 T0 + 1.954 log q + 51.047 x 2
+ (6 + 22||)x1 + (log q + 6) (3 + 17||) x3/2 .
Proof. The left side of (4.55) equals

Z X
dw
n
n
2 (w)
(n)(n)e
x
wx
w
0
n=1
1X
1
4
(n)(n)e
n=1
wn
wx

dw
n
2 (w) ,
wx
w
since 2 is supported on [1/4, 1]. By Prop. 4.8, the main term (if q = 1)
contributes
Z
Z 1
dw
b(w)2 (w)dw
=x
b(w)xw 2 (w)
1
w
0
4
Z Z
Z Z
dt
r
e(r) 2 (w)dw
=x
(t)e(wt)dt 2 (w)dw = x
w
w
0

Z Z
r
dw
=x
2 (w)
e(r)dr = b () x.
w
w
0
The error term is
Z 1
Z 1
dw
=x
err, (w, wx) wx 2 (w)
err, (w, wx)2 (w)dw.
(4.57)
1
1
w
4
Since
and
Z
w
2 (w)dw = 1,
w
w1 2 (w)dw = 4(log 2)2 1.92182,

Z
0.1065(4)2
e
w
1
w2

1
w1/2 2 (w)dw 1.37259,

w
Z
w3/2 2 (w)dw 2.74517
w
2 (w)dw 0.002866,
we see that (4.52) and (4.57) imply (4.56).

5By rigorous integration from 1/4 to 1/2 and from 1/2 to 1 using VNODE-LP [Ned06].
49
4.4. The case of + (t). We will work with

2 /2
(t) = + (t) = hH (t)t (t) = hH (t)tet
(4.58)
where hH is as in (1.5). We recall that hH is a band-limited approximation to

the function h defined in (1.4) to be more precise, M hH (it) is the truncation
of M h(it) to the interval [H, H].
We are actually defining h, hH and in a slightly different way from what was
done in the first draft of the present paper. The difference is instructive. There,
2
(t) was defined as hH (t)et /2 , and hH was a band-limited approximation to a
function h defined as in (1.4), but with t3 (2 t)3 instead of t2 (2 t)3 . The reason
for our new definitions is that now the truncation of M h(it) will not break the
holomorphy of M , and so we will be able to prove the general results we proved
in 4.1.
In essence, M h will still be holomorphic because the Mellin transform of t (t)
is holomorphic in the domain we care about, unlike the Mellin transform of (t),
which does have a pole at s = 0.
As usual, we start by bounding the contribution of zeros with large imaginary part. The procedure is much as before: since + (t) = H (t) (t) , the
2
Mellin transform M + is a convolution of M (tet /2 ) and something of support
in [H, H]i, namely, M H restricted to the imaginary axis. This means that the
2
decay of M + is (at worst) like the decay of M (tet /2 ), delayed by H.
Lemma 4.10. Let = + be as in (4.58) for some H 5. Let x R+ , R.
Let be a primitive character mod q, q 1. Assume that all non-trivial zeros
of L(s, ) with |()| T0 satisfy (s) = 1/2, where T0 H + max(4 2 ||, 100).
Write G (s) for the Mellin transform of (t)e(t). Then
2
T
q
X
0.1065 0
log qT0 ,
|G ()| 9.462 T0 e0.1598T0 + 11.287||e
2
non-trivial
|()|>T0
where T0 = T0 H.
Proof. As usual,
non-trivial
|()|>T0
|G ()| =
non-trivial
()>T0
(|G ()| + |G (1 )|) .

2
Let F be as in (4.44). Then, since + (t)e(t) = hH (t)tet /2 e(t), where hH is

as in (1.5), we see by (2.5) that
Z H
1
M h(ir)F (s + 1 ir)dr,
G (s) =
2 H
and so, since |M h(ir)| = |M h(ir)|,

(4.59)
Z H
1
|G ()| + |G (1 )|
|M h(ir)|(|F (1 + ir)| + |F (2 ( ir))|)dr.
2 H
We apply Cor. 3.2 with k = 1 and T0 H instead of T0 , and obtain that

|F ()| + |F (1 )| g( ), where

p
2
| |
| |e0.1598| | +
e0.1065( )
(4.60)
g( ) = c1
2||
50
H. A. HELFGOTT
where c1 = 3.516. (As in the proof of Lemma 4.7, we are really putting in extra
terms so as to simplify our integrals.)
From (4.59), we conclude that
|G ()| + |G (1 )| f ( ),
for = + i , > 0, where
|M h(ir)|1
g( H)
2
is decreasing for T0 (because g( ) is decreasing for T0 H). By (B.17),
|M h(ir)|1 16.193918.
We apply Lemma 4.3, and get that

Z
X
1
qT
1
|G ()|
f (T )
log
+
dT
2
2
4T
T0
non-trivial
|()|>T0
(4.61)

Z
|M h(ir)|1
qT
1
1
=
log
+
dT.
g(T H)
2
2
2
4T
T0
f ( ) =
Now we just need to estimate some integrals. For any y e2 , c > 0 and
, 1 0,

Z
ct
y
1
te dt
+ 2
ecy ,
c
2c
y
y

Z
y
1
a
ct
t log(t + ) + e dt
+ 2
log(y + )ecy ,
c
c
y
t
y
where
a=
1 + c1
1
+
.
2 log(y + )
The contribution of the exponential term in (4.60) to (4.61) thus equals

Z
c1 |M h(ir)|1 1
qT
1
T H e0.1598(T H) dT
log
+
2
2
2
4T
T0

Z
q
log 2
1
1 0.1598T
9.06194
log(T + H) +
+
Te
dT
(4.62)
2
2
4T
T0 H

a
qT0 0.1598(T0 H)
9.06194
T0 H
+
e
,
log
2
0.1598
2
0.15982 T0 H
where a = 1/2 + (1 + 0.1598/2)/ log T0 . Since T0 H 100 and T0 105, this

is at most
p
qT0 0.1598(T0 H)
9.4612 T0 H log
e
.
2
We now estimate a few more integrals so that we can handle the Gaussian
term in (4.60). For any y > 1, c > 0, , 1 0,
Z
2
ecy
2
,
tect dt =
2c
y
!
Z
2
1
1 + 2cy
log(y + ) ecy
ct2
(t log(t + ) + 1 )e
dt 1 +
y log(y + )
2c
y
51
Proceeding just as before, we see that the contribution of the Gaussian term in
(4.60) to (4.61) is at most
(4.63)

Z
c1 |M h(ir)|1 1
qT
1 T H 0.1065( T H )2
log
+
e
dT
2
2
2
4T
2||
T0

Z
q|| /2
2
||
H
+
+ log
T e0.1065T dT
9.06194
log T +
4 T0 H
||
2
T
||
||

H 2
+
qT0 0.1065 T0
||
2
20.1065(T0 H)
e
1+
,
log
9.06194
T0 H
T0
8 0.1065
2
|| log ||
Since (T0 H)/(||) 4, this is at most
qT0 0.1065
e
11.287|| log
2
T0 H
2
.
Proposition 4.11. Let = + be as in (4.58) for some H 50. Let x 103 ,

R. Let be a primitive character mod q, q 1. Assume that all nontrivial zeros of L(s, ) with |()| T0 lie on the critical line, where T0
H + max(4 2 ||, 100).
Then
(4.64)
(

X
c
+ ()x + O err+ , (, x) x if q = 1,

n + (n/x) =
(n)(n)e
x
O err+ , (, x) x
if q > 1,
n=1
where
(4.65)
q
0.1065
err+ , (, x) = 9.462 T0 e0.1598T0 + 11.287||e
T0
2
log qT0
2
p
p
+ (1.631 T0 log qT0 + 12.42 T0 + 1.321 log q + 34.51)x1/2 ,
+ (9 + 11||)x1 + (log q)(11 + 6||)x3/2 ,
where T0 = T0 H.
Proof. We can apply Lemmas 4.1 and Lemma 4.4 because + (t), (log t)+ (t) and
(t) are in (by (B.25), (B.28) and (B.32)) and (t)t1 and (t)t1 are in
+
2
+
+
1 for in an open interval containing [1/2, 3/2] (by (B.30) and (B.33)). (Because
of (4.5), the fact that + (t)t1/2 and + (t)t1/2 are in 1 implies that + (t) log t is
also in 1 , as is required by Lemma 4.4.)
We apply Lemmas 4.1, 4.4 and 4.10. We bound the norms involving + using
the estimates in B.3 and B.4. Since + (0) = 0 (by the definition (B.3) of + ),
the term R in (4.2) is at most c0 , where c0 is as in (4.3). We bound
p

p

p
p
2
2.922875
(1/2) + (3/2) + 1.062319
(5/2) + (7/2)
c0
3
p

p
4
(3/2) + (5/2) 6.536232 + 9.319578||
+
|| 1.062319
3
using (B.30) and (B.33). By (B.25), (B.32) and the assumption H 50,
|+ |2 0.80044,
|+
|2 10.845789.
52
H. A. HELFGOTT
Thus, the error terms in (4.1) total at most

(4.66)
6.536232+9.319578|| + (log q + 6.01)(10.845789 + 2 0.80044||)x1/2
9 + 11|| + (log q)(11 + 6||)x1/2 .

P
The part of the sum G ()x in (4.1) corresponding to zeros with |()| >
T0 gets estimated by Lem 4.10. By Lemma 4.4, the part of the sum corresponding
to zeros with |()| T0 is at most
p
p
(1.631 T0 log qT0 + 12.42 T0 + 1.321 log q + 34.51)x1/2 ,
where we estimate the norms |+ |2 , | log |2 and |(t)/ t|1 by (B.25), (B.28)
and (B.30).

4.5. A sum for + (t)2 . Using a smoothing function sometimes leads to considering sums involving the square of the smoothing function. In particular, [Hela]
2 something that could be slightly challenging to
requires a result involving +
prove, given the way in which + is defined. Fortunately, we have bounds on
|+ | and other -norms (see Appendix B.5). Our task will also be made easier
by the fact that we do not have a phase e(n/x) this time. All in all, this will be
yet another demonstration of the generality of the framework developed in 4.1.
Proposition 4.12. Let = + be as in (4.58), H 50. Let x 108 . Assume
that all non-trivial zeros of the Riemann zeta function (s) with |()| T0 lie
on the critical line, where T0 2H + max(H/4, 50).
Then
Z
X
2
2
(t) log xt dt + O (err2 ,+ ) x log x,
+
(n)(log n)+ (n/x) = x
(4.67)
0
n=1
where
(4.68)
err2 ,+
p
(T0 2H)
(log T0 )2
4
+ 1.21(log T0 )
T0 e
= 0.607
log x
p
+ (2.06 T0 log T0 + 43.87) x1/2

Proof. We will need to consider two smoothing functions, namely, +,0 (t) =
+ (t)2 and +,1 = + (t)2 log t. Clearly,
2
(n)(log n)+
(n/x) = (log x)
(n)+,0 (n/x) +
n=1
n=1
2 /2
Since + (t) = hH (t)tet
(n)+,1 (n/x).
n=1
2
+,1 (r)
+,0 (r) = h2H (t)tet ,
2
(log x)+,0 + +,1 = + (t) log xt.
= h2H (t)(log t)tet .
Let +,2 =
We wish to apply Lemma 4.1. For this, we must first check that some norms
are finite. Clearly,
2
2
+,2 (t) = +
(t) log x + +
(t) log t
(t) log x + 2+ (t)+

(t) log t + +
(t)/t.
+,2
(t) = 2+ (t)+
Thus, we see that +,2 (t) is in 2 because + (t) is in 2 and + (t), + (t) log t are
both in (see (B.25), (B.38), (B.40)):
2
2

|+,2 (t)|2 +
(t)2 log x + +
(t) log t2
(4.69)
|+ | |+ |2 log x + |+ (t) log t| |+ |2 .
53
(t) is in because (t) is in , (t) is in (B.32), and (t),

Similarly, +,2
2
+
2
2
+
+
+ (t) log t and + (t)/t (see (B.41)) are all in :
(4.70)

2

log x + 2+ (t)+
+ +

+,2 (t) 2+ (t)+
(t)
(t)
log
t
(t)/t
2
2
2
2
2 |+ | + log x + 2 |+ (t) log t| + + |+ (t)/t| |+ | .
(t)t1
In the same way, we see that +,2

is in 1 for all in (1, ) (because
1
the same is true of + (t)t
(B.30), and + (t), + (t) log t are both in ) and
(t)t1 is in for all in (0, ) (because the same is true of (t)t1 and
+,2
1
+
1
+ (t)t
(B.33), and + (t), + (t) log t, + (t)/t are all in ).
We now apply Lemma 4.1 with q = 1, = 0. Since +,2 (0) = 0, the residue
term R equals c0 , which, in turn, is at most 2/3 times

(|+ | log x + |+ (t) log t| ) + (t)/ t + + (t) t
1
1

+ 2 (|+ | log x + |+ (t) log t| ) + (t)/ t + + (t) t
1
1

+ |+ (t)/t| + (t)/ t + + (t) t .
1
Using the bounds (B.38), (B.40), (B.41) (with the assumption H 50), (B.30)
and (B.33), we get that this means that
c0 18.15014 log x + 7.84532.
Since q = 1 and = 0, we get from (4.70) that

1/2
+
2||
|
|
(log q + 6.01) +,2
+,2 2 x
2
1/2
x
= 6.01 +,2
(162.56 log x + 59.325)x1/2 .
2
Using the assumption x 108 , we obtain

(4.71)
c0 + (162.56 log x + 59.325)x1/2 18.593 log x.
We now apply Lemma 4.4 as we may, because of the finiteness of the norms
we have already checked, together with
(4.72)
2

2

|+,2 (t) log t|2 +
(t) log t2 log x + +
(t)(log t)2 2
|+ (t) log t| (|+ (t)|2 log x + |+ (t) log t|2 )
0.40742 (0.80044 log x + 0.82999) 0.32396 log x + 0.33592
(by (B.40), (B.25) and (B.28); use the assumption H 50). We will also need
the bounds
(4.73)
|+,2 (t)|2 0.99811 log x + 0.32612
(from (4.69), by the norm bounds (B.38), (B.40) and (B.25), all with H 50)
and

+,2 (t)/ t (|+ (t)| log x + |+ (t) log t| ) + (t)/ t
1
1
(4.74)
1.24703 log x + 0.40745
(by
P (B.38), (B.40) (again with H 50) and (B.30)). We obtain that the sum
|G0 ()|x (where G0 () = M +,2 ()) over all non-trivial zeros with |()|
54
H. A. HELFGOTT
T0 is at most x1/2 times

(4.75)
p
p
(1.3221 log x + 0.6621) T0 log T0 + (3.2419 log x + 5.0188) T0
+ 43.1 log x + 14.1,
where we are bounding

norms by (4.73), (4.72)and (4.74). (We areusing
the fact
that T0 2 e to ensure that the quantity T0 log T0 (log 2 e) T0 being

multiplied by |+,2 |2 is positive; thus, an upper bound for |+,2 |2 suffices.) By
the assumptions x 108 , T0 150, (4.75) is at most
p
2.06 log x T0 log T0 + 43.866 log x.
Note that the term 18.539 log x from (4.71) is at most 0.002x1/2 log x.
It remains to bound the sum of M +,2 () over non-trivial zeros with |()| >
T0 . This we will do, as usual, by Lemma 4.3. For that, we will need to bound
M +,2 () for in the critical strip.
2
2
The Mellin transform of et is (s/2)/2, and so the Mellin transform of tet
2
is ((s + 1)/2)/2. By (2.6), this implies that the Mellin transform of (log t)tet
is ((s + 1)/2)/4. Hence, by (2.5),
Z
1
M (h2H )(ir) Fx (s ir) dr,
(4.76)
M +,2 (s) =
4
where
(4.77)
Fx (s) = (log x)
s+1
2
1
+
2
s+1
2
Moreover,
(4.78)
M (h2H )(ir)
1
=
2
M hH (iu)M hH (i(r u)) du,
and so M (h2H )(ir) is supported on [2H, 2H]. We also see that |M h2H (ir)|1
|M hH (ir)|21 /2. We know that |M hH (ir)|21 /2 41.73727 by (B.17).
Hence
Z
1
|M +,2 (s)|
|M (h2H )(ir)|dr max |Fx (s ir)|
4
|r|2H
(4.79)
41.73727
max |Fx (s ir)| 3.32135 max |Fx (s ir)|.

4
|r|2H
|r|2H
By [OLBC10, 5.6.9] (Stirling with explicit constants),
(4.80)
|(s)| 2|s|(s)1/2 e|(s)|/2 e1/6|z| ,
and so
(4.81)
p
|(s)| 2.526 |(s)|e|(s)|/2
for s C with 0 < (s) 1 and |(s)| 25. Moreover, by [OLBC10, 5.11.2]
and the remarks at the beginning of [OLBC10, 5.11(ii)],

1
1
1
(s)
= log s
+ O
(s)
2s
12|s|2 cos3 /2
55
for | arg(s)| < ( (, )). Again, for s = + i with 0 < 1 and | | 25,
this gives us
p

| |2 + 1
1
+O
+ O
| |
2| |

1
O (0.236)
1
+
= log | | + O
+
2| |2 2| |
| |2

0.53
.
= log | | + O
| |
(s)
= log | | + log
(s)
12| |2 cos3
1
arctan | |
2
Hence, for 1 (s) 1 and |(s)| 50,

1
1
0.53
s+1

|Fx (s)| (log x) + log + O
2
2
2
| /2|
2
p
1
2.526((log x) + log | | 0.335) | |e| |/2 .
2
(4.82)
Thus, by (4.79), for = + i with | | T0 2H + 50 and 1 1,

|M +,2 ()| f ( )
where

1
f (T ) = 8.39 log x + log T
2
(4.83)
r
(| |2H)
| |
4
H e
.
2
The functions t 7 tet/2 and t 7 (log t) tet/2 are decreasing for t 3/;
setting t = T /2 H, we see that the right side of (4.83) is a decreasing function
of T for T T0 , since T0 /2 H 25 > 3/.
We can now apply Lemma 4.3, and get that
X
(4.84)
non-trivial
|()|>T0
|M +,2 ()|
f (T )
T0
1
T
1
log
+
2
2 4T
dT.
Since T T0 150 > 2, we know that ((1/2) log(T /2)+1/4T ) (1/2) log T .
Hence, the right side of (4.84) is at most
8.39
2
(4.85)
T0
(log T )2
(log x)(log T ) +
2
In general, for T0 e2 ,
r
T (T 2H)
4
dT.
e
2

T0
4 p
2/
T0 (log T0 )2 + (log e2 T0 )2 e 4 ,
T0
T
p

Z0
T0
4
2/
2
T /4
T (log T )e
dT
T0 (log T0 ) + log e T0 e 4 ;
T0
T0
T (log T )2 eT /4 dT
56
H. A. HELFGOTT
for T0 150,
the
quantities
on
the
right
are
at
most
1.284
T0 (log T0 )2 eT0 /4
and 1.281 T0 (log T0 )eT0 /4 , respectively. Thus, (4.84) and (4.85) give us that
X
|M +,2 ()|
non-trivial
|()|>T0
p
(T0 2H)
1.284
2
4
(log T0 ) + 1.281(log x)(log T0 )
T0 e
2
p
(T0 2H)
4
(0.607(log T0 )2 + 1.21(log x)(log T0 )) T0 e
.
8.39
2 2
4.6. A verification of zeros and its consequences. David Platt verified in

his doctoral thesis [Pla11], that, for every primitive character of conductor
q 105 , all the non-trivial zeroes of L(s, ) with imaginary part 108 /q lie on
the critical line, i.e., have real part exactly 1/2. (We call this a GRH verification
up to 108 /q.)
In work undertaken in coordination with the present project [Plab], Platt has
extended these computations to
all odd q 3 105 , with Tq = 108 /q,
all even q 4 105 , with Tq = max(108 /q, 200 + 7.5 107 /q).
The method used was rigorous; its implementation uses interval arithmetic.
Let us see what this verification gives us when used as an input to Prop. 4.6.
We are interested in bounds on | err, (, x)| for q r and || 0 r/2q. We set
r = 3 105 and 0 = 8, and so || 4r/q. (We will not be using the verification
for q even with 3 105 < q 4 105 .)
We let T0 = 108 /q. Thus,
(4.86)
1000
108
,
=
5
3 10
3
108 /q
1000
T0
=
||
4r/q
12
T0
and so
4.329e0.1598T0 3.184 1023 ,
0.1065
0.802e
T02
()2
4.3166 1033 .
Since || 4r/q 1.2 106 /q 1.2 106 and qT0 108 , this gives us
!
T2
qT0
8.59 1026
0.1065 0 2
0.1598T0
()
log
5.28 1022 +
4.329e
+ 0.802||e
2
q
5.281 1022 .
Again by T0 = 108 /q,
p
p
2.337 T0 log qT0 + 21.817 T0 + 2.85 log q + 74.38
is at most
648662
+ 111,
57
and
1.7 107
,
q
1.2 108
.
(log q + 6) (1 + 5||) 19 +
q
Hence, assuming x 108 to simplify, we see that Prop. 4.6 gives us that
3 log q + 14|| + 17 55 +
22
648662
+ 111
55 +
1.7107
q
+
x
x

1
650400
5.281 1022 +
+ 112
x
q
err, (, x) 5.281 10
1.2108
q
3/2
x
19 +
for (t) = et /2 . This proves Theorem 1.1.

Let us now see what Platts calculations give us when used as an input to
Prop. 4.8 and Cor. 4.9. Again, we set r = 3 105 , 0 = 8, || 4r/q and
T0 = 108 /q, so (4.86) is still valid. We obtain
!
T02
4.269 1014
qT0
0.1065
()2
3.5e0.1598T0 + 0.64e
.
T0 log
2
q
We use the same bound when we have 0.0019 instead of 0.64 on the left side,
as in (4.56). (The coefficient affects what is by far the smaller term, so we are
wasting nothing.) Again by T0 = 108 /q and q r,
p
p
275263
1.22 T0 log qT0 + 5.053 T0 + 1.423 log q + 37.19
+ 55.2
q
p
p
377907
+ 75.7.
1.675 T0 log qT0 + 6.936 T0 + 1.954 log q + 51.047
q
For x 108 , we use || 4r/q 1.2 106 /q to bound

1322
1
3/2
x1/2 .
(3 + 11||)x + (log q + 6) (1 + 6||) x
0.0004 +
q

2644
1
3/2
(6 + 22||)x + (log q + 6) (3 + 17||) x
0.0007 +
x1/2 .
q
Summing, we obtain

1
276600
4.269 1014
+
err,
+ 56
q
x
q
2 /2
for (t) = t2 et
and
err,
1
4.269 1014
+
q
x

380600
+ 76
for (t) = t2 et /2 M 2 (t). This proves Theorem 1.2 and Corollary 1.3.
Now let us work with the smoothing weight + . This time around, set r =
150000 if q is odd, and r = 300000 if q is even. As before, we assume
q r,
|| 4r/q.
We can see that Platts verification [Plab], mentioned before, allows us to take
250r
,
H = 200,
T0 = H +
q
58
H. A. HELFGOTT
since Tq is always at least this (Tq = 108 /q > 200 + 3.75 107 /q for q 150000
odd, Tq 200 + 7.5 107 /q for q 300000 even).
Thus,
250r
T0 H
= 250,
r
250r
250
T0 H
= 19.89436 . . .
q
4
and also
T0 200 + 250 150000 3.751 107 ,
qT0 rH + 250r 1.35 108 .
Hence
(T H)2
p
0.1065 0 2
0.1598(T0 H)
()
9.462 T0 He
+ 11.287||e
r
250r 4r
4.2259 1017
+
5.57888 1018
q
q
6.6947 1012
3.6598 1013
+
.
q
q
Examining (4.65), we get

3.6598 1013
6.6947 1012
+
q
q
s
!

1.35 108
1
8
+ 1.321 log 300000 + 34.51 x 2
+ 1.631 log 1.35 10 + 12.42
q

1.2 106
1.2 106
1
+ 9 + 11
x + (log 300000) 11 + 6
x3/2
q
q
1.14 1010
6.18 1012
+
q
q

1
499076
1.32 106
139
9
9.1 107
+
51.17
+
+
+
+
+
q
qx
x
q x
x
x
1.35 108
err+ , (, x) log
Making the assumption x 1012 , we obtain
6.18 1012
1.14 1010
+
err+ , (, x)
+
q
q

499100
1
+ 52 .
q
x
This proves Theorem 1.4 for general q.

Let us optimize things a little more carefully for the trivial character T . Again,
we will make the assumption x 1012 . We will also assume, as we did before,
that || 4r/q; this now gives us || 600000, since q = 1 and r = 150000 for q
odd. We will go up to a height T0 = H + 600000 t, where H = 200 and t 10.
Then
600000t
T0 H
=
t.
4r
Hence
(T H)2
p
0.1065 0 2
()
9.462 T0 He0.1598(T0 H) + 11.287||e
2
101300000 + 6773000e0.1065t .
59
Looking at (4.65), we get

T0 1300000
2
10
+ 6773000e0.1065t
2
p
+ ((1.631 log T0 + 12.42) T0 + 34.51)x1/2 + 6600009x1 .
err+ ,T (, x) log
The value t = 20 seems good enough; we choose it because it is not far from
optimal in the range 1027 x 1030 . We get that T0 = 12000000 + 200; since
T0 < 108 , we are within the range of the computations in [Plab] (or for that
matter [Wed03] or [Plaa]). We obtain
err+ ,T (, x) 3.34 1011 +
251100
.
x
Lastly, let us look at the sum estimated in (4.67). Here it will be enough to go
up to just T0 = 2H + max(50, H/4) = 450, where, as before, H = 200. Of course,
the verification of the zeros of the Riemann zeta function does go that far; as
we already said, it goes until 108 (or rather more: see [Wed03] and [Plaa]). We
make, again, the assumption x 1012 . We look at (4.68) and obtain
err2 ,+
(4.87)
(log 450)2
450e 4 50
+
1.21
log
450
0.607
12
log 10

1
+ 2.06 450 log 450 + 43.87 x 2

1.536 1015 +
310.84
.
x
It remains only to estimate the integral in (4.67). First of all,

Z
Z
2
2 (t) log xt dt
+
(t) log xt dt =
0
0
Z
Z
(+ (t) (t))2 log xt dt.
(+ (t) (t)) (t) log xt dt +
+2
0
The main term will be given by

Z

2 (t) log xt dt = 0.64020599736635 + O 1014 log x
0

0.021094778698867 + O 1015 ,
where the
R integrals were computed rigorously using VNODE-LP [Ned06]. (The
integral 0 2 (t)dt can also be computed symbolically.) By Cauchy-Schwarz and
the triangle inequality,
Z
(+ (t) (t)) (t) log xt dt |+ |2 | (t) log xt|2
0
|+ |2 (| |2 log x + | log |2 )
274.86
(0.80013 log x + 0.214)
H 7/2
1.944 106 log x + 5.2 107 ,
60
H. A. HELFGOTT
where we are using (B.23) and evaluate | log |2 rigorously as above. By (B.23)
and (B.24),

Z
27428
274.86 2
2
log x +
(+ (t) (t)) log xt dt
7/2
H7
H
0
5.903 1012 log x + 2.143 1012 .
We conclude that
Z
2
+
(t) log xt dt
(4.88)
0
= (0.640206 + O (1.95 106 )) log x 0.021095 + O (5.3 107 )
We add to this the error term 1.536 1015 + 310.84/ x from (4.87), and simplify
using the assumption x 1012 . We obtain:
(4.89)
n=1
2
(n)(log n)+
(n/x) = 0.640206x log x 0.021095x
+ O 2 106 x log x + 310.84 x log x ,
and so Prop. 4.12 gives us Proposition 1.5.

As we can see, the relatively large error term 4 106 comes from the fact that
we have wanted to give the main term in (4.67) as an explicit constant, rather
than as an integral. This is satisfactory; Prop. 1.5 is an auxiliary result needed
for [Helc], as opposed to Thms. 1.11.4, which, while crucial for [Helc], are also
of general applicability and interest.
Appendix A. Extrema via bisection and truncated series
In the above, we found ourselves several times in the following familiar situation. Let f : I R, I R. We wish to find the minima and maxima of f in I
numerically, but rigorously.
(This is a situation in which a proof by plot would be convincing, but not,
of course, rigorous.)
The bisection method (as described in, e.g., [Tuc11, 5.2]) can be used to show
that the minimum (or maximum) of f on a compact interval I lies within an
interval (usually a very small one). We will need to complement it by other
arguments if either (a) I is not compact, or (b) we want to know the minimum
or maximum exactly.
p
As in 3.2, let j() = (1 + 2 )1/2 and () = (1 + j())/2 for 0. Let ,
cos 0 , sin 0 , c0 and c1 be understood as one-variable real-valued functions on ,
given by (3.26), (3.28) and (3.34).
First, let us bound () from below. By the bisection method6 applied with
32 iterations,
0.798375987 min () 0.798375989.
010
p
p
Since j() and () j()/2 /2,
0
= ,
3/2
2()(() + j())
2
2
6Implemented by the author from the description in [Tuc11, p. 8788], using D. Platts
interval arithmetic package.
61
and so
(A.1)
() 1
1 .
2()(() + j())
2
Hence () 0.8418 for 20. We conclude that

(A.2)
0.798375987 min () 0.798375989.

0
Now let us bound c0 () from below. For 8,

s
r
1
1
1
1
1
sin 0 =
,
2 2
2
2
2
whereas cos 0 1/ 2 for all 0. Hence, by (A.2)

(A.3)
c0 ()
0.7983 1
+ > 1.06
2
2
for 8. The bisection method applied with 28 iterations gives us that

max c0 () 1 + 5 108 > 1.
(A.4)
0.018
It remains to study c0 () for [0, 0.01]. The method we are about to give
actually works for all [0, 1].
Since

1
1
,
1+x =
1+x =
,
4(1 + x)3/2
2 1+x

3/4
1
1/2
1
1
=
,
,
=
=
2(1 + x)3/2
(1 + x)3/2
(1 + x)5/2
1+x
1+x
a truncated Taylor expansion gives us that, for x 0,
1
1
1
1 + x x2 1 + x 1 + x
2
8
2
(A.5)
1
1
3
1
1 x
1 x + x2 .
2
2
8
1+x
Hence, for 0,
(A.6)
1 + 2 /2 4 /8 j() 1 + 2 /2,
1 + 2 /8 54 /128 + 6 /256 8 /2048 () 1 + 2 /8,
and so
() 1 + 2 /8 54 /128
(A.7)
for 8. We also get from (A.5) that
(A.8)
1
=q
()
1
1+
1
1+
j()1
2
j() 1
2
2
74
3 4
1
+
,
8 16
8
128
2
1 j() 1
1 .
1
2
2
8
4
16
1
1
2
1
=q
()
j()1
2
2
1 j() 1 3
+
1
2
2
8
+
2
62
H. A. HELFGOTT
Hence
sin 0 =
(A.9)
sin 0
1
1
2 2()
2
74
=
16 256
4
7 2
,
16
2
= ,
16
4
while
(A.10)
cos 0 =
1
1
+
2 2()
2
1 ,
16
cos 0
2
74
+
,
16 256
By (A.6) and (A.8),
1 2 /8
2
2( + j)
2 2 + 5 /8
2
(A.11)
Assuming 0 1,
1
1+
52
16
94
64
52 94
+
+
1
16
64
1 32
2
32
52 94
16
64
2 !
33
.
4
64
52 614
+
,
16
256
and so, by (A.7) and (A.8),

2
7
1 8 + 128
2( + j)
2 2 + 52 214
8 2 128 4

52 464
+
+
1
1
4
8
128
16
256

2
35 4
81 6 161 8
7
73 355
+

+ 14
+
.
1
4
16
128
2048
2
4
64
512
Hence, we obtain
s
(A.12)
2
() = 1 +
2( + j)
2( + j)

2

2
73 355
1 33
1 2
1+
+
2 4
64
8 16
4
64
512

2
3
5
3
7
1
96

35
+
+
+ 13
1 +
4
4 32
64
256 2048
512
2
2
3
4
7
165

+
,
1 +
4 32
64
2048

where, in the last line, we use again the assumption 1.

For x [1/4, 0],
1
x2
x2
1
x
1+x 1+ x
=
1
+
2
2 4(1 1/4)3/2
2 33/2
x2
1
1
1 + x.
1+x 1+ x
2
8
2
63
Hence
(A.13)
r
2
2
4
74
2
74
cos 0 1
1
3/2
+
1
+
32 3 256
16 256
32 512

49
7 2
4 sin 0
1 3/2
4
32
4
3 256
for 1. Therefore,
c0 () = () cos 0 + sin 0

73
165 4
4
2
2
+
3/2
1
1 +
4 32
64
2048
32 3 256
7 3
49
+
5
3/2
4 128
3 1024 !
!
!
3
7 3 3
7
167
3
3
1+
+
+
+
+
4
16
2304 2048
2048 192
147456
3
0.09494 ,
16
where we are again using 1. We conclude that, for all (0, 1/2],
1+
c0 () > 1.
Together with (A.3) and (A.4), this shows that

(A.14)
c0 () > 1
> 0.
It is easy to check that c0 (0) = 1.

(The truncated-Taylor series estimates above could themselves have been done
automatically; see [Tuc11, Ch. 4] (automatic differentiation). The footnote in
[Tuc11, p. 72] (referring to the work of Berz and Makino [BM98] on Taylor
models) seems particularly relevant here. We have preferred to do matters by
hand in the above.)
Now let us examine (), given as in (3.46). Let us first focus on the case of
large. We can use the lower bound (A.1) on (). To obtain a good upper
bound on (), we need to get truncated series expansions on 1/ for and j.
These are:
(A.15)
r

p
1
1
1
2
j() = + 1 = 1 + 2 1 + 2 = + ,
2
2
r
r
r r
r

1
1
1
1
1+j
1
() =
1+ + 2
+ +
=
1+
,
2
2 2 4
2
2
2
2
p
p
together with the trivial bounds j() and () j()/2 /2. By
(A.15),

q
2
1
+
1
1

2 q =

q
q
2
2

1
2
1
1
+
1 + 2
1 + 2
2
2
2
2
(A.16)

q
2
2
1
+
2
8

q + 3/2
=
2

+ 12
1 2 +
1+ 2
64
H. A. HELFGOTT
for 15, and so
j
2+
2

(A.17)
for 15. In fact, the bisection method (applied with 20 iterations, including 10
initial iterations after which the possibility of finding a minimum within each
interval is tested) shows that (A.16) (and hence (A.17)) holds for all 1. By
(A.15),

q

2( + j)
1
1
2 1 + 12
1
+
+
+
2
2
2
(A.18)
1
1
1
1
1

2 1 + 12 + 1
2 2
23/2
for 16. (Again, (A.18) is also true for 1 16 by the bisection method; it
is trivially true for [0, 1], since the last term of (A.18) is then negative.) We
also have the easy upper bound
1
1
1
1
q
q
+

(A.19)
=
2( + j)
2 + 1
2 2 (2)3/2
2 ( + )
2
valid for 1/2.

Hence, by (A.5), (A.18) and (A.19),
s
2

= 1+
2( + j)
2( + j)

2
1
1
1
1
1
1
1
1

+
1 +
+
1+
3/2
2
2 2
2 2
2
2
for 3. Again,
we use the bisection method (with 20 iterations) on [1/2, 3],
and note that 1/ 2 < 1/ for < 1/2; we thus obtain
(A.20)
1
1
1 +
2
for all > 0.

We recall (3.46) and the lower bounds (A.17) and (A.1). We get
s
r

2 !
1
1
1
8
1 2
1
1+ 1
1 +
2+
2
2
2
2

1
1
1
1
+
1 +
2
2 + 1
2
!

1
1
2
1
2
5
1
(A.21)
1+
2 +
1 +
2
2
2 4
2 2

1
1
1
1
+ 1 1 + 2 1 +
2
2
2
1
1
1
1
1
9
37
1+
2+
3 1+
2 4 8
2 16
25/2
for 2. This implies that () > 1 for 11. (Since our estimates always
give an error of at most O(1/ ), we also get lim () = 1.) The bisection
65
method (with 20 iterations, including 6 initial iterations) gives that () > 1 also
holds for 1 11.
Let us now look at what happens for 1. From (A.12), we get the simpler
bound
(A.22)
valid for 1, implying that

2 1
33
2
+
+
1
4 32
32
4
234
2 113
+
+
2
8
64
1024
for 1. We also have, by (3.26) and (A.11),

2

33
1
1 2
1+
1+
2 2( + j)
2( + j)
2 4
4
64
(A.23)
2
3
2

3
5
1 +
+
1 +
4 32
64
4
64
for 1. (This immediately implies the easy bound 1, which follows
anyhow from (3.25) for all 0.)
By (A.6),
2
1 + 2 /2 4 /8
8
1 + 2 /2 4 /8
j

4
2
2
2
2
2
1 + 8
1 + 8
8 + 64
for 1. Therefore, by (3.46),

r

2
1 1
34
1
2 113
1
52
8
+
+
2 +
1 +
2
2
8
64
128
2
4
64
2 2 2
2

2
3
2
4
11
3
1
4 3 152 33
7
1+ +
1 +
+
4
32
64
2
2
32
2
2
64
64
4 3

2
for 1. This implies the bound () > 1 for all 1. Conversely, ()
4/ 3/2 follows from () > 1 for > 8/5. We check () 4/ 3/2 for
[1, 8/5] by the bisection method (5 iterations).
We conclude that, for all > 0,

4 3
(A.24)
max 1,
.
2
This bound has the right asymptotics for 0+ and +.
Let us now bound c0 from above. By (A.13) and (A.23),

52
74
2
c0 () = () cos 0 + sin 0 1 +
+
1
+
4
64
32 512
4
(A.25)
3
4
5
6
2
2
23
7
35
3
+
+
+ 15 1 +
1+
64
128 2048 2048
2
15
for 1. Since 1 and 0 [0, /4] [0, /2], the bound
(A.26)
c0 () cos 0 + sin 0 2
66
H. A. HELFGOTT
holds for all 0. By (A.20), we also know that, for 2,

1
1
cos 0 + sin 0
c0 () 1 +
2
s
(A.27)

1
1
1 2
9
1 +
+1 2 1 +
.
2
2 2 16
From (A.24) and (A.26), we obtain that
(A.28)

1
1 + 2c20 1 (1 + 2 2) = 5
for all 0. At the same time, (A.24) and (A.25) imply that

4 3 1
42 24
1
2
+ 2
1 + 2c0
3+
2
15
15

1

2

4
3
3
4
3
1+
+
=
1
1+
4
8
45
675
4
2
for 0.4. Hence (1 + 2c20 )/ 0.86 for < 0.29. The bisection method
(20 iterations, starting by splitting the range into 28 equal intervals) shows that
(1 + 2c20 )/ 0.86 also holds for 0.29 6; for > 6, the same inequality
holds by (A.28).
We have thus shown that
(A.29)
1 + 2c20
min(5, 0.86)
for all > 0.

p
Now we wish to bound ( 2 )/2 from below. By (A.7) and (A.6),
2

2
54
2
1+
2 1 +
8
128
8
(A.30)

2
52
2 54
1
2
2 54
4
=1+
,
1+
4
64
128 8
8
8
64
for 1, and so
r
52
,
1
2
4
8
and this is greater than /6p
for 1/3. The bisection method (20 iterations, 5
initial steps) confirms that ( 2 )/2 > /6 also holds for 2/3 < 4. On
the other hand, by (A.15) and 2 = (1 + j)/2 (1 + )/2,
v
q

u
r
u +1 1 + 1
r

s
2
1
2 t 2
2
1
2
1+
1+
2
2
2
2
(A.31)
s
r
r
1
1
2
1
1
3/2
1
=
2
2
2
2
2
2
p
for 4. We check by the bisection method (20 iterations) that ( 2 )/2
/2 1/23/2 also holds for all 0 4.

r
We conclude that
(A.32)
(
/6
1
23/2
67
if 4,
for all .
We still have a few other inequalities to check. Let us first derive an easy lower
bound on c1 () for large: by (A.1), (A.16) and (A.5),
r
r

s

1
1 + 1/
1
2
8
1
1
+
c1 () =
1
2
2
3/2
2
2
r
r

2
2
1
1
3
1
1+
1
1
=
4
2 4
2
for 1. Together with (A.27), this implies that, for 2,

1
9
1 + 9
2
1 12 + 8
2
16
1
c0 1/ 2
2 2

,
3
2 2 1 3
2c1
2 2 1 4
4
again for 1. This is 1/ 8 for 8. Hence it is 1/ 8 25 < 0.071 for

25.
Let us now look at small. By (A.6),

2

2
2 54
2 94
2 1 +
1+
+
=
8
8
32
8
32
for any > 0. Hence, by (A.8) and (A.22),
s
r

4
5 2
1 + 1/
2 2 /8
,
1
c1 () =
4
2
2
4
4
4
+ 9
8
32
whereas, for 1,
c0 () = () cos 0 + sin 0 1 + sin 0 1 + /4
by (A.13). Thus
1 + 4 12
c0 1/ 2

0.0584
2c1
2 4 1 54 2 1 4
for 0.1.
We check the remaining interval [0.1, 25] (or [0.1, 8], if we aim at the
bound 1/ 8) by the bisection method (with 24 iterations, including 12 initial
iterations or 15 iterations and 10 initial iterations, in the case of [0.1, 8]) and
obtain that
c0 1/ 2
0.0763896
0.0763895 max
0
2c1
(A.33)
1
c0 1/ 2
.
sup
2
1
0
In the same way, we see that
1
c0
1
3 0.171
c1
1 4
for 36 and
1 + 4
c0

0.267
c1
4 1 54 2 1 4
68
H. A. HELFGOTT
for 0.1. The bisection method applied to [0.1, 36] with 24 iterations (including
12 initial iterations) now gives
c0
0.29888.
(A.34)
0.29887 max
>0 c1
We would also like a lower bound for c0 /c1 . For c0 , we can use the lower bound
c0 1 given by (A.14). By (A.8), (A.23) and (A.30),
s
r

2
74
2 8 + 128
52
1 + 1/
1 +
c1 () =
2
2 /8 54 /64
4
64

2
2
4
5
5
4
1+
1 +
<
16
4
64
for 1/4. Thus, c0 /(c1 ) 1/4 for [0, 1/4]. The bisection method (with 20
iterations, including 10 initial iterations) gives us that c0 /(c1 ) 1/4 also holds
for [1/4, 6.2]. Hence
c0
c1
4
for 6.2.
Now consider the case of large . By and 1,
p
( 2 )/
1 1 1/ 2
c0
1/
p
.
(A.35)
p
q
1+1/
c1
2 1 + 1/
1 + 1/
(This is off from optimal by a factor of about 2.) For 200, (A.35) implies
that c0 /(c1 ) 0.6405. The bisection method (with 20 iterations, including
5 initial iterations) gives us c0 /(c1 ) 5/8 = 0.625 for [6.2, 200]. We

conclude that

5
c0
.
min
,
(A.36)
c1
4 8
Finally, we verify an inequality that will be useful for the estimation a crucial
exponent in Thm. 3.1. We wish to show that, for all [0, /2],
(A.37)
5 sin3
sin
sin 2
4 cos2 2
2 cos2 24 cos6
The left side is positive for all (0, /2], since cos2 /2 1/ 2 and (sin 2)/2
is less than 2/2 = . The right side is negative for > 1 (since it is negative
for = 1, and (sin )/(cos )2 is increasing on ). Hence, it is enough to check
(A.37) for [0, 1]. The two sides of (A.37) are equal for = 0; moreover,
the first four derivatives also match at = 0. We take the fifth derivatives of
both sides; the bisection method (running on [0, 1] with 20 iterations, including
10 initial iterations) gives us that the fifth derivative of the left side minus the
fifth derivative of the right side is always positive on [0, 1] (and minimal at 0,
where it equals 30.5 + O 109 ).
Appendix B. Norms of smoothing functions
Our aim here is to give bounds on the norms of some smoothing functions
and, in particular, on several norms of a smoothing function + : [0, ) R
2
based on the Gaussian (t) = et /2 .
69
As before, we write
(B.1)
(
t2 (2 t)3 et1/2
h : t 7
0
if t [0, 2],
otherwise
We recall that we will work with an approximation + to the function :

[0, ) R defined by
(
2
t3 (2 t)3 e(t1) /2 for t [0, 2],
(B.2)
(t) = h(t) (t) =
0
otherwise.
The approximation + is defined by
2 /2
+ (t) = hH (t)tet
(B.3)
where
FH (t) =
(B.4)
sin(H log y)
,
log y
hH (t) = (h M FH )(y) =
(B.6)
h(ty 1 )FH (y)
and H is a positive constant to be set later.

FH is just a Dirichlet kernel under a change
for real,
1
(B.5)
M FH (i ) = 1/2
0
Thus,
dy
y
By (2.4), M hH = M h M FH . Now
of variables; using this, we get that,
if | | < H,
if | | = H,
if | | > H.
M h(i )
M hH (i ) = 12 M h(i )
if | | < H,
if | | = H,
if | | > H.
As it turns out, h, and M h (and hence M hH ) are relatively easy to work

with, whereas we can already see that hH and + have more complicated definitions. Part of our work will consist in expressing norms of hH and + in terms
of norms of h, and M h.
B.1. The decay of M h(i ). Now, consider any : [0, ) C that (a) has
compact support (or fast decay), (b) satisfies (k) (t)tk1 = O(1) for t 0+ and
0 k 3, and (c) is C 2 everywhere and quadruply differentiable outside a finite
set of points.
By definition,
Z
dx
(x)xs .
M (s) =
x
0
Thus, by integration by parts, for (s) > 1 and s 6= 0,
Z
Z
Z
xs
s dx
s dx
= lim
= lim
(x)x
(x) dx
(x)x
M (s) =
x
x
s
t0+ t
t0+ t
0
Z
Z
s+2
s+1
x
x
(3) (x)
dx = lim
dx
(x)
(B.7) = lim+
+
s(s + 1)
s(s + 1)(s + 2)
t0
t0
t
t
Z
xs+3
dx,
= lim
(4) (x)
s(s + 1)(s + 2)(s + 3)
t0+ t
70
H. A. HELFGOTT
where (4) (x) is understood in the sense of distributions at the finitely many
points where it is not well-defined as a function.
R
Let s = it, = h. Let Ck = limt0+ t |h(k) (x)|xk1 dx for 0 k 4. Then
(B.7) gives us that
(B.8)

C2
C3
C4
C1
,
,
,
.
M h(it) min C0 ,
|t| |t||t + i| |t||t + i||t + 2i| |t||t + i||t + 2i||t + 3i|
We must estimate the constants Cj , 0 j 4.

Clearly, h(t)t1 = O(1) as t 0+ , hk (t) = O(1) as t 0+ for all k 1, h(2) =
h (2) = h (2) = 0, and h(x), h (x) and h (x) are all continuous. The function
h has a discontinuity at t = 2. As we said,
we understand h(4) in the sense of
R 2+
distributions at t = 2; for example, lim0 2 h(4) (t)dt = lim0 (h(3) (2 + )
h(3) (2 )).
Symbolic integration easily gives that
Z 2
t(2 t)3 et1/2 dt = 92e1/2 12e3/2 = 2.02055184 . . .
(B.9)
C0 =
0
We will have to compute Ck , 1 k 4, with some care, due to the absolute

value involved in the definition.
The function (x2 (2 x)3 ex1/2 ) = ((x2 (2 x)3 ) + x2 (2 x)3 )ex1/2 has the
same zeros as H1 (x) = (x2 (2 x)3 ) + x2 (2 x)3 , namely, 4, 0, 1 and 2. The
sign of H1 (x) (and hence of h (x)) is + within (0, 1) and within (1, 2). Hence
Z
|h (x)|dx = |h(1) h(0)| + |h(2) h(1)| = 2h(1) = 2 e.

(B.10) C1 =
0
The situation with (x2 (2 x)3 ex1/2 ) is similar: it has zeros at the roots of
H2 (x) = 0, where H2 (x) = H1 (x) + H1 (x) (and, in general, Hk+1 (x) = Hk (x) +
Hk (x)). This time, we will prefer to find the roots numerically. It is enough to
find (candidates for) the roots using any available tool7 and then check rigorously
that the sign does change around the purported roots. In this way, we check that
H2 (x) = 0 has two roots 2,1 , 2,2 in the interval (0, 2), another root at 2, and
two more roots outside [0, 2]; moreover,
2,1 = 0.48756597185712 . . . ,
2,2 = 1.48777169309489 . . . ,
(B.11)
where we verify the root using interval arithmetic. The sign of H2 (x) (and hence
of h (x)) is first +, then , then +. Write 2,0 = 0, 2,3 = 2. By integration by
parts,
(B.12)
Z 2,2
Z 2
Z 2,1
Z
h (x)x dx
h (x)x dx +
h (x)x dx
|h (x)|x dx =
C2 =
=
(1)j+1
j=1
=2
2,1
3
X
h (x)x|2,j
2,j1
2,j
2,j1
h (x) dx
2,2
2
X

(1)j+1 h (2,j )2,j h(2,j ) = 10.79195821037 . . . .
j=1
7Routine find root in SAGE was used here.
71
To compute C3 , we proceed in the same way, finding two roots of H3 (x) = 0

(numerically) within the interval (0, 2), viz.,
3,1 = 1.04294565694978 . . .
3,2 = 1.80999654602916 . . .
The sign of H3 (x) on the interval [0, 2] is first , then +, then . Write 3,0 = 0,
3,3 = 2. Proceeding as before with the only difference that the integration by
parts is iterated once now we obtain that
Z 3,j
Z
3
X
j
2
(1)
h (x)x2 dx
|h (x)|x dx =
C3 =
0
3
X
(1)j
=
j=1
(B.13)
3,j1
j=1
h (x)x2 |3,j
3,j1
3,j
3,j1
h (x) 2x dx
3
X

(1)j h (x)x2 h (x) 2x + 2h(x) |3,j
=
3,j1
j=1
2
X
(1)j (h (3,j )23,j 2h (3,j )3,j + 2h(3,j ))
=2
j=1
and so interval arithmetic gives us

(B.14)
C3 = 75.1295251672 . . .
The treatment of the integral in C4 is very similar, at least as first. There are
two roots of H4 (x) = 0 in the interval (0, 2), namely,
4,1 = 0.45839599852663 . . .
4,2 = 1.54626346975533 . . .
The sign of H4 (x) on the interval [0, 2] is first , +, then . Using integration
by parts as before, we obtain
Z 2

(4) 3
h (x) x dx
0+
=2
4,1
(4)
(x)x dx +
0+
4,2
(4)
4,1
(x)x dx
h(4) (x)x3 dx
4,1
2

X
(1)j h(3) (4,j )34,j 3h(2) (4,j )24,j + 6h (4,j )4,j 6h(4,j )
j=1
lim h(3) (t)t3 = 1152.69754862 . . . ,

t2
since limt0+ h(k) (t)tk = 0 for 0 k 3, limt2 h(k) (t) = 0 for 0 k 2 and
limt2 h(3) (t) = 24e3/2 . Now
Z
|h(4) (x)x3 |dx = lim |h(3) (2 + ) h(3) (2 )| 23 = 23 24e3/2 ,
0+
Hence
(B.15)
C4 =
0+

(4) 3
h
(x)
x dx + 24e3/2 23 = 2013.18185012 . . .

72
H. A. HELFGOTT
We finish by remarking that can write down M h explicitly:

(B.16)
M h = e1/2 (1)s (8(s+2, 2)+12(s+3, 2)+6(s+4, 2)+(s+ 5, 2)),
where (s, x) is the (lower) incomplete Gamma function
Z x
et ts1 dt.
(s, x) =
0
We will, however, find it easier to deal with M h by means of the bound (B.8), in
part because (B.16) amounts to an invitation to numerical instability.
For instance, it is easy to use (B.8) to give a bound for the 1 -norm of M h(it).
Since C4 /C3 > C3 /C2 > C2 /C1 > C1 /C0 ,
Z
M h(it)dt
|M h(it)|1 = 2
0
!
Z C3 /C2
Z C2 /C1
Z C4 /C3
Z
dt
dt
dt
dt
C1
+ C2
+ C1
+ C3
+ C4
2 C0
2
3
4
C0
C2 /C1 t
C1 /C0 t
C3 /C2 t
C4 /C3 t

2
C32
C4 C33
C1 C2
C2 C0
C3 C2
+
C
+
,
= 2 C1 + C1 log
+
2
C2 C3
2 C32 C42
3 C43
C12
and so
(B.17)
|M h(it)|1 16.1939176.
This bound is far from tight, but it will certainly be useful.

Similarly, |(t + i)M h(it)|1 is at most two times

Z C3
Z C4
Z
C2 dt
C3 dt
dt
i
dt + C2
+ C3
+ C4
|t + i| dt + C1
C0

2
3
C
C1
C
C
4 t
2
3
t
t
t
0
C0
C1
C2
C3
s
!!
!
C2
p
2+11
C0
C14 C12
t
C
1 C1
2
| C11
=
+ C1
t + 1 + log
+ 2 + sinh
4
2
C
t
C0
C0
0
C0

2
C4 C3
C3 C1
C2 C3
+
+ C2 log
+ C3
,
2
C3 C4
2 C42
C2
Z
C1
C0
C2
C1

1 +

and so
(B.18)
|(t + i)M h(it)|1 27.8622803.
B.2. The difference + in 2 norm. We wish to estimate the distance

in 2 norm between and its approximation + . This will be an easy affair,
since, on the imaginary axis, the Mellin transform of + is just a truncation of
the Mellin transform of .
By (B.2) and (B.3),
Z
2
2
2

2
|+ |2 =
hH (t)tet /2 h(t)tet /2 dt
0
Z
(B.19)
dt
2
max et t3
|hH (t) h(t)|2 .
t0
t
0
73
The maximum maxt0 t3 et is (3/2)3/2 e3/2 . Since the Mellin transform is an

isometry (i.e., (2.2) holds),
(B.20)
Z
Z
Z
1
1
2
2 dt
=
|M hH (it) M h(it)| dt =
|M h(it)|2 dt.
|hH (t) h(t)|
t
2
H
0
By (B.8),
(B.21)
Hence
(B.22)
|M h(it)|2 dt
C42
C42
dt
.
t8
7H 7
C42
dt
.
t
7H 7
|hH (t) h(t)|2
Using the bound (B.15) for C4 , we conclude that

3/4
C4
274.856893
1
3
(B.23)
|+ |2
.
7/2
2e
H
H 7/2
7
It will also be useful to bound
Z

2

(+ (t) (t)) log t dt .

0
This is at most
Now
t0
Z
t | log t|
t2 3
max e
t2 3
max e
t0
t | log t| = max
t2 3
max e
t[0,1]
|hH (t) h(t)|2
dt
.
t
t2 3
t ( log t), max e
= 0.14882234545 . . .
t[1,5]
t log t
where we find the maximum by the bisection method with 40 iterations.8 Hence,
by (B.22),
Z
C2
(+ (t) (t))2 | log t|dt 0.148822346 4
7
0
(B.24)

27427.502
165.61251 2
.
H7
H 7/2
B.3. Norms involving + . Let us now bound some 1 - and 2 -norms involving
+ . Relatively crude bounds will suffice in most cases.
First, by (B.23),
274.8569
(B.25)
|+ |2 | |2 + |+ |2 0.800129 +
,
H 7/2
where we obtain
(B.26)
| |2 = 0.640205997 . . . = 0.8001287 . . .
by symbolic integration.
8The bisection method (as described in, e.g., [Tuc11, 5.2]) can be used to show that the
minimum (or maximum) of f on a compact interval I lies within an interval (usually a very
small one). Here, the bisection method was carried rigorously, using interval arithmetic. The
method was implemented by the author from the description in [Tuc11, p. 8788], using D.
Platts interval arithmetic package.
74
H. A. HELFGOTT
Let us now bound |+ log |22 . By isometry and (2.6),

Z 1 +i
Z 1 +i
2
2
1
1
2
2
|M (+ log)(s)| ds =
|(M + ) (s)|2 ds.
|+ log |2 =
2i 1 i
2i 1 i
2
) (1/2
Now, (M +
+ it) equals 1/2 times the additive convolution of M hH (it)
2
and (M ) (1/2 + it), where (t) = tet /2 . Hence, by Youngs inequality,

|(M + ) (1/2 + it)|2 (1/2)|M hH (it)|1 |(M ) (1/2 + it)|2 .
Again by isometry and (2.6),
|(M ) (1/2 + it)|2 = 2| log |2 .

Hence, by (B.17),
1
|M hH (it)|1 | log |2 2.5773421 | log |2 .
2
Since, by symbolic integration,
r
(8(log 2)2 + 2 2 + 2 + 8( 2) log 2 8)

|
log
|
(B.27)
32
0.3220301,
|+ log |2
we get that
(B.28)
|+ log |2 0.8299818.
Let us bound |+ (t)t |1 for (2, ). By Cauchy-Schwarz and Plancherel,

(B.29)

+3/2 t2 /2
1+ t2 /2
e
e
|+ (t)t |1 = hH (t)t
t
|hH (t)/ t|2
1
2
s Z
sZ

H
dt
2
1

+3/2 t2 /2
|hH (t)|2 = t+3/2 et /2
|M h(ir)|2 dr
e
= t

2
t
2 H
2
0
s Z

1
2
2

t+3/2 et /2
|M h(ir)|2 dr = t+3/2 et /2 |h(t)/ t|2 .
2
2
2
Since

+3/2 t2 /2
e
t
=
2
|h(t)/ t|2 =
we conclude that
sZ
r
et2 t2+3 dt
for > 2.
( + 2)
,
2
31989 585e3
1.5023459,
8e
8
|+ (t)t |1 1.062319
(B.30)
( + 2)
. By one of the standard transformation rules (see

equals (s 1) M (s 1). Since the Mellin
(2.6)), the Mellin transform of +
+
transform is an isometry in the sense of (2.2),
Z 1 +i
Z 1 +i

2
2
2
1
1
2

M (+ )(s) ds =
|s M + (s)|2 ds.
|+ |2 =
1
1
2i i
2i i
2
75
Recall that + (t) = hH (t) (t), where (t) = tet /2 . Thus, by (2.5), the
function M + (1/2+it) equals 1/2 times the (additive) convolution of M hH (it)
and M (1/2 + it). Therefore, for s = 1/2 + it,
Z
|s| H
M h(ir)M (s ir)dr
|s| |M + (s)| =
2 H
Z H
3
(B.31)
|ir 1||M h(ir)| |s ir||M (s ir)|dr
2 H
3
=
(f g)(t),
2
where f (t) = |it 1||M h(it)| and g(t) = | 1/2 + it||M (1/2 + it)|. (Since
|(1/2 + i(t r)) + (1 + ir)| = |1/2 + it| = |s|, either | 1/2 + i(t r)| |s|/3 or
|1+ir| 2|s|/3; hence |sir||ir1| = |1/2+i(tr)||1+ir| |s|/3.) By Youngs
inequality (in a special case that follows from Cauchy-Schwarz), |f g|2 |f |1 |g|2 .
By (B.18),
|f |1 = |(r + i)M h(ir)|1 27.8622803.
Yet again by Plancherel,
Z
Z 1 +i
2
|s|2 |M (s)|2 ds =
|g|22 =
21 i
1
+i
2
1
i
2
2
|(M (
))(s)|2 ds = 2|
|2 =
Hence
(B.32)
1
3
1 3
|+
|2
|f g|2
27.8622803
2
2
2 2
3 2
.
4
3 2
10.845789.
4
(t)t | for (1, ). First of all,

Let us now bound |+
1

2
2

t2 /2

t hH (t)tet /2 + hH (t)(1 t2 )et /2 t
|+ (t)t |1 = hH (t)te
1

1
2

hH (t)t+1 et /2 + |+ (t)t1 |1 + |+ (t)t+1 |1 .
1
We can bound the last two terms by (B.30). Much as in (B.29), we note that

2
2

hH (t)t+1 et /2 t+1/2 et /2 |hH (t) t|2 ,
1
and then see that

sZ
|hH (t) t|2 =

=
1
2
1
2
Z
1
dt =
|M (hH )(1 + ir)|2 dr
2
0
s Z
Z
H
1
|(ir)M hH (ir)|2 dr =
|(ir)M h(ir)|2 dr
2 H
s Z
Z H
1
|M (h )(1 + ir)|2 dr
|M (h )(1 + ir)|2 dr = |h (t) t|2 ,
2
H
|hH (t)|2 t
where we use the first rule in (2.6) twice. Since

r
r

( + 1)
103983 1899e3
+1/2 t2 /2
e
, |h (t) t|2 =
= 2.6312226,
t
=
2
16e
16
2
76
H. A. HELFGOTT
we conclude that
(B.33)
|+
(t)t |1
for > 1.
r
p
p
( + 1)
1.062319 ( ( + 1) + ( + 3)) +
2.6312226
2
p
p
2.922875 ( + 1) + 1.062319 ( + 3)
B.5. The -norm of + . Let us now get a bound for |+ | . Recall that
2
+ (t) = hH (t) (t), where (t) = tet /2 . Clearly
(B.34)
|+ | = |hH (t) (t)| | | + |(h(t) hH (t)) (t)|

h(t) hH (t)
| (t)t| .
| | +

t
Taking derivatives, we easily see that
| | = (1) = 1,
| (t)t| = 2/e.
It remains to bound |(h(t) hH (t))/t| . By (1.5),

Z
Z
sin w
t
1 sin(H log y) dy
(B.35) hH (t) =
=
dw.
h(ty )
h w/H
t
log y
y
w
e
H log 2
2
The sine integral
sin t
dt
t
0
is defined for all x; it tends to /2 as x + and to /2 as x (see
[AS64, (5.2.25)]). We apply integration by parts to the second integral in (B.35),
and obtain

Z
t
1
d
Si(w)dw h(t)
h w/H
hH (t) h(t) =
H log 2 dw
e
t

Z
1 d

t
=
Si(w)
dw
h w/H
0
dw
2
e

Z
d
1 0

t
Si(w) +
dw.
h w/H
H log 2 dw
2
e
Si(x) =
Now

tew/H

d
t
t
=
t|h | .
h
h
dw
H
ew/H
ew/H Hew/H
Integration by parts easily yields the bounds | Si(x) /2| < 2/x for x > 0 and
| Si(x) + /2| < 2/|x| for x < 0; we also know that 0 Si(x) x < /2 for
x [0, 1] and /2 < x Si(x) 0 for x [1, 0]. Hence
Z 1
Z w/H !
w/H
2e
2t|h |
e
dw +
dw
|hH (t) h(t)|
H
w
0 2
1

4 E1 (1/H)
1/H
= t|h | (1 e
)+
,
H
where E1 is the exponential integral
E1 (z) =
et
dt.
t
77
By [AS64, (5.1.20)],
log(H + 1)
,
e1/H
and, since log(H +1) = log H +log(1+1/H) < log H +1/H < (log H)(1+1/H) <
(log H)e1/H for H e, we see that this gives us that E1 (1/H) < log H (again
for H e, as is the case). Hence

1 + 4 log H
1
4 log H
|hH (t) h(t)|
H
+
(B.36)
< |h | 1 e
,
< |h |
t
H
H
0 < E1 (1/H) <
and so, by (B.34),

|+ |

1 + 4 log H
2 h(t) hH (t)
2
1+
<
1
+
|h
|

e
t
e
H
By (B.11) and interval arithmetic, we determine that

(B.37)
|h | = |h (2,2 )| 2.805820379671,
where 2,2 is a root of h (x) = 0 as in (B.11). We have proven

1 + 4 log H
1 + 4 log H
2
2.80582038
< 1 + 2.06440727
.
e
H
H
We will need three other bounds of this kind, namely, for + (t) log t, + (t)/t
and + (t)t. We start as in (B.34):
(B.38) |+ | < 1 +
|+ log t| | log t| + |(h(t) hH (t)) (t) log t|

(B.39)
| log t| + |(h hH (t))/t| | (t)t log t| ,
|+ (t)/t| | (t)/t| + |(h hH (t))/t| | (t)|
|+ (t)t| | (t)t| + |(h hH (t))/t| | (t)t2 | .
By the bisection method with 30 iterations, implemented with interval arithmetic,

| (t) log t| 0.279491,
Hence, by (B.36) and (B.37),

(B.40)
| (t)t log t| 0.3811561.
|+ log t| 0.279491 + 1.069456
1+
log H
.
H
By the bisection method with 32 iterations,

| (t)/t| 1.08754396.
(We can also obtain this

by solving ( (t)/t) = 0 symbolically.) It is easy to
show that | | = 1/ e. Hence, again by (B.36) and (B.37),
(B.41)
|+ (t)/t| 1.08754396 + 1.70181609
1+
log H
.
H
By the bisection method with 32 iterations,

| (t)t| 1.06473476.
Taking derivatives, we see that | (t)t2 | = 33/2 e3/2 . Hence, yet again by
(B.36) and (B.37),
(B.42)
|+ (t)t| 1.06473476 + 3.25312
1+
log H
.
H
78
H. A. HELFGOTT
References
[AS64]
M. Abramowitz and I. A. Stegun. Handbook of mathematical functions with formulas,

graphs, and mathematical tables, volume 55 of National Bureau of Standards Applied
Mathematics Series. For sale by the Superintendent of Documents, U.S. Government
Printing Office, Washington, D.C., 1964.
[BBO10] J. Bertrand, P. Bertrand, and J.-P. Ovarlez. Mellin transform. In A. D. Poularikas,
editor, Transforms and applications handbook. CRC Press, Boca Raton, FL, 2010.
[BM98]
Martin Berz and Kyoko Makino. Verified integration of ODEs and flows using differential algebraic methods on high-order Taylor models. Reliab. Comput., 4(4):361369,
1998.
[Boo06]
A. R. Booker. Turing and the Riemann hypothesis. Notices Amer. Math. Soc.,
53(10):12081211, 2006.
[dB81]
N. G. de Bruijn. Asymptotic methods in analysis. Dover Publications Inc., New York,
third edition, 1981.
[GR00]
I. S. Gradshteyn and I. M. Ryzhik. Table of integrals, series, and products. Academic Press Inc., San Diego, CA, sixth edition, 2000. Translated from the Russian,
Translation edited and with a preface by Alan Jeffrey and Daniel Zwillinger.
[Har66]
G. H. Hardy. Collected papers of G. H. Hardy (Including Joint papers with J. E.
Littlewood and others). Vol. I. Edited by a committee appointed by the London
Mathematical Society. Clarendon Press, Oxford, 1966.
[HB79]
D. R. Heath-Brown. The fourth power moment of the Riemann zeta function. Proc.
London Math. Soc. (3), 38(3):385422, 1979.
[Hela]
H. A. Helfgott. Major arcs for Goldbachs problem. Preprint. Available at
arXiv:1203.5712.
[Helb]
H. A. Helfgott. Minor arcs for Goldbachs problem. Preprint. Available as
arXiv:1205.5252.
[Helc]
H. A. Helfgott. The Ternary Goldbach Conjecture is true. Preprint.
[HL23]
G. H. Hardy and J. E. Littlewood. Some problems of Partitio numerorum; III: On
the expression of a number as a sum of primes. Acta Math., 44(1):170, 1923.
[IK04]
H. Iwaniec and E. Kowalski. Analytic number theory, volume 53 of American Mathematical Society Colloquium Publications. American Mathematical Society, Providence, RI, 2004.
[Kad05]
H. Kadiri. Une region explicite sans zeros pour la fonction de Riemann. Acta Arith.,
117(4):303339, 2005.
[Kn
u99] O. Kn
uppel. PROFIL/BIAS, February 1999. version 2.
[Leh66]
R. Sherman Lehman. On the difference (x) li(x). Acta Arith., 11:397410, 1966.
[McC84] K. S. McCurley. Explicit estimates for the error term in the prime number theorem
for arithmetic progressions. Math. Comp., 42(165):265285, 1984.
[MV07]
H. L. Montgomery and R. C. Vaughan. Multiplicative number theory. I. Classical
theory, volume 97 of Cambridge Studies in Advanced Mathematics. Cambridge University Press, Cambridge, 2007.
[Ned06]
N. S. Nedialkov. VNODE-LP: a validated solver for initial value problems in ordinary
differential equations, July 2006. version 0.3.
[OLBC10] F. W. J. Olver, D. W. Lozier, R. F. Boisvert, and Ch. W. Clark, editors. NIST handbook of mathematical functions. U.S. Department of Commerce National Institute
of Standards and Technology, Washington, DC, 2010. With 1 CD-ROM (Windows,
Macintosh and UNIX).
[Olv58]
F. W. J. Olver. Uniform asymptotic expansions of solutions of linear second-order differential equations for large values of a parameter. Philos. Trans. Roy. Soc. London.
Ser. A, 250:479517, 1958.
[Olv59]
F. W. J. Olver. Uniform asymptotic expansions for Weber parabolic cylinder functions of large orders. J. Res. Nat. Bur. Standards Sect. B, 63B:131169, 1959.
[Olv61]
F. W. J. Olver. Two inequalities for parabolic cylinder functions. Proc. Cambridge
Philos. Soc., 57:811822, 1961.
[Olv65]
F. W. J. Olver. On the asymptotic solution of second-order differential equations having an irregular singularity of rank one, with an application to Whittaker functions.
J. Soc. Indust. Appl. Math. Ser. B Numer. Anal., 2:225243, 1965.
[Olv74]
[Plaa]
[Plab]
[Pla11]
[Ros41]
[Tem10]
[Tru]
[Tuc11]
[Tur53]
[TV03]
[Vin37]
[Wed03]
[Whi03]
[Wig20]
[Won01]
79
F. W. J. Olver. Asymptotics and special functions. Academic Press [A subsidiary of

Harcourt Brace Jovanovich, Publishers], New York-London, 1974. Computer Science
and Applied Mathematics.
D. Platt. Computing (x) analytically. To appear in Math. Comp.. Available as
arXiv:1203.5712.
D. Platt. Numerical computations concerning GRH. Preprint. Available at
arXiv:1305.3087.
D. Platt. Computing degree 1 L-functions rigorously. PhD thesis, Bristol University,
2011.
B. Rosser. Explicit bounds for some functions of prime numbers. Amer. J. Math.,
63:211232, 1941.
N. M. Temme. Parabolic cylinder functions. In NIST handbook of mathematical functions, pages 303319. U.S. Dept. Commerce, Washington, DC, 2010.
T. S. Trudgian. An improved upper bound for the error in the zero-counting formulae
for Dirichlet L-functions and Dedekind zeta-functions. Preprint.
W. Tucker. Validated numerics: A short introduction to rigorous computations.
Princeton University Press, Princeton, NJ, 2011.
A. M. Turing. Some calculations of the Riemann zeta-function. Proc. London Math.
Soc. (3), 3:99117, 1953.
N. M. Temme and R. Vidunas. Parabolic cylinder functions: examples of error
bounds for asymptotic expansions. Anal. Appl. (Singap.), 1(3):265288, 2003.
I. M. Vinogradov. Representation of an odd number as a sum of three primes. Dokl.
Akad. Nauk. SSR, 15:291294, 1937.
S. Wedeniwski. ZetaGrid - Computational verification of the Riemann hypothesis.
Conference in Number Theory in honour of Professor H. C. Williams, Banff, Alberta,
Canada, May 2003.
E. T. Whittaker. On the functions associated with the parabolic cylinder in harmonic
analysis. Proc. London Math. Soc., 35:417427, 1903.
S. Wigert. Sur la theorie de la fonction (s) de Riemann. Ark. Mat., 14:117, 1920.
R. Wong. Asymptotic approximations of integrals, volume 34 of Classics in Applied
Mathematics. Society for Industrial and Applied Mathematics (SIAM), Philadelphia,
PA, 2001. Corrected reprint of the 1989 original.
Harald Helfgott, Ecole

Normale Sup
erieure, D
epartement de Math
ematiques,
45 rue dUlm, F-75230 Paris, France
E-mail address: harald.helfgott@ens.fr

1305 2897v4

Enviado por

Dados do documento

Título original

Direitos autorais

Formatos disponíveis

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Direitos autorais:

Formatos disponíveis

1305 2897v4

Enviado por

Direitos autorais:

Formatos disponíveis

MAJOR ARCS FOR GOLDBACHS PROBLEM

arXiv:1305.2897v4 [math.NT] 14 Apr 2014

Appendix A. Extrema via bisection and truncated series

B.4. Norms involving +

MAJOR ARCS FOR GOLDBACHS PROBLEM

where Iq=1 = 1 if q = 1, Iq=1 = 0 if q 6= 1, and

Theorem 1.2. Let (t) = t2 et /2 . Let x be a real number 108 . Let be a

n (n/x) = Iq=1 b() x + E x,

where Iq=1 = 1 if q = 1, Iq=1 = 0 if q 6= 1, and

Corollary 1.3. Let (t) = t2 et /2 M 2 (t), where 2 = 1 M 1 and 1 =

n (n/x) = Iq=1 b() x + E x,

Let us now look at a different kind of modification of the Gaussian smoothing.

+ (t) = hH (t) tet

where hH is a band-limited approximation to

By band-limited we mean that the restriction of the Mellin transform of hH to

h(ty 1 )FH (y)

and H is a positive constant. It is easy to check that M FH (i ) = 1 for H <

Theorem 1.4. Let (t) = + (t) = hH (t)tet /2 , where hH is as in (1.5) and

n (n/x) = Iq=1 b() x + E x,

where Iq=1 = 1 if q = 1, Iq=1 = 0 if q 6= 1, and

MAJOR ARCS FOR GOLDBACHS PROBLEM

If q = 1, we have the sharper bound

This is a paradigmatic example, in that, following the proof given in 4.4, we

Proposition 1.5. Let (t) = + (t) = hH (t)tet /2 , where hH is as in (1.5) and

= 0.640206x log x 0.021095x + E2 x log x,

|E1 | 1.536 1015 +

1.2. Main ideas. We will be working with smoothed sums

Our integral will actually be of the form

where + and are two different smoothing functions to be discussed soon.

MAJOR ARCS FOR GOLDBACHS PROBLEM

and the Ecole

(n) for all n coprime to q, we say that induces . A character is primitive if

write for the (uniquely defined) primitive character inducing . If a character

M f (z)M g(s z)dz

MAJOR ARCS FOR GOLDBACHS PROBLEM

M (tf (t))(s) = s M f (s),

M ((log t)f (t))(s) = (M f ) (s)

(as in, e.g., [BBO10, Table 1.11]).

Let fz = ezt , where (z) > 0. Then

Theorem 3.1. Let f (t) = et /2 e(t), R. Let F be the Mellin transform of

where P (x) = x2 if [0, 2], P (x) = x2 + ( 2)x4 if (2, 4] and

for [0, 1],

for > 0 arbitrary.

As we shall see, the choice of (t) = et /2 can be easily motivated by the

Corollary 3.2. Let f (t) = et /2 e(t), R. Let F be the Mellin transform

It is natural to look at |F (s + k)| + |F (k + 1 s)| with s in the critical strip

MAJOR ARCS FOR GOLDBACHS PROBLEM

when t = /2. This is meaningful when t 0, i.e., sgn( ) 6= sgn(). The

The absolute value of (3.6) is

It is not hard to work out an explicit formula2 for (t) = et . However, it is

where R. The parabolic cylinder function U : C2 C is given by

The second argument of U is purely imaginary; it would be otherwise if a Gaussian

MAJOR ARCS FOR GOLDBACHS PROBLEM

for u real or complex, so that

We will be able to shift the contour of integration as we wish, provided that

where = 2. The solutions to (u) = 0 are thus

Solving a quadratic equation, we get that

where j() = (1 + 2 )1/2 and = 4 /2 . Thus

Let us now compute the argument of u0,+ :

(by cos 2 = 1 2 sin2 ). Thus

MAJOR ARCS FOR GOLDBACHS PROBLEM

By (3.11) and (3.15),

2()2 j()(1 (())1 ) =