Search

An Application of a Theorem I Just Learned

Question : Recall that a polygonal number is a number that counts dots arranged in the shape of a regular polygon. Prove that the n-th non s-gonal number is given by
12+14+2ns2+n
where x represents the least integer less than or equal to x.

Answer : It is well known that
(s2)n(n1)2+n
is the n-th s-gonal number.

Now, we will use the Lambek-Moser Theorem to get
f(n)=(s2)n(n1)2
and hence
f(n)=12+14+2ns2
and hence
F(n)=12+14+2ns2+n
hence completing the proof.

Remark : A careful induction on n (keeping s fixed) would probably have also done the job.

Acknowledgement : I learnt about the Lambek-Moser Theorem from the book The Irrationals: a Story of the Numbers You Can't Count On by Julian Havil (section Theoretical Matters of chapter Does Irrationality Matter?) which I also reviewed for zbMATH

The Law of Iterated Logarithm

1. Introduction

The goal is to give a complete proof of the Law of Iterated Logarithm. The setup we will work in is the following. Let {Xi}i=1 be a sequence of iid random variables with mean 0 and variance 1. Let
Sn:=i=1nXi and S¯n:=Snn
for nZ+. From now on, we will use Xn and Sn as defined in our discussions.

We wish to prove the following theorem.
Theorem 1 (The Law of Iterated Logarithm) : Let
Λn:=2nloglogn
wherever it is defined. Then,
lim supnSnΛn=1
almost surely. By replacing Xj by Xj, this also proves that
lim infnSnΛn=1
almost surely. In fact, for any fC(R,R),
lim supnf(SnΛn)=supt[1,1]f(t)
almost surely. That is, the set of limit points of
{SnΛn:n1}
coincides with the interval [1,1] almost surely.

In what follows, we will use the notation
Λβ:=Λβ and S~β:=SβΛβ
where β is greatest integer less than or equal to β.

The main reference we are following is Probability Theory, An Analytic View by Daniel W. Stroock.

2. Lemmas

To prove Theorem 1, we need three lemmas.

Lemma 1 : For a(0,) and β(1,), if
m=1P(|S~βm|aβ)<
then
lim supn|S~n|a
almost surely.

Lemma 2 : For any sequence {Xn:n1} of iid random variables with mean 0 and variance σ2, we have
lim supn|S~n|8σ
almost surely.

Lemma 3 : Let the sequence {Xn} have a common distribution μ. Further, let the moment-generating function
Mμ(ξ):=Reξxμ(dx)<
for all ξR. Then, for all R(0,), there is an N(R)Z+ and a suitable constant K such that
P(|S~n|R)2exp[(1K8Rloglognn)R2loglogn]
for all nN(R). Also, for each ε(0,1], there is an N(ε)Z+ such that for all for all nN(ε) and |a|1ε,
P(|S~na|<ε)12exp[(a2+4K|a|ε)loglogn]
for some suitable constant K.

3. Proof of Theorem
We will first give a proof assuming the three lemmas and then give a proof of the lemmas afterwards.

Proof of Theorem 1 : Observe first that for any Xn and ε>0, it is easy to find a ψCb(R) so that Yn:=ψXn has mean 0 and variance 1 while Zn:=XnYn has variance less than ε2 (what we want is to find a smooth truncation - one way to do this is by first considering Xn1|Xn|<n and then appropriately smoothening the edges). Applying Lemma 2 to this Zn, we have
lim supn|S~n(ω)|1+lim supn|m=1nZm(ω)Λn|1+8ε
and for a[1,1], we have
lim infn|S~n(ω)a|lim supn|m=1nZm(ω)Λn|8ε
for almost every ωΩ. So, it is enough to assume that the random variables are bounded, and that is what will be assumed in the rest of the proof.

Now, for a given β>1, we can use Lemma 3 to argue
P(|S~βm|β)2exp[βloglogβm]
for all sufficiently large mZ+. So, using Lemma 1 with a=β, we see that
lim supn|S~n|β
almost surely for all β(1,).

So, to complete the proof, we need to show
P(lim infn|S~na|<ε)=1
for all a(1,1) and all ε>0. To do this, we wish to use the Borel-Cantelli Lemma. But for that, we need to make sure we are dealing with independent events. For this, we can use the previously proved result to argue that
lim infn|S~na|infk2lim infm|S~kma|lim infklim infn|SkmSkm1Λkma|
almost surely for all integers k2.

Now, since the events
Ak,m:={|SkmSkm1Λkma|<ε}mZ+
are mutually independent for each k2, it is enough to check that
m=1P(Ak,m)=
for sufficiently large k.

But, since
limkmaxmZ+|ΛkmΛkmkm11|=0
and
P(Ak,m)=P(|S~kmkm1aΛkmΛkmkm1|<εΛkmΛkmkm1)
for all k2, it is enough to show
m=1P(|S~kmkm1a|<ε)=
for every k2 and a(1,1) and ε>0. This follows by using the second part of Lemma 3 and choosing ε~>0 so small that ρ:=a2+4Kε~|a|<1 from which it is possible to conclude that when 0<ε<ε~, we have
P(|S~na|<ε)12exp[ρloglogn]
for all sufficiently large n.

This completes the proof.

4. Proof of Lemma 1

We first introduce the idea of a median. Give a real valued random variable Y, we say αR is a median of Y if
P(Yα)12 and P(Yα)12
and write as αmed(Y). It is easy to see that
α:=inf{tR:P(Yt)12}
is a median of Y. In other words, every Y has a median. Additionally, it can be shown that if YL2(R) and m is the mean of Y, then
|αm|2Var(Y)
for all αmed(Y).

We also have the following theorem.
Theorem 2 (Lévy Reflection Principle) : For k, choose α,kmed(SSk). Then, for all ε>0, we have
P(max1nN|Sn+αn,m|ε)2P(|SN|ε)
for all NZ+.

With this at hand, we move to the proof.

Proof of Lemma 1 : Let β(1,) be given. For each mN and 1nβm, let αm,n be a median of SβmSn. This gives |αm,n|2βm so that
lim supn|S~n|=lim supmmaxβm1nβm|S~n|βlim supmmaxnβm|Sn|Λβmβlim supmmaxnβm|Sn+αm,n|Λβm
which gives
P(lim supn|S~n|a)P(lim supmmaxnβm|Sn+αm,n|Λβmaβ)
and hence on application of Theorem 2, we have
P(maxnβm|Sn+αm,n|Λβmaβ)2P(|S~βm|aβ)
from which the desired result follows using Borel-Cantelli Lemma.

5. Proof of Lemma 2

Again, we need to introduce a few objects before we can move to our proof.

We define the Rademacher functions {Rn}nZ+ on [0,1) as follows. Consider the function R:R{1,1} given by
R(t):={1iftt[0,12)1iftt[12,1)
and define Rn on [0,1) by
Rn(ω):=R(2n1ω)
for nZ+ and ω[0,1). It can be shown that the Rademacher functions are mutually independent.

We will use these functions in our proof.

Proof of Lemma 2 : Without loss of generality, it can be assumed that σ=1. We begin by considering the case where the Xn's are symmetric, i.e., Xn and Xn has the same distribution as Xn itself. For this case, we will be able to prove the result with the constant 8 replaced by 4 once we can show
m=0P(|S~2m23/2|)<
since we can then use Lemma 1.

Now, let (Ω,F,P) be the probability space on which the Xn's are defined and let {Rn}nZ+ be the sequence of Rademacher functions on [0,1). Set Q:=λ[0,1)×P on ([0,1)×Ω,B[0,1)×F) where λ[0,1) is the Lebesgue measure on [0,1). It is easy to check that the symmetry of Xn is equivalent to the statement that
ωΩ(X1(ω),X2(ω),)RZ+
has the same distribution under P as
(t,ω)[0,1)×Ω(R1(t)X1(ω),R2(t)X2(ω),)RZ+
has under Q.

Now, from Hoeffding's inequality, we know that for a given {σk}k=1nR and mutually independent random variables {Xi}i=1n with common distribution μ, the centered Bernoulli distribution given by μ({±1})=12, if ν denotes the distribution S:=1nσkXk, then for all a[0,), we have
P(|S|a)2exp[a22Σ2]
where Σ2:=1nσk2. We use this with σk=Xk(ω) and note that
λ[0,1)({t[0,1):|n=12mRn(t)Xn(ω)|a})2exp[a2212mXn(ω)2]
for all a[0,) and ωΩ.

So, if we define
Am:={ωΩ:12mn=12mXm(ω)22}
and
Fm(ω):=λ[0,1)({t[0,1):|n=12mRn(t)Xn(ω)|23/2Λ2m})
then we have
P({ωΩ:|S2m(ω)|23/2Λ2m})=ΩFm(ω)P(dω)2Ωexp[8Λ2m2212mXn(ω)]P(dω)2exp[4loglog2m]+2P(Am)
due to Tonelli's Theorem.

So, it is enough to show 0P(Am)<. To do so, let us set
Tm:=n=12mXn2Bm:={Tm+1Tm2m2}T¯m:=Tm2m
for mZ+. Clearly {Bm}mZ+ are independent and P(Am)=P(Bm). So, it is enough to show
P(lim supmBm)=P(lim supmTm+1Tm2m2)=0
and then use Borel-Cantelli Lemma.

But, by Strong Law of Large Numbers, we know T¯m1 almost surely, and hence
Tm+1Tm2m1
almost surely. This completes the proof for the symmetric case replacing the constant 8 by 4.

To eliminate the symmetry assumption, let (Ω,F,P) be the space on which Xn's are defined and define {Yn} on (Ω2,F2,P2) as
Yn(ω1,ω2):=Xn(ω1)Xn(ω2)
for nZ+. It is easy to check that the Yn's are symmetric. This gives
lim supn|Sn(ω1)Sn(ω2)|Λn25/28
for Q almost every (ω1,ω2)Ω2.

Now, we wish to proceed by contradiction. If possible, let
lim supn|Sn(ω)|Λn>8
on a set of positive P measure so that by Kolmogorov's 01 Law, there is an ε>0 such that
lim supn|Sn(ω)|Λn8+ε
for P almost every ωΩ. Using Fubini's Theorem, this gives that for P2 almost every (ω1,ω2)Ω2, there is a {nm(ω1):mZ+}Z+ such that nm(ω1) increases to so that the limit
limmSnm(ω1)(ω1)Λnm(ω)8+ε
exists and
lim infm|Snm(ω1)(ω2)|Λnm(ω1)limmSnm(ω1)(ω1)Λnm(ω)lim supm|Snm(ω)(ω1)Snm(ω)(ω2)|Λnm(ω1)ε
so that by Fubini's Theorem, we have an {nm:mZ+}Z+ such that nm increases to and
lim infm|Snm(ω2)|Λnmε
for P almost every ω2Ω. This contradicts Markov's Inequality owing to
E[(SnΛn)2]=12loglogn0
hence completing the proof.

Proof of Lemma 3

As usual, we need to discuss some new ideas before we can move to our proof. We define
Λμ(ξ):=log(Mμ(ξ))
and define the Legendre transform of Λμ as
Iμ(x):=sup{ξxΛμ(ξ):ξR}
for xR. Also, let Ξμ:=(Λμ)1.

Lemma 4 : There exists a δ(0,1] and a K(0,) such that [mδ,m+δ](α,β) and
|Λμ(Ξ(x))|K|Ξμ(x)|K|xm|
and
|Iμ(x)(xm)22σ2|K|xm|3
for all x[mδ,m+δ]. In particular, if 0<ε<δ, then
P(|S¯nm|ε)2exp[n(ε22σ2Kε3)]
and if |am|<δ, then
P(|S¯nm|<ε)(1Knε2)exp[n(|am|22σ2+K|am|(ε+|am|2))]
for any ε>0.

Proof : Recall that we started with the assumption that our random variables have mean 0 and variance 1. So, we have Λμ(0)=Λμ(0)=0 and Λμ(0)=1 and hence Ξμ(0)=0 and Ξμ(0)=1. So, we can find an M(0,) and a δ(0,1] with α<δ<δ<β for which
|Ξμ(x)x|M|x|2
whenever |x|δ and
|Λμ(ξ)ξ22|M|ξ|3
whenever |ξ|(M+1)δ. This immediately proves
|Ξμ(x)|(M+1)|x|
for |x|δ and the estimate for Iμ follows since Iμ(x)=Ξ(x)xΛμ(Ξμ(x)).

This gives us enough room to discuss our proof.

Proof of Lemma 3 : Begin by setting
λn:=Λnn=2loglognn
wherever it is defined.

To prove the first part, we apply the upper bound of P(|S¯nm|ε) obtained in Lemma 4 to see that
P(|S~n|R)=P(|S¯n|Rλn)2exp[n(R2λn22KR3λn3)]
for sufficiently large nZ+.

To prove the next part of the lemma, we note
P(|S~na|<ε)=P(|S¯nan|<εn)
where an=aλn and εn=ελn.So, using the lower bound on P(|S¯nm|<ε) obtained in Lemma 4 to conclude
P(|S~na|<ε)(1Knεn2)exp[n(an22+K|an|(εn+an2))](1K2ε2loglogn)exp[(a2+2K|a|(ε+a2λn))loglogn]
for sufficiently large n.


Acknowledgement : I'd like to thank Satvik Saha, who has been my partner in reading this proof, patiently answering my barrage of inane questions and making clever observations that I've shamelessly passed off as my own in this write-up. I feel safe admitting it here because, let’s be real, no one’s reading this far anyway.

Introduction to Sumsets II

This is a sequel to a previous post . However, one can technically read it without reading the previous post. Given A,BZ for a c...