Search

Another Beautiful Pigeon-Hole Argument

I first encountered this question in an assignment in our Graph Theory and Combinatorics course. Later, I also saw this in an MSE post and answered immediately. I was extremely happy to be able to solve this problem and now seems to be a good time to put it on this blog.

Problem Statement -
Let S={1,2,,280}. Find the smallest natural number n such that every n-element subset of S contains 5 pairwise relatively prime numbers.

Solution -
Before we begin with the real question, we note that 280=16. This means that all composite numbers below 280 must have one prime factor in the set {2,3,5,7,11,13}. We will use this fact in our answer.

First, we will consider the subset A of S that only contains multiples of 2,3,5 and 7. We can use Principle of Inclusion and Exclusion to find the cardinality of A as
|A|=2802+2807+2803+280528014280628010280152802128035+28070+280105+28042+28030280210
which gives |A|=216.

Now, note that, A only has multiples of four prime numbers. So, using Pigeon Hole Principle, we can say that any five elements of A must have at least two multiples of a particular prime, which proves that any 5 element subset of A is NOT pairwise relatively prime.

This proves that if we want to have an n-element subset of S such that any five of them will be relatively prime, then we must have n>216.

Now, notice that the cardinality of A is 216 and A contains exactly 4 prime numbers (namely, 2,3,5 and 7). So, the other 212 numbers in A (and hence in S) are composite.

Let us try to find the other composite numbers in S. These must be of the form 11m or 13n for some m,nN such that m,n>10 (since all multiples till 10 have already been counted). It is easy to check that the only such composite numbers are  121,143,169,187,209,221,247,253. That gives precisely 8 more composite numbers in S. So, S has 220 composite numbers and 59 primes (with a 1 which neither).

Now, consider a subset X of S such that |X|=217. We claim that there are five elements in X such that any two of them are pairwise relatively prime. To prove that, let us assume on the contrary that our claim is false. So, any five element subset of X must have two relatively non-prime elements. This forces the number of prime numbers in X to be less than or equal to 4 since if there are more than four prime numbers, then the set of these primes has all pairwise relatively prime numbers. So, there are at most four primes in X which means that there are at least 213 composite numbers in X. This means that the number of composite numbers outside X (i.e., in SX) is at most 7.

Now, consider the following eight sets
{22,32,52,72,132}{2×31,3×29,5×23,7×19,11×17}{2×47,3×43,5×41,7×37,13×19}{2×41,3×37,5×31,7×29,11×23}{2×23,3×19,5×17,7×13,112}{2×43,3×41,5×37,7×31,13×17}{2×37,3×31,5×29,7×23,11×19}{2×29,3×23,5×19,7×17,11×13}
and note that the 40 elements in these 8 sets are all distinct because of the Unique Factorization Theorem. Also, since there are eight sets, and at most seven composite numbers can be outside X, by the Pigeon Hole Principle, at least one of these sets must be in X. Also, clearly, in all of the above sets, the elements are mutually co-prime.

This completes the proof.

Four Proofs of a Theorem

A set of positive integers AN is called a Sidon Set or a Sidon Sequence if the equation a+b=c+d does not have any non-trivial solutions in A. They were named after Hungarian mathematician Simon Sidon who was inspired by certain problems in Fourier series to ask Erdős about the possible growth of such sequences.

Since then, there has been an extensive amount of work on this topic exploring a plethora of different questions about finite and infinite Sidon sets, and various generalizations of them. The classic result regarding bounds on sizes of Sidon sets was given by Erdős and Turan who also provided a first proof of their theorem. However, here we will discuss four other proofs which I find are easier than the original one. They all yield the same bound, with the exception of Ruzsa's proof which when refined can change the final constant to 1/2 instead of 1.


Theorem (Erdős, Turan) : Let A[n] be Sidon Set. Then

|A|<n1/2+n1/4+1

where [n]:={1,2,,n}.


Proof 1 (Lindström) : Let A={a1<a2<ak}[n], ajai are all distinct. Indeed this property of all the differences (except 0) being distinct is a defining feature of Sidon sets and is often one of the most important ingredients in proofs of quite strong results.

Define

S1:=1jk1(aj+1aj)=aka1

so that

(k1)k2S1n

as S1 is the sum of (k1) distinct integers, and is therefore bigger than the sum of the first (k1) integers. This yields the bound k2n and was already observed by Erdős. But to refine this further, we need to test the limits of this idea.

Define

S2:=1jk1(aj+2aj)=ak+ak1a2a12n

so that

(2k3)(2k2)2S1+S23n

which again follows from the fact that the differences considered in S1 are all different from those considered in S2 and hence S1+S2 is the sum of (2k2) distinct integers. This by itself improves the bound to k1.5n. Of course we do not need to stop here and can indeed continue to the -th iteration. After iterating this times, we have

D(D+1)2S1++S(1+2++)n

where D:=(k1)++(k)=k(+1)2.

Rearranging, we get

k+12n(+1) 

which gives

kn+n2+2+12

using the inequality 1+x1+x/2.

It is easy to check that =n1/4 is the optimal choice, hence completing the proof.


Proof 2 (Ruzsa) : This proof uses a lemma which is fairly well known and appears in different texts and in a variety of different contexts. It is usually attributed to Johnson although it has been rediscovered several times.

Lemma : Let A be a family of k-sets A1,,Am so that AiAj has cardinality atmost t for any i,j. Then,

vk2mtm+kt

where v:=|Ai|.

Proof of Lemma : For an element xAi, let dx=|{i:xAi}| so that

tm(m1)+mki(j|AiAj|)=xdx2(dx)2v=k2m2v

hence completing the proof.

Now let A[n] be a Sidon set and let k:=|A|. Take a positive integer m and construct

A:={Ai:=A+(i1):i[m]}

where A+(i1):={a+i1:aA}. Notice that Ai[n+m1].

The key observation is that |AiAj|1 for ij as if x,yAiAj for some i<j, then we have a,b,c,dA with a+i=x=b+j and c+i=y=d+j implying a+d=(xi)+(yj)=(xj)+(yi)=b+c, and hence x=y.

Applying Lemma with t=1 and vn+m1, we get

(n+m1)(m+k1)k2m

hence completing the proof.


Proof 3 (Graham) : This is the shortest proof in the list. In fact, this is probably one of those proofs that are so short and immediate that they feel like magic. The proof is essentially a direct application of a well known lemma.

Recall the Weyl - van der Corput Lemma. This states that if ξ(n) is a real valued function supported on the interval [1,N], then

|nξ(n)|2N+H1H|h|<H(1|h|H)nξ(n)ξ(n+h)

for any positive integer H.

Now, let A be a Sidon set and ξ(n) be the characteristic function of A, i.e., ξ(n)=1 if nA and ξ(n)=0 otherwise. We have

|A|2N+H1H(|A|+H1)

since nξ(n)ξ(n+h)=|A| if h=0 and nξ(n)ξ(n+h)1 otherwise. This is due to the fact that for a fixed constant h, there can be atmost one n for which both n and n+h are in A (since all differences in A are distinct).

Using H=N3/4+1 completes the proof.


Proof 4 (Cilleruelo) : This proof will require the introduction of some new definitions, including set additions. For A,BN we define A+B:={a+b:aA,bB} to be the collection of all sums of A and B. The set AB is defined analogously.

Now, for A,BN, let us define

rA+B(z):=|{z=a+b:aA,bB}|

to be the number of representations of an integer z as sum of an element from A and another from B. We also define

E:=xA+BrA+B2(x)

and call it the additive energy of A+B.

We immediately have rAA(0)=|A|. It is straightforward to check that

xA+BrA+B2(x)=xA+BrAA(x)rBB(x)xA+BrA+B(x)=|A||B||A+B|xA+BrA+B2=|A+B|(|A||B|+x0rAA(x)rBB(x))

for any A,BN.

Now, let A[N] be a Sidon set and B[] so that A+B[N+]. So, using the second property above, we have

|A|2(+1)2(N+)(|A|(+1)+x0rBB(x))

as rAA(x)1 for all x since A is Sidon (and hence all differences are distinct).

Also,

x0rBB(x)=|{b1b2:b1b2;b1,b2B}|=(+1)

so that

|A|2(+1)(N+)(|A|+)

and hence putting =n(|A|1) completes the proof.


A keen reader might try to make a case for Proof 4 being essentially the same as Proof 2, only written in a different language. While this may be true to some extent, it should be noted that Cilleruelo's goal (in this paper) was to consider Sidon sets in Nd and to prove an analogous result for this case using the method of additive energy. So, this proof is almost a byproduct of his actual work.

It is also worth mentioning that the linked document to Graham's proof is an excellent article which tries to show how the Weyl - van der Corput Lemma alone can prove some really strong results in the field. It is quite surprising how one inequality by itself can describe so many results!


The theorem we described remained the best result until very recently when an improvement was provided by Balogh, Füredi and Roy proving

|A|<n1/2+(1γ)n1/4

for some γ0.002. In the same spirit, O'Bryant improved the constant from 0.998 to 0.99703, and then again to 0.98183 with Carter and Hunter.

It should however be noted that we are still quite far away from proving what we expect to be true. Addressing the original question of Sidon, Erdős conjectured (and offered $500 for a proof or disproof) that 

|A|<n1/2+o(nε)

for all ε>0. This is considered to be a pretty hard problem and it will probably require some time before we can answer it. Maybe one of the readers will be able to help someday!


Acknowledgement : I want to express my heartfelt thanks to my supervisor, Prof. Ramachandran Balasubramanian, who introduced me to the beautiful topic of Sidon sets. His straightforward approach to understanding mathematics has greatly influenced my own mathematical intuition and helped me greatly in understanding these proofs.

A Dense Infinite Sidon Sequence

A set of positive integers AN is called a Sidon Set or a Sidon Sequence if the equation a+b=c+d does not have any non-trivial solutions in A. They were named after Hungarian mathematician Simon Sidon who was inspired by certain problems in Fourier series to ask Erdős about the possible growth of such sequences.

Since then, there has been an extensive amount of work on this topic exploring a plethora of different questions about finite and infinite Sidon sets, and various generalizations of them. In this post, we are mainly concerned about a breakthrough theorem in the field of infinite Sidon sets.

First consider the following example.

Example : (Greedy Algorithm) If S[1,n] is already a Sidon set, we can find x such that x does not equal a, b, c or a+bc for any a,b,cS and S{x}N is Sidon. Since there are at most n choices for each of a, b and c and a,b,cn, choosing x>n3 is enough.

It is clear that this sequence satisfies S(x)x1/3 where S(x):=|S[1,x]|. It was conjectured by Erdős that there is a Sidon sequence S with S(x)x1/2ε for all ε>0 (recall that f(x)g(x) means that there exists a constant C such that f(x)Cg(x) for all large x and f(x)g(x) is defined similarly). He also proved that any arbitrary Sidon sequence will have the close upper bound of

S(x)x1/2(logx)1/2

for infinitely many x. This exhibits that finite segments of an infinite Sidon set must sometimes be much smaller than it is expected in finite Sidon sets.

The lower bound was first improved by Ajtai, Komlós and Szemerédi who proved the existence of S satisfying

S(x)x1/3(logx)1/3

and then finally Ruzsa gave a breakthrough by finding another S satisfying

S(x)x21o(1)

which has remained the best result in this area since 1998. It is this proof that I will present in this post.

For those asking why we care, it is known that the number of elements in the biggest (usually called dense in literature) Sidon subset of the interval [1,x] is of the order x (see here) and what Erdős' conjecture claims is that N is quite similar. In other words, we can find Sidon subsets of N with (logarithmic) density arbitrarily close to x (although x is not achievable, and this was proven by Erdős).


Theorem : There exists an infinite Sidon sequence B such that

B(N)=Nγ+o(1)

where γ=21.


Idea of the Proof : Notice that the set of primes P form a multiplicative Sidon set, that is p1p2=p3p4 for piP implies {p1,p2}={p3,p4}. So, {lnp}pP forms a Sidon set in R.

This means that if we can cut the binary representation of lnp in a proper way, we still almost surely retain the Sidon property. In fact, all the observations we made so far are true for {αlnp}pP for all α. Indeed we will start with a bunch of α so that we can later argue probabilistically. 


Proof Setup : The first step is to look at the number αlnp in binary. Choose α[1,2] and write

αlnp=i=0kpεip2i+i=1δip2i

where kp=log(αlnp). So, the ε-string is the integral part of this number and the δ-string is the part after the decimal point (or is it the binary point?). Here, and in the rest of the proof, log will represent log2 and ln will be a placeholder for logarithm to any pre-specified base - the reader can think of it as loge for convinience.

Now, we set β=2+1 and γ=1/β. Then, we define

Kp=min{i>2:2(i1)2>pβ}

so that Kp=2+βlogp. A more natural way to present this proof could have been to start with an arbitrary β and realize at the end of the proof that this is the value of β that optimizes the density.

Now, we need to do the proper cutting. So, we define

λp=i=0kpεip2i+i=1Kp2δip2i

by cutting the binary expansion of αlnp at the Kp2-th place.

Now, we use the square indices to construct

Δip=j=(i1)2+1i2δj2i2j

by cutting the bigger block into smaller pieces.

In binary, the numbers look like

Δ1,p=δ1Δ2,p=δ2δ3δ4Δ3,p=δ5δ6δ7δ8δ9

and so on. From now on, we will often supress p or α from our notation whenever it doesn't interfere with the clarity of the proof.

Finally, we define bp as

010ΔKΔ30ε2000Δ20ε1000Δ10ε0000

which was done in the following way : first write the Δi's in a piece of paper and put five blanks in between each of them, and after Δ1. Fill up the second blank to the right of Δi with εi1. The 4th blank always contains a 0 except the one to the left of ΔK - this is done to keep track of the size of bp. The rest of the blanks are all 0

More formally,

bp=i=1KΔip2(i1)2+5i+i=0kεip2i2+5i+2+tp

where tp=2K2+5K+4.

Further, we put Δip=0 for i>K and εip=0 for i>k. Our Sidon set will be a subset of the set {bp}pP.


Plan of the Proof : Recall how we introduced a padding of 0's in the definition of bp. This was done to ensure that there are no carry overs when we add bp1+bp2. This makes sure that

bp+bs=bq+brtp+ts=tq+tr,εip+εis=εiq+εir,Δip+Δis=Δiq+Δir

for all i. Now, consider

(*)bp+bs=bq+br,bp>bqbr>bs

for some p,q,r,s.

Define PK={p:Kp=K}. These sets provide a proper partitioning of the primes with respect to their sizes, and hence have the following important property. If () holds, then there are integers KL such that

(1)λp+λs=λq+λr

and p,qPK, r,sPL.

The artistic part is now over and what's left of our work is the technical part. This is the part where we try to filter out the largest Sidon subset of {bp}pP. The techniques are all mainly aimed towards estimating the number of solutions of () in the set {bp}pP.


Bounds : Notice that

|λpαlnp|2K2<pβ

follows from definitions. Also, if () holds, then

|psqr|<8qr2L2,qr>2L22,(K1)2>(β1)(L1)2+β(2L3)

where K and L are as in (1).

Now, we define

JKL=|{p,q,r,s:pqPK,rsPL,|psqr|<8qr2L2}|

and try to estimate it.


Lemma : We have

JKL22γ((K1)2+(L1)2)L2

where JKL is defined as above.

Proof : The Diophantine equation psqr=a has the general solution

p=p0+tr,q=q0+ts

where p0, q0 are the smallest solution.

As p<2γ(K1)2, we have t<2γ(K1)2r. So, for fixed a,r,s, the number of solutions is O(2γ(K1)2r).

Again, |a|r2γ(K1)2L2. So, for fixed r,s, the number of solutions is O(22γ(K1)2L2) and r,s<2γ(L1)2.


Lemma : We have

2K2αlnp2K2αlnq(mod2K2L2)

assuming () holds.

Proof : It is clear that

2K2αlnpi=L2+1K2δip2K2i(mod2K2L2)

from definition. Also for L<iK, we have Δip=Δiq so that δjp=δjq for L2<jK2. So, RHS remains unchanged if we replace p by q.


Lemma : Let K>L and pqPK be given. Let there be at least one pair r,sPL and an α[1,2] such that () holds. Then, we have

μ(α:2K2αlnp2K2αlnq(mod2K2L2))2L2K2

where μ is the Lebesgue measure.

Proof : Call A=2K2lnpq, M=2K2L2 and notice that the property we are after is

Aα0 or 1(modM)

and since Aα is evenly distributed modulo M, we have

μ(α:Aα0 or 1(modM))1M

hence completing the proof.


The proofs of the next two lemmas are straightforward and hence we will refrain from exhibiting them.

Lemma : For pqPK and r,sPL, we have

μ(α:bp+bs=bq+br){=0if|psqr|8qr2L22L2K2if|psqr|<8qr2L2

for K>L.

Lemma : If

GKL(α):=|{p,q,r,s:pqPK,r,sPL,() holds }|

then

12GKL(α)dα22γ((L1)2+(K1)2)K2

for K>L.


Lemma : Define

GK(α)=|{p,q,r,s:p,qPK,() holds }|

so that

12GK(α)dα2γ(K1)22K

for all K.

Proof : By definition, we have GK(α)=LKGKL(α). And since GKL(α)0 only if (L1)2<(K1)2/(β1), we obtain an estimate of 2Z with

Z=(2β11)(K1)22K+1

on which we use the fact that 2β11=1β.

In fact, the value of β was chosen in the first place to satisfy 2β11=1β.


Proof of Main Theorem : With all the setup in hand, the rest of the proof is essentially an exercise in probability theory. Define the event

EK:=[GKNK]

where NK:=2γ(K1)2K. This gives

P(EK)=P(GKNK)E(GK)NK(using Markov Inequality)=1NK12GK(α)dα2K

and hence

KP(EK)<P(lim infKEKC)=1

by Borel-Cantelli lemma.

This guarantees the existence of an α such that

GK(α)<2γ(K1)2K

for all (except at most finitely many) K. Fix this α.

Using Prime Number Theorem, the cardinality of PK is

|PK|=π(2γ(K1)2)π(2γ(K2)2)2γ(K1)2γK2ln2

and hence GK(α)<|PK|/2 for K>K0.

Now, for each p,q,r,s satisfying () with pPK, we omit p and define QK to be the set of remaining elements. Then we define

B=K>K0QK

which, by definition, is a Sidon set.

Now, we have

bp<2K2+5K+5<2(K+3)2

implying that for K=logx3, maxQK<x. This gives

B(x)(12ε)π(2γ(K1)2)=xγ+o(1)

hence completing the proof. The reader can check that a similar calculation yields an upper bound.


Although any improvements on the density is still beyond our reach, some analogues of this idea have been suggested and they have been able to yield similar results in more general settings. They have some excellent potential of their own, but that is a topic for a different post!


Acknowledgement : I want to express my heartfelt thanks to my supervisor, Prof. Ramachandran Balasubramanian, who introduced me to the beautiful topic of Sidon sets. His straightforward approach to understanding mathematics has greatly influenced my own mathematical intuition and helped me greatly in understanding this paper.

AMM problem 12451

This is AMM problem 12451 that I solved with my friends Satvik Saha and Sohom Gupta.


Problem Statement -
Let A and B be complex n×n and n×m matrices respectively. Let 0m,n denote the m×n zero matrix and let Im denote the m×m identity matrix. Prove 

exp[AB0m,n0m,m]=[exp(A)(01exp(tA)dt)B0m,nIm]

where exp be the matrix exponential function.


Solution -
Setting 

P=[AB0m,n0m,m]

we observe that the (unique) solution of the IVP z=Pz with z(0)=z0 is precisely z(s)=exp(sP)z0 for all z0Cm+n.

Thus, it suffices to check that for all x0Cn,y0Cm, the curves

x(s)=exp(sA)x0+(01exp(stA)dt)sBy0,y(s)=y0

solve the system of differential equations

x=Ax+By,y=0

and then put s=1.

This is easily verified; with x(s) as above, we have

x(s)=Aexp(sA)x0+(01(In+stA)exp(stA)dt)By0=Aexp(sA)x0+exp(sA)By0

via integration by parts, and

Ax(s)=Aexp(sA)x0+(01sAexp(stA)dt)By0=Aexp(sA)x0+(exp(sA)In)By0=x(s)By(s)

hence completing the proof.

AMM problem 12450

This is AMM problem 12450 that I solved with my friends Satvik Saha and Sohom Gupta.


Problem Statement -
Let n be an odd positive number, and suppose that x1,,xn are chosen randomly and uniformly from the interval [0,1]. For 1in, let yi=xixi2. What is the expected value of the median of {y1,,yn}?


Solution -
Let g(x)=xx2 on [0,1], and let

h(y)=114y2

be the inverse of the g restricted to [0,1/2].

Let XU(0,1), and set Y=g(X).

Then, by symmetry, Y has density 

fY(y)=2fX(h(y))ddyh(y)=2h(y)

from which it follows that the cdf of Y on [0,1/4] is FY(y)=2h(y). From an iid sample y1,,ynFY where n=2k+1, the sample median ξ=y[k+1] has density

fξ(ξ)=n!(k!)2FY(ξ)k(1FY(ξ))kfY(ξ)=n!(k!)2(2h(ξ))k(12h(ξ))k2h(ξ)

and hence, the expected value of the median is

E[ξ]=ξfξ(ξ)dξ=01/4n!(k!)2ξ(2h(ξ))k(12h(ξ))k2h(ξ)dξ=01n!(k!)2g(x/2)xk(1x)kdx=01n!(k!)2[x2x24]xk(1x)kdx=n!(k!)2[k!(k+1)!2(n+1)!k!(k+2)!4(n+2)!]=3n+516n+32

which is the final answer.

Infinitely many Proofs of Pythagoras' Theorem

This post is inspired by an MSE question and the brilliant answers that it received. However, it can be understood independently. Everythin...