Search

Another Beautiful Pigeon-Hole Argument

I first encountered this question in an assignment in our Graph Theory and Combinatorics course. Later, I also saw this in an MSE post and answered immediately. I was extremely happy to be able to solve this problem and now seems to be a good time to put it on this blog.

Problem Statement -
Let S={1,2,,280}. Find the smallest natural number n such that every n-element subset of S contains 5 pairwise relatively prime numbers.

Solution -
Before we begin with the real question, we note that 280=16. This means that all composite numbers below 280 must have one prime factor in the set {2,3,5,7,11,13}. We will use this fact in our answer.

First, we will consider the subset A of S that only contains multiples of 2,3,5 and 7. We can use Principle of Inclusion and Exclusion to find the cardinality of A as
|A|=2802+2807+2803+280528014280628010280152802128035+28070+280105+28042+28030280210
which gives |A|=216.

Now, note that, A only has multiples of four prime numbers. So, using Pigeon Hole Principle, we can say that any five elements of A must have at least two multiples of a particular prime, which proves that any 5 element subset of A is NOT pairwise relatively prime.

This proves that if we want to have an n-element subset of S such that any five of them will be relatively prime, then we must have n>216.

Now, notice that the cardinality of A is 216 and A contains exactly 4 prime numbers (namely, 2,3,5 and 7). So, the other 212 numbers in A (and hence in S) are composite.

Let us try to find the other composite numbers in S. These must be of the form 11m or 13n for some m,nN such that m,n>10 (since all multiples till 10 have already been counted). It is easy to check that the only such composite numbers are  121,143,169,187,209,221,247,253. That gives precisely 8 more composite numbers in S. So, S has 220 composite numbers and 59 primes (with a 1 which neither).

Now, consider a subset X of S such that |X|=217. We claim that there are five elements in X such that any two of them are pairwise relatively prime. To prove that, let us assume on the contrary that our claim is false. So, any five element subset of X must have two relatively non-prime elements. This forces the number of prime numbers in X to be less than or equal to 4 since if there are more than four prime numbers, then the set of these primes has all pairwise relatively prime numbers. So, there are at most four primes in X which means that there are at least 213 composite numbers in X. This means that the number of composite numbers outside X (i.e., in SX) is at most 7.

Now, consider the following eight sets
{22,32,52,72,132}{2×31,3×29,5×23,7×19,11×17}{2×47,3×43,5×41,7×37,13×19}{2×41,3×37,5×31,7×29,11×23}{2×23,3×19,5×17,7×13,112}{2×43,3×41,5×37,7×31,13×17}{2×37,3×31,5×29,7×23,11×19}{2×29,3×23,5×19,7×17,11×13}
and note that the 40 elements in these 8 sets are all distinct because of the Unique Factorization Theorem. Also, since there are eight sets, and at most seven composite numbers can be outside X, by the Pigeon Hole Principle, at least one of these sets must be in X. Also, clearly, in all of the above sets, the elements are mutually co-prime.

This completes the proof.

Four Proofs of a Theorem

A set of positive integers AN is called a Sidon Set or a Sidon Sequence if the equation a+b=c+d does not have any non-trivial solutions in A. They were named after Hungarian mathematician Simon Sidon who was inspired by certain problems in Fourier series to ask Erdős about the possible growth of such sequences.

Since then, there has been an extensive amount of work on this topic exploring a plethora of different questions about finite and infinite Sidon sets, and various generalizations of them. The classic result regarding bounds on sizes of Sidon sets was given by Erdős and Turan who also provided a first proof of their theorem. However, here we will discuss four other proofs which I find are easier than the original one. They all yield the same bound, with the exception of Ruzsa's proof which when refined can change the final constant to 1/2 instead of 1.


Theorem (Erdős, Turan) : Let A[n] be Sidon Set. Then

|A|<n1/2+n1/4+1

where [n]:={1,2,,n}.


Proof 1 (Lindström) : Let A={a1<a2<ak}[n], ajai are all distinct. Indeed this property of all the differences (except 0) being distinct is a defining feature of Sidon sets and is often one of the most important ingredients in proofs of quite strong results.

Define

S1:=1jk1(aj+1aj)=aka1

so that

(k1)k2S1n

as S1 is the sum of (k1) distinct integers, and is therefore bigger than the sum of the first (k1) integers. This yields the bound k2n and was already observed by Erdős. But to refine this further, we need to test the limits of this idea.

Define

S2:=1jk1(aj+2aj)=ak+ak1a2a12n

so that

(2k3)(2k2)2S1+S23n

which again follows from the fact that the differences considered in S1 are all different from those considered in S2 and hence S1+S2 is the sum of (2k2) distinct integers. This by itself improves the bound to k1.5n. Of course we do not need to stop here and can indeed continue to the -th iteration. After iterating this times, we have

D(D+1)2S1++S(1+2++)n

where D:=(k1)++(k)=k(+1)2.

Rearranging, we get

k+12n(+1) 

which gives

kn+n2+2+12

using the inequality 1+x1+x/2.

It is easy to check that =n1/4 is the optimal choice, hence completing the proof.


Proof 2 (Ruzsa) : This proof uses a lemma which is fairly well known and appears in different texts and in a variety of different contexts. It is usually attributed to Johnson although it has been rediscovered several times.

Lemma : Let A be a family of k-sets A1,,Am so that AiAj has cardinality atmost t for any i,j. Then,

vk2mtm+kt

where v:=|Ai|.

Proof of Lemma : For an element xAi, let dx=|{i:xAi}| so that

tm(m1)+mki(j|AiAj|)=xdx2(dx)2v=k2m2v

hence completing the proof.

Now let A[n] be a Sidon set and let k:=|A|. Take a positive integer m and construct

A:={Ai:=A+(i1):i[m]}

where A+(i1):={a+i1:aA}. Notice that Ai[n+m1].

The key observation is that |AiAj|1 for ij as if x,yAiAj for some i<j, then we have a,b,c,dA with a+i=x=b+j and c+i=y=d+j implying a+d=(xi)+(yj)=(xj)+(yi)=b+c, and hence x=y.

Applying Lemma with t=1 and vn+m1, we get

(n+m1)(m+k1)k2m

hence completing the proof.


Proof 3 (Graham) : This is the shortest proof in the list. In fact, this is probably one of those proofs that are so short and immediate that they feel like magic. The proof is essentially a direct application of a well known lemma.

Recall the Weyl - van der Corput Lemma. This states that if ξ(n) is a real valued function supported on the interval [1,N], then

|nξ(n)|2N+H1H|h|<H(1|h|H)nξ(n)ξ(n+h)

for any positive integer H.

Now, let A be a Sidon set and ξ(n) be the characteristic function of A, i.e., ξ(n)=1 if nA and ξ(n)=0 otherwise. We have

|A|2N+H1H(|A|+H1)

since nξ(n)ξ(n+h)=|A| if h=0 and nξ(n)ξ(n+h)1 otherwise. This is due to the fact that for a fixed constant h, there can be atmost one n for which both n and n+h are in A (since all differences in A are distinct).

Using H=N3/4+1 completes the proof.


Proof 4 (Cilleruelo) : This proof will require the introduction of some new definitions, including set additions. For A,BN we define A+B:={a+b:aA,bB} to be the collection of all sums of A and B. The set AB is defined analogously.

Now, for A,BN, let us define

rA+B(z):=|{z=a+b:aA,bB}|

to be the number of representations of an integer z as sum of an element from A and another from B. We also define

E:=xA+BrA+B2(x)

and call it the additive energy of A+B.

We immediately have rAA(0)=|A|. It is straightforward to check that

xA+BrA+B2(x)=xA+BrAA(x)rBB(x)xA+BrA+B(x)=|A||B||A+B|xA+BrA+B2=|A+B|(|A||B|+x0rAA(x)rBB(x))

for any A,BN.

Now, let A[N] be a Sidon set and B[] so that A+B[N+]. So, using the second property above, we have

|A|2(+1)2(N+)(|A|(+1)+x0rBB(x))

as rAA(x)1 for all x since A is Sidon (and hence all differences are distinct).

Also,

x0rBB(x)=|{b1b2:b1b2;b1,b2B}|=(+1)

so that

|A|2(+1)(N+)(|A|+)

and hence putting =n(|A|1) completes the proof.


A keen reader might try to make a case for Proof 4 being essentially the same as Proof 2, only written in a different language. While this may be true to some extent, it should be noted that Cilleruelo's goal (in this paper) was to consider Sidon sets in Nd and to prove an analogous result for this case using the method of additive energy. So, this proof is almost a byproduct of his actual work.

It is also worth mentioning that the linked document to Graham's proof is an excellent article which tries to show how the Weyl - van der Corput Lemma alone can prove some really strong results in the field. It is quite surprising how one inequality by itself can describe so many results!


The theorem we described remained the best result until very recently when an improvement was provided by Balogh, Füredi and Roy proving

|A|<n1/2+(1γ)n1/4

for some γ0.002. In the same spirit, O'Bryant improved the constant from 0.998 to 0.99703, and then again to 0.98183 with Carter and Hunter.

It should however be noted that we are still quite far away from proving what we expect to be true. Addressing the original question of Sidon, Erdős conjectured (and offered $500 for a proof or disproof) that 

|A|<n1/2+o(nε)

for all ε>0. This is considered to be a pretty hard problem and it will probably require some time before we can answer it. Maybe one of the readers will be able to help someday!


Acknowledgement : I want to express my heartfelt thanks to my supervisor, Prof. Ramachandran Balasubramanian, who introduced me to the beautiful topic of Sidon sets. His straightforward approach to understanding mathematics has greatly influenced my own mathematical intuition and helped me greatly in understanding these proofs.

A Dense Infinite Sidon Sequence

A set of positive integers AN is called a Sidon Set or a Sidon Sequence if the equation a+b=c+d does not have any non-trivial solutions in A. They were named after Hungarian mathematician Simon Sidon who was inspired by certain problems in Fourier series to ask Erdős about the possible growth of such sequences.

Since then, there has been an extensive amount of work on this topic exploring a plethora of different questions about finite and infinite Sidon sets, and various generalizations of them. In this post, we are mainly concerned about a breakthrough theorem in the field of infinite Sidon sets.

First consider the following example.

Example : (Greedy Algorithm) If S[1,n] is already a Sidon set, we can find x such that x does not equal a, b, c or a+bc for any a,b,cS and S{x}N is Sidon. Since there are at most n choices for each of a, b and c and a,b,cn, choosing x>n3 is enough.

It is clear that this sequence satisfies S(x)x1/3 where S(x):=|S[1,x]|. It was conjectured by Erdős that there is a Sidon sequence S with S(x)x1/2ε for all ε>0 (recall that f(x)g(x) means that there exists a constant C such that f(x)Cg(x) for all large x and f(x)g(x) is defined similarly). He also proved that any arbitrary Sidon sequence will have the close upper bound of

S(x)x1/2(logx)1/2

for infinitely many x. This exhibits that finite segments of an infinite Sidon set must sometimes be much smaller than it is expected in finite Sidon sets.

The lower bound was first improved by Ajtai, Komlós and Szemerédi who proved the existence of S satisfying

S(x)x1/3(logx)1/3

and then finally Ruzsa gave a breakthrough by finding another S satisfying

S(x)x21o(1)

which has remained the best result in this area since 1998. It is this proof that I will present in this post.

For those asking why we care, it is known that the number of elements in the biggest (usually called dense in literature) Sidon subset of the interval [1,x] is of the order x (see here) and what Erdős' conjecture claims is that N is quite similar. In other words, we can find Sidon subsets of N with (logarithmic) density arbitrarily close to x (although x is not achievable, and this was proven by Erdős).


Theorem : There exists an infinite Sidon sequence B such that

B(N)=Nγ+o(1)

where γ=21.


Idea of the Proof : Notice that the set of primes P form a multiplicative Sidon set, that is p1p2=p3p4 for piP implies {p1,p2}={p3,p4}. So, {lnp}pP forms a Sidon set in R.

This means that if we can cut the binary representation of lnp in a proper way, we still almost surely retain the Sidon property. In fact, all the observations we made so far are true for {αlnp}pP for all α. Indeed we will start with a bunch of α so that we can later argue probabilistically. 


Proof Setup : The first step is to look at the number αlnp in binary. Choose α[1,2] and write

αlnp=i=0kpεip2i+i=1δip2i

where kp=log(αlnp). So, the ε-string is the integral part of this number and the δ-string is the part after the decimal point (or is it the binary point?). Here, and in the rest of the proof, log will represent log2 and ln will be a placeholder for logarithm to any pre-specified base - the reader can think of it as loge for convinience.

Now, we set β=2+1 and γ=1/β. Then, we define

Kp=min{i>2:2(i1)2>pβ}

so that Kp=2+βlogp. A more natural way to present this proof could have been to start with an arbitrary β and realize at the end of the proof that this is the value of β that optimizes the density.

Now, we need to do the proper cutting. So, we define

λp=i=0kpεip2i+i=1Kp2δip2i

by cutting the binary expansion of αlnp at the Kp2-th place.

Now, we use the square indices to construct

Δip=j=(i1)2+1i2δj2i2j

by cutting the bigger block into smaller pieces.

In binary, the numbers look like

Δ1,p=δ1Δ2,p=δ2δ3δ4Δ3,p=δ5δ6δ7δ8δ9

and so on. From now on, we will often supress p or α from our notation whenever it doesn't interfere with the clarity of the proof.

Finally, we define bp as

010ΔKΔ30ε2000Δ20ε1000Δ10ε0000

which was done in the following way : first write the Δi's in a piece of paper and put five blanks in between each of them, and after Δ1. Fill up the second blank to the right of Δi with εi1. The 4th blank always contains a 0 except the one to the left of ΔK - this is done to keep track of the size of bp. The rest of the blanks are all 0

More formally,

bp=i=1KΔip2(i1)2+5i+i=0kεip2i2+5i+2+tp

where tp=2K2+5K+4.

Further, we put Δip=0 for i>K and εip=0 for i>k. Our Sidon set will be a subset of the set {bp}pP.


Plan of the Proof : Recall how we introduced a padding of 0's in the definition of bp. This was done to ensure that there are no carry overs when we add bp1+bp2. This makes sure that

bp+bs=bq+brtp+ts=tq+tr,εip+εis=εiq+εir,Δip+Δis=Δiq+Δir

for all i. Now, consider

(*)bp+bs=bq+br,bp>bqbr>bs

for some p,q,r,s.

Define PK={p:Kp=K}. These sets provide a proper partitioning of the primes with respect to their sizes, and hence have the following important property. If () holds, then there are integers KL such that

(1)λp+λs=λq+λr

and p,qPK, r,sPL.

The artistic part is now over and what's left of our work is the technical part. This is the part where we try to filter out the largest Sidon subset of {bp}pP. The techniques are all mainly aimed towards estimating the number of solutions of () in the set {bp}pP.


Bounds : Notice that

|λpαlnp|2K2<pβ

follows from definitions. Also, if () holds, then

|psqr|<8qr2L2,qr>2L22,(K1)2>(β1)(L1)2+β(2L3)

where K and L are as in (1).

Now, we define

JKL=|{p,q,r,s:pqPK,rsPL,|psqr|<8qr2L2}|

and try to estimate it.


Lemma : We have

JKL22γ((K1)2+(L1)2)L2

where JKL is defined as above.

Proof : The Diophantine equation psqr=a has the general solution

p=p0+tr,q=q0+ts

where p0, q0 are the smallest solution.

As p<2γ(K1)2, we have t<2γ(K1)2r. So, for fixed a,r,s, the number of solutions is O(2γ(K1)2r).

Again, |a|r2γ(K1)2L2. So, for fixed r,s, the number of solutions is O(22γ(K1)2L2) and r,s<2γ(L1)2.


Lemma : We have

2K2αlnp2K2αlnq(mod2K2L2)

assuming () holds.

Proof : It is clear that

2K2αlnpi=L2+1K2δip2K2i(mod2K2L2)

from definition. Also for L<iK, we have Δip=Δiq so that δjp=δjq for L2<jK2. So, RHS remains unchanged if we replace p by q.


Lemma : Let K>L and pqPK be given. Let there be at least one pair r,sPL and an α[1,2] such that () holds. Then, we have

μ(α:2K2αlnp2K2αlnq(mod2K2L2))2L2K2

where μ is the Lebesgue measure.

Proof : Call A=2K2lnpq, M=2K2L2 and notice that the property we are after is

Aα0 or 1(modM)

and since Aα is evenly distributed modulo M, we have

μ(α:Aα0 or 1(modM))1M

hence completing the proof.


The proofs of the next two lemmas are straightforward and hence we will refrain from exhibiting them.

Lemma : For pqPK and r,sPL, we have

μ(α:bp+bs=bq+br){=0if|psqr|8qr2L22L2K2if|psqr|<8qr2L2

for K>L.

Lemma : If

GKL(α):=|{p,q,r,s:pqPK,r,sPL,() holds }|

then

12GKL(α)dα22γ((L1)2+(K1)2)K2

for K>L.


Lemma : Define

GK(α)=|{p,q,r,s:p,qPK,() holds }|

so that

12GK(α)dα2γ(K1)22K

for all K.

Proof : By definition, we have GK(α)=LKGKL(α). And since GKL(α)0 only if (L1)2<(K1)2/(β1), we obtain an estimate of 2Z with

Z=(2β11)(K1)22K+1

on which we use the fact that 2β11=1β.

In fact, the value of β was chosen in the first place to satisfy 2β11=1β.


Proof of Main Theorem : With all the setup in hand, the rest of the proof is essentially an exercise in probability theory. Define the event

EK:=[GKNK]

where NK:=2γ(K1)2K. This gives

P(EK)=P(GKNK)E(GK)NK(using Markov Inequality)=1NK12GK(α)dα2K

and hence

KP(EK)<P(lim infKEKC)=1

by Borel-Cantelli lemma.

This guarantees the existence of an α such that

GK(α)<2γ(K1)2K

for all (except at most finitely many) K. Fix this α.

Using Prime Number Theorem, the cardinality of PK is

|PK|=π(2γ(K1)2)π(2γ(K2)2)2γ(K1)2γK2ln2

and hence GK(α)<|PK|/2 for K>K0.

Now, for each p,q,r,s satisfying () with pPK, we omit p and define QK to be the set of remaining elements. Then we define

B=K>K0QK

which, by definition, is a Sidon set.

Now, we have

bp<2K2+5K+5<2(K+3)2

implying that for K=logx3, maxQK<x. This gives

B(x)(12ε)π(2γ(K1)2)=xγ+o(1)

hence completing the proof. The reader can check that a similar calculation yields an upper bound.


Although any improvements on the density is still beyond our reach, some analogues of this idea have been suggested and they have been able to yield similar results in more general settings. They have some excellent potential of their own, but that is a topic for a different post!


Acknowledgement : I want to express my heartfelt thanks to my supervisor, Prof. Ramachandran Balasubramanian, who introduced me to the beautiful topic of Sidon sets. His straightforward approach to understanding mathematics has greatly influenced my own mathematical intuition and helped me greatly in understanding this paper.

AMM problem 12451

This is AMM problem 12451 that I solved with my friends Satvik Saha and Sohom Gupta.


Problem Statement -
Let A and B be complex n×n and n×m matrices respectively. Let 0m,n denote the m×n zero matrix and let Im denote the m×m identity matrix. Prove 

exp[AB0m,n0m,m]=[exp(A)(01exp(tA)dt)B0m,nIm]

where exp be the matrix exponential function.


Solution -
Setting 

P=[AB0m,n0m,m]

we observe that the (unique) solution of the IVP z=Pz with z(0)=z0 is precisely z(s)=exp(sP)z0 for all z0Cm+n.

Thus, it suffices to check that for all x0Cn,y0Cm, the curves

x(s)=exp(sA)x0+(01exp(stA)dt)sBy0,y(s)=y0

solve the system of differential equations

x=Ax+By,y=0

and then put s=1.

This is easily verified; with x(s) as above, we have

x(s)=Aexp(sA)x0+(01(In+stA)exp(stA)dt)By0=Aexp(sA)x0+exp(sA)By0

via integration by parts, and

Ax(s)=Aexp(sA)x0+(01sAexp(stA)dt)By0=Aexp(sA)x0+(exp(sA)In)By0=x(s)By(s)

hence completing the proof.

AMM problem 12450

This is AMM problem 12450 that I solved with my friends Satvik Saha and Sohom Gupta.


Problem Statement -
Let n be an odd positive number, and suppose that x1,,xn are chosen randomly and uniformly from the interval [0,1]. For 1in, let yi=xixi2. What is the expected value of the median of {y1,,yn}?


Solution -
Let g(x)=xx2 on [0,1], and let

h(y)=114y2

be the inverse of the g restricted to [0,1/2].

Let XU(0,1), and set Y=g(X).

Then, by symmetry, Y has density 

fY(y)=2fX(h(y))ddyh(y)=2h(y)

from which it follows that the cdf of Y on [0,1/4] is FY(y)=2h(y). From an iid sample y1,,ynFY where n=2k+1, the sample median ξ=y[k+1] has density

fξ(ξ)=n!(k!)2FY(ξ)k(1FY(ξ))kfY(ξ)=n!(k!)2(2h(ξ))k(12h(ξ))k2h(ξ)

and hence, the expected value of the median is

E[ξ]=ξfξ(ξ)dξ=01/4n!(k!)2ξ(2h(ξ))k(12h(ξ))k2h(ξ)dξ=01n!(k!)2g(x/2)xk(1x)kdx=01n!(k!)2[x2x24]xk(1x)kdx=n!(k!)2[k!(k+1)!2(n+1)!k!(k+2)!4(n+2)!]=3n+516n+32

which is the final answer.

AMM problem 12449

This is AMM problem 12449 that I solved with my friends Satvik Saha and Sohom Gupta.


Problem Statement -

Let n be a positive integer with n2. The squares of an (n2+n1)-by-(n2+n1) grid are colored with up to n colors. Prove that there exist two rows and two columns whose four squares of intersection have the same color.


Solution -

Define a monochromatic i-pair as an unordered pair of row indices of squares in the same column with the same color i.

Note that if the same i-pair appears in two distinct columns, we have found a monochromatic rectangle. If not, each column must contribute distinct i-pairs, making the maximum number of possible i-pairs (n2+n12), hence the maximum number of monochromatic pairs n(n2+n12).

But if the k-th column contains ki squares of color i, then it contributes

i=1n(ki2)

monochromatic pairs, which is minimized when one ki is n and the remaining n1 many ki are n+1.

Thus, the k-th column contributes at least

(n2)+(n1)(n+12)=n(n2+n2)2

monochromatic pairs, giving a total of at least

(n2+n1)n(n2+n2)2=n(n2+n12)

monochromatic pairs.

Since we have equality, each color i must contribute the same number (i.e. (n2+n12)) of i-pairs, while simultaneously all ki{n,n+1}. The latter means that each color i must contribute a multiple of n many i-pairs, whereas

n(n2+n12)

which is a contradiction.

AMM problem 12448

This is AMM problem 12448 that I solved with my friends Satvik Saha and Sohom Gupta.


Problem Statement -

Given real numbers a1,,an. Prove

S1nS2S12+12n24(maxiaiminiai)2

where S1=i=1nai and S2=i=1nai2.


Solution -

Without loss of generality, let a1an and S1>0.

Setting μ=S1/n, we need to prove 

μσ2+μ2μ2+12n24(ana1)2n2

where σ2=1ni=1n(aiμ)2=S2nμ2.

Squaring both sides, this is equivalent to

()σ2n24(ana1)2n2+14μ2(n24(ana1)2n2)2

since μ>0.

Let μx=1nixi and σx2=1ni(xiμx)2 for real numbers x1,,xn.

Since σx2 is a smooth convex function of x, it attains its maximum on [a,b]n at an extreme point, i.e. when all xi{a,b}.

If exactly k many xi are equal to a and nk many are equal to b, we have

σx2=k(nk)n2(ba)2.

Now, (n2k)20 gives k(nk)n2/4, forcing k(nk)n2/4.

Thus, 

maxx[a,b]nσx2n24(ba)2n2

and σ2n2/4(ana1)2/n2, from which () follows directly.

Another Needlessly Sophisticated Proof of a Simple Result

This post belongs solely to a genre that should be called "joke proofs". The only thing a joke proof provides (except an awfully sophisticated argument for a very simple result) is intellectual stimulation. Some very well known contributions to this field are a proof of irrationality of 21/n using Fermat's Last Theorem or a proof of infinitude of primes using irrationality of ζ(3) (although infinitude of primes have received some more suspicious entries as well).

The following theorem (if you can call it a theorem) follows directly from looking at the integers {(k+1)!+}=2k+1. But our proof instead uses two other well known results, namely the fact that the sum of reciprocals of primes diverges (proof) and something slightly stronger than the fact that the sum of reciprocals of twin primes converges (proof). It is well known that an exact same proof technique runs through to show that if Sk={pP:p+kP} where P is the set of primes, then sSk1s converges. This is what we will use in the proof.

As far as my knowledge goes (which is not too far), this proof is completely original, although that's not something to be very proud of!


Theorem : For any given k, there exists k consecutive composite numbers.

Proof : If possible, let

supp1,p2 are consecutive primes|p1p2|=B

for some integer B.

Then

i=1Bp,p+iP1ppP1p

where P is the set of primes.

The LHS converges by Brun sieve and the RHS diverges, hence giving a contradiction!

AMM problem 12435

This is AMM problem 12435 that I solved with my friends Satvik Saha and Sohom Gupta.


Problem Statement -

For a positive integer n, let d(n) be the number of positive divisors of n, let ϕ(n) be Euler's totient function, and let q(n)=d(ϕ(n))/d(n). Find infnq(n) and supnq(n).


Solution -

We claim that infnq(n)=0 and supnq(n)=.

To show that supnq(n)=, note that q(p)=d(p1)/2 for primes p, which can be made arbitrarily large. Indeed, for any N, the arithmetic progression {!k+1}kN must contain a prime p (due to Dirichlet) whence p1=!k has at least M divisors so that d(p1).

Next, we show that infnq(n)=0. To do so, we choose arbitrary mN and find N such that q(N)1/2m1. Pick a sequence Qm of m primes q1qm such that qm<2q1; we will show later that this is always possible via the Prime Number Theorem. Let P={p1,,pn}  be all the primes less than q1. Note that each qi1 and each pj1 can be factored solely in terms of the primes from P.

For k1,,knN, consider

N=N(k1,,kn)=p1k1pnkn×q1qm,

With this, we have 

d(N)=(k1+1)(kn+1)×2m

and

ϕ(N)=p1k11pnkn1×(p11)(pn1)×(q11)(qm1)=p1k1+α11pnkn+αn1

where the integers αj0 depend only on Qm. Thus,

d(ϕ(N))=(k1+α1)(kn+αn)

and

q(N(k1,,kn))=(k1+α1k1+1)(kn+αnkn+1)×12m.

Taking min{k1,,kn}, we can make q(N(k1,,kn)) arbitrarily close to 1/2m, in particular less than 1/2m1.

To prove the existence of Qm for each mN, it is enough to show that there is nN such that [n,2n] contains at least m primes. By the Prime Number Theorem, the number of primes in [n,2n] is of the order

2nlog(2n)nlog(n)nlog(n)

which grows arbitrarily large as n.



Note : Here's an easier proof of infnq(n)=0 assuming that there are infinitely many Fermat primes. Consider the sequence

N=2kF1Fn

where F1,,Fn are the first n Fermat primes. This gives

q(N)=k+α1++αnk+1×12n

where Fi=2αi+1. Taking k and n to infinity, we complete the proof.


Thanks to this argument, if infnq(n)0, then that implies that there are only finitely many Fermat primes, which resolves a long standing conjecture about Fermat primes. Since that is highly unlikely, the inf has to be 0 - this is a classic case of proof by existence of proof!

AMM problem 12377

This is my first post (or second, depending on how you look at the world). For this, I decide to post the first AMM problem I solved with my friends Satvik Saha and Sohom Gupta.


Problem Statement -

We define a one-drop number as in problem 12377 from Problems and Solutions, The American Mathematical Monthly, Volume 130, 2023, Issue 3.

An integer is a one-drop number if its decimal digits d1,,dn satisfy

1d1d2di>di+1dn

for some i, with 1i<n.

For example, the numbers 13802 and 49557 are one-drop numbers, while the numbers 12345 and 352712 are not.

We answer the following question: for n2, how many n-digit one-drop numbers are there?


Solution -

There are

(n+1818)(n+99)n(n+88)

one-drop numbers of length n2.

The digits of every n-digit one-drop number d1,,dn can be uniquely split into two zero-drop blocks A and B, with digits 1d1di and 0di+1dn. Here, is a zero-drop block consists of a sequence of non-decreasing decimal digits; we allow the leading digit to be 0.

Consider all 19-tuples (a1,,a9,b0,b1,,b9) of non-negative integers such that

a1++a9+b0+b1++b9=n.

Each tuple represents an n-digit number obtained by concatenating two zero-drop blocks A and B, where ad and bd denote the number of occurrences of the digit d in the respective blocks. Clearly, there are (n+1818) such tuples. Note that each tuple of this kind represents either a zero-drop or a one-drop number, and each n-digit one-drop number is uniquely represented by such a tuple.

Discard all tuples where the block A is empty, with all ad=0; the corresponding number is zero-drop. There are as many such tuples as there are non-negative integer solutions of 

b0+b1++b9=n,

that is, (n+99). This leaves (n+1818)(n+99) tuples; call this set of tuples S. Note that for any element in S, the block A is non-empty. Thus, it represents a proper n-digit number since the leftmost digit is non-zero.

Suppose that a tuple (a1,,a9,b0,b1,,b9)S represents a zero-drop number. Since some ad>0, the digit 0 cannot appear in block B, hence the number consists only of digits 1 through 9. There are (n+88) such n-digit zero drop numbers, via counting the non-negative integer solutions of

c1++c9=n.

Each of these numbers has been represented by precisely n tuples from S, because the first i digits of such a number can be placed in block A with the remaining digits in block B for all 1in.

For example, the 5-digit zero-drop number 11234 can be broken into blocks A,B as 1,1234 or 11,234 or 112,34 or 1123,4 or 11234,, each of which correspond to distinct tuples from S.

Thus, precisely n(n+88) tuples from S represent zero-drop numbers. The remaining (n+1818)(n+99)n(n+88) tuples uniquely represent all n-digit one-drop numbers.

Hello World

Hey there!

I am Sayan Dutta, a student of mathematics interested in Number Theory and Combinatorics. I completed my BS-MS from IISER Kolkata and I will be joining a PhD program in 2025 in University of Montreal under Prof. Dimitris Koukoulopoulos. Before that, I completed my Masters' thesis under the supervision of Prof. Ramachandran Balasubramanian and Prof. Soumya Bhattacharya. My thesis was on Sidon sets and can be found in my website.

A far less boring description of me was given by my dear friend Sohom Gupta

Sayan Dutta is an undergraduate student of mathematics, interested in discrete mathematics and vibrant patterns hidden in the simplest of structures. His appreciation of beauty stems from his cinephile present, while his chess moves are just nuances of perfection. He aspires to paint the world in the language of mathematics and the blossoms of the silver screen.


Inspired by What's new, Joni's Math Notes and others, I decided to keep a blog of my own. As of now, I don't have any particular plans and will post anything I have in mind (only related to mathematics). We will see where this goes!

Introduction to Sumsets II

This is a sequel to a previous post . However, one can technically read it without reading the previous post. Given A,BZ for a c...