Search

A Dense Infinite Sidon Sequence

A set of positive integers AN is called a Sidon Set or a Sidon Sequence if the equation a+b=c+d does not have any non-trivial solutions in A. They were named after Hungarian mathematician Simon Sidon who was inspired by certain problems in Fourier series to ask Erdős about the possible growth of such sequences.

Since then, there has been an extensive amount of work on this topic exploring a plethora of different questions about finite and infinite Sidon sets, and various generalizations of them. In this post, we are mainly concerned about a breakthrough theorem in the field of infinite Sidon sets.

First consider the following example.

Example : (Greedy Algorithm) If S[1,n] is already a Sidon set, we can find x such that x does not equal a, b, c or a+bc for any a,b,cS and S{x}N is Sidon. Since there are at most n choices for each of a, b and c and a,b,cn, choosing x>n3 is enough.

It is clear that this sequence satisfies S(x)x1/3 where S(x):=|S[1,x]|. It was conjectured by Erdős that there is a Sidon sequence S with S(x)x1/2ε for all ε>0 (recall that f(x)g(x) means that there exists a constant C such that f(x)Cg(x) for all large x and f(x)g(x) is defined similarly). He also proved that any arbitrary Sidon sequence will have the close upper bound of

S(x)x1/2(logx)1/2

for infinitely many x. This exhibits that finite segments of an infinite Sidon set must sometimes be much smaller than it is expected in finite Sidon sets.

The lower bound was first improved by Ajtai, Komlós and Szemerédi who proved the existence of S satisfying

S(x)x1/3(logx)1/3

and then finally Ruzsa gave a breakthrough by finding another S satisfying

S(x)x21o(1)

which has remained the best result in this area since 1998. It is this proof that I will present in this post.

For those asking why we care, it is known that the number of elements in the biggest (usually called dense in literature) Sidon subset of the interval [1,x] is of the order x (see here) and what Erdős' conjecture claims is that N is quite similar. In other words, we can find Sidon subsets of N with (logarithmic) density arbitrarily close to x (although x is not achievable, and this was proven by Erdős).


Theorem : There exists an infinite Sidon sequence B such that

B(N)=Nγ+o(1)

where γ=21.


Idea of the Proof : Notice that the set of primes P form a multiplicative Sidon set, that is p1p2=p3p4 for piP implies {p1,p2}={p3,p4}. So, {lnp}pP forms a Sidon set in R.

This means that if we can cut the binary representation of lnp in a proper way, we still almost surely retain the Sidon property. In fact, all the observations we made so far are true for {αlnp}pP for all α. Indeed we will start with a bunch of α so that we can later argue probabilistically. 


Proof Setup : The first step is to look at the number αlnp in binary. Choose α[1,2] and write

αlnp=i=0kpεip2i+i=1δip2i

where kp=log(αlnp). So, the ε-string is the integral part of this number and the δ-string is the part after the decimal point (or is it the binary point?). Here, and in the rest of the proof, log will represent log2 and ln will be a placeholder for logarithm to any pre-specified base - the reader can think of it as loge for convinience.

Now, we set β=2+1 and γ=1/β. Then, we define

Kp=min{i>2:2(i1)2>pβ}

so that Kp=2+βlogp. A more natural way to present this proof could have been to start with an arbitrary β and realize at the end of the proof that this is the value of β that optimizes the density.

Now, we need to do the proper cutting. So, we define

λp=i=0kpεip2i+i=1Kp2δip2i

by cutting the binary expansion of αlnp at the Kp2-th place.

Now, we use the square indices to construct

Δip=j=(i1)2+1i2δj2i2j

by cutting the bigger block into smaller pieces.

In binary, the numbers look like

Δ1,p=δ1Δ2,p=δ2δ3δ4Δ3,p=δ5δ6δ7δ8δ9

and so on. From now on, we will often supress p or α from our notation whenever it doesn't interfere with the clarity of the proof.

Finally, we define bp as

010ΔKΔ30ε2000Δ20ε1000Δ10ε0000

which was done in the following way : first write the Δi's in a piece of paper and put five blanks in between each of them, and after Δ1. Fill up the second blank to the right of Δi with εi1. The 4th blank always contains a 0 except the one to the left of ΔK - this is done to keep track of the size of bp. The rest of the blanks are all 0

More formally,

bp=i=1KΔip2(i1)2+5i+i=0kεip2i2+5i+2+tp

where tp=2K2+5K+4.

Further, we put Δip=0 for i>K and εip=0 for i>k. Our Sidon set will be a subset of the set {bp}pP.


Plan of the Proof : Recall how we introduced a padding of 0's in the definition of bp. This was done to ensure that there are no carry overs when we add bp1+bp2. This makes sure that

bp+bs=bq+brtp+ts=tq+tr,εip+εis=εiq+εir,Δip+Δis=Δiq+Δir

for all i. Now, consider

(*)bp+bs=bq+br,bp>bqbr>bs

for some p,q,r,s.

Define PK={p:Kp=K}. These sets provide a proper partitioning of the primes with respect to their sizes, and hence have the following important property. If () holds, then there are integers KL such that

(1)λp+λs=λq+λr

and p,qPK, r,sPL.

The artistic part is now over and what's left of our work is the technical part. This is the part where we try to filter out the largest Sidon subset of {bp}pP. The techniques are all mainly aimed towards estimating the number of solutions of () in the set {bp}pP.


Bounds : Notice that

|λpαlnp|2K2<pβ

follows from definitions. Also, if () holds, then

|psqr|<8qr2L2,qr>2L22,(K1)2>(β1)(L1)2+β(2L3)

where K and L are as in (1).

Now, we define

JKL=|{p,q,r,s:pqPK,rsPL,|psqr|<8qr2L2}|

and try to estimate it.


Lemma : We have

JKL22γ((K1)2+(L1)2)L2

where JKL is defined as above.

Proof : The Diophantine equation psqr=a has the general solution

p=p0+tr,q=q0+ts

where p0, q0 are the smallest solution.

As p<2γ(K1)2, we have t<2γ(K1)2r. So, for fixed a,r,s, the number of solutions is O(2γ(K1)2r).

Again, |a|r2γ(K1)2L2. So, for fixed r,s, the number of solutions is O(22γ(K1)2L2) and r,s<2γ(L1)2.


Lemma : We have

2K2αlnp2K2αlnq(mod2K2L2)

assuming () holds.

Proof : It is clear that

2K2αlnpi=L2+1K2δip2K2i(mod2K2L2)

from definition. Also for L<iK, we have Δip=Δiq so that δjp=δjq for L2<jK2. So, RHS remains unchanged if we replace p by q.


Lemma : Let K>L and pqPK be given. Let there be at least one pair r,sPL and an α[1,2] such that () holds. Then, we have

μ(α:2K2αlnp2K2αlnq(mod2K2L2))2L2K2

where μ is the Lebesgue measure.

Proof : Call A=2K2lnpq, M=2K2L2 and notice that the property we are after is

Aα0 or 1(modM)

and since Aα is evenly distributed modulo M, we have

μ(α:Aα0 or 1(modM))1M

hence completing the proof.


The proofs of the next two lemmas are straightforward and hence we will refrain from exhibiting them.

Lemma : For pqPK and r,sPL, we have

μ(α:bp+bs=bq+br){=0if|psqr|8qr2L22L2K2if|psqr|<8qr2L2

for K>L.

Lemma : If

GKL(α):=|{p,q,r,s:pqPK,r,sPL,() holds }|

then

12GKL(α)dα22γ((L1)2+(K1)2)K2

for K>L.


Lemma : Define

GK(α)=|{p,q,r,s:p,qPK,() holds }|

so that

12GK(α)dα2γ(K1)22K

for all K.

Proof : By definition, we have GK(α)=LKGKL(α). And since GKL(α)0 only if (L1)2<(K1)2/(β1), we obtain an estimate of 2Z with

Z=(2β11)(K1)22K+1

on which we use the fact that 2β11=1β.

In fact, the value of β was chosen in the first place to satisfy 2β11=1β.


Proof of Main Theorem : With all the setup in hand, the rest of the proof is essentially an exercise in probability theory. Define the event

EK:=[GKNK]

where NK:=2γ(K1)2K. This gives

P(EK)=P(GKNK)E(GK)NK(using Markov Inequality)=1NK12GK(α)dα2K

and hence

KP(EK)<P(lim infKEKC)=1

by Borel-Cantelli lemma.

This guarantees the existence of an α such that

GK(α)<2γ(K1)2K

for all (except at most finitely many) K. Fix this α.

Using Prime Number Theorem, the cardinality of PK is

|PK|=π(2γ(K1)2)π(2γ(K2)2)2γ(K1)2γK2ln2

and hence GK(α)<|PK|/2 for K>K0.

Now, for each p,q,r,s satisfying () with pPK, we omit p and define QK to be the set of remaining elements. Then we define

B=K>K0QK

which, by definition, is a Sidon set.

Now, we have

bp<2K2+5K+5<2(K+3)2

implying that for K=logx3, maxQK<x. This gives

B(x)(12ε)π(2γ(K1)2)=xγ+o(1)

hence completing the proof. The reader can check that a similar calculation yields an upper bound.


Although any improvements on the density is still beyond our reach, some analogues of this idea have been suggested and they have been able to yield similar results in more general settings. They have some excellent potential of their own, but that is a topic for a different post!


Acknowledgement : I want to express my heartfelt thanks to my supervisor, Prof. Ramachandran Balasubramanian, who introduced me to the beautiful topic of Sidon sets. His straightforward approach to understanding mathematics has greatly influenced my own mathematical intuition and helped me greatly in understanding this paper.

No comments:

Post a Comment

Worst Way to Calculate Area of a Triangle

A useful technique from Complex Analysis helps us to identify whether a real number u is positive or negative by checking a discontinuous ...