组合数学 (Fall 2011)/Extremal graph theory

From TCS Wiki
Jump to navigation Jump to search

Forbidden Cliques

Extremal grap theory studies the problems like "how many edges that a graph G can have, if G has some property?"

Mantel's theorem

We consider a typical extremal problem for graphs: the largest possible number of edges of triangle-free graphs, i.e. graphs contains no K3.

Theorem (Mantel 1907)
Suppose G(V,E) is graph on n vertice without triangles. Then |E|n24.

We give three different proofs of the theorem. The first one uses induction and an argument based on pigeonhole principle. The second proof uses the famous Cauchy-Schwarz inequality in analysis. And the third proof uses another famous inequality: the inequality of the arithmetic and geometric mean.

First proof. (pigeonhole principle)

We prove an equivalent theorem: Any G(V,E) with |V|=n and |E|>n24 must have a triangle.

Use induction on n. The theorem holds trivially for n3.

Induction hypothesis: assume the theorem hold for |V|n1.

For G with n vertices, without loss of generality, assume that |E|=n24+1, we will show that G must contain a triangle. Take a uvE, and let H be the subgraph of G induced by V{u,v}. Clearly, H has n2 vertices.

Case.1: If H has >(n2)24 edges, then by the induction hypothesis, H has a triangle.
Case.2: If H has (n2)24 edges, then at least (n24+1)(n2)241=n1 edges are between H and {u,v}. By pigeonhole principle, there must be a vertex in H that is adjacent to both u and v. Thus, G has a triangle.
Second proof. (Cauchy-Schwarz inequality)
(Mantel's original proof)

For any edge uvE, no vertex can be a neighbor of both u and v, or otherwise there will be a triangle. Thus, for any edge uvE, du+dvn. It follows that

uvE(du+dv)n|E|.

Note that d(v) appears exactly dv times in the sum, so that

uvE(du+dv)=vVdv2.

Applying Chauchy-Schwarz inequality,

n|E|uvE(du+dv)=vVdv2(vVdv)2n=4|E|2n,

where the last equation is due to Euler's equality vVdv=2|E|. The theorem follows.

Third proof. (inequality of the arithmetic and geometric mean)

Assume that G(V,E) has |V|=n vertices and is triangle-free.

Let A be the largest independent set in G and let α=|A|. Since G is triangle-free, for very vertex v, all its neighbors must form an independent set, thus d(v)α for all vV.

Take B=VA and let β=|B|. Since A is an independent set, all edges in E must have at least one endpoint in B. Counting the edges in E according to their endpoints in B, we obtain |E|vBdv. By the inequality of the arithmetic and geometric mean,

|E|vBdvαβ(α+β2)2=n24.

Turán's theorem

The famous Turán's theorem generalizes the Mantel's theorem for triangles to cliques of any specific size. This theorem is one of the most important results in extremal combinatorics, which initiates the studies of extremal graph theory.

Theorem (Turán 1941)
Let G(V,E) be a graph with |V|=n. If G has no r-clique, r2, then
|E|r22(r1)n2.

We give an example of graphs with many edges which does not contain Kr.

Partition V into r1 disjoint classes V=V1V2Vr1, ni=|Vi|, n1+n2++nr1=n. For every two vertice u,v, uvE if and only if uVi and vVj for distinct Vi and Vj. The resulting graph is a complete (r1)-partite graph, denoted Kn1,n2,,nr1. It is obvious that any (r1)-partite graph contains no r-clique since only those vertices from different classes can be adjacent.

A Kn1,n2,,nr1 has i<jninj edges, which is maximized when the numbers ni are divided as evenly as possible, that is, if ni{nr1,nr1} for every 1ir1.

Definition
We call a complete multipartite graph Kn1,n2,,nr1 with ni{nr1,nr1} for every i a Turán graph, denoted T(n,r1).
Example
Turán graph T(13,4)
Turán graph [math]\displaystyle{ T(13,4) }[/math]
Turán graph T(13,4)

Turán's theorem has been proved for many times by different mathematicians, with different tools. We show just a few.

The first proof uses induction; the second proof uses a technique called "weight shifting"; and the third proof uses the probabilistic method. All of them are very powerful and frequently used proof techniques.

First proof. (induction)
(Turán's original proof)

Induction on n. It is easy to verify that the theorem holds for n<r.

Let G be a graph on n vertices without r-cliques where nr. Suppose that G has a maximum number of edges among such graphs. G certainly has (r1)-cliques, since otherwise we could add edges to G. Let A be an (r1)-clique and let B=VA. Clearly |A|=r1 and |B|=nr+1.

By the induction hypothesis, since B has no r-cliques, |E(B)|r22(r1)(nr+1)2. And E(A)=(r12). Since G has no r-clique, every vB is adjacent to at most r2 vertices in A, since otherwise A and v would form an r-clique. We obtain that the number edges crossing between A and B is |E(A,B)|(r2)|B|=(r2)(nr+1). Combining everything together,

|E|=|E(A)|+|E(B)|+|E(A,B)|(r12)+r22(r1)(nr+1)2+(r2)(nr+1)=r22(r1)n2.
Second proof. (weight shifting)
(due to Motzkin and Straus)

Assign each vertex vV a nonnegative weight wv0, and assume that vVwv=1. We try to maximize the quantity

S=uvEwuwv.

Let Wu=v:vuwv be the sum of the weights of u's neighbors. Note that S can also be computed as S=12uVwuWu. For any nonadjacent pair of vertices uv, supposed that WuWv, then for any ϵ0,

(wu+ϵ)Wu+(wvϵ)WvwuWu+wvWv.

This means that we do not decrease S by shifting all of the weight of the vertex v to the vertex u. It follows that S is maximized when all of the weight is concentrated on a complete subgraph, i.e., a clique.

Now if wu>wv>0, then choose ϵ with 0<ϵ<wuwv and change wu=wuϵ and wv=wv+ϵ. This changes S to S=S+ϵ(wuwv)ϵ2>S. Thus, the maximal value of S is attained when all nonzero weights are equal and concentrated on a clique.

G has at most an (r1)-clique, thus S(r12)1(r1)2=r22(r1).

As we argued above, this inequality hold for any nonnegative weight assignments with vVwv=1. In particular, for the case that all wv=1n,

S=uvEwuwv=|E|n2.

Thus,

|E|n2r22(r1),

which implies the theorem.

Third proof. (the probabilistic method)
(due to Alon and Spencer)

Write ω(G) for the number of vertices in a largest clique, called the clique number of G.

Claim: ω(G)vV1ndv.

We prove this by the probabilistic method. Fix a random ordering of vertices in V, say v1,v2,,vn. We construct a clique as follows:

  • for i=1,2,,n, add vi to S iff all vertices in current S are adjacent to vi.

It is obvious that an S constructed in this way is a clique. We now show that E[|S|]=vV1ndv.

Let Xv be the random variable that indicates whether vS, i.e.,

Xv={1vS,0otherwise.

Note that a vertex vS if and only if v is ranked before all its ndv1 non-neighbors in the random ordering. The probability that this event occurs is 1ndv. Thus,

E[Xv]=Pr[vS]=1ndv.

Observe that |S|=vVXv. Due to linearity of expectation,

E[|S|]=vVE[Xv]=vV1ndv.

There must exists a clique of at least such size, so that ω(G)vV1ndv. The claim is proved.

Apply the Cauchy-Schwarz inequality

(vVavbv)2(vVnav2)(vVnbv2).

Set av=ndv and bv=1ndv, then avbv=1 and so

n2vV(ndv)vV1ndvω(G)vV(ndv).

By the assumption of Turán's theorem, ω(G)r1. Recall the handshaking lemma 2|E|=vVdv. The above inequality gives us

n2(r1)(n22|E|),

which implies the theorem.

Our last proof uses the idea of vertex duplication. It does not only prove the edge bound of Turán's theorem, but also shows that Turán graphs are the only possible extremal graphs.

Fourth proof.

Let G(V,E) be a r-clique-free graph on n vertices with a maximum number of edges.

Claim: G does not contain three vertices u,v,w such that uvE but uwE,vwE.

Suppose otherwise. There are two cases.

  • Case.1: d(w)<d(u) or d(w)<d(v). Without loss of generality, suppose that d(w)<d(u). We duplicate u by creating a new vertex u which has exactly the same neighbors as u (but uu is not an edge). Such duplication will not increase the clique size. We then remove w. The resulting graph G is still r-clique-free, and has n vertices. The number of edges in G is
|E(G)|=|E(G)|+d(u)d(w)>|E(G)|,
which contradicts the assumption that |E(G)| is maximal.
  • Case.2: d(w)d(u) and d(w)d(v). Duplicate w twice and delete u and v. The new graph G has no r-clique, and the number of edges is
|E(G)|=|E(G)|+2d(w)(d(u)+d(v)+1)>|E(G)|.
Contradiction again.

The claim implies that uvE defines an equivalence relation on vertices (to be more precise, it guarantees the transitivity of the relation, while the reflexivity and symmetry hold directly). Graph G must be a complete multipartite graph Kn1,n2,,nr1 with n1+n2++nr1=n. Optimize the edge number, we have the Turán graph.

Forbidden Cycles

Another direction to generalize Mantel's theorem other than Turán's theorem is to see a triangle as a 3-cycle rather than 3-clique. We then ask for the extremal bound for graphs without certain cycle structures.

Girth

Recall that the girth of a graph G is the length of the shortest cycle in G. A graph is triangle-free if and only if its girth g(G)4. Matel's theorem can be seen as a bound on the edge number of graphs with girth g(G)4. The next theorem extends this bound to the graphs with g(G)5, i.e., graphs without triangles and quadrilaterals ("squares").

Theorem
Let G(V,E) be a graph on n vertices. If girth g(G)5 then |E|12nn1.
Proof.

Suppose g(G)5. Let v1,v2,,vd be the neighbors of a vertex u, where d=d(u). Let Si={vVvvivu} be the set of neighbors of vi other than u.

  • For any vi,vj, vivjE since G has no triangle. Thus, Si{u,v1,v2,,vd}= for every i.
  • No vertex other than u can be adjacent to more than one vertices in v1,v2,,vd since there is no C4 in G. Thus, SiSj= for any distinct i and j.

Therefore, {u,v1,v2,,vd}S1S2SdV implies that

(d+1)+|S1|+|S2|++|Sd|=(d+1)+(d(v1)1)+(d(v2)1)++(d(vd)1)n,

so that v:vud(v)n1.

By Cauchy-Schwarz inequality,

n(n1)uVv:vud(v)=vVd(v)2(vVd(v))n=4|E|2n,

which implies that |E|12nn1.

Hamiltonian cycle

We now look at graphs which does not have large cycles. In particular, we consider graphs without Hamiltonian cycles.

For a Hamiltonian graph, every vertex must has degree 2. And the graph satisfying this condition with maximum number of edges is the graph composed by a (n1)-clique and the one remaining vertex is connected to the clique by one edge. This graph has (n12)+1 edges, and has no Hamiltonian cycle. It is not very hard to realize that this is the largest possible number of edges that a non-Hamiltonian graph can have.

Since it is not very interesting to bound the number of edges of non-Hamiltonian graphs, we consider a more informative graph invariant, its degree sequence.

Dirac's Theorem
A graph G(V,E) on n vertices has a Hamiltonian cycle if dvn2 for all vV.
Proof.

Suppose to the contrary, the theorem is not true and there exists a non-Hamiltonian graph with dvn2 for all vV. Let G be such a graph with a maximum number of edges. Then adding any edge to G creates a Hamiltonian cycle. Thus, G must have a Hamiltonian path, say v1v2vn.

Consider the sets,

  • S={ivivnE};
  • T={ivi+1v1E}.

Therefore, S{v1,v2,,vn1} contains the neighbors of vn; and T{v1,v2,,vn1} contains the predecessors (along the Hamiltonian path) of the neighbors of v1. It holds that S,T{v1,v2,,vn1}.

Since dvn2 for all vV, |S|,|T|n2. By the pigeonhole principle, there exists some viST. We can construct the following Hamiltonian cycle:

v1vi+1vi+2vnvivi1v1,

which contradict to the assumption that G is non-Hamiltonian.

Erdős–Stone theorem

We introduce a notation for the number of edges in extremal graphs with a specific forbidden substructure.

Definition
Let ex(n,H) denote the largest number of edges that a graph GH on n vertices can have.

With this notation, Turán's theorem can be restated as

Turán's theorem (restated)
ex(n,Kr)r22(r1)n2.

Let Ksr=Ks,s,,sr be the complete r-partite graph with s vertices in each class, i.e., the Turán graph T(rs,r). The Erdős–Stone theorem (also referred as the fundamental theorem of extremal graph theory) gives an asymptotic bound on ex(n,Ksr), i.e., the largest number of edges that an n-vertex graph can have to not contain Ksr.

Fundamental theorem of extremal graph theory (Erdős–Stone 1946)
For any integers r2 and s1, and any ϵ>0, if n is sufficiently large then every graph on n vertices and with at least (r22(r1)+ϵ)n2 edges contains Kr,s as a subgraph, i.e.,
ex(n,Ksr)=(r22(r1)+o(1))n2.

The theorem is called fundamental because of its single most important corollary: it relate the extremal bound for an arbitrary subgraph H to a very natural parameter of H, its chromatic number.

Recall that χ(G) is the chromatic number of G, the smallest number of colors that one can use to color the vertices so that no adjacent vertices have the same color.

Corollary
For every nonempty graph H,
limnex(n,H)(n2)=χ(H)2χ(H)1.
Proof of corollary

Let r=χ(H).

Note that T(n,r1) can be colored with r1 colors, one color for each part. Thus, HT(n,r1), since otherwise H can also be colored with r1 colors, contradicting that χ(H)=1. By definition, ex(n,H) is the maximum number of edges that an n-vertex graph GH can have. Thus,

|T(n,r1)|ex(n,H).

It is not hard to see that

|T(n,r1)|(r12)nr12(r12)(nr11)2=(r22(r1)o(1))n2.

On the other hand, any finite graph H with chromatic number r has that HKsr for all sufficiently large s. We just connect all pairs of vertices from different color classes. Thus,

ex(n,H)ex(n,Ksr).

Due to Erdős–Stone theorem,

ex(n,Ksr)=(r22(r1)+o(1))n2.

Altogether, we have

r2r1o(1)|T(n,r1)|(n2)ex(n,H)(n2)ex(n,Ksr)(n2)=r2r1+o(1)

The theorem follows.

References

(声明: 资料受版权保护, 仅用于教学.)
(Disclaimer: The following copyrighted materials are meant for educational uses only.)
  • van Lin and Wilson. A course in combinatorics. Cambridge Press. Chapter 4.
  • Aigner and Ziegler. Proofs from THE BOOK, 4th Edition. Springer-Verlag. Chapter 36.
  • Diestel. Graph Theory, 3rd Edition. Springer-Verlag 2000. Chapter 7.