Fubini's theorem in Euclidean space (Understanding 'almost everywhere')

General Idea

In elementary calculus, integrals of continuous functions of several variables are often calculated by iterating one-dimensional integrals. But the properties of measurability give rise to a lot of issues for Lebesgue integration on Rd. What we are looking for is the equation

Rm(Rnf(x,y)dy)dx=Rn(Rmf(x,y)dx)dy=Rdfdm

where d=m+n and m,n are positive integers. If this equation holds for f, the integration would be relatively easy, as the iteration can be taken in any order. In fact, this equation can be generalized to some other abstract measure space, but that’s beyond what this post could cover.

Notations

For d=m+n, we write

Rd=Rm×Rn

A point in Rd therefore takes the form (x,y), where xRm and yRn. If f is defined on Rd, the slice of f is respectively

fy(x)=f(x,y)yRnfx(y)=f(x,y)xRm

For ERm×Rn, we defines its slices by

Ey={xRm:(x,y)E}Ex={yRn:(x,y)E}

But why ‘almost everywhere’?

Unfortunately, even if we assume that f is measurable on Rd, it can be shown that fy is not necessarily measurable for each y. It’s easy to construct a non-measurable set on R (x-axis), namely A. Then A has Lebesgue measure 0 in R×R. But Ey is not measurable for y=0. Nevertheless, the consideration of ‘almost everywhere’ is able to save us from this.

Fubini’s Theorem

Suppose f(x,y) is integrable on Rm×Rn. Then for almost every yRn, we have

  1. fy is integrable on Rm.

  2. The function defined by Rmfy(x)dx is integrable on Rn.

  3. This equation holds (m denotes the Lebesgue measure on Rd):

    Rm(Rnf(x,y)dx)dy=Rn(Rmf(x,y)dy)dx=Rdfdm

    The symmetric conclusion can be obtained for x.

General and more rigorous version

The general version of Fubini’s theorem is developed in abstract product space, which will not be proved here. But it’s worth a peek. Of course, feel free to jump to the next section if you are not interested.

Let (X,S,μ) and (Y,T,λ) be σ-finite measure spaces, and let f be an (S×T)-measurable function defined on X×Y.

  1. If f is an nonnegative real function, and if

    φ(x)=Yfxdλψ(y)=Xfydμ(xX,yY)

    then φ is S-measurable, and ψ is T-measurable, and

    Xdμ(x)Yf(x,y)dλ(y)=Ydλ(y)Xf(x,y)dμ(x)
  2. If f is complex and if

    φ(x)=Y|f|xdλandXφdμ<

    then fL1(μ×λ).

  3. If fL1(μ×λ), then fxL1(λ) for almost all xX, fyL1(μ) for almost all yY. The function therefore defined in 1 a.e. are in L1(μ) and L1(λ) respectively, and the equation holds.

Clearly, if we replace X, Y with Rm and Rn, S and T with the respective Lebesgue σ-algebra, λ and μ with Lebesgue measure, then we obtained the Euclidean version. Notice that f is integrable means that X|f|dμ<.

Before the proof

The proof is relatively long. Instead of proving that f as an integrable function satisfies the three conclusions, we shall show that, however, the family of functions satisfy the three conclusions (say, F) contains all integrable functions. If you check the general version of Fubini’s theorem, you see that integrability was explicitly discussed.

First, we shall show that F is not empty. This is important because we might have been discussing something that never exists. Second. Considering the fact that any integrable function can be “approximated” by simple functions, where simple functions can be generated linear combination, it encourage us to discuss limits and linear combinations in F. Finally, we shall show that if f is integrable, then fF. The power of almost-everywhere will show up along the proof.

Complete proof of Fubini’s Theorem (With explanation)

Step 1 - F is not empty

It’s somewhat absurd to discuss the property of F without proving that it’s not empty. But that can be done easily.

Suppose E is a bounded open cube in Rd such that E=Q1×Q2, where Q1 and Q2 are open cubes in Rm and Rn. Then χEF.

For each y, χE(x,y) is measurable. And the integrability of χE(x,y) follows with

g(y)=RmχE(x,y)dx={vol(Q1)yQ20yQ2

It shows that g(y)=vol(Q1)χQ2, which is measurable and integrable as well. Further,

Rng(y)dy=vol(Q1)vol(Q2)

Since we initially have RdχEdm=vol(E)=vol(Q1)vol(Q2), we see that χE satisfies these three properties, hence χEF.

Step 2 - F is closed under finite linear combination

We have only judged open cubes in Rd, which are far from Lebesgue σ-algebra. To get there, we may have to check some Gδ sets, but we can’t do that since we have no idea about limits in F. We are also looking for some simple functions, which are linear combinations of character functions.

Any finite linear combination of functions in F also belongs to F.

Since there are arbitrarily many bounded open cubes in Rd, we are able to find arbitrarily many members in F. Say,

f1,f2,,fnF

Following the definition of F, for each 1kn, we are able to find a set AkRn such that Ak has measure 0 and whenever yAk, fky is integrable on Rm. If we collect these sets altogether, namely A=Ak, we see that in RnA, all fk’s has the desired property, so does their arbitrary finite linear combination (due to the linear property of Lebesgue integral). Since A has measure zero as well, it turns out that the finite linear combinations belong to F.

Step 3 - Monotone convergence in F

Limits and convergence come into play. One may think about something like complete metric space, where Cauchy sequences converges. In this step we show that the monotone limit does exist in F.

Suppose fk is a sequence of measurable functions in F so that fkfk+1 or fkfk+1 holds for all k, and fkf where f is integrable on Rd, then fF.

Without loss of generality, it suffices to assume that

0f1f2fn

Since for other situations, we can take some fk or fkf1 or something like that. An application of monotone convergence theorem yields that

limkRdfkdm=Rdfdm

Also, we can find some sets with measure 0, namely Ak, carrying the same meaning as is in Step 2. For A=k=1Ak, we also have m(A)=0 in Rn. Also, for yRnA, fky is integrable on Rm for all k. Thus by monotone convergence theorem, we see that

gk(y)=Rmfkydxg(y)=Rmfydx(k)

Clearly we have gkgk+1 for all k, and by assumption, gk is integrable. Use monotone convergence theorem again, we see that

RngkdyRngdy(k)

Combining these two limits, we see

Rngdy=Rdfdm

We’ll show that fF by checking its properties one by one.

  1. Since f is integrable, we see that Rng=Rdf<. Thus g is integrable.

  2. Since g is integrable, we have g(y)< a.e. for y, consequently fy is integrable a.e. for y.

  3. By the definition of g, we have

    Rn(Rmf(x,y)dx)dy=Rdfdm

Thus fF as proved.

Step 4 - Characteristic functions of measurable sets

4.1 - Final destination

We are pretty close to simple functions now. To get rid of infinity, we are going to prove this:

If E is any measurable subset in Rd with m(E)<, then χEF.

Once it’s done, we can construct simple functions, which approximate to any integrable functions, with ease. Fortunately, with the help of the property of Lebesgue measurable sets, we are able to break “measurable subsets” into several pieces. Recall the fact that

ERd is Lebesgue measurable if and only if there are sets A and BRd such that AEB, A is a Fσ and B is a Gδ, and m(BA)=0.

Since BEBA, we also have m(BE)=0. Also, since E(BE)=B, E(BE)=, we have

χB=χE+χBE

which is equivalent toχE=χBχBE. Notice that the right hand of this equation is a finite combination of functions (Step 2 comes into play). If we prove that χB,χBEF, then we are done.

We are going to prove that if E is a Gδ set, or E has measure 0, then χEF. That is, we are going to generalize all Lebesgue measurable sets by proving these two key situations.

4.2 - Finite measure Gδ sets

In Step 1 we proved χEF if E is a bounded open cube. Now we are going to generalize this to Gδ, which is a countable intersection of open sets. Also, since every open sets can be a countable union of closed cubes (Rd is a locally compact Hausdorff space in which every open set is σ-compact). You will see how Step 2 and Step 3 play a role in this section.

4.2.1 - Characteristic function of closed cubes

If Q a closed cube in Rd, then χQF.

Since Q=int(Q)Q, where int(Q) denotes its interior and Q denotes its boundary, we have

χQ=χint(Q)+χQ

As proved in Step 1, int(Q)F. So we have to prove that χQF, and the conclusion follows from Step 2.

Since m(Q)=0, we have RdχQdm=0. Also, it can be seen that for almost every y, we have Qy has measure 0 in Rm, and therefore g(y)=RmχQdx=0 a.e. for y. Consequently, Rngdy=0, therefore χQF.

4.2.2 - Finitely many almost disjoint closed cubes

Suppose E=k=1KQk, where Qk is closed cube, and int(Qi)int(Qj)= for ij, then χEF.

This conclusion is obvious if one notice that

χE=k=1KχQk.

In 4.2.1 we showed that χQkF. Hence χEF according to Step 2.

4.2.3 - Arbitrary open sets with finite measure

Since every open sets in Rd can be a countable union of almost disjoint cubes, we have

E=k=1Qk

If we take EK=k=1KQk, we have fK=χEK=k=1KχQk. And we are going to follow Step 3 to show that χEF if m(E)<.

Since the Lebesgue σ-measure contains all Borel sets, and E is open, we see that E is measurable. If m(E)<, then we see that χE is integrable. Also we have fK+1fK for all K, and fKχE; hence fK is what we described in Step 3. Say, χEF.

4.2.4 - Arbitrary Gδ sets

If E is a Gδ set of finite measure, then χEF.

By the definition of Gδ sets, we have

E=k=1Rk

where Rk are open sets. Since m(E)<, m is regular, we have a open set S0E such that m(S0)<. Let

Sk=S0(j=1kRj)

Then we have

E=k=1Sk

For Sk’s, observe that S0S1, we have fk=χSk decreases to the limit f=χE. Following Step 3, we see that χEF.

4.3 - Sets with measure 0

If m(E)=0, then χEF

If E is a Gδ set, then we are done by following 4.2. If not, it comes to the issue of m’s being a complete measure.

Again, by the regularity of m, we may choose a set G of Gδ such that EG and that m(G)=0. As proved, χGF. Therefore

RmχGdx=0for a.e. y

Thus, the slice Gy has measure 0 a.e. for y, since EyGy, we have Ey has measure 0 a.e. for y. Therefore the fact that χEF can be verified by simple calculation.

Step 5 - All integrable functions

If f is integrable, then fF.

Like the construction of Lebesgue integral, f has the decomposition that f=f+f. Thus it suffice to prove this for nonnegative f (by Step 1).

There exists a sequence of integrable and nonnegative simple functions sk that monotonically converges to f. Since each integrable sk is a finite combination of sets with finite measure, by Step 2 and 4, skF. By Step 3, clearly we have fF.

Comments

Fubini’s theorem shows us that we might be able to evaluate multidimensional integrals in the sense of measure theory with ease (at least ‘almost everywhere’). However there are some counterexamples showing that Fubini’s theorem will fall, which will be discussed later.

This proof is a good example of how to play with the elements of Lebesgue integral. Let’s take a rewind. We want to obtain all integrable functions in F, which however can’t be done directly. So we are looking for simple functions, which are generated by characteristic functions. And luckily we obtained a wide enough range of characteristic functions. With linear combinations and limits, we finally achieved the goal to describe all integrable functions. The properties of ‘almost everywhere’ played a critical role.

Comments