General Idea
In elementary calculus, integrals of continuous functions of several variables are often calculated by iterating one-dimensional integrals. But the properties of measurability give rise to a lot of issues for Lebesgue integration on $\mathbb{R}^d$. What we are looking for is the equation
where $d=m+n$ and $m,n$ are positive integers. If this equation holds for $f$, the integration would be relatively easy, as the iteration can be taken in any order. In fact, this equation can be generalized to some other abstract measure space, but that’s beyond what this post could cover.
Notations
For $d=m+n$, we write
A point in $\mathbb{R}^d$ therefore takes the form $(x,y)$, where $x\in\mathbb{R}^m$ and $y\in\mathbb{R}^n$. If $f$ is defined on $\mathbb{R}^d$, the slice of $f$ is respectively
For $E \subset \mathbb{R}^m\times\mathbb{R}^n$, we defines its slices by
But why ‘almost everywhere’?
Unfortunately, even if we assume that $f$ is measurable on $\mathbb{R}^d$, it can be shown that $f^y$ is not necessarily measurable for each $y$. It’s easy to construct a non-measurable set on $\mathbb{R}$ (x-axis), namely $A$. Then $A$ has Lebesgue measure $0$ in $\mathbb{R} \times \mathbb{R}$. But $E^y$ is not measurable for $y=0$. Nevertheless, the consideration of ‘almost everywhere’ is able to save us from this.
Fubini’s Theorem
Suppose $f(x,y)$ is integrable on $\mathbb{R}^m \times \mathbb{R}^n$. Then for almost every $y \in \mathbb{R}^n$, we have
$f^y$ is integrable on $\mathbb{R}^m$.
The function defined by $\int_{\mathbb{R}^m}f^y(x)dx$ is integrable on $\mathbb{R}^n$.
This equation holds ($m$ denotes the Lebesgue measure on $\mathbb{R}^d$):
The symmetric conclusion can be obtained for $x$.
General and more rigorous version
The general version of Fubini’s theorem is developed in abstract product space, which will not be proved here. But it’s worth a peek. Of course, feel free to jump to the next section if you are not interested.
Let $(X,\mathscr{S},\mu)$ and $(Y,\mathscr{T},\lambda)$ be $\sigma$-finite measure spaces, and let $f$ be an $(\mathscr{S} \times \mathscr{T})$-measurable function defined on $X \times Y$.
If $f$ is an nonnegative real function, and if
then $\varphi$ is $\mathscr{S}$-measurable, and $\psi$ is $\mathscr{T}$-measurable, and
If $f$ is complex and if
then $f \in L^1(\mu\times\lambda)$.
If $f \in L^1(\mu \times \lambda)$, then $f_x \in L^1(\lambda)$ for almost all $x \in X$, $f^y \in L^1(\mu)$ for almost all $y \in Y$. The function therefore defined in 1 a.e. are in $L^1(\mu)$ and $L^1(\lambda)$ respectively, and the equation holds.
Clearly, if we replace $X$, $Y$ with $\mathbb{R}^m$ and $\mathbb{R}^n$, $\mathscr{S}$ and $\mathscr{T}$ with the respective Lebesgue $\sigma$-algebra, $\lambda$ and $\mu$ with Lebesgue measure, then we obtained the Euclidean version. Notice that $f$ is integrable means that $\int_X|f|d\mu < \infty$.
Before the proof
The proof is relatively long. Instead of proving that $f$ as an integrable function satisfies the three conclusions, we shall show that, however, the family of functions satisfy the three conclusions (say, $\mathcal{F}$) contains all integrable functions. If you check the general version of Fubini’s theorem, you see that integrability was explicitly discussed.
First, we shall show that $\mathcal{F}$ is not empty. This is important because we might have been discussing something that never exists. Second. Considering the fact that any integrable function can be “approximated” by simple functions, where simple functions can be generated linear combination, it encourage us to discuss limits and linear combinations in $\mathcal{F}$. Finally, we shall show that if $f$ is integrable, then $f \in \mathcal{F}$. The power of almost-everywhere will show up along the proof.
Complete proof of Fubini’s Theorem (With explanation)
Step 1 - $\mathcal{F}$ is not empty
It’s somewhat absurd to discuss the property of $\mathcal{F}$ without proving that it’s not empty. But that can be done easily.
Suppose $E$ is a bounded open cube in $\mathbb{R}^d$ such that $E = Q_1 \times Q_2$, where $Q_1$ and $Q_2$ are open cubes in $\mathbb{R}^m$ and $\mathbb{R}^n$. Then $\chi_E \in \mathcal{F}$.
For each $y$, $\chi_E(x,y)$ is measurable. And the integrability of $\chi_E(x,y)$ follows with
It shows that $g(y)=\text{vol}(Q_1)\chi_{Q_2}$, which is measurable and integrable as well. Further,
Since we initially have $\int_{\mathbb{R}^d}\chi_Edm=\text{vol}(E)=\text{vol}(Q_1)\text{vol}(Q_2)$, we see that $\chi_E$ satisfies these three properties, hence $\chi_E \in \mathcal{F}$.
Step 2 - $\mathcal{F}$ is closed under finite linear combination
We have only judged open cubes in $\mathbb{R}^d$, which are far from Lebesgue $\sigma$-algebra. To get there, we may have to check some $G_\delta$ sets, but we can’t do that since we have no idea about limits in $\mathcal{F}$. We are also looking for some simple functions, which are linear combinations of character functions.
Any finite linear combination of functions in $\mathcal{F}$ also belongs to $\mathcal{F}$.
Since there are arbitrarily many bounded open cubes in $\mathbb{R}^d$, we are able to find arbitrarily many members in $\mathcal{F}$. Say,
Following the definition of $\mathcal{F}$, for each $1 \leq k \leq n$, we are able to find a set $A_k \subset \mathbb{R}^n$ such that $A_k$ has measure $0$ and whenever $y \notin A_k$, $f_k^y$ is integrable on $\mathbb{R}^m$. If we collect these sets altogether, namely $A=\cup A_k$, we see that in $\mathbb{R}^n-A$, all $f_k$’s has the desired property, so does their arbitrary finite linear combination (due to the linear property of Lebesgue integral). Since $A$ has measure zero as well, it turns out that the finite linear combinations belong to $\mathcal{F}$.
Step 3 - Monotone convergence in $\mathcal{F}$
Limits and convergence come into play. One may think about something like complete metric space, where Cauchy sequences converges. In this step we show that the monotone limit does exist in $\mathcal{F}$.
Suppose $f_k$ is a sequence of measurable functions in $\mathcal{F}$ so that $f_{k} \leq f_{k+1}$ or $f_k \geq f_{k+1}$ holds for all $k$, and $f_k \to f$ where $f$ is integrable on $\mathbb{R}^d$, then $f \in \mathcal{F}$.
Without loss of generality, it suffices to assume that
Since for other situations, we can take some $-f_k$ or $f_k-f_1$ or something like that. An application of monotone convergence theorem yields that
Also, we can find some sets with measure $0$, namely $A_k$, carrying the same meaning as is in Step 2. For $A=\bigcup_{k=1}^{\infty}A_k$, we also have $m(A)=0$ in $\mathbb{R}^n$. Also, for $y \in \mathbb{R}^n - A$, $f_k^y$ is integrable on $\mathbb{R}^m$ for all $k$. Thus by monotone convergence theorem, we see that
Clearly we have $g_k \leq g_{k+1}$ for all $k$, and by assumption, $g_k$ is integrable. Use monotone convergence theorem again, we see that
Combining these two limits, we see
We’ll show that $f \in \mathcal{F}$ by checking its properties one by one.
Since $f$ is integrable, we see that $\int_{\mathbb{R}^n}g = \int_{\mathbb{R}^d}f<\infty$. Thus $g$ is integrable.
Since $g$ is integrable, we have $g(y)<\infty$ a.e. for $y$, consequently $f^y$ is integrable a.e. for $y$.
By the definition of $g$, we have
Thus $f \in \mathcal{F}$ as proved.
Step 4 - Characteristic functions of measurable sets
4.1 - Final destination
We are pretty close to simple functions now. To get rid of infinity, we are going to prove this:
If $E$ is any measurable subset in $\mathbb{R}^d$ with $m(E)<\infty$, then $\chi_E\in\mathcal{F}$.
Once it’s done, we can construct simple functions, which approximate to any integrable functions, with ease. Fortunately, with the help of the property of Lebesgue measurable sets, we are able to break “measurable subsets” into several pieces. Recall the fact that
$E \subset \mathbb{R}^d$ is Lebesgue measurable if and only if there are sets $A$ and $B\subset\mathbb{R}^d$ such that $A \subset E \subset B$, $A$ is a $F_{\sigma}$ and $B$ is a $G_{\delta}$, and $m(B-A)=0$.
Since $B-E \subset B-A$, we also have $m(B-E)=0$. Also, since $E \cup (B-E)=B$, $E \cap (B-E) = \varnothing$, we have
which is equivalent to$\chi_{E}=\chi_{B}-\chi_{B-E}$. Notice that the right hand of this equation is a finite combination of functions (Step 2 comes into play). If we prove that $\chi_{B},\chi_{B-E} \in \mathcal{F}$, then we are done.
We are going to prove that if $E$ is a $G_{\delta}$ set, or $E$ has measure $0$, then $\chi_{E}\in\mathcal{F}$. That is, we are going to generalize all Lebesgue measurable sets by proving these two key situations.
4.2 - Finite measure $G_{\delta}$ sets
In Step 1 we proved $\chi_{E} \in \mathcal{F}$ if $E$ is a bounded open cube. Now we are going to generalize this to $G_\delta$, which is a countable intersection of open sets. Also, since every open sets can be a countable union of closed cubes ($\mathbb{R}^d$ is a locally compact Hausdorff space in which every open set is $\sigma$-compact). You will see how Step 2 and Step 3 play a role in this section.
4.2.1 - Characteristic function of closed cubes
If $Q$ a closed cube in $\mathbb{R}^d$, then $\chi_{Q} \in \mathcal{F}$.
Since $Q = \text{int}(Q) \cup \partial{Q}$, where $\text{int}(Q)$ denotes its interior and $\partial{Q}$ denotes its boundary, we have
As proved in Step 1, $\text{int}(Q) \in \mathcal{F}$. So we have to prove that $\chi_{\partial{Q}}\in\mathcal{F}$, and the conclusion follows from Step 2.
Since $m(\partial{Q})=0$, we have $\int_{\mathbb{R}^d}\chi_{\partial{Q}}dm=0$. Also, it can be seen that for almost every $y$, we have $\partial{Q}^y$ has measure $0$ in $\mathbb{R}^m$, and therefore $g(y)=\int_{\mathbb{R}^m}\chi_{\partial{Q}}dx=0$ a.e. for $y$. Consequently, $\int_{\mathbb{R}^n}gdy=0$, therefore $\chi_{\partial{Q}} \in \mathcal{F}$.
4.2.2 - Finitely many almost disjoint closed cubes
Suppose $E = \bigcup_{k=1}^{K}Q_k$, where $Q_k$ is closed cube, and $\text{int}(Q_i)\cap\text{int}(Q_j)=\varnothing$ for $i \neq j$, then $\chi_{E} \in \mathcal{F}$.
This conclusion is obvious if one notice that
In 4.2.1 we showed that $\chi_{Q_k} \in \mathcal{F}$. Hence $\chi_{E} \in \mathcal{F}$ according to Step 2.
4.2.3 - Arbitrary open sets with finite measure
Since every open sets in $\mathbb{R}^d$ can be a countable union of almost disjoint cubes, we have
If we take $E_{K}=\bigcup_{k=1}^{K}Q_k$, we have $f_K=\chi_{E_K}=\sum_{k=1}^{K}\chi_{Q_k}$. And we are going to follow Step 3 to show that $\chi_{E} \in \mathcal{F}$ if $m(E)<\infty$.
Since the Lebesgue $\sigma$-measure contains all Borel sets, and $E$ is open, we see that $E$ is measurable. If $m(E)<\infty$, then we see that $\chi_{E}$ is integrable. Also we have $f_{K+1} \geq f_{K}$ for all $K$, and $f_{K} \to \chi_{E}$; hence $f_{K}$ is what we described in Step 3. Say, $\chi_{E} \in \mathcal{F}$.
4.2.4 - Arbitrary $G_\delta$ sets
If $E$ is a $G_\delta$ set of finite measure, then $\chi_{E} \in \mathcal{F}$.
By the definition of $G_\delta$ sets, we have
where $R_k$ are open sets. Since $m(E)<\infty$, $m$ is regular, we have a open set $S_0 \supset E$ such that $m(S_0)<\infty$. Let
Then we have
For $S_k$’s, observe that $S_0 \supset S_1 \supset \cdots$, we have $f_k=\chi_{S_k}$ decreases to the limit $f=\chi_E$. Following Step 3, we see that $\chi_E \in \mathcal{F}$.
4.3 - Sets with measure $0$
If $m(E)=0$, then $\chi_{E} \in \mathcal{F}$
If $E$ is a $G_{\delta}$ set, then we are done by following 4.2. If not, it comes to the issue of $m$’s being a complete measure.
Again, by the regularity of $m$, we may choose a set $G$ of $G_\delta$ such that $E \subset G$ and that $m(G)=0$. As proved, $\chi_{G} \in \mathcal{F}$. Therefore
Thus, the slice $G^y$ has measure $0$ a.e. for $y$, since $E^y \subset G^y$, we have $E^y$ has measure $0$ a.e. for $y$. Therefore the fact that $\chi_{E}\in\mathcal{F}$ can be verified by simple calculation.
Step 5 - All integrable functions
If $f$ is integrable, then $f \in \mathcal{F}$.
Like the construction of Lebesgue integral, $f$ has the decomposition that $f= f^+-f^-$. Thus it suffice to prove this for nonnegative $f$ (by Step 1).
There exists a sequence of integrable and nonnegative simple functions $s_k$ that monotonically converges to $f$. Since each integrable $s_k$ is a finite combination of sets with finite measure, by Step 2 and 4, $s_k \in \mathcal{F}$. By Step 3, clearly we have $f \in \mathcal{F}$.
Fubini’s theorem shows us that we might be able to evaluate multidimensional integrals in the sense of measure theory with ease (at least ‘almost everywhere’). However there are some counterexamples showing that Fubini’s theorem will fall, which will be discussed later.
This proof is a good example of how to play with the elements of Lebesgue integral. Let’s take a rewind. We want to obtain all integrable functions in $\mathcal{F}$, which however can’t be done directly. So we are looking for simple functions, which are generated by characteristic functions. And luckily we obtained a wide enough range of characteristic functions. With linear combinations and limits, we finally achieved the goal to describe all integrable functions. The properties of ‘almost everywhere’ played a critical role.