Introducing Riemann-Stieltjes Integral

Motivation

Riemann-Stieltjes integral is a generalisation of Riemann integral, the one every college student studies in their calculus class, and is a little bit more difficult to understand. Nevertheless it has advantages of its own, as we will show below. Before seeing the definition and properties of this integral, we first raise some questions that will can motivate our study.

When talking about abfdg, one may simply think about abfgdx. But is it even necessary that g is differentiable? What would happen if g is simply continuous, or even not continuous? Further, given that g is differentiable, can we prove that

abf(x)dg(x)=abf(x)g(x)dx

in a general way(without assuming f is differentiable)? Although integration can be connected to differentiation, it should not be mandatory to lock ourselves into C1 functions, C2 functions or smooth functions all the time.

Another motivation comes from probability theory. Oftentimes one need to consider discrete case () and continuous case () separately. One may say that integral is the limit of summation, but it would be weird to write as lim every time. However, if we have a way to write a sum, for example the expected value of a discrete variable (E(X)), as an integral, things would be easier. Of course, we don’t want to write such a sum as another sum by adding up the integral on several disjoint segments. That would be weirder.

If you have learned measure theory, you will know that Lebesgue integral does not perfectly cover Riemann integral. For example, 0sinxxdx is not integrable in the sense of Lebesgue but Riemann. We cannot treat Lebesgue integral as a perfect generalization of Riemann integral. In this blog post however, we will be studying a faithful generalization of Riemann integral, adding the name of Stieltjes.

We are trying our best to prevent ourselves from using sup, inf, and differentiation theory. But εδ language is heavily used here, so make sure that you are good at it.

Riemann-Stieltjes Integral

By a partition P on [a,b] we mean a sequence of numbers (xn) such that

a=x0x1xn=b

and we associate its size by

σ(P)=maxk(xk+1xk).

Let f, g be bounded real function on [a,b] (again, no continuity or differentiability required). Given a partition P and numbers ck with xkckxk+1, we define the Riemann-Stieltjes sum (RS-sum) by

S(P,f,g)=k=0n1f(ck)[g(xk+1)g(xk)].

We say that the limit

limσ(P)0S(P,f,g)

exists if there exists some LR such that give ε>0, there exists δ>0 such that whenever σ(P)<δ, we have

|S(P,f,g)L|<ε.

In this case, we say f is RS(g)-integrable, and the limit is denoted by

abfdg.

This is the so-called Riemann-Stieltjes integral. When g(x)=x, we get Riemann integral naturally.

Remarks: Further generalization still available

This integral method can be generalized to Banach space. Let f, g be bounded maps of [a,b] into Banach spaces E, F respectively. Assume we have a product E×FG denoted by (u,v)uv with uvuv. Then by replacing the absolute value by norm, still we get the Riemann-Stieltjes integral, although in this case we have

abfdgG

and G is not necessary to be R. This is different from Bochner integral, since no measure theory evolved here.

Linearity with respect to f and g

First, we shall show that RS(g)-integrable functions form a vector space. To do this, it suffices to show that

fS(P,f,g)

and

gS(P,f,g)

are linear. This follows directly from the definition of RS-sum. Let’s see the result.

Suppose we have

abfdg=I,abhdg=J,abfdu=K

Then we have the following identities for αI.

  1. abαfdg=αI.
  2. ab(f+h)dg=I+J.
  3. abfd(g+u)=I+K.
  4. abfd(αg)=αI.

Proof. We shall show 2 for example. Other three identities follows in the same way.

Notice that the existence of the limit of RS-sum depends only on the size of P. For ε>0, there exists some δ1,δ2>0 such that

|S(P,f,g)I|<ε2,|S(P,h,g)J|<ε2

when σ(P)<δ1 and σ(P)<δ2 respectively. By picking δ=min(δ1,δ2), we see for σ(P)<δ, we have

|S(P,f+h,g)(I+J)|=|(S(P,f,g)I)+(S(P,h,g)J)||S(P,f,g)I|+|S(P,h,g)J|<ε2+ε2=ε.

Integration by parts but no differentiation

fRS(g) if and only if gRS(f). In this case, we also have integration by parts:

abfdg+abgdf=f(b)g(b)f(a)g(a)

You may not believe it, but differentiation does not play any role here, as promised at the beginning.

Proof. Using the summation by parts (by Abel), we have

S(P,f,g)=k=0n1f(ck)[g(xk+1)g(xk)]=k=1n1g(xk)[f(ck)f(ck1)]+f(cn1)g(b)f(c0)g(a).

By writing

S(P,f,g)=S(P,f,g)+f(a)g(a)f(a)g(a)+f(b)g(b)f(b)g(b)

we have

S(P,f,g)=f(b)g(b)f(a)g(a)S(Q,g,f)

where

S(Q,g,f)=k=1n1g(xk)[f(ck)f(ck1)]+[f(b)f(cn1)]g(b)+[f(c0)f(a)]g(a).

Consider the partition Q by

yk={ak=0ck1k<nbk=n

we have x0,x1,,xn1,xk to be intermediate points, and

S(Q,g,f)=k=0n1g(xk)[f(yk+1)f(yk)].

Since 0<σ(Q)2σ(P)4σ(Q), when σ(P)0, we also have σ(Q)0 and vice versa. Suppose now abgdf exists, we have

limσ(P)0S(P,f,g)=f(b)g(b)f(a)g(a)abgdf=abfdg.

And integration by parts follows.

Suppose abfdg exists, then

limσ(Q)0S(Q,g,f)=f(b)g(b)f(a)g(a)abfdg=abgdf.

This proves the proposition.

The flexibility of Riemann-Stieltjes integral

As said before, we want to represent both continuous and discrete case using integral. For measure theory, we have Lebesgue measure and counting measure. But in some cases, this can be done using Riemann-Stieltjes integral as well. Ordinary Riemann integral and finite or infinite series are all special cases of Riemann-Stieltjes integral.

From integral to series (discrete case)

To do this, we need the unit step function by

I(x)={0x0,1x>0.

If a<s<b, f is bounded on [a,b] and continuous at s, by putting g(x)=I(xs), we have

abfdg=f(s)

Proof. A simple verification shows that abfdg=sbfdg (by unwinding the RS-sum, one see immediately that g(xk)=0 for all xks, therefore the partition before s has no tribute to the value of the integral). Now consider the partition P by

s=x0<x1<<xn=b.

We see

S(P,f,g)=k=0n1f(ck)(g(xk+1)g(xk))=f(c0)(g(x1)g(x0))=f(c0).

As x1s, we have c0s, since f is continuous at s, we have f(c0)f(s) as desired.

By the linearity of RS integral, it is easy to generalize this to the case of finite linear combination. Namely, for g(x)=k=1ncnI(xsn), we have

abfdg=k=1ncnf(sn).

But now we are discussing the infinite case.

Suppose cn0 for all n0 and n0cn converges, (sn) is a sequence of distinct points in (a,b), and

g(x)=n0cnI(xsn).

Let f be continuous on [a,b]. Then

abfdg=ncnf(sn)

Proof. First it’s easy to see that g(x) converges for every x, and is monotonic with g(a)=0, g(b)=ncn. For given ε>0, there exists some N such that

N+1cn<ε.

Putting

g1(x)=n=1NcnI(xsn),g2(x)=N+1cnI(xsn)=g(x)g1(x)

we have

abfdg1=n=1Ncnf(sn).

By putting M=sup|f(x)|, we see

|abfdg2|=|abfdgabfdg1|=|abfdgn=1Ncnf(sn)|Mε

The inequality holds since |g2(b)g2(a)|<ε. Since M is finite, when N, we have the desired result.

Transformed into ordinary Riemann integral (continuous case)

Finally we will discuss differentiation. The following theorem shows the connection between RS integral and Riemann integral.

Let f be continuous and suppose that g is real differentiable on [a,b] while g is Riemann integrable as well, then fRS(g) and

abfdg=abfgdx

Proof. By mean value theorem, for each k, we have

g(xk+1)g(xk)=g(ζk)(xk+1xk).

The RS-sum can be written as

S(P,f,g)=k=0n1f(ck)[g(xk+1)g(xk)]=k=0n1f(ck)g(ζk)(xk+1xk).

Since g is Riemann integrable, we have

k=0n1|g(ck)g(ζk)|(xk+1xk)<ε

given that |S(P,g,x)abgdx|<ε. Therefore

|k=0n1f(ck)g(ζk)(xk+1xk)k=0n1f(ck)g(ck)(xk+1xk)|Mε

where M=sup|f(x)|< (f is assumed to be bounded.) . Also notice that fg is integrable since f is continuous. Therefore

|S(P,f,g)abfgdx|=|S(P,f,g)S(P,fg,x)+S(P,fg,x)abfgdx||S(P,f,g)S(P,fg,x)|+|S(P,fg,x)abfgdx|<(M+1)ε.

Therefore,

abfdg=abfgdx,

which proves the theorem.

To sum up, given ε>0, there exists some δ>0 such that if σ(P)<δ, we have

|S(P,g,x)abgdx|<ε/(M+1)

and

|S(P,fg,x)abfgdx|<ε/(M+1).

After some estimation, we get

|S(P,f,g)abfgdx|<(M+1)εM+1=ε.

Comments