This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
'\T>n)<(l-e)^"/"^, which tends to zero as n ^ o o exponentially rapidly. By property (a) of the
1. Coupling
67
coupling, \p''\u,y)-p''\v,y)\
= \P^-^^\X, = y, Y,^y]-P'^^^\X,^y,
Y, = y]\
which tends to zero by property (b). Since msix^p^^\x, y) decreases and minxp^^\x,y) increases in k for each j , the proof of (1.3) is now easy. Statement (1.4) follows by taking the limit in (1.1) as /c->oo. D The next application of coupling is to the problem of proving that certain Markov chains have no nonconstant bounded harmonic functions. While the primary purpose of this application is to illustrate the use of coupling, bounded harmonic functions for Markov chains will arise naturally in Chapters V and VIII. Suppose now that p{x, y) are the transition probabilities for a Markov chain on the countable state space 5. By a (Markov) coupling of two copies of this chain, we will mean a Markov chain on 5 x 5 with transition probabilities p{{x,y), (w, v)) which satisfy
1 pii^, y)Au, v)) = p{x, u) V
and 1 P(ix, y)Au, v)) = p(y, v). u
This property guarantees that the two marginals of the coupled process are Markovian and have transition probabilities /?(•,•)• The coupling will be said to be successful if for any (x, y)e SxS, P^""' ^^[X„ = ¥„ for all sufficiently large n]= 1. The bivariate chain constructed in the proof of Theorem 1.2 is an example of a successful coupling. A bounded function h on S is said to be harmonic for j9( •, •) if h{x)=lp(x,y)h(y) y
for all xe S. Theorem 1.5. Suppose there exists a successful coupling for the chain /?(•, • )• Then every bounded harmonic function for /?(•,•) is constant. Proof Let (X„, y„) be a successful coupling, and let r be a (finite-valued) random variable such that X„ = Y^ for all n > r. If /i is a bounded harmonic function for /?(•, •), then h(x) = E''h{X,)
68
II. Some Basic Tools
for all X 6 S and n > 0. Therefore
\hix)-hiy)\
= \E''h{X„)-E'hiX„)\
u
<2[sup|/i(u)|]P<"'^^[T>n]. u
Since the right side tends to 0 as n ^ o o , h{x) = h(y) for all x,yeS, hence h is constant. D
and
The Markov chain X„ will be called a random walk if 5 = Z"^ for some J > 1 and p{x, y)=p{0,y — x). It is said to be irreducible if for each (x, y) e SxS there is an m so that p^"'\x, y)>0. A basic fact is that an irreducible random walk has no nonconstant bounded harmonic functions. (When Z"^ is replaced by a general locally compact Abelian group, this result is known as the Choquet-Deny Theorem. See Chapter VIII of Meyer (1966) or Chapter 5 of Revuz (1975), for example.) There are a number of approaches to the proof of this fact, one of which will be given in Section 7 (see Corollary 7.2). To illustrate the use of coupling, suppose we attempt to give a proof based on Theorem 1.5. Consider then the case of an irreducible random walk on Z"^. The simplest coupling which has a chance of succeeding, at least in some cases, is to let the coordinates X„ and y„ evolve independently until they meet, and then to force them to move together afterwards. This was the coupling which was used in the proof of Theorem 1.2. To see when this coupling is successful, note that X„ - Y„ is a symmetric random walk with transition probabilities
q(x,y)=lp{x,u)p(y,u) until it hits 0, after which it remains at 0. The q random walk is irreducible and recurrent provided for example that d = l or 2 and that the p random walk is irreducible, aperiodic, and has finite second moments. In this case the coupling is successful. The finite second moment assumption is easy to remove by modifying the coupling slightly. It suffices to have the coordinates X„ and Yn take the same jump if the jump is large, and take independent jumps if the jumps are small, until the coordinates meet. (This is a modification of a couphng used by Ornstein (1969).) The reader is encouraged to write down explicitly the transition probabilities for the coupled chain which accomplishes this objective.
1. Coupling
69
It might appear from the previous paragraph that the usefulness of Theorem 1.5 is limited to d = \ or d = 2, because it is only in that case that the q random walk has any chance of being recurrent. This is true only if one is inflexible in the choice of a coupling. To illustrate the possibilities, consider the simple random walk on Z"^, in which
for nearest neighbors x and y. This mechanism can be thought of as choosing a coordinate at random, each with probability d~\ and then adding ±1 with probability j each to the coordinate chosen. With this in mind, a successful coupling can be constructed for the simple random walk by having X„ and ¥„ choose the same coordinate to modify, and then having the choice of which of +1 or - 1 to add to that coordinate be independent for the two processes if those coordinates do not agree, and the same choice if those coordinates do agree. This coupling is successful (for initial points X and y so that x-y has all coordinates even) because the simple random walk on ZMs recurrent. Of course, more involved couplings will work for more general random walks on Z"^. Sometimes it is convenient to define a coupling in which the joint mechanism depends on the initial states of the two processes. This will be illustrated by giving a coupling proof of the Choquet-Deny Theorem for a general irreducible random walk on Z"^. We may assume that p{0,0) > 0, since otherwise /?(•, •) could be replaced by
using the fact that p and p have the same harmonic functions. By irreducibility, it suffices to show that h(x) = h{y) for any bounded harmonic function and any x and y satisfying p{x,y)>0. We do this by using the idea of Theorem 1.5, but will construct the coupled process (X„, y„) with XQ = X and Yo = y only on D = {(M, v)eZ'^ XZ"^: V-U is an integer multiple of y — x} for some fixed X9^ y satisfying p{x, y) > 0. The coupled process has the transition probabilities given below until the first time that X„ = Y„. Set s = min{/7(0, 0), p{x, y)} > 0, and let (M + w, i; + w)
with probability p{0, w), for all Wy^O.y — x,
{u,v-\-y-x)
with probability s,
{u,v)-> <j {u-hy-x,v)
with probability e,
(w, v) [ (u-\-y-x,
with probability p{0, 0) - e, v-\-y-x)
with probability/^(x, j ; ) - £ .
70
II. Some Basic Tools
Note that the chain remains on D at all times, and that the difference Yn — X„ has the following transition probabilities until the first time X„ = F„: for integer /c,
i
(/c + l ) ( j - x )
with probability 8,
k{y - x)
with probability 1 - 2e,
{k-\){y-x) with probability e. Therefore by the recurrence of the simple symmetric random walk, eventually the chain will satisfy X„ = y„. At that point the transition probabilities are modified so that X„ — Y^ for all later times, and then the coupling is successful. It should be noted from the proof of Theorem 1.5 that the existence of a successful coupling for a Markov process implies that the total variation distance between the distributions of the process at time n for different initial points tends to zero as n tends to infinity. (The converse to this statement is true also, provided one allows non-Markovian couplings as well—see Theorem 3 in Griffeath (1978b).) This total variation convergence essentially never occurs for particle systems. Therefore one cannot expect couplings to exist in that context which are successful in the sense that the two processes agree from some time on. For a coupling (77^, l^ of processes with state space {0, 1}^, the most which can be hoped for is then that (1-6)
limP[77,(x) = f,(x)]=l
for each JC G 5. Fortunately, this turns out to be sufficient for many purposes, as will be seen in later chapters. Even when (1.6) does not hold, coupling can be extremely useful. The next section expands on this statement.
2. Monotonicity and Positive Correlations As we will see later, it is nearly impossible to prove limit theorems for particle systems by developing and using estimates. It is therefore essential to take advantage of any monotonicity which may be present in a problem. There is an intimate connection between coupling and monotonicity. In the previous section, couplings were used which made two processes agree at large times with high probability. When used in connection with monotonicity arguments, the objective of a coupling is to show that if certain inequalities between the distributions of the processes of interest hold initially, then they continue to hold at later times. This section is devoted to a discussion of a number of ideas related to monotonicity and coupling which will be used often in this book. At the end, two results concerning measures with positive correlations will be presented.
2. Monotonicity and Positive Correlations
71
Throughout this section, X will be a compact metric space on which there is defined a partial order. A typical example to keep in mind is X — {0, 1}^ where 5 is a finite or countable set. The partial order should be compatible with the topology in the sense that {(77,neXxX:77
for
I
fdfx
allfeJt.
Note that by approximation, /xi
feJt implies S(t)fe M for all t>0. fJii^iJii implies iJLiS(t)^ jjL2S(t) for all t^O.
Proof Suppose (a) holds and 1x1^1x2- Given feJi,
S{t)fdfi2
/d[^25(0],
S(t)feM
also, so
72
IL Some Basic Tools
and hence /Xi5(0 ^ ßiSit). For the other direction, assume that (b) holds, and take feJi. Then for rj — L the pointmasses 5^ and 8^ satisfy 8^ < 8^, so that 5(0/(7,) = £ Y ( 7,,) =
^fd[8^Sit)]
<j/d[S,S(f)] = E'f{r,,) = Sit)M).
D
Definition 2.3. A Feller process is said to be monotone if the equivalent conditions of Theorem 2.2 are satisfied. (In the context of particle systems in later chapters, such a process will be called attractive rather than monotone.) One connection between Definitions 2.1 and 2.3 on the one hand and coupling on the other, is that usually condition (a) of Theorem 2.2 is verified by constructing a coupled process (rjt, ^t) which has the property that v — ^ implies
A particularly simple instance of this occurs when X is linearly ordered and the process has continuous paths. (For example, a diffusion process on [0, 1], or a birth and death chain on {0, 1 , . . . , N}.) Then a simple coupling which has the desired property is obtained by letting the two copies of the process evolve independently until the first time (if ever) that they agree, and then letting them evolve together. Thus we see that such processes are monotone. It might appear at first glance that the existence of a coupling which preserves the inequality rjt^^t is a much stronger statement than (a) of Theorem 2.2. The following result shows that this is not the case, and at the same time gives additional insight into the content of Definition 2.1. Theorem 2.4. Suppose fii and /X2 are probability measures on X. A necessary and sufficient condition for /txi < /X2 is that there exist a probability measure v on X xX which satisfies (a)
p{{rj, 0' ^ e A} = )txi(A), and
(b)
v{{7),0'^C^A}
= ^^.2{A)
for all Borel sets in X, and
(c)
p{{ri,n'^7j^n=i-
2. Monotonicity and Positive Correlations
73
Proof. The sufficiency of the condition is clear. For the necessity, suppose that /Xi
$
Note that $ is upper semicontinuous, so that the integral defining p{(p) is meaningful. It is easy to check that p satisfies the properties p{(Pi+(p2)^p{(p\)+p{^2). p{c(p) = cp((p)
and
for c > 0 .
Define a linear functional T on the subspace of all (pe C{X xX) are of the form
(2.5)
which
cp{v,n=Mv)^f2(n
for s o m e / i , / 2 G C ( X ) by (2.6)
T