LARGE DEVIATIONS FOR MINKOWSKI SUMS OF HEAVY-TAILED GENERALLY NON-CONVEX RANDOM COMPACT SETS
Thomas Mikosch1, Zbynek Pawlas2, Gennady Samorodnitsky3
1. Department of Mathematics, University of Copenhagen, Denmark, Professor, [email protected]
2. Department of Probability and Mathematical Statistics,
Charles University in Prague, Czech Republic Professor, [email protected]
3. School of Operations Research and Information Engineering,
Cornell University, New York, USA
Professor, [email protected]
We prove large deviation results for Minkowski sums of iid random compact sets where we assume that the summands have a regularly varying distribution. The result confirms the heavy-tailed large deviation heuristics: “large” values of the sum are essentially due to the “largest” summand.
1. Introduction
Preliminaries on random sets and Minkowski addition. The theory of random sets is summarized in the recent monograph [9]. For all definitions introduced below we refer to [9]. Let F be a separable Banach space with norm || • ||. For A\,A^ C F and a real number A, the Minkowski addition and scalar multiplication, respectively, are defined by
Ai + A2 = + $2 : £ Ai, a2 £ A2}, AA1 = {Aa1 : ai £ A\}.
We denote by K(F) the class of all non-empty compact subsets of F. Note that this is not
a vector space. However, it is well known that K(F) equipped with the Hausdorff distance
d(A;i_,A2) = max < sup inf |a1 — a2||, sup inf |a1 — a2|| > , A1?A2 £K(F), la1£A1 a2GA2 a2EA2 ai^Ai )
forms a complete separable metric space. The Hausdorff metric is subinvariant, i.e.,
d(A1 + A, A2 + A) < d(A1, A2) for any A1, A2, A £ K(F). (1)
For any subset U of K(F), a real number A and a set A £ K(F) we use the notation
AU = {AC : C £U} and U + A = {C + A : C £U}.
A random compact set X in F is a Borel measurable function from an abstract probability space (Q, F, P) into K(F). Since addition and scalar multiplication are defined for random compact sets it is natural to study the strong law of large numbers, the central limit theorem, large deviations, etc., for sequences of such random sets; see Chapter 3 in [9] for
© T. Mikosch, Z. Pawlas, G. Samorodnitsky, 2011
an overview of results obtained until 2005. A general Cramer-type large deviation result for Minkowski sums of iid random compact sets was proved in [2]. Cramer-type large deviations require exponential moments of the summands; see Chapter 8 in Valentin V. Petrov’s classical monograph [13] for the case of sums of independent real-valued variables and [3] in the case of more general random structures. If such moments do not exist, then we are dealing with heavy-tailed random elements. Large deviations results for sums of heavy-tailed random elements significantly differ from Cramer-type results. In this case it is typical that only the largest summand determines the large deviation behavior; see the classical results by A. Nagaev [10, 11] for sums of iid random variables; cf. [12, 6]. It is the aim of this paper to prove large deviation results for sums of heavy-tailed random compact sets. In what follows, we make this notion precise by introducing regularly varying random sets.
Regularly varying random sets. A special element of K(F) is Ao = {0}. In what follows, we say that U C K(F) is bounded away from A0 if A0 £ clU, where clU stands for the closure of U. We consider the subspace K0(F) = K(F) \ {A0}, which is a separable metric space in the relative topology. For any Borel set U C K0(F) and e > 0, we write
Ue = {A £ K0(F) : d(A, C) < e for some C £ U}.
Furthermore, we define the norm ||A|| = d(A, A0) = sup{||a|| : a £ A} for A £ K(F), and denote Br = {A £ K(F) : ||A|| < r}. Let M0(K0(F)) be the collection of Borel measures on K0(F) whose restriction to K(F) \ Br is finite for each r > 0. Let C0 denote the class
of real-valued, bounded and continuous functions f on K0 (F) such that for each f there
exists r > 0 and f vanishes on Br. The convergence jn —► j in M0(K0(F)) is defined
n—— tt
to mean the convergence f f 1jn —> if 1j for all f £ C0. By the portmanteau theorem
n—— tt
([5], Theorem 2.4), jn —> j in M0(K0(F)) if and only if jn(U) —> j(U) for all Borel
n—— tt n—— tt
sets U C K(F) which are bounded away from A0 and satisfy j(dU) = 0, where dU is the boundary of U.
Following [5], for the general case of random elements with values in a separable metric space, we say that a random compact set X is regularly varying if there exist a non-null measure j £ M^K0(F)) and a sequence {a,n}n>1 of positive numbers such that
nP(X £ an) —^ j(-) in M0(K0(F)a. (2)
n——tt
The tail measure j necessarily has the property j(AU) = A-aj(U) for some a > 0 and all Borel sets U in K0F) and all A > 0. We then also refer to regular variation of X with index a. From the definition of regular variation of X we get ([5], Theorem 3.1)
[P(X £ t(K(F) \ B1))]-1 P(X £ t) ^ cj(•) in M0(K0(F)a as t ^m, (3)
for some c > 0. The sequence {an}n>1 will always be chosen such that nP(X £ an(K(F) \ B1)) —► 1. With this choice of {an}n>1, it follows that c =1 in (3).
n——tt ~
An important closed subset of K(F) is the family of non-empty compact convex subsets of F, denoted by coK(F). Denote the topological dual of F by F* and the unit ball of F* by B*, it is endowed with the weak-* topology w*. The support function Ha of a compact convex A £ coK(F) is defined by (see [9])
Ha(u) = sup{w(x) : x £ A}, u £ B*.
Since A is compact, Ha(u) < m for all u £ B*. The support function Ha is sublinear, i.e., it is subadditive (Ha(u + v) < Ha(u) + Ha(v) for all u,v £ B* with u + v £ B*) and positively homogeneous (Ha(cu) = cHa(u) for all c > 0, u £ B* with cu £ B*). Let C(B*,w*) be the set of continuous functions from B* (endowed with the weak-* topology) to R and consider the uniform norm ||f||tt = sup„eB* \f(u)|, f £ C(B*,w*). The map H : coK(F) ^ C(B*,w*) has the following properties
HA1+A2 = HA1 + HA2 , H\A1 = AHA1, A1,A2 £ co K(F), A > 0,
which make it possible to convert the Minkowski sums and scalar multiplication, respectively, of convex sets into the arithmetic sums and scalar multiplication of the corresponding support functions. Furthermore,
d(A1 ,A2) = IIHA1 — HA2 ||tt.
Hence, the support function provides an isometric embedding of co K(F) into C(B*, w*) with the uniform norm. If G = H(coK(F)), then G is a closed convex cone in C(B*, w*), and H is an isometry between co K(F) and G.
A random compact convex set X is a Borel measurable function from a probability space (Q, F, P) into coK(F), which we endow with the relative topology inherited from K(F). The support function of a random compact convex set is, clearly, a C(B*,w*)-valued random variable taking values in G.
The definition of a regularly varying random compact convex set parallels that of a regularly varying random compact set above, and we are using the same notation: a random compact convex set X is regularly varying if there exist a non-zero measure j £ M^coK0(F)) and a sequence {an}n>1 of positive numbers such that
nP(X £ an) —► j() in M0(coK0(F)). (4)
n——tt
Once again, the tail measure j necessarily scales, leading to the notion of the index of regular variation.
The following lemma is elementary.
Lemma 1. (i) A random compact convex set X is regularly varying in co K(F) if and only if its support function hx is regularly varying in C(B*,w*). Specifically, if (4) holds for some sequence {an}, then for the same sequence we have
nP(hx £ an) —► v() in M0 (C(B*,w*)), (5)
n—-tt
where v = j o hX (the “special element” of C(B*,w*) is, of course, the zero function). Conversely, if (5) holds, then (4) holds as well with j = v o hx. In particular, the exponents of regular variation of X and hx are the same.
(ii) If a random compact set X is regularly varying in K(F) then its convex hull co X is a random compact convex set, that is regularly varying in co K(F). Specifically, if (2) holds, then so does (4), with the tail measure replaced by the image of the tail measure from (2)
under the map A ^ co A from K(F) to co K(F). In particular, X and co X have the same
exponents of regular variation.
Proof. Since isometry implies continuity, the support function is homogeneous of order
1, and assigns to the “special set” {0} the “special element”, the zero function, the statement
of part (i) of the lemma follows from the mapping theorem (Theorem 2.5 in [5]). For part (ii) note that the map A ^ co A from K(F) to co K(F) is a contraction in the Hausdorff distance, hence is continuous. It is also homogeneous of order 1. Since the “special set” {0} is already convex, the statement follows once again from the mapping theorem. □
For compact convex sets in Rd, the intrinsic volumes Vj, j = 0,...,d, play an important role. They can be introduced by means of the Steiner formula, see [9], Appendix F. In particular, Vd is the volume, Vd-1 is half of the surface area, V1 is a multiple of the mean width and V0 = 1 is the Euler-Poincare characteristic.
Corollary 1. Let X be a random compact convex set which is regularly varying in coK(Rd) with index a > 0 and tail measure j. Then for j £ {1, ..., d}, Vj (X) is a regularly varying non-negative random variable with index a/j and tail measure Vj = j o V-1.
Proof. Since Vj is continuous, homogeneous of order j and Vj(A0) = 0, the continuous mapping theorem (Theorem 2.5 in [5]) yields that Vj(X) is regularly varying with tail measure Vj = j o V-1. Moreover, Vj(AU) = j(Vj-1(AU)) = j(A1/jV—1(U)) = A-a/jVj(U) for any measurable subset U of R+.
Organization of the paper. In Section 2 we consider various examples of regularly varying compact random sets. In Section 3 we prove large deviation results for Minkowski sums Sn of iid regularly varying random compact sets. We allow the random sets to be, generally, non-convex. To the best of our knowledge, such results are not available in the literature; they parallel those proved by A. and S. Nagaev [10-12] for sums of iid random variables. The price one has to pay for this generality is that the normalizations An of the sums Sn have to exceed the level n. The situation with milder normalizations is more delicate. It is considered in [8]. Our main result there assumes that the random compact summands Xn are convex, but we also include partial results in the non-convex case.
2. Examples of regularly varying random sets
Example 1 (Convex hull of random points). Let k > 2, and let £1,...,£k be iid regularly varying F-valued random elements with index a > 0 and tail measure v and let X = co{^1,...,£k} be their convex hull. The mapping g : (z1,...,zk) ^ co{z1,...,zk} from Fk to co K(F) is continuous and homogeneous of order 1. Moreover, this mapping sends the zero point in Fk to the “special element” A0 of K(F). Since the random vector £ = (£1, .. . ,£k) is regularly varying in Fk, the continuous mapping theorem (Theorem 2.5 in [5]) yields that X is regularly varying with index a, and tail measure V o g-1 (as long as we are using the same sequence {an} for each element £i). Here
k
V = ¿0 x •• • x J0 x v x S0 x ••• x S0
i=1
(with v appearing at the ith place) is the tail measure of the vector £ = (£1,...,£k). Clearly, the convex hull of k points, one of which is x £ F, and the rest are zero points, is the interval [0,x] = {y £ F : y = cx, 0 < c < 1}. Therefore, the convex hull X has tail measure j = kv o T-1, where T : F ^ coK(F) is defined by the relation T(x) = [0,x].
Example 2 (Random zonotopes). As in the previous example, let £1 ,...,£k be iid regularly varying random elements in F with index a > 0 and tail measure v. Starting with the same
ingredients, we construct a different convex compact subset of F. Consider the Minkowski sum of the random segments, X = ^2k=1[0,£i], a so-called zonotope.
The function g : (z1,..., zk) i=1 [0, zi] from Fk to co K(F) is continuous, homogeneous of order 1, and maps the zero point in Fk to A0. The same argument as in Example 1 shows that the random zonotope X is regularly varying with index a, and, if we use the
same sequence {an} as we used for each element £i, has tail measure j = kv o T-1, where
T : F ^ coK(F) is as above.
Examples 1 and 2 construct different compact sets starting from a finite number of iid regularly varying random points in F, but the tail measures in the two cases turn out to be the same.
Example 3 (Multiple of a deterministic set). Let A C K(F) be a deterministic compact set such that ||A|| > 0 and let R be a regularly varying random variable with index a > 0, satisfying the tail balance condition
P(R > x) P(R < —x)
P(|i?| > x) x^oo^ P(|-R| > x) x^oo^
Then the mapping g : z ^ zA from R to K(F) is continuous and homogeneous of order 1, and it maps the origin in R into A0. Therefore, X = RA is regularly varying with index a. Recall that the tail measure of R has density + ^1{K<0^ |x|-(1+a) with respect to
Lebesgue measure on R. Using the sequence {an} that defines the above tail measure on R, we see that the tail measure j of X can be written as
p OO
j(U) = x-(1+a) (p1{xAeU} + q1{-xAeU}) 1x.
0
Example 4 (Stable random compact convex set). A random compact convex set X has an a-stable distribution, a £ (0, 2), if for any a,b > 0 there are compact convex sets C and D such that
aX1 + bX2 + C = (aa + ba)1/aX + D,
where X1X2 are independent copies of X; see [4] and [9], Section 2.3. By Theorem 2.2.14 in [9], the support function of an a-stable random compact convex set X is itself an astable random vector in C(B*,w*), hence is regularly varying in that space (see e.g. [7]). By Lemma 1, X is a regularly varying random compact convex set.
It follows from [4] that an a-stable random compact convex set for a £ [1, 2) must be of the form X = K + £, where £ £ F is an a-stable random element and K £ co K(F) is deterministic.
3. A large deviation result for general random compact sets
In this section we consider an iid sequence {Xn}n>1 of random compact sets which are not necessarily convex. We introduce the sequence of the corresponding Minkowski partial sums Sn = X1 + ••• + Xn, n > 1. Next we formulate our main result on the large deviations in this situation.
Theorem 1. Let {Xn}n>1 be an iid sequence o^f random compact sets which are regularly varying with index a and tail measure j £ M^K0(F)). Let {a,n}n>1 be the normalizing sequence in (2). Consider a sequence An /* <x such that
(i) Xn/an —► ro if a < 1,
n—— O
(ii) An/n —> oo; An/an —> oo; f-E||Xi||l{||X ||<A } —> 0 if a = 1,
n—O n—O L" J n — O
(iii) Xn/n —> ro if a> 1.
n—O
Then, with Yn = [nP(||X1|| > Xn)] 1,
Yn P(Sn £ Xn) —► j() in M0fK0(F)).
n—O
Proof. First observe that our assumptions and an appeal to [14], Theorem 4.13, yield that
X-1(|X1| + ••• + ||Xn||) 0. (6)
n—O
Let U C K0(F) be a j-continuity set (j(dU) = 0), bounded away from A0. We have to prove that YnP(Sn £ XnU) —> j(U). Following [6], Lemma 2.1, we start with an upper bound.
n—O
For any e > 0,
P(Sn £ XnU) = P (Sn £ XnU, un=1 {Xi £ XnUe}) + P (Sn £ XnU, nn=1 {Xi £ XnUe}) < < nPfX1 £ XnU) + P (nn=1 {d(Sn,Xi) > eXn}) = I1 +12.
Relation (3) implies that
in Mo(/Co(F)).
P(I|x1|| > Xn) n—O
Consequently, Ynl1 —* j(Ue) whenever Ue is a j-continuity set. Since j(dU) = 0, we have
n—O
lime\0 j(Ue) = j(U). Next we show that, for every e > 0, YnI2 —► 0. We consider the
n—O
following disjoint partition of i for S > 0:
B1 = U {XU >SXn, UXj|| >SXn},
1<i<j<n
n
B2 = (J {UXiU >SXn, UXj H<SXn,j = = 1,...,n},
< SXn^j .
By regular variation of X1 and definition of Yn, we have YnP(B1) —► 0. As regards B2,
n—O
from (1) we get d(Sn,Xn) < 11Sn—11 and accordingly,
P(nn=1 {d(Sn,Xi) > eXn} n B2) =
n
^53P(nn=1{d(Sn,Xi) >eXn}n{||Xk|| > SXn, UXj|| <SXn,j = kj = 1,...,n}) <
k=1
n
<53P(d(Sn,Xk) > eXn, ||Xk|| >SXn) <
k=1
< peuSn-lU >eXn)[nP(UXlU > SXn)].
B3 = ^ max UXi
■ =1,...,n
Since X1 is regularly varying and X—1||Sn|| —► 0,
n n—O
YnnP(UX1U > SXn)P(USn-1U > eXn) ^ 0.
n—O
For B3, using again (1), we have
Pfnn=1 {d(Sn,Xi) > eXn}nB3) < PfUSn-1U > eXn, max ||Xi|| < SXn) <
i=1,...,n—1
n—1 n—1
< P( || Xi1{\\Xi\\<S\n} > eXn) < P(^3 llXiUl{|Xi|<5A„} > eXnJ <
i=1 i=1
n—1
< P(53 (UXiU1{\Xi\<5A„} - E|XiU1{ eXn — (n — 1)E|X1U1{
i=1
By the Karamata theorem [1], for a < 1,
/■ $An sx
EHXiHliu^K^j = PdlXiH > x) lx ~ - —P(||Xi|| > SXn).
J0 1 — a
Therefore,
(n — 1)Xn1EllX1U1{||Xi||<A} -------> 0. (7)
L" J n—O
If a > 1, then (7) follows directly from the assumptions on {Xn}. Taking into account (7), it suffices to show that
7np (E (llXillllllx.n^} -Ell^lllin^ii^}) > -.0,
which can be accomplished similarly as in the one-dimensional case by an application of the Fuk-Nagaev inequality ([14], p. 78) and the Karamata theorem. We conclude that
limsup YnP(Sn £ XnU) < j(Ue) —► j(U)
n — O e\0
for any j-continuity set U bounded away from A0.
To prove the corresponding lower bound, we consider a j-continuity set U C K0 (F) bounded away from A0 with non-empty interior intU. Introduce the set U—e = ((Uc)e)c, where Uc denotes the complement of U. It is a non-empty j-continuity set for a sequence of e > 0 converging to zero. Notice that (U—e)e C U. Then
PfSn £ XnU) > PfSn £ XnU, un=1{Xi £ XnU—e}) >
> Pfun=1{d(Sn,Xi) < eXn,Xi £ XnU—e}) >
> nV(Xl G AnW-^PdlSn-ill < eXn) - n(n~ 1} [P(X! G XnU-e)]2. Since X—1U Sn—1U —— 0 and U is a j-continuity set,
n n—O
lminfo„P(S„ € KU) > l,m £ A”“-> " - 1 [P№ 6
P(IIX1|I > Xn) 2nYn [P(|X1U > A„)]'
= j(U—e) —-0 j(U).
e\0
From Theorem 1 we get by the continuous mapping theorem (Theorem 2.5 in [5]) the following corollary concerning large deviations of the intrinsic volumes of random compact convex sets.
Corollary 2. Let {Xn}n>1 be an iid sequence of random compact convex sets which are regularly varying with index a and tail measure j £ M^(co K0(Rd)). Under the same assumptions on the sequence {Xn} as in Theorem 1, we have
P(Vj(Sn)/Xjn £ ) „_1
n P(|X1| > Xv
j o V. (•) in M0(R), j = 1,...,d.
Remark 1. For j = 1 we can also use the relation V1(X1 + • • -+Xn) = V1(X1) + - • •+V1(Xn), Corollary 1 and large deviations results for sums of iid random variables [10, 11] in order to obtain large deviations result for V1(Sn). However, for j > 1 similar results are not straightforward.
The assumptions of Theorem 1 imposed on the normalizing sequence {Xn} ensure that (6) holds, in particular, Xn/n —> ro. If a > 1, this is a rather strong assumption. In [8] we show
n—O
that this condition can be weakened significantly if it is possible to introduce the notion of expectation of Sn. In particular, we will assume that the iid sequence {Xn} consists of iid random convex compact sets which are regularly varying with index a > 1 and E||X1|| < ro.
n
A personal remark of Thomas Mikosch. When I was a student of Valentin V. Petrov in the beginning of the 1980s I got familiar with large deviations by reading his monograph [13]. I remember the excitement when I read his proof of Cramer’s theorem for sequences of independent variables: it is an example of extraordinary mathematical elegance and beauty. It was also then that I started getting interested in heavy-tail phenomena, in particular in distributions with power laws. The combination of regular variation and large deviations has fascinated me since then. I would like to thank Valentin Vladimirovich for opening the door to this exciting world.
Acknowledgment. Thomas Mikosch’s research is partly supported by the Danish Natural Science Research Council (FNU) Grant 09-072331, “Point process modelling and statistical inference”. Zbynek Pawlas is partly supported by the Czech Ministry of Education, research project MSM 0021620839 and by the Grant Agency of the Czech Republic, grant P201/10/0472. Gennady Samorodnitsky’s research is partially supported by a US Army Research Office (ARO) grant W911NF-10-1-0289 and a National Science Foundation (NSF) grant DMS-1005903 at Cornell University.
References
1. Bingham N.H., Goldie C.M., Teugels J. L. Regular Variation // Encyclopedia of Mathematics and its Applications. Vol. 27. Cambridge: Cambridge University Press, 1987.
2. Cerf R. Large deviaition results for sums of iid random compact sets // Proceedings of the American Mathematical Society. 1999. Vol. 127. P. 2431-2436.
3. Dembo A., Zeitouni O. Large Deviations Techniques and Applications. Berlin: Springer, 1998.
4. Gine E., Hahn M. Characterization and domain of attraction of p-stable random compact sets // The Annals of Probability. 1985. Vol. 13. P. 447-468.
5. Hult H., Lindskog F. Regular variation for measures on metric spaces // Publications de l’Institut Mathematique, Nouvelle Serie. 2006. Vol. 80. P. 121-140.
6. Hult H., Lindskog F., Mikosch T., Samorodnitsky G. Functional large deviations for multivariate regularly varying random walks // The Annals of Applied Probability. 2005. Vol. 15. P. 26512680.
7. Ledoux M., Talagrand M. Probability in Banach Spaces. Berlin: Springer-Verlag, 1991.
8. Mikosch T., Pawlas Z., Samorodnitsky G. A large deviation principle for Minkowski sums of heavy-tailed random compact convex sets with finite expectation // Glynn P., Mikosch T., Rolski T., Rubinstein R. (Eds.) New Frontiers in Applied Probability. A Volume in Honour of S0ren Asmussen. A special issue of Journal of Applied Probability, to appear. 2011.
9. Molchanov I. Theory of Random Sets. London: Springer, 2005.
10. Nagaev A. V. Limit theorems for large deviations where Cramer’s conditions are violated (in Russian) // Izvestiya Akademii Nauk UzSSR, Ser. Fiz.-Mat. Nauk. 1969. Vol. 6. P. 17-22.
11. Nagaev A. V. Integral limit theorems taking large deviations into account when Cramer’s condition does not hold I, II // Theory of Probability and its Applications. 1969. Vol. 14. P. 51-64, 193-208.
12. Nagaev S. V. Large deviations of sums of independent random variables // The Annals of Probability. 1979. Vol. 7. P. 745-789.
13. Petrov V. V. Sums of Independent Random Variables. Berlin: Springer, 1975.
14. Petrov V. V. Limit Theorems of Probability Theory. Oxford: Oxford University Press, 1995.
Статья поступила в редакцию 21 декабря 2010 г.