REMARKS ON CONVERGENCE OF INDUCTIVE MEANS

PARK, JISU;KIM, SEJONG;

doi:10.14317/jami.2016.28

Journal of applied mathematics & informatics

Volume 34 Issue 3_4
/
Pages.285-294
/
2016
/
2734-1194(pISSN)
/
2234-8417(eISSN)

The Korean Society for Computational and Applied Mathematics (한국전산응용수학회)

DOI QR Code

REMARKS ON CONVERGENCE OF INDUCTIVE MEANS

PARK, JISU (Department of Mathematics, College of Natural Sciences, Chungbuk National University) ;
KIM, SEJONG (Department of Mathematics, College of Natural Sciences, Chungbuk National University)

Received : 2016.01.12
Accepted : 2016.03.30
Published : 2016.05.30

https://doi.org/10.14317/jami.2016.28 Citation PDF KSCI KPUBS HTML

Download PDF

⟨ Previous Next ⟩

Abstract

We define new inductive mean constructed by a mean on a complete metric space, and see its convergence when the intrinsic mean is given. We also give many examples of inductive matrix means and claim that the limit of inductive mean constructed by the intrinsic mean is not the Karcher mean, in general.

Keywords

1. Introduction

Among a variety of applications for positive definite matrices, the process of averaging has become attractive and widely studied. Since the two-variable geometric mean of positive definite matrices A and B

has been introduced by Kubo and Ando [9], its algebraic and geometric properties have been studied. On the open convex cone of positive definite matrices equipped with the Riemannian trace metric d(X, Y) = ∥ log(X−1/2Y X−1/2)∥2, the unique geodesic connecting from A to B is given by

See [3] for more details. An extensive theory of two-variable geometric mean to the multivariable geometric mean has sprung up and has remained problematic. Ando, Li, and Mathias [1] have especially suggested a symmetrization procedure to the multivariable geometric mean of positive definite matrices including ten desirable properties for extended geometric means. Moreover, a convergence for symmetrization procedure has been recently proved by Kim and Petz [8].

A natural and attractive average of positive definite matrices is the least squares mean, also called the Cartan mean, Riemannian barycenter, and Karcher mean. The Karcher mean ∧ (ω;A1, . . . ,An) of positive definite matrices A1, . . . ,An and a positive probability vector ω = (w1, . . . ,wn) is defined as the unique minimizer (provided it exists) of the weighted sum of squares of the Riemannian trace distances to each of the point. That is,

In general, such minimizer exists uniquely on a Hadamard space, which is a complete metric space satisfying the semiparallelogram law. While many interesting properties of the Karcher mean including Ando-Li-Mathias properties have been developed, the remarkable one is that the limit of inductive mean given by

where and denotes the residual of k mod n, coincides with the Karcher mean [14].

One can naturally ask what if we replace the original inductive mean by other geometric means. In this article we consider a new inductive mean constructed by a given mean generally on a complete metric space. We mainly show that the limit of the inductive mean constructed by a symmetric and multiplicative mean is the given mean, and see several examples for positive definite matrices.

2. Convergence of inductive means

Let (X, d) be a complete metric space. Let ω ∈ Δn, the simplex of positive probability vectors in ℝn convexly spanned by the unit coordinate vectors. A weighted n-mean Gn on X for n ≥ 2 is a continuous map Gn : Δn×Xn → X that is idempotent in the sense that Gn(ω; x, . . . , x) = x for all x ∈ X. A weighted n-mean Gn is symmetric or permutation invariant if Gn(ωσ;xσ) = Gn(ω; x), where ωσ = (ωσ(1), . . . ,ωσ(n)) and xσ = (xσ(1), . . . ,xσ(n)) for each permutation σ on {1, . . . , n}. A mean G on X is a sequence of means {Gn}n≥2.

For ω = (w1, . . . ,wn) ∈ Δn and x = (x1, . . . ,xn) ∈ Xn, we denote by

where the number of blocks is k. Also,

Note that ω∞ is an infinite-dimensional vector, not a probability vector.

Definition 2.1. Let ω ∈ Δn. A mean G = {Gn} on X is said to be multiplicative if for all n and x ∈ Xn,

If G is symmetric and multiplicative, then G is called intrinsic.

Let a = (a1, . . . , am) ∈ Xm and let ω = (w1, . . . ,wm) ∈ Δm. For a mean G = {Gn}n≥2, we consider the inductive mean defined as

where denotes the residual of k mod m, and

We also denote the multiple of m by m. Note that

Proposition 2.2. Let G = {Gn}n≥2 be a symmetric mean satisfying that for all n, ω = (w1, . . . ,wn) ∈ Δn and x = (x1, . . . ,xn) ∈ Xn

if x1 = · · · = xp for 1 ≤ p < n. Then G is the intrinsic mean.

Proof. Let ω = (w1, . . . ,wm) ∈ Δm and x = (x1, . . . ,xm) ∈ Xm. It is enough to show that G is multiplicative. Using the permutation invariance and the condition (3) sequentially yield that for k ≥ 2,

We now see our main result about the convergence of inductive means.

Theorem 2.3. Let G = {Gn}n≥2 be a symmetric mean satisfying the property (3). Then for any a = (a1, . . . , am) ∈ Xm and ω = (w1, . . . ,wm) ∈ Δm,

Proof. Let k ∈ ℕ. Then there are p, r ∈ ℕ ∪ {0} such that k = pm + r and 0 ≤ r < m by the division algorithm. By permutation invariance and the condition (3),

where Note that k → ∞ is equivalent to Then for s = 1, . . . , r, and for t = r + 1, . . . ,m.

Since a mean Gm is continuous, we conclude

3. Multivariable means of positive definite matrices

In this section we see multivariable matrix means on the open convex cone ℙ of positive definite matrices. Let A = (A1, . . . ,An) ∈ ℙn and ω = (w1, . . . ,wn) ∈ Δn.

Remark 3.1. The weighted arithmetic mean and weighted harmonic mean

are the intrinsic means. Moreover, the identity (3) is satisfied for both A and H, and hence,

Remark 3.2. The resolvent mean for parameter μ ≥ 0 is defined by

Bauschke, Moffat, and Wang [2] introduced the resolvent mean whose origin comes from the proximal average in convex analysis and optimization. Since then, many scholars have found fascinating properties such as the monotonicity for parameters and the nonexpansiveness [7, 10].

One can see the resolvent mean alternatively as

where and I is the identity matrix. So it is an intrinsic mean, and satisfies the equality (3). Therefore,

We now review a natural and attractive average, the Karcher mean, among many geometric means. Also we see some recent results and the connection with the Log-Euclidean mean.

Remark 3.3. The Karcher mean (or the least squares mean, Riemannian centroid, Cartan barycenter) is defined as the unique minimizer of the weighted sum of the squares of the Riemannian trace metric δ:

where δ(A,B) = ∥ log(A−1/2BA−1/2)∥F and ∥ · ∥F denotes the Frobenius norm. Using Karcher’s formula in [5] for the gradient of the objective function yields that the Karcher mean is the unique positive definite solution of the Karcher equation

Recently, many interesting properties of the Karcher mean have been widely studied. It has been shown in [14] that power means defined by the unique positive solution of the equation

converge to the Karcher mean as t → 0+, where A#tB = A1/2(A−1/2BA−1/2)tA1/2 is the weighted geometric mean of A and B in ℙ. Also, it has been proved in [6] that certain sequence of the Karcher means converges to the Log-Euclidean mean:

The following shows the continuity of power means P : (0, 1] × Δn × ℙn → ℙ, with respect to the Thompson metric d on ℙ given by

where ∥X∥ denotes the operator norm of X.

Lemma 3.1 ([12, Proposition 3.5]). Let ω, μ ∈ Δn and Then for s, t ∈ (0, 1]

where denotes the diameter of and

Theorem 3.2. Let and ω = (w1, . . . ,wn) ∈ Δn. Then

Proof. Let ϵ > 0 be given and let ω, μ ∈ Δn. By Lemma 3.1, there is δ > 0 such that d(ω, μ) < δ implies

For such ω, μ ∈ Δn

for t > 0 small enough, since the power means Pt converges to the Karcher mean as t → 0 in [12, 14]. By the triangle inequality, we have

Moreover, by Theorem 4.4, (P5) in [12]

So the Karcher mean Λ : Δn × ℙn → ℙ is continuous, and is a symmetric mean. Indeed, the Karcher mean satisfies all of the Ando-Li-Mathias properties [11].

Let If A1 = · · · = Ak for 1 ≤ k < n, then the Karcher equation (7) reduces to

It means that So by Theorem 2.3 it is proved. □

Proposition 3.3. Let and ω = (w1, . . . ,wn) ∈ Δn. Then the double limit converges, and

Proof. By Theorem 3.2 and the fact that the map A ∈ ℙ ↦ Am is continuous, we have

By the property (8) we get

By the definition of inductive means and the property (8), on the other hand, we obtain

where and k = pn + r for some p ∈ ℕ and 0 ≤ r < n. Note that k → ∞ is equivalent to p → ∞, and hence,

4. Final Remarks

Hadamard spaces are important examples of complete convex metric spaces [13, 17]. Here, a complete metric space (M, d) is called a Hadamard space if it satisfies the semiparallelogram law; for each x, y ∈ M, there exists an m ∈ M satisfying

for all z ∈ M. Such spaces are also called (global) CAT(0)-spaces or NPC (non-positively curved) spaces. The point m appearing in (9) is the unique metric midpoint between x and y. The midpoint operation gives rise to a unique minimal geodesic γx,y : [0, 1] → M for given two points x and y. We denoted by x#ty := γx,y(t) and call it the weighted geometric mean of x and y. The typical example of Hadamard space is the open convex cone ℙ of positive definite matrices with Riemannian trace metric.

On a Hadamard space (M, d), the least squares mean

exists uniquely. Motivated by Strong Law of Large Number on Hadamard spaces that established by K. Sturm [17], J. Holbrook [4], Y. Lim and M. Palfia [15] found a deterministic approximation to the least squares mean mean: For a = (a1, . . . , an) ∈ Mn,

where

and and denotes the residual of k mod n.

We naturally ask whether or not the inductive mean defined in (2) for given mean G = {Gn}n≥2 satisfies the property (11). One can see that if the mean G is symmetric and satisfies the property (3), then the property (11) does not hold. Indeed, suppose that converges to the Karcher mean as k → ∞. Then every subsequence of should converge to the Karcher mean. However, by Proposition 2.2 we have

for any p ∈ ℕ. This means that which is a contradiction to converge to the Karcher mean.

References

T. Ando, C.-K. Li, and R. Mathias, Geometric means, Linear Algebra Appl. 385 (2004), 305-334. https://doi.org/10.1016/j.laa.2003.11.019
H.H. Bauschke, S.M. Moffat, X. Wang, The resolvent average for positive semidefinite matrices, Linear Algebra Appl. 432 (2010), 1757-1771. https://doi.org/10.1016/j.laa.2009.11.028
R. Bhatia, Positive Definite Matrices, Princeton Series in Applied Mathematics, 2007.
J. Holbrook, No dice: a determinic approach to the Cartan centroid, J. Ramanujan Math. Soc. 27 (2012), 509-521.
H. Karcher, Riemannian center of mass and mollifier smoothing, Comm. Pure Appl. Math. 30 (1977), 509–541. https://doi.org/10.1002/cpa.3160300502
S. Kim, U. Ji, and S. Kum, An approach to the Log-Euclidean mean via the Karcher mean on symmetric cones, Taiwanese J. Math., to be published.
S. Kim, J. Lawson, and Y. Lim, The matrix geometric mean of parameterized, weighted arithmetic and harmonic means, Linear Algebra Appl. 435 (2011), 2114-2131. https://doi.org/10.1016/j.laa.2011.04.010
S. Kim and D. Petz, A new proof to construct multivariable geometric means by symmetrization, J. Appl. Math. & Informatics 33 (2015), 379-386. https://doi.org/10.14317/jami.2015.379
F. Kubo and T. Ando, Means of positive linear operators, Math. Ann. 246 (1979/80), 205-224. https://doi.org/10.1007/BF01371042
S. Kum and Y. Lim, Nonexpansiveness of the resolvent average, J. Math. Anal. Appl. 432 (2015), 918-927. https://doi.org/10.1016/j.jmaa.2015.07.005
J. Lawson and Y. Lim, Monotonic properties of the least squares mean, Math. Ann. 351 (2011), 267-279. https://doi.org/10.1007/s00208-010-0603-6
J. Lawson and Y. Lim, Karcher means and Karcher equations of positive definite operators, Trans. Amer. Math. Soc. Series B 1 (2014), 1-22. https://doi.org/10.1090/S2330-0000-2014-00003-4
J. Lawson, H. Lee and Y. Lim, Weighted geometric means, Forum Math. 24 (2012), 1067-1090. https://doi.org/10.1515/form.2011.096
Y. Lim and M. P´alfia, Matrix power mean and the Karcher mean, J. Functional Analysis 262 (2012), 1498-1514. https://doi.org/10.1016/j.jfa.2011.11.012
Y. Lim and M. P´alfia, Weighted deterministic walks and no dice approach for the least squares mean on Hadamard spaces, Bull. London Math. Soc. 46 (2014), 561-570. https://doi.org/10.1112/blms/bdu008
M. Sagae and K. Tanabe, Upper and lower bounds for the arithmetic-geometric-harmonic means of positive definite matrices, Linear and Multilinear Algebra 37 (1994), 279-282. https://doi.org/10.1080/03081089408818331
K.-T. Sturm, Probability measures on metric spaces of nonpositive curvature, in: Heat Kernels and Analysis on Manifolds, Graphs, and Metric Spaces? Eds. P. Auscher et. al., Contemp. Math. 338, Amer. Math. Soc. (AMS), Providence, 2003.

Cited by

MUIRHEAD'S AND HOLLAND'S INEQUALITIES OF MIXED POWER MEANS FOR POSITIVE REAL NUMBERS vol.35, pp.1, 2016, https://doi.org/10.14317/jami.2017.033

Journal of applied mathematics & informatics

REMARKS ON CONVERGENCE OF INDUCTIVE MEANS

Abstract

Keywords

1. Introduction

2. Convergence of inductive means

3. Multivariable means of positive definite matrices

4. Final Remarks

References

Cited by

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)