1. Trang chủ
  2. » Khoa Học Tự Nhiên

Báo cáo hóa học: " Research Article Trace-Inequalities and Matrix-Convex Functions" ppt

12 279 0

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 12
Dung lượng 494,88 KB

Các công cụ chuyển đổi và chỉnh sửa cho tài liệu này

Nội dung

Since the subpace of Hermitian matrices is provided with the order structure induced by the cone of positive semidefinite matrices, one can consider convexity of this map.. Introduction

Trang 1

Volume 2010, Article ID 241908, 12 pages

doi:10.1155/2010/241908

Research Article

Trace-Inequalities and Matrix-Convex Functions

Tsuyoshi Ando

Hokkaido University (Emeritus), Shiroishi-ku, Hongo-dori 9, Minami 4-10-805, Sapporo 003-0024, Japan

Correspondence should be addressed to Tsuyoshi Ando,ando@es.hokudai.ac.jp

Received 8 October 2009; Accepted 30 November 2009

Academic Editor: Anthony To Ming Lau

Copyrightq 2010 Tsuyoshi Ando This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited

A real-valued continuous function ft on an interval α, β gives rise to a map X → fX via functional calculus from the convex set of n × n Hermitian matrices all of whose eigenvalues

belong to the interval Since the subpace of Hermitian matrices is provided with the order structure induced by the cone of positive semidefinite matrices, one can consider convexity of this map

We will characterize its convexity by the following trace-inequalities: TrfB − fAC − B ≤ TrfC − fBB − A for A ≤ B ≤ C A related topic will be also discussed

1 Introduction and Theorems

Let ft be a real-valued continuous function defined on an open interval α, β of the real line The function ft is said to be convex if

f λa  1 − λb ≤ λfa  1 − λfb 0≤ λ ≤ 1; α < a, b < β. 1.1

We referee to 1 for convex functions Under continuity the requirement 1.1 can be

restricted only to the case λ  1/2, that is,

f



a  b

2



f a  fb

2



α < a, b < β

It is well known that when ft is a C1-function, its convexity is characterized by the condition on the derivative

f b − fb − t

t ≤ fb ≤ f b  t − fb

t



α < b − t < b  t < β

Trang 2

and, further when ft is a C2-function, by the condition on the second derivative

fb ≥ 0 α < b < β

On the other hand, it is easy to see that1.1 is equivalent to the following requirement

on the divided difference:

f b − fa

b − af c − fb

c − b



α < a < b < c < β

or even to the inequality



f b − fac − b ≤f c − fbb − a α < a ≤ b ≤ c < β

LetMn be the linear space of n × n complex matrices, and H nitsreal subspace of n×n Hermitian matrices The identity matrix I will be denoted simply by 1, and correspondingly,

a scalar λ will represent λI For Hermitian A, B the order relation A ≤ B means that B − A is

positive-semidefinite, or equivalently

will mean that B − A is positive definite; that is, B ≥ A and B − A is invertible see 2 for basic facts about matrices.

Notice that for scalars α, β and Hermitian X the order relation α < X < β is equivalent

to the condition that every eigenvalue of X is in the interval α, β Denote by H n α, β the convex set of Hermitian matrices X such that α < X < β A continuous function ft, defined

onα, β, induces a nonlinear map X → fX from H n α, β to H n through the familiar

functional calculus, that is,

f X : U diagf λ1, , fλ nU∗ 1.8

with a unitary matrix U which diagonalizes X as

UXU  diag λ1, , λ n . 1.9

The function ft is said to be matrix-convex of order n, or simply n-convex on the interval α, β

if the map X → fX is convex on H n α, β or more exactly

f λA  1 − λB ≤ λfA  1 − λfB 0≤ λ ≤ 1; A, B ∈ H n



α, β

1.10

see 3,4 This is a formal matrix-version of 1.1 In view of 1.7 this convexity means that

n

Trang 3

Just as in the scalar case for the matrix-convexity the following matrix-version of1.2

is sufficient:

2fB ≤ fB  X  fB − X B ± X ∈ H n



α, β

This means that ft is n-convex if and only if the map t → fB  tX is convex on −1, 1 when B ± X ∈ H n α, β.

The 1-convexity is nothing but the usual convexity of the function ft It is easy to see that n-convexity implies m-convexity for all 1 ≤ m ≤ n.

It is knownsee 3 that if ft is 2-convex then it is already a C2-function, andsee

5,6 that for each n there is an n-convex function which is not n  1-convex.

It should be mentioned here that in his original definition of n-convexity Kraus 3 restricted the requirement1.11 only for X ≥ 0 with rankX  1 We will return to this point

later

The corresponding matrix-versions of 1.5 and 1.6 have no definite meaning becausefB − fAC − B or fB − fAB − A−1is no longer Hermitian

On the spaceMn the most useful linear functional is the Trace, in symbol, TrX, which

is defined as the sum of diagonal entries of X with respect to any orhonormal basis The useful properties of the trace are commutativity, TrXY  TrYX, and positivity, that is, X ≤

Y ⇒ TrX ≤ TrY .

We will use a characterization of positive semidefiniteness X ≥ 0 in terms of trace:

X ≥ 0 ⇐⇒ Tr XY ≥ 0 0 ≤ Y of rank-one. 1.12

Notice in this connection that if both X, Y are Hermitian, then TrXY  is a real number.

Our main aim is to establish trace-versions of 1.5 and 1.6 The trace-version for

1.6 is quite natural

Theorem 1.1 A continuous function ft on an interval α, β is n-convex if and only if

Tr

f B − fAC − B ≤ Trf C − fBB − A A ≤ B ≤ C in H n



α, β

. †n

On the other hand, the trace-version for1.5 turns out quite restrictive.

Theorem 1.2 Let n ≥ 2 A continuous function ft on an interval α, β satisfies the condition

Tr

f B − fAB − A−1≤ Trf C − fBC − B−1 

A < B < C in H n



α, β

‡n

if and only if it is of the form ft  at2 bt  c with a ≥ 0, and b, c ∈ R.

Trang 4

2 Preliminary

In order to prove theorems, we use a well-established regularization techniquesee 7 I-4

Take a nonnegative symmetric C-function ϕt on −∞, ∞ such that

ϕ t  0 |t| ≥ 1,

∞

and for  > 0 let ϕ : ϕt−1−1 Then ϕ  t is a nonnegative, symmetric C∞-function such that

ϕ  t  0 |t| ≥ ,

∞

Given a continuous function ft on an interval α, β, setting ft  0 outside of the

intervalα, β, define f  t as the convolution of this extended function f with ϕ , that is,

f  t :f  ϕ 

t 

∞

The following is well known

Lemma 2.1 The function f  is a C-function, in fact,

d k

dt k f   f  d k

and f  t converges to ft uniformly on each compact subset of the interval α, β as  → 0.

Lemma 2.2 Let ft be a continuous function on an interval α, β.

i ft satisfies n  on α, β if and only if for small  > 0 the function f  t satisfies n  on

α  , β − .

ii ft is n-convex on α, β if and only if for small  > the function f  t is n-convex on

α  , β − .

Proof i Let ft satisfy n  on α, β Suppose that α   < A ≤ B ≤ C < β − , then

Tr

f  C − f  BB − A − Trf  B − f  AC − B





−



Tr

f C − s − fB − s {B − s − A − s}

−Trf B − s − fA − s {C − s − B − s} ϕ  sds ≥ 0,

2.5

Trang 5

α < A − s ≤ B − s ≤ C − s < β |s| ≤ . 2.6

The converse statement is clear by the second half ofLemma 2.1

The proof ofii is also easy and omitted

In a similar way we have the following

Lemma 2.3 A continuous function ft satisfies ‡ n  on α, β if and only if for small  > 0 the

functin f  t satisfies n  on α  , β − .

When ft is a C1-function onα, β, B ∈ H n α, β and X ∈ H n , the map t → fB  tX

is defined for small|t| and differentiable at t  0 The derivative of this map at t  0 will be

denoted byDfB; X, that is,

DfB; X : d

dt

t0

When B is daigonal as B  diagλ1, , λ n, it is known see 2 V-3 and 8 6-6 that

DfB; X  f1

λ i , λ j n i,j1◦x ij n i,j1 for X 

x ij n i,j1 , 2.8

where ◦ denotes the Schur product  entrywise product and f1s, t is the first divided

difference of f, defined as

f1s, t 

f s − ft

s − t , if s /  t,

fs, if s  t.

2.9

Notice thatf1λ i , λ jn i,j1is a real symmetric matrix

In a similar way when ft is a C2-function, the second derivative of the map t → fB 

tX at t  0 is written as see 2 V-3 and 8 6-6

d2

dt2

t0

f B  tX  2

n

k1

f2

λ i , λ k , λ j



x ik x kj

n

i,j1 for X 

x ij n i,j1 , 2.10

where f2s, t, u is the second divided difference of f, defined as

f2s, t, u  f1s, t − f1t, u

s − u 

f1s, t − f1t, u

s − u , if s /  u,

fs − f1t, s

s − t , if s  u /  t,

fs

2.11

Trang 6

Since the functional calculus is invariant for unitary similarity, that is, fVXV  

VfXV , the formulas 2.8 and 2.10 well determine the forms of derivatives

Lemma 2.4 If ft is a C1-function on an interval α, β, then

Tr

DfB; X · Y TrDfB; Y · X B ∈ H n



α, β

; X, Y ∈ H n



Proof We may assume that B  diagλ1, , λ n, then by 2.8 for X  x ijn

i,j1 and Y 

y ijn

i,j1

Tr

DfB; X · Y n

i,j1

f1

λ i , λ j



x ij y ji

 n

i,j1

f1

λ j , λ i



y ji x ij TrDfB; Y · X.

2.13

3 Proofs of Theorems

By Lemmas2.3and2.4we may assume that ft in theorems is a C∞-function

Proof of Theorem 1.1 Suppose that the function ft satisfies n  on α, β Take B ∈ H n α, β

and 0 ≤ X, Y of rank-one such that C : B  X and A : B − tY for small t > 0 belong to

Hn α, β Since A ≤ B ≤ C, by assumption n we have

Tr



f B − fB − tYX

hence by2.7

Tr

Then it follows fromLemma 2.4that

Tr

Since 0≤ X, Y of rank-one are arbitrary, it follows from 3.3 and 1.12 that for any 0 ≤ X of rank one such that α < B ± X < β

and similarly

Trang 7

2fB ≤ fB  X  fB − X B ± X ∈ H n



α, β and 0≤ X of rank-one. 3.6

This means that the matrix-valued function t → fB  tX is convex under the condition that

0≤ X is of rank-one.

At this point we proved the n-convexity in the sense of Kraus 3 as mentioned in

Section 1 The remaining part is essentially the same as Kraus’ approach3

Since for 0≤ X of rank-one and small t > 0 by 3.6

0≤ f B  tX  fB − tX − 2fB

dt2

t0

we can conclude from2.10 that for 0 ≤ X  x ijn

i,j1of rank-one

n

k1

f2

λ i , λ k , λ j



x ik x kj

n i,j1

For each t > 0, consider a positive semidefinite matrix of rank-one

0≤ X x ij n i,j1: t ij−2 n

Then by3.8

0≤

 n



k1

f2

λ i , λ k , λ j



x ik x kj

n

i,j1

 diag1, t, , t n−1n

k1

f2

λ i , λ k , λ j



t2k−1

n

i,j1

diag

1, t, , t n−1

,

3.10

which implies

n

k1

f2

λ i , λ k , λ j



t2k−1

n i,j1

Letting t → 0, we have

C1 : f2

λ i , λ1, λ j

n

Trang 8

In a similar way we can see that

C k: f2

λ i , λ k , λ j

n i,j1 ≥ 0 k  1, 2, , n. 3.13

Now since for any X  x ijn

i,j1 ∈ Hn each matrixx ik x kjn

i,j1 k  1, 2, , n is

positive semidefinite and of rank-one, it follows from3.8 that

d2

dt2

t0

f B  tX  2

n

k1

f2

λ i , λ k , λ j



x ik x kj

n i,j1

 2n

k1

C k◦x ik x jk n i,j1 ≥ 0.

3.14

Here we used the well-known fact that the Schur product of two positive semidefinite matrices is again positive semidefinitesee 2 I-6 Therefore

d2

dt2

t0

f B  tX ≥ 0 B ∈ H n



α, β

; X ∈ H n



which implies the convexity of the map t → fB  tX whenever B ± X ∈ H n α, β This completes the proof of the n-convexity of the function ft.

Suppose conversely that ft is n-convex on the interval α, β, then by 1.3

f C − fB  fB  C − B − fB ≥ d

dt

t0

f B  tC − B

 DfB; C − B,

3.16

so that by1.12

Tr

f C − fBB − A ≥ TrDfB; C − B · B − A, 3.17 and similarly

Tr

f B − fAC − B ≤ TrDfB; B − A · C − B. 3.18 Now byLemma 2.4we can conclude

Tr

f B − fAC − B ≤ Trf C − fBB − A, 3.19

which shows that the function ft satisfies n This completes the whole proof of

Theorem 1.1

In the above proof we really showed the following

Trang 9

Theorem 3.1 A continuous function ft on an interval α, β is n-convex if and only if TrfB −

fAC − B ≤ TrfC − fBB − A, whenever A ≤ B ≤ C in H n α, β and rankB − A 

rankC − B  1

Notice that Kraus3 cf 8, Theorem 6.6.52 really showed, for n ≥ 2, that ft is

n-convex on α, β if and only if it is a C2-function and

f2

λ i , λ1, λ j

n i,j1 ≥ 0 ∀λ1, , λ n∈α, β

For the proof ofTheorem 1.2, let us start with an easy lemma

Lemma 3.2 If condition ‡ n  for ft is valid on α, β, so is condition (‡ m ) for 1 ≤ m < n.

Proof Given A < B < C in H m α, β, take λ ∈ α, β and small  > 0 and consider the n × n

matrices



A :



0 λ − I n−m





B 0

0 λI n−m



, C :



0 λ  I n−m



where I n−mis then − m × n − m identity matrix Then since  A <  B <  C in H n α, β and

Tr

f

B− fA

B − A−1

 Trf B − fAB − A−1 n − m f λ − fλ − 

Tr

f



C

− fBC −  B−1

 Trf C − fBB − A−1 n − m f λ   − fλ

3.22

by letting  → 0 it follows from n that

Tr

f B − fAB − A−1≤ Trf C − fBC − B−1. 3.23 This completes the proof

In view of Lemma 3.2 the essential part of the proof of Theorem 1.2 is in the next lemma

Lemma 3.3 If a C1- function ft satisfies (‡2) on α, β, then

fs  ft  2f1s, t α < s, t < β

Trang 10

Proof Take B  diagt1, t2 with t1, t2 ∈ α, β Then for any 2 × 2 positive definite X, Y > 0 and small  > 0 we have by assumption

Trf B − fB − X

−1≤ Trf B  Y − fB

Letting  → 0 by 2.8 this leads to the inequality

Tr

f1

t i , t j

2

i,j1 ◦ X



· X−1≤ Tr

f1

t i , t j

2

i,j1 ◦ Y



Replacing X and Y we have also

Tr

f1

t i , t j

2

i,j1 ◦ Y



· Y−1≤ Tr

f1

t i , t j

2

i,j1 ◦ X



Those together show that

Tr

f1

t i , t j

2

i,j1 ◦ X



· X−1 constant 0 < X ∈ H2. 3.28

It is easy to see that a 2× 2 positive definite matrix X with TrX  1 is of the form

X 



a u

a 1 − a

u

a 1 − a 1− a



0 < a < 1; |u| < 1. 3.29 Now it follows3.28 and 3.29 that

Tr

f1

t i , t j

2

i,j1 ◦ X



· X−1

 ft1  ft2 − 2|u|2

f1t1, t2

1− |u|2  constant |u| < 1,

3.30

which is possible only when3.24 is valid

Proof of Theorem 1.2 Suppose that a C2-function ft satisfies n  on α, β By Lemmas3.2

and3.3ft satisfies the identity 3.24 Therefore we have



ft  fs t − s  2f t − fs α < s, t < β

Twice differentiating both sides with respect to t we arrive at

ftt − s  0 α < s, t < β

3.32

Trang 11

which is possible only when ft is a quadratic function

Finally a ≥ 0 follows from the usual convexity of ft.

Suppose conversely that ft is of the form 3.33 with a ≥ 0 Take A < B < C in H n Then

Tr

f B − fAB − A−1 aTrB2− A2

and correspondingly

Tr

f C − fBC − B−1 aTrC2− B2

Since

we have

Tr

B2− A2

and correspondingly

Tr

C2− B2

Therefore we arrive at the inequality

Tr

f C − fBC − B−1− Trf B − fAB − A−1 a{TrC − TrA} ≥ 0. 3.39

This shows that ft satisfies n  for any n and on any interval α, β.

Acknowledgment

The author would like to thank Professor Fumio Hiai for his valuable comments on the original version of this paper

References

1 A W Roberts and D E Varberg, Convex Functions, vol 57 of Pure and Applied Mathematics, Academic

Press, New York, NY, USA, 1973

2 R Bhatia, Matrix Analysis, vol 169 of Graduate Texts in Mathematics, Springer, New York, NY, USA, 1997.

...

Trang 9

Theorem 3.1 A continuous function ft on an interval α, β is n-convex if and only if TrfB... 3.3 and 1.12 that for any ≤ X of rank one such that α < B ± X < β

and similarly

Trang 7

2fB... j

n

Trang 8

In a similar way we can see that

C k: f2

Ngày đăng: 21/06/2014, 20:20

TỪ KHÓA LIÊN QUAN

TÀI LIỆU CÙNG NGƯỜI DÙNG

TÀI LIỆU LIÊN QUAN