After an introduction reviewing the properties of the Wasserstein space and corresponding subdifferential calculus, applications are given to evolutionary partial differential equations.
Trang 2EQUATIONS, 3
Edited by
C Dafermos, Brown University, Providence, USA
Eduard Feireisl, Mathematical Institute AS CR, Prague, Czech Republic
Description
The material collected in this volume reflects the active present of this area of mathematics, ranging from the abstract theory of gradient flows to stochastic representations of non-linear parabolic PDE's Articles will highlight the present as well as expected future directions of development of the field with particular emphasis on applications The article by Ambrosio and Savare discusses the most recent development in the theory of gradient flow of probability measures After an introduction reviewing the properties of the Wasserstein space and corresponding subdifferential calculus, applications are given to evolutionary partial differential equations The contribution of Herrero provides a description of some mathematical approaches developed to account for quantitative as well as qualitative aspects
of chemotaxis Particular attention is paid to the limits of cell's capability to measure external cues on the one hand, and to provide an overall description of aggregation models for the slim mold Dictyostelium discoideum on the other The chapter written by Masmoudi deals with a rather different topic - examples of singular limits in hydrodynamics This is nowadays a well-studied issue given the amount of new results based on the development of the existence theory for rather general systems of equations in hydrodynamics The paper by DeLellis addreses the most recent results for the transport equations with regard to possible applications in the theory of hyperbolic systems of conservation laws Emphasis is put on the development of the theory in the case when the governing field is only a BV function The chapter by Rein represents a comprehensive survey of results on the Poisson-Vlasov system in astrophysics The question of global stability of steady states is addressed in detail The contribution of Soner is devoted to different representations of non-linear parabolic equations in terms of Markov processes After a brief introduction on the linear theory, a class of non-linear equations is investigated, with applications to stochastic control and differential games The chapter written by Zuazua presents some of the recent progresses done on the problem of controllabilty of partial differential equations The applications include the linear wave and heat equations,parabolic equations with coefficients
of low regularity, and some fluid-structure interaction models
Trang 33.N Masmoudi: Examples of singular limits in
hydrodynamics
195
4 C DeLellis: Notes on hyperbolic systems of conservation
laws and transport equations
277
5 G Rein: Collisionless kinetic equations from astrophysics
- the Vlasov-Poisson system
383
6 H.M Soner: Stochastic representations for non-linear
parabolic PDE's
477
7 E Zuazua Controllability and observability of partial
differential equations: Some results and open problems
Trang 4The original aim of this series of Handbook of Differential Equations was to acquaint the
interested reader with the current status of the theory of evolutionary partial differentialequations, with regard to some of its applications in physics, biology, chemistry, economy,among others The material collected in this volume reflects the active present of this area
of mathematics, ranging from the abstract theory of gradient flows to stochastic tations of nonlinear parabolic PDEs
represen-The aim here is to collect review articles, written by leading experts, which will light the present as well as expected future directions of development of the field withparticular emphasis on applications The contributions are presented in alphabetical orderaccording to the name of the first author The article by Ambrosio and Savaré discusses themost recent development in the theory of gradient flow of probability measures After anintroduction reviewing the properties of the Wasserstein space and corresponding subdif-ferential calculus, applications are given to evolutionary partial differential equations Thecontribution of Herrero provides a description of some mathematical approaches developed
high-to account for quantitative as well as qualitative aspects of chemotaxis Particular attention
is paid to the limits of cell’s capability to measure external cues on the one hand, and
to provide an overall description of aggregation models for the slim mold Dictyostelium
discoideum on the other The chapter written by Masmoudi deals with a rather different
topic – examples of singular limits in hydrodynamics This is nowadays a well-studied sue given the amount of new results based on the development of the existence theory forrather general systems of equations in hydrodynamics The chapter by De Lellis addressesthe most recent results for the transport equations with regard to possible applications inthe theory of hyperbolic systems of conservation laws Emphasis is put on the develop-
is-ment of the theory in the case when the governing field is only a BV function The chapter
by Rein represents a comprehensive survey of results on the Poisson–Vlasov system inastrophysics The question of global stability of steady states is addressed in detail Thecontribution of Soner is devoted to different representations of nonlinear parabolic equa-tions in terms of Markov processes After a brief introduction on the linear theory, a class
of nonlinear equations is investigated, with applications to stochastic control and tial games The chapter written by Zuazua presents some of the recent progresses done onthe problem of controllability of partial differential equations The applications include thelinear wave and heat equations, parabolic equations with coefficients of low regularity, andsome fluid–structure interaction models
differen-v
Trang 5We firmly believe that the fascinating variety of rather different topics covered by thisvolume will contribute to inspiring and motivating researchers in the future.
Constantine DafermosEduard Feireisl
Trang 6Herrero, M.A., Departamento de Matemática Aplicada, Facultad de CC Matemáticas,
Universidad Complutense de Madrid, Avda Complutense s/n, 28040 Madrid, Spain
Soner, H.M., Koç University, Istanbul, Turkey (Ch 6)
Zuazua, E., Departamento de Matemáticas, Universidad Autónoma, 28049 Madrid, Spain
(Ch 7)
vii
Trang 7Gradient Flows of Probability Measures
Luigi Ambrosio
Scuola Normale Superiore di Pisa, Piazza dei Cavalieri 7, 56126 Pisa, Italy
E-mail: l.ambrosio@sns.it
Giuseppe Savaré
Dipartimento di Matematica, Università di Pavia, Pavia via Ferrata 1, 27100 Pavia, Italy
E-mail: giuseppe.savare@unipv.it
Contents
Introduction 3
Notation 7
1 Notation and measure-theoretic results 8
1.1 Transport maps and transport plans 9
1.2 Narrow convergence 10
1.3 The change of variables formula 11
2 Metric and differentiable structure of the Wasserstein space 13
2.1 Absolutely continuous maps and metric derivative 13
2.2 The quadratic optimal transport problem 14
2.3 Geodesics in P2(Rd ) 16
2.4 Existence of optimal transport maps 17
2.5 The continuity equation with locally Lipschitz velocity fields 19
2.6 The tangent bundle to the Wasserstein space 29
3 Convex functionals in P2(Rd ) 38
3.1 λ-geodesically convex functionals inP2(Rd ) 39
3.2 Examples of convex functionals in P2(Rd ) 40
3.3 Relative entropy and convex functionals of measures 47
3.4 Log-concavity and displacement convexity 50
4 Subdifferential calculus in P2(Rd ) 55
4.1 Definition of the subdifferential for a.c measures 58
4.2 Subdifferential calculus in Pa(Rd ) 60
4.3 The case of λ-convex functionals along geodesics 62
4.4 Regular functionals 65
4.5 Examples of subdifferentials 68 HANDBOOK OF DIFFERENTIAL EQUATIONS
Evolutionary Equations, volume 3
Edited by C.M Dafermos and E Feireisl
© 2007 Elsevier B.V All rights reserved
1
Trang 85 Gradient flows of λ-geodesically convex functionals inP2(Rd ) 84
5.1 Characterizations of gradient flows, uniqueness and contractivity 85
5.2 Main properties of gradient flows 89
5.3 Existence of gradient flows by convergence of the “minimizing movement” scheme 95
5.4 Bibliographical notes 104
6 Applications to evolution PDEs 107
6.1 Gradient flows and evolutionary PDEs of diffusion type 107
6.2 The linear transport equation for λ-convex potentials 111
6.3 Kolmogorov–Fokker–Planck equation 113
6.4 Nonlinear diffusion equations 129
6.5 Drift diffusion equations with nonlocal terms 132
6.6 Gradient flow of−W2/2 and geodesics 133
References 133
Trang 9In a finite-dimensional smooth setting, the gradient flow of a function φ :Md→ R defined
on a Riemannian manifoldMd simply means the family of solutions u :R → Md of theCauchy problem associated to the differential equation
d
dt u(t ) = −∇φu(t )
Thus, at each time t ∈ R equation (0.1), which is imposed in the tangent space T u(t )Md
ofMd at the moving point u(t), simply prescribes that the velocity vector v t:= d
dt u(t )of
the curve u equals the opposite of the gradient of φ at u(t).
The extension of the theory of gradient flows to suitable (infinite-dimensional) stract/functional spaces and its link with evolutionary PDEs is a wide subject with a longhistory
ab-One of its first main achievement, going back to the pioneering papers by Komura [61],Crandall and Pazy [33], Brézis [21] (we refer to the monograph [22]), concerns an Hilbert
space H and nonlinear contraction semigroups generated by a proper, convex, and lower semicontinuous functional φ : H → (−∞, +∞] Since in general φ admits only a sub-
differential ∂φ in a (possibly strict) subset D(∂φ) ⊂ D(φ) := {u ∈ H: φ(u) < +∞} and
each tangent space of H can be identified with H itself, it turns out that (0.1) should be rephrased as a subdifferential inclusion on the positive real line
u(t ) ∈ −∂φu(t )
, t >0; u(0) = u0∈ D(φ), (0.2)and it provides a general framework for studying existence, uniqueness, stability, asymp-totic behavior, and regularizing properties of many PDEs of parabolic type
The possibility to work in a more general metric space (E, d) and/or with smooth perturbations of a convex functional φ : E → (−∞, +∞] has been exploited by
non-De Giorgi and his collaborators in a series of papers originating from [37] and nating in [64] (see also the presentation of [6] and our recent book [9]) One of thenice features of this approach is the so-called “minimizing movement” approximationscheme [36]: it suggests a general variational procedure to approximate and construct gra-dient flows by a recursive minimization algorithm For, one introduces a uniform partition
culmi-0 < τ < 2τ < · · · < nτ < · · · of the positive real line, τ > 0 being the step size, and
start-ing from the initial value U τ0:= u0one looks for a suitable approximation U τ n of u at the time nτ by iteratively solving the minimum problems
((n − 1)τ, nτ] can be constructed Limit points (possibly after extracting a suitable
subse-quence) of U τ (t ) as τ ↓ 0 can be considered as good candidates for gradient flows of φ and
Trang 10in many circumstances it is, in fact, possible to give differential characterizations of theirtrajectories.
One of the most striking application of this variational point of view has been introduced
by Otto [57,74] (also in collaboration with Jordan and Kinderlehrer): he showed that theFokker–Planck equation
∂ t u − ∇ · (∇u + u∇V ) = 0 in R d × (0, +∞) (0.4)and nonlinear diffusion equations of porous media type
for a suitable choice of the nonlinearity F and of the reference measure γ in Rd Here
the solutions u t of (0.4) and (0.5) yield a corresponding family of evolving measures
μ t∈ P2(Rd ) through the identification μ t = u tLd
One of the main novelties of Otto’s approach relies in the particular distance d
on P2(Rd )which should be used to recover the above mentioned PDEs in the limit: it
is the so-called Kantorovich–Rubinstein–Wasserstein distance between two measures μ,
The minimum in (0.7) is thus evaluated on all probability measures γ on the product
Rd× Rd whose marginals π#1γ , π#2γ are μ and ν, respectively, π1, π2:Rd× Rd→ Rd
denote the canonical projections on the first and the second factor
By applying the “minimizing movement” scheme inP2(Rd )with the above choice (0.6)
of φ and with d := W2, it is, in fact, possible to show that its discrete trajectories converge
to the solution of a suitable evolution PDE Moreover, Otto introduced a formal mannian” structure in the spaceP2(Rd )in order to guess first, and then prove rigorouslythe form of the limit PDEs and their gradient flow structure like in (0.1)
“Rie-The aim of this chapter is to present, in a simplified form, the general and rigoroustheory developed in our book [9] (written with N Gigli), giving quite general answers tothe following questions:
1 Give a rigorous meaning to the concept of gradient flow inP2(Rd )
2 Find general conditions on φ in order to guarantee the convergence of the
“minimiz-ing movement” scheme inP2(Rd )
Trang 113 Characterize the limit trajectories and study their properties, applying them to classes
of specific and relevant examples
In comparison with [9], the simplification comes from the fact that we mostly restrict selves to absolutely continuous measures, in finite-dimensional spaces, while in [9] none
our-of these restrictions is present
Concerning the first point, it is clear from the heuristic arguments of Otto and from (0.1)that one should make precise:
(1a) the notion of velocity vector field of a curve (μ t ) t ∈(0,T )of measures inP2(Rd ),
(1b) the notion of tangent space Tan μP2(Rd )ofP2(Rd ) at a given measure μ, (1c) the notion of gradient of a functional φ (like (0.6)) at μ.
The investigations about velocity and tangent space are, in fact, strictly related to a deep
analysis of the continuity equation
∂ t μ t + ∇ · (v t μ t )= 0 in Rd × (0, T ).
It is carried out in Section 2.6 after some basic preliminaries of measure theory (recalled
in Section 1), a brief outline on optimal transportation and Wasserstein distance (presented
in Sections 2.1–2.4), and a more detailed review on the classical representation formulasfor solutions of the continuity equation, which is discussed in Section 2.5 Starting formthe general definition of absolutely continuous curves in a (arbitrary) metric space, we will
show that every absolutely continuous family of measures (μ t ) t ∈(0,T ) inP2(Rd )satisfiesthe continuity equation
∂ t μ t + ∇ · (v t μ t )= 0 in the distribution sense of D
SinceP2(Rd )is a length space (i.e., the infimum of the distance between any two points
is the infimum of the lengths of all curves connecting the two points), one recovers also theBenamou–Brenier [15] formula
W2(μ, ν)= min
1 0
sider vt as the velocity vector of the curve (μ t ) and the squared L2(μ t; Rd )-norm as themetric tensor inP2(Rd )
Trang 12It turns out that in general the set spanned by all the possible velocity vector fields of a
curve through a measure μ is a proper subset of L2(μ; Rd ) For, vtcan be strongly
approx-imated in L2(μ t; Rd )by gradients of smooth functions (and this approximability property
is equivalent to (0.9)); moreover, gradients of smooth functions are always velocity vectors
(in the above sense) of smooth curves These facts suggests the definition of the tangent
space as
TanμP2
Rd:= ∇ϕ: ϕ ∈ C∞
for the squared Wasserstein distance from a given measure ν Here t ν μ t are the optimal
transport maps between μ t and ν (provided they exist, as it happens whenever μ t are
absolutely continuous) and i is the identity map.
Concerning (1c), any reasonable definition of gradient in infinite-dimensional spacesshould be sufficiently general to fit with various classes of nonsmooth functionals For easy
of exposition, in this chapter we decided to focus our attention on the case of geodesically
convex (or, more generally, λ-convex) functionals (we refer to [9] for more general results).
Geodesics inP2(Rd )play a crucial role and their characterization is briefly discussed inSection 2.3 Section 3 is thus devoted to the analysis of convex functionals inP2(Rd )and
to some particularly important examples, discovered by McCann [66]
Having at our disposal a nice Hilbertian structure at the level of each tangent space and
a significant notion of convexity, it is natural to develop a subdifferential theory modeled
on the well-known linear one We deal with this program in Section 4: first of all we define
the (Fréchet) subdifferential ∂φ(μ) of φ at a measure μ Even if it is a multivalued map,
it is possible to perform a natural minimal selection ∂φ◦(μ)among its values, which joys nice features and always belongs to the tangent space TanμP2(Rd ) Sections 4.2–4.4present the basic calculus properties of the subdifferential: they precisely reproduce theanalogous ones of the linear framework and justify the interest for this notion Section 4.5contains the main characterizations of the subdifferential of the most relevant function-als (internal, potential and interaction energies, and the negative squared Wasserstein dis-tance)
en-Combining all these notions, we end up with the rigorous definition of the gradient flow
of a functional φ in Section 5: it always has the structure of the continuity equation
Trang 13linking vt to μ t through the functional φ When φ has the structure of (0.6) and μ t = ρ t γ,(0.14) is equivalent (in a suitable weak sense) to
vt= −∇F(ρ
The remaining part of the section is devoted to study the main properties of the gradient
flows, obtained independently from the existence issue, i.e directly from the definition We
conclude the section providing an answer to the second question we raised before, i.e., the
construction of the gradient flow by means of the variational approximation scheme.
Even in this case (λ-geodesic) convexity plays a crucial role and we are able to obtain
the same well-known results of the theory in flat linear spaces Here we only mention the
generation of a contracting and regularizing semigroup satisfying, when λ > 0, nice
as-ymptotic convergence estimates In comparison with other papers ([29,76], for the porousmedium equation on Riemannian manifolds), where similar goals are pursued, our ap-
proach is totally independent of the specific form of the functional φ and of the PDE that
it induces: it is ultimately based on the one hand on monotonicity inequalities (ensured by
the λ-convexity of φ), and on the other hand on (0.12), whose validity is a purely
geomet-rical fact Furthermore, as shown in [9], it extends also to the case whenRd is replaced by
a separable Hilbert space and/or singular (e.g., concentrated) measures are allowed.The last section illustrates our main examples and applications A particular emphasis
is devoted to the linear Fokker–Planck equation (0.4) associated to a convex potential V
with arbitrary growth at infinity: as showed by Otto, it is the gradient flow inP2(Rd )ofthe relative entropy functional
absolutely continuous measures w.r.t γ coincides with the Markov semigroup generated
by the natural Dirichlet form associated to γ
Applications to the case of nonlinear diffusion equations and to more complicateddifferential–integral equations are also considered
Notation
B r (x) open ball of radius r centered at x in a metric space
B(X) Borel sets in a separable metric space X
Cb0(X) space of continuous and bounded real functions defined on X
C∞
c (Rd ) space of smooth real functions with compact support inRd
P (X) probability measures in a separable metric space X
P2(X) probability measures with finite quadratic moment, see (1.3)
Trang 14P2a(Rd ) measures inP2(Rd )absolutely continuous w.r.t.Ld
L p (μ; Rd ) L p space of μ-measurableRd-valued maps
π i projection operators on a product space X, see (1.8)
Γ (μ1, μ2) 2-plans with given marginals μ1, μ2
Γo(μ1, μ2) optimal 2-plans with given marginals μ1, μ2
W2(μ, ν) Wasserstein distance between μ and ν, see (2.6)
tν μ optimal transport map between μ and ν given by Theorem 2.3
TanμtP2(Rd ) tangent bundle toP2(Rd ), see (2.42)
μ1→2
t geodesic curve connecting μ1to μ2, see (3.1)
|u|(t) metric derivative of u : (a, b) → E, see (2.2)
AC p ((a, b) ; E) absolutely continuous u : (a, b) → E with |u| ∈ L p (a, b), see (2.3)
Lip(φ, A) Lipschitz constant of the function φ in the set A
∂φ (v) Fréchet subdifferential of φ in Hilbert (4.2) or Wasserstein spaces, see
Definition 4.1 and (4.20)
|∂φ|(v) metric slope of φ, see (4.4) and (4.29)
∂◦φ (μ) minimal selection in the subdifferential, see Lemma 4.10
M τ (t ) piecewise constant interpolation of M n
τ, see (5.54)
MM(Φ ; u0) minimizing movement of φ, see the definition before (5.55)
1 Notation and measure-theoretic results
In this section we recall the main notation used in this chapter and some basic
measure-theoretic terminology and results Given a separable metric space (X, d), we denote
by P (X) the set of probability measures μ : B(X) → [0, 1], where B(X) is the Borel
σ -algebra The support of μ ∈ P (X) is the closed set
Trang 15We denote byLdthe Lebesgue measure inRdand set
P2a(X):= μ∈ P2(X) : μ Ld
,
whenever X ∈ B(R d )
1.1 Transport maps and transport plans
If μ ∈ P (X1) , and r : X1 → X2 is a Borel (or, more generally, μ-measurable) map, we
denote by r#μ ∈ P (X2) the push-forward of μ through r, defined by
(r◦ s)#μ= r#(s#μ) where s : X1 → X2, r : X2→ X3, μ ∈ P (X1). (1.7)
We denote by π i , i = 1, 2, the projection operators defined on a product space
X:= X1× X2, defined by
π1: (x1 , x2) → x1∈ X1, π2: (x1 , x2) → x2∈ X2. (1.8)
If X is endowed with the canonical product metric and the Borel σ -algebra and μ ∈ P (X),
the marginals of μ are the probability measures
Trang 16To each couple of measures μ1∈ P (X1) , μ2= r#μ1∈ P (X2)linked by a Borel transport
map r : X1 → X2we can associate the transport plan
μ := (i × r)#μ1∈ Γμ1, μ2
If μ is representable as in (1.12) then we say that μ is induced by r Each transport plan μ concentrated on a μ-measurable graph in X1 × X2 admits the representation (1.12) for
some μ1-measurable map r, which therefore transports μ1to μ2(see, e.g., [7])
for every function f ∈ C0
b(X), the space of continuous and bounded real functions defined
on X.
THEOREM1.1 ([39], III-59) If a set K ⊂ P (X) is tight, i.e.,
∀ε > 0 ∃K ε compact in X such that μ(X \ K ε ) ε ∀μ ∈ K, (1.14)
then K is relatively compact in P (X).
When one needs to pass to the limit in expressions like (1.13) w.r.t unbounded or lower
semicontinuous functions f , the following two properties are quite useful The first one is
a lower semicontinuity property,
for every sequence (μ n ) ⊂ P (X) narrowly convergent to μ and any l.s.c function g : X →
( −∞, +∞] bounded from below: it follows easily by a monotone approximation argument
of g by continuous and bounded functions Changing g in −g one gets the corresponding
“lim sup” inequality for upper semicontinuous functions bounded from above In particular,
choosing as g the characteristic functions of open and closed subset of X, we obtain
lim inf
n→∞ μ n (G) μ(G) ∀G open in X, (1.16)lim sup
n→∞ μ n (F ) μ(F ) ∀F closed in X. (1.17)
Trang 17The statement of the second property requires the following definitions: we say that a Borel
function g : X → [0, +∞] is uniformly integrable w.r.t a given set K ⊂ P (X) if
we say that the set K ⊂ P (X) has uniformly integrable p-moments The following
lemma (see, for instance, Lemma 5.1.7 of [9] for its proof) provides a characterization
of p-uniformly integrable families, extending the validity of (1.13) to unbounded but with
p -growth functions, i.e., functions f : X→ R such that
f (x) A + Bd p ( ¯x, x) ∀x ∈ X, (1.20)
for some A, B 0 and ¯x ∈ X.
LEMMA 1.2 Let (μ n ) ⊂ P (X) be narrowly convergent to μ ∈ P (X) If f : X → R is
continuous, g : X → (−∞, +∞] is lower semicontinuous, and |f | and g− are uniformly
integrable w.r.t the set {μ n}n∈N, then
then f is uniformly integrable w.r.t {μ n}n∈N
In particular, a family {μ n}n∈N⊂ P (X) has uniformly integrable p-moments iff (1.21b)
holds for every continuous function f : X → R with p-growth.
1.3 The change of variables formula
Let r : A⊂ Rd→ Rd be a Borel function, with A open Then, denoting by Σr= D(∇r)
the Borel set where r is differentiable, there is a sequence of sets Σ n ↑ Σr such that r|Σ n
is a Lipschitz function for any n (see [45], Section 3.1.8) Therefore the well-known area
Trang 18formula for Lipschitz maps (see, for instance, [44,45]) extends to this general class of mapsand reads as follows:
for any Borel function h :Rd → [0, +∞] This formula leads to a simple rule for
comput-ing the density of the push-forward of measures absolutely continuous w.r.t.Ld
LEMMA1.3 (Density of the push-forward) Let ρ ∈ L1(Rd ) be a nonnegative function and assume that there exists a Borel set Σ ⊂ Σr such that r|Σ is injective and the difference
{ρ > 0} \ Σ is L d -negligible Then r#(ρLd ) Ld if and only if | det ∇r| > 0 L d -a.e.
on Σ and in this case
| det ∇r(r−1(y))|dy.
Conversely, if there is a Borel set B ⊂ Σ with L d (B) >0 and| det ∇r| = 0 on B, the area
formula givesLd ( r(B))= 0 On the other hand,
for any Borel function F : [0, +∞) → [0, +∞] with F (0) = 0 Notice that in this formula
the set Σr does not appear anymore (due to the fact that F (0) = 0 and ρ = 0 out of Σ),
so it holds provided r is differentiable ρLd -a.e., it is ρLd-essentially injective (i.e., there
exists a Borel set Σ such that r|Σ is injective and ρ= 0 Ld -a.e out of Σ ) and | det ∇r| > 0
ρLd-a.e inRd
We will apply mostly these formulas when r is the gradient of a convex function
g : Ω → R, Ω being an open subset of R d In this specific case it is well known that
Trang 19the (multivalued) subdifferential ∂g(x) of g (we will recall its definition at the beginning
of Section 4) is nonempty for every x ∈ Ω and it is reduced to a single point ∇g(x) when
g is differentiable at x: this happen forLd -a.e x ∈ Ω.
In the following result (see, for instance, [4,44]) we are considering an arbitrary Borel
selection r : Ω→ Rd such that
r(x) ∈ ∂g(x) for every x ∈ Ω. (1.25)THEOREM1.4 (Aleksandrov) Let Ω⊂ Rd be a convex open set and let g : Ω → R be a
convex function Then g is a locally Lipschitz function, (every extension r satisfying (1.25)
of ) ∇g is differentiable at L d -a.e point of Ω, its gradient∇2g(x) is a symmetric matrix, and g has the second-order Taylor expansion
and that the above inequality is strict if g is strictly convex: in this case, it is immediate
to check that∇g is injective on D(∇g), and that | det ∇2g | > 0 on the differentiability set
of∇g if g is uniformly convex.
2 Metric and differentiable structure of the Wasserstein space
In this section we look atP2(Rd )first from the metric and then from the differentiableviewpoints
2.1 Absolutely continuous maps and metric derivative
Let (E, d) be a metric space.
DEFINITION 2.1 (Absolutely continuous curves) Let I ⊂ R be an interval and let
u : I → E We say that u is absolutely continuous if there exists m ∈ L1(I )such that
d
u(s), u(t )
t s
m(τ ) dτ ∀s, t ∈ I, s t. (2.1)
Trang 20Any absolutely continuous curve is obviously uniformly continuous, and therefore it
can be uniquely extended to the closure of I It is not difficult to show (see, for instance, Theorem 1.1.2 in [9] or [11]) that the metric derivative
u(t ):= lim
h→0
d(u(t + h), u(t))
exists atL1-a.e t ∈ I for any absolutely continuous curve u(t) Furthermore, |u| ∈ L1(I )
and is the minimal m fulfilling (2.1) (i.e., |u| fulfills (2.1) and m |u| L1-a.e in I for any m with this property) For p ∈ [1, +∞] we also set
AC p (I ; E)
:= u : I → E: u is absolutely continuous andu ∈L p (I )
2.2 The quadratic optimal transport problem
Let X, Y be complete and separable metric spaces and let c : X × Y → [0, +∞] be a
Borel cost function Given μ ∈ P (X), ν ∈ P (Y ) the optimal transport problem, in Monge’s
This problem can be ill posed because sometimes there is no transport map t such that
t#μ = ν (this happens for instance when μ is a Dirac mass and ν is not a Dirac mass).
circumvents this problem (as μ × ν ∈ Γ (μ, ν)) The existence of an optimal transport plan,
when c is l.s.c., is provided by (1.15) and by Theorem 1.1, taking into account that Γ (μ, ν)
is tight (this follows easily by the fact that the marginals of the measures in Γ (μ, ν) are
fixed, and by the fact that according to Ulam’s theorem any finite measure in a completeand separable metric space is tight, see also Chapter 6 in [9] for more general formulations)
The problem (2.5) is truly a weak formulation of (2.4) in the following sense: if c is bounded and continuous, and if μ has no atom, then the “min” in (2.5) is equal to the “inf”
in (2.4), see [7,47] This result can also be extended to classes of unbounded cost functions,see [79]
In the sequel we consider the case when X = Y and c(x, y) = d2(x, y) , where d is the distance in X, and denote by Γo (μ, ν) the optimal plans in (2.5) corresponding to
this choice of the cost function In this case we use the minimum value to define theKantorovich–Rubinstein–Wasserstein distance
Trang 21THEOREM2.2 Let X be a complete and separable metric space Then W2 defines a tance inP2(X) andP2(X) , endowed with this distance, is a complete and separable metric
dis-space Furthermore, for a given sequence (μ n )⊂ P2(X) we have
(μ n ) has uniformly integrable 2-moments. (2.7)
PROOF We just prove that W2is a distance The complete statement is proved for instance
in Proposition 7.1.5 of [9] or, in the locally compact case, in [86]
Let μ, ν, σ∈ P2(X) and let γ ∈ Γo(μ, ν) and η ∈ Γo(ν, σ ) General results of
probabil-ity theory (see the above mentioned references) ensure the existence of λ ∈ P (X × X × X)
continuous function f with at most quadratic growth.
Working with Monge’s formulation the proof above is technically easier, as an
admissi-ble transport map between μ and σ can be obtained just composing transport maps between
μ and ν with transport maps between ν and σ However, in order to give a complete proof
one needs to know either that optimal plans are induced by maps, or that the infimum inMonge’s formulation coincides with the minimum in Kantorovich’s one, and none of theseresults is trivial, even in Euclidean spaces
Trang 22Although in many situations that we consider in this chapter the optimal plans are duced by maps, still the Kantorovich formulation of the optimal transport problem is quite
in-useful to provide estimates from above on W2 For instance,
Actually only the inequality d(γ (s), γ (t)) T−1(t −s)d(γ (0), γ (T )) needs to be checked
for all 0 s t T Indeed, if the strict inequality occurs for some s < t, then the triangle
Trang 23It has been proved in Theorem 7.2.2 of [9] that any constant speed geodesic joining μ to ν
can be built in this way We discuss additional regularity properties of the geodesics in the
next section Here we just mention that, in the case when γ is induced by a transport map t (i.e., γ = (i, t)#μ), then (2.9) reduces to
μ t=(1− t)i + tt#μ, t ∈ [0, 1]. (2.11)
2.4 Existence of optimal transport maps
The following basic result of [20,48,60] provides existence and uniqueness of the optimal
transport map in the case when the initial measure μ belongs toP2a(Rd )
THEOREM 2.3 (Existence and uniqueness of optimal transport maps) For any μ∈
P2a(Rd ) , ν ∈ P2(Rd ) Kantorovich’s optimal transport problem (2.5) with c(x, y)=
|x − y|2has a unique solution γ Moreover:
(i) γ is induced by a transport map t, i.e., γ = (i, t)#μ In particular t is the unique
solution of Monge’s optimal transport problem (2.4).
(ii) The map t coincides μ-a.e with the gradient of a convex function ϕ :Rd →
( −∞, +∞], whose finiteness domain D(ϕ) has nonempty interior and satisfies
PROOF We are presenting here the proof of the last statement (iii) Since (i, t)#μ
and (s, i)# ν are both optimal plans between μ and ν, they coincide Testing this identity
Trang 24between plans on|s(t(x)) − x| (resp |t(s(y)) − y|) we obtain that s ◦ t = i μ-a.e in R d
s(y) − s(y)dν(y) = 0.
The formula for the density of ν with respect to Ld follows by Lemma 1.3, taking into
In the following we shall denote by tν μthe unique optimal map given by Theorem 2.3
Notice that t= ∇ϕ is uniquely determined only μ-a.e., hence ϕ is not uniquely determined,
not even up to additive constants, unless μ = ρL d with ρ > 0Ld-a.e inRd However,the existence proof (at least the one achieved through a duality argument), yields some
“canonical” ϕ, given by the duality formula
ϕ(x)= sup
y ∈supp ν
for a suitable function ψ : supp ν → (−∞, +∞] This explicit expression is sometimes
technically useful: for instance, it shows that when supp ν is bounded we can always find
a globally convex and Lipschitz map ϕ whose gradient is the optimal transport map.
The following result shows that optimal maps along geodesics enjoy nicer properties(see also [17])
THEOREM2.4 (Regularity in the interior of geodesics) Let μ, ν∈ P2(Rd ) and let
PROOF (i) The necessary optimality conditions at the level of plans (see, for instance,
Section 6.2.3 of [9], or [86]) imply that the support of γ is contained in the graph
(x, y) : y ∈ Γ (x)
of a monotone operator Γ (x) On the other hand, the same argument used in the proof
of (2.10) shows that the plan γ := (π1, (1− t)π1+ tπ2)#γ is optimal between μ and μ t
Trang 25The support of γ t is contained in the graph of the monotone operator (1 − t)I + tΓ , whose
inverse
Γ−1(y):= x∈ Rd
: y ∈ Γ (x)
is single-valued and 1/(1 − t)-Lipschitz continuous Therefore the graph of Γ−1 is the
graph of a 1/(1 − t)-Lipschitz map s t pushing μ t to μ The uniqueness of this map, even
at the level of plans, is proved in Lemma 7.2.1 of [9]
(ii) If A ∈ B(R d ) is Ld-negligible, then st (A) is also Ld-negligible, hence
μ-negligible The identity st◦ tt = i μ-a.e then gives
μ t (A) = μt−1
t (A)
μst(A)
= 0.
2.5 The continuity equation with locally Lipschitz velocity fields
In this section we collect some results on the continuity equation
∂ t μ t + ∇ · (v t μ t )= 0 in Rd × (0, T ), (2.14)
which we will need in the sequel Here μ tis a Borel family of probability measures onRd
defined for t in the open interval I := (0, T ), v : (x, t) → v t (x)∈ Rd is a Borel velocityfield such that
REMARK2.5 (More general test functions) By a simple regularization argument via
con-volution, it is easy to show that (2.16) holds if ϕ ∈ C1
c(Rd × (0, T )) as well Moreover,
under condition (2.15), we can also consider bounded test functions ϕ, with bounded dient, whose support has a compact projection in (0, T ) (that is, the support in x need not be compact): it suffices to approximate ϕ by ϕχ R , where χ R ∈ C∞
gra-c (Rd ), 0 χ R 1,
|∇χ R | 2 and χ R = 1 on B R ( 0).
First of all we recall some technical preliminaries
Trang 26LEMMA2.6 (Continuous representative) Let μ t be a Borel family of probability measures
satisfying (2.16) for a Borel vector field v t satisfying (2.15) Then there exists a narrowly
continuous curve t ∈ [0, T ] → ˜μ t ∈ P (R d ) such that μ t = ˜μ t forL1-a.e t ∈ (0, T )
If L ζ is the set of its Lebesgue points, we know that L1(( 0, T ) \ L ζ )= 0 Let us now
take a countable set Z which is dense in Cc1(Rd ) with respect the usual C1normζ C1=
supRd ( |ζ |, |∇ζ |) and let us set L Z:=ζ ∈Z L ζ The restriction of the curve μ to L Z
pro-vides a uniformly continuous family of bounded functionals on C1
Trang 27It is not restrictive to suppose that ζ k ∈ Z Applying the previous formula (2.18), for
k=1a k < +∞ For a fixed s ∈ L Z and ε > 0, being μ s tight, we can find k∈ N such
that μ s (ζ k ) >1− ε/2 and a k < ε/2 It follows that
Passing to the limit as ε vanishes and invoking the continuity of ˜μ t, we get (2.17)
LEMMA 2.7 (Time rescaling) Let t: s ∈ [0, T] →t(s) ∈ [0, T ] be a strictly increasing
absolutely continuous map with absolutely continuous inverses:=t−1 Then (μ
t ,vt) is a distributional solution of (2.14) if and only if ˆμ := μ ◦t, ˆv :=tv◦t, is a distributional
Trang 28When the velocity field vt is more regular, the classical method of characteristics vides an explicit solution of (2.14) First we recall an elementary result of the theory ofordinary differential equations.
pro-LEMMA 2.8 (The characteristic system of ODE) Let v t be a Borel vector field such that for every compact set B⊂ Rd
admits a unique maximal solution defined in an interval I (x, s) relatively open in [0, T ]
and containing s as (relatively) internal point.
Furthermore, if t → |X t (x, s) | is bounded in the interior of I (x, s) then I (x, s) = [0, T ];
finally, if v satisfies the global bounds analogous to (2.20)
For simplicity, we set X t (x) := X t (x, 0) in the particular case s= 0 and we denote by
τ (x) := sup I (x, 0) the length of the maximal time domain of the characteristics leaving
from x at t= 0
REMARK 2.9 (The characteristics method for first-order linear PDEs) Characteristicsprovide a useful representation formula for classical solutions of the backward equation(formally adjoint to (2.14))
∂ t ϕ t , ∇ϕ = ψ in R d × (0, T ); ϕ(x, T ) = ϕ T (x), x∈ Rd , (2.24)
when, e.g., ψ ∈ C1
b(Rd × (0, T )), ϕ T ∈ C1
b(Rd )and v satisfies the global bounds (2.22),
so that maximal solutions are always defined in[0, T ] A direct calculation shows that
Trang 29solve (2.24) For X s (X t (x, 0), t) = X s (x, 0) yields
Since x (and then X t (x, 0)) is arbitrary we conclude that (2.31) is fulfilled.
Now we use characteristics to prove the existence, the uniqueness, and a representation
formula of the solution of the continuity equation, under suitable assumption on v.
LEMMA 2.10 Let v t be a Borel velocity field satisfying (2.20), (2.15), let μ0∈ P (R d ),
and let X t be the maximal solution of the ODE (2.21) (corresponding to s = 0) Suppose
that for some ¯t ∈ (0,T ]
τ (x) > ¯t for μ0-a.e x∈ Rd (2.26)
Then t → μ t := (X t )#μ0is a continuous solution of (2.14) in [0, ¯t].
PROOF The continuity of μt follows easily since lims →t X s (x) = X t (x) for μ0-a.e.
x∈ Rd : thus for every continuous and bounded function ζ :Rd→ R the dominated
con-vergence theorem yields
c (Rd × (0, ¯t)) and for μ0-a.e x∈ Rd the maps t → ϕ t (x) := ϕ(X t (x), t )
are absolutely continuous in (0, ¯t), with
Trang 30We want to prove that, under reasonable assumptions, in fact any solution of (2.14) can
be represented as in Lemma 2.10 The first step is a uniqueness theorem for the continuityequation under minimal regularity assumptions on the velocity field Notice that the only
global information on vtis (2.27) The proof is based on a classical duality argument (see,for instance, [7,19,41])
PROPOSITION2.11 (Uniqueness and comparison for the continuity equation) Let σ t be a narrowly continuous family of signed measures solving
Trang 31We define wt so that wt= vt on B2R ( 0) × [0, T ], w t = 0 if t /∈ [0, T ] and
Let wε t be obtained from wt by a double mollification with respect to the space and time
variables: notice that wε t satisfy
We now build, by the method of characteristics described in Remark 2.9, a smooth
solu-tion ϕ ε:Rd × [0, T ] → R of the PDE
We insert now the test function ϕ ε χ R in the continuity equation and take into account
that σ0 0 and ϕ ε 0 to obtain
Trang 32PROPOSITION 2.12 (Representation formula for the continuity equation) Let μ t,
t ∈ [0, T ], be a narrowly continuous family of Borel probability measures solving the
con-tinuity equation (2.14) w.r.t a Borel vector field v t satisfying (2.20) and (2.15) Then for
μ0-a.e x∈ Rd the characteristic system (2.21) admits a globally defined solution X t (x)
PROOF Let Es = {τ > s} and let us use the fact, proved in Lemma 2.10, that t →
X t # (χ Es μ0)is a solution of (2.14) in[0, s] By Proposition 2.11 we get also
Trang 33glob-Now we observe that the differential quotient D h (x, t ) := h−1(X
Since we already know that D h is pointwise converging to vt ◦ X t μ0× L1-a.e in
Rd × (0, T ), we obtain the strong convergence in L2(μ0× L1), i.e., (2.34)
Finally, we can consider t → X t ( ·) and t → v t (X t ( ·)) as maps from (0, T ) to L2(μ0;
and it shows that t → X t ( ·) belongs to AC2( 0, T ; L2(μ0; Rd )) General results for
ab-solutely continuous maps with values in Hilbert spaces yield that X t is differentiable
LEMMA 2.13 (Approximation by regular curves) Let μ t be a time-continuous solution
of (2.14) w.r.t a velocity field satisfying the integrability condition
Let (ρ ε ) ⊂ C∞(Rd ) be a family of strictly positive mollifiers in the x variable (e.g., ρ ε (x)=
(2πε) −d/2 exp( −|x|2/ 2ε)), and set
μ ε t := μ t ∗ ρ ε , E t ε := (v t μ t ) ∗ ρ ε , vε t :=E ε t
Trang 34Then μ ε t is a continuous solution of (2.14) w.r.t v ε t , which satisfies the local regularity
assumptions (2.20) and the uniform integrability bounds
t and its densityw.r.t.Ld by the same symbol Notice first that|E ε |(t, ·) and its spatial gradient are uni-
formly bounded in space by the product ofvtL1(μ t ) with a constant depending on ε, and
the first quantity is integrable in time Analogously,|μ ε
t |(t, ·) and its spatial gradient are
uniformly bounded in space by a constant depending on ε Therefore, as v ε t = E ε
t /μ ε t, thelocal regularity assumptions (2.20) is fulfilled if
inf
|x|R,t∈[0,T ] μ
ε
t (x) >0 for any ε > 0, R > 0.
This property is immediate, since μ ε
t are continuous w.r.t t and equi-continuous w.r.t x,
and therefore continuous in both variables
Lemma 2.14 shows that (2.38) holds Notice also that μ ε t solve the continuity equation
∂ t μ ε t + ∇ ·vε t μ ε t
= 0 in Rd × (0, T ), (2.40)because, by construction,∇ ·(v ε
t μ ε t ) = ∇ ·((v t μ t ) ∗ρ ε ) = (∇ ·(v t μ t )) ∗ρ ε Finally, generallower semicontinuity results on integral functionals defined on measures of the form
2dμ
(see, for instance, Theorem 2.34 and Example 2.36 in [8]) provide (2.39)
LEMMA2.14 Let μ ∈ P (R d ) and let E be anRm -valued measure inRd with finite total variation and absolutely continuous with respect to μ Then
for any convolution kernel ρ.
PROOF We use Jensen inequality in the following form: if Φ :Rm+1→ [0, +∞] is
con-vex, l.s.c and positively 1-homogeneous, then
Trang 35for any Borel map ψ :Rd→ Rm+1and any positive and finite measure θ inRd(by
rescal-ing θ to be a probability measure and lookrescal-ing at the image measure ψ# θ the formula
reduces to the standard Jensen inequality) Fix x∈ Rdand apply the inequality above with
2.6 The tangent bundle to the Wasserstein space
In this section we endowP2(Rd )with a kind of differential structure, consistent with themetric structure introduced in Section 2.2 Our starting point is the analysis of absolutely
continuous curves μ t : (a, b)→ P2(Rd ): recall that this concept depends only on the metricstructure ofP2(Rd ), by Definition 2.1 We show in Theorem 2.15 that this class of curvescoincides with (distributional) solutions of the continuity equation
∂
∂t μ t + ∇ · (v t μ t )= 0 in Rd × (a, b).
More precisely, given an absolutely continuous curve μ t, one can find a Borel
time-dependent velocity field vt:Rd→ Rdsuch thatvtL2(μt ) |μ|(t) for L1-a.e t ∈ (a, b)
and the continuity equation holds Here |μ|(t) is the metric derivative of μ t, defined
in (2.2) Conversely, if μ t solve the continuity equation for some Borel velocity field wt
withb
awtL2(μ t ) dt < +∞, then μ t is an absolutely continuous curve andwtL2(μ t )
|μ|(t) for L1-a.e t ∈ (a, b).
As a consequence of Theorem 2.15 we see that among all velocity fields wt which
produce the same flow μ t , there is a unique optimal one with smallest L2(μ t; Rd )-norm,
equal to the metric derivative of μ t; we view this optimal field as the “tangent” vector field
to the curve μ t To make this statement more precise, one can show that the minimality of
Trang 36the L2norm of wt is characterized by the property
wt∈ ∇ϕ: ϕ ∈ C∞
c
RdL2(μt;Rd )
The characterization (2.41) of tangent vectors strongly suggests to consider the ing tangent bundle toP2(Rd )
follow-TanμP2
Rd:= ∇ϕ: ϕ ∈ C∞
endowed with the natural L2metric Moreover, as a consequence of the characterization
of absolutely continuous curves inP2(Rd ), we recover the Benamou–Brenier (see [15],where the formula was introduced for numerical purposes) formula for the Wassersteindistance:
Conversely, since we know thatP2(Rd ) is a length space, we can use a geodesic μ t and
its tangent vector field vt to obtain equality in (2.43) We also show that optimal transportmaps belong to TanμP2(Rd )under quite general conditions
In this way we recover in a more general framework the Riemannian interpretation of the
Wasserstein distance developed by Otto in [74] (see also [57,73]) and used to study the longtime behavior of the porous medium equation In the original paper [74], (2.43) is derived
using formally the concept of Riemannian submersion and the family of maps φ → φ#μ
(indexed by μ Ld) from Arnold’s space of diffeomorphisms into the Wasserstein space
In Otto’s formalism tangent vectors are rather thought as s= d
dt μ t and these vectors areidentified, via the continuity equation, with−D · (v s μ t ) Moreover vs is chosen to be the
gradient of a function ψ s , so that D · (∇ψ s μ t ) = −s Then the metric tensor is induced by
the identification s → ∇φ s as follows:
s, s
μt :=
Rd s , ∇ψ s dμ t
As noticed in [74], both the identification between tangent vectors and gradients and the
scalar product depend on μ t, and these facts lead to a nontrivial geometry of the
Wasser-stein space We prefer instead to consider directly vtas the tangent vectors, allowing them
to be not necessarily gradients: this leads to (2.42)
Another consequence of the characterization of absolutely continuous curves is a result,given in Proposition 2.20, concerning the infinitesimal behavior of the Wasserstein distance
Trang 37along absolutely continuous curves μ t: given the tangent vector field vt to the curve, weshow that
lim
h→0
W2(μ t +h , (i+ hv t )#μ t )
|h| = 0 for L1-a.e t ∈ (a, b).
Moreover, the rescaled optimal transport maps between μ t and μ t +hconverge to the
trans-port plan (i× vt )#μ t associated to vt (see (2.56)) As a consequence, we will obtain in
Theorem 2.21 a key formula for the derivative of the map t → W2
2(μ t , ν)
THEOREM 2.15 (Absolutely continuous curves in P2(Rd ) ) Let I be an open interval
in R, let μ t : I → P2(Rd ) be an absolutely continuous curve and let |μ| ∈ L1(I ) be its
metric derivative, given by (2.2) Then there exists a Borel vector field v : (x, t)→ vt (x)
gener-Conversely, if a narrowly continuous curve μ t : I → P2(Rd ) satisfies the continuity
equation for some Borel velocity field w t withwtL2(μt;Rd ) ∈ L1(I ) then μ t : I→ P2(Rd )
is absolutely continuous and |μ|(t) w tL2(μt;Rd ) forL1-a.e t ∈ I
In particular equality holds in (2.44).
PROOF Taking into account that any absolutely continuous curve can be reparametrized
by arc length (see, for instance, [11]) and Lemma 2.7, we will assume with no loss ofgenerality that|μ| ∈ L∞(I ) in the proof of the first statement To fix the ideas, we also
assume that I = (0, 1).
First of all we show that for every ϕ ∈ C∞
c (Rd ) the function t → μ t (ϕ)is absolutely
continuous, and its derivative can be estimated with the metric derivative of μ t Indeed,
for s, t ∈ I and μ st ∈ Γo(μ s , μ t )we have, using the Hölder inequality,
Trang 38whence the absolute continuity follows In order to estimate more precisely the derivative
of μ t (ϕ)we introduce the upper semicontinuous and bounded map
If t is a point where s → μ s is metrically differentiable, using the fact that μ h → (x, x)#μ t
narrowly (because their marginals are narrowly converging, any limit point belongs
to Γo (μ t , μ t )and is concentrated on the diagonal ofRd× Rd) we obtain
Set Q= Rd × I and let μ =μ t dt ∈ P (Q) be the measure whose disintegration
is{μ t}t ∈I For any ϕ ∈ C∞
where J ⊂ I is any interval such that supp ϕ ⊂ J × R d If V denotes the closure
in L2(μ; Rd ) of the subspace V := {∇ϕ, ϕ ∈ C∞(Q)}, the previous formula says that the
Trang 39linear functional L : V→ R defined by
Setting vt (x) = v(x, t) and using the definition of L we obtain (2.46) Moreover, choosing
a sequence ( ∇ϕ n ) ⊂ V converging to v in L2(μ; Rd ), it is easy to show that forL1-a.e
t ∈ I there exists a subsequence n(i) (possibly depending on t) such that ∇ϕ n(i) ( ·, t) ∈
Now we show the converse implication We apply the regularization Lemma 2.13, finding
approximations μ ε t, wε t satisfying the continuity equation, the uniform integrability tion (2.15) and the local regularity assumptions (2.20) Therefore, we can apply Proposi-
condi-tion 2.12, obtaining the representacondi-tion formula μ ε
t = (T ε
t )#μ ε0, where T ε
t is the maximalsolution of the ODE ˙T t ε= wε
t (T t ε ) with the initial condition T ε = x (see Lemma 2.8).
Trang 40Now, taking into account Lemma 2.14, we estimate
Since, for every t ∈ I , μ ε
t converges narrowly to μ t as ε→ 0, a compactness argument
(see Lemma 5.2.2 or Proposition 7.1.3 of [9]) gives
for some optimal transport plan γ between μ t1 and μ t2 Since t1 and t2 are arbitrary
this implies that μ t is absolutely continuous and that its metric derivative is less than
Notice that the continuity equation (2.45) involves only the action of vt on∇ϕ with
ϕ ∈ C∞
c (Rd ) Moreover, Theorem 2.15 shows that the minimal norm among all possible
velocity fields wt is the metric derivative and that vt belongs to the L2closure of gradients
c
RdL2(μ;Rd )
.
This definition is motivated by the following variational selection principle
LEMMA 2.17 (Variational selection of the tangent vectors) A vector v ∈ L2(μ; Rd ) longs to the tangent space Tan μP2(Rd ) iff
be-v + wL2(μ;Rd ) vL2(μ;Rd )
∀w ∈ L2
μ; Rd