Vers un théorème de la limite centrale dans l'espace de Wasserstein ?/ Towards a central limit theorem in the Wasserstein space?

(1)

Citation for this paper:

Agueh, M. & Carlier, G. (2017). Vers un théorème de la limite centrale dans

l’espace de Wasserstein?/Towards a central limit theorem in the Wasserstein

space?. Comptes Rendus Mathematique, 355(7), 812-818.

https://doi.org/10.1016/j.crma.2017.05.010

UVicSPACE: Research & Learning Repository

_____________________________________________________________

Faculty of Science

Faculty Publications

_____________________________________________________________

Vers un théorème de la limite centrale dans l’espace de Wasserstein? /Towards a

central limit theorem in the Wasserstein space?

Martial Agueh, Guillaume Carlier

July 2017

©2017 Académie des sciences. Publié par Elsevier Masson SAS. Cet article est publié en

Open Access sous licence CC BY-NC-ND (

http://creativecommons.org/licenses/by-nc-nd/4.0/

).

This article was originally published at:

http://dx.doi.org/10.1016/j.crma.2017.05.010

(2)

Contents lists available atScienceDirect

C. R.

Acad.

Sci.

Paris,

Ser. I

www.sciencedirect.com

Analyse fonctionnelle

Vers

un

théorème

de

la

limite

centrale

dans

l’espace

de

Wasserstein ?

Towards

a

central

limit

theorem

in

the

Wasserstein

space?

Martial Agueh

a

,

Guillaume Carlier

b

,

c

a_University_of_Victoria,_Victoria,_BC,_PO_Box₃₀₆₀_STN_CSC_Victoria,_BC,_V8W_3R4,_Canada b_Université_{Paris-Dauphine,}_PSL_Research_University,_CNRS,_CEREMADE,₇₅₀₁₆_Paris,_France c_INRIA,_Centre_de_Paris,_équipe_MOKAPLAN,_France

i

n

f

o

a

r

t

i

c

l

e

r

é

s

u

m

é

Historiquedel’article :

Reçule19février2017

Acceptéaprèsrévisionle29mai2017 DisponiblesurInternetle7juin2017 Présentéparlecomitéderédaction

Lesbarycentresdansl’espacedeWassersteinconstituentunemanièrenaturelled’interpoler entreplusieursmesuresdeprobabillité,utiledansdifférentsdomainesappliquéscommele traitementd’imagesoul’apprentissagestatistique.Nousconjecturonsquecesbarycentres obéissentàun théorèmede lalimite centrale quenous démontrons dansquelques cas (très)particuliers.

a

b

s

t

r

a

c

t

The notion of Wasserstein barycenters is anatural way to interpolatebetween several probabilitymeasures,usefulinvariousappliedsettingslikeimageprocessingormachine learning.Weconjecturethatsuchbarycentersobeyacentrallimittheoremwhichweprove insome(very)particularcases.

AbridgedEnglishversion

The 2-Wasserstein space(on

R

d),

P

2

(

R

d

)

isby deﬁnitionthespaceofBorelprobability measures on

R

d havingﬁnite

secondmomentsandendowedwiththedistanceW2

W₂2

(

μ

,

ν

)

:=

inf

⎧

⎪

⎨

⎪

⎩

Rd_×Rd

|

x

−

y

|

2d

γ

(

x

,

y

),

γ

∈ (

μ

,

ν

)

⎫

⎪

⎬

⎪

⎭

,

∀(

μ

,

ν

)

∈

P

2

(

R

d

₎

2 ₍₁₎

Adressese-mail :agueh@math.uvic.ca(M. Agueh),carlier@ceremade.dauphine.fr(G. Carlier).

(3)

where

(

μ

,

ν

)

denotesthesetoftransportplansbetween

μ

and

ν

i.e.thesetofprobabilitymeasureson

R

d

× R

dhaving

μ

and

ν

asmarginals.Givenaninteger N,

ν

1

,

. . .

ν

N elements of

P

2

(

R

d

)

andpositiveweights

λ

= (λ

1

,

. . . ,

λ

N

)

∈ R

₊N with

N

i=1

λ

i

=

1,aWassersteinbarycenterofthemeasures

ν

iwithweights

λ

iisaminimizerof

Jλ

(

μ

)

:=

N

i=1

λ

iW22

(

ν

i

,

μ

).

(2)

Wassersteinbarycenterswereintroducedinapreviouswork[1],whereitwasobservedthat:

– theminimizerisunique(itsexistenceisobvious)assoonasoneofthemeasures

ν

doesnotgivemasstosmallsets(i.e. BorelsetsofHausdorff dimensionatmostd

−

1),inthiscase,wemaycallthisminimizerthe Wassersteinbarycenter ofthemeasures

ν

iwithweights

λ

i,

– ifforsome p

∈ (

1

,

+∞)

,all themeasures

ν

i areLp (withrespecttotheLebesguemeasure)thensoistheirbarycenter

(thisfollowsfromadisplacementconvexitytypeargumentinthespiritofMcCann’sseminalwork[11]),thisalsoholds forthelimitcasep

=

1 andforp

= ∞

,itisenoughthatoneofthemeasures

ν

i isL∞.

Wassersteinbarycenters foundvariousapplicationsinstatistics,imageprocessingandmachinelearning,andtherearefast solverstocomputethem,seeCuturi[5]andBenamouet al.[2].TheconceptofWassersteinbarycenterhasbeensigniﬁcantly extendedby BigotandKlein [3],Le GouicandLoubes[10] tothe caseofa quitegeneralBorelprobabilitymeasure m on

(P

2

(

R

d

),

W2

)

andby Kim and Pass [9] tothe Riemannian setting. AWasserstein barycenter isthen a minimizer ofthe

Wassersteinvariancefunctional:

Jm

(

μ

)

:=

P2(Rd)

W₂2

(

ν

,

μ

)

dm

(

ν

).

(3)

Theexistenceofabarycentercaneasilybeestablishedbythedirectmethodofthecalculusofvariations(see[10])assoon as

P2(Rd)

Rd

|

x

|

2d

ν

(

x

)

dm

(

ν

) <

+∞.

(4)

Uniqueness alsoholdsprovided thatm givesastrictlypositivemasstothe setofmeasures thatvanishonsmallsets.We shallalwaysmaketheseassumptionsandthendeﬁneunambiguously:

bar

(

m

)

:=

argmin_P

2(Rd)Jm

.

(5)

If,inaddition,forsome p

∈ (

1

,

∞)

,m issupportedby

P

2

(

R

d

)

∩

Lp

(

R

d

)

and

P2(Rd)

ν

p

Lp₍_Rd₎dm

(

ν

) <

+∞

(6)

thesameconvexityargumentasin[1]givesthatbar

(

m

)

∈

Lp

(

R

d

)

.Ofcourse,(2)isaspecialcaseof(3)correspondingtoa discrete m:m

=

N_i₌₁

λ

i

δ

νi,andifone ofthemeasures

ν

i vanisheson smallsets, thebarycenterofthisdiscretemeasure

willbedenotedas:

bar

(

ν

1

, λ

1

. . . ,

ν

N

, λ

N

)

=

bar

N i=1

λ

i

δ

νi

:=

argmin_P₂₍_Rd₎ N

i=1

λ

iW22

(

ν

i

, .).

(7)

FollowingBigotandKlein [3],we areinterested intheasymptoticbehaviorofempirical Wassersteinbarycenters.More precisely,letusconsiderm aBorelprobabilitymeasurem on

P

2

(

R

d

)

suchthatforsome p

∈ (

1

,

∞)

andsome C

>

0

Rd

|

x

|

2d

ν

(

x

)

+

Rd

ν

(

x

)

pdx

≤

C for m-a.e.

ν

.

(8)

Consideringani.i.d.sample of(random)probability measures

ν

ˆ

1

,

. . . ,

ν

ˆ

n drawnaccordingtotheprobabilitymeasurem on

P

2

(

R

d

)

, theempirical Wassersteinbarycenter of thissample is the random (a.s.well-deﬁned sincem gives full massto

measuresthatvanishonsmallsets)measure

ˆ

μ

n

:=

bar

ˆ

ν

1

,

1 n

. . . ,

ν

ˆ

n

,

1 n

.

(9)

(4)

In the compactlysupported case, Bigot andKlein [3]proved a lawof largenumber, extended toa generalsetting by Le GouicandLoubes[10],fortheempiricalbarycenter:

lim

n→∞W

2

(

μ

ˆ

n

,

μ

¯

)

→

0 a.s., where

μ

¯

:=

bar

(

m

).

(10)

Since,thanksto(8)both

μ

¯

:=

bar

(

m

)

and

μ

ˆ

nhaveLpdensities,thereexistsauniqueoptimaltransportmapforW2between

thetruebarycenter

μ

¯

andtheempiricalbarycenter

μ

ˆ

n,wedenoteby T

ˆ

n

:=

Tμ¯→ ˆμn thismap,whichmaybeviewedasan L2

(

μ

¯

,

R

d

)

-valuedrandom variable. Since W₂2

(

μ

ˆ

n

,

μ

¯

)

= ˆ

Tn

−

id

2_L2₍_μ_¯₎, thanks to (8)and toa simple convexity argument

(see[1]),onehasW2

2

(

μ

ˆ

n

,

μ

¯

)

≤

2C .Thanksto(10)andLebesgue’sdominatedconvergencetheorem,wethushave

ˆ

Tn

−

id

2_L2₍_μ_¯₎

→

0 a.s. and

E

ˆ

Tn

−

id

2_L2₍_μ_¯₎

→

0

.

(11)

ItisnaturalthentoinvestigatewhetheronecangoonestepfurtherintheconvergenceofempiricalWasserstein barycen-ters throughsome centrallimit theorem.Thelawoflargenumbers ofBigotandKlein givesthat T

ˆ

n convergesa.s.to the

identity map, we believe thatan L2 _estimate _in _O

₍

₁

_/

√

_n

₎

_and_asymptotic_normality _seem_reasonable _to _conjecture_for_a

wide classofmeasures m.Proving sucha CLTeveninthe caseofa discretem seems adelicate issue,actually relatedto theregularityofsolutionsofan obstacleproblemforasystemofMonge–Ampèreequations.Letusmentionthatarelated questionconcerningtheasymptoticdistributionofthescalarquantity

√

nW2

(

μ

ˆ

n

,

μ

¯

)

anditsuseforstatisticaltestspurposes

hasbeenconsideredintherecentworks[6]and[7].

Bydeﬁnition,wewillsaythattheWassersteinCLTholdswhenever

√

n

( ˆ

Tn

−

id

)

convergesindistribution(inthe

separa-bleHilbertspaceL2

₍

_μ

_¯

_,

_R

d

₎

₎_to_a_centered_Gaussian

_{N (}

₀

_,

₎

_for_a_certain_positive_self-adjoint_operator

_of_trace_class_on L2

₍

_μ

_¯

_,

_R

d

₎

_._The_following_gives_a_positive_answer_in_a_few_(very)_particular_cases.

Theorem1.TheWassersteinCLTholdsinthefollowingcases: (i) m

= (

1

− λ)δ

ν0

+ λδ

ν1with

λ

∈ (

0

,

1

)

,

ν

0,

ν

1in

P

2

(

R

d

₎

_and

_ν

0vanishingonsmallsets,

(ii) d

=

1,m satisﬁes(19)andgivesfullmasstothesetofnon-atomicmeasures, (iii) m

=

N_i₌₁

λ

i

δ

νiwitheach

ν

ianon-degenerateGaussianon

R

d_.

1. Introduction

L’espace de Wasserstein(sur

R

d),

P

2

(

R

d

)

est pardéﬁnition l’ensemble desmesures de probabilité sur

R

d,de second

momentﬁni,munidelamétriqueW2 déﬁnieparleproblèmedetransportoptimalquadratique : W₂2

(

μ

,

ν

)

:=

inf

⎧

⎪

⎨

⎪

⎩

Rd_×Rd

|

x

−

y

|

2d

γ

(

x

,

y

),

γ

∈ (

μ

,

ν

)

⎫

⎪

⎬

⎪

⎭

,

∀(

μ

,

ν

)

∈

P

2

(

R

d

₎

2 ₍₁₂₎

où

(

μ

,

ν

)

estl’ensembledesplansdetransportentre

μ

et

ν

c’est-à-direl’ensembledesmesuresdeprobabilitésur

R

d

_×R

d

ayant

μ

et

ν

commemarginales(nousrenvoyonsauxlivresdeVillani[13] etSantambrogio[12] pouruneprésentationde lathéoriedutransportoptimal).SoitN unentierplusgrandque1,

ν

1

,

. . .

ν

N deséléments

P

2

(

R

d

)

et

λ

= (λ

1

,

. . . ,

λ

N

)

∈ R

₊N

despoidspositifsnormaliséspar

N_i₌₁

λ

i

=

1,unbarycentredansl’espacedeWassersteindesmesures

ν

iaveclespoids

λ

i

estunminimiseurde Jλ

(

μ

)

:=

N

i=1

λ

iW22

(

ν

i

,

μ

).

(13)

Cettenotionaétéintroduitedansnotretravailprécédent[1],danslequelnousavionsobservéque :

– le minimiseurest unique (l’existence estquant à elleévidente) dèslorsque l’une desmesures

ν

i necharge pas les

ensemblespetits (i.e.lesBoréliensde dimensiondeHausdorffauplusd

−

1) ;danscecas,onappelleceminimiseurle

barycentredansl’espacedeWassersteindesmesures

ν

i aveclespoids

λ

i;

– sitoutes lesmesures

ν

i sontdans Lp (parrapport àlamesurede Lebesgue)pourun certain p

∈ (

1

,

+∞)

,alorsilen

estdemêmedeleurbarycentre(cecidécouled’unargumentdetypeconvexitépardéplacementdansl’espritdutravail pionnierdeMcCann[11]), c’estencorevalablepourlecaslimite p

=

1 etpour p

= ∞

,ilsuﬃtquel’unedesmesures

ν

i (pourlaquellelepoids

λ

i

>

0 évidemment)soitdansL∞.

(5)

l’espacedeWassersteinsontrelativementpopulairesdansdifférentscontextesapplicatifscommeletraitementd’imagesou l’apprentissagestatistiqueetpeuventsecalculernumériquementeﬃcacement–voirCuturi[5]etBenamouet al.[2].

Danscettenote,nousconjecturonsque,lorsquelesmesures

ν

isontdesmesurestiréesaléatoirementetdemanièrei.i.d.

selonunecertain loide probabilitém sur

P

2

(

R

d

)

vériﬁantdesconditionsadéquates,alorsl’écartentre levrai barycentre

delamesurem (voirparagraphesuivant)etlebarycentreempiriqueobéitàunthéorèmedelalimitecentrale.Celaestun raﬃnementquinoussemblenatureld’unrésultatdetypeloidesgrandsnombresdeBigotetKlein[3].Nousprécisonsun peulesdéﬁnitionsdansleparagraphesuivantetétablironslethéorèmedelalimitecentraleauparagraphe3dansquelques castrèsparticuliers.

2. Barycentreempiriqueetvraibarycentre

Lanotiondebarycentredansl’espace deWassersteinaétéétenduedemanièresigniﬁcativeaucasd’unemesure boré-liennem sur

P

2assezgénéraleparBigotandKlein[3],LeGouicetLoubes[10]danslecasde

(

P

2

(

R

d

),

W2

)

etparKimet

Pass[9]danslecasdel’espacedeWassersteinsurunevariétériemannienne.UnbarycentreWassersteindem est alorsun miniseurdelafonctionnelledevariance :

Jm

(

μ

)

:=

P2(Rd)

W₂2

(

ν

,

μ

)

dm

(

ν

).

(14)

L’existence d’un tel barycentre s’obtientaisément par la méthode directedu calcul desvariations (voir notamment [10]) lorsque

P2(Rd)

Rd

|

x

|

2d

ν

(

x

)

dm

(

ν

) <

+∞.

(15)

L’unicitéest quantà elleassurée dès quem donne unemesurepositive auxmesures qui nechargent pasles ensembles petits.Sousceshypothèses,onpeutdoncdéﬁnirsansambiguité

bar

(

m

)

:=

argmin_P

2(Rd)Jm

.

(16)

Sienplus,pourun p

∈ (

1

,

∞)

,m estportéepar

P

2

(

R

d

)

∩

Lp

(

R

d

)

et

P2(Rd)

ν

p

Lp₍_Rd₎dm

(

ν

) <

+∞

(17)

lemêmeargumentde convexitéque dans[1]permetde déduire que bar

(

m

)

∈

Lp

₍

_R

d

₎

_._Évidemment,₍₁₃₎_est _un_cas

par-ticulier de(14)correspondant àlamesure discrètem

=

_iN₌₁

λ

i

δ

νi,etsi l’unedesmesures

ν

i (avecun poidsstrictement

positif)nechargepaslesensemblespetits,onnotealors :

bar

(

ν

1

, λ

1

. . . ,

ν

N

, λ

N

)

=

bar

N i=1

λ

i

δ

νi

:=

argmin_P 2(Rd) N

i=1

λ

iW22

(

ν

i

, .).

(18)

SuivantBigotetKlein[3],nousnousintéressonsaucomportementasymptotiquedesbarycentres deWasserstein empi-riques.Plusprécisément,considéronsm unemesuredeprobabilitéboréliennesur

P

2

(

R

d

)

tellequ’ilexisteC

>

0 telque

Rd

|

x

|

2d

ν

(

x

)

≤

C pour m-presque tout

ν

, (19)

etp

∈ (

1

,

∞)

telque

Rd

ν

(

x

)

pdx

≤

C pour m-presque tout

ν

. (20)

Considéronsmaintenantunéchantilloni.i.d.de mesuresaléatoires deprobabilité

ν

ˆ

1

,

. . . ,

ν

ˆ

n tiréesselonm sur

P

2

(

R

d

)

.

Le barycentrede Wassersteinempiriquede cet échantillonestlamesurealéatoire(p.s.biendéﬁnie puisquem estportée parlesmesuresquinechargentpaslesensemblespetits)

ˆ

μ

n

:=

bar

ˆ

ν

1

,

1 n

. . . ,

ν

ˆ

n

,

1 n

.

(21)

(6)

Bigot et Klein [3] ont établi, dans un cadre à support compact, une loi des grands nombres, étendue par Le Gouic et Loubes[10]àdescasbeaucoupplusgénéraux,pourlebarycentreWassersteinempirique :

lim

n→∞W

2

(

μ

ˆ

n

,

μ

¯

)

→

0 p.s., avec

μ

¯

:=

bar

(

m

).

(22)

Comme, grâceà (20)àla fois

μ

¯

:=

bar

(

m

)

et

μ

ˆ

n sont Lp,ilexiste ununique transport optimal(voir[4,13,12]) pour W2

entrelevraibarycentre

μ

¯

etlebarycentreempirique

μ

ˆ

n,notonsT

ˆ

n

:=

Tμ¯→ ˆμncetransportoptimalquel’ondoitcomprendre

comme unevariablealéatoireprenant sesvaleursdansl’espace deHilbertséparable L2

(

μ

¯

,

R

d

)

.Ona W2₂

(

μ

ˆ

n

,

μ

¯

)

= ˆ

Tn

−

id

2

L2₍_μ_¯₎,avec(19)etunargumentsimpledeconvexité(voir[1]),onaW22

(

μ

ˆ

n

,

μ

¯

)

≤

2C ,desortequ’avec(22)etlethéorème

deconvergencedominéedeLebesgue,ona

ˆ

Tn

−

id

2_L2₍_μ_¯₎

→

0 p.s. et

E

ˆ

Tn

−

id

2_L2₍_μ_¯₎

→

0

.

(23)

Il est alorsnaturel de chercherà allerun ordre plus lointaindansla convergenceau travers d’un théorème de lalimite centrale.LaloidesgrandsnombresdeBigotetKleinénonçantqueT

ˆ

nconvergep.s.etenmoyennequadratiqueversl’identié

dans L2

(

μ

¯

)

,il semble raisonnable de conjecturerune vitessede convergenceen O

(

1

/

√

n

)

etunenormalité aymptotique de

√

n

( ˆ

Tn

−

id

)

,au moins pour certainesclasses de mesuresm. Une stratégienaturelle consiste à essayer de dériver les

conditions d’optimalité par rapport à la mesure pour appliquer un théorème d’inversion locale. Cela semble néanmoins délicat,mêmedanslecasd’unemesurem discrète,carlaconditiond’optimalitécaractérisantlebarycentreprendlaforme d’unproblèmed’obstaclepourunsystèmed’équationsdeMonge–Ampère,l’aspectfrontièrelibreduproblèmecompliquant signiﬁcativementl’analyse...Mentionnonsunequestionreliéequiafaitl’objetdetravauxrécentsenstatistiques[6,7]etqui concerne l’identiﬁcation de la distributionasymptotique de la quantité scalaire

√

nW2

(

μ

ˆ

n

,

μ

¯

)

aﬁnde construire destest

d’hypothèseoud’adéquation.

Pardéﬁnition,nousdironsqueleTLCdansl’espacede Wassersteinestsatisfaitsi

√

n

( ˆ

Tn

−

id

)

convergeenloi(dansle

HilbertséparableL2

₍

_μ

_¯

_,

_R

d

₎

₎_vers_une_mesure_gaussienne_centrée

_{N (}

₀

_,

₎

_pour_un_certain_opérateur_auto-adjoint_positif_de

trace ﬁnie

sur L2

(

μ

¯

,

R

d

)

.Ondonneauparagraphe suivantquelquesexemplestrèsparticuliersde validitédece TLCau sensWasserstein.

3. QuelquescasdevaliditéduTLCWasserstein

Théorème3.1.LeTLCdansl’espacedeWassersteinestsatisfaitdanslescassuivants : (i) lorsquem estuneloideBernoulli,m

= (

1

−λ)δ

ν0

+λδ

ν1avec

λ

∈ (

0

,

1

)

,

ν

0,

ν

1dans

P

2

(

R

d

₎

_et

_ν

0nechargeantpaslesensembles

petits,

(ii) lorsqued

=

1 etquem estportéeparlesmesuresnonatomiquesetvériﬁe(19), (iii) lorsquem

=

N_i₌₁

λ

i

δ

νioùchaque

ν

iestunegaussiennenondégénérée.

Démonstration. (i) Pourdeuxmesures,

ν

0 et

ν

1 commedansl’énoncé,lanotiondebarycentreestétroitementliéeàcelle

degéodésiqueouinterpolantedeMcCannentre

ν

0 et

ν

1,letransportoptimaldeBrenierTν0→ν1 entre

ν

0et

ν

1 dérived’un

potentiel convexe

φ

(dont legradient estdéﬁni de manière unique

ν

0-presquepartout), Tν0→ν1

= ∇φ

,pour t

∈ [

0

,

1

]

,le

barycentrede

(

1

−

t

)δ

ν0

+

t

δ

ν1 coïncideavecl’interpolationdeMcCann(voir[11,1]) :

ν

(

t

)

:=

bar

((

1

−

t

)δ

ν0

+

t

δ

ν1

)

= ∇φ

t #

ν

0avec

φ

t

:= (

1

−

t

)

1 2

|

x

|

2

₊

_t

_φ

(observerque

∇φ

t apourinverse

∇φ

t∗quiestLipschitzdèsquet

∈ [

0

,

1

)

).Parailleurs,ilestfaciledevoirqueletransport

optimalentredeuxpoints

ν

(

t

)

et

ν

(

s

)

delagéodésique,avecs,t dans

[

0

,

1

]

ets

=

1 estdonnépar

Tν(s)→ν(t)

=

id

+

t

−

s

1

−

s

(

∇φ ◦ ∇φ

∗

s

−

id

).

Enparticulier,letransportoptimalT

ˆ

nentrebar

(

m

)

=

ν

(λ)

etlebarycentreemprique

μ

ˆ

n

=

ν

(ˆλ

n

)

(ici

ˆλ

ndésignelafréquence

empiriqued’apparitionde

ν

1 dansl’échantillon

ν

ˆ

1

,

. . . ,

ν

ˆ

n)satisfait

√

n

( ˆ

Tn

−

id

)

=

√

n

(ˆλ

n

− λ)

(

1

− λ)

(

∇φ ◦ ∇φ

∗ λ

−

id

)

etonconclutavecleTLCstandardpour

√

n

(ˆλ

n

− λ)

.

(ii) Endimension1,lebarycentred’unemesurem portéeparlesmesuresnonatomiquesestnonatomiqueetdonnépar laformuleexplicite(voirparexemple[1])

(7)

où

ρ

estn’importequellemesure nonatomiqueetTρ→ν l’uniquetransportcroissantde

ρ

vers

ν

.Enprenant

ρ

:=

bar

(

m

)

et

enappliquantlaformuleprécédenteaubarycentreempirique

μ

ˆ

n,ilvientqueletransportoptimal(i.e.croissant)debar

(

m

)

vers

μ

ˆ

n est

ˆ

Tn

:=

1 n n

i=1 T_bar₍_m₎_→ˆ_ν_i (25)

etcomme les variables aléatoires à valeursdans L2

(

bar

(

m

))

, Tbar(m)→ˆνi sonti.i.d. etde carré intégrable par (19), leTLC

usueldanslesespacesdeHilbertséparables(voir[8])permetdeconclure.

(iii) Sanspertedegénéralité,noussupposeronsqueles

ν

i sontdesgaussiennescentréesetnotons Si

=

Ki2 leurmatrice

de variancecovariance(Ki et Si appartiennent à

S

_d++le cônedesmatricesd

×

d symétriques,déﬁnies positives, onnote

par ailleurs

S

d l’espacedesmatricessymétriques et

S

_d+ le cônedesmatricesd

×

d symétriques, semi-déﬁnies positives).

Lepointimportanticiestquesi

α

∈

N

:= {(

α

1

,

· · · ,

α

N

)

∈ R

₊N

;

Ni=1

α

i

=

1

}

alors

ν

(

α

)

:=

bar

(

Ni=1

α

i

δ

νi

)

estelle-même

unegaussiennecentréedontlamatricedevariance-covarianceS

(

α

)

estl’uniqueracinedans

S

_d++ del’équationmatricielle (cf.[1]) : I

=

N

i=1

α

i

(

S

)

où

i

(

S

)

:=

Ki

(

KiS Ki

)

− 1 2_K_i

_.

₍₂₆₎

Envertudulemme 3.2ci-dessous,l’application

α

∈

N

→

S

(

α

)

∈

S

_d++estC∞.Levraibarycentrebar

(

ν

1

,

λ

1

. . . ,

ν

N

,

λ

N

)

est

lamesuregaussiennecentréede variance-covarianceS

(λ)

tandisquelebarycentreempirique

μ

ˆ

n estlamesuregaussienne

centréedevariance-covarianceS

(ˆλ

n

)

où

ˆλ

n estlevecteurdesfréquencesempiriques :

(ˆλ

n

)

i

=

1

n#

{

j

=

1

. . . ,

n

: ˆ

ν

j

=

ν

i

},

i

=

1

, . . . ,

N

.

(27)

LeTLCimpliqueque

√

n

(ˆλ

n

− λ)

convergeenloivers

N (

0

,

σ

)

(

σ

i j

:= λ

i

δ

i j

− λ

i

λ

j).Ilestbienconnuqueletransportoptimal

entrelesgaussiennescentréesdevariancecovarianceS

(λ)

etS

(ˆλ)

estlinéaireetexplicitementdonnéenfonctionde

ˆλ

par

T

(ˆλ)

=

S

(ˆλ)

12

₍

_S

_(ˆλ)

12_S

_(λ)

_S

_(ˆλ)

12

₎

−12_S

_(ˆλ)

12

c’estencoreuneapplicationC∞ de

ˆλ ∈

N dans

S

d++etévidemmentT

(λ)

=

id.Avecuneinégalitéd’accroissementsﬁnis,

ilvient

ˆ

Tn

=

T

(ˆλ

n

)

=

id

+

T

(λ)(ˆλ

n

− λ) +

ε

n

,

|

ε

n

| ≤ |ˆλ

n

− λ|

sup θ∈[λ,ˆλn]

|

T

(θ )

−

T

(λ)

|,

desorteque

√

n

( ˆ

Tn

−

id

)

=

T

(λ)

√

n

(ˆλ

n

− λ) +

√

n

ε

n

cequi pardesargumentsclassiques (laméthodedelta)permetaisémentde conclureque

√

n

( ˆ

Tn

−

id

)

convergeenloivers

unedistributiongaussiennecentréeetdevariance–covarianceT

(λ)

σ

T

(λ)

.

2

Nousavonsutiliséci-dessuslerésultatsuivant

Lemme3.2.L’application

α

∈

N

→

S

(

α

)

∈

S

_d++déﬁnieimplicitementparl’équation(26)estdeclasseC∞.

Démonstration. Grâce au théorème des fonctions implicites, il suﬃt de montrer que pour S

∈

S

_d++,

N_i₌₁

α

i

_i

(

S

)

est

inversible.Pour

θ

∈

S

d,Li

:=

i

(

S

)(θ )

∈

S

d estlasolutionuniquede

−

K−_i 1S−1

θ

S−1K_i−1

= (

KiS Ki

)

− 1 2_K−1 i LiK− 1 i

+

K− 1 i LiK− 1 i

(

KiS Ki

)

− 1 2

_.

₍₂₈₎ Déﬁnissant

Li

:=

K−_i 1LiK_i−1

,

Si

:=

KiS Ki

,

θ

i

:=

Ki

θ

Ki

,

ilestcommodederéécrire(28)souslaformeplusconcise

−

S−_i1

θ

i

S−_i1

=

S− 1 2 i

Li

+

Li

S −1 2 i

.

(29)

(8)

Supposonsque

θ

∈

S

d soitdanslenoyaude

iN=1

α

i

(

S

)

i.e.

N

i=1

α

iLi

=

0,ilvientdoncavec(29)etquelques

manipula-tionsélémentaires 0

=

N

i=1

α

iTr

(

Li

θ )

=

N

i=1

α

iTr

(

Li

θ

i

)

= −

2 N

i=1

α

iTr

(

S 1 2 i

Li

S 1 2 i

Li

S 1 2 i

)

(30) et comme

S 1 2 i

Li

S 1 2 i

Li

S 1 2

i

∈

S

d+, chaque terme de cette sommeest nul, de sorteque, pour

α

i

>

0,comme

Si

∈

S

d++, on a

Li

=

0 etdonc

θ

i

=

0,sibienque

θ

=

0,cequimontrel’inversibilitécherchée.

2

Note :

Cette note a été rédigée aprèsla disparition soudainedu premier auteur, Martial Agueh ; le second auteur tient à la dédieràsamémoire.

Références

[1]M.Agueh,G.Carlier,BarycentersintheWassersteinspace,SIAMJ.Math.Anal.43 (2)(2011)904–924.

[2]J.-D.Benamou,G.Carlier,M.Cuturi,L.Nenna,G.Peyré,IterativeBregmanprojectionsforregularizedtransportationproblems,SIAMJ.Sci.Comput. 37 (2)(2015)1111–1138.

[3] J. Bigot,T. Klein, Characterization ofbarycenters in the Wasserstein space byaveraging optimal transport maps,preprint, https://arxiv.org/abs/ 1212.2562,2012.

[4]Y.Brenier,Polarfactorizationandmonotone.Rearrangementofvector-valuedfunctions,Commun.PureAppl.Anal.44 (4)(1991)375–417.

[5] M.Cuturi,A.Doucet,FastcomputationofWassersteinbarycenters,in:Proceedingsofthe31stInternationalConferenceonMachineLearning(ICML-14), Beijing,China,21–26June2014,pp. 685–693.

[6] E.DelBarrio,H.Lescornel,J.-M.Loubes,AstatisticalanalysisofadeformationmodelwithWassersteinbarycenters:estimationprocedureandgoodness ofﬁttest,preprint,https://arxiv.org/abs/1508.06465,2015.

[7] J.Ebert,V.Spokoiny,A.Suvorikova,Constructionofnon-asymptoticconﬁdencesetsin2-Wassersteinspace,preprint,https://arxiv.org/abs/1703.03658, 2017.

[8]E.Giné,J.-R.Leòn,OnthecentrallimittheoreminHilbertspace,Stochastica4 (1)(1980)43–71.

[9]Y.-H.Kim,B.Pass,WassersteinBarycentersoverRiemannianmanifolds,Adv.Math.307(2017)640–683.

[10]T.LeGouic,J.-M.Loubes,ExistenceandconsistencyofWassersteinbarycenters,Probab.TheoryRelat.Fields(2016)1–17.

[11]R.-J.McCann,Aconvexityprincipleforinteractinggases,Adv.Math.128(1997)153–179.

[12]F.Santambrogio,OptimalTransportforAppliedMathematicians,Birkhäuser,2015.