Document [original]

Sums of correlated exponentials: two

types of Gaussian correlation structures

vorgelegt von

Diplom-Mathematiker

Anton Klymovskiy (Klimovsky)

aus Charkiw

von der Fakult¨

at II – Mathematik und Naturwissenschaften

der Technischen Universit¨

at Berlin

zur Erlangung des akademischen Grades

Doktor der Naturwissenschaften

Dr. rer. nat.

genehmigte Dissertation

Promotionsausschuss:

Vorsitzender: Prof. Dr. Dirk Ferus

Berichter: Prof. Dr. Anton Bovier

Berichter: Prof. Dr. Erwin Bolthausen

Tag der wissenschaftlichen Aussprache: 24. Juni 2008

Berlin 2008

D83

Abstract

In this thesis, we study the limiting behaviour of the large sums of strongly correlated exponen-

tials as the number of their summands and the effective dimension of the correlation structure

simultaneously tend to infinity. We consider two types of such sums which are generated by two

a priori very different Gaussian correlation structures. The first type is a sum of hierarchically

correlated random variables which is based on the partition function of Derrida’s generalised

random energy model (GREM) with external field. The second type is an infinitesimal sum

of genuinely non-hierarchically strongly correlated random variables which is based on the

partition function of the Sherrington-Kirkpatrick (SK) model with multidimensional spins. We

consider the asymptotic behaviour (the thermodynamic limit) of these two sums on a logarith-

mic scale (i.e., at the level of free energy) and also at a more refined level of their fluctuations

(i.e., at the level of weak limiting laws). Interestingly for the SK model with multidimensional

spins, we find traces of a hierarchical organisation in the thermodynamic limit. This supports

the conjectured in theoretical physics universal behaviour of the sums of such sort.

Concerning the SK model with multidimensional spins, we obtain the following results.

We prove upper and lower bounds on the free energy of this model in terms of variational

inequalities. The bounds are based on a multidimensional extension of the Parisi functional. We

generalise and unify the comparison scheme of Aizenman, Sims and Starr and the one of Guerra

involving the GREM-inspired processes and Ruelle’s probability cascades. For this purpose, an

abstract quenched large deviations principle of the G¨

artner-Ellis type is obtained. We derive

Talagrand’s representation of Guerra’s remainder term for the SK model with multidimensional

spins. The derivation is based on well-known properties of Ruelle’s probability cascades and

the Bolthausen-Sznitman coalescent. We study the properties of the multidimensional Parisi

functional by establishing a link with a certain class of semi-linear partial differential equations.

We embed the problem of strict convexity of the Parisi functional in a more general setting

and prove the convexity in some particular cases which, however, do not cover the original

setup of Talagrand. Finally, we prove the Parisi formula for the local free energy in the case of

multidimensional Gaussian a priori distribution of spins using Talagrand’s methodology of a

priori estimates.

Concerning the GREM in the presence of uniform external field, we obtain the following

results. We compute the fluctuations of the ground state and of the partition function in the

thermodynamic limit for all admissible values of parameters. We find that the fluctuations are

described by a hierarchical structure which is obtained by a certain coarse-graining of the ini-

tial hierarchical structure of the GREM with external field. We provide an explicit formula for

the free energy of the model. We also derive some large deviation results providing an expres-

sion for the free energy in a class of models with Gaussian Hamiltonians and external field.

Finally, we prove that the coarse-grained parts of the system emerging in the thermodynamic

limit tend to have a certain optimal magnetisation, as prescribed by strength of external field

and by parameters of the GREM.

Zusammenfassung

Diese Dissertation behandelt das Grenzwertverhalten der großen Summen der stark korrelierten

Exponenziale, w¨

ahrend die Anzahl ihren Summanden und die effektive Dimension der Korre-

lationsstruktur gleichzeitig gegen Unendlichkeit gehen. Wir betrachten zwei Arten dieser Sum-

men, die durch die a priori sehr unterschiedliche Gauß’schen Korrelationsstrukturen erzeugt

werden. Die erste Art ist eine Summe der hierarchisch korrelierten Zufallsvariablen, die auf der

Zustandssumme von Derrida’s generalised random energy model (GREM) mit externem Feld

basiert. Die zweite Art ist eine infinitesimale Summe der echt nicht-hierarchisch stark korre-

lierten Zufallsvariablen, die auf der Zustandssumme vom Sherrington-Kirkpatrick (SK) Modell

mit mehrdimensionalen Spins basiert. Wir betrachten das asymptotische Verhalten (der thermo-

dynamische Limes) dieser Summen auf der logarithmischen Skala (d.h., auf dem Niveau der

freien Energie) und außerdem auf dem pr¨

aziseren Niveau ihrer Fluktuationen (d.h., auf dem Ni-

veau der schwachen Grenzwertverteilungen). Interessanterweise finden wir auch im SK Modell

mit mehrdimensionalen Spins Spuren der hierarchischen Organisation im thermodynamischen

Limes. Dies unterst¨

utzt das hypothetische universale Verhalten dieser Summen aus der theore-

tischen Physik.

Bez¨

uglich des SK Modells mit mehrdimensionalen Spins erzielen wir die folgenden Ergeb-

nisse. Wir beweisen die oberen und unteren Schranken f¨

ur die freie Energie mittels Variati-

onsungleichungen, die auf der mehrdimensionalen Verallgemeinerung des Parisi-Funktionals

basieren. Wir setzen das Vergleichsschema von Aizenman, Sims und Starr und das von Guerra

ein, die die GREM-inspirierten Prozesse und Ruell’schen Wahrscheinlichkeitskaskaden invol-

vieren. Hierf¨

ur beweisen wir ein abstraktes “quenched” Prinzip der großen Abweichungen von

G¨

artner-Ellis Art. Mittels der Eigenschaften der Ruelle’schen Wahrscheinlichkeitskaskaden und

des Bolthausen-Sznitman Koaleszents leiten wir die Darstellung von Talagrand des Restterms

von Guerra f¨

ur das SK Modell mit mehrdimensionalen Spins ab. Wir untersuchen die Eigen-

schaften des mehrdimensionalen Parisi-Funktionals, indem wir eine Verbindung mit einer Ka-

tegorie semi-linearer partieller Differentialgleichungen herstellen. Wir betten das Problem der

strengen Konvexit¨

at des Parisi-Funktionals in einen allgemeineren Kontext ein. Wir zeigen die

Konvexit¨

at in einigen F¨

allen, welche jedoch nicht die urspr¨

ungliche Formulierung von Tala-

grand umfassen. Schließlich beweisen wir die Parisi-Formel f¨

ur die lokale freie Energie im Fall

der mehrdimensionalen Gauß’schen a priori Verteilung der Spins mit der Methodologie der a

priori Absch¨

atzungen von Talagrand.

Bez¨

uglich des GREMs in Anwesenheit des uniformen externen Felds erzielen wir die fol-

genden Ergebnisse. Wir berechnen die Fluktuationen des Grundzustandes und der Zustandss-

umme im thermodynamischen Limes f¨

ur alle zul¨

assigen Werte der Parameter. Wir finden, dass

im thermodynamischen Limes die Fluktuationen durch eine hierarchische Struktur beschrie-

ben sind. Diese Struktur ist eine Grobk¨

ornung der urspr¨

unglichen hierarchischen Struktur des

GREMs mit externem Feld. Wir stellen eine explizite Formel f¨

ur die freie Energie des Modells

zur Verf¨

ugung. Wir leiten auch einige Resultate zu den großen Abweichungen, welche einen

Ausdruck f¨

ur die freie Energie einiger Modelle mit Gauß’schen Hamiltonians und externem

Feld erm¨

oglichen, her. Schließlich beweisen wir, dass die grobk¨

ornigen Teile des Systems, die

in der thermodynamischen Limes auftauchen, dazu neigen eine optimale Magnetisierung zu ha-

ben. Diese Magnetisierung ist durch die St¨

arke des externen Felds und durch die Parameter des

GREMs vorgeschrieben.

To my mother and the memory of my father

Contents

Introduction ................................................................. 1

1 Framework .............................................................. 7

1.1 Physical roots and vision . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

1.2 Statistical mechanics of spin systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

1.2.1 Basicobjects .................................................... 8

1.2.2 Thermodynamic quantities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

1.2.3 The elementary Gibbs variational principle . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

1.2.4 Thermodynamic limit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

1.3 Spin-glasses .......................................................... 13

1.3.1 Basic mathematical objects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

1.3.2 Quenched disorder and random processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

1.3.3 Realistic spin-glass models. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

1.3.4 Thermodynamic limit in spin-glasses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

1.4 Some mean-field Gaussian spin-glass models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

1.4.1 The Sherrington-Kirkpatrick model. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

1.4.2 The random energy model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

1.4.3 The generalised random energy model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

1.4.4 Between the GREM and SK model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

1.4.5 Other mean-field spin-glass models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

1.5 Replica symmetry breaking picture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

1.5.1 Overlap distribution function and replica symmetry breaking . . . . . . . . . . . . 22

1.5.2 Limiting Gibbs measures and pure states . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

1.5.3 Higher-level objects. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

1.5.4 Ultrametricity.................................................... 26

1.5.5 Free energy in the SK model: the Parisi formula . . . . . . . . . . . . . . . . . . . . . . . 26

2 Often used tools and some existing results ................................... 29

2.1 Oftenusedingredients.................................................. 29

2.1.1 Interpolation..................................................... 29

2.1.2 Concentration of measure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31

2.1.3 Superadditivity................................................... 32

2.2 Limitingobjects ....................................................... 32

2.2.1 The Poisson-Dirichlet process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33

VIII Contents

2.2.2 The Ruelle probability cascades. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36

2.2.3 The Bolthausen-Sznitman coalescent and random permutations . . . . . . . . . . 38

2.3 Some results on Gaussian mean-field spin-glasses . . . . . . . . . . . . . . . . . . . . . . . . . . . 42

2.3.1 TheGREM...................................................... 42

2.3.2 The SK model with multidimensional spins . . . . . . . . . . . . . . . . . . . . . . . . . . . 44

2.3.3 The Aizenman-Sims-Starr comparison scheme for the SK model . . . . . . . . . 49

3 The Aizenman-Sims-Starr scheme for the SK model with multidimensional spins 53

3.1 Introduction........................................................... 53

3.2 Some preliminary results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58

3.2.1 Covariance structure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58

3.2.2 Concentration of measure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59

3.2.3 Gaussian comparison inequalities for free energy-like functionals . . . . . . . . 61

3.3 Quenched G¨

artner-EllistypeLDP ........................................ 64

3.3.1 Quenched LDP upper bound . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64

3.3.2 Quenched LDP lower bound . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68

3.4 The Aizenman-Sims-Starr comparison scheme . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74

3.4.1 Naive comparison scheme . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74

3.4.2 Free energy upper bound . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74

3.4.3 Free energy lower bound . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78

4 Estimates of Guerra’s remainder term in the SK model with multidimensional

spins .................................................................... 81

4.1 Guerra’s comparison scheme . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81

4.1.1 Multidimensional Guerra’s scheme . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82

4.1.2 Localcomparison ................................................ 83

4.1.3 Free energy upper and lower bounds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84

4.1.4 The filtered d-dimensional GREM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85

4.1.5 A computation of the remainder term . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86

4.2 The Parisi functional in terms of differential equations . . . . . . . . . . . . . . . . . . . . . . . 89

4.2.1 The Parisi functional . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94

4.2.2 On strict convexity of the Parisi functional and its variational representation 95

4.2.3 Simultaneous diagonalisation scenario . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96

4.3 Remainderestimates ................................................... 97

4.3.1 A sufficient condition for µk-terms to vanish . . . . . . . . . . . . . . . . . . . . . . . . . . 97

4.3.2 Upper bounds on ϕ(2): Guerra’s scheme revisited . . . . . . . . . . . . . . . . . . . . . . 98

4.3.3 Adjustment of the upper bounds on ϕ(2)..............................102

4.3.4 Talagrand’s a priori estimates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105

4.3.5 Gronwall’s inequality and the Parisi formula . . . . . . . . . . . . . . . . . . . . . . . . . . 106

5 The SK model with multidimensional Gaussian spins .........................111

5.1 Introduction...........................................................111

5.2 Proof of the local Parisi formula . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112

5.2.1 The case of positive increments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112

5.2.2 Simultaneous diagonalisation scenario . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114

5.2.3 The Crisanti-Sommers functional in 1-D . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115

Contents IX

5.2.4 Replica symmetric calculations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120

5.2.5 The multidimensional Crisanti-Sommers functional . . . . . . . . . . . . . . . . . . . . 122

5.2.6 Talagrand’s a priori estimates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122

5.2.7 The local low temperature Parisi formula . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 124

6 The GREM in the presence of uniform external field ..........................125

6.1 Introduction...........................................................125

6.2 Partial partition functions, external fields and overlaps . . . . . . . . . . . . . . . . . . . . . . . 130

6.3 The REM with external field revisited . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 134

6.3.1 Free energy and ground state . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 134

6.3.2 Fluctuations of the ground state . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137

6.3.3 Fluctuations of the partition function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140

6.4 The GREM with external field . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140

6.4.1 Fluctuations of the ground state . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140

6.4.2 Fluctuations of the partition function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 148

6.4.3 Formula for the free energy of the GREM . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153

7 Some open problems and outlook ...........................................155

A Appendix ................................................................157

Notation Index ...............................................................159

References ...................................................................163

Introduction

In this thesis, we study the limiting behaviour of the large sums of correlated exponentials

ZM:=

∑

i=1

exp(XM(i)) (0.1)

as the number of their summands Mtends to infinity (here {XM(i)}M

i=1is an array of correlated

random variables). We shall consider the sums with the summands that have the following two

characteristic properties: a wide range of orders of magnitude and significant high-dimensional

correlations. Not surprisingly, the sums of this type emerge in a variety of applied contexts,

e.g., in models of interacting complex systems being considered in physics, biology, computer

science, social sciences, finance, etc.

We should emphasise that, for the purposes of modelling of special magnetic materials called

spin glasses, study of such sums Zwas initiated and brought to very sophisticated (though

mathematically unrigorous) levels of analysis in theoretical physics, see, e.g., M´

ezard et al.

(1987). For the most recent accounts of mathematical research on such sums Z, we refer to

a thorough monograph by Talagrand (2003), very readable introductory text-book of Bovier

(2006), and comprehensive collection of review articles (Bolthausen & Bovier, 2007).

In general, the first characteristic feature mentioned above precludes the applicability of

such standard tools of probability theory as the law of large numbers and the central limit

theorem, since Zmay well be (and actually, as we shall see, in many cases is!) dominated by a

small number of exceptionally large summands. In these cases we are forced to perform more

detailed analysis of such rare events. For this purpose, we shall invoke some techniques of large

deviations and also, perhaps more importantly, extreme value theory.

Contrary to extensively studied situations with one-dimensional correlation structures (e.g.,

Markov processes, where the involved random variables are indexed by a real time variable),

we shall deal with the high-dimensional correlation structures, where effective dimension of the

index set grows with M.

In the so far rigorously analysed sums Zof the above sort, a new class of limiting objects

appears. Moreover, contrary to what might have been expected, the limiting objects possess an

important hierarchical ultrametric structure. This structure appears to be universal, modulo the

number of levels of the hierarchy.

In this thesis, we consider two a priori very different instances of such sums of correlated

exponentials. These sums are inspired by the following mean-field spin-glass models.

2 Introduction

1. The generalised random energy model with external field. The generalised random en-

ergy model (GREM) was proposed by Derrida (1985) as a model of a random energy land-

scape with a priori ultrametric correlation structure. To define the family of correlated ran-

dom variables of interest, we first fix the ultrametric structure. Given N∈

, consider the

standard discrete hypercube ΣN:={−1;1}N. It will play the role of the index set. Define

the (normalised) lexicographic overlap between the configurations σ(1),σ(2)∈ΣNas

qL(σ(1),σ(2)):=(0,σ(1)

16=σ(2)

Nmaxnk∈[1;N]∩

:[σ(1)]k= [σ(2)]ko,otherwise. (0.2)

We equip the index set with the lexicographic distance defined as

dL(σ(1),σ(2)):=1−qL(σ(1),σ(2)).(0.3)

This distance is obviously an ultrametric, that is, for all σ(1),σ(2),σ(3)∈ΣN, we have

dL(σ(1),σ(3))≤maxndL(σ(1),σ(2)),dL(σ(2),σ(3))o.(0.4)

Let GREMN:={GREMN(σ)}σ∈ΣNbe the Gaussian random process on the discrete hyper-

cube ΣNwith a covariance of the following form

hGREMN(σ(1))GREMN(σ(2))i=ρ(qL(σ(1),σ(2))),(0.5)

where ρ:[0;1]→[0;1]is the non-decreasing right-continuous function such that ρ(0) = 0

and ρ(1) = 1. Given h∈

+, consider the Gaussian process X:=XN:={XN(h,σ)}σ∈ΣN

defined as

XN(h,σ):=GREMN(σ)+ h

√N

∑

i=1

σi,σ∈ΣN.(0.6)

The second summand in (0.6) is called the external field. The parameter hrepresents the

strength of external field. The GREM with external field is based on the following sum of

M:=2Nhierarchically correlated exponentials

ZN(β):=∑

σ∈ΣN

exphβ√NXN(h,σ)i,(0.7)

where β>0 is the real parameter called the inverse temperature. The sum (0.7) is called

the partition function. Of course, if β≈0 the summands in (0.7) are on the same scale of

magnitude and, hence, the law of large numbers applies. As we shall see below, the situation

becomes radically different and the law of large numbers breaks down, if βincreases above

a certain h-dependent threshold β0(h).

2. The Sherrington-Kirkpatrick model with multidimensional spins. The Sherrington-

Kirkpatrick (SK) model was introduced by Sherrington & Kirkpatrick (1975). This model

of a mean-field spin-glass has long been one of the most enigmatic models of statistical me-

chanics. Mathematically it boils down to the sum of correlated exponentials with a genuine

Introduction 3

non-hierarchical correlation structure. The correlations in the SK model are induced by

the Hamming distance on the discrete hypercube ΣN. In this thesis, we study the following

generalisation of the SK model. Consider arbitrary (not necessarily discrete) subset Σ⊂

and the corresponding index set ΣN:=ΣN. We define the family of Gaussian processes

X:={X(σ)}σ∈ΣNas

X(σ) = XN(σ):=1

∑

i,j=1

gi,jhσi,σji,(0.8)

where G:={gi,j}N

i,j=1consists of i.i.d. standard normal random variables and, for x,y∈

hx,yi:=∑d

u=1xuyuis the standard Euclidean scalar product. The SK model with multidi-

mensional spins is based on the following infinitesimal sum of correlated exponentials

ZN(β):=ZΣN

expβ√NX(σ)dµ⊗N(σ),(0.9)

where µ∈Mf(Σ)is some arbitrary (not necessarily uniform or discrete) finite a priori

measure. The sum (0.9) is called the partition function.

To investigate limiting behaviour of the sums (0.7) and (0.9), it is natural to ask the following

questions.

As a first approximation, we consider the limiting behaviour of ZN(β)on a logarithmic scale.

Consider the random variable

pN(β):=1

NlogZN(β),(0.10)

where ZN(β)is given either by (0.9) or by (0.7). Somewhat abusing physical terminology we

shall call (0.10) the free energy.

Question 0.0.1. Does pNconverge to some random variable p as N ↑+∞? If convergence

holds, then in what sense (i.e., in probability, almost surely or L1)? Finally, can we compute p?

Assume that we have computed p(β), for all β>β0. It is intuitively clear that being able to

compute the limit

lim

β↑+∞β−1p(β) =:m

in many cases means that

lim

N↑+∞N−1/2max

σ∈ΣN

XN(σ)=m.(0.11)

Thus, knowing p(β)for large βprovides very sharp information on extremes of the process

XNas N↑+∞. That is, the constant mnot only bounds the extremes, but gives also their exact

value. This is unusually good.

The second natural more refined approximation is to study the “fluctuations” (weak limiting

laws) of the sum ZN.

4 Introduction

Question 0.0.2. Does there exist some scaling sequence of constants {CN}∞

N=1and a nontrivial

random variable Z0so that the weak convergence CNZN

−−−→

N↑+∞Z0holds? Is the distribution of

the random variable Z0to some extent universal?

Question 0.0.2 is, of course, harder than Question 0.0.1, but it seems to be exactly the ques-

tion that is claimed to be positively answered in theoretical physics.

The main purpose of this thesis is to compute the free energy (Question 0.0.1) and the fluctu-

ations of the partition function (Question 0.0.2) in the N↑+∞limit for the GREM with external

field and the SK model with multidimensional spins. We find that in both models similar hierar-

chical ultrametric structures do emerge in the N↑+∞limit. For the GREM with external field

we are able to give a complete positive answer to both Questions 0.0.1, 0.0.2. For the general

case of the SK model with multidimensional spins we are only able to give upper and lower

bounds on the free energy. However, for the special case, where µ(cf. (0.9)) is a Gaussian

distribution, we compute the limiting free energy and give a complete answer to Question 0.0.1.

To be more specific, concerning the SK model with multidimensional spins we prove upper

and lower bounds on the free energy (0.10) in terms of variational inequalities involving the

corresponding multidimensional generalisation of the Parisi functional (Theorems 3.1.1, 3.1.2,

4.1.1, 4.1.2). For this purpose, we generalise and unify the Aizenman-Sims-Starr (AS2) and

Guerra’s schemes for the case of multidimensional spins, and employ a quenched large devia-

tions principle (LDP) which may be of independent interest (Theorems 3.3.1 and 3.3.2). Both

schemes are formulated in a unifying framework based on the same comparison functional.

The functional acts on Gaussian processes indexed by an extended configuration space as in

the original AS2scheme. As a by-product, we provide also a short derivation of the remainder

term in multidimensional Guerra’s scheme (Theorem 4.1.4) using well-known properties of the

Ruelle probability cascades (RPC) and the Bolthausen-Sznitman coalescent. This gives a clear

meaning to the remainder in terms of averages with respect to a measure changed disorder.

The change of measure is induced by a reweighting of the RPC using the exponentials of the

GREM-inspired process.

We study the properties of the multidimensional Parisi functional by establishing a link be-

tween the functional and a certain class of non-linear partial differential equations (PDEs),

see Propositions 4.2.1, 4.2.2 and Theorem 4.2.2. We extend the Parisi functional to a contin-

uous functional on a compact space (Theorems 4.2.1, 4.2.2). We show that the class of PDEs

corresponds to the Hamilton-Jacobi-Bellman (HJB) equations induced by a linear problem of

diffusion control (Proposition 4.2.4). Motivated by a problem posed by Talagrand (2006c), we

show the strict convexity of the local Parisi functional in some cases (Theorem 4.2.4).

We partially extend Talagrand’s methodology of estimating the remainder term to the mul-

tidimensional setting (Theorem 4.1.4, Proposition 4.3.1, Theorem 4.3.1). In the case of multi-

dimensional Gaussian a priori distribution of spins we prove the validity of the Parisi formula

(Theorem 5.1.1).

Concerning the GREM in the presence of uniform external field, we find the fluctuations

of the ground state and of the partition function in the thermodynamic limit for all admissible

values of the model parameters (Theorems 6.1.1, 2.3.1, 6.1.3). An explicit formula for the free

energy of this model is given (Theorem 6.1.4). We also derive some large deviation results

providing an expression for the free energy of a class of models with external field and Gaussian

Hamiltonians. (Theorem 6.2.1). We prove that the coarse-grained parts of the system tend to

Introduction 5

have a certain optimal magnetisation as prescribed by the strength of external field and by

parameters of the GREM.

The thesis is organised as follows (see Figure 0.1 for the chapter dependency graph).

1 2

Fig. 0.1. Chapter dependency graph

Chapter 1 contains a description of some modelling ideas of statistical mechanics of disor-

dered systems. We fix some basic notation and terminology, and introduce the models used

throughout the work (in particular, the random energy model (REM), the GREM and the SK

model).

Chapter 2 is a short account of mathematical results available in the literature which are either

directly used in the arguments of this thesis or give a suggestive background and useful ex-

amples of what could be achieved. In particular, we discuss in this chapter some interpolation

techniques for Gaussian random variables, comparison theorems, concentration of measure,

the Poisson-Dirichlet processes, Ruelle’s probability cascades and the Bolthausen-Sznitman

coalescents.

Chapter 3 is the first chapter of this thesis dealing with the SK model with multidimensional

spins. We extend the AS2scheme to this model. We record some basic properties of co-

variance structure of the process (0.8) and establish the relevant concentration of measure

results. The chapter contains the tools allowing to compare and interpolate between the free

energy-like functionals of two different Gaussian processes. We derive a quenched LDP of

the G¨

artner-Ellis type under measure concentration assumptions. We conclude the chapter

with the derivation (based on the AS2scheme) of upper and lower bounds on the free energy

in terms of the saddle point of a Parisi-like functional.

Chapter 4 employs the ideas of Guerra’s comparison scheme in order to obtain upper and lower

bounds on the free energy in the SK model with multidimensional spins. It contains also a

useful analytic representation of the remainder term which is used in the next chapter. We

also study some properties of the multidimensional Parisi functional such as differentiability

and convexity. The chapter is concluded by a partial extension of Talagrand’s remainder term

estimates to the case of the SK model with multidimensional spins.

Chapter 5 concerns the SK model with multidimensional Gaussian a priori distribution of spins.

The main result of this chapter is a derivation of the local Parisi formula for the SK model

with multidimensional spins.

Chapter 6 is devoted to study of the fluctuations of the ground state and of the partition function

for the REM and GREM with external field. We also provide an explicit formula for limiting

free energy of these models. We obtain some large deviation results providing an expression

for the free energy of a class of models with Gaussian Hamiltonians and external field.

6 Introduction

Chapter 7 lists some open problems related to the models studied in this thesis.

Appendix A contains a proof of the almost superadditivity of the local free energy in the SK

model with multidimensional spins. It is an application of the Gaussian comparison results

we derived in Chapter 3.

Acknowledgements

Foremost, I would like to express my sincere gratitude to my adviser Prof. Anton Bovier for

sharing his vision, erudition and understanding of the fascinating subject of disordered systems

with me. Not less important for me were his optimistic approach to resolution of the problems,

his skill to understand the subject of matter almost without any words from my side, and his

very stimulating habit of dissecting the problem at hand by asking extremely sharp questions.

I would like to thank Prof. Erwin Bolthausen for agreeing to be the coexaminer of this thesis.

I have the pleasure thank the members of the Research Groups on Probability Theory and

Mathematical Finance (Technische Universit¨

at Berlin), on Interacting Random Systems (Weier-

strass Institute for Analysis and Stochastics) and on Probability Theory (Universit¨

at Potsdam)

for giving me the opportunity to start to understand the beauty and diversity of probability the-

ory.

I am indebted to Prof. Mariya Shcherbina for giving me the initiation in research on disor-

dered systems and for sharing her vision with me. Besides, I would also like to thank my lyceum

and university lecturers and professors – the representatives of the Kharkiv mathematical school

– for giving me the start in modern mathematics.

I am grateful to Prof. Michel Talagrand and Prof. Dmitry Panchenko for important remarks.

I would like to warmly thank my colleagues Stefan Ankirchner, Andrej Depperschmidt,

Alexander Drewitz, Matthias an der Heiden, Ivan Izmestiev, Mesrop Janunts, Pierre-Yves Louis,

Martin Slowik, Michel Sortais, Cristian Vormoor, Wiebke Wittm¨

uß for greatly contributing to

the cheerful atmosphere I enjoyed both at and after work. I thank Elise Luckfiel for her help

with peculiarities of the German language.

This work would totally be unthinkable without the exceptionally wise guide in life and

science, who was and is for me my prematurely passed away father. Completion of this thesis

would simply be impossible without love and steady support of my mother and professional

psychological help of my sister.

Financial support of both the Deutsche Forschungsgemeinschaft (through the Research

Training Group “Stochastic Processes and Probabilistic Analysis”) and the Helmholz-Gemein-

schaft is gratefully acknowledged.

Framework

In this chapter, we fix the framework and notations we shall use in the present work. Besides,

we try to give some motivational background to the set of problems we shall deal with.

1.1 Physical roots and vision

Starting from the 1970’s, an increasing amount of research in statistical mechanics has been

targeted at modelling the behaviour of magnetic materials collectively called spin-glasses. This

led to a group of models usually referred to as disordered spin systems. In general, disordered

systems are complex systems with spatial inhomogeneity. On physical grounds, the typical rep-

resentatives of these systems are alloys, composite materials, porous media, polymers, to name

a few. As became clear in recent decades, advances in understanding of the models of disor-

dered systems made in theoretical physics allow for their successful application to systems well

beyond the usual scope of physics. For example, we can name the complex systems arising

in computer science (combinatorial optimisation, computational complexity, see Hartmann &

Weigt (2005); M´

ezard & Montanari (2007), information processing, see M´

ezard & Montanari

(2007); Nishimori (2001); Opper & Saad (2001)), cognitive science (Amit, 1989; Arbib, 2003;

Nishimori, 2001), finance (agent based modelling, see Challet et al. (2005); Coolen (2005); risk

modelling, see Bouchaud & Potters (2003)), social sciences (Contucci & Graffi, 2007; Durlauf,

1999). Moreover, the models of disordered systems, the vision and heuristics developed in the-

oretical physics are abstract enough to be viewed as a collection of very sharp statements about

basic purely mathematical objects. This explains the ever growing interest in bringing these

heuristics on rigorous mathematical grounds in order to understand the real domain of their

applicability.

Often the precise mechanisms of inhomogeneity in disordered systems are not known in

advance or have a very complicated description. A way to model such situation is to assume

that instead of a full deterministic description of the system we have some statistical data on the

distribution of inhomogeneities. In that case, the inhomogeneities can be modelled by random

processes. This randomness is usually referred to as a quenched disorder1. Disordered systems

are often studied in the framework of statistical mechanics which is also the viewpoint adopted

in the present thesis.

1The term “quenched” emphasises a certain analogy with materials obtained by heat treatment which is called in metallurgy

quenching. The method amounts to rapid cooling of a heated up sample of metal in order to preserve (or freeze) inhomo-

geneities in it. In contrast, annealing is a process of slow cooling which gives enough time for homogenising diffusion

processes.

8 1 Framework

The main objective of statistical mechanics according to Nishimori (2001) is “the clarifi-

cation of the macroscopic properties of many-body systems starting from the knowledge of

interactions between microscopic elements”. In turn, statistical mechanics of disordered sys-

tems is a “particularly difficult, but also particularly exciting, branch of the general subject, that

is devoted to the same problem in situations when the interactions between the components are

very irregular and inhomogeneous, and can only be described in terms of its statistical prop-

erties” (Bovier, 2006). For introductory information on disordered systems we refer to Bovier

(2006).

Highly nontrivial heuristics such as the replica symmetry breaking (RSB) have been de-

veloped in theoretical physics to explain the very complex behaviour of even the “simplest”

spin-glass models both in dynamics and equilibrium (see, e.g., the landmark monograph by

M´

ezard et al. (1987)). Interestingly, these “simple” models are actually very basic but very little

studied mathematical objects which starting from the 1980’s became the subject of increasingly

active mathematical research.

One of the main messages of physical theory of the hierarchical RSB is the following ambi-

tious but unrigorous universality hypothesis.

The main characteristic observables (order parameters) of huge disordered spin systems

with long range interactions often have ultrametric (tree) structure.

The domain of applicability of the above heuristic seem to be broader than that of the central

limit theorem (CLT). Despite considerable efforts in recent years, only rather basic predictions

of physical theories were proved rigorously for the most interesting mean-field spin-glass mod-

els. Mathematical research on spin-glasses is still concentrated on several most simple cases

with more or less ad hoc methods specific to the concrete instance. There is, however, a hope

that there exists some general, yet not discovered, unifying mathematical theory explaining all

predictions on spin-glasses made in theoretical physics and, especially, the universality hypoth-

esis.

Rigorous understanding of applicability domain of the universality hypothesis would be a

very useful result, since the tree structures are dramatically more tractable than the general

graphic structures. This allows one not only to get substantially deeper analytical insight, but

also to construct faster algorithms solving hard problems at hand.

1.2 Statistical mechanics of spin systems

In this section, we recall some basic objects, terminology and facts used in statistical mechanics

of spin systems.

1.2.1 Basic objects

Spin system consists of sites indexed by some finite index set Λ. Assume Σ⊂

d. To each site

we attach an independent copy of the configuration probability space (Σ,S,µ). In this thesis,

we shall consider the following configuration spaces: Σ={−1;1}, compact subsets of

dand,

finally, the whole

ditself. Compactness of the configuration space Σis usually assumed. In

Chapter 5, we shall, however, treat an example of a non-compact configuration space. The

1.2 Statistical mechanics of spin systems 9

elements σ∈Σare traditionally called spins2or just configurations. A configuration gives

complete microscopic description of the site. In applications, the reference measure (or a priori

distribution)µis often just uniform. The overall configuration space of the whole system is

often a product set ΣΛ:=ΣΛ. Denote µΛ:=µ⊗Λand also SΛ:=SΛ. Then (ΣΛ,SΛ,µΛ)is

the (product) a priori probability space.

Remark 1.2.1. Sometimes we shall deal with the a priori measures µ∈Mf(Σ), i.e., with the

measures without probabilistic normalisation. This will not change (modulo a constant shift)

the quantities of interest.

Having defined the configuration space, we would like to have some means to compare the

individual configurations. For that purpose we assume that we are given some (measurable) cost

functional on the configuration space H:ΣΛ→

which is called the Hamiltonian. We say that

the configuration σ(1)∈ΣΛis energetically more favourable than the configuration σ(2)∈ΣΛ,

if H(σ(1))>H(σ(2)).

Remark 1.2.2. Physical convention is that the configurations with smaller energies are more

favourable. Mathematically this is only a matter of considering the Hamiltonian ˜

HΛ:=−HΛ.

The inverse temperature is the real parameter β∈

+. A Gibbs measure (also called the

Boltzmann distribution) GΛ(β)∈M1(ΣΛ)is a measure satisfying

dGΛ(β)

dµΛ

ZΛ(β)expβHΛ,(1.1)

where necessarily

ZΛ(β):=µΛexpβHΛ.(1.2)

The normalisation constant (1.2) is referred to as the partition function.

Remark 1.2.3. Since we do not assume the compactness of Σ, we have to assume the existence

of (1.2), for all β≥0. The partition function (1.2) is a very familiar object. It is just the Laplace

transform of the random variable HΛon the a priori probability space (ΣΛ,SΛ,µΛ). This

immediately implies that the partition function is increasing and also (by the H¨

older inequality)

logarithmically convex function of β. Moreover, if Σis a compact set, then ZΛ(β)is an analytic

function of β.

A mathematical pattern behind (1.1) is as follows. At inverse temperature β=0, or equiv-

alently, at infinite temperature T :=β−1= +∞, we obviously have GΛ(β) = µΛwhich is a

sound physical fact. A system which is extremely heated up almost ignores its internal ener-

getic barriers. Hence, due to the analyticity of the Gibbs weights (with respect to β), we have

in a (sufficiently small) neighbourhood of β=0 that irrespective of the structure of the Hamil-

tonian the system looks just so as there is no energetic barriers at all.

On the other extreme, if we consider the low temperature limit, i.e., let β↑+∞(equivalently

T↓+0) the system is increasingly exposed to energetic costs. That is, the Gibbs measure is

more and more concentrated around the energetically most favourable configurations which are

2This terminology is a reminiscent of the fact that initially spin systems were developed as microscopic models of magnetism,

where each site models an atom or molecule and a spin state is an orientation of its individual magnetic moment.

10 1 Framework

the maxima of the Hamiltonian HΛ. Finally, in the low temperature limit the system “freezes”

completely at the maxima of the Hamiltonian HΛ. That is, the support of the Gibbs measure

coincides with the set argmaxHΛwhich is called the set of ground states (Remark 1.2.2 can

make this terminology more transparent). More formally, assume (for simplicity) that Σis a

compact set, HΛ∈C(Σ), and cardargmaxHΛ<∞. If we denote the uniform measure on the

set argmaxHΛby U(argmaxHΛ)∈M1(ΣΛ), then it is easy to show that

kGΛ(β)−U(argmaxHΛ)kTV −−−→

β↑+∞0.(1.3)

It is exactly this pattern relating the Gibbs measure and the maxima of the Hamiltonian HΛ

which makes the Gibbs measures attractive in applications. Often the problems of finding the

extrema of functions depending on a large number of variables are computationally hard. Gibbs

measures provide the whole family of objects indexed by the real parameter β∈

+. This allows

to approach (both analytically and algorithmically) the hard problem at β= +∞gradually.

1.2.2 Thermodynamic quantities

The observables are the measurable functions of spin configurations O:ΣΛ→

3. An important

quantity is the thermodynamic potential defined as

Λ(β):=logZΛ(β).(1.4)

Remark 1.2.3 implies that the thermodynamic potential is convex. This quantity has the follow-

ing useful property (cf. (1.3))

lim

β↑+∞

βP

Λ(β) = max

σ∈ΣΛ

HΛ(σ).(1.5)

Another useful property of this quantity is that it allows to compute averages with respect to the

Gibbs measure of other observables included in the Hamiltonian. For example, we have

dβP

Λ(β) = GΛ(β)HΛ.

More generally, given some macroscopic observable O:ΣΛ→

, fix some real parameter h∈

and define, for σ∈ΣΛ, the modified Hamiltonian HΛ(h;·):ΣΛ→

as follows

HΛ(h;σ):=HΛ(σ)+hO(σ).

Then we have

dhP

Λ(β) = GΛ(β)[O].

3In other words, the observables are the random variables on the configuration space.

1.2 Statistical mechanics of spin systems 11

1.2.3 The elementary Gibbs variational principle

Another motivation for introducing the thermodynamic potential (and the Gibbs measure) is the

fact that they solve a natural variational problem.

The functional S:M1(Σ)2→

+is called the relative entropy (or a Kullback-Leibler diver-

gence) of ν1with respect to ν2, if the following holds

S(ν1|ν2):=(ν2hlogdν1

dν2i,ν1ν2,

+∞,otherwise.

(1.6)

Define the functional ϕΛ(β;·):M1(ΣΛ)→

∪{−∞}as

ϕΛ(β;ν):=βν HΛ−S(νkµΛ).

The following proposition holds true.

Proposition 1.2.1 (elementary Gibbs variational principle).For any β∈

+, we have

Λ=ϕΛ(β;GΛ(β)) = max

ν∈M1(ΣΛ)ϕΛ(β;ν).(1.7)

Remark 1.2.4. See, e.g., Ellis (2006) for more complicated instances of the same proposition

for infinite-volume (cardΛ=∞) spin systems.

In words, (1.7) means that the Gibbs measure GΛ(β)realises the best possible balance be-

tween the relative entropy and energy. Besides, the Gibbs variational principle gives a straight-

forward way to obtain lower bounds on the thermodynamic potential by choosing various trial

measures in (1.7).

In Chapter 3, following the seminal work of Aizenman et al. (2003), we obtain another

variational scheme for the SK model with multidimensional spins. This scheme expresses the

free energy of this model as the value of the Parisi functional at its saddle point (Theorems 3.1.1

and 3.1.2).

1.2.4 Thermodynamic limit

It is often hopelessly hard to study exact behaviour of thermodynamic quantities of interest for

finite but large systems due to their enormous dimensionality and the presence of irregularities.

The receipt of statistical mechanics is to approximate the quantities of interest for the finite

systems by their counterparts from the limiting infinite system. This approximation procedure

is called the thermodynamic limit.

Limiting thermodynamic potential

Consider the sequence of finite index spaces {ΛN}N∈

and the corresponding family of the

Hamiltonians {HΛN}∞

N=1such that

lim

N↑+∞cardΛN= +∞.

In what follows, we assume that the index spaces {ΛN}N∈

are organised such that, for all

N∈

,ΛN⊂ΛN+1⊂Λ0, where Λ0is some countable set such that Λ0=SN∈

ΛN.

12 1 Framework

Question 1.2.1 (thermodynamic potential).What is the asymptotic behaviour of thermody-

namic quantities such as the thermodynamic potential P

ΛNin the thermodynamic limit, i.e.,

as N ↑+∞? Does the limit exists? What is the behaviour of the limit with respect to β?

To fix some natural scale, it is by convention assumed in statistical mechanics that thermo-

dynamic quantities grow asymptotically linearly4with size of the system, that is,

ΛN(β)∼

N↑+∞p(β)cardΛN,(1.8)

where p(β)is some real constant. Therefore, in view of Question 1.2.1 and for notational con-

venience, we define the density of thermodynamic potential which is called the free energy

pN(β):=1

cardΛN

ΛN(β).(1.9)

Note that the assumption (1.8) implies that

lim

N↑+∞pN(β) = p(β).(1.10)

Infinite-volume Gibbs measures

The following question is rather imprecise as it is stated.

Question 1.2.2 (limiting Gibbs measure).Can we describe the limiting behaviour of the se-

quence of Gibbs measures {GΛN(β)}N∈

? What are the limiting objects of this sequence?

A clean way to answer the previous question is based on the concept of the Dobrushin-

Lanford-Ruelle (DLR) states, see, e.g., Bovier (2006); Simon (1993). This approach requires,

however, more structure from the Hamiltonian than it has been assumed previously. In partic-

ular, the Hamiltonian should (1) be defined for any ΛbΛ0, (2) be the function defined on the

whole ΣΛ0, i.e., HΛ(·):ΣΛ0→

, and (3) have the following form

HΛ(σ) = ∑

A:A∩Λ6=/0

UA([σ]A),(1.11)

where, for AbΛ0, the interaction potential UA:ΣA→

is the measurable with respect to SA

function. In this case, given any index subset ΛbΛ0and any external condition σ(c) ∈ΣΛ0\Λ,

we define the local specification GΛ(β,σ(c))∈M1(ΣΛ0), for a measurable set ∆⊂ΣΛ0, as

GΛ(β,σ(c))(∆):=1

ZΛ(β,σ(c))ZΛ

∆(σqσ(c))expβHΛ(σqσ(c))dµΛ(σ),(1.12)

where

ZΛ(β,σ(c)):=ZΛexpβHΛ(σqσ(c))dµΛ(σ).

4In physics such quantities are called extensive.

1.3 Spin-glasses 13

The set G0(β)⊂M1(ΣΛ0)of all infinite-volume Gibbs measures (or states) is the set of

measures G0(β)∈G0(β)such that, for all index sets ΛbΛ0, and all external conditions

σ(c) ∈ΣΛ0\Λ, we have

G0(β){·|[σ]Λ0\Λ=σ(c)}=GΛ(β,σ(c)){·},(1.13)

G0(β)-almost surely. Equation (1.13) is called the DLR equation. Under certain regularity con-

ditions (see, e.g., (Bovier, 2006, Corollary 4.2.8)) it can be proved that the set of all infinite-

volume measures G0(β)is not empty and can be constructed by rather straightforward limiting

procedure. Namely, it can be constructed as a convex hull of a set of all weak limiting points (as

N↑+∞) for the following sequences of local specifications {GΛN(β,σ(c)

N)}N∈

with all possible

external conditions {σ(c)

N}N∈

⊂ΣΛ0\ΣΛN.

The above set of ideas and especially the condition (1.13) shows that the Gibbs measures

generated by the Hamiltonians (1.11) can be viewed as graphical (spatial) generalisations of

Markov chains (see, e.g., Preston (1976)). Such generalisations (e.g., Markov random fields)

are important in various applications such as information processing, see, e.g., Winkler (2003).

In this thesis, we shall deal with mean-field systems, where the external conditions are ir-

relevant. Moreover, the Hamiltonians of such systems are not representable in the form (1.11).

Thus, the described above framework of the DLR equations is not applicable in the context of

mean-field models. See Section 1.3.4, for a more precise formulation of the questions which

will be dealt with in the present work.

Remark 1.2.5 (infinite-volume Gibbs variational principle).An alternative to the DLR states

way of defining the infinite-volume Gibbs measure is by using the infinite-volume analogues of

Proposition 1.2.1, see, e.g., Ellis (2006).

Phase transitions

After passing to the thermodynamic limit, many kinds of interesting effects can occur at the level

of free energy (cf. (1.10)). For example, the free energy may become a non-analytic function of

β. Such situation is usually referred to as the phase transition. In the context of the DLR states,

another indicator of the phase transition at the level of the Gibbs measure is the situation, where

the set G0(β)contains more than one element.

1.3 Spin-glasses

In this section, we introduce disordered spin systems we are going to study in the present work.

These are the spin systems with strongly randomised interactions.

1.3.1 Basic mathematical objects

In this subsection, we introduce the deterministic objects that will be randomised afterwards

and used as building blocks of spin-glass models.

Deterministic instances

We start from definition of non-disordered Hamiltonians with pair interactions.

14 1 Framework

The interaction kernel is the real matrix A∈

Λ×Λ. The external field is the real vector

h= (hi)i∈Λ∈

Λ. The (pair interaction) Hamiltonian is the mapping HΛ(A,h):ΣΛ→

defined

HΛ(A,h;σ):=∑

i,j∈Λ

Ai,jhσi,σki+∑

i∈Λhhi,σii.(1.14)

Remark 1.3.1. More generally, for p ∈

, one can consider the p-spin interactions defined as

follows. Let A(p)∈

Λpbe a real tensor of rank p ∈

. We also set d =1for simplicity. Define

the p-spin Hamiltonian HΛ(A(p),h):ΣΛ→

in the following way

HΛ(A(p),h;σ):=∑

i1,...,ip∈Λ

Ai1,...,ipσi1σi2···σip+∑

i∈Λ

hiσi.(1.15)

Remark 1.3.2. The presence of external field (i.e., h 6=0) in (1.14) introduces a bias which

makes the mapping (1.14) less symmetric by excluding, e.g., the rotational symmetry

HΛ(A,0,Oσ) = HΛ(A,0,σ),

where O ∈O(d)is an arbitrary orthogonal matrix.

Remark 1.3.3. An important difference of the Hamiltonian (1.14) from the Hamiltonian (1.11)

is that the former one does not depend on spins outside Λ. For this reason, we say that the

former Hamiltonian does not interact with the exterior of Λ.

Graph of interacting sites

Note that the Hamiltonian (1.14) naturally induces the graph of interacting sites or interaction

graph G = (V,E), with vertices V:=Λand edges E:={(i,j)∈Λ2|Ai,j6=0}.

1.3.2 Quenched disorder and random processes

Disorder

In the sequel, we shall assume that the interaction matrix Ais the random matrix on the proba-

bility space of disorder (Ω,F,

). This probability space is always assumed to be large enough

to contain all random variables we shall be dealing with (except those explicitly defined on the

spin configuration space ΣΛ). We shall be interested in typical behaviour of systems governed

by the random Hamiltonians (1.14).

Random processes

Due to the presence of disorder, the spin-glass Hamiltonian (1.14) can be viewed as a random

process on the probability space of disorder that is indexed by the configuration space ΣΛ(cf.

Definition 2.1.1). This “Hamiltonian-as-a-random-process” point of view we shall adopt in what

follows. In the sequel, we shall consider not only the pair interaction random Hamiltonians but

also the “abstract” random processes HΛ:={HΛ(σ)}σ∈ΣΛN

on the probability space of disorder.

Such Hamiltonians induce the random free energy and the random Gibbs measure as specified

by (1.9) and (1.1).

To lighten the notation, we shall use the following simplifications. Given some random inter-

action matrix A, we shall write simply HΛ(h;σ)instead HΛ(A,h;σ). Given β∈

+and h∈

the random Gibbs measure induced by HΛ(h;σ)at inverse temperature βwill be denoted by

GΛ.

1.3 Spin-glasses 15

1.3.3 Realistic spin-glass models

We start from two motivating examples of the physical relevance.

The Ruderman-Kittel-Kasuya-Yoshida (RKKY) model

Physically, spin-glasses are amorphous magnetic alloys. We can imagine the following mathe-

matical model. Given some dimension d∈

, let XN⊂

dbe the box XN:= [−N;N]d. Suppose

we have a family of i.i.d. random variables {ri}Nd

i=1uniformly distributed on XN. Realisations

of the family rmodel the quenched positions of impurities in the sample of material XN. Thus,

we are dealing with quenched inhomogeneities5. Put ΛN:={1,...,Nd}. Let further J:

→

be an even function decaying to zero at infinity. Assume that the spin configuration space is the

unit sphere Σ:=

d−1. Given the direction of external field h ∈

d, we define the Hamiltonian

HRKKY

N(h):ΣΛN→

HRKKY

N(h;σ):=∑

i<j

i,j∈ΛN

J(ri−rj)hσi,σji+hh,∑

i∈Λ

σii,σ∈ΣΛN.(1.16)

A physical example of the function Jis the RKKY exchange potential

J(r):=1

krk3cos(krk).(1.17)

Physical reasons for this form of the exchange potential Jcan be found, e.g., in Fischer & Hertz

(1991). The Hamiltonian (1.16) becomes a rather complex expression with the above choice

of the exchange potential (1.17). The summands involving the exchange potential decay (albeit

slowly) as the distance between the interacting sites grows. Hence, the interaction between the

spins i,j∈ΛNwith large distance kri−rjkbetween them gives only a small contribution to

(1.16). Moreover, the exchange potential oscillates taking positive and negative values. Due

to these oscillations, there are summands in (1.16) with Ji,j:=J(ri−rj)>0 (ferromagnetic

interaction). In this case collinear spins σiand σjare energetically the most favourable. There

are, however, nonnegligible summands in (1.17) with Ji,j<0 (antiferromagnetic interaction).

In the latter case anticollinear spins are energetically the most favourable. These two cases may

impose contradicting requirements on the spin configuration σ, since, for the triple i,j,k∈ΛN

such that i<j<kwith non negligible probability we have Ji,j>0, Jj,k>0 and Ji,k<0. This

conflicting non transitive situation is called frustration.

Unfortunately, even on the heuristic qualitative level of theoretical physics the thermody-

namic limit of the RKKY model seems to be very hard to analyse, see, however, Zegarli´

nski

(1998).

The Edwards-Anderson (EA) model

The EA model (Edwards & Anderson, 1975) is an attempt to simplify the complicated picture

of the previous model, while retaining the most important features of it, namely the frustration,

quenched disorder and locality of interactions.

Let ΛN:=

d∩[−N;N]dand also let Σ:={−1;1}⊂

. Let now {gi,j}i,j∈ΛNbe a family of

i.i.d. standard Gaussian random variables. Consider the following EA Hamiltonian

5An example of such material is the Cu-Mg alloy (with 1% of Mg).

16 1 Framework

HEA

N(h,σ):=∑

ki−jk2=1

i,j∈ΛN

gi,jσiσj+h∑

i∈Λ

σi.(1.18)

At the time of writing, there seems to be no consensus (except for the cases d=1 and

“d=∞”6) in the physical literature about the qualitative picture of the EA model in the thermo-

dynamic limit (see Newman & Stein (2007) for a recent review).

1.3.4 Thermodynamic limit in spin-glasses

To lighten the notation in what follows, we shall simply write Ninstead of ΛNin quantities

depending on ΛN. We shall sometimes drop the subscript index Nin occurrences, where it stays

fixed and its value is clear from the context.

Given the random interaction matrix A, we can naturally extend Questions 1.2.1 and 1.2.2 to

the case of quenched disorder and ask a few additional ones.

The extension of Question 1.2.1 to the case of quenched disorder is Question 0.0.1 from the

introduction. This is the first main question being addressed in the present work.

Question 1.3.1 (maximization algorithm for the average-case scenaio).Can one produce an

efficient (for the average-case scenario) algorithm for finding the maximiser of (1.14) or at

least an “almost maximiser” of it?

For algorithmic purposes of Question 1.3.1, it would be interesting to know an answer to the

quenched version of Question 1.2.2 and, in particular, to the following.

Question 1.3.2 (structure of the maximisers).What is the typical structure of the (almost) max-

imisers of (1.14)? How many maximisers are there? Are they very different?

Since the information about the almost maximisers can be gained from the geometry of the

Gibbs measure, it is natural to ask also the following.

Question 1.3.3 (structure of the random Gibbs measure).Does the random Gibbs measure look

almost surely uniform or it concentrates around certain random domains in the thermodynamic

limit? Can we construct the limiting random object that encodes the structure of the random

Gibbs measure in the thermodynamic limit?

Question 1.3.3 is certainly very informal. The partial excuse is that at the writing time there

is no general framework which gives a proper limiting description of the random Gibbs mea-

sure. See, however, Bovier (2006); Newman & Stein (2007), and references therein (cf. also

Section 1.5.3).

In this thesis, we shall instead of Question 1.3.3 deal with closely related Question 0.0.2

from the introduction.

To make Questions 1.3.2 and 1.3.3 more sensible, we have to specify some metric structure

on the configuration space. This metric structure depends, of course, crucially on the choice of

the Hamiltonian. In the next section we shall consider some representative examples.

6The latter case is the alias for the SK model on which more is said below.

1.4 Some mean-field Gaussian spin-glass models 17

1.4 Some mean-field Gaussian spin-glass models

In this section, we define the main models considered in the present work. They fall into the

class of mean-field spin-glass models which Hamiltonians are Gaussian processes indexed by

high dimensional configuration spaces.

The term mean-field refers to the idea to approximate complex interaction geometries (as in

the EA model) by certain effective, in some sense averaged, interactions with “simpler” geome-

tries. One way to make this approximation rigorous is Proposition 1.2.1, where one restricts the

maximisation to product measures, see, e.g., Opper & Saad (2001); Simon (1993).

1.4.1 The Sherrington-Kirkpatrick model

The celebrated Sherrington-Kirkpatrick model is based on a very simple instance of the Hamil-

tonian (1.14) on a fully connected graph G. The model ignores completely the locality of inter-

actions but retains high degrees of frustration and quenched disorder. A large body of heuristics

concerning spin-glasses were developed in theoretical physics during the analysis of the SK

model and then were applied to other models. In a sense, the SK model is the most understood

spin-glass model in theoretical physics. Some of heuristics concerning Question 0.0.1 have been

rigorously confirmed for the SK model in the series of papers by Aizenman et al. (2003); Guerra

(2003); Guerra & Toninelli (2002); Talagrand (2006b), see also Section 1.5.5 for some details.

But still, as of the time of writing, the model contains quite a few puzzles, especially concern-

ing Questions 1.3.1, 1.3.2 and certainly the hardest Question 1.3.3. For a recent short account

of mathematical results on this model see Bolthausen (2007) and references therein.

To define the SK model, we make the following assumption.

Assumption 1.4.1 (Ising spins).Let the index space be ΛN:= [1;N]∩

and let the spin config-

uration space be the Ising one, that is Σ:=ΣIsing :={−1;1}. Therefore, the spin configuration

space ΣΛNof the whole system is the discrete hypercube {−1;1}N. We also consider the uniform

a priori distribution µon the configuration space Σso that the individual spin configurations

are the Rademacher random variables on the a priori probability space.

Let {gi,j}N

i,j=1be a family of i.i.d. standard Gaussian random variables (the same as in

(1.18)). We set Ai,j:=1

√Ngi,jand hi:=h∈

. Hence, the Hamiltonian (1.14) assumes the

form of the SK Hamiltonian

HSK

N(σ):=1

√N

∑

i,j=1

gi,jσiσj+h∑

i∈Λ

σi.(1.19)

The reason for the normalisation factor 1/√Nin (1.19) is to satisfy our assumption (1.8).

Remark 1.4.1. The Gaussian character of the interaction matrix is not very crucial, at least, if

Question 0.0.1 is concerned, see Carmona & Hu (2006). However, it makes the analysis easier.

The SK Hamiltonian (1.19) is, thus, a Gaussian process on the spin configuration space ΣN.

For convenience, we shall also use the following centred Gaussian process

SKN(σ):=1

∑

i,j=1

gi,jσiσj.(1.20)

18 1 Framework

The distribution of this process is completely determined by its covariance structure

hSKN(σ(1))SKN(σ(2))i= 1

∑

i=1

σ(1)

iσ(2)

i!2

The overlap is the mapping R:Σ2

N→[−1;1]defined as

R(σ(1),σ(2)):=1

∑

i=1

σ(1)σ(2).(1.21)

In words, the quantity (1.21) measures the number of matching coordinates of the vectors

σ(1)and σ(2). In fact, if we denote the Hamming distance between σ(1)and σ(2)through

dH(σ(1),σ(2)), then we have

R(σ(1),σ(2)) = 1−2dH(σ(1),σ(2)).(1.22)

Remark 1.4.2 (canonical L2-distance).The Hamming distance can also be obtained (modulo

a constant factor) as the canonical L2-distance induced by the Gaussian process SKNon the

configuration space ΣN

dH(σ(1),σ(2)):=1

√2q

(SKN(σ(1))−SKN(σ(2)))2.

The model (1.20) can be seen as a particular instance of the models which have the covari-

ance structure of the following form

hXN(σ(1))XN(σ(2))i=f(R(σ(1),σ(2))),(1.23)

for some function f:[−1;1]→

is such that f(0) = 0 and f(1) = 1.

In particular, the Hamiltonian (1.15) with Ai1,...,ip:=N−p/2gi1,...,ip, where {gi1,...,ip}N

i1,...,ip=1

is the family of i.i.d. standard Gaussian random variables, has the covariance structure given by

(1.23) with f(q):=qp. This model is called Derrida’s p-spin model.

1.4.2 The random energy model

Perhaps, the simplest possible model in the framework of Section 1.3.2 is the random en-

ergy model. It was introduced by Derrida (1980, 1981), who considered the Gaussian process

{HREM(σ)}σ∈ΣNwithout any correlations. More formally, consider

HREM

N(σ):=√NREMN(σ),(1.24)

where {REM(σ)}σ∈ΣNis the family of 2Ni.i.d. standard Gaussian random variables. We work

under Assumption 1.4.1.

As opposed to pair interaction Hamiltonians (cf. (1.14)), the REM and, as well as its gen-

eralised version from the next section, lack a description in terms of interacting microscopic

spin variables σi. Nevertheless, while being a “toy model”, the REM serves quite surprisingly

as a building block of exact hierarchical structure arising in the thermodynamic limit of the SK

1.4 Some mean-field Gaussian spin-glass models 19

model (Section 2.2.1). Note that the SK model is a model with the genuine interacting spins

description (1.19). It is conjectured that the above mentioned hierarchical structure emerges

universally also for other mean-field spin-glass models (see Section 1.5.2). Related universality

facts on the level of random Hamiltonians (as opposed to the level of free energy and Gibbs mea-

sure, cf. Section 1.3.4) have been established in the series of papers by Bauke & Mertens (2004);

Ben Arous et al. (2006); Borgs et al. (2001, 2005a,b); Bovier & Kurkova (2004c, 2006a,b).

Remark 1.4.3. The covariance of the process REMN(·)can also be obtained as the limit of the

covariance of Derrida’s p-spin model as p ↑+∞.

Remark 1.4.4. The canonical L2-distance generated by the process REMN(·)induces, obvi-

ously, the discrete topology on ΣN.

It turns out that already this simple model without correlations demonstrates a phase transi-

tion. Indeed, the limiting free energy is (see Chapter 6) almost surely (and in Lp) given by

p(β) = (β2

2,β≤βc:=√2log2

β2

2+βc(β−βc),β>βc.(1.25)

The two cases in (1.25) can easily be explained by the breakdown of the law of large numbers

(LLN) in the partition function of the REM for β>βc.

The questions listed in Section 1.3.4 have been considered and settled for the REM by Bovier

et al. (2002). The above game of i.i.d. processes on the hypercube in the framework of Section

1.3.2 can also be played for non-Gaussian distributions, see Ben Arous et al. (2005).

1.4.3 The generalised random energy model

The generalised random energy model was proposed by Derrida (1985). A recent account of

mathematical results on the GREM can be found in Bovier & Kurkova (2007). The results

are substantially more elaborate than for the SK model. Even the hardest Question 1.3.3 can

be answered for this model completely (Bovier & Kurkova, 2003a). However, perhaps most

significantly, the importance of the GREM stems from the fact that it emerges in a variational

formula for the free energy of the a priori much harder SK model. The variational principle was

proposed by Aizenman et al. (2003) (see also Section 2.3.3). This principle allows, in particular,

to obtain upper bounds on the quenched free energy of the SK model. These bounds have been

proved to be exact in the thermodynamic limit (Talagrand, 2006b), as prescribed by the Parisi

formula.

The GREM is based on the following random Hamiltonian

HGREM

N(σ):=√NGREMN(σ),

where {GREMN(σ)}σ∈ΣNis the Gaussian random process satisfying (0.5). (The model uses

Assumption 1.4.1.)

Consider the space of discrete order parameters

n:={q:[0;1]→[0;1]|q(0) = 0,q(1) = 1,qis non-decreasing,

piece-wise constant with njumps}.(1.26)

20 1 Framework

Fig. 1.1. An example of discrete order parameter (overlap distribution function)

Recall the function ρfrom (0.5). Assume that ρ∈Q0

n. In what follows, we shall refer to ρas

the discrete order parameter. In this case, it is possible to construct the process GREMNas a

finite sum of independent Gaussian processes. Assume that

ρ(x) =

∑

k=1

[xk;xk+1)(x),(1.27)

where

0=:x0<x1< ... < xn=1,(1.28)

0=:q0<q1< ... < qn=1.(1.29)

See Figure 1.1 for an example graph of discrete order parameter.

Let {ak}n

k=1⊂

be such that a2

k=qk−qk−1. We assume that, for all k∈[1;n]∩

, we have

xkN∈

7and also ak6=0. Denote ∆xl:=xl−xl−1.

Consider the family of i.i.d standard Gaussian random variables

{X(σ(1),σ(2),...,σ(k))|k∈[1;n]∩

,σ(1)∈Σx1N,...,σ(k)∈ΣxkN}.

Using these ingredients, for σ=σ(1)qσ(2)q...qσ(n)∈ΣN, we have

GREMN(σ)∼

∑

k=1

akX(σ(1),σ(2),...,σ(k)).(1.30)

Equivalence (1.30) is easily verified by computing the covariance of the right hand side. The

computation gives, for σ,τ∈ΣN

CovGREMN(σ)GREMN(τ)=qNqL(σ,τ).

Figure 1.2 shows the tree structure corresponding to the GREM with the piece-wise constant

function ρsuch that n=3. This leads to the tree with three “REM levels”.

7This condition is for notational simplicity. It means that we actually consider instead of Nthe increasing sequence

{Nα}α∈

⊂

such that Nα↑+∞as α↑+∞, satisfying Nαxk∈

, for all α∈

and all k∈[1;n]∩

1.4 Some mean-field Gaussian spin-glass models 21

q3=1

q0=0

2(x3−x2)Nbranches

2(x2−x1)Nbranches

2x1Nbranches

X(σ(1),σ(2))

X(σ(1))

X(σ(1),σ(2),σ(3))

Fig. 1.2. Structure of the GREM with the three levels of hierarchy

1.4.4 Between the GREM and SK model

We now review shortly the results on generalisations of the GREM available in the literature.

We classify these results according to the set of questions proposed above.

The GREM is generated by the lexicographic distance (0.3) on the discrete hypercube. Der-

rida & Gardner (1986) considered the GREM with “lexicographic” external field which is par-

ticularly well adapted to the natural lexicographic distance generated by the GREM Hamilto-

nian. In contrast to the work of Derrida & Gardner (1986), in Chapter 6, we consider the GREM

with the uniform (magnetic) field (0.6). This external field is the same as in the SK model, cf.

(1.19). We answer both Questions 0.0.1 and 0.0.2 for this model in Chapter 6.

Bolthausen & Kistler (2006) introduced an a priori non-hierarchical generalisation of the

GREM and analysed it on the level of Question 0.0.1. In a recent preprint, Bolthausen & Kistler

(2008) prove that the Gibbs measure of the non-hierarchical GREM hierarchically decomposes

in the thermodynamic limit which leads to ultrametricity (Section 1.5.4). This result corre-

sponds to the level of geometry of the Gibbs measure (Question 1.3.3). Due to its a priori

non-hierarchical character, the model of Bolthausen and Kistler is closer to the SK model than

the GREM.

The model of Bolthausen & Kistler (2006) was studied also by Jana (2007); Jana & Rao

(2006) at the level of Question 0.0.1 with a generalisation to non-Gaussian distributions of

22 1 Framework

the REM summands in (1.30). Moreover, Jana (2007) analysed the GREM with randomised

branching rates (1.28) at the level of Question 0.0.1.

1.4.5 Other mean-field spin-glass models

For the sake of completeness, we mention briefly some mean-field spin-glass models which are

not considered in this thesis. Among them are the neural network memory capacity models:

the perceptron and Hopfield models, see Bovier (2006); Shcherbina (2005); Talagrand (2003).

Important are also the models of neural network learning, e.g., Klimovsky (2005); Shcherbina

& Tirozzi (2002).

Very important for applications are diluted mean-field spin-glass models. These models are

based on the Hamiltonians with random interaction graphs (e.g., Erd˝

os-R´

enyi ones). A promi-

nent example is the satisfiability problem on random graphs M´

ezard & Montanari (2007); Ta-

lagrand (2003), random codes M´

ezard & Montanari (2007); Nishimori (2001). Naturally, other

combinatorial problems such as the assignment one can be formulated as mean-field spin-glass

models (Talagrand, 2003, Chapter 8). The main distinguishing feature of these models from the

ones considered in this thesis is that their Hamiltonians (1.19) are not Gaussian. The impor-

tant tool in this case seems to be the rigorous versions of the cavity method, see, e.g., Bovier

(2006); Talagrand (2003). The cavity method is also quite compatible with Gaussian process

techniques, as shown by Aizenman et al. (2003), see also Section 2.3.3 and Chapter 3.

1.5 Replica symmetry breaking picture

In this section, we shall sketch the heuristic picture of the thermodynamic limit in mean-field

spin-glasses developed in theoretical physics (Dotsenko, 2001; M´

ezard & Montanari, 2007;

M´

ezard et al., 1987; Nishimori, 2001; Parisi, 2007). This picture suggests how the list of ques-

tions stated in Section 1.3.4 should be answered.

In landmark series of papers, Parisi and collaborators developed the heuristic Ansatz called

the replica symmetry breaking, see M´

ezard et al. (1987) and references therein. This Ansatz very

plausibly explained effects occurring in a large class of disordered mean-field spin systems. We

shall shortly review this Ansatz in the following subsection.

1.5.1 Overlap distribution function and replica symmetry breaking

Assume that ΣN={−1;+1}N. The overlap between configurations σ(1),σ(2)∈ΣNis defined

RN(σ(1),σ(2)):=1

N∑

i∈ΛN

σ(1)

iσ(2)

The overlap can be viewed as the random variable on the probability space (Σ2

N,S⊗2

N,G⊗2

N)(i.e.,

the overlap is an observable for such a two-times replicated system). To explain the subsequent

terminology, we note that one can think of the latter probability space as of two independent

replicas of the initial spin glass configuration space. However, the marginals of the random

measure G⊗2

Nare not independent random measures on the probability space of disorder, since

they depend on the same realisations of disorder.

Let us consider the (finite volume) overlap distribution function FN(β;·):[0;1]→[0;1]

defined as

1.5 Replica symmetry breaking picture 23

FN(β;q):=

hG⊗2

N(β)nRN(σ(1),σ(2))≤qoi.(1.31)

We note that the overlap distribution function is indeed a distribution function of some random

variable. We shall denote this random variable by q. Assume that the thermodynamic limit

F(β;q):=lim

N↑+∞FN(β;q)(1.32)

exists. The behaviour of the function (1.32) with respect to βis conjectured (M´

ezard et al.,

1987) to be as follows. There exists βc∈

+such that:

1. Replica symmetric (RS) regime: for β<βc, we have q∼δq(β), for some constant q(β)∈

2. RSB region: for β>βc, we have cardsuppq>1. Then either of the following two subcases

holds

a) Discrete RSB:

i. Single-step RSB (1-RSB): there exist some β(1)

c∈

and β(2)

c∈

+such that β(2)

β(1)

c>βcand, for all β∈(β(1)

c;β(2)

c), we have

q∼w(β)δq1(β)+(1−w(β))δq2(β),

for some constants q1(β),q2(β)∈

and w(β)∈(0;1). This regime has been rigor-

ously proven to hold for the REM (Bovier et al., 2002) with β(2)

c= +∞, and also for

the p-spin model (Talagrand, 2000b, 2003), where β(2)

c<+∞. Traces of this regime

at the level of free energy (Question 0.0.1) were confirmed for the spherical model

by Talagrand (2006a). In this thesis, we show an analogous result for the SK model

with multidimensional Gaussian spins, see Chapter 5.

ii. Finite-step RSB (n-RSB): the 1-RSB regime is a particular case of the following sit-

uation

q∼

n+1

∑

k=1

wk(β)δqk(β),

for some real constants q1(β)< ... < qn+1(β)and some vector of weights w(β)∈

(0;1)n+1such that ∑n+1

k=1wk(β) = 1. See Figure 1.1 for the typical graph of the over-

lap distribution function. This regime has been rigorously proven for the GREM by

Bovier & Kurkova (2004a). In Chapter 6 we prove an analogous result for the GREM

with external field at the level of Question 0.0.2.

b) Continuous RSB or full RSB (FRSB): there exists β(f)

c>0 such that for β∈(β(f)

c;+∞),

there exist some deterministic weights wm,wM∈[0;1), deterministic constants qm,qM∈

, and a continuous random variable qcont with suppqcont = [qm(β);qM(β)] such that we

have the following decomposition

q∼wm(β)δqm(β)+wM(β)δqM(β)+(1−wm−wM)qcont.

This decomposition is believed to hold for the SK model and Derrida’s p-spin model for

large enough β(f)

c(M´

ezard et al., 1987). Probably the only (at the time of writing) rigor-

ous hint that this regime might hold for the SK model is obtained by Talagrand (2006b)

at the level of free energy (which in our classification corresponds to Question 0.0.1).

Bovier & Kurkova (2004b) have considered this regime for the continuous version of the

GREM (CREM) at the level of Question 1.3.3.

24 1 Framework

Remark 1.5.1. The behaviour of the random overlap distribution function

FN(β;q):=G⊗2

N(β)nRN(σ(1),σ(2))≤qo

can be very different from the behaviour of (1.31), see Parisi & Talagrand (2004).

1.5.2 Limiting Gibbs measures and pure states

The RSB described in the previous section is believed to have its roots in the structure of the

Gibbs measure.

In classical statistical mechanics of spin systems, Gibbs measures often form a convex com-

pact set G0(β), see, e.g., (Simon, 1993, Chapter III.5). The Krein-Milman theorem then assures

that there exists the set of extremal points corresponding to the set of all limiting Gibbs mea-

sures. These extremal points are called pure states.

Somewhat similar but substantially more complicated picture is conjectured (M´

ezard et al.,

1987) to be true for mean-field spin-glass systems in the thermodynamic limit. It is conjectured

that in the FRSB case the Gibbs measure decomposes into a countable convex combination

of “random pure states”, where the coefficients of the convex combination are also random

and independent of the pure states. Moreover, it is conjectured that the distribution of these

coefficients is universal and the coefficients are hierarchically clustered. See Section 2.2.2, for

the random weights which seem to suit this picture. The pure states are claimed (at least for the

SK model) to be completely characterised by the “limiting overlaps” which, in turn, are also

believed to possess hierarchical structure.

Unfortunately, at the time of writing there is no general mathematical definition of the con-

cept of limiting Gibbs measure for mean-field spin-glass systems which reflects the picture

predicted in theoretical physics. However, there are proper definitions and results in several

concrete cases, see the next subsection.

Remark 1.5.2 (metastates).The limiting objects called metastates comprehensively encode the

behaviour of the limiting Gibbs measures in the case of non-mean-field (lattice) disordered

systems (e.g., the EA model). See Bovier (2006); Newman & Stein (2007) and references therein

for precise definitions and discussion.

1.5.3 Higher-level objects

As emphasised, e.g., in (Ellis, 2006), study of higher-level objects such as empirical distribu-

tion functions and their large deviations is fruitful for understanding the properties of classical

(non-disordered) spin systems. This ideas are closely related to the Gibbs variational principle

(Proposition 1.2.1).

Large-deviations of empirical measures

We refer to Bolthausen (2007) for a suggestion of yet another higher-level object approach in

the spirit of large deviations of empirical measures in the context of the perceptron model. See,

e.g., Comets (1989) for some earlier results in this spirit.

Comparison schemes

Recent breakthroughs in understanding of mean-field spin-glasses can be viewed as rewards

on the quest for “proper” higher-level objects. These breakthroughs were initiated by the com-

parison schemes of Guerra (2003) and Aizenman et al. (2003). See Section 2.3.3 for a short

1.5 Replica symmetry breaking picture 25

review. See Chapters 3, 4 for an extension of the comparison approach to the SK model with

multidimensional spins.

Empirical overlap distribution functions

Bovier & Kurkova (2004a,b) have obtained a comprehensive description of the limiting Gibbs

measure for the REM, GREM and CREM using the empirical overlap distribution function.

To define this object, we need the following ingredients. Let Q0⊂[0;1][0;1]be the set of all

non-decreasing right-continuous piece-wise constant functions x:[0;1]→[0;1]with x(0) = 0,

x(1) = 1 such that xhas only a finite number of jumps (Figure 1.1). We will refer to the set Q0

as the set of discrete order parameters. We equip it with the L1topology. The compactification

of this space we denote by Qand call the set of order parameters. The latter space is obviously

isometric to the space of all non-decreasing c`

adl`

ag functions x:[0;1]→[0;1]such that x(0) = 0

and x(1) = 1 equipped with the L1topology.

Now suppose that the Gibbs measure GN(β)is induced by the GREM Hamiltonian. Consider

the “random overlap distribution field” xN(β;·):ΣN×[0;1]→[0;1]defined as

xN(β;σ(1),q):=GN(β){σ(2)∈ΣN:qL(σ(1),σ(2))≤q}.(1.33)

Note that given σ(1)∈ΣN, the above field (1.33) induces the random order parameter

xN(β;σ(1),·)∈Q0.

In words, (1.33) is the distribution of the overlap between given configuration σ(1)∈ΣNand the

“equilibrated” (i.e., Gibbs-distributed) configuration σ(2)∈ΣN. Finally, we define the empirical

overlap distribution KN(β)of M1(X0)as

M1([0;1]) 3KN(β):=ZΣN

δxN(σ(1),·)GN(β;dσ(1)).

The precise relation between the empirical object KN(β)and the overlap distribution function

(1.31) is the following

ZX

xKN(β;dx)(q) = FN(β;q).

It is shown by Bovier & Kurkova (2003b, 2004b) that

KN(β)w

−−−→

N↑+∞

K0(β),(1.34)

where K0(β)is the random element in M1(Q0). Moreover, the explicit distribution of the

random measure K0(β)is provided. In particular, this random measure K0(β)is the “countable

convex combination” (with random weights) of Dirac measures. In turn, the Dirac measures are

supported by the random overlap distributions corresponding to the atomic measures of the

following form

(1−w1)δ0+(w2−w1)δ¯x1+...+wl(β)δ¯xl(β),

26 1 Framework

where {wk}l(β)

k=1⊂[0;1]are certain random weights (0 ≤w1≤...≤wl(β)≤1) with hierarchical

organisation, {¯xk}l(β)

k=1⊂[0;1]is a certain coarse-graining of {xk}n

k=1, and l(β)≤n. This result

gives a precise mathematical meaning to the “convex-combination-of-the-pure-states” heuris-

tics mentioned in Section 1.5.2. Besides, this result comprehensively displays the hierarchical

“geometry” of the Gibbs measure in the thermodynamic limit. We refer to Bovier & Kurkova

(2007) for the extensive presentation. Essentially, the limiting empirical overlap distribution ob-

ject K0(β)is equivalent to the Bolthausen-Sznitman coalescent see Section 2.2.3 and references

therein.

Gibbs measure of the SK-like models

We refer to Talagrand (2000b, 2003) for the results on the structure of the Gibbs measure in the

thermodynamic limit for Derrida’s p-spin model (with large enough p) in the 1-RSB regime.

See also Talagrand (2007a) for some related results and conjectures for the SK model in the

FRSB regime.

1.5.4 Ultrametricity

In physical literature, it is believed that the canonical metric structure of the SK model weighted

by the Gibbs measure becomes ultrametric in the thermodynamic limit. One possible way to

formalise this (Talagrand, 2007b) is the following assertion about the three replicas which is

formulated quite in the spirit of (1.31) as

lim

N↑+∞

hG⊗3

N(β)hR(σ(1),σ(3))≥R(σ(1),σ(2))∧R(σ(2),σ(3))−εii=1 (1.35)

which is required to hold for each ε>0.

The (unproved) assertion (1.35) is believed to be caused by the hierarchical organisation of

the pure states.

Remark 1.5.3. The definition of ultrametric (0.4) implies that the metric space (Σ,dU)has the

following property. For any triangle with vertices σ(k), k ∈{1,2,3}, we have that the lengths

of least two of the three sides dU(σ(1),σ(2)),dU(σ(2),σ(3)), and dU(σ(1),σ(3))are the same.

Using the language of balls this means that, given two generic balls, the balls are either disjoint

or one of them contains the other.

The assertion (1.35) states roughly that the canonical L2distance generated by the SK model

weighted with the corresponding Gibbs measure is almost ultrametric in the thermodynamic

limit.

1.5.5 Free energy in the SK model: the Parisi formula

Using mathematically unrigorous heuristics of the “zero replica limit”, Parisi (see, e.g., M´

ezard

et al. (1987)) proposed a formula which answers Question 0.0.1 for the SK model. As we shall

see below, this formula seems to be in line with the general RSB picture. Hence, the Parisi

formula can be treated as an evidence of the RSB picture. This formula was rigorously proved

by Talagrand (2006b) for a class of models which essentially reduces to Derrida’s p-spin model

with even p≥2. This proof was extended by Panchenko (2005b) to cover the case which in

our nomenclature corresponds to the SK model with multidimensional spins with d=1 and

a priori measure with bounded support, cf. (0.9). See Section 2.3.2 for a short review of the

1.5 Replica symmetry breaking picture 27

available results in the case d>1, Chapter 3 for partial results in the case d>1, and Chapter 5

for extension to the case of multidimensional Gaussian spins.

Using the definitions of Section 1.5.3, we may state the following result.

Theorem 1.5.1 (the Parisi formula, Guerra (2003); Talagrand (2006b)).Given x ∈Q0, let the

function fx(q,y):[0;1]×

→

be a unique solution of the following backward semilinear

terminal value problem

(∂qf(q,y)+ 1

2∂2

yf(q,y)+x(q)(∂yf)2(q,y)=0,q∈[0;1),y∈

f(1,y) = logcoshβ√2(y+h),y∈

.(1.36)

Define the Parisi functional as

P(β,x):=fx(0,0)−β2

2Z1

qx(q)dq.

Then, for any β>0and any h ∈

, the following Parisi formula holds

p(β,h) = inf

x∈Q0

P(β,x),(1.37)

where p(β,h)is the limiting free energy of the SK model with the Hamiltonian (1.24).

Remark 1.5.4. It is easy to see (Guerra, 2003; Talagrand, 2006c) that the Parisi functional is

Lipschitz continuous with respect to x ∈Q0(recall that the space Q0is equipped with the L1

topology). Hence, it can be extended by continuity onto the whole compact space Q. Then the

extended Parisi functional attains its infimum on Q.

The Parisi formula (1.37) can more intuitively be reformulated in terms of the AS2scheme.

The scheme expresses the Parisi functional in terms of difference between the free energies of

two GREM-like models, see Section 2.3.3 for details. This shows explicitly the relevance of the

hierarchical structure of the GREM for the SK model.

In this thesis, we partially extend Theorem 1.5.1 to the case of the SK model with multidi-

mensional spins, see Chapters 3, 4 and 5.

Often used tools and some existing results

The present chapter consists of three parts. In Section 2.1, we record some tools and techniques

that are often used in the context of mean-field spin-glasses. In Section 2.2, we record some

known limiting objects and their basic properties that are extensively used in the subsequent

chapters. Finally, in Section 2.3, we give a short review of the known results on Gaussian mean-

field spin-glasses which are closely related to or are special cases of the results obtained in this

thesis.

2.1 Often used ingredients

In this section, we record for the future reference some basic results on Gaussian processes.

In what follows, whenever we speak about Gaussian random variables (or processes), we

assume by default that they are centred.

We also accept the following.

Definition 2.1.1 (random process).Let (Ω,F,

)be a probability space. Suppose also that

(S,S)is a measure space. Let I be an abstract index space. The collection of the measurable

mappings

X:={X(i):Ω→S}i∈I

is called the random process X.

In particular, we shall be interested in Gaussian random processes indexed by high-dimensional

index spaces. For a general discussion of Gaussian random processes we refer, e.g., to mono-

graphs by Adler (1990); Adler & Taylor (2007); Bogachev (1998); Dudley (1999); Fernique

(1997); Ledoux & Talagrand (1991); Lifshits (1995).

2.1.1 Interpolation

This subsection is devoted to a short account of comparison results between the functionals of

Gaussian processes.

Integration by parts

We begin by recalling well-known integration by parts formula which is the source of many

comparison results for functionals of Gaussian processes.

Let F:X→

be a functional on the linear space X. Given x∈Xand e∈X, the directional

(Gˆ

ateaux) derivative of Fat xalong the direction eis

30 2 Often used tools and some existing results

∂x eF(x):=∂tF(x+te)t=0.(2.1)

We now ready to state the following.

Lemma 2.1.1. Let {g(i)}i∈Ibe a real-valued Gaussian process (the set I is an arbitrary index

set), and let h be some Gaussian random variable. Define the vector e ∈

Ias e(i):=

[hg(i)],

i∈I. Let F :

I→

be such that, for all f ∈

I, the function

3t7→F(f+te)∈

(2.2)

is either locally absolute continuous or everywhere differentiable on

. Moreover, assume that

the random variables hF(g)and ∂g eF(g)are in L1.

Then

[hF(g)] =

[∂g eF(g)].(2.3)

The previous lemma coincides with (Panchenko, 2005b, Lemma 4) (modulo the differentia-

bility condition on (2.2) and the integrability assumptions which are needed, e.g., for (Bogachev,

1998, Theorem 5.1.2)).

Corollary 2.1.1 (integration by parts for finite dimensional Gaussian vectors).Using the nota-

tions of Lemma 2.1.1, suppose that cardI<∞. Assume, further, that the function F :

I→

has

the first derivatives of moderate growth.

Then

[hF(g)] = ∑

i∈I

[hgi]

∂iF(g).(2.4)

See (Talagrand, 2003, formula (A.41)).

Quadratic interpolation paths method

The following is a simple but powerful equality which is a consequence of the Gaussian inte-

gration by parts. It can be considered as a tool of comparison between two Gaussian processes

that are seen through the prism of expected value of some functional of them.

Proposition 2.1.1. Consider two independent Gaussian random vectors X ={Xi}n

i=1, Y =

{Yi}n

i=1and the n-variate function F :

n→

. We require that F has second-order derivatives

of moderate growth. Define

Z(t):=√tX +√1−tY.(2.5)

Then

[F(Z(t))] = 1

∑

i,j=1

(

[XiXj]−

[YiYj])

∂2

i,jF(Z(t)).(2.6)

In particular,

[F(X)] =

[F(Y)]+ 1

∑

i,j=1Z1

(

[XiXj]−

[YiYj])

∂2

i,jF(Z(t))dt.(2.7)

2.1 Often used ingredients 31

See, e.g., the proof of (Talagrand, 2003, Proposition 2.4.3). Analogous ideas can be found

in Kahane (1986). An interesting generalisation of the previous proposition to more general

interpolation paths than (2.5) is (Aizenman et al., 2007, Corollary A.2).

Remark 2.1.1. Definition (2.5) and the fact that Z(0) = Y and Z(1) = X may explain the name

“quadratic interpolation”.

Corollary 2.1.2 (generalised Slepian’s lemma).Let X and Y be two independent d-dimensional

Gaussian vectors. Let D1and D2be some subsets of {1,...,d}×{1,...,d}. Assume that











[XiXj]≤

[YiYj],(i,j)∈D1

[XiXj]≥

[YiYj],(i,j)∈D1

[XiXj] =

[YiYj],(i,j)/∈(D1∪D2).

Let F :

d→

be the function with the second derivatives of moderate growth and such that

the following holds

(∂2

i,jF(x)≥0,(i,j)∈D1

∂2

i,jF(x)≤0,(i,j)∈D1.

Then

[F(X)] ≤E[F(Y)].(2.8)

See (Ledoux & Talagrand, 1991, Theorem 3.11). A similar result can be extracted from

Joag-Dev et al. (1983).

2.1.2 Concentration of measure

We refer to the monograph of Ledoux (2001) for an extensive account of results on concentra-

tion of measure. We state here a typical result of concentration of measure for i.i.d. Gaussian

random variables.

Theorem 2.1.1 (Gaussian concentration measure of bound).Equip

nwith the Euclidean norm.

Suppose F :

n→

is Lipschitzian with some constant L >0. Further, let {gi}n

i=1be a family

of i.i.d. standard normal random variables.

Then, for any t ≤0, we have

{|F(g)−

[F(g)]|≥t}≤2exp−t2

2L2.(2.9)

See, e.g., (Ledoux, 2001, Corollary 2.6) for a proof.

Proposition 2.1.2 (a measure conentration bound for the SK free energy).Using the notations

of Chapter 1, suppose Σ⊂

is a bounded set. Further, assume Ω⊂ΣN. Let {H(σ)}σ∈ΣNbe

the Gaussian process with the correlation structure satisfying the following condition

sup

σ(1),σ(2)∈ΣN

hH(σ(1))H(σ(2))i−ξ(R(σ(1),σ(2)))≤c(N),

32 2 Often used tools and some existing results

where ξ:

→

is some continuous function and c(N)−−−−→

N→+∞0. Define

X:=logZΩexpβHN(σ)dµN(σ).

Then, for any t ≥0, we have

{|X−

[X]|>2t√LN}≤2exp−t2,

where L :=max{ξ(σ2):σ∈Σ}+c(N).

See (Panchenko, 2005b, Lemma 12) for a proof.

2.1.3 Superadditivity

The following is a classical theorem which is usually attributed to Fekete with an extension due

to Aizenman et al. (2003).

Theorem 2.1.2 (superadditivity).Consider a real sequence {an}∞

n=1⊂

. Suppose it is super-

additive, i.e., for all n,m∈

, we have

an+m≥an+an.

Then there exists the following limit

lim

n→∞

n=:a0=sup

n∈

Moreover,

a0=inf

n∈

sup

n∈

an

n=lim

m→∞

mlim

n→∞lim

n→∞

[an+m−am].(2.10)

See, e.g., (Aizenman et al., 2007, Lemma B.1) for a proof.

Subadditivity-based arguments were historically used to prove the existence of free energy in

the thermodynamic limit, cf. Question 1.2.1. In the context of the SK model, the superadditivity

in conjunction with the quadratic interpolation method was used for the first time by Guerra &

Toninelli (2002).

2.2 Limiting objects

In this section, we shall shortly review several limiting objects that arise in the context of mean-

field spin-glass models with Gaussian Hamiltonians.

In Sections 2.2.1 and 2.2.2, we consider limiting higher-level objects for the exponentials

of the REM and GREM Hamiltonians, respectively. In Section 2.2.3, we consider a limiting

higher-level object for the GREM overlap structure.

2.2 Limiting objects 33

2.2.1 The Poisson-Dirichlet process

In this subsection, we shall deal with a limiting structure for the REM. It was first suggested by

Ruelle (1987). The limiting structure is closely related to the Poisson-Dirichlet process.

Using the notation of Pitman & Yor (1997), the object we shall deal with is the Poisson point

process PD(x,0). The generic two-parameter Poisson-Dirichlet processes PD(x,a)can also be

obtained within framework of the REM exponentials, see Talagrand (2007a).

Definition 2.2.1 (limiting REM exponentials, Ruelle (1987)).Assume x ∈(0;1). The limiting

REM exponentials ξ(x)is the Poisson point process on

+with the following intensity density

+3t7→xt−x−1.(2.11)

We refer, e.g., to Daley & Vere-Jones (2003); Kallenberg (1983); Leadbetter et al. (1983);

Resnick (1987) for the discussion of random measures and the Poisson point processes.

A limiting higher-level object

To motivate the Definition 2.2.1, we need the following linear scaling function

uN(x):=x

+bN,(2.12)

where

aN:= (2Nlog2)1/2,

bN:=aN−√2

2aN

log(aN√2π).

Consider the rescaled REM process

REM(σ):=γu−1

N(REM(σ)),(2.13)

where γ:= (2log2)−1/2. The relation between the rescaled REM process (2.13) and the Poisson

point process ξ(x)is established through the convergence of the corresponding “higher-level ob-

ject” (cf. Section 1.5.3). Namely, define the empirical process of the REM exponentials WN(β)

– a random element in Mf(

+)– as

WN(β):=∑

σ∈ΣN

δexp[βREM(σ)].(2.14)

The following “low temperature behaviour” then holds.

Theorem 2.2.1 (the limit of the REM exponentials).Suppose β>√2log2. Let x(β):=

√2log2/β∈(0;1). The point process (2.14) converges to ξ(x(β)) in distribution. More for-

mally, we have

WN(β)w

−−−→

N↑+∞ξ(x(β)).

See, e.g., Bolthausen & Sznitman (2002); Bovier et al. (2002) for a proof.

By a slight abuse of notation, let {ξ(α)}α∈

be the enumeration of all atom positions for a

given realisation of ξon

34 2 Often used tools and some existing results

Properties

The particular form of the density (2.11) implies the following result.

Theorem 2.2.2 (some basic properties of the REM exponentials).

1. For any ε>0, we have

[card{α∈

:ξ(α)>ε}] = ε−x.(2.15)

2. Almost surely, it is possible to reorder the atoms {ξ(α)}α∈

={ξ(i)}i∈

such that, for all

i∈

, we have

ξ(i)>ξ(i+1).

3. The atom positions have the following “profile”

ξ(i)i1/x−−−→

i↑+∞1.

4. The atoms have the following integrability property. The sum ∑∞

i=1ξ(i)vconverges if and

only if v >x.

5. The partition function Z :=∑∞

i=1ξ(i)is almost surely finite and has an infinitely divisible

distribution satisfying Z ∼2−1/x(Z+Z0), where Z0is an independent copy of Z.

6. The v-th moment of the partition function Z is finite, i.e.,

[Zv]<∞if and only if u <x.

For the proofs, see (Aizenman et al., 2007, Theorem 5.1).

Remark 2.2.1. Theorem 2.2.2 implies that the sequence {ξ(α)}α∈

can be treated as a random

element of Mf(

In the sequel, we shall assume that the random sequence {ξ(i)}i∈

is decreasing.

Random permutations and their distributions

To state the next property of the limiting REM we need some additional ingredients. Let Ybe

the non-negative random variable independent of ξsatisfying the following moment condition

λ(Y):=

[Yx]1/x<+∞.

Denote by νthe distribution of Y. Let, further, {Y(i)}i∈

be the i.i.d. copies of Y. Consider the

following multiplicative deformation of the limiting REM process

{ξ(i)}∞

i=17→{ξ(i)Y(i)}∞

i=1.(2.16)

Due to Theorem 2.2.2, the resulting sequence {ξ(i)Y(i)}i∈

can also be reordered into the

decreasing sequence {˜

ξ(i)}i∈

, i.e.,

ξ(i)>˜

ξ(i+1),i∈

Let π:

→

be the corresponding random permutation, i.e., ˜

ξ(i) = ξ(π(i))Y(π(i)). Define

also ˜

Y(i):=Y(π(i)). Then the following result holds.

2.2 Limiting objects 35

Theorem 2.2.3 (the averaging property of the REM, the “quasi-stationarity”).The limiting REM

process satisfies

1. The averaging (or the “quasi-stationarity”) property:

∑

α∈

δ˜

ξ(α)∼∑

α∈

δλ(Y)ξ(α).(2.17)

2. The reordering effect: the sequence {˜

Y(i)}i∈

consists of i.i.d. random variables which are

independent of ˜

ξ. Moreover, the distribution of, e.g., ˜

Y(1)is absolute continuous with re-

spect to νand has the following density

+3˜

t7→ ˜

λ(Y).

For a proof see (Ruzmaikina & Aizenman, 2005, Proposition 3.1) and (Aizenman et al.,

2007, Theorem 5.2).

Define the normalisation operation N:Mf(

)→M1(

)as follows

N(η){i}:=η{i}

η(

).(2.18)

Note, that Theorem 2.2.3 implies that

∑

α∈A

δN(ξ){α}∼∑

i∈

δN(˜

ξ){i}.

Ruzmaikina & Aizenman (2005) show that under some mild regularity conditions the “quasi-

stationarity” property (2.17) characterises the distribution of the point process ξ(x)completely.

The limiting Gibbs measure

The above results are precise enough to give an answer to Question 1.3.3 in the REM case.

It is shown by Bovier (2001); Bovier & Kurkova (2004a) (see also (Talagrand, 2003, Theo-

rem 1.2.1)) that, in particular, for β>√2log2, we have the following weak convergence

∑

σ∈ΣN

δGN(β){σ}−−−→

N↑+∞

N(ξ(x(β))),

where x(β):=√2log2/β.

For β∈[0;√2log2), the situation is substantially easier. Given an arbitrary M∈

and

η∈ΣM, we have

GN(β){σ∈ΣN:[σ]M=η}−−−→

N↑+∞

2M,

almost surely, see, e.g., (Bovier, 2006, Section 9.3) for more details.

36 2 Often used tools and some existing results

2.2.2 The Ruelle probability cascades

After the seminal work of Derrida (1985) who introduced the GREM, the corresponding limit-

ing probability structure – probability cascades – was identified by Ruelle (1987). The RPC is

the point process, which atoms possess certain hierarchical correlations.

More formally, we shall need the following index spaces. Given n∈

, define A0:=/0,

Ak:=

kand A:=An. The latter index space can be seen as an (infinitely wide) tree with

n-levels of hierarchy.

Define also the set of discrete order parameters with n jumps Q0

n⊂Q0which consists of the

functions of the following form

x(q) =

∑

i=0

[qi;qi+1)(q),(2.19)

where, naturally,

0=:x0<x1< ... < xn<xn+1:=1,

0=:q0<q1< ... < qn<qn+1:=1,(2.20)

see Figure 1.1 for a sketch of x.

Definition 2.2.2 (Ruelle’s probability cascade, Ruelle (1987)).Given some x ∈Q0

n, let, for all

k∈[1;n]∩

and all α∈Ak, the point processes ξk,[α]k−1

be the independent ones with

ξk,[α]k−1∼PPP(

+3q7→xkq−xk−1∈

).(2.21)

The Ruelle probability cascade then is the point process ξ=ξ(x1,...,xn)with the atom posi-

tions {ξ(α)}α∈Andefined as follows

ξ(α):=

∏

k=1

ξk,[α]k−1(αk).(2.22)

A limiting higher-level object

Assume that the order parameter x∈Q0

nis such that, for all k= [1;n]∩

, the following holds

xk=k

∆qk>∆qk+1.(2.23)

Define

γk:=∆qk

(2log2)∆xk1/2

To motivate Definition 2.2.2, consider the following rescaled GREM process

GREM(σ):=

∑

k=1

γku−1

∆xkN(X(σ(1),σ(2),...,σ(k))).(2.24)

2.2 Limiting objects 37

Define also the correspondent point process of the rescaled GREM exponentials

W(n)

N(β):=∑

σ∈ΣN

δexp[βGREM(σ)].(2.25)

The following theorem then describes the “very low temperature regime” of the limiting GREM.

Theorem 2.2.4 (the limit of the GREM exponentials, Bovier & Kurkova (2004a)).Put {βk:=

p2log2∆xk/∆qk}n

k=1⊂

+. Fix an arbitrary β>βn. For k ∈[1;n]∩

, define xk(β):=βk/β∈

(0;1).

Then the point process (2.25) converges to ξ(x1(β),...,xn(β)) in distribution. More for-

mally, we have

W(n)

N(β)w

−−−→

N↑+∞ξ(x1(β),...,xn(β)).

See (Bovier & Kurkova, 2004a, Theorem 1.7) for a proof of more elaborate result which

covers also all remaining intermediate regimes β∈[0;βn].

Remark 2.2.2. Condition (2.23) implies, in particular, that all jumps of x are the extreme points

of the their concave hull. See Theorem 2.3.2 for similar results in the case of generic order

parameters.

Properties

Theorem 2.2.5 (some elementary properties of the RPC).

1. The partition function Z :=∑α∈Anξ(α)is almost surely finite and, moreover,

Z∼Z(x1)

∏

k=2

Z(xk)xn−11/xn−1.

In particular, we have

[logZ]<∞.

2. Almost surely, it is possible to reorder the atoms {ξ(α)}α∈

={ξ(i)}i∈

such that, for all

i∈

, we have

ξ(i)>ξ(i+1).

That is, there exists the corresponding random permutation π:

→Anwhich satisfies

ξ(k) = ξ(π(k)).

See, e.g., (Aizenman et al., 2007, Theorem 5.3) for a proof.

The limiting Gibbs measure

As already mentioned above, Question 1.3.3 can completely be answered for the GREM. It is

shown by (Bovier & Kurkova, 2004a, Theorem 1.9) that, in particular, for β>βn, we have

∑

σ∈ΣN

δGN(β){σ}

−−−→

N↑+∞

N(ξ(xn(β))),

where ξ(xn(β)) ∼PD(xn(β),0). Moreover, at the level of limiting structures the normalisation

(2.18) of the RPC is equidistributed with the normalisation of the Poisson-Dirichlet process.

More precisely,

38 2 Often used tools and some existing results

N(ξ(x1,...,xn)) ∼N(ξ(xn)),

where ξ(xn)∼PD(xn,0). See, e.g., (Bolthausen & Sznitman, 1998, Lemma 2.1) for a proof.

Hence, interestingly, the hierarchical RPC structure flattens down to the Poisson-Dirichlet struc-

ture, if the point process of limiting Gibbs weights is concerned.

The hierarchical structure is, however, important for encoding the RSB picture (cf. Sec-

tion 1.5). It is this structure which is captured by the limiting object introduced in the following

section.

2.2.3 The Bolthausen-Sznitman coalescent and random permutations

A deep insight of Bolthausen & Sznitman (1998) was to identify the dynamical system gen-

erated by the RPC with remarkable properties which allow to perform many spin-glass cal-

culations in a more transparent way. In what follows, we shall use this dynamical system in

computations involving the replicas of the system (see Chapter 4).

A contribution of Bovier & Kurkova (2004a,b) was, in particular, to relate the RPC, the

continuous time coalescent process of Bolthausen & Sznitman (1998), and its dual – the contin-

uous time branching processes of Bertoin & Le Gall (2000) – with the original setup of Derrida

(1985).

Definition

Let ξ=ξ(x1,...,xn)be the RPC process. Theorem 2.2.5 guarantees that there exists the re-

arrangement ξ={ξ(i)}i∈

of ξ’s atoms in a decreasing order. Recall (0.2) and define the

(random) limiting ultrametric overlap qL:

2→[0;n]∩

qL(i,j):=max{k∈[0;n]∩

:[π(i)]k= [π(j)]k},(2.26)

where we use the convention that max /0 =0. This overlap valuation (2.26) induces the sequence

of random partitions of

into equivalence classes. Namely, given a k∈

∩[0;n], we define,

for any i,j∈

, the Bolthausen-Sznitman equivalence relation as follows

i∼

kjdef

⇐⇒qL(i,j)≥k.(2.27)

Note the conceptual similarity between the equivalence (2.27) and the limiting empirical

overlap distribution K0(β)from (1.34).

In what follows, we shall, by a slight abuse of the language, not distinguish between the

concepts of the equivalence relation and the partition of a set into the equivalence classes.

Partitions induced by equivalence (2.27) have a nice property that the partitions with smaller

indices kare the coarsenings of the partitions with the larger k’s. Moreover, these random parti-

tions are induced by a Markovian structure and can be seen as the states of the continuous-time

coalescent process at certain deterministic times introduced by Bolthausen & Sznitman (1998).

Indeed, for an arbitrary index set I, let E(I)be the set of all equivalence relations on I. The

sets E(I)are compact, if seen as the subsets of the product spaces {0,1}I×I(we equip the set

{0,1}with the discrete topology). Finally, let Γbe the Markov process

Γ:={Γ(t):Ω→E(

)}t∈

with the following properties.

2.2 Limiting objects 39

1. The initial state is Γ(0) = Π0, where Π0∈E(

)is the partition of

into singleton sets.

2. Coalescence property: for any s,t∈

+, whenever s<t, we have Γ(t)Γ(s), i.e., the

partition Γ(s)is not finer than Γ(t).

3. Finite-dimensional traces: for any Ib

, let ΓIbe the trace of the process Γon the set

E(I). Let aI:={aI(Π,Π0)}Π,Π0∈E(I)be the transition rate matrix (Q-matrix) of the Markov

process Γ(I)defined as follows. For Π,Π0∈E(I), denoting N:=cardI, define

aIΠ,Π0:=









(N−2)N−2

k−2−1,Π0is obtained by gluing k∈[2;N]∩

classes

of Πtogether;

−∑E6=E0aI(E,E0),Π=Π0;

0,otherwise.

Definition 2.2.3 (the Bolthausen-Sznitman coalescent).As is shown in (Bolthausen & Sznitman,

1998, Theorem 1.2), the above properties uniquely identify the distribution of the continuous-

time pure-jump Markov process Γwhich is then called the Bolthausen-Sznitman coalescent.

See Figure 2.1 for a realisation of the trace of the Bolthausen-Sznitman coalescent on a finite

set.

Fig. 2.1. A realisation of the trace of the Bolthausen-Sznitman coalescent on I:={1,2,3,4}

Remark 2.2.3. An explicit representation of the Markov semigroup of the Bolhausen-Sznitman

coalescent is certainly also available, see (Bolthausen & Sznitman, 1998, Proposition 1.4).

40 2 Often used tools and some existing results

Properties

For k∈[0;n]∩

, let the partition Πk∈E(

)be the partition generated by the Bolthausen-

Sznitman equivalence “∼

k”. The following theorem motivates Definition 2.2.3 by relating the

distributions of the limiting ultrametric overlap distribution function with the Bolthausen-

Sznitman coalescent.

Theorem 2.2.6 (limiting ultrametric overlap and the Bolthausen-Sznitman coalescent).

1. The law of the “limiting ultrametric overlaps” vector Πn−1,...,Π1coincides with the law

of the following Bolthausen-Sznitman vector

Γ(t1),Γ(t2),...,Γ(tn−1),

where, for k ∈[1;n−1]∩

tk:=logxn

xk∈

2. The coalescent Γand the point process N(ξ(x1,...,xn)) are independent.

See (Bolthausen & Sznitman, 1998, Theorem 2.2) for a proof.

The limiting overlaps and certain random permutations

So far, in the present chapter, we were dealing with an important ingredient of the RSB picture,

namely, with the scaling limits of the (unnormalised) GREM exponentials at the level of point

processes. This lead us to the RPC. The second important ingredient of the Parisi RSB picture

are the overlaps (see Section 1.4.5).

Recall (0.2). Given x∈Q0

n, define the limiting GREM overlap q :A2→[0;1]as

q(α1,α2):=qqL(α1,α2).(2.28)

We shall need also the randomised limiting GREM overlap q :

2→[0;1]as

q(i,j):=qqL(i,j).

Note that the above definitions allow also for another representation of the reordered limiting

GREM overlap

q(i,j) = q(π(i),π(j)).

Finally, define the filtered limiting GREM process {Y(α,t):α∈A,t∈

+}as follows

Y(α;t):=

∑

k=0

[qk;1](t)Wk([α]k;t∧qk+1−qk),(2.29)

where ∆qk:=qk+1−qkand {{Wk([α]k;t)}t∈

+}k∈[0;n]∩

,α∈Ais the family of independent (for

different indices αand k) standard 1-D Wiener processes. Definition (2.29) readily implies

CovhY(α(1),t),Y(α(2),s)i=t∧s∧q(α(1),α(2)).

2.2 Limiting objects 41

Now, similarly to (2.16) with the help of the filtered limiting GREM we reweight the initial

RPC weights ξas

ξ(α;t):=ξ(α)exp(f(t,Y(α,t))),(2.30)

where we assume, for simplicity, that f:[0;1]×

→

is such that, for any t∈[0;1],f(t,·)∈

C(2)(

), and any c>0 we have R

expf(t,x)−cx2dx<∞and, moreover, that

sup

x∈

|∂xf(t,x)|+|∂2

xx f(t,x)|<+∞.

Further, the point process defined in (2.30) can also be reordered in a decreasing way, i.e., there

exists the mapping ˜

πt:

→Asuch that, for all i∈

, the following holds

ξ(˜

πt(i);t)>˜

ξ(˜

πt(i+1);t).(2.31)

In what follows, we shall use the short-hand notations ˜

ξ(i;t):=˜

ξ(˜

πt(i);t),˜

Y(i,t):=Y(˜

πt(i),t)

and ˜qt:={˜qt(i,j):=q(˜

πt(i),˜

πt(j))}i,j∈

Theorem 2.2.7 (some limiting GREM properties).Given a discrete order parameter x ∈Q0

we have

1. Independence #1: the normalised RPC point process N(ξ)is independent from the corre-

sponding randomised limiting GREM overlaps q.

2. Independence #2: the reordered filtered limiting GREM ˜

Y is independent from the corre-

sponding reordered weights ˜

ξ.

3. The change of measure: given I b

, let µI(·|q)be the conditional distribution of {Y(i,1)}i∈I,

and ˜

µI(·|q)be the conditional distribution of {˜

Y(i,1)}i∈Iboth conditional on q. Then

d˜

µI(·|q)

dµI(·|q)=

∏

k=0

∏

i∈I/∼

kexpxkf(xk+1,Y(i,xk+1))−f(xk,Y(i,xk)),

where the innermost product is taken over all equivalence classes on the index set I induced

by the equivalence ∼

4. The averaging property: the function f :[0;1]×

→

satisfies (1.36), if and only if

{ξ(α)exp(f(t,Y(α;t)))}α∈A,˜qt∼{ξ(α)exp(f(s,Y(α;s)))}α∈A,˜qs,

for all s,t∈[0;1].

See, e.g., Arguin (2007); Bolthausen & Sznitman (1998) for a proof.

Proposition 2.2.1 (expected mutial overlap distribution, Bolthausen & Sznitman (1998); Ruelle

(1987)).For any k ∈[1;n+1]∩

, we have

hN(ξ)⊗N(ξ)n(α(1),α(2))∈A2:qL(α1,α2)≤koi=xk.

See Bolthausen & Sznitman (1998); Ruelle (1987) for a proof.

42 2 Often used tools and some existing results

Some applications

We would like to point out that the Poisson-Dirichlet distribution and Bolthausen-Sznitman

coalescents appear in various applied contexts: in combinatorial stochastic structures, see, e.g.,

Arratia et al. (2003); Basdevant (2006); Bertoin (2006); Pitman (2006), in statistics, see, e.g.,

Teh et al. (2006). The structures are recently proved to emerge as equilibrium distributions for

certain systems of interacting diffusions, see, e.g., Chatterjee & Pal (2007).

2.3 Some results on Gaussian mean-field spin-glasses

2.3.1 The GREM

Recall the definition of the GREM from Section 1.4.3 and the decomposition (1.30) in particular.

Let ρ∈Q0

n. Assume that the process {GREM(σ):σ∈ΣN}has ρas its order parameter.

Assume that ρsatisfies (1.27). Define, for j,k∈[1;n+1]∩

,j<k, the “slopes” Aj,kof the

discrete order parameter xas

Aj,k:=qk−qj−1

2log2(xk−xj−1).

Define, further, the increasing sequence of indices {Jl}m

l=0⊂[0;n+1]∩

as follows. Start from

J0:=0, and define iteratively

Jl:=minnJ∈[Jl−1;n+1]∩

:AJl−1,J>AJ+1,k,for all k>Jo.

This subsequence of indices induces the following coarse-graining of the initial GREM. Define

the following (possibly) “coarse-grained” parameters

¯ql:=qJl−qJl−1,

¯xl:=xJl−xJl−1,

γl:=s¯ql

(2log2)¯xl

Recall (2.12) and define the GREM scaling function uJ,N:

→

uJ,N(x):=

∑

l=1q(2log2)N¯xl¯ql−1

2√N¯

γllog(2log2)¯xl+x

N−1/2.

Define the rescaled GREM process as

GREM(σ):=u−1

J,N(GREM(σ)).(2.32)

Remark 2.3.1. Note that (2.32) is compatible with (2.24).

Define the point process of the rescaled GREM energies ENas

EN:=∑

σ∈ΣN

δGREM(σ).

2.3 Some results on Gaussian mean-field spin-glasses 43

The limiting object we are about to define now is a “logarithm” of the RPC point process. Given

K∈

+, assume that the point process P(1)(K)on

satisfies

P(1)(K)∼PPP(Kexp(−x)dx).

Consider the following collection of independent point processes

{P(k)

α1,...,αl−1(K)|α1,...,αl−1∈

;l∈[1;m]∩

}

such that

P(k)

α1,...,αk−1(K)∼P(1)(K).

Given K∈

+, define the limiting GREM point process Pm(K)on

mas follows

Pm(K):=∑

α∈Am

δ(P(1)(K1;α1),P(2)

α1(K2;α2),...,P(m)

α1,α2,...,αm−1(Km;αm)).

Now we are ready to state the following generalisation of Theorem 2.2.4.

Theorem 2.3.1 (the limit of the GREM point process).Assume that the “slopes” {γl}lform a

decreasing sequence, i.e., for all l ∈[1;m]∩

, we have

γl>γl+1.

Then there exists some K =K(x)∈(0;1]msuch that

EN−−→

N↑∞Z

mδγ1e1+...+γmemPm(K;de1,...,dem).

Moreover, Kl=1if and only if, for all x ∈(xl−1;xl), we have

[0;1]23(x,ρ(x)) /∈∂convΓ(ρ),

where Γ(ρ)is the sub-graph of the function ρ, i.e.,

Γ(ρ):=(x0,ρ(x)) ∈[0;1]2:x0≤x,x∈[0;1].

See (Bovier & Kurkova, 2004a, Theorem 1.5) for a proof.

Remark 2.3.2. Theorem 2.2.4 is a simple particular case of Theorem 2.3.1.

Theorem 2.3.1 allows for a complete characterisation of the limiting distribution of the

GREM’s partition function. To formulate the result, we need the β-dependent threshold l(β)∈

[0;m]∩

such that above this threshold (l>l(β)) all coarse grained levels lof the limiting

GREM are in a “frozen state” (“low temperature regime”). Below this threshold (l≤l(β)) the

levels are in the “high temperature regime”. Namely, define

l(β):=max{l∈[1;m]∩

:γlβ>1},

and let l(β):=0, if ¯

γ1β≤1.

44 2 Often used tools and some existing results

Theorem 2.3.2 (the limiting distribution of the GREM partition function ).Given β∈

+, there

exists the constant C(β;x)∈(0;1]such that one of the following two cases holds.

1. If l(β) = 0, then

ZN(β)

2Nexp(β2N/2)

−−−→

N↑+∞C(β,x).

2. If l(β)>0, then

exp l(β)

∑

l=1−βNq(2log2)¯ql¯xl+β¯

γllog((8log2)πN¯xl)

−N∑

k=Jl(β)+1β2qk/2+(2log2)¯xl

ZN(β)w

−−−→

N↑+∞

C(β,x)Z

l(β)exphβ¯

γ1x1+¯

γ2x2+...+¯

γl(β)xl(β)i

×Pl(β)(K(x);de1,...,del(β)).

Moreover, C(β;x) = 1, if and only if β¯

γl(β)+1<1.

See (Bovier & Kurkova, 2004a, Theorem 1.7) for a proof.

2.3.2 The SK model with multidimensional spins

Mean-field spin-glass models with multidimensional (Heisenberg) spins were considered in

the theoretical physics literature, see, e.g., Sherrington (2007) and references therein. Rigorous

results are, however, rather scarce. An early example is the result of Fr¨

ohlich & Zegarli´

nski

(1987), where the bounds on the free energy in the high temperature regime are obtained. Meth-

ods of stochastic analysis and large deviations were used by Toubol (1998) to identify the limit-

ing distribution of the partition function and also to obtain some information about the geometry

of the Gibbs measure for small β. More recent treatments of the high temperature regime using

very different methods are due to Talagrand (2000a), see also (Talagrand, 2003, Section 2.13).

The importance of the SK model with multidimensional spins for understanding the ultrametric-

ity of the original model of Sherrington & Kirkpatrick (1975) (which corresponds to d=1 and µ

being the Rademacher measure in the above notations) was emphasised by Talagrand (2007b).

The most advanced recent study of spin-glass models with multidimensional spins was at-

tempted by Panchenko & Talagrand (2007b), where the multidimensional spherical spin-glass

model was considered. The authors combined the techniques of Panchenko (2005b); Talagrand

(2006b) to obtain partial results on the ultrametricity and also get some information on the local

free energy for their model.

Multidimensional spins with compact support

Equip

dwith the Euclidean norm. Assume Σ:=B(0,√d)⊂

d. Given σ(1),σ(2)∈ΣN, define

the matrix R(σ(1),σ(2))∈

d×dwith the following entries

R(σ(1),σ(2))u,v:=1

N∑

i=1

σ(1)

i,uσ(2)

i,v.

2.3 Some results on Gaussian mean-field spin-glasses 45

Theorem 2.3.3 (existence of the RS order parameter).There exists L >0such that if Lβd≤1,

then there exist matrices Q(1),Q(2)∈

d×dsuch that

hGN(β)hkR(σ,σ)−Q(2)k2

2ii≤K(d)

hGN(β)⊗GN(β)hkR(σ(1),σ(2))−Q(1)k2

2ii≤K(d)

See (Talagrand, 2003, Theorem 2.13.1) for a proof.

The following theorem provides additional information on the matrices Q(1),Q(2). Assume

Q(2)is a symmetric non-negative definite matrix. There exists the Gaussian family {Yu}d

u=1with

covariance

[YuYv] = β2Q(2)

u,v.

Define the function E:

d→

as follows

E(x):=exp√2hx,Yi+β2h(Q(2)−Q(1))x,xi.

Theorem 2.3.4 (representation of the RS order parameters).Assume that βLd ≤1. We have

Q(1)

u,v=

1

Z2ZxuE(x)dµ(x)ZxvEdµ(x),(2.33)

Q(2)

u,v=

1

ZZxuxvE(x)dµ(x),(2.34)

where

Z:=ZE(x)dµ(x).

Moreover,

p(β) =

logZΣ

E(x)dµ(x)−β2

2kQ(2)k2

2−kQ(1)k2

2.

See (Talagrand, 2003, Theorems 2.13.2 and 2.13.3) for proofs.

One-dimensional spins with compact support

Let d=1. Given u∈

+, let us generalise slightly the objects of Section 1.5.3, namely let

Q0(u)⊂[0;1][0;u]be the set of all non-decreasing right-continuous piece-wise constant func-

tions x:[0;u]→[0;1]with x(0) = 0, x(u) = 1, such that they have only a finite number of

jumps. Let also Q0

n(u)⊂Q0(u)be the subset of order parameters with exactly n∈

jumps.

Assume the real sequence {cN}N∈

satisfies cN−−−→

N↑+∞0. Let {HN(σ)}σ∈ΣNbe the family of

Gaussian processes such that, for all σ(1),σ(2)∈ΣN, we have



hHN(σ(1))HN(σ(2))i−ξR(σ(1),σ(2))≤cN,(2.35)

46 2 Often used tools and some existing results

where the function ξ:

→

, for all x>0, satisfies the following (symmetry and convexity)

conditions

ξ(0) = 0,ξ(x) = ξ(−x),ξ00(x)>0.

A particular example of the Hamiltonian satisfying (2.35) is Derrida’s p-spin interaction Hamil-

tonian (cf. (1.15), p∈

)

HN,p(σ):=p−1/2N−(p−1)/2N

∑

i1,...,ip=1

gi1,...,ipσi1···σip,(2.36)

where {gi1,...,ip}N

i1,...,ip=1are i.i.d. standard normal random variables.

Let [d;D]⊂

be the smallest interval such that

µσ2∈[d;D]=1.

Assume {εN}∞

N=1⊂

+is such that εN−−−→

N↑+∞0. Define the set of all configurations with ap-

proximately the same self-overlap

UN(u):=σ∈ΣN:|R(σ,σ)−u|≤εN.

Define the corresponding local free energy

pN(u,εN):=1

logZN(u,εN),

where

ZN(u,εN):=ZUN(u)

expHN(σ)dµ⊗N(σ).

Given u∈[d;D], consider an arbitrary x∈Q0

n(u)and λ∈

. Consider, further, the family of

independent Gaussian random variables {zk}n

k=0such that

Varzk=ξ0(qk+1)−ξ0(qk).

Define the sequence of functionals {Xk:Q0

n(u)×

→

}n+1

k=0iteratively as follows. Start from

Xn+1(x,λ):=logZΣexp σ

∑

k=0

zn+λσ2!dµ(σ),

and continue, for k∈{n,n−1,...,0}, recursively

Xk(x,λ):=1

log

expxkXk+1(x,λ).

Note that the functional X0(x,λ)is deterministic. Define the functional P:Q0

n(u)×

→

follows

2.3 Some results on Gaussian mean-field spin-glasses 47

P(x,λ):=−λu+X0(x,λ)−1

∑

k=1

xkθ(qk+1)−θ(qk),

where

θ(q):=qξ0(q)−ξ(q).(2.37)

Define the local Parisi free energy as

P(ξ,u):=inf

(x,λ)∈Q0(u)×

P(x,λ).

Finally, consider the (global) Parisi free energy

P(ξ):=sup

u∈[d;D]

P(ξ,u).

Theorem 2.3.5 (the saddle point Parisi formula).Given u ∈[d;D], there exists the vanishing

sequence {εN}N∈

such that

lim

N→+∞pN(u,εN) = P(ξ,u).

Moreover,

p(β) = P(ξ).

See (Panchenko, 2005b, Theorems 1 and 2) for proofs.

Multiple spherical spin-glass models

In this paragraph, deviating from the rest of the thesis, we shall consider the model with spins

σ∈ΣN, where ΣNis a non-product space (namely, a suitably chosen Euclidean sphere). Con-

sider the Gaussian Hamiltonian of the spherical spin-glass model1{HN(σ)}σ∈S(0,√N)satisfying

condition (2.35). Recall that (2.36) is a particular example of this situation. Define the free en-

ergy of the spherical spin-glass model as follows

pN(β,h):=1

logZS(0,√N)

expβHN(σ)+h

∑

i=1

σidλN(σ),(2.38)

where λN∈M1S(0,√N)is the uniform distribution on S(0,√N).

Consider x∈Q0

n, where Q0

nis defined in (1.26). The limiting free energy for (2.38) was

computed by Crisanti & Sommers (1992) and then the computation was made rigorous by

Talagrand (2006a) using the methods of Talagrand (2006b).

Given a real parameter b>1, define the sequences {dl}n

l=1and {Dl}n

l=1as

dk:=

∑

k=l

xkξ0(qk+1)−ξ0(qk),

Dl:=b−dl.

1Sometimes also referred to as “the spherical SK model”

48 2 Often used tools and some existing results

Define the Parisi functional P(β,h):Q0

n×(1;+∞)→

for the spherical spin-glass model as

P(β,h;x,b):=1

2 b−1−logb+1

(h2+ξ0(q1))+

∑

k=1

log Dl+1

−

∑

k=1

xlθ(ql+1)−θ(ql)!.

(See (2.37) for the definition of θ:[0;1]→

.) Also let us introduce the Crisanti-Sommers

functional C S (β,h):Q0

n→

as follows. Set δk:=∑n

l=kxl(ql+1−ql)and define

C S (β,h;x):=1

2 h2δ1+q1

δ1

n−1

∑

k=1

log δk

δk+1

+logδn+

∑

k=1

xk(ξ(qk+1)−ξ(qk))!.

The following result is obtained by Talagrand (2006a).

Theorem 2.3.6 (the Parisi formula for the spherical spin-glass model).We have

p(β,h) = P(β,h):=inf

x∈Q0,b>1

P(β,h;x,b)

=inf

x∈Q0

C S (β,h;x).

See Talagrand (2006a) for a proof.

Now, we are ready to consider multiple copies of the spherical model defined above. Let

Q∈Sym(d)be the non-negative definite matrix with elements Qu,v∈[−1;1]and Qu,u=1, for

all u,v∈[1;d]∩

. Given ε>0, consider the following set of configurations having almost the

same self-overlap given by Q, namely

U(ε):=nσ∈S(0,√N)d:|R(σ,σ)u,v−Qu,v|≤ε,for all u,v∈[1;d]∩

Consider β1,...,βd>0 and h1,...,hd∈

. Denote β:={βu}d

u=1and h:={hu}d

u=1. Define the

local free energy on U(ε)of the dcopies of the spherical model as follows

p(d)

N(β,h,U(ε)) :=1

"ZU(ε)

exp d

∑

u=1

βuHN(σ·,u)+

∑

u=1hu

∑

u=1

σi,u!#.

It is easy to see that

p(d)

N(β,h,U(ε)) ≤

∑

u=1

pN(βu,hu)

which implies that

lim

N→∞p(d)

N(β,h,U(ε)) ≤

∑

u=1

P(βu,hu).

Consider the following inequality

2.3 Some results on Gaussian mean-field spin-glasses 49

lim

ε→+0lim

N→∞p(d)

N(β,h,U(ε)) <

∑

u=1

P(βu,hu).(2.39)

An interesting result of Panchenko & Talagrand (2007b) indicates that, in general, inequality

(2.39) does not hold, as one can infer from the following theorem.

Theorem 2.3.7 (local Parisi formula for the multidimensional spherical model (Panchenko &

Talagrand, 2007b)).Assume we are in the situation of the Hamiltonian (2.36) with p =2, i.e.,

the classical SK case. Suppose βu:=β>1, for all u ∈[1;d]∩

, and minru≥1, where {ru}d

u=1

are the eigenvalues of the self-overlap matrix Q.

Then

lim

ε→+0lim

N→+∞p(d)

N(β,0,U(ε)) = d·P(β,0).

See (Panchenko & Talagrand, 2007b, Theorem 2) for a proof.

2.3.3 The Aizenman-Sims-Starr comparison scheme for the SK model

The AS2scheme (Aizenman et al., 2003, 2007) gives an intrinsic way to obtain variational

upper bounds on the free energy in the SK model. The scheme is also based on a comparison

between two Gaussian processes. The first process is the sum of the original SK Hamiltonian

Xand a GREM-inspired process indexed by additional index space A:=

n. The second one

is another GREM-inspired process indexed by the extended configuration space ΣN×A. The

scheme uses a comparison functional defined on Gaussian processes indexed by the extended

configuration space equipped with the product measure between the original a priori measure

and Ruelle’s probability cascade. The role of the comparison functional in the AS2scheme is

played by a free energy functional acting on the Gaussian processes indexed by the extended

configuration space.

It is interesting to note that Panchenko & Talagrand (2007a) have reexpressed Guerra’s

scheme for the SK model using the RPC.

Increments of the free energy of the SK model

Suppose HN(σ)is the SK Hamiltonian given by (1.19) and PN(β,h)is given by (1.4).

Theorem 2.3.8 (superadditivity of the free energy).For any N,M∈

PN(β,h)+PM(β,h)≤PN+M(β,h).(2.40)

See Guerra & Toninelli (2002) for a proof.

Theorem 2.3.8 immediately gives (by Theorem 2.1.2) the existence of the thermodynamic

limit (cf. Question 0.0.1). The next theorem builds upon more subtle consequences of superad-

ditivity, namely on relation (2.10).

Theorem 2.3.9 (incremental representation of the free energy).We have

lim

N→∞lim

M→∞

logZN+M(β,h)

ZN(β,h)=p(β,h).

See, e.g., (Aizenman et al., 2007, Corollary 3.5) for a proof.

50 2 Often used tools and some existing results

Remark 2.3.3. Note that the free energy pN(β,h)is sharply concentrated around its expec-

tation – a fact that was first established by a martingale argument by Pastur & Shcherbina

(1991) and then made more precise by the general machinery of concentration of measure, see,

e.g., (Talagrand, 2003, Corollary 2.2.5). Any of these two facts together with Theorem 2.3.8 is

sufficient to conclude that

pN(β,h)−−−→

N↑+∞p(β,h),almost surely and in L1.

Let {C(α)}α∈ΣM,{B(α)}α∈ΣM, and {A(α,σ)}σ∈ΣN,α∈ΣMbe three independent Gaussian

processes with the following correlation structures2

CovhC(α(1)),C(α(2))i=M

N+MRM(α(1),α(2))2

CovhA(σ(1),α(1)),A(σ(2),α(2))i=2N

N+MRN(σ(1),σ(2))RM(α(1),α(2)),

CovhB(α(1)),B(α(2))i=N

N+MRM(α(1),α(2))2

As a short but crucial computation shows, Theorem 2.3.9 implies the following result.

Theorem 2.3.10 (second incremental representaion of the free energy).We have

lim

N→∞lim

M→∞



log

∑(σ,α)∈ΣN×ΣMη(α)expβ√MA(σ,α)+h∑N

i=1σi

∑α∈ΣNη(α)expβ√MB(α)





=p(β,h),(2.41)

where

η(α):=expβ√MC(α)+h

∑

i=1

σi.(2.42)

See, e.g., (Bovier, 2006, Section 11.3.1) or (Aizenman et al., 2007, Theorem 4.1) for a proof.

Theorem 2.3.10 suggests that in (2.41) instead of (2.42) one can consider arbitrary random

weights η:={η(α)}α∈A, where Ais a countable set and ∑α∈Aη(α)<∞almost surely.

Assume also that the Gaussian processes A:={A(σ,α)}σ∈ΣN

α∈A

and B:={B(α)}α∈Ahave the

following correlation structures

CovhA(σ(1),α(1)),A(σ(2),α(2))i=2RN(σ(1),σ(2))q(α(1),α(2)),

CovhB(α(1)),B(α(2))i=q(α(1),α(2))2,(2.43)

where q:={q(α(1),α(2))}α(1),α(2)∈Ais the non-negative definite kernel with q(α,α) = 1, for

all α∈A. Assume, further, that the following functional

GN(β,h,η,q):=1

"∑(σ,α)∈ΣN×Aη(α)expβ√NA(σ,α)+h∑N

i=1σi

∑α∈Aη(α)expβ√NB(α)#

is well defined.

2It is easy prove the existence of such processes.

2.3 Some results on Gaussian mean-field spin-glasses 51

Theorem 2.3.11 (the AS2variational bound).For any N ∈

and any η, q as above, we have

pN(β,h)≤GN(β,h,η,q).

Moreover,

GN(β,h,η,q)−pN(β,h)

=β2

2Z1

G(β,h,t,η,q)⊗G(β,h,t,η,q)q(α(1),α(2))−R(σ(1),σ(2))2dt,

where G(β,h,t,η,q)∈M1ΣN×Ais the Gibbs measure induced by the Hamiltonian H(t):

Σ×A→

defined as follows

H(t;σ,α):=√t(HN(σ)+B(α))+√1−tA(σ,α)+h

∑

i=1

σ,

where t ∈[0;1]. More precisely, for a measurable f :ΣN×A→

, we have

G(β,h,t,η,q)[ f] = 1

Z∑

(σ,α)∈Σ×A

f(σ,α)η(α)expβ√NH(t;σ,α),

where Z is the usual probabilistic normalisation factor.

See (Aizenman et al., 2007, Theorem 4.1 ) for a proof.

Comparison with the GREM, relation with the Parisi functional

Consider the following comparison functional

Φ(η)[T]:=



log ∑

(σ,α)∈ΣN×A

η(α)expβ√NT (σ,α)+h

∑

i=1

σ

,(2.44)

where T:={T(σ,α)}σ∈ΣN

α∈A

is an arbitrary Gaussian process such that (2.44) exists.

We then obviously have

GN(β,h,η,q) = Φ(η)[A]−Φ(η)[B].

Given x∈Q0

n, let η:=ξ(x1,...,xn)be the corresponding RPC. Let, further, q=q(x)be the

limiting GREM overlap generated by x(cf. (2.28)). Let A=A(q)and B=B(q)be the corre-

sponding Gaussian processes with covariances given by (2.43).

Theorem 2.3.12 (comaprison with the GREM, relation with the Parisi functional).Let the func-

tion fx(q,y):[0;1]×

→

be the solution of (1.36).

Then we have

Φ(η)[A] = fx(0,0),

Φ(η)[B] = β2

2Z1

qx(q)dq,

and consequently

GN(β,h,η,q) = P(β;x).

See, e.g., (Aizenman et al., 2007, Lemma 6.2) for a proof.

Note that Theorems 2.3.12 and 2.3.11 immediately imply the upper bound in (1.37), i.e.,

p(β,h)≤inf

x∈Q0

P(β,x).(2.45)

The Aizenman-Sims-Starr scheme for the SK model with

multidimensional spins

In this chapter, we are mainly concerned with the question of the validity of the Parisi formula

in the case where spins take values in a d-dimensional Riemannian manifold. We address the

issue of extending the approach of Aizenman, Sims and Starr to the multidimensional spins.

3.1 Introduction

The recent rigorous proof of the celebrated Parisi formula for the free energy of the SK model,

due to Talagrand (2006b), based on the ingenious interpolation schemes of Guerra (2003) and

Aizenman et al. (2003) constitutes one of the major recent achievements of probability theory.

Recently, these results have been generalised to spherical SK-models (Talagrand, 2006a) and to

models with spins taking values in a bounded subset of

(Panchenko, 2005b).

A particular case (d=1, µwith bounded support) of the model we are considering here was

treated by Panchenko (2005b). He used the techniques of Talagrand (2006b) to prove that in the

case d=1 upper and lower bounds on the free energy coincide (cf. (3.13) and (3.20) in this

chapter). However, the results of (Panchenko, 2005b, Section 5 and the proofs of Theorems 2,

5 and 9) are based on relatively detailed differential properties of the optimal Lagrange multi-

pliers in the saddle point optimisation problem of interest. These properties are harder to obtain

in multidimensional situations such as that we are dealing with here. In fact, as we show in

Theorems 3.1.1 and 3.1.2, one can obtain the same saddle point variational principles without

invoking the detailed properties of the optimal Lagrange multipliers. This is achieved using a

quenched LDP of the G¨

artner-Ellis type.

Definition of the model

We refer to the introduction of this thesis for the definition of the SK model with multidimen-

sional spins. (See, in particular, (0.8) and (0.9).)

Throughout the chapter we assume that we are given a large enough probability space

(Ω,F,

)such that all random variables under consideration are defined on it. Without further

notice we shall assume that all Gaussian random variables (vectors and processes) are centred.

We shall be interested mainly in the free energy

pN(β):=1

NlogZΣN

expβ√NX(σ)dµ⊗N(σ),(3.1)

where β≥0 is the inverse temperature and µ∈Mf(Σ)is some arbitrary (not necessarily uni-

form or discrete) finite a priori measure. We assume that the a priori measure µis such that

54 3 The Aizenman-Sims-Starr scheme for the SK model with multidimensional spins

(3.1) is finite. We shall be interested in proving bounds on the thermodynamic limits of these

quantities, e.g., on

p(β):=lim

N↑+∞pN(β).(3.2)

Remark 3.1.1. Note that there is no need to include the additional external field terms into the

Hamiltonian (0.8), since they could be absorbed into the a priori measure µ.

Main results

Below we introduce the notations, assumptions and formulate the main results of this chapter.

Assumption 3.1.1. Suppose that the configuration space Σis bounded and such that 0∈

intconvΣ, where convΣdenotes the convex hull of Σ.

The examples listed below verify this assumption:

1. Multicomponent Ising spins. Σ={−1;1}d– the discrete hypercube.

2. Heisenberg spins. Σ=σ∈

d:kσk2=1– the unit Euclidean sphere.

3. Σ=σ∈

d:kσk2≤1– the unit Euclidean ball.

Remark 3.1.2. The boundedness assumption can be relaxed and replaced by concentration

properties of the a priori measure. In Section 5.2 we will exemplify this in the case of a Gaussian

a priori distribution. In general a subgaussian distribution will suffice.

Consider the space of all symmetric matrices Sym(d):=Λ∈

d×d|Λ=Λ∗. Denote

Sym+(d):={Λ∈Sym(d)|Λ0},

where the notation Λ0 means that the matrix Λis non-negative definite. We equip the space

Sym(d)with the Frobenius (Hilbert-Schmidt) norm

kMk2

F:=

∑

u,v=1

u,v,M∈Sym(d).

We shall also denote the corresponding (tracial) scalar product by h·,·i. For r>max{kσk2

σ∈Σ}, define

U:=U∈Sym(d)|U0,kUk2≤r.

We will call the set Uthe space of the admissible self-overlaps. In analogy to the usual overlap

in the standard SK model, we define, for two configurations, σ(i)= (σ(i)

1,σ(i)

2,...,σ(i)

N)∈ΣN,

i=1,2, the (mutual) overlap matrix RN(σ(1),σ(2))∈

d×dwhose entries are given by

RN(σ(1),σ(2))u,v:=1

∑

i=1

σ(1)

i,uσ(2)

i,v,u,v∈[1;d]∩

.(3.3)

Fix an overlap matrix U ∈U. Given a subset V⊂U, define the set of the local configurations,

3.1 Introduction 55

ΣN(V):=σ∈ΣN|RN(σ,σ)∈V.

Next, define the local free energy

pN(V):=1

NlogZΣN(V)

eβ√NX(σ)dµ⊗N(σ).(3.4)

We also define

p(V):=p(β,V):=lim

N↑+∞pN(V),(3.5)

where the existence of the limit follows from a result of (Guerra & Toninelli, 2003, Theorem 1).

Consider a sequence of matrices Q:={Q(k)∈Sym(d)}n+1

k=0such that

0=:Q(0)≺Q(1)≺...≺Q(n+1):=U,(3.6)

where the ordering is understood in the sense of the corresponding quadratic forms. Consider

in addition a partition of the unit interval x:={xk}n+1

k=0, i.e.,

0=:x0<x1< ... < xn+1:=1.(3.7)

Let {z(k)}n

k=0be a sequence of independent Gaussian d-dimensional vectors with

Covhz(k)i=Q(k+1)−Q(k).

Given Λ∈Sym(d), define

Xn+1(x,Q,U,Λ):=logZΣexp√2βn

∑

k=0

zk,σ+hΛσ,σidµ(σ).(3.8)

Define, for k∈{n,...,0}, by a descending recursion,

Xk(x,Q,U,Λ):=1

log

z(k)expxkXk+1(x,Q,U,Λ) (3.9)

with

X0(x,Q,U,Λ):=

z(0)X1(x,Q,U,Λ),(3.10)

where

z(k)[·]denotes the expectation with respect to the σ-algebra generated by the random

vector z(k).

Remark 3.1.3. Section 4.1.4 contains the more general framework of dealing with the recursive

quantities (3.10) which in particular brings to light the links with certain non-linear parabolic

PDEs. For these PDEs the recursion (3.1.2) is closely related to an iterative application of the

well-known Hopf-Cole transformation, see, e.g., Evans (1998).

Define the local Parisi functional as

f(x,Q,U,Λ):=−hΛ,Ui−β2

∑

k=1

xkkQ(k+1)k2

F−kQ(k)k2

F+X0(x,Q,U,Λ).(3.11)

56 3 The Aizenman-Sims-Starr scheme for the SK model with multidimensional spins

Assumption 3.1.2 (Hadamard squares).We shall say that a sequence, {Q(i)}n

i=1, of matrices

satisfies Assumption 3.1.2, if

Q(1)2≺...≺Q(n)2≺Q(n+1)2.(3.12)

Remark 3.1.4. The above assumption on the matrix order parameters Qis necessary only

to employ the AS2scheme. In contrast, Guerra’s scheme (Theorems 4.1.1 and 4.16) does not

require the above assumption.

One may verify that the matrices qand ρin (Talagrand, 2003, Theorems 2.13.1 and 2.13.2)

correspond to the matrices Q(1)and Q(2)of this chapter (n=1). (See also (3.23) below.) Further-

more, a straightforward application of the Cauchy-Schwarz inequality shows that the matrices

qand ρactually satisfy Assumption 3.1.2. We also note that in the simultaneous diagonalisa-

tion scenario in which the matrices in (3.6) are diagonalisable in the same orthogonal basis (see

Sections 4.2.3 and 5.2.2) this assumption is also satisfied.

The first main result of the present chapter uses the AS2scheme to establish the upper bound

on the limiting free energy p(β)in terms of the saddle point problem for the local Parisi func-

tional (3.11).

Theorem 3.1.1. For any closed set V⊂Sym(d), we have

p(V)≤sup

U∈V∩U

inf

(x,Q,Λ)f(x,Q,Λ,U),(3.13)

where the infimum runs over all x satisfying (3.7), all Qsatisfying both (3.6) and Assump-

tion 3.1.2, and all Λ∈Sym(d).

We were not able to prove in general that the r.h.s. of (3.13) gives also the lower bound to

the thermodynamic free energy. See, however, Theorem 5.1.1 for a positive example.

To formulate the lower bound on (3.2) we need some additional definitions.

Let the comparison index space be A:=

n. Given α(1),α(2)∈A, define

Q(α(1),α(2)):=Q(qL(α(1),α(2))),(3.14)

where qL(α(1),α(2))is defined in (0.2) Given a d×d-matrix Mand p∈

, we denote by Mp

the d×d-matrix with entries

Mpu,v:= (Mu,v)p.

The matrix valued lexicographic overlap (3.14) can be used to construct the multidimensional

(d≥1) versions of the GREM (see, e.g., Bovier & Kurkova (2007) and references therein

for a review of the results on the one-dimensional case of the model). Here we shall need the

following two GREM-inspired real-valued Gaussian processes: A:={A(σ,α)}σ∈ΣN,α∈Aand

B:={B(α)}α∈Awith covariance structures

hA(σ(1),α(1))A(σ(2),α(2))i=2hR(σ(1),σ(2)),Q(α(1),α(2))i,

hB(α(1))B(α(2))i=kQ(α(1),α(2))k2

3.1 Introduction 57

Note that the process Acan be represented in the following form:

A(σ,α) = 2

N1/2N

∑

i=1hAi(α),σii,(3.15)

where {Ai:={Ai(α)}α∈A}N

i=1are the i.i.d. (for different indices i) Gaussian

d-valued pro-

cesses with the following covariance structure: for i∈[1;N]∩

, for all α(1),α(2)∈Aand all

u,v∈[1;d]∩

assume that the following holds

hAi(α(1))uAi(α(2))vi=Q(α(1),α(2))u,v.

Given t∈[0;1], we define the interpolating AS2Hamiltonian

Ht(σ,α):=√t(X(σ)+B(α))+√1−tA(σ,α).(3.16)

Next, we define the random probability measure πN∈M1(ΣN×A)through

πN:=µ⊗N⊗ξ,

where ξ=ξ(x)is the RPC. We denote by {ξ(α)}α∈Athe enumeration of the atom locations

of the RPC and consider the enumeration as a random measure on A(independent of all other

random variables around). Define the local AS2Gibbs measure GN(t,x,Q,U,V)by

GN(t,x,Q,U,V)[ f]:=1

ZN(t,V)ZΣN(V)×A

f(σ,α)e√NβHt(σ,α)dπN(σ,α),(3.17)

where f:ΣN×A→

is an arbitrary measurable function for which the right-hand side of

(3.17) is finite. For V⊂U, define the AS2remainder term as

RN(x,Q,U,V)

:=−1

2Z1

hGN(t,x,Q,U,V)⊗GN(t,x,Q,U,V)hkRN(σ(1),σ(2))−Q(α(1),α(2))k2

Fiidt.

(3.18)

We define also the limiting AS2remainder term

R(x,Q,U):=lim

ε↓+0lim

N↑∞

RN(x,Q,B(U,ε)) ≤0,(3.19)

where B(U,ε)is the ball with centre Uand radius ε. (The existence of the limiting remainder

term is proved in Theorem 3.1.2.)

The second main result of this chapter uses the AS2scheme to establish a lower bound

on (3.2) in terms of the same saddle point Parisi-type functional as in the upper bound which

includes, however, the non-positive remainder term (3.19). In one-dimensional situations Tala-

grand (2006b) and Panchenko (2005b), respectively, have shown that the corresponding error

term vanishes on the optimiser of the Parisi functional.

58 3 The Aizenman-Sims-Starr scheme for the SK model with multidimensional spins

Theorem 3.1.2. For any open set V⊂Sym(d), we have

p(V)≥sup

U∈V∩U

inf

(x,Q,Λ)[f(x,Q,Λ,U)+R(x,Q,U)],(3.20)

where the infimum runs over all x satisfying (3.7), all Λ∈Sym(d), and all Qsatisfying both

(3.6) and Assumption 3.1.2.

Remark 3.1.5. The comparison scheme of Guerra (2003) (see also more recent accounts Tala-

grand (2007a), Guerra (2005) and Aizenman et al. (2007)) is also applicable to our model and

is covered by our quenched LDP approach, see Theorems 4.1.1 and 4.16 for the formal state-

ments. Guerra’s scheme seems to be more amenable (compared to the Aizenman-Sims-Starr

one) for Talagrand’s remainder estimates (Talagrand, 2006b), see Section 4.3. The scheme is

based on the following interpolation

Ht(σ,α):=√tX(σ)+ √1−tA(σ,α)(3.21)

which induces the corresponding local Gibbs measure (3.17) and remainder term (3.18) by

substituting (3.16) with (3.21). Guerra’s scheme does not include the process B and, hence,

does not require Assumption 3.1.2. Recovering the terms corresponding to ΦN(x,U)[B](see,

(3.105)) in the Parisi functional requires then a short additional calculation (Lemma 4.1.1).

Note that the results of (Talagrand, 2003, Theorems 2.13.2 and 2.13.3) imply that at least in

the high temperature region (i.e., for small enough β) the Parisi formula for the SK model with

multidimensional spins is valid with n=1

p(β) = f(x,Q∗,0,U∗) = sup

U∈U

inf

(Q,Λ)f(x,Q,Λ,U),(3.22)

where the matrices Q∗(2)=U∗and Q∗(1)solve the following system of equations:







∂Q(2)

u,v

f(x,Q∗,0,U∗) = 0,u,v∈[1;d]∩

∂Q(1)

u,v

f(x,Q∗,0,U∗) = 0,u,v∈[1;d]∩

.(3.23)

Note that the system (3.23) coincides with the mean-field equations obtained in Theorem 2.3.4.

3.2 Some preliminary results

3.2.1 Covariance structure

Our definition of the overlap matrix in (3.3) is motivated by the fact that, as can be seen from a

straightforward computation

hXN(σ(1))XN(σ(2))i=

∑

u,v=1RN(σ(1),σ(2))u,v2=kRN(σ(1),σ(2))k2

2,(3.24)

that is, the the covariance structure of the process XN(σ)is given by the square of the Frobenius

(Hilbert-Schmidt) norm of the matrix RN(σ(1),σ(2)). The basic properties of the overlap matrix

are summarised in the following proposition.

3.2 Some preliminary results 59

Proposition 3.2.1. We have, for all σ(1),σ(2),σ∈ΣN,

1. Matrix representation. RN(σ(1),σ(2)) = 1

Nσ(1)∗σ(2).

2. Symmetry #1. Ru,v

N(σ(1),σ(2)) = Rv,u

N(σ(2),σ(1)).

3. Symmetry #2. Ru,v

N(σ,σ) = Rv,u

N(σ,σ).

4. Non-negative definiteness #1. RN(σ,σ)0.

5. Non-negative definiteness #2.

RN(σ(1),σ(1))RN(σ(1),σ(2))

RN(σ(1),σ(2))∗RN(σ(2),σ(2))0.

6. Suppose U :=RN(σ(1),σ(1)) = RN(σ(2),σ(2)), then

kR(σ(1),σ(2))k2

F≤kUk2

Proof. The proof is straightforward. ut

3.2.2 Concentration of measure

The following concentration of measure result for the free energy is standard.

Proposition 3.2.2. Let (Σ,S)be a Polish space. Suppose µis a random finite measure on Σ.

Suppose, moreover, that X(σ),σ∈Σis the family of Gaussian random variables independent

of µwhich possesses a bounded covariance, i.e.,

there exists K >0such that sup

σ(1),σ(2)∈Σ|Cov(X(σ(1)),X(σ(2)))|≤K.(3.25)

Assume that

f(X):=logZΣeX(σ)dµ(σ)<∞.

Then

{|f(X)−

[f(X)]|≥t}≤2exp−t2

4K.

Remark 3.2.1. An analogous result was given in a somewhat more specialised case in Panchenko

(2005b).

Proof. This is an adaptation of the proof of (Talagrand, 2003, Theorem 2.2.4). We can not apply

the comparison Theorem 3.2.5 directly, so we resort to the basic interpolation argument as stated

in Proposition 2.1.1. For j=1,2, let the processes Xj(·)be the two independent copies of the

process X(·). For t∈[0;1], let

Xj,t:=√tXj+√1−tX

and

Fj(t):=logZΩexpXj,t(σ)dµ(σ).

60 3 The Aizenman-Sims-Starr scheme for the SK model with multidimensional spins

For s∈

, let

ϕs(t):=

exps(F1−F2).

Hence, differentiation gives

ϕs(t) = s

exps(F1−F2)(˙

F1−˙

F2)(3.26)

(the dots indicate the derivatives with respect to t) and also

Fj(t) =1

2ZΣexpXj,t(σ)dµ(σ)−1

×ZΣt−1/2Xj(σ)−(1−t)−1/2X(σ)expXj,t(σ)dµ(σ).(3.27)

Now, we substitute (3.27) back to (3.26) and apply Corollary 2.1.1 to the result. After some

tedious but elementary calculations we get

ϕs(t) =s2

"exps(F1−F2)ZΣexpX1,t(σ)dµ(σ)ZΣexpX2,t(σ)dµ(σ)−1

ZΣCov(X(σ(1)),X(σ(2)))expX1,t(σ(1))+ X2,t(σ(2))dµ(σ(1))dµ(σ(2)).

Thus, thanks to (3.25), we obtain

ϕs(t)≤Ks2ϕs(t).

The conclusion of the theorem follows now exactly as in the proof of (Talagrand, 2003, Theo-

rem 2.2.4).

We now apply this general result to the our model and also to the free energy-like functional

of the GREM-inspired process A.

Proposition 3.2.3. Suppose Σ⊂B(0,r), for r >0. For Ω⊂ΣN, denote

PSK

N(β,Ω):=logZΩexp√NβXN(σ)dµ⊗N(σ),

and

PGREM

N(β,Ω):=logZΩ×A

expβ√2

∑

i=1hAi(α),σiidπN(σ,α).

Then, for all Ω⊂ΣN, we have

1. For any t >0,

PSK

N(β,Ω)−

PSK

N(β,Ω)>t≤2exp−t2

4β2r4N.(3.28)

3.2 Some preliminary results 61

2. For any t >0,

PGREM

N(β,Ω)−

PGREM

N(β,Ω)>t≤2exp−t2

8β2r4N.(3.29)

Proof. 1. We would like to use Proposition 3.2.2. By (3.24) and the Cauchy-Bouniakovsky-

Schwarz inequality, we have, for all N∈

,σ(1),σ(2)∈ΣN, that

Cov(XN(σ(1),σ(2))) = kRN(σ(1),σ(2))k2

F=1

∑

i,j=1hσ(1)

i,σ(1)

jihσ(2)

i,σ(2)

ji≤r4.(3.30)

Hence, for all N∈

and all subsets Ωof ΣN, we obtain

sup

σ(1),σ(2)∈Σ|Cov(X(σ(1)),X(σ(2)))|≤r4.

Thus (3.28) is proved.

2. We fix an arbitrary N∈

,σ(1),σ(2)∈ΣN,α(1),α(2)∈A. We have

Cov(A(σ(1),α(1)),A(σ(2),α(2))) =

hA(σ(1),α(1))A(σ(2),α(2))i

∑

i=1hQ(α(1),α(2))σ(1)

i,σ(2)

ii.

Bound (3.30) implies that, for any U∈U, we have kUk2≤r2. Since Q(α(1),α(2))∈U,

we obtain

|hQ(α(1),α(2))σ(1)

i,σ(2)

ii|≤kQ(α(1),α(2))k2kσ(1)

ik2kσ(2)

ik2

≤kQ(α(1),α(2))k2r2≤r4.

Therefore, using Proposition 3.2.2, we obtain (3.29).

3.2.3 Gaussian comparison inequalities for free energy-like functionals

We begin by recalling well-known integration by parts formula which is the source of many

comparison results for functionals of Gaussian processes.

The following proposition connects the computation of the derivative of the free energy with

respect to the parameter that linearly occurs in the Hamiltonian with a certain Gibbs average for

a replicated system.

Proposition 3.2.4. Consider a Polish measure space (Σ,S)and a random measure µon it. Let

X={X(σ)}σ∈Σand Y :={Y(σ)}σ∈Σbe two independent Gaussian real-valued processes. For

u∈

, we define

Hu(σ):=uX(σ)+Y(σ).

Assume that, for all u ∈[a,b]b

, we have

62 3 The Aizenman-Sims-Starr scheme for the SK model with multidimensional spins

Zexp(Hu(σ))dµ(σ)<∞,ZX(σ)exp(Hu(σ))dµ(σ)<∞

almost surely, and also that

logZexp(Hu(σ))dµ(σ)<∞.

Then we have

logZeHu(σ)dµ(σ)=u

[G(u)⊗G(u)[VarX(σ)−

[X(σ),X(τ)]]],

where G(u)is the random element of M1(Σ)which, for any measurable f :Σ→

, satisfies

G(u)[ f] = 1

Z(u)Zf(σ)exp(Hu(σ))dµ(σ).

Proof. We write

dulogZeHu(σ)dµ(σ) = ZX(σ)eHu(σ)

Zu(β)dµ(σ),(3.31)

where Zu(β):=ReβHu(σ)dµ(σ). The main ingredient of the proof is the Gaussian integration

by parts formula. Denote, for τ∈Σ,e(τ):=

[X(σ)Hu(τ)]. By (2.3), we have

"X(σ)eHu(σ)

Zu(β)#=

"∂X eHu(σ)

ReHu(τ)dµ(τ)!(X;e)#.(3.32)

Due to the independence, we have

[X(σ)Hu(τ)] = u

[X(σ),X(τ)].

Henceforth, the computation of the directional derivative in (3.32) amounts to

∂

∂t"eHu(σ)+tuVar(σ)

ReHu(τ)+tuCov(σ,τ)dµ(τ)#

=ZeHu(σ)dµ(σ)−2uVarX(σ)eHu(σ)ZeHu(τ)dµ(τ)

−eHu(σ)ZuCov[X(σ),X(τ)]eHu(τ)dµ(τ).(3.33)

Substituting the r.h.s. of (3.33) into (3.31), we obtain the assertion of the proposition. ut

The following proposition gives a short differentiation formula, which is useful in getting

comparison results between the (free energy-like) functionals of Gaussian processes.

3.2 Some preliminary results 63

Proposition 3.2.5. Let (X(σ))σ∈Σ,(Y(σ))σ∈Σbe two independent Gaussian processes as be-

fore. Set

Ht(σ):=√tX(σ)+ √1−tY(σ).

Assume that

ZeHt(σ)dµ(σ)<∞,ZX(σ)eHt(σ)dµ(σ)<∞,

ZY(σ)eHt(σ)dµ(σ)<∞

almost surely, and also that, for all t ∈[0;1],

logZeHt(σ)dµ(σ)<∞.

Then we have

logZeX(σ)dµ(σ)=

logZeY(σ)dµ(σ)

−1

2Z1

G(t)⊗G(t)hVarX(σ(1))−VarY(σ(1))

−CovhX(σ(1)),X(σ(2))i−CovhY(σ(1)),Y(σ(2))iidt,(3.34)

where G(t)is the random element of M1(Σ)which, for all measurable f :Σ→

, satisfies

G(t)[ f] = 1

Z(t)ZΣ

f(σ)exp(Ht(σ))dµ(σ).(3.35)

Proof. Let us introduce the process

Wu,v(σ):=uX(σ)+vY(σ).

Hence,

Ht(σ) = W√t,√1−t(σ).(3.36)

Thus

logZeHt(σ)dµ(σ)=1

21

√t

∂

∂u

logZeWu,v(σ)dµ(σ)

−1

√1−t

∂

∂v

logZeWu,v(σ)dµ(σ)u=√t,v=√1−t

Applying Proposition 3.2.4 and R1

0·dtto the previous formula, we conclude the proof. ut

64 3 The Aizenman-Sims-Starr scheme for the SK model with multidimensional spins

3.3 Quenched G¨

artner-Ellis type LDP

In this section, we derive a quenched LDP under measure concentration assumptions. Theo-

rems 3.3.1 and 3.3.2 give the corresponding LDP upper and lower bounds, respectively. The

proofs of the LDP bounds will be adapted to get the proofs of the upper and lower bounds

on the free energy of the SK model with multidimensional spins. However, they may be of

independent interest.

Note that the existing “level-2” quenched large deviation results of Comets (1989) are ap-

plicable only to a certain class of mean-field random Hamiltonians which are required to be

“macroscopic” functionals of the joint empirical distribution of the random variables represent-

ing the disorder and the independent spin variables. The SK Hamiltonian can not be repre-

sented in such form, since the interaction matrix consists of i.i.d. random variables. Moreover,

it is assumed in Comets (1989) that the Hamiltonian has the form HN(σ) = NV(σ), where

{V(σ)}σ∈ΣNis a random process taking values in some fixed bounded subset of

. Since the

Hamiltonian of our model is a Gaussian process, this assumption is also not satisfied, due to the

unboundedness of the Gaussian distribution.

3.3.1 Quenched LDP upper bound

Throughout this section we impose the following.

Assumption 3.3.1. Suppose {QN}∞

N=1is a sequence of random measures on a Polish space

(X,X). Assume that there exists some L >0such that for any QN-measurable set A ⊂Xwe

have

logQN(A)−

logQN(A)>t≤exp−t2

LN .

Note that this assumption will hold in the cases we are interested in due to Proposition 3.2.2.

Lemma 3.3.1. Suppose {QN}∞

N=1is a sequence of random measures on a Polish space (X,X)

and {Ar⊂X:r∈{1,...,p}}is a sequence of QN-measurable sets such that, for some absolute

constant L >0, we have

logQN(Ar)−

logQN(Ar)>t≤exp−t2

LN .(3.37)

Then we have

lim

N↑+∞

logQNp

[

r=1

Ar−max

r∈{1,...,p}

logQN(Ar)=0.(3.38)

Proof. First, (3.37) gives

max

r∈{1,...,p}logQN(Ar)−

logQN(Ar)≥t≤2pexp−t2

LN .

Since, for a,b∈

p, the following elementary inequality holds

max

rar−max

rbr≤max

r|ar−br|,

3.3 Quenched G¨

artner-Ellis type LDP 65

we get

max

r∈{1,...,p}logQN(Ar)−max

r∈{1,...,p}

logQN(Ar)≥t≤2pexp−t2

LN .

The last equation in turn implies that

max

r∈{1,...,p}logQN(Ar)−max

r∈{1,...,p}

logQN(Ar)≤2pZ+∞

exp−Nt2

Ldt,(3.39)

and the r.h.s. of the previous formula vanishes as N↑∞.

Let QN∈M(X),N∈

be a family of random measures on (X,X). Define the Laplace

transform

LN(Λ):=ZX

eNhx,ΛidQN(x).

Suppose that, for all Λ∈

d, we have

I(Λ):=lim

N↑∞

logLN(Λ)∈

∪{−∞,+∞}.(3.40)

Define the Legendre transform

I∗(x):=inf

Λ[−hx,Λi+I(Λ)].(3.41)

Define, for δ>0,

I∗

δ(x):=maxI∗(x)+δ,−1

δ.(3.42)

Lemma 3.3.2. Suppose

0∈intD(I):=int{Λ:I(Λ)<+∞}.(3.43)

Then

1. The mapping I∗(·):X→

is upper semi-continuous and concave.

2. For all M >0,

{x∈X:I∗(x)≤M}is a compact.

Proof. 1. Since, for all Λ∈D(I), the linear mappings

x7→−hΛ,xi+I(Λ)

are obviously concave, the infimum of this family is upper semi-continuous and concave.

2. See, e.g., den Hollander (2000) for the proof.

66 3 The Aizenman-Sims-Starr scheme for the SK model with multidimensional spins

Theorem 3.3.1. Suppose that

1. The family {QN}satisfies condition (3.38).

2. Condition (3.40) is satisfied.

3. Condition (3.43) is satisfied.

Then, for any closed set V⊂

d, we have

lim

N↑∞

logQN(V)≤sup

x∈V

I∗(x).(3.44)

Proof. 1. Suppose at first that Vis a compact.

Thanks to (3.41), for any x∈X, there exists Λ(x)∈Xsuch that

−hx,Λ(x)i+I(Λ(x)) ≤I∗

δ(x).(3.45)

For any x∈X, there exists a neighbourhood A(x)⊂Xof xsuch that

sup

y∈A(x)hy−x,Λ(x)i≤δ.

By compactness, the covering Sx∈YA(x)⊃Vhas the finite subcovering, say Sp

r=1A(xr)⊃

V. Hence,

NlogQN(V)≤1

Nlog p

[

r=1

QN(A(xr))!.(3.46)

Applying condition (3.38), we get

lim

N↑∞

max

r∈{1,...,p}logQN(A(xr))−max

r∈{1,...,p}

1

NlogQN(A(xr))≤0.(3.47)

By the Chebyshev inequality,

QN(A(x)) ≤QN{y∈X:hy−x,Λ(x)i≤δ}

≤e−δNZX

eNhy−x,Λ(x)idQN(y)

=e−δNe−Nhx,Λ(x)iLN(Λ(x)).(3.48)

Hence, (3.48) together with (3.45) yields

lim

N↑+∞

logQN(A(xr))≤lim

N↑+∞−hxr,Λ(xr)i+1

NlogLN(Λ(xr))−δ

=−hxr,Λ(xr)i+I(Λ(xr))−δ

≤I∗

δ(xr)−δ.(3.49)

Combining (3.46), (3.47), (3.49), we obtain

lim

N↑+∞

logQN(V)≤max

r∈{1,...,p}I∗

δ(xr)−δ

≤sup

x∈V

I∗

δ(x)−δ.

Taking δ↓+0 limit, we get the assertion of the theorem.

3.3 Quenched G¨

artner-Ellis type LDP 67

2. Let us allow now the set Vto be unbounded. We first prove that the family QNis quenched

exponentially tight. For that purpose, let

RN(M):=1

hlogQN(X\[−M;M]d)i,

and denote

R(M):=lim

N↑+∞RN(M).

We want to prove that

lim

M↑+∞R(M) = −∞.(3.50)

Fix some u∈{1,...,d}. Suppose δu,p∈{0,1}is the standard Kronecker symbol. Let eu∈

dbe an element of the standard basis of

d, i.e., for all p∈{1,...,d}, we have

(eu)p:=δu,p.

Thanks to the Chebyshev inequality, we have

QN{xu≤−M}≤e−NM Z

de−Nhx,euidQN(x),a.s. (3.51)

Now, we get

de−Nhx,euidQN(x) = 1

LN(Λe)Z

deNhx,Λe−euidQN(x)

=LN(Λe−eu)

LN(Λe),a.s. (3.52)

Hence, combining (3.51) and (3.52), we obtain

logQN{xu≤−M}≤−M+IN(Λe−eu)−IN(Λe).(3.53)

Using the same argument, we also get

logQN{xu≥M}≤−M+IN(Λe+eu)−IN(Λe).(3.54)

We obviously have

RN(M)≤1

hlogQNd

[

u=1

({xu≤−M}∪{xu≥M})i.(3.55)

Applying condition (3.38) to (3.55), we get

lim

N↑+∞

hlogQNd

[

u=1{xu≤−M}∪{xu≥M}

68 3 The Aizenman-Sims-Starr scheme for the SK model with multidimensional spins

−max

u∈{1,...,d}maxn

[logQN({xu≤−M})],

[logQN({xu≥M})]oi≤0.(3.56)

Applying (3.53) and (3.54) in (3.56), we get

lim

N↑+∞

hlogQNd

[

u=1

({xu≤−M}∪{xu≥M})i

≤−M−I(Λe)+ max

u∈{1,...,d}max{I(Λe−eu),I(Λe+eu)}.(3.57)

The bound (3.57) assures (3.50). Now, since we have (with the help of (3.38) and (3.44))

lim

N↑+∞

logQN(V)≤lim

N↑+∞

hlogQN((V∩[−M;M]d)∪(X\[−M;M]d))i

≤maxsup

x∈(V∩[−M;M]d)

I∗(x),R(M),(3.58)

the assertion of the theorem follows from (3.50) by taking the limM↑+∞in the bound (3.58).

3.3.2 Quenched LDP lower bound

Suppose that, for some Λ∈

dand all N∈

, we have

eNhy,ΛidQN(y)<+∞.

Let e

QN,Λ∈M(X)be the random measure defined by

QN,Λ(A) = ZA

eNhy,ΛidQN(y),(3.59)

for any QNmeasurable A⊂X.

Lemma 3.3.3. Suppose the family of random measures QNsatisfies the following assumptions.

1. Measure concentration. There exists some L >0such that, for any QN-measurable set A ⊂

X, we have

logQN(A)−

logQN(A)>t≤exp−t2

LN .

2. Tails decay condition. Let

C(M):={x∈X:kxk<M}.

There exists p ∈

such that

lim

K↑+∞lim

N↑∞Z+∞

1

Nlog e

QN,Λ(X\C(Np)) >−K+tdt=0.

3.3 Quenched G¨

artner-Ellis type LDP 69

3. Non-degeneracy. The family of the sets nBj⊂X:j∈{1,...,q}osatisfies the following

condition

there exists some j0∈{1,...,q}such that lim

N↑∞

hlog e

QN,Λ(Bj0)i>−∞.(3.60)

Then, for any Λ∈

d, we have

lim

N↑∞

log e

QN,Λq

[

j=1

Bj−max

j∈{1,...,q}

hlog e

QN,Λ(Bj)i≤0.(3.61)

Remark 3.3.1. The polynomial growth choice of M =MN:=Npmade in assumption (2) of

the above lemma corresponds certainly to the concrete form of the exponential concentration

in assumption (1). Both assumptions together with the proof can be adapted to the weaker

concentration bounds.

Proof. We fix some j∈ {1,...,q}. Take an arbitrary ε>0, M>0 and denote JM,ε:=

∩

[−kΛkM/ε;kΛkM/ε]. Consider, for i∈JM,ε, the following closed sets

Ai,j:={x∈Bj:(j−1)ε≤hΛ,xi≤ jε}.

We get

Nlog e

QN,Λq

[

j=1

Bj≤1

Nlog e

QN,Λq

[

j=1

Bj∩C(M)∪(X\C(M))

≤1

Nmaxnmax

j∈{1,...,q}log e

QN,Λ(Bj∩C(M)),

log e

QN,Λ(X\C(M))o+log(q+1)

N.(3.62)

We have

Nlog e

QN,Λ(Bj∩C(M)) ≤1

Nlog ∑

i∈JM,ε

eNiεQN(Ai,j)

≤max

i∈{1,...,p}iε+1

NlogQN(Ai,j)+log(cardJM,ε)

N.(3.63)

Denote

αN(ε):=max

j∈{1,...,q}max

i∈JM,εiε+1

NlogQN(Ai,j),

and

βN:=max

j∈{1,...,q}

hlog e

QN,Λ(Bj)i,

βN(ε):=max

j∈{1,...,q}

"max

i∈JM,εiε+1

NlogQN(Ai,j)#,

70 3 The Aizenman-Sims-Starr scheme for the SK model with multidimensional spins

γN(M):=1

Nlog e

QN,Λ(X\C(M)).

We also have

Nlog e

QN,Λ(Bj)≥1

Nlog e

QN,Λ(Bj∩C(M))

≥max

i∈JM,ε(i−1)ε+1

NlogQN(Ai,j)

=max

i∈JM,εiε+1

NlogQN(Ai,j)−ε.(3.64)

Due to condition (1), we have

nαN(ε)−e

βN(ε)>to≤cardJM,εqexp−Nt2

L.(3.65)

We put M:=MN:=Np, and we get

cardJM,ε≤2kΛkM/ε+1

≤2kΛkNp/ε+1.(3.66)

Let

XN(M,ε):=max{γN(M),αN(ε)}−βN,

then we have

{XN(K,ε)>t}≤

{γN(M)>βN+t}+

{αN(ε)>βN+t}.(3.67)

Due to property (3.60), there exists K>0 such that we have

{γN(M)>βN+t}≤

{γN(M)>−K+t}.(3.68)

Thanks to (3.64), we have

{αN(ε)>βN+t}≤

{αN(ε)>e

βN(ε)+t−ε}.(3.69)

For t>ε, we apply (3.65) and (3.66) to (3.69) to obtain

{αN(ε)>βN+t}≤(2kΛkNp/ε+1)qexp−Nt2

L.(3.70)

Combining (3.62) and (3.63), we get

log e

QN,Λq

[

j=1

Bj−max

j∈{1,...,q}

hlog e

QN,Λ(Bj)i

≤

XN(M,ε)+log(q+1)

N+log(2kΛkNp/ε+1)

N.(3.71)

3.3 Quenched G¨

artner-Ellis type LDP 71

Now, (3.67), (3.68) and (3.70) imply

XN(M,ε)≤Z+∞

XN(M,ε)>tdt

≤Z+∞

XN(M,ε)>tdt+ε

≤Z+∞

{γN(M)>−K+t}dt

+(2kΛkNp/ε+1)qZ+∞

εexp−Nt2

Ldt+ε.(3.72)

Therefore, taking sequentially limN↑+∞, limK↑+∞and limε↑+0in (3.72), we arrive at

lim

N↑∞

XN(M,ε)≤0.(3.73)

The bound (3.73) together with (3.71) implies the assertion of the lemma.

Let ˆ

QN,Λbe the (random) probability measure defined by

QN,Λ:=e

LN(Λ).

Lemma 3.3.4. Suppose that the measure QNsatisfies the assumptions of the previous lemma.

Then (3.61) is valid also for ˆ

QN,Λ.

Proof. Similar to the one of the previous lemma. ut

Remark 3.3.2. Recall that a point x ∈Xis called an exposed point of the concave mapping I∗

if there exists Λ∈

dsuch that, for all y ∈X\{x}, we have

I∗(y)−I∗(x)<hy−x,Λi.(3.74)

Theorem 3.3.2. Suppose

1. The family QN:N∈

⊂M(

d)satisfies the assumptions of Lemma 3.3.3.

2. G⊂Xis an open set.

3. /0 6=E(I∗)⊂D(I∗)is the set of the exposed points of the mapping I∗.

4. Condition (3.43) is satisfied.

Then

lim

N↑+∞

logQN(G∩E)≥sup

x∈G

I∗(x).(3.75)

Proof. Let B(x,ε)be a ball of radius ε>0 around some arbitrary x∈X. It suffices to prove

that

lim

ε↓+0lim

N↑∞

logQN(B(x,ε))≥I∗(x).(3.76)

72 3 The Aizenman-Sims-Starr scheme for the SK model with multidimensional spins

Indeed, since we have

QN(G)≥QN(B(x,ε)),(3.77)

applying 1

Nlog(·), taking the expectation, taking limN↑+∞,ε↓+0 and taking the supremum over

x∈Gin (3.77), we get (3.75).

Take any x∈G∩E. Then we can find the corresponding vector Λe=Λe(x)∈

dorthog-

onal to the exposing hyperplane at the point x, as in (3.74). Define the new (“tilted”) random

probability measure ˆ

QNon

dby demanding that

dˆ

dQN

(y) = 1

LN(Λe)eNhy,Λei.(3.78)

Moreover, we have

logQN(B(x,ε))=1

logZB(x,ε)

dQN(y)

logLN(Λe)+1

ZB(x,ε)

e−Nhy,Λeidˆ

QN(y)

≥1

logLN(Λe)−hx,Λei−εkΛek2+1

log ˆ

QN(B(x,ε)).

Hence,

lim

ε↓+0lim

N↑∞

logQN(B(x,ε))≥[−hx,Λei+I(Λe)]+ lim

ε↓+0lim

N↑∞

log ˆ

QN(B(x,ε)).

Since we have

−hx,Λei+I(Λe)≥I∗(x),

in order to show (3.76) it remains to prove that

lim

ε↓+0lim

N↑∞

log ˆ

QN(B(x,ε))=0.(3.79)

The Laplace transform of ˆ

QNis

LN(Λ) = LN(Λ+Λe)

LN(Λe).

Hence, we arrive at

I(Λ) = I(Λ+Λe)−I(Λe).

Moreover, we have

I∗(x) = I∗(x)+hx,Λei−I(Λe).(3.80)

3.3 Quenched G¨

artner-Ellis type LDP 73

By the assumptions of the theorem, the family QNsatisfies the assumptions of Lemma 3.3.3.

Hence, due to Lemma 3.3.4, the family ˆ

QNsatisfies (3.38). Thus we can apply Theorem 3.3.1

to obtain

lim

N↑+∞

hlog ˆ

QN(

d\B(U,ε))i≤sup

y∈U\B(x,ε)

I∗(y).(3.81)

Lemma 3.3.2 implies that there exists some x0∈X\B(x,ε)(note that x06=x) such that

sup

y∈X\B(x,ε)

I∗(y) = ˆ

I∗(x0).

Since Λeis an exposing hyperplane, using (3.80), we get

I∗(x0) = I∗(x0)+hx0,Λei−I(Λe)

≤[I∗(x0)+hx0,Λei]−[I∗(x)+hx,Λei]<0,(3.82)

and hence, combining (3.81) and (3.82), we get

lim

N↑+∞

hlog ˆ

QN(

d\B(x,ε))i<0.

Therefore, due to the concentration of measure, we have almost surely

lim

N↑+∞

Nlog ˆ

QN(

d\B(x,ε)) <0

which implies that, for all ε>0, we have almost surely

lim

N↑+∞

QN(

d\B(x,ε)) = 0,

and (3.79) follows by yet another application of the concentration of measure.

Corollary 3.3.1. Suppose that in addition to the assumptions of previous Theorem 3.3.2 we

have

1. I(·)is differentiable on intD(I).

2. Either D(I) = Xor

lim

Λ→∂D(I)k∇I(Λ)k= +∞.

Then E(I∗) =

d, consequently

lim

N↑+∞

logQN(G)≥sup

x∈G

I∗(x).

Proof. The proof is the same as in the classical G¨

artner-Ellis theorem (see, e.g., den Hollander

(2000)).

74 3 The Aizenman-Sims-Starr scheme for the SK model with multidimensional spins

3.4 The Aizenman-Sims-Starr comparison scheme

In this section, we shall extend the AS2scheme to the case of the SK model with multidi-

mensional spins and prove the Theorems 3.1.1 and 3.1.2, as stated in Section 3.1. We use the

Gaussian comparison results of Section 3.2.3 in the spirit of AS2scheme in order to relate

the free energy of the SK model with multidimensional spins with the free energy of a certain

GREM-inspired model. Comparing to Aizenman et al. (2003), due to more intricate nature of

spin configuration space, some new effects occur. In particular, the remainder term of the Gaus-

sian comparison non-trivially depends on the variances and covariances of the Hamiltonians

under comparison. To deal with this obstacle, we use the idea of localisation to the configura-

tions having a given overlap (cf. (3.4)). This idea is formalised by adapting the proofs of the

quenched G¨

artner-Ellis type LDP obtained in Section 3.3.

3.4.1 Naive comparison scheme

We start by recalling the basic principles of the AS2comparison scheme (see, e.g., (Bovier,

2006, Chapter 11)). It is a simple idea to get the comparison inequalities by adding some ad-

ditional structure into the model. However, the way the additional structure is attached to the

model might be suggested by the model itself. Later on we shall encounter a real-world use of

this trick. Let (Σ,S)and (A,A)be Polish spaces equipped with measures µand ξ, respec-

tively. Furthermore, let

X:={X(σ)}σ∈Σ,A:={A(σ,α)}σ∈Σ,

α∈A

,B:={B(σ)}α∈A

be independent real-valued Gaussian processes. Define the comparison functional

Φ[C]:=

logZΣ×A

eC(σ,α)d(µ⊗ξ)(σ,α),(3.83)

where C:={C(σ,α)}σ∈Σ

α∈A

is a suitable real-valued Gaussian process. Theorem 4.1 of Aizen-

man et al. (2007) is easily understood as an example of the following observation. Suppose

Φ[X]is somehow hard to compute directly, but Φ[A]and Φ[B]are manageable. We always

have the following additivity property

Φ[X+B] = Φ[X]+Φ[B].(3.84)

Assume now that

Φ[X+B]≤Φ[A](3.85)

which we can obtain, e.g., from Proposition 3.2.5. Combining (3.84) and (3.85), we get the

bound

Φ[X]≤Φ[A]−Φ[B].(3.86)

3.4.2 Free energy upper bound

Let V⊂Sym(d)be an arbitrary Borell set.

3.4 The Aizenman-Sims-Starr comparison scheme 75

Remark 3.4.1. Note that Uis closed and convex.

Let

ΣN(V):=σ∈ΣN:RN(σ,σ)∈V

=σ∈ΣN:RN(σ,σ)∈V∩U.(3.87)

Let us define the local comparison functional ΦN(x,V)as follows (cf. (3.83))

ΦN(x,V)[C]:=1

hlogπNh

ΣN(V)expβ√NCii,(3.88)

where C:={C(σ,α)}σ∈Σ

α∈A

is a suitable Gaussian process. Let us consider the following family

(N∈

) of random measures on the Borell subsets of Sym(d)generated by the SK Hamiltonian,

PN(V):=ZΣN(V)

eβ√NXN(σ)dµ⊗N(σ),

and consider also the following family of the random measures generated by the Hamiltonian

A(σ,α)

PN(V):=e

Px,Q,U

N(V):=ZΣN(V)×A

expβ√N

∑

i=1hAi(α),σiidπN(σ,α),(3.89)

where the parameters Qand Uare taken from the definition of the process A(α)(cf. (3.6)).

The vector xdefines the random measure ξ∈M(A)(cf. (3.7)), and, hence, also the measure

πN∈M(Σ×A).

Remark 3.4.2. To lighten the notation, most of the time we shall not indicate explicitly the

dependence of the following quantities on the parameters x, Q, U.

Consider (if it exists) the Laplace transform of the measure (3.89)

LN(Λ):=ZU

eNhU,Λide

PN(U).(3.90)

Let (if it exists)

I(Λ):=lim

N↑∞

hloge

LN(Λ)i.(3.91)

Define the following Legendre transform

I∗(U):=inf

x∈Q0(1,1),

Q∈Q0(U,d),

Λ∈Sym(d)

h−hU,Λi−ΦN(x,V)[B]+ e

I(Λ)i.(3.92)

Denote, for δ>0,

I∗

δ(U):=maxe

I∗(U)+δ,−1

δ.

Let

p(V):=lim

N↑+∞

logPN(V).(3.93)

76 3 The Aizenman-Sims-Starr scheme for the SK model with multidimensional spins

Remark 3.4.3. Note that the result of Guerra & Toninelli (2003) assures the existence of the

limit in the previous formula.

Lemma 3.4.1. We have

1. The Laplace transform (3.90) exists. Moreover, for any Λ∈Sym(d), we have

eNhU,ΛidPN(U)

=ZΣN(V)

expNhΛ,RN(σ,σ)i+β√NX(σ)dµ⊗N(σ),(3.94)

eNhU,Λide

PN(U)

=ZΣN(V)×A

expNhΛ,RN(σ,σ)i+β√N

∑

i=1hAi(α),σiidπN(σ,α).(3.95)

2. The quenched cumulant generating function (3.91) exists in the N ↑∞limit, for any Λ∈

Sym(d). Moreover, for all N ∈

, we have

IN(Λ):=1

logLN(Λ)=X0(x,Q,Λ,U),(3.96)

that is IN(·)in fact does not depend on N.

Proof. 1. We prove (3.95), the proof of (3.94) is similar. Since Uis a compact, it follows that,

for arbitrary ε>0, there exists the following ε-partition of U

N(ε) = {Vr⊂U:r∈{1,...,K}}

such that SrVr=U,Vr∩Vs=/0, diamVr≤εand pick some Vr∈intVr, for all r6=s.

We denote

LN(Λ,ε):=

∑

r=1

eNhΛ,VriZΣN(Vr)×A

expβ√N

∑

i=1hAi(α),σiidπN(σ,α).

For small enough ε, we have

(1−2NkΛkε)eNhΛ,RN(σ,σ)i≤eNhΛ,Ui≤eNhΛ,RN(σ,σ)i(1+2NkΛkε).

Therefore, if we denote

LN(V,Λ):=ZΣN(V)×A

expNhΛ,RN(σ,σ)i+β√N

∑

i=1hAi(α),σiidπN(σ,α),

we get

(1−2NkΛkε)

∑

r=1b

LN(Vr,Λ)≤e

LN(Λ,ε)≤(1+2NkΛkε)

∑

r=1b

LN(Vr,Λ).

Hence,

(1−2NkΛkε)b

LN(U,Λ)≤e

LN(Λ,ε)≤(1+2NkΛkε)b

LN(U,Λ).(3.97)

Let ε↓+0 in (3.97) and we arrive at

LN(Λ) = b

LN(U,Λ).

That is, the existence of LN(Λ)and the representation (3.95) are proved.

3.4 The Aizenman-Sims-Starr comparison scheme 77

2. For all N∈

, we have, by the RPC averaging property (see, e.g., (Aizenman et al., 2007,

Theorem 5.4) or Theorem 4.1.3, property (4) below), that

hloge

LN(U,Λ)i=ΦN(x,U)A+NhΛ,RN(σ,σ)i=X0(x,Q,Λ,U).

Proof of Theorem 3.1.1. In essence, the proof follows almost literally the proof of Theorem

3.3.1. The notable difference is that we apply the Gaussian comparison inequality (Proposi-

tion 3.2.5) in order to “compute” the rate function in a somewhat more explicit way.

Due to (3.87), we can without loss of generality suppose that Vis compact. For any δ>

0 and U∈V, by (3.92), there exists Λ(U,δ)∈Sym(d),x(U,δ)∈Q0(1,1)and Q(U,δ)∈

Q0(U,d)such that

−hU,Λ(U)i+e

I(Λ(U)) ≤e

I∗

δ(U).(3.98)

For any U∈V, there exists an open neighbourhood V(U)⊂Sym(d)of Usuch that

sup

V∈V(U)hV−U,Λ(U)i≤δ.

Fix some ε>0. Without loss of generality, we can suppose that all the neighbourhoods satisfy

additionally the condition diamV(U)≤ε. By compactness, the covering SU∈VV(U)⊃V

has a finite subcovering, say Sp

r=1V(U(r))⊃V. We denote the corresponding to this covering

approximants in (3.98) by {x(r)∈Q0(1,1)}p

r=1and {Q(r)∈Q0(U(r),d)}p

r=1. We have

NlogPN(V)≤1

Nlogp

[

r=1

PN(V(U(r))).(3.99)

Due to the concentration of measure Proposition 3.2.3, we can apply Lemma 3.3.1 and get

lim

N↑+∞

logPNp

[

r=1

V(U(r))−max

r∈{1,...,p}

hlogPN(V(U(r)))i=0.(3.100)

In fact, since we know that (3.93) exists, (3.100) implies that

lim

N↑+∞

hlogPNp

[

r=1

V(U(r))i=max

r∈{1,...,p}lim

N↑+∞

hlogPN(V(U(r)))i.(3.101)

For U(r),x=x(r),Q=Q(r), Proposition 3.2.5 gives

hlogPN(V(U(r)))i=1

hlog e

PN(V(U(r)))i−ΦN(x,U)[B]

+RN(x(r),Q(r),U(r),V(U(r)))+O(ε)

≤1

hlog e

PN(V(U(r)))i−ΦN(x,U)[B]+Kε,(3.102)

where K>0 is an absolute constant.

By the Chebyshev inequality and Lemma 3.4.1, we have

78 3 The Aizenman-Sims-Starr scheme for the SK model with multidimensional spins

PN(V(U)) ≤e

PN{V∈U:hV−U,Λ(U)i≤δ}

≤e−δNZU

eNhV−U,Λ(U)ide

PN(V)

=e−δNe−NhU,Λ(U)ie

LN(Λ(U)).

Thus, using (3.102) and (3.98), we get

lim

N↑+∞

hlogPN(V(U(r)))i≤lim

N↑+∞−hU(r),Λ(U(r))i−Φ[B]+ 1

Nloge

LN(Λ(U(r)))−δ+Kε

=−hUr,Λ(Ur)i−Φ[B] + e

I(Λ(Ur))−δ+Kε

≤e

I∗

δ(Ur)−δ+Kε.(3.103)

Combining (3.99), (3.47), (3.103), we obtain

p(V) = lim

N↑+∞

logPN(V)≤max

r∈{1,...,p}lim

N↑+∞

hlogPN(V(U(r)))i

≤max

r∈{1,...,p}e

I∗

δ(U(r))+Kε−δ

≤sup

U∈Ve

I∗

δ(V)+Kε−δ.

Taking δ↓+0 and ε↓+0 limits, we get

p(V)≤sup

V∈Ve

I∗(U).(3.104)

The averaging property of the RPC (see, e.g., (Aizenman et al., 2007, Theorem 5.4) or property

(4) of Theorem 4.1.3) gives

ΦN(x,U)[B] = β2

∑

k=1

xkkQ(k+1)k2

F−kQ(k)k2

F.(3.105)

To finish the proof it remains to show that, for any fixed Λ∈Sym(d), we have

I(Λ) = X0(x,Q,Λ,U)

which is assured by Lemma 3.4.1. ut

3.4.3 Free energy lower bound

In this subsection, we return to the notations of Section 3.4.2.

Lemma 3.4.2. For any B⊂Sym(d)such that intB∩intU6=/0 there exists ∆⊂Σwith int∆6=

/0 such that

lim

N↑∞

ZΣN(B)×A

expNhΛ,RN(σ,σ)i+

∑

i=1hAi(α),σiidπN(σ,α)

≥logZ∆exph(β2U+Λ)σ,σidµ(σ)>−∞.(3.106)

3.4 The Aizenman-Sims-Starr comparison scheme 79

Proof. In view of (3.9), iterative application of the Jensen inequality with respect to

z(k)leads

to the following

Xn+1(x,Q,Λ,U)≤X0(x,Q,Λ,U).

Performing the Gaussian integration, we get

Xn+1(x,Q,Λ,U)≥logZ∆exph(β2U+Λ)σ,σidµ(σ),

where ∆⊂Σis such that µ(∆)>0 and {R(σ,σ):σ∈∆N}⊂B.ut

Define the following Legendre transform

I∗(U):=inf

x∈Q0(1,1),

Q∈Q0(U,d),

Λ∈Sym(d)

h−hU,Λi−Φ[B]+ e

I(Λ)+R(x,Q,U)i.(3.107)

Proof of Theorem 3.1.2. As it is the case with the proof of Theorem 3.1.1, this proof also fol-

lows in essence almost literally the proof of Theorem 3.3.2. The notable difference is that we

apply the Gaussian comparison in order to “compute” the rate function in a somewhat more

explicit way.

In notations of Theorem 3.3.2, we are in the following situation: X:=Sym(d)and Xis the

topology induced by any norm on Sym(d).

Let B(U,ε)be the ball (in the Hilbert-Schmidt norm) of radius ε>0 around some arbitrary

U∈V. Let us prove at first that

lim

ε↓+0lim

N↑∞

logPN(B(U,ε))≥b

I∗(U).(3.108)

Similarly to (3.102), for any (x,Q), we have

1

NlogPN(B(U,ε))

hlog e

PN(B(U,ε))i−Φ[B]+RN(x,Q,U,B(U,ε))+O(ε).(3.109)

The random measure e

PNsatisfies the assumptions of Corollary 3.3.1. Indeed:

1. Due to representation (3.96), mapping I(·)is differentiable with respect to Λ. Henceforth

assumption (1) of the corollary is also fulfilled.

2. Let us note at first that, thanks to Proposition 3.2.3, we have D(I) =

d. Thus, the assump-

tion (2) of Corollary 3.3.1 is satisfied, as is condition (3.43).

Moreover, the assumptions of Lemma 3.3.3 are satisfied:

1. The concentration of measure condition is satisfied due to Proposition 3.2.3.

2. The tail decay is obvious since the family {e

PN:N∈

}has compact support. Namely, for

all N∈

, we have supp e

PN=U. Thus the measure e

QN,Λ(cf. (3.59)) generated by e

PNhas

the same support. Thus, supp e

QN,Λ=U.

80 3 The Aizenman-Sims-Starr scheme for the SK model with multidimensional spins

3. The non-degeneracy is assured by Lemma 3.4.2.

Hence, due to (3.109), arguing in the same way as in Theorem 3.3.2, we arrive at (3.108). Note

that the N↑+∞limit of RN(x,Q,U,B(U,ε)) exists, since in (3.109) the limits of the other two

N-dependent quantities exist due to Guerra & Toninelli (2003). The subsequent ε↓+0 limit of

the remainder term exists due to the monotonicity.

Finally, taking the supremum over U∈Vin (3.108), we get (3.20). ut

Estimates of Guerra’s remainder term in the SK model with

multidimensional spins

In this chapter, we extend the comparison scheme of Guerra (2003) to the case of the SK model

with multidimensional spins. We provide a short derivation of the remainder term which is

a by-product of this scheme (Theorem 4.1.4). We use the well-known properties of the RPC

and Bolthausen-Sznitman coalescent. This gives the remainder term a clear meaning in terms

of the averages with respect to the disorder induced by a change of measure. The change of

measure is induced by the reweighting of the RPC by means of the GREM-inspired process (cf.

Theorem 2.2.7).

We study the properties of the multidimensional Parisi functional by establishing a link

between the (multidimensional) Parisi functional and a certain class of non-linear PDEs, see

Propositions 4.2.1, 4.2.2 and Theorem 4.2.2. We extend the Parisi functional to a continuous

functional on a compact space (Theorems 4.2.1, 4.2.2). We show that the class of PDEs cor-

responds to the Hamilton-Jacobi-Bellman equations induced by a linear problem of diffusion

control (Proposition 4.2.4). Motivated by a problem posed by Talagrand (2006c), we show the

strict convexity of the local Parisi functional in some cases (Theorem 4.2.4).

We partially extend Talagrand’s methodology of estimating the remainder term to the multi-

dimensional setting (Proposition 4.3.1, Theorem 4.3.1).

4.1 Guerra’s comparison scheme

For the SK model, Guerra’s scheme gave historically the first way to obtain the variational

upper bound on the free energy in terms of the Parisi functional. The scheme is based on the

comparison between two Gaussian processes: the first one being the original SK Hamiltonian

(0.8) and the other one being a carefully chosen GREM inspired process indexed by σ∈ΣN. The

second important ingredient is a recursively defined non-linear comparison functional acting on

the Gaussian processes indexed by σ∈ΣN.

Talagrand (2006b) using Guerra’s scheme and the wealth of other ingenious analytical in-

sights showed that the variational upper bound is also the lower bound for the free energy in the

SK model. This established, hence, the remaining half of the Parisi formula.

In this section, we shall apply Guerra’s comparison scheme (see the recent accounts by

Aizenman et al. (2007); Guerra (2005); Talagrand (2007a)) to the SK model with multidimen-

sional spins. However, we shall use also the ideas (and the language) of Aizenman et al. (2003).

In particular, we shall use the same local comparison functional (3.88) as in the AS2scheme,

see (4.2). The section contains the proofs of the upper (4.14) and lower (4.16) bounds on the free

energy without Assumption 3.1.2. The proofs use the GREM-like Gaussian processes, RPCs as

82 4 Estimates of Guerra’s remainder term in the SK model with multidimensional spins

in the AS2scheme. We also obtain an analytic representation of the remainder term (which is

an artifact of this scheme) using the properties of the Bolthausen-Sznitman coalescent.

4.1.1 Multidimensional Guerra’s scheme

Given n∈

, assume that xand Qsatisfy (3.7) and (3.6), respectively. Recall the definitions of

the Gaussian processes Xand Awhich satisfy (0.8) and (3.15), respectively. We consider, for

t∈[0;1], the following interpolating Hamiltonian on the configuration space ΣN×A

Ht(σ,α):=√tX(σ)+ √1−tA(σ,α).(4.1)

Given U⊂Sym+(d), the Hamiltonian (4.1) in the usual way induces the following local free

energy

ϕN(t,x,Q,U):=ΦN(x,U)[Ht],(4.2)

where we use the same local comparison functional (3.88) as in the AS2scheme. Using (3.4),

we obtain then

ϕ(0,x,Q,U) = ΦN(x,U)[A]and ϕ(1,x,Q,U) = ΦN(x,U)[X] = pN(U).

Now, we are going to disintegrate the Gibbs measure defined on U×Ainto two Gibbs mea-

sures acting on Uand Aseparately. For this purpose we define the correspondent (random)

local free energy on Uas follows

ψ(t,x,Q,α,U):=logZΣN(U)

exphβ√NHt(σ,α)idµ⊗N(σ).(4.3)

For α∈A, we can define the (random) local Gibbs measure G(t,Q,α,U)∈M1(ΣN)by

demanding that the following holds

dG(t,x,Q,α,U)

dµ⊗N(σ):=

ΣN(U)(σ)exphβ√NHt(σ,α)−ψ(t,x,Q,U,α)i.

Let us define a certain reweighting of the RPC ξwith the help of (4.3). We define the random

point process {˜

ξ}α∈Ain the following way

ξ(α):=ξ(α)exp(ψ(t,x,Q,U,α)).

We also define the normalisation operation N:Mf(A)→M1(A)as

N(ξ)(α):=ξ(α)

∑α0∈Aξ(α0).

We introduce the local Gibbs measure G(t,x,Q,U)∈M1(U×A), for any V⊂U×A, as

follows

G(t,x,Q,U)[V]:=∑

α∈An

N(˜

ξ)(α)G(t,x,Q,α,U)[V].(4.4)

Finally, we introduce, what shall call Guerra’s remainder term:

R(t,x,Q,U):=−β2

G(t,x,Q,U)⊗G(t,x,Q,U)kR(σ1,σ2)−Q(α1,α2)k2

F.(4.5)

Note that (4.5) coincides with (3.18) after substituting (3.16) with (3.21).

4.1 Guerra’s comparison scheme 83

4.1.2 Local comparison

The results of Section 3.4 can be straightforwardly generalised to the comparison scheme based

on (4.1). Given ε,δ>0 and Λ∈Sym(d), define

V(Λ,U,ε,δ):={U0∈Sym(d):kU0−UkF<ε,hU0−U,Λi<δ}.(4.6)

We now specialise to the case U=ΣN(V(Λ,U,ε,δ)).

Lemma 4.1.1. We have

∂

∂tϕN(t,x,Q,V(Λ,U,ε,δ)) =R(t,x,Q,ΣN(A(Λ,U,ε,δ)))

−β2

∑

k=1

xkkQ(k+1)k2

F−kQ(k)k2

F+O(ε).(4.7)

Proof. This is an immediate consequence of Proposition 3.2.5. Indeed, recalling that Q(α1,α1) =

U, and setting U:=Σ(B(U,ε)), we have

∂

∂tϕ(t,x,Q,U)

=β2

G(t,x,Q,U)⊗G(t,x,Q,U)kR(σ1,σ1)−Uk2

F−kR(σ1,σ2)−Q(α1,α2)k2

−kUk2

F−kQ(α1,α2)k2

F

=−β2

G(t,x,Q,U)⊗G(t,x,Q,U)kR(σ1,σ2)−Q(α1,α2)k2

F

−β2

G(t,x,Q,U)⊗G(t,x,Q,U)kUk2

F−kQ(α1,α2)k2

F+O(ε).(4.8)

Using the Proposition 2.2.1, we get

β2

G(t,x,Q,U)⊗G(t,x,Q,U)kUk2

F−kQ(α1,α2)k2

F

=β2



N(ξ)⊗N(ξ)

n

∑

k=qL(α(1),α(2))kQ(k+1)k2

F−kQ(k)k2

F





=β2

∑

k=1kQ(k+1)k2

F−kQ(k)k2

F

hN(ξ)⊗N(ξ){k≥qL(α(1),α(2))}i

=β2

∑

k=1

xkkQ(k+1)k2

F−kQ(k)k2

F.(4.9)

Combining (4.8) and (4.9), we get (4.7)

Lemma 4.1.2. We have

pN(ΣN(B(U,ε))) =ΦN(x,ΣN(B(U,ε)))[A]−β2

∑

k=1

xkkQ(k+1)k2

F−kQ(k)k2

F

+Z1

R(t,x,Q,ΣN(B(U,ε))dt+O(ε).(4.10)

84 4 Estimates of Guerra’s remainder term in the SK model with multidimensional spins

Remark 4.1.1. Note that the above lemma also holds if we substitute B(U,ε)with the smaller

set V(Λ,U,ε,δ).

Proof. The claim follows from (4.7) by integration. ut

Proposition 4.1.1. There exists C =C(Σ,µ)>0such that, for all U ∈Sym+(d)as above, and

all ε,δ>0, there exists an δ-minimal Lagrange multiplier Λ=Λ(U,ε,δ)∈Sym(d)in (3.11)

such that, for all t ∈[0;1], and all (x,Q), we have

pN(ΣN(V(Λ,U,ε,δ))) ≤inf

Λ∈Sym(d)f(x,Q,U,Λ)+C(ε+δ)(4.11)

and

lim

N↑+∞pN(ΣN(B(U,ε))) ≥inf

Λ∈Sym(d)f(x,Q,U,Λ)+ lim

N↑+∞Z1

R(t,x,Q,ΣN(B(U,ε)))dt

−C(ε+δ).(4.12)

Remark 4.1.2. The following upper bound also holds true. There exists C =C(Σ,µ)>0, such

that, for any Λ∈Sym(d),

pN(ΣN(B(U,ε))) ≤f(x,Q,U,Λ)+CkΛkFε.(4.13)

Proof. The result follows from Lemma 4.1.2 by the same arguments as in the proofs of Theo-

rems 3.1.1 and 3.1.2. ut

4.1.3 Free energy upper and lower bounds

Similarly to the quenched LDP bounds for the AS2scheme in the SK model with multidimen-

sional spins (see Section 3.3), we get the quenched LDP bounds for Guerra’s scheme in the

same model without Assumption 3.1.2 on Q.

Recall the definition of the local Parisi functional f(3.11).

Theorem 4.1.1. For any closed set V⊂Sym(d), we have

p(V)≤sup

U∈V∩U

inf

(x,Q,Λ)f(x,Q,Λ,U),(4.14)

where the infimum runs over all x satisfying (3.7), all Qsatisfying (3.6) and all Λ∈Sym(d).

Proof. The proof is identical to the one of Theorem 3.1.1. ut

Define the local limiting Guerra remainder term R(x,Q,U)as follows

R(x,Q,U):=−lim

ε↓+0lim

N↑+∞Z1

R(t,ΣN(B(U,ε)))dt≤0.(4.15)

The existence of the limits in (4.15) is proved similar to the case of the AS2scheme, see the

proof of Theorem 3.1.2.

4.1 Guerra’s comparison scheme 85

Theorem 4.1.2. For any open set V⊂Sym(d), we have

p(V)≥sup

U∈V∩U

inf

(x,Q,Λ)[f(x,Q,Λ,U)+R(x,Q,U)],(4.16)

where the infimum runs over all x satisfying (3.7); all Qsatisfying (3.6) and all Λ∈Sym(d).

Proof. The proof is identical to the one of Theorem 3.1.2. The only new ingredient is Lemma 4.1.1

needed to recover Guerra’s remainder term (4.5). ut

4.1.4 The filtered d-dimensional GREM

Given U∈Sym+(d)non-negative definite, denote by Q(U,d)the set of all c`

adl`

ag (right con-

tinuous with left limits) Sym+(d)-valued non-decreasing paths which end in matrix U, i.e.,

Q(U,d):={ρ:[0;1]→Sym+(d)|ρ(0) = 0;ρ(1) = U;ρ(t)ρ(s),for t≤s;ρis c´

adl´

ag}.

(4.17)

Definition (4.17) is a multidimensional generalisation of (1.26). Define the natural inverse ρ−1:

Imρ→[0;1]as

ρ−1(Q):=inf{t∈[0;1]|ρ(t)Q},

where Imρ:=ρ([0;1]). Let x:=ρ−1◦ρ∈Q(1,1).

Let also Q0(U,d)⊂Q(U,d)be the space of all piece-wise constant paths in Q(U,d)with

finite (but arbitrary) number of jumps with an additional requirement that they have a jump at

x=1. Given some ρ∈Q0(U,d), we enumerate its jumps and define the finite collection of

matrices {Q(k)}n+1

k=0:=Imρ⊂

d. This implies that there exist {xk}n+1

k=0⊂

such that

0=:x0<x1< ... < xn<xn+1:=1,

0=:Q(0)Q(1)Q(2)···Q(n+1):=U,

where ρ(xk) = Q(k). Let us associate to ρ∈Q0(U,d)a new path ˜

ρ∈Q(U,d)which is obtained

by the linear interpolation of the path ρ. Namely, let

ρ(t):=Q(k)+(Q(k+1)−Q(k))t−xk

xk+1−xk

,t∈[xk;xk+1).

Let g:

d→

be a function satisfying Assumption 4.1.1. Let us introduce the filtered d-

dimensional GREM process W. Let

W:=n{Wk(t,[α]k)}t∈

+:α∈A,k∈[0;n]∩

be the collection of independent (for different αand k)

d-valued correlated Brownian motions

satisfying

Wk(t,[α]k)∼(Q(k+1)−Q(k))1/2Wt−xk

xk+1−xk,

86 4 Estimates of Guerra’s remainder term in the SK model with multidimensional spins

where {W(t)}t∈

+is the standard (uncorrelated)

d-valued Brownian motion. Now, for k∈

[0;n]∩

, we define the

d-valued process {Y(t,α)|α∈A,t∈[0;1]}by

Y(t,α):=

∑

k=0

[xk;1](t)Wk(t∧xk+1,[α]k).

Lemma 4.1.3. For α(1),α(2)∈A, we have

CovhY(t1,α(1)),Y(t2,α(2))i=˜

ρt1∧t2∧xqL(α(1),α(2)).

Proof. The proof is straightforward. ut

Assumption 4.1.1. Suppose that the function g :

d→

satisfies g ∈C(2)(

d)and, for any

c>0, we have R

dexpg(y)−ckyk2

2dy<∞and also

sup

y∈

dk∇g(y)k2+k∇2g(y)k2<+∞,(4.18)

where ∇2g(y)denotes the matrix of second derivatives of the function g at y ∈

Assume gsatisfies the above assumption. Let f:=fρ:[0;1]×

d→

be the function satis-

fying the following (backward) recursive definition

f(t,y):=(g(y),t=1,

xklog

expxkf(xk+1,y+Y(xk+1,α)−Y(t,α)),t∈[xk;xk+1),(4.19)

where k∈[0;n]∩

,α∈Ais arbitrary and fixed.

Remark 4.1.3. It is easy to recognise that the definition of f is a continuous “algorithmisation”

of (3.10). Namely, Xk(x,Q,Λ,U) = f(xk,0), where

f(1,y) = g(y):=logZΣexp√2βhy,σi+hΛσ,σidµ(σ).(4.20)

4.1.5 A computation of the remainder term

Recall the equivalence relation (2.27). In words, the equivalence i∼

kjmeans that the atoms of

the RPC ξwith ranks iand jhave the same ancestors up to the k-th generation. Varying the kin

(2.27), we get a family of equivalences on

which possesses important Markovian properties,

see Bolthausen & Sznitman (1998).

Lemma 4.1.4. For all k ∈[0;n−1]∩

, we have





∑

i∼

i

k+1j

N(ξ)(i)N(ξ)(j)



=xk+1−xk,(4.21)

and also

h∑

N(ξ)(i)2i=1−xn.(4.22)

4.1 Guerra’s comparison scheme 87

Proof. 1. To prove (4.21) we notice that





∑

i∼

i

k+1j

N(ξ)(i)N(ξ)(j)



=





∑

i

k+1j

N(ξ)(i)N(ξ)(j)−∑

i

N(ξ)(i)N(ξ)(j)





=xk+1−xk,

where the last equality is due to Proposition 2.2.1.

2. Similarly, (4.22) follows from the following observation

"∑

N2(ξ)(i)#=



∑

i,j

N(ξ)(i)N(ξ)(j)−∑

i

N(ξ)(i)N(ξ)(j)



=1−xn,

where the last equality is due to Proposition 2.2.1.

Note that, using the above notations, we readily have

A(σ,α)∼2

N1/2N

∑

i=1hY(i)(1,α),σii,

where {Y(i):={Y(i)(1,α)}α∈A}N

i=1are i.i.d. copies of {Y(1,α)}α∈A. Consider the following

weights

ξ(t)(α):=ξ(α)exp(f(t,Y(t,α))).

As in (Bolthausen & Sznitman, 1998), the above weights induce the permutation ˜

π(t):

→A

such that, for all i∈

, the following holds

ξ(t)(˜

π(t)(i)) >˜

ξ(t)(˜

π(t)(i+1)).(4.23)

In what follows, we shall use the short-hand notations ˜

ξ(t)(i):=˜

ξ(t)(˜

π(t)(i)),˜

Y(t)(s,i):=

Y(s,˜

π(t)(i)) and ˜

Q(t):={˜

Q(t)(i,j):=Q(˜

π(t)(i),˜

π(t)(j))}i,j∈

Theorem 4.1.3. Given a discrete order parameter x ∈Q0(1,1), we have

1. Independence #1. The normalised RPC point process N(ξ)is independent from the corre-

sponding randomised limiting GREM overlaps q.

2. Independence #2. The reordered filtered limiting GREM ˜

Y is independent from the corre-

sponding reordered weights ˜

ξ.

3. The reordering change of measure. Given I b

, let νI(·|Q)be the joint distribution of

{Y(1,i)}i∈I, and ˜

νI(·|Q)be the joint distribution of {˜

Y(1)(1,i)}i∈Iboth conditional on Q.

Then

88 4 Estimates of Guerra’s remainder term in the SK model with multidimensional spins

d˜

νI(·|Q)

dνI(·|Q)=

∏

k=0

∏

i∈I/∼

kexpxkf(xk+1,Y(xk+1,i))−fk(xkY(xk,i)),(4.24)

where the innermost product in the previous formula is taken over all equivalence classes

on the index set I induced by the equivalence ∼

4. The averaging property. For all s,t∈[0;1], we have

nξ(t)(α)oα∈A,˜

Q(t)∼nξ(s)(α)oα∈A,˜

Q(s).(4.25)

Proof. The proof is the same as in the case of the one-dimensional SK model, see Arguin

(2007); Bolthausen & Sznitman (1998). ut

Keeping in mind (4.24), we define, for k∈[0;n−1]∩

, the following random variables

Tk(α):=expxkf(xk+1,Y(xk+1,α))−f(xk,Y(xk,α)).

Given k∈[1;n]∩

, assume that α(1),α(2)∈Asatisfy qL(α(1),α(2)) = k. We introduce, for no-

tational convenience, the (random) measure µk(t,U)– an element of M1(ΣN)– by demanding

the following

µk(t,U)[g]:=

T1(α1)···Tk(α1)Tk+1(α1)Tk+1(α2)···Tn(α1)Tn(α2)

G(t,α(1),U)⊗G(t,α(2),U)[g]i,(4.26)

where g:U2→

is an arbitrary measurable function such that (4.26) is finite. Using this

notation, we can state the following lemma.

Lemma 4.1.5. For any i,j∈

, satisfying i ∼

kj, i 

k+1j, we have

G(t,i,U)⊗G(t,j,U)kR(σ1,σ2)−Q(i,j)k2

F=µk(t,U)hkR(σ1,σ2)−Q(k)k2

Fi.

(4.27)

Proof. This is a direct consequence of (4.24) and the fact that under the assumptions of the

theorem Q(i,j) = Q(k).

Remark 4.1.4. It is obvious from the previous theorem that µkis a probability measure.

The main result of this subsection is an “analytic projection” of the probabilistic RPC repre-

sentation which integrates out the dependence on the RPC. Comparing to (3.18), it has a more

analytic flavor which will be exploited in the remainder estimates (Section 4.3). This is also a

drawback in some sense, since the initial beauty of the RPCs is lost.

Theorem 4.1.4. In the case of Guerra’s interpolation (3.21), we have

R(t,x,Q,ΣN(B(U,ε))) =1

n−1

∑

k=0

(xk+1−xk)µk(t,ΣN(B(U,ε)))hkR(σ1,σ2)−Q(k)k2

+O(ε)+O(1−xn),(4.28)

as ε→0and xn→1.

4.2 The Parisi functional in terms of differential equations 89

Proof. Recalling (4.5) and (4.4), we write

R(t,x,Q,Σ(U,ε)) = β2

h∑

i,j

N(˜

ξ)(i)N(˜

ξ)(j)

×G(t,x,Q,i,U)⊗G(t,x,Q,j,U)hkR(σ1,σ2)−Q(i,j)k2

Fii.

Using the Theorem 4.1.3, we arrive to

R(t,x,Q,Σ(U,ε)) = β2

2∑

i,j

hN(˜

ξ)(i)N(˜

ξ)(j)i

G(t,x,Q,i,U)⊗G(t,x,Q,j,U)kR(σ1,σ2)−Q(i,j)k2

F.

(We can interchange the summation and expectation since all summands are non-negative.) The

averaging property (see Theorem 4.1.3) then gives

R(t,Σ(U,ε)) = β2

2∑

i,j

[N(ξ)(i)N(ξ)(j)]

G(t,i,U)⊗G(t,j,U)kR(σ1,σ2)−Q(i,j)k2

F.

(4.29)

For each k∈[1;n−1]∩

, we fix any indexes i0,i(k)

0,j(k)

0∈

such that i∼

kjand i

k+1j.

Rearranging the terms in (4.29), we get

R(t,Σ(U,ε)) = β2

∑

k=1

hG(t,i(k)

0,U)⊗G(t,j(k)

0,U)hkR(σ1,σ2)−Q(k)k2

Fii

×∑

i∼

i

k+1j

[N(ξ)(i)N(ξ)(j)]

+β2

G(t,i0,U)⊗G(t,i0,U)kR(σ1,σ2)−Uk2

F∑

N(ξ)(i)2.

(4.30)

Finally, applying Lemmata 4.1.4 and 4.1.5 to (4.30), we arrive at (4.28).

4.2 The Parisi functional in terms of differential equations

In this section, we study the properties of the multidimensional Parisi functional. We derive the

multidimensional version of the Parisi PDE. This allows to represent the Parisi functional as

a solution of a PDE evaluated at the origin. We also obtain a variational representation of the

Parisi functional in terms of a HJB equation for a linear problem of diffusion control. As a by-

product, we arrive at the strict convexity of the Parisi functional in 1-D which settles a problem

of uniqueness of the optimal Parisi order parameter posed by Panchenko (2005a); Talagrand

(2006c).

90 4 Estimates of Guerra’s remainder term in the SK model with multidimensional spins

Lemma 4.2.1. Consider the function B :

d×

+→

defined as

B(y,t):=1

xlog

[exp{x f (y+z(t))}],

where f :

d→

satisfies Assumption 4.1.1 and {z(t)}t∈[0;1]is a Gaussian

d-valued process

with Cov[z(t)] :=Q(t)∈Sym(d)such that Q(t)u,vis differentiable, for all u,v. Then

∂tB(y,t) = 1

∑

u,v=1

Qu,v(t)∂2

yuyvB(y,t)+x∂yuB(y,t)∂yvB(y,t),(t,y)∈(0;1)×

d.(4.31)

In particular, the function B is differentiable with respect to the t-variable on (0;1)and C2(

with respect to the y-variable.

Proof. Denote Z:=

hex f (y+z(t))i. By (Aizenman et al., 2007, Lemma A.1), we have

∂tB(y,t) = 1

2x 1

∑

u,v=1

Qu,v(t)∂2

zuzvex f (z)|z=y+z(t)i!.

A straightforward calculation then gives

∂tB(y,t) = 1

2x 1

∑

u,v=1

Qu,v(t)x2∂zuf(z)∂zvf(z)+x∂2

zuzvf(z)ex f (z)|z=y+z(t)i!.(4.32)

We also have

∂yuB(y,t) = 1

hxex f (z)∂zuf(z)|z=y+z(t)i,(4.33)

and

∂2

yuyvB(y,t) = 1

x1

hex f (z)x2∂zuf(z)∂zvf(z)+∂2

zuzvf(z)|z=y+z(t)i

−1

hxex f (z)∂zuf(z)|z=y+z(t)i

hxex f (z)∂zvf(z)|z=y+z(t)i.(4.34)

Combining (4.32), (4.33) and (4.34), we get (4.31). ut

Proposition 4.2.1. Denote D :=Sn

k=0(xk;xk+1). The function f =fρdefined in (4.19) satisfies

the final-value problem for the controlled semi-linear parabolic Parisi-type PDE











∂tf(y,t)+ 1

2∑d

u,v=1d

dt˜

ρu,v(t)∂2

yuyvf(y,t)+x(t)∂yuf(y,t)∂yvf(y,t)=0,(t,y)∈D×

f(1,y) = g(y),y∈

f(y,xk−0) = f(y,xk+0),k∈[1;n]∩

,y∈

(4.35)

Note that d

dt˜

ρ(t) = Q(k+1)−Q(k)

xk+1−xk, for t ∈(xk;xk+1).

4.2 The Parisi functional in terms of differential equations 91

Proof. A successive application of Lemma 4.2.1 to (4.19) on the intervals Dstarting from (xn;1)

gives (4.35). ut

Remark 4.2.1. Note that a straightforward inspection of (4.19), using (4.32),(4.33) and (4.34),

shows that the function f defined in (4.19) is C1(D)∩C([0;1]) with respect to the t-variable and

C2(

d)with respect to the y-variable.

Lemma 4.2.2. Given ρ∈Q0(U,d), the function (4.19) satisfies the following:

fρ(0,0) =

hlog ∑

α∈A

ξ(α)exp{g(Y(1,α))}i.(4.36)

Proof. This is an immediate consequence of the RPC averaging property (4.25). ut

Lemma 4.2.3.

1. Given k ∈[1;n]∩

and a non-negative definite matrix Q ∈Sym(d), we have

∂Q(k) Qfρ(0,0) = −1

2(xk−xk−1)

[hQ,Mi],(4.37)

where M ∈

d×dis defined as

Mu,v:=T1(α(1))···Tk(α(1))Tk+1(α(1))Tk+1(α(2))···Tn(α(1))Tn(α(2))

∂zug(z)|z=Y(1,α(1))∂zvg(z)|z=Y(1,α(2))

with qL(α(1),α(2)) = k. Moreover, (4.37) does not depend on the choice of α(1),α(2)∈A

but only on k.

2. Given a non-negative definite matrix Q ∈Sym(d), we have

∂U Qfρ(0,0) = 1

hQ,M0i,(4.38)

where M0∈Sym(d)is satisfies

u,v=T1(α)···Tn(α)∂2

zuzvg(z)+∂zug(z)∂zvg(z)z=Y(1,α)+O(1−xn),

as xn→1. Note that (4.38) obviously does not depend on the choice of α∈A.

Proof. Applying (Aizenman et al., 2007, Lemma A.1) to (4.36), we obtain

∂s

hlog ∑

α∈A

exp{g(Y(1,α))}Q(k)=Q(k)+sQi

∑

u,v=1

N(˜

ξ)⊗N(˜

ξ)h∂sQ(α(1),α(2))u,v|Q(k)=Q(k)+sQ

α(1)=α(2)(α(1),α(2))∂2

zuzvg(z)+∂zug(z)∂zvg(z)z=Y(1,α(1))

−∂zug(z)|z=Y(1,α(1))∂zvg(z)|z=Y(1,α(2))oQ(k)=Q(k)+sQii.

Note that

∂sQ(α(1),α(2))u,v|Q(k)=Q(k)+sQ=(Qu,v,qL(α(1),α(2)) = k,

0,qL(α(1),α(2))6=k.

92 4 Estimates of Guerra’s remainder term in the SK model with multidimensional spins

1. Define M(α(1),α(2))∈

d×das

M(α(1),α(2))u,v:=∂zug(z)|z=Y(1,α(1))∂zvg(z)|z=Y(1,α(2)).

Hence, we arrive at

∂Q(k) Qfρ(0,0) = −1

h∑

α(1)α(2)∈A

qL(α(1),α(2))=kξ(α(1))ξ(α(2))(α(1),α(2))hQ,M(α(1),α(2))ii.

The proof is concluded similarly to the proof of Theorem 4.1.4 by using the properties of

the RPC (Theorem 4.1.3 and Lemma 4.1.4).

2. The proof is the same as in (1).

The following is a multidimensional version of (Talagrand, 2006b, Lemma 4.3).

Lemma 4.2.4. For any α∈A, we have

∂xkfρ(0,0)|xk=xk−1=1

xk−1

hT1(α)···Tk−2(α)Tk−1(α)|xk=xk−1



hf(xk+1,Y(xk+1,α))Tk(α)|xk=xk−1i−f(xk,Y(xk,α))i.

2. Let M ∈Sym(d)with Mu,v:=∂zuf(xk,Y(xk,α))∂zvf(xk,Y(xk,α)), then

∂2

Q(k) Q,xk

fρ(0,0) = 1

hT1(α)···Tk−2(α)hQ,Mii.

Proof. This proof is the same as in (Talagrand, 2006b). ut

We now generalise the PDE (4.35). Given a piece-wise continuous x∈Q(1,1)and Q∈

Q(U,d), consider the following terminal value problem

(∂tf+1

2h˙

Q,∇2fi+xh˙

Q∇f,∇fi=0,(y,t)∈

d×(0,1),

f(y,1) = g(y).(4.39)

We say that f∈C([0;1]×

d→

)is a piece-wise viscosity solution of (4.39), if there exists

the partition of the unit segment 0 =:x0<x1< ... < xn+1:=1 such that, for each k∈[0,n]∩

f:(xk;xk+1)×

d→

is a viscosity solution (see, e.g., Briand & Hu (2007)) of











∂tf+1

2h˙

Q,∇2fi+xh˙

Q∇f,∇fi=0,(y,t)∈

d×(xk,xk+1),

f(y,xk+1+0) = f(y,xk+1−0),

f(y,1) = g(y).

Proposition 4.2.2. For any ρ(1),ρ(2)∈Q0(U,d), we have

|fρ(1)(0,0)−fρ(2)(0,0)|≤ C

2Z1

0kρ(1)(t)−ρ(2)(t)kFdt,

where C =C(Σ):=

kMkF.

4.2 The Parisi functional in terms of differential equations 93

Proof. This is an adaptation of the proof of (Talagrand, 2006c, Theorem 3.1) to the multidimen-

sional case. Assume without loss of generality that the paths ρ(1)and ρ(2)have same jump times

{xk}n+1

k=0. Denote the corresponding overlap matrices as {Q(1,k)}n+1

k=0and {Q(2,k)}n+1

k=0. Given s∈

[0;1], define the new path ρ(s)∈Q0(U,d)by assuming that it has the same jump times {xk}n+1

k=0

as the paths ρ(1),ρ(2)and defining its overlap matrices as Q(k)(s):=sQ(1,k)+(1−s)Q(2,k). On

the one hand, we readily have

0kρ(1)(t)−ρ(2)(t)kFdt=

∑

k=1

(xk−xk−1)kQ(1,k)−Q(2,k)kF.

On the other hand, using Lemma 4.2.3, we have

|∂sfρ(s)(0,0)|≤ C

∑

k=1

(xk−xk−1)kQ(1,k)−Q(2,k)kF.

Finally, we have

|fρ(1)(0,0)−fρ(2)(0,0)|≤Z1

0|∂sfρ(s)(0,0)|ds.

Combining the last three formulae, we get the theorem. ut

Remark 4.2.2. Note that using the same argument and notations as in the previous theorem we

get that, for any (y,t)∈

d×[0;1],

|fρ(1)(y,t)−fρ(2)(y,t)|≤ C(Σ)

2Z1

tkρ(1)(s)−ρ(2)(s)kFds.

Remark 4.2.3. Note that we can associate to each ρ∈Q(U,d)aSym+(d)-valued countably

additive vector measure νρ∈M([0;1],Sym+(d)) by the following standard procedure. Given

[a;b)⊂[0;1], define

νρ([a;b)) :=ρ(b)−ρ(a)

and then extend the measure, e.g., to all Borell subsets of [0;1].

Theorem 4.2.1. Given U ∈Sym+(d), we have

1. The set Q(U,d)is compact under the topology induced by the following norm

kρk:=Z1

0kρ(t)kFdt,ρ∈Q(U,d).(4.40)

2. The functional Q0(U,d)3ρ7→ fρ(0,0)is Lipschitzian and can be uniquely extended by

continuity to the whole Q(U,d).

Proof. 1. The topology induced by the norm (4.40) coincides with the topology of weak con-

vergence of the above-defined vector measures. Since Q(U,d)is a bounded set, it is compact

in the weak topology.

2. This is an immediate consequence of Proposition 4.2.2.

94 4 Estimates of Guerra’s remainder term in the SK model with multidimensional spins

In the next result, we summarise some results on the PDE (4.39) for the non-discrete param-

eters, cf. Proposition 4.2.1.

Theorem 4.2.2.

1. Existence. Assume that Q is in Q(U,d)and is piece-wise C(1). Assume also that x is in

Q(1,1)and is piece-wise continuous. Then the terminal value problem (4.39) has a unique

continuous, piece-wise viscosity solution fQ,x∈C([0;1]×

d).

2. Monotonicity with respect to x.Assume Q ∈Q(U,d). Assume also that x(1),x(2)∈Q(1,1)

are such that x(1)(t)≤x(2)(t), almost everywhere for t ∈[0;1]. Let fQ,x(1)and fQ,x(2)be the

corresponding solutions of (4.39). Then fQ,x(1)≤fQ,x(2).

3. Monotonicity with respect to g.Assume g1,g2:

d→

satisfy Assumption 4.1.1 and also

g1≤g2almost everywhere. Let fg1,fg2:

d×[0;1]→

be the corresponding solutions of

(4.39) with g =g1, g =g2, respectively. Then fg1≤fg2.

Proof. 1. Due to the assumptions, the diffusion matrix ˙

Q(t) = ˙

ρ(t)in (4.39) is non-negative

definite. Applying (Briand & Hu, 2007, Proposition 8) to the PDE (4.39) successively on

the intervals [xk;xk+1), where the ˙

ρis continuous, gives the existence of the solutions in

viscosity sense and, moreover, gives their continuity. Uniqueness is ensured by (Da Lio &

Ley, 2006, Theorem 1.1).

2. By the approximation argument (cf. Theorem 4.2.1), it is enough to assume that x(1),x(2)∈

Q0(1,1)and Q∈Q0(U,d). Then Proposition 4.2.1 gives the existence of the corresponding

piece-wise classical solutions of (4.39): fQ,x(1),fQ,x(2). These solutions are obviously also

the (unique) piece-wise viscosity solutions of (4.39). The comparison result (Briand & Hu,

2007, Theorem 5) and the non-linear Feynman-Kac formula (Briand & Hu, 2007, Proposi-

tion 8 ) give then the claim.

3. This can be seen either from the representation (4.36) and an approximation argument, or

exactly as in (2) by invoking the results of Briand & Hu (2007).

4.2.1 The Parisi functional

We consider now a specific terminal condition in the system (4.35) given in (4.20).

Given ρ∈Q(U,d), let fρ:[0;1]×

d→

be the value of (the continuous extension onto

Q(U,d)of) the solution of (4.35) with the specific terminal condition given by (4.20). Fol-

lowing the ideas in the physical literature, we now define the Parisi functional P(β,ρ,Λ):

+×Q0(U,d)×Sym+(d)×Sym(d)→

in as

P(β,ρ,Λ):=fρ(0,0)−β2

2Z1

x(t)dkρ(t)k2

F−hU,Λi.(4.41)

The integral in (4.41) is understood in the usual Lebesgue-Stiltjes sense.

Remark 4.2.4. Note that the path integral term in (4.41) equals f (0,0), where f (t,y)is the

solution of (4.39) with the following boundary condition

g(y):=βhy,

i=β

∑

u=1

yu,y∈

4.2 The Parisi functional in terms of differential equations 95

Obviously Q0(d)is dense in Q(d).

Theorem 4.2.3. We have

p(β)≤sup

U∈Sym+(d)

inf

ρ∈Q0(U,d)

Λ∈Sym(d)

P(β,ρ,Λ).(4.42)

Proof. The bound (4.42) is a straightforward consequence of Theorem 4.1.1. ut

4.2.2 On strict convexity of the Parisi functional and its variational representation

In this subsection, we derive a variational representation for Parisi’s functional. As a conse-

quence, for d=1, we prove that the functional is strictly convex with respect to the x∈Q(1,1),

if the terminal condition g(cf. (4.39)) is strictly convex and increasing. This result is related to

the problem of strict convexity of the Parisi functional in the case of the SK model.

Let W:={W(s)}s∈

+be the standard

d-valued Brownian motion and let {Ft}t∈

+be the

correspondent filtration. Define

U[t;T]:={u:[t;T]→

d|uis {Ft}t∈

+progressively measurable}.

Given u∈U[t;1],Q∈Q(U,d)and x∈Q(1,1), consider the following

d-valued and adapted

to {Ft}t∈

+diffusion

Y(Q,x,u,t,y)(s):=y−Zs

tx(s)˙

Q(s)1/2u(s)ds+Zs

t˙

Q(s)1/2dW(s),s∈[t;1].

Given some function g:

d→

satisfying Assumption 4.1.1, define fQ,x:

d×[0;1]→

fQ,x(y,t):=sup

u∈U[t;1]

g(Y(Q,x,u,t,y)(1))−1

2Z1

tku(s)k2

2ds.(4.43)

Proposition 4.2.3. Let d =1. If g is strictly convex and increasing, then the functional Q(1,1)3

x7→ fQ,xis strictly convex.

Proof. We have

Y(Q,x,u,t,y)(1) = y−Z1

tx(s)˙

Q(s)1/2u(s)ds+Z1

t˙

Q(s)1/2W(s).

By an approximation argument, it is enough to prove the strict convexity for the continuous

x1,x2∈Q(1,1)(x16=x2). For any γ∈(0;1), we have

Y(Q,γx1+(1−γ)x2,u,t,y)(1) = −Z1

tγx1+(1−γ)x2˙

Q(s)1/2u(s)ds+Z1

t˙

Q(s)1/2W(s)

<−γZ1

tx1˙

Q(s)1/2u(s)ds−(1−γ)Z1

tx2˙

Q(s)1/2u(s)ds

+Z1

t˙

Q(s)1/2W(s)

=γY(Q,x1,u,t,y)(1)+(1−γ)Y(Q,x2,u,t,y)(1),(4.44)

96 4 Estimates of Guerra’s remainder term in the SK model with multidimensional spins

where the strict inequality above is due to the strict concavity of the square root function. The

strict convexity and monotonicity of gcombined with the representation (4.44) implies that

(4.43) is strictly convex as a function of x, since a supremum of a family of convex functions is

convex. ut

Proposition 4.2.4. Given a piece-wise continuous x ∈Q(1,1)and a Q ∈Q(U,d)which is

piece-wise in C1(0;1), the function fQ,x:

d×[0;1]→

defined by (4.43) is a unique, contin-

uous, piece-wise viscosity solution of the following terminal value problem

(∂tf+1

2h˙

Q,∇2fi+xh˙

Q∇f,∇fi=0,(y,t)∈

d×(0,1),

f(y,1) = g(y).

Proof. In a way similar to the proof of Theorem 4.2.2, we successively use (Da Lio & Ley,

2006, Theorem 2.1) on the intervals (xk;xk+1), where the data of the PDE are continuous. ut

Theorem 4.2.4. Assume d =1. Suppose also that g satisfies the assumptions of Proposi-

tion 4.2.3. For any u ∈

, the generalised Parisi functional given by (4.41) with fρ(0,0)corre-

sponding to the terminal condition g is strictly convex on Q(u,1). Consequently, there exists a

unique optimising order parameter.

Proof. In 1-D, we can choose the coordinates such that Q:=Ut, on [0;1]. Consequently, ˙

Q:=

U:=const on [0;1]. Hence, it is enough check the strict convexity with respect to x∈Q(1,1).

The result follows by approximation in the norm (4.40) of an arbitrary pair of different elements

of Q(U,d)by a pair of elements of Q0(U,d)and Propositions 4.2.1, 4.2.3 and 4.2.4. ut

Remark 4.2.5. Due to the monotonicity assumption on g, Theorem 4.2.4 does not cover the

case of the SK model, where the terminal value g is given by (4.20).

4.2.3 Simultaneous diagonalisation scenario

In the setups with highly symmetric state spaces ΣN(such as the spherical spin models of

Panchenko & Talagrand (2007b) or the Gaussian spin models, see Section 5.2 below), less

complex order parameter spaces as Q(U,d)suffice.

Given some orthogonal matrix O∈O(d), we briefly discuss the case ρ∈Qdiag(U,O,d),

where

Qdiag(U,O,d):={ρ∈Q(U,d)|for all t∈[0;1], the matrix Oρ(t)O∗is diagonal}.

The space Qdiag(U,O,d)is obviously isomorphic to the space of “paths” with the non-

decreasing coordinate functions in

d, starting from the origin and ending at u, i.e.,

Q(u,d):={ρ:[0;1]→

d|¯

ρ(0) = 0; ¯

ρ(1) = u;¯

ρ(t)¯

ρ(s),for t≤s;¯

ρis c´

adl´

ag},

where u=OUO∗∈

d. The isomorphism is then given by

Q(u,d)3¯

ρ7→OρO∗∈Qdiag(U,O,d).(4.45)

4.3 Remainder estimates 97

4.3 Remainder estimates

In this section, we partially extend Talagrand’s remainder estimates to the multidimensional set-

ting. Due to Proposition 4.1.1, to prove the validity of Parisi’s formula it is enough to show that

all the µkterms in (4.28) almost vanish for the almost optimal parameters of the optimisation

problem in (4.14). This can be done if the free energy of two coupled replicas of the system

(4.48) is strictly smaller than twice the free energy of the uncoupled single system (4.2), see

inequality (4.3.2). However, the systems involved in (4.3.2) are effectively at least as complex

as the SK model itself. In Section 4.3.2, we again apply Guerra’s scheme to obtain the upper

bounds on (4.48) in terms of the free energy of the corresponding comparison GREM-inspired

model. One might then hope that by a careful choice of the comparison model one can prove

inequality (4.3.2). In Sections 4.3.3 and 4.3.4, we formulate some conditions on the comparison

system which would suffice to get inequality (4.3.2), giving, hence, the conditional proof of the

Parisi formula, see Theorem 4.3.1.

4.3.1 A sufficient condition for µk-terms to vanish

In this subsection, we are going to establish a sufficient condition for the measures µkto vanish.

This condition states roughly the following. Whenever the free energy of a certain replicated

system uniformly in Nstrictly less then twice the free energy of the single system, the measure

µkvanishes in N→+∞limit (see Lemma 4.3.2).

Keeping in mind the definition of µk(cf. (4.26)) and of the Hamiltonian Ht(σ,α)(cf. (4.1)),

we define, for α(1),α(2)∈A(2),k, the corresponding replicated Hamiltonian as

H(2)

t(σ(1),σ(2),α(1),α(2)):=Ht(σ(1),α(1))+ Ht(σ(2),α(2)).(4.46)

Remark 4.3.1. We note here that the distribution of the Hamiltonian Ht(k,σ(1),σ(2))depends

only on k and not on the choice of the indices α(1),α(2)∈A(2),k.

Remark 4.3.2. The superscript (2)in (4.46) (and in what follows) indicates that the quantity is

related to the twice replicated objects.

Define

A(2),k:={(α(1),α(2))∈A2:qL(α(1),α(2)) = k}.

Additionally, for any V⊂Σ(B(U,ε))2and any suitable Gaussian process,

{F(σ(1),σ(2),α(1),α(2)):σ(1),σ(2)∈ΣN,α(1),α(2)∈A},

we define the local remainder comparison functional as

Φ(2),k,x

V[F]:=1

hlogZZVZZA(2),kexpnβ√NF(σ(1),σ(2),α(1),α(2))o

dµ⊗N(σ(1))dµ⊗N(σ(2))dξ(α(1))dξ(α(2))i.(4.47)

Define

ϕ(2)

N(k,t,x,Q,V):=Φ(2),k

VhH(2)

ti.(4.48)

98 4 Estimates of Guerra’s remainder term in the SK model with multidimensional spins

Lemma 4.3.1. Recalling the definition (4.2), for any V⊂Σ(B(U,ε))2, we have

ϕ(2)

N(k,t,x,Q,V)≤ϕ(2)

N(k,t,x,Q,Σ(B(U,ε))2) = 2ϕN(t,x,Q,Σ(B(U,ε))).(4.49)

Proof. The first inequality in (4.49) is obvious, since the expression under the integral in (4.47)

is positive. The equality in (4.49) is an immediate consequence of the RPC averaging property

(4.25). ut

In what follows, we shall be looking for the sharper (in particular, strict) versions of the

inequality (4.49) because of the following observation due to Talagrand (2006b).

Lemma 4.3.2. Fix an arbitrary V⊂ΣN(B(U,ε))2. Suppose that, for some ε>0, the following

inequality holds

ϕ(2)

N(k,t,x,Q,V)≤2ϕN(t,x,Q,ΣN(B(U,ε)))−ε.(4.50)

Then, for some K >0, we have

µk(V)≤Kexp−N

K.

Proof. The proof is based on Theorem 3.2.2 and follows the lines of (Panchenko, 2005b,

Lemma 7). ut

4.3.2 Upper bounds on ϕ(2): Guerra’s scheme revisited

In this subsection, we shall develop a mechanism to obtain upper bounds on ϕ(2)defined in

(4.48). This will be achieved in the full analogy to Guerra’s scheme by using a suitable Gaussian

comparison system.

Given U∈Sym+(d), we say that V∈

d×dis an admissible mutual overlap matrix for U, if

U:=U V

V∗U∈Sym+(2d).(4.51)

Furthermore, define

V(U):={V∈

d×d:Vis an admissible mutual overlap matrix for U}.

Hereinafter without further notice we assume that U∈Sym+(2d)has the form (4.51), where

Vis some admissible mutual overlap matrix for U.

Let Q∈Q(U,2d). Let x:={xl∈[0;1]}n

l=1be the “jump times” of the path ρ. We assume

that the “times” are increasingly ordered, i.e.,

0=x0<x1< ... < xn<xn+1=1.

Consider the following collection of matrices

Q:={Ql:=Q(xl)⊂Sym+(2d)}n+1

l=0.

We obviously then have

4.3 Remainder estimates 99

0=Q(0)≺Q(1)≺...≺Q(n)≺Q(n+1)=U.(4.52)

Such a path Qinduces in the usual way the “doubled” GREM overlap kernel Q:={Q(α(1),α(2))∈

Sym+(2d)|α(1),α(2)∈An}, defined as

Q(α(1),α(2)):=Q(qL(α(1),α(2))).

We also need the d×dsubmatrices of the above overlap such that

Q(α(1),α(2)) = Q|11(α(1),α(2))Q|12(α(1),α(2))

Q|12(α(1),α(2))∗Q|22(α(1),α(2)).(4.53)

Remark 4.3.3. For σ(1)σ(2)∈ΣN, we shall use the notation σ(1)qσ(2)∈

2dNto denote the

vector obtained by the following concatenation of the vectors σ(1)and σ(2)

σ(1)qσ(2):=σ(1)

iσ(2)

i∈Σ×Σ⊂

2dN

i=1.

Let us observe that the process

X(2):=nX(2)(τ) = X(σ(1))+ X(σ(1))|τ=σ(1)qσ(2);σ(1),σ(2)∈ΣNo

is actually an instance of the 2d-dimensional Gaussian process defined in (0.8). Hence, it has

the following correlation structure, for τ1,τ2∈Σ(2)

CovhX(2)(τ1),X(2)(τ2)i=kR(2)(τ1,τ2)k2

The path ρinduces also the following two new (independent of everything before) comparison

process Y(2):=nY(2)(α)∈

2d|α∈Ano, with the following correlation structures

CovhY(2)(α(1)),Y(2)(α(2))i=Q(α(1),α(2))∈Sym+(d).

As usual, let {Y(2)

i}N

i=1be the independent copies of Y(2). For the purposes of new Guerra’s

scheme we define a GREM-like process (cf. (3.15))

A(2)={A(2)(τ,α):τ=σ(1)qσ(2);σ(1),σ(2)∈ΣN;α∈An}

A(2)(τ,α):=2

N1/2N

∑

i=1hY(2)

i(α),τii.

We fix some t∈[0;1]. We would now like to apply Guerra’s scheme to the comparison func-

tional (4.47) and the following two processes

nH(2)

t(σ(1),σ(2),α)oσ(1),σ(2)∈ΣN,α∈A,n√tA(2)(σ(1)qσ(2),α)oσ(1),σ(2)∈ΣN,α∈A.

100 4 Estimates of Guerra’s remainder term in the SK model with multidimensional spins

These two processes are, respectively, the counterparts of the processes X(σ)and A(σ,α)in

Guerra’s scheme.

Consider a path e

Q∈Q0(U,d)with the following jumps

0=:e

Q(0)≺e

Q(1)≺...≺e

Q(n)≺e

Q(n+1).

Let e

A:=ne

A(σ,α):σ∈ΣN;α∈Anobe a Gaussian process (independent of all random ob-

jects around) with the following covariance structure

A(σ(1),α(1))e

A(σ(2),α(2))i=2hR(σ(1),σ(2)),e

Q(α(1),α(2))i.

For notational convenience, we introduce also the following process

A(2)(σ(1)qσ(2),α(1),α(2)):=e

A(σ(1),α(1))+ e

A(σ(2),α(2)).(4.54)

Recalling the replicated Hamiltonian (4.46) and following Guerra’s scheme, we introduce, for

s∈[0;1], the following interpolating Hamiltonian

H(2)

t,s(σ(1),σ(2),α(1),α(2)):=√stX(2)(σ(1)qσ(2))+p(1−s)tA(2)(σ(1)qσ(2),α(1))

+√1−te

A(2)(σ(1)qσ(2),α(1),α(2)).(4.55)

Given ε,δ>0 and L∈Sym(2d), define (cf. (4.6))

V(2)(L,U,ε,δ):={U0∈Sym+(2d):kU0−UkF<ε,hU0−U,Li<δ}.

We consider the following set of the local configurations

Σ(2)

N(L,U,ε,δ):=n(σ(1),σ(2))∈ΣN×ΣN:R(2)

N(σ(1)qσ(2),σ(1)qσ(2))∈V(2)(L,U,ε,δ)o.

(4.56)

Note that Σ(2)

N(L,U,ε,δ)⊂ΣN(B(U,ε))2. We consider also the RPC ζ=ζ(x)generated by the

vector xand, for any suitable Gaussian process

F:={F(σ(1),σ(2),α(1),α(2))|σ(1),σ(2)∈ΣN;α(1),α(2)∈An},

define the corresponding local comparison functional (cf. (4.47)) as follows

Φ(2),k,x

V[F]:=1

hlogZZVZZA(2),kexpnβ√NF(σ(1),σ(2),α(1),α(2))o

dµ⊗N(σ(1))dµ⊗N(σ(2))dζ(α(1))dζ(α(2))i.

Define the corresponding local free energy-like quantity as (cf. (4.2))

χ(s,t,k,x,Q,e

Q,Σ(2)

N(L,U,ε,δ)) :=Φ(2),k,x

Σ(2)

N(L,U,ε,δ)hH(2)

t,si.(4.57)

To lighten the notation, we indicate hereinafter only the dependence of χon s. Denote

Bx,Q:=tβ2

∑

l=1

xlkQ(l+1)k2

F−kQ(l)k2

F.

4.3 Remainder estimates 101

Lemma 4.3.3. There exists C =C(Σ)>0such that, for any Uas above, we have

∂

∂sχ(s,t,k,x,Q,e

Q,Σ(2)

N(L,U,ε,δ)) ≤−Bx,Q+Cε,(4.58)

Consequently,

ϕ(2)

N(k,t,x,Q,Σ(2)

N(L,U,ε,δ)) ≤Φ(2),k,x

Σ(2)

N(L,U,ε,δ)h√tA(2)(σ(1)qσ(2),α(1))

+√1−te

A(2)(σ(1)qσ(2),α(1),α(2))i−Bx,Q+Cε.(4.59)

Proof. The idea is the same as in the proof of Theorem 4.1.1 and is based on Proposition 3.2.5.

Since we are considering the localised free energy-like quantities (4.57), the variance terms

induced by the interpolation (4.55) in (3.34) cancel out (up to the correction O(ε)) and we are

left with the non-positive contribution of the covariance terms. ut

Given L∈Sym(2d), we consider the following stencil of the Legendre transform

Φ(2),k,x,L[F]:=−hL,Ui−Bx,Q+1

[logZZΣ2

NZZA(2),kexp{β√NF(σ(1),σ(2),α(1),α(2))

+hL(σ(1)qσ(2)),σ(1)qσ(2)i}

dµ⊗N(σ(1))dµ⊗N(σ(2))dζ(α(1))dζ(α(2))i.

(4.60)

Definition 4.3.1. Let F : Sym(2d)→

. Given δ>0, we call L(0)∈Sym(2d)δ-minimal for

F, if

F(Λ(0))≤inf

Λ∈Sym(2d)F(Λ)+δ.

Lemma 4.3.4. There exists C =C(Σ)>0such that, for all Uand Q∈Q0(U,2d)as above,

all ε,δ>0, there exists a δ-minimal Lagrange multiplier L=L(U,ε,δ)∈Sym(2d)for (4.60)

such that, for all k ∈[1;n]∩

, all t ∈[0;1], and all (x,Q), we have

ϕ(2)

N(k,t,x,Q,Σ(2)

N(L,U,ε,δ)) ≤inf

L∈Sym(2d)e

Φ(2),k,x,Lh√tA(2)(σ(1)qσ(2),α(1))

+√1−te

A(2)(σ(1)qσ(2),α(1),α(2))i

+C(ε+δ).(4.61)

Proof. The argument is the same as in the proof of Theorem 3.1.1. ut

Consider the family of matrices e

Q:=ne

Q(l)∈Sym+(2d)|l∈[0;n+1]∩

o, defined as

Q(l):=e

Q(l)e

Q(l)

Q(l)e

Q(l),(4.62)

for l∈[0;k]∩

, and as

102 4 Estimates of Guerra’s remainder term in the SK model with multidimensional spins

Q(l):=e

Q(l)e

Q(k)

Q(k)e

Q(l),(4.63)

for l∈[k+1;n+1]∩

. Additionally we define, for l∈[0;n+1], the matrices

Q(l)(t):=tQ+(1−t)e

Let b

Z(l)∈

2d×2d, for l∈[0;x], be independent Gaussian vectors with

Covhb

Z(l)i=2β2b

Q(l+1)(t)−b

Q(l)(t).

Given b

y∈

2d,L∈Sym(2d), consider the random variable

X(2)

n+1(b

y,x,b

Q(t),L):=logZΣZΣexphb

y,σ(1)qσ(2)i+hL(σ(1)qσ(2)),σ(1)qσ(2)idµ(σ(1))dµ(σ(2)).

(4.64)

Define recursively, for l∈[n;0]∩

, the following quantities

X(2)

l(b

y,k,x,b

Q(t),L):=1

log

Z(l)hexpxlX(2)

l+1(b

y+b

Z(l),k,x,b

Q(l)(t),L)i.(4.65)

Lemma 4.3.5. We have

Φ(2),k,x,Lh√tA(2)(σ(1)qσ(2),α(1))+√1−te

A(2)(σ(1)qσ(2),α(1),α(2))i

=−hL,Ui+X(2)

0(0,x,b

Q(l)(t),L).(4.66)

Proof. This is an immediate consequence of the RPC averaging property (4.25). ut

Proposition 4.3.1. Under the conditions of Lemma 4.3.4, we have

ϕ(2)

N(k,t,x,Q,Σ(2)

N(L,U,ε,δ)) ≤inf

L∈Sym(2d)−hL,Ui+X(2)

0(0,x,b

Q(t),L)−Bx,Q+C(ε+δ).

Remark 4.3.4. Similarly to (4.13), there exists C =C(Σ,µ)>0, such that, for any L∈

Sym(2d),

ϕ(2)

N(k,t,x,Q,Σ(2)

N(B(U,ε)) ≤−hL,Ui−Bx,Q+X(2)

0(0,x,b

Q(t),L)+CkLkFε.

Proof. Immediately follows from Lemmata 4.3.4 and 4.3.5. ut

4.3.3 Adjustment of the upper bounds on ϕ(2)

Proposition 3.2.1 implies that there exists r∈[1;n]∩

such that

kQ(r−1)k2

F<kVk2

F<kQ(r)k2

F.(4.67)

Assume r=k. (Other cases are similar or easier as shown for 1-D in Talagrand (2006b).) We

make the following tuning of the upper bounds of the previous subsection. Set n:=n+1. Let

w∈[xr−1/2;xr]. Define

4.3 Remainder estimates 103

xl:=xl(w):=









2,l∈[0;k−1]∩

w,l=k,

xl,l∈[k+1;n+1]∩

(4.68)

Let

Q(l):=(Q(l),l∈[0;k−1]∩

Q(l−1),l∈[k;n+2]∩

Moreover, suppose Q:={Q(l)}n+2

l=0satisfy

kQ(l)k2

F=









4kQ(l)k2

F,l∈[0;k−1]∩

4kVk2

F,l=k,

2kQ(l−1)k2

F+kVk2

F,l∈[k+1;n+2]∩

(4.69)

Such Qexists due to (4.67). Moreover, if d≥2, then it is obviously non-unique.

Lemma 4.3.6. In the above setup, we have

Bx,Q:=tβ2n(w−xl−1)kQ(k)k2

F−kVk2

F+

∑

l=1

xlkQ(l+1)k2

F−kQ(l)k2

Fo.

Proof. The claim is a straightforward consequence of (4.68) and (4.69). ut

Define the matrix D(n+1)∈Sym+(2d)block-wise as

D(n+1)|11 :=β2t(U−Q(n+1)|11)+β2(1−t)(U−Q(n))+L|11,

D(n+1)|12 :=β2t(V−Q(n+1)|12)+L|12,

D(n+1)|21 :=β2t(V−Q(n+1)|12)∗+L|∗

12,

D(n+1)|22 :=β2t(U−Q(n+1)|22)+β2(1−t)(U−Q(n))+L|22.

Furthermore, we define

Sym+(2d)3e

D(n+1):=β2(U−Q(n))+Λ0

0β2(U−Q(n))+Λ.

Lemma 4.3.7. We have

X(2)

n+1(b

y,k,x,b

Q(t),L):=logZΣZΣexphb

y,σ(1)qσ(2)i+hD(n+1)(σ(1)qσ(2)),σ(1)qσ(2)i

×dµ(σ(1))dµ(σ(2)).

Proof. Since xn+2=1, the result follows from a straightforward calculation of the Gaussian

integrals in (4.65) for l=n+1. ut

Define

L:=Λ0

0Λ,e

U:=U Q(k)

Q(k)U.

104 4 Estimates of Guerra’s remainder term in the SK model with multidimensional spins

Lemma 4.3.8. For any y ∈

d, l ∈[0;n+2]∩

, we have

X(2)

l(yqy,x(w),e

Q,e

L)|w=xk−1=(2Xl−1(y,x,Q,U,Λ),l∈[k;n+2]∩

2Xl(y,x,Q,U,Λ),l∈[0;k−1]∩

Proof. A straightforward (decreasing) induction argument on lgives the result. Indeed: for l=

n+2, an inspection of (4.64) and (3.8) immediately yields

X(2)

n+2(y(1)qy(2),x(w),e

Q,e

L) = Xn+1(y(1),x,Q,U,Λ)+Xn+1(y(2),x,Q,U,Λ),

where y(1),y(2)∈

d. Let b

Z(l)be a Gaussian 2d-dimensional vector with

Covhe

Z(l)i=2β2(e

Q(l+1)−e

Q(l)).

Define two Gaussian d-dimensional vectors e

Z(l),1and e

Z(l),2by demanding that

Z(l)=e

Z(l),1qe

Z(l),2.

Due to (4.62) and (4.63), the vectors e

Z(l),1and e

Z(l),2are independent, for l∈[k;n+1]. We have

Z(l),1∼e

Z(l),2, for l∈[0;k−1]. Assume that l∈[k;n+1]∩

and

X(2)

l+1(y(1)qy(2),x(w),e

Q,e

L) = Xl(y(1),x,Q,U,Λ)+Xl(y(2),x,Q,U,Λ).

By definition (4.64), we have

X(2)

l(y(1)qy(2),k,x,e

Q,e

L) = 1

log

Z(l)hexpxlX(2)

l+1(y(1)qy(2)+b

Z(l),k,x,e

Q,L)i

log

Z(l)hexpnxlXl(y(1)+e

Z(l),1,x,Q,U,Λ)

+Xl(y(2)+e

Z(l),2,x,Q,U,Λ)oi

=Xl−1(y(1),x,Q,U,Λ)+Xl−1(y(2),x,Q,U,Λ).

By the construction and previous formula, for l=k−1, we have

X(2)

k−1(y(1)qy(2),k,x,e

Q,e

L)|w=xk−1=X(2)

k(y(1)qy(2),k,x,e

Q,e

=Xk−1(y(1),x,Q,U,Λ)+Xk−1(y(2),x,Q,U,Λ).

Finally, for l∈[0;k−2], we recursively obtain

X(2)

l(y(1)qy(1),k,x,e

Q,e

L)|w=xk−1=1

log

Z(l)hexpxlX(2)

l+1(y(1)qy(1)+b

Z(l),k,x,e

Q,L)|w=xk−1i

log

Z(l),1hexpnxl

2Xl+1(y(1)+e

Z(l),1,x,Q,U,Λ)

+Xl+1(y(1)+e

Z(l),1,x,Q,U,Λ)oi

=2Xl(y(1),x,Q,U,Λ).

4.3 Remainder estimates 105

Remark 4.3.5. Motivated by Lemmata 4.3.2 and 4.3.8 (see also Section 4.3.4), we pose the

following problem. Is it true that, as in 1-D (see Panchenko (2005b); Talagrand (2006b)), there

exists Q∈Q0(U,2d)satisfying the assumption (4.69) such that the following inequality holds

inf

L∈Sym(2d)−hL,Ui+X(2)

0(0,x(w),b

Q(t),L)|w=xk−1

≤2 inf

Λ∈Sym(d)−hΛ,Ui+X0(0,x,Q,U,Λ)? (4.70)

Similar problems have at first been posed in Talagrand (2007b). The resolution of the above

problem seems to require more detailed information on the behaviour of the Parisi functional

(4.41) or, equivalently, of the solution of (4.39) as a function of Q ∈Q(U,d).

4.3.4 Talagrand’s a priori estimates

We start from defining a class of the almost optimal paths for the optimisation problem in (4.42).

Recall the following convenient definition from Panchenko (2005b).

Definition 4.3.2. Given U ∈Sym+(d), we shall call the triple (n,ρ∗,Λ∗)∈

×Q0

n(U,d)×

aθ-optimiser of the Parisi functional (4.41), if it satisfies the following two conditions

P(β,ρ∗,Λ∗)≤inf

ρ∈Q0(U,d)

Λ∈Sym(d)

P(β,ρ,Λ)+θ.(4.71)

P(β,ρ∗,Λ∗) = inf

ρ∈Q0

n(U,d)

Λ∈Sym(d)

P(β,ρ,Λ).(4.72)

Remark 4.3.6. It is obvious that for any θ>0such a θ-optimiser exists. The main convenient

feature of this definition (as pointed out in Talagrand (2006b)) is that n (the number of jumps of

ρ∗) is finite and fixed.

Recalling (4.11), we set

φ(x,Q,Λ)(t):=−hU,Λi−tβ2

∑

k=1

xkkQ(k+1)k2

F−kQ(k)k2

F+X0(x,Q,U,Λ).(4.73)

Under the following assumption (at first proposed in 1-D in Talagrand (2006b)), we shall ef-

fectively prove that remainder term almost vanishes on the θminimisers of (4.41), see Theo-

rem 4.3.1.

Assumption 4.3.1. Let U∈Sym+(2d)be defined by (4.51). We fix arbitrary t0∈[0;1),ε>0

and δ>0. There exists K =K(t0,ε,δ,U)>0,θ(t0,ε,δ,U)>0, and N0=N0(t0,ε,δ,U)∈

and L∗∈Sym(2d)with the following property:

If (n,ρ∗,Λ∗)is a θ-optimiser, for some θ∈(0;θ(t0,ε,δ,U)], then uniformly, for all t ∈

[0;t0), N >N0and all k ∈[1;n]∩

, we have

ϕ(2)

N(k,t,x∗,Q∗,Σ(2)

N(L∗,U,ε,δ)) ≤2φ(x∗,Q∗,Λ)(t)−1

KkQ∗(k)−Vk2

F+C(ε+δ).(4.74)

106 4 Estimates of Guerra’s remainder term in the SK model with multidimensional spins

Remark 4.3.7. The validity of the above assumption for general a priori measures is an open

problem. However, in the particular case of the Gaussian a priori distribution the assumption

is indeed effectively satisfied. See Section 5.2 and Theorem 5.2.1, in particular. This gives a

complete proof of the Parisi formula for the case of Gaussian spins.

Remark 4.3.8. If the bound (4.70) holds then Lemma 4.3.6 with w =xr−1would imply that

ϕ(2)

N(k,t,Σ(2)

N(L∗,U,ε,δ)) ?

≤2φ(x∗,Q∗,Λ∗)(t)+C(ε+δ).(4.75)

The above inequality would then be a starting point for the a priori estimates in the spirit of

Talagrand (2006b) which might lead to the proof of Assumption 4.3.1.

4.3.5 Gronwall’s inequality and the Parisi formula

Theorem 4.3.1. Suppose Assumption 4.3.1 holds.

Then we have

lim

N↑+∞pN(β) = sup

U∈Sym+(d)

inf

ρ∈Q0(U,d)

Λ∈Sym(d)

P(β,ρ,Λ).

Proof. The proof follows the argument of Talagrand (2006b) (see also Panchenko (2005b)) with

the adaptations to the case of multidimensional spins. The main ingredients are the Gronwall

inequality and Lemma 4.3.2. Theorem 4.1.1 implies that

lim

N↑+∞pN(β)≤sup

U∈Sym+(d)

inf

ρ∈Q0(U,d)

Λ∈Sym(d)

P(β,ρ,Λ).

We now turn to the proof of the matching lower bound. As in the proof of Theorem 3.1.2, it is

enough to show that

lim

ε↓+0lim

N↑+∞ϕN(1,x,Q,B(U,ε)) ≥inf

ρ∈Q0(U,d)

Λ∈Sym(d)

P(β,ρ,Λ).(4.76)

1. We fix an arbitrary U∈Sym+(d). Fix also some t0∈[0;1). By Assumption 4.3.1, we can

find the corresponding θ(t0,V,U)>0 with the properties listed in the assumption. We pick

any θ∈(0;θ(t0,V,U)] and let (n,ρ∗,Λ∗)be a correspondent θ-optimiser. Note that, by

definition (4.73), we have

φ(x∗,Q∗,Λ∗)(1) = P(β,ρ∗,U,Λ∗)

and, by Definition 4.3.2,

|φ(x∗,Q∗,Λ∗)(1)−inf

ρ∈Q0(U,d)

Λ∈Sym(d)

P(β,ρ,U,Λ)|≤θ.(4.77)

2. We denote

∆N(t):=φ(x∗,Q∗,Λ∗)(t)−ϕN(t,x∗,Q∗,B(U,ε)).

4.3 Remainder estimates 107

Note that, due to (4.10), we obviously have

∆N(t)≥−Cε.(4.78)

Define

∆(t):=lim

N↑+∞∆N(t).

The definition (4.73) and Theorem 4.1.4 yield

dt∆N(t)≤1

n−1

∑

k=0

(xk+1−xk)µkhkRN(σ(1),σ(2))−Q(k)k2

Fi+Cε.(4.79)

3. Let us set D:=supσ∈Σkσk2. We note that, for any σ(1),σ(2)∈ΣN, we have

R(σ(1),σ(2))∈[−D2;D2]d×d.

Given the constant Kfrom (4.74), for any c>0, we define the set

Σ(2),k

N(U,ε):=n(σ(1),σ(2))∈ΣN(B(U,ε))2:kR(σ(1),σ(2))−Q(k)k2

F≥2K∆N(t)+co.

(4.80)

It is easy to see that by compactness we can find a finite covering of Σ(2),k

N(U,ε)by the

neighbourhoods (4.56) with centres, e.g., in the corresponding set of admissible overlap

matrices

V(k)

N(U,ε):=nR(σ(1),σ(1))∈[−D2;D2]d×d:(σ(1),σ(2))∈Σ(2),k

N(U,ε)o.

That is, there exists M=M(ε,δ)∈

and the finite collections of matrices {V(i)}M

i=1⊂

V(k)

N(U,ε)and {U(i)}M

i=1⊂B(U,ε)∩Sym+(d)such that

Σ(2),k

N(U,ε)⊂

[

i=1

Σ(2)

N(L∗(i),U(i),ε,δ),(4.81)

where

U(i):=U(i)V(i)

V∗(i)U(i)∈Sym+(2d),

and L∗(i)is the corresponding δ-minimal Lagrange multiplier.

4. Given i∈[1;M]∩

, let (n(i),x∗(i),Q∗(i),Λ∗(i)) be the corresponding toU(i)θ(i)-optimisers.

Due to Lipschitzianity of the Parisi functional (Proposition 4.2.2) and the fact that U(i)∈

B(U,ε)we can assume that n(i) = n. Using the bound (4.74) and the definition (4.80), we

obtain

ϕ(2)

N(k,t,x∗

i,Q∗

i,Σ(2)

N(L∗(i),U(i),ε,δ)) ≤2φ(x∗(i),Q∗(i),Λ∗(i))(t)−1

KkQ(k)−V(i)k2

F+C(ε+δ)

108 4 Estimates of Guerra’s remainder term in the SK model with multidimensional spins

≤2ϕN(t,x∗,Q∗,B(U,ε))−c+C(ε+δ),

where the last inequality is again due to Lipschitzianity of the Parisi functional (Propo-

sition 4.2.2) which allows to approximate functional’s value at (x∗(i),Q∗(i),Λ∗(i)) by the

value at (x∗,Q∗,Λ∗)paying the cost of at most Cε. Choose c>C(ε+δ). Then Lemma 4.3.2

implies that there exists L=L(ε,δ,c)>0 such that

µkΣ(2)

N(L∗,U,ε,δ)≤Lexp−N

L.

Therefore, the inclusion (4.81) gives

µkΣ(2),k

N(U,ε)≤LM exp−N

L.(4.82)

Hence, for each k∈[1;n]∩

, we have

µkhkRN(σ(1),σ(2))−Q(k)k2

Fi=µkhkRN(σ(1),σ(2))−Q(k)k2

Σ(2),k

N(U,ε)(σ(1),σ(2))i

+µkhkRN(σ(1),σ(2))−Q(k)k2

F1−

Σ(2),k

N(U,ε)(σ(1),σ(2))i

=: I +II.(4.83)

For all (σ(1),σ(2))∈ΣN(B(U,ε))2\Σ(2),k

N(U,ε,δ), we have by definition

kR(σ(1),σ(2))−Q(k)k2

F<2K∆N(t)+c.

Therefore, using Remark 4.1.4, we arrive to

II ≤2K∆N(t)+c.(4.84)

The bound (4.82) assures that

I≤LM exp−N

L.(4.85)

5. Combining (4.84) and (4.85) with (4.83) and (4.79), we obtain

dt∆N(t)≤2K∆N(t)+c+LM exp−N

L+C(ε+δ).

Hence,

dt(∆N(t)+c)exp(−2Kt)=exp(−2Kt)d

dt(∆N(t)+c)−2K(∆N(t) +c)

≤exp(−2Kt)d

dt(LM exp−N

L+C(ε+δ).

Integrating the above inequality and noting that due to (4.10) |∆N(0)|≤Cε, we arrive to

4.3 Remainder estimates 109

∆N(t)+c≤(Cε+c)exp(−2Kt)+ LM exp−N

L

+C(ε+δ)(exp(−2Kt)−1)+C(ε+δ).

Passing consequently to the limits N↑+∞,ε↓+0, δ↓+0 and finally c↓+0 in the above

inequality, we get

lim

ε↓+0∆(t)≤0,for all t∈[0;t0].

The existence of the N↑+∞limits is guaranteed by the general result of Guerra & Toninelli

(2003). The limits ε↓+0, δ↓+0 exist due to monotonicity. Finally, combining the above

inequality with (4.78), we get

lim

ε↓+0∆(t) = 0,for all t∈[0;t0]. (4.86)

6. Now it is easy to extend the validity of (4.86) onto the whole interval [0;1]. Indeed, due to

the boundedness of the derivatives of ϕNand φ, we have, for any t∈[0;1],

∆N(t)≤Z1

dt∆N(t)dt

=Zt0

+Z1

t0d

dt∆N(t)dt

≤∆N(t0)−∆N(0)+Z1

t0

dt∆N(t)dt

≤∆N(t0)+L(1−t0).(4.87)

Passing to the N↑+∞limit, applying (4.86), and then to t0→1 limit in (4.87), we get

lim

ε↓+0∆(t) = 0,for all t∈[0;1].

7. In particular, the previous formula yields

0=lim

ε↓+0∆(1) = φ(x∗,Q∗,Λ∗)(1)−lim

ε↓+0ϕN(1,x∗,Q∗,B(U,ε)).

Note that ϕN(1,x,Q,B(U,ε)) does not depend on the choice of xand Q. Hence, by (4.77),

we obtain

|lim

ε↓+0ϕN(1,x∗,Q∗,B(U,ε))−inf

ρ∈Q0(U,d)

Λ∈Sym(d)

P(β,ρ,U,Λ)|≤θ.

The proof of (4.76) is finished by noticing that the θcan be made arbitrary small.

The SK model with multidimensional Gaussian spins

In this chapter, we prove the local Parisi formula (Theorem 2.3.10) for the SK model with

multidimensional Gaussian spins.

5.1 Introduction

Let Σ:=

dand fix some vector h∈

d. Let µ∈Mf(Σ)be the finite measure with the following

density (with respect to the Lebesgue measure λon Σ)

dµ

dλ(σ) = detC

(2π)d!1/2

exp−1

2hCσ,σi+hh,σi,(5.1)

where C∈Sym+(d). Note that, given m∈

dand C∈Sym+(d)such that detC6=0, the density

(5.1) with h:=Cm coincides (up to the constant factor exp−1

2hCm,mi) with the Gaussian

density with covariance matrix C−1and mean m.

Remark 5.1.1. It turns out that only matrices C with sufficiently large eigenvalues will result

in finite global free energy, cf. Lemma 5.2.8. The local free energy is, in contrast, always finite,

see Lemma 5.2.7 and Theorem 5.1.1.

Consider the function f:(0 : +∞)2→

given by

f(c,u) = (β2u2+logcu −cu +1,u∈(0; √2

2β],

(2√2β−c)u+log c

β−1

2(1+log2),u∈(√2

2β;+∞].(5.2)

The following result shows that, at least, in the highly symmetric situation (5.1) with h=0 the

multidimensional Parisi formula indeed holds true (see Lemma 5.2.7 for an explanation why

the result is indeed a Parisi-type formula).

Theorem 5.1.1. Let µsatisfy (5.1) with h =0. Assume that the matrices U and C are simultane-

ously diagonalisable in the same basis. Denote by {cv∈

+}d

v=1and {uv∈

+}d

v=1the eigenval-

ues of the matrices C and U, respectively. Moreover, assume that minvuv>0and minvcv>0.

Then we have

lim

ε↓+0lim

N↑+∞pN(ΣN(B(U,ε))) =

∑

v=1

f(cv,uv).

112 5 The SK model with multidimensional Gaussian spins

Remark 5.1.2. Close results have previously been obtained in the case of the spherical model by

Panchenko & Talagrand (2007b), from where we borrow the general methodology of the proof of

the above theorem. As noted by Panchenko & Talagrand (2007b), another more straightforward

way to obtain the Theorem 5.1.1 is to diagonalise the interaction matrix G and use the properties

of the corresponding random matrix ensemble.

5.2 Proof of the local Parisi formula

In this section, we prove Theorem 5.1.1. The rich symmetries of the Gaussian a priori distri-

bution allow rather explicit computations of the X0terms (see (3.10)). This allows us to prove

that the analogon of Assumption 4.3.1 is satisfied, implying the Parisi formula for the local free

energy (Theorem 5.1.1).

Remark 5.2.1. The case of Gaussian spins is very tractable due to the (unusually) good sym-

metry (i.e., the rotational invariance) of the Gaussian measure. Therefore, it is not surprising

that in this case the calculus resembles the one for the spherical SK model, cf. Panchenko &

Talagrand (2007b); Talagrand (2006a).

We start from the estimates under a generic (i.e., no simultaneous diagonalisation, cf. Sec-

tion 4.2.3) scenario.

5.2.1 The case of positive increments

Let, for k∈[0;n]∩

∆Q(k):=Q(k+1)−Q(k).

We define, for Λ∈Sym(d), a family of matrices nD(l)∈

d×don+1

l=0as follows

D(n+1):=C,

and, further, for k∈[0;n]∩

D(k):=C−Λ−2β2n

∑

l=k

xl∆Q(l).(5.3)

We assume that the matrices Λand Care such that, for all l∈[1;n+1]∩

, we have

D(l)0.

We need the following two small (and surely known) technical Lemmata which exploit the

symmetries of our Gaussian setting. We include their statements for reader’s convenience.

Lemma 5.2.1. Fix some vector h ∈

dand a Gaussian random vector z ∈

dwith Varz=C−1∈

d×d.

Then we have

z[exp(hz,hi+hΛσ,σi)] =dethC(C−Λ)−1i1/2

×exp1

2(C−Λ)−1h,h.

5.2 Proof of the local Parisi formula 113

Proof. This is a standard Gaussian averaging argument.

Lemma 5.2.2. For a positive definite matrix ∆Q∈Sym(d), let z ∼N(0,∆Q). We fix also

another positive definite matrix D ∈Sym(d)such that ∆Q−1D−1.

Then we have

zexp1

2hD−1(z+h),z+hi=detD(D−∆Q)−1−1/2

×Z

dexp1

2h(D−∆Q)−1h,hi.

Proof. This is a standard Gaussian averaging argument. See, e.g., Talagrand (2006a) for an

argument in 1-D.

Now we are ready to compute the term X0(x,Q,U,Λ)(see (3.10)) corresponding to the a

priori distribution (5.1) in a rather explicit way.

Lemma 5.2.3. We have

X0(x,Q,U,Λ) = 1

2 h[D(1)]−1,∆Q(0)i+h[D(1)]−1h,hi+

∑

l=1

log detD(l+1)

detD(l)!!.

Proof. 1. We start from computing the following quantity

Xn+1:=logZ

dexp n

∑

l=0hY(l),σi+hΛσ,σi!dµ(σ),(5.4)

where Y(l)∈

dare independent Gaussian vectors with variance

VarhY(l)i=2β2∆Q(l).

We denote

eh:=h+

∑

l=0

Y(l).

Lemma 5.2.1 gives

dexp n

∑

l=0hY(l),σi+hΛσ,σi!dµ(σ) =dethC(C−Λ)−1i1/2

×exp1

2D(C−Λ)−1eh,ehE.

114 5 The SK model with multidimensional Gaussian spins

2. Next, we define, for l∈[0;n]∩

, recursively the following quantities

Xl:=1

log

YlexpxlXl+1.

Applying the Lemma 5.2.2 to (5.4) recursively, we obtain

X1:=1

2h[(D(1)]−1Y(0)+h,Y(0)+hi+1

∑

l=1

log detD(l+1)

detD(l)!.(5.5)

Recall that we have

X0=lim

x→+0

xlog

Y0expxX1

Y0X1(5.6)

and note that

Y0hh[D(1)]−1(Y(0)+h),Y(0)+hii=2β2h[D(1)]−1,∆Q(0)i+h[D(1)]−1h,hi.(5.7)

Hence, combining (5.6) and (5.7) with (5.5), we obtain the theorem.

5.2.2 Simultaneous diagonalisation scenario

In what follows, we employ the simultaneous diagonalisation scenario introduced in Sec-

tion 4.2.3. Suppose that, for l∈[0;n+1]∩

, and some matrix O∈O(d), we have

D(l):=O∗d(l)O,

where the vectors d(l)∈

d, for l∈[0;n]∩

, satisfy

0≺d(l)≺d(l+1).

That is, the vectors d(l)are (component-wise) increasingly ordered and non-negative.

Lemma 5.2.4. We have

X0(x,Q,U,Λ) = 1

∑

v=1 2β2q(1)

v+h2

d(1)

∑

l=1

log d(l+1)

d(l)

v!!,(5.8)

β2

∑

k=1

xkkQ(k+1)k2

F−kQ(k)k2

F=β2

∑

k=1

xlkq(k+1)k2

2−kq(k)k2

2.(5.9)

Proof. This is a standard argument which relies on the standard invariance properties of the

determinant and the matrix trace.

5.2 Proof of the local Parisi formula 115

Define the 1-D Parisi functional for the case (5.1) as

P(ρ,λ):=−λu+2β2q(1)+h2

d(1)+

∑

l=1

log d(l+1)

d(l)!

−β2n

∑

l=1

xl[q(l+1)]2−[q(l)]2.(5.10)

Proposition 5.2.1. There exists C =C(Σ)>0such that, for all u ∈

+and all ε,δ>0, there

exists an δ-minimal Lagrange multiplier λ=λ(U,ε,δ)∈

din (3.11) such that, for all t ∈[0;1]

and all (x,ρ), we have

pN(ΣN(V(Λ,U,ε,δ))) ≤1

2inf

ρ,λ d

∑

v=1

P(ρv,λv)!+C(ε+δ)(5.11)

and

lim

N↑+∞pN(ΣN(B(U,ε))) ≥1

2inf

ρ,λ d

∑

v=1

P(ρv,λv)+ lim

N↑+∞Z1

R(t,x,Q,ΣN(B(U,ε)))dt!

+C(ε+δ).(5.12)

Proof. We combine (5.8) and (5.9) and the Proposition 4.1.1 to get (5.11) and (5.12). ut

5.2.3 The Crisanti-Sommers functional in 1-D

In this subsection, we adapt the proof of Talagrand (2006a) to obtain the equivalence between

the (very tractable) Crisanti-Sommers functional (Crisanti & Sommers, 1992) and the Parisi one

(5.10) in the case of the Gaussian a priori measure (5.1). Similar ideas based on the symmetry

of the a priori measure were exploited in the case of the spherical models by Ben Arous et al.

(2001); Panchenko & Talagrand (2007b).

We restrict the consideration to 1-D situation for a moment. Given u≥0, consider ρ∈

n(u,1),λ∈

,h∈

and let {d(l)∈

}n+1

l=1be the scalars playing the role of matrices D(l)(cf.

(5.3)). That is,

d(l):=c−λ−2β2n

∑

k=l

xkq(k+1)−q(k),

d(n+1):=c.

We define, for k∈[1;n]∩

, the family of vectors {s(k)∈

d}n

k=0by

s(k):=

∑

l=k

xlq(l+1)−q(l).(5.13)

We also define the Crisanti-Sommers functional as follows

C S (ρ):=1−cu +h2s(1)+q(1)

s(1)+

n−1

∑

l=1

log s(l)

s(l+1)!+loghc(u−q(n))i

+β2n

∑

l=1

xl[q(l+1)]2−[q(l)]2.(5.14)

116 5 The SK model with multidimensional Gaussian spins

Lemma 5.2.5. If (ρ,λ)is an optimiser for (5.10), that is,

P(ρ,λ) = inf

(ρ0,λ0)

P(ρ0,λ0),(5.15)

then, for all k ∈[1;n]∩

, the pair (ρ,λ)satisfies

q(k)=h2+2β2q(1)

[d(1)]2+

k−1

∑

l=1

xl1

d(l)−1

d(l+1).(5.16)

Moreover,

λ=c−2β2(u−q(n))−(u−q(n))−1,(5.17)

and, for all k ∈[1;n]∩

, we have

s(k+1)−1

s(k)=2β2xkq(k+1)−q(k),(5.18)

and also

s(k)=1

d(k).(5.19)

Remark 5.2.2. In the formulation of the theorem (as well as elsewhere), it is implicit that

d(k)=d(k)(ρ,λ)and s(k)=s(k)(ρ,λ).

Proof. 1. Rearranging the terms in (5.10), we observe that

P(ρ0,λ0) =−λu+2β2q(1)+h2

d(1)+

∑

l=2

logd(l)1

xl−1−1

xl+1

logd(n+1)−1

logd(1)

−β2n

∑

l=1

xl[q(l+1)]2−[q(l)]2.(5.20)

We compute, for k,l∈[1;n]∩

∂d(l)

∂q(k)=









0,k<l,

2β2xk,l=k,

2β2xk−xk−1,k>l.

(5.21)

Using (5.21) and the representation (5.20), we compute the necessary condition for (q,λ)

satisfy (4.72), for k∈[2;n]∩

0=∂

∂q(k)P(q,λ) =2β2xk−xk−1"−2β2q(1)+h2

[d(1)]2+

k−1

∑

l=2

d(l)1

xl−1−1

xl

d(k)xk−1−1

x1d(1)+qk#.(5.22)

5.2 Proof of the local Parisi formula 117

We also have (for k=1)

0=∂

∂q(1)P(q,λ) = 2β2

d(1)−x1q(1)+h2

[d(1)]2−x1

x1d(1)+x1q(1)



=2β2x1

q(1)−q(1)+h2

[d(1)]2

.(5.23)

Relations (5.22) and (5.23) then imply (5.16).

2. Using the fact that

∂d(l)

∂λ =−1,

we obtain

∂

∂λ P(q,λ) = −u+h2+2β2q(1)

[d(1)]2+

n−1

∑

l=1

xl1

d(l)−1

d(l+1)+1

d(n).(5.24)

Applying (5.16) with k=nin (5.24), we obtain that the necessary condition for λto satisfy

(5.15) is as follows

0=∂

∂λ P(q,λ) = −u+q(n)+1

xn1

d(n)−1

d(n+1)

=−u+q(n)+1

d(n)=−u+q(n)+c−λ−2β2(u−q(n))−1

(5.25)

which implies (5.17).

3. Relation (5.18) is proved as follows. Subtracting the relations (5.16), we obtain, for k∈

[1;n−1]∩

xkq(k+1)−q(k)=1

d(k)−1

d(k+1).(5.26)

By (5.25), we have

xnq(n+1)−q(n)=u−q(n)=1

d(n).

(That is, (5.26) is valid also for k=n.) Combining the previous two relations, we get, for

k∈[1;n]∩

s(k)=1

d(k).(5.27)

Using (5.27) and (5.26), we get

2β2xkq(k+1)−q(k)=d(k+1)−d(k)

118 5 The SK model with multidimensional Gaussian spins

(by (5.26)) =d(k+1)d(k)xkq(k+1)−q(k)=d(k+1)d(k)s(k)−s(k+1)

(by (5.27)) =1

s(l+1)−1

s(l)

which is (5.18).

Lemma 5.2.6. If ρis an optimiser of (5.14), that is,

C S (ρ) = inf

ρ0

C S (ρ0),

then, for all l ∈[1;n]∩

,(5.18) holds.

Proof. The strategy is the same as in the previous lemma. We rearrange the summands in (5.14)

to get

C S (ρ) =h2s(1)+q(1)

s(1)+logs(1)

x1−logs(n)

xn−1

n−1

∑

l=21

xl−1

xl+1logs(l)

+logc(u−q(n))+β2n

∑

l=1

xl[q(l+1)]2−[q(l)]2.(5.28)

We have, for k,l∈[1;n]∩

∂s(l)

∂q(k)=









0,k<l,

−xk,k=l,

xk−1−xk,k>l.

(5.29)

1. Relation (5.29) implies, for k∈[2;n−1]∩

∂

∂q(k)C S (ρ) =h2(xk−1−xk)−q(1)

[s(1)]2(xk−1−xk)+ xk−1−xk

x1s(1)

k−1

∑

l=2

xk−1−xk

s(l)1

xl−1

xl−1−xk

s(k)1

xk−1

xk−1

+2β2q(k)xk−1−xk=0.

Hence,

2β2q(k)=−h2+q(1)

[s(1)]2−1

x1s(1)+1

xk−1s(k)−

k−1

∑

l=2

s(l)1

xl−1

xl−1

=−h2+q(1)

[s(1)]2−

k−1

∑

l=1

xl1

s(l)−1

s(l+1).(5.30)

2. To handle the case k=n, we note that

log1+c(u−q(n))=1

log s(n)

s(n+1)!,

and, hence, the argument in the previous item shows that (5.30) is also valid for k=n.

5.2 Proof of the local Parisi formula 119

3. Differentiating the representation (5.28) with respect to q(1)and using (5.29), we obtain

∂

∂q(1)C S (ρ) = −x1h2+1

s(1)+x1q(1)

[s(1)]2−x1

x1s(1)−2β2x1q(1)=0.

Therefore,

2β2q(1)=−h2+q(1)

[s(1)]2

which is (5.30), for k=1.

4. Subtracting equations (5.30), we arrive to (5.18), for all k∈[1;n]∩

Proposition 5.2.2. The functionals (5.14) and (5.10) are equivalent in the following sense

inf

ρ0,λ0

P(ρ0,λ0) = inf

ρ0

C S (ρ0).

Proof. 1. Let (ρ,λ)be the solutions of equations (5.18) and (5.17). Lemma 5.2.6 guarantees

that ρis the optimiser of the Crisnati-Sommers functional and Lemma 5.2.5 assures that

(ρ,λ)is the optimiser of the Parisi functional.

2. We have

P(ρ,λ)−C S (ρ) =−λu+2β2q(1)s(1)−q(1)

s(1)+cu −1

−2β2n

∑

l=1

xl[q(l+1)]2−[q(l)]2.(5.31)

We can simplify the Φ[B]-like term (that is the summation) in (5.31), using (5.18) and (5.17).

Indeed,

2β2n−1

∑

l=1

xl[q(l+1)]2−[q(l)]2=2β2n−1

∑

l=1

xlq(l+1)[q(l+1)−q(l)]+ q(l)[q(l+1)−q(l)]

(by (5.18) and (5.13)) =

n−1

∑

l=12β2q(l+1)hs(l)−s(l+1)i+q(l)1

s(l+1)−1

s(l).

(5.32)

Regrouping the summands in (5.32), we get

(5.32) =2β2n−1

∑

l=1

s(l)q(l+1)−q(l)+2β2q(1)s(1)−q(n)s(n)

n−1

∑

l=1

q(l)−q(l+1)

s(l+1)+ q(n)

s(n)−q(1)

s(1)!.(5.33)

Due to (5.18), we have

120 5 The SK model with multidimensional Gaussian spins

2β2q(l+1)−q(l)=s(l)−s(l+1)

xls(l)s(l+1)=q(l+1)−q(l)

s(l)s(l+1).

Applying the previous relation, we get that the both summations in (5.33) cancel out and we

end up with

(5.33) =2β2q(1)s(1)−q(n)s(n)+q(n)

s(n)−q(1)

s(1).

Now, turning back to (5.31), we get

P(ρ,λ)−C S (ρ) = −λu−2β2u2−[q(n)]2+2β2q(n)s(n)−q(n)

s(n)+cu −1

(by (5.17)) and (5.13) =−uc−2β2(u−q(n))−(u−q(n))−1−2β2u2−[q(n)]2

−q(n)

u−q(n)+2β2q(n)u−q(n)+cu −1

=0.

5.2.4 Replica symmetric calculations

In this subsection, we shall consider the one dimensional case of the a priori measure (5.1) with

h=0. We shall also restrict the computations to the case n=1 which is often referred to in

physical literature as the replica symmetric scenario. It is indeed the right scenario under the

above assumptions, as shows Theorem 5.1.1.

Lemma 5.2.7. Let µsatisfy (5.1) with h =0. Assume d =1, n =1and c >0. Given u ≥0, we

have

inf

ρ∈Q(u,1)

C S (ρ) = inf

q∈[0;u]1−cu +log(c(u−q))+ q

u−q+β2u2−q2=f(c,u),(5.34)

where f (c,u)is defined in (5.2).

Proof. Using the definitions, we obtain

∂

∂qC S (ρ) = ∂

∂qlog(u−q)+ q

u−q+β2u2−q2=q

(u−q)2−2β2q.

Hence, the critical points of q7→C S (q,u)are

q0=0,q1,2=u±√2

2β.

Furthermore, we also have

∂2

∂q2C S (q,u) = 1

(u−q)2+2q

(u−q)3−2β2.

5.2 Proof of the local Parisi formula 121

Hence, as a simple calculation shows, the infima in (5.34) are attained on

q∗=(0,u≤√2

2β,

u−√2

2β,u>√2

2β

(5.35)

which implies (5.34). ut

Lemma 5.2.8. Under the assumptions of Lemma 5.2.7, we have

1. For c ≥2√2β, we have

sup

u≥0

inf

q∈[0;u]

C S (q,u) = C S (0,u∗) = β2(u∗)2+logcu∗−cu∗+1,

where

u∗:=1

4β2c−pc2−8β2.

2. For c <2√2β, we have

sup

u≥0

inf

q∈[0;u]

C S (q,u)=+∞.

Remark 5.2.3. Under the assumptions, the above theorem says that from the point of view of

the global free energy, the system can only exist in the “high temperature” scenario, cf. (5.2).

The threshold at c0=2√2βcould be easily understood from the perspective of the norms of

random matrices.

Proof. 1. Suppose c≥2√2β. Recalling (5.2), for u∈(0; √2

2β], we introduce the following func-

tion

f(u):=log(cu)+β2u2−cu +1.

We have

∂

∂uf(u) = 1

u+2β2u−c.

Hence, the critical points of the function fare

u1,2=c±pc2−8β2

4β2.

Furthermore, we have

∂2

∂u2f(u) = 2β2−1

u2.

We notice that u∗≤√2

2βand, hence, due to (5.2)

C S (0,u∗) = β2(u∗)2+logcu∗−cu∗+1.

2. If c<2√2β, then the function

u7→(2√2β−c)u+log c

β−1

2(1+log2)

is unbounded on (√2

2β;+∞).

122 5 The SK model with multidimensional Gaussian spins

5.2.5 The multidimensional Crisanti-Sommers functional

Recall the definition (4.73).

Proposition 5.2.3. Assume d =1. Given u >0, we have

2φ(x∗,Q∗,Λ∗)(t) = 



3√2β−cu+log c

β−1−log2

2−t√2uβ−1

2,u>√2

2β,

2β2(u)2+log(cu)−cu +1−tβ2(u)2,u≤√2

2β.(5.36)

Proof. Combining (5.10), (5.14) with Lemma 5.2.7 and Proposition 5.2.2, we get the claim.

5.2.6 Talagrand’s a priori estimates

In this subsection, we prove that Assumption 4.3.1 is satisfied in the case of the Gaussian a

priori distribution (5.1) with h=0.

Theorem 5.2.1. Let µsatisfy (5.1) with h =0, assume U ∈Sym+(d)is such that minvuv>√2

2β

and suppose C 0. Let Q =Q∗and Λ=Λ∗.

Then, for any t0∈(0;1)and any t ∈(0;t0], we have (cf. (4.74) with k =1)

ϕ(2)

N(1,t,x,Q,Σ(2)

N(L,U,ε,δ)) ≤2φ(x,Q,Λ)(t)−1

KkQ(1)−Vk2

F+O(ε+δ).(5.37)

Proof. 1. We employ the notations of Section 4.3.2. Let n=1. Given U∈Sym(2d)(cf. (4.51)),

choose arbitrary matrices nQ(l)∈Sym(2d)|l∈[0;2]∩

osatisfying (4.52). Define x:=x

which, in particular, implies that ζ=ξ. Finally, we set, for l∈[0;n+1]∩

Q(l):=Q(l).

2. Proposition 4.3.1 implies that, for any δ-minimal L∈

2d×2d, we have

ϕ(2)

N(1,t,x,Q,Σ(2)

N(L,U,ε,δ)) ≤−hL,Ui−tβ2

2kQ(2)k2

F−kQ(1)k2

F

+X(2)

0(1,x,b

Q(l)(t),L)+O(ε+δ).(5.38)

3. We define a matrix C∈

2d×2das follows

C:=C0

0C.

Recalling (5.3), we define also the following matrices D(2):=Cand

D(1):=C−L−b

Q(2)(t)−b

Q(1)(t).(5.39)

Applying the Proposition 5.2.1 to (5.38), we get

ϕ(2)

N(1,t,Σ(2)

N(L,U,ε,δ)) ≤1

2h−hL,Ui−tβ2kQ(2)k2

F−kQ(1)k2

F

+2β2h[D(1)]−1,b

Q(1)(t)i+log detD(2)

detD(1)!#+O(ε)

=:e

Φ(2),k,x,L+O(ε).(5.40)

5.2 Proof of the local Parisi formula 123

4. Assume that the matrices

Q(1),Q(2),D(1)∈

2d×2d(5.41)

are simultaneously diagonalisable in the same basis which is given by the orthogonal matrix

O∈

2d×2d. Let the vectors

q(1),q(2),d(1)∈

2d(5.42)

be the corresponding spectra of the matrices (5.41). That is, we assume that

Q(1)=O∗diagq(1)O,Q(2)=O∗diagq(2)O,

D(1)=O∗d(1)O,e

Q(1)=O∗diag e

Q0(1)O,

where we have introduced the matrix e

Q0(1)(t)∈Sym+(2d). By (5.35), we have, Q(2)−

Q(1)=√2

2βI, where Idenotes the unit matrix of the suitable dimension. The definitions (4.62)

and (4.63) then imply

Q(2)−e

Q(1)=√2

2βI.(5.43)

Using the definitions and the above relation, we obtain

Q(1)

v(t) = O∗tdiagq(1)+(1−t)e

Q0(1)O,

Q(2)(t)−b

Q(1)(t) = O∗tdiag(q(2)−q(1))+(1−t)√2

2βIO.(5.44)

Motivated by (5.19), we set

d(1)

v:=uv−q(1)

v−1.(5.45)

In view of (5.39), the above choice necessarily yields (cf. (5.17))

L=C−O∗diag(uv−q(1)

v)−1O−b

Q(2)(t)−b

Q(1)(t)

=C−O∗diag(uv−q(1)

v)−1+tdiag(q(2)−q(1))+(1−t)√2

2βIO.(5.46)

Applying Lemma 5.2.4 to (5.40) and using (5.46), (5.45), (5.44), we get the following diag-

onalised representation of (5.38)

ϕ(2)

N(1,t,x,Q,Σ(2)

N(L,U,ε,δ)) ≤1

2logdetC−1

2hC,Ui

∑

v=1nuvh(uv−q(1)

v)−1+2β2t(q(2)

v−q(1)

v)+(1−t)√2

2βi

+2β2(uv−q(1)

v)tq(1)

v+(1−t)eq(1)

v+log(uv−e

Q0(1)

v,v)

124 5 The SK model with multidimensional Gaussian spins

−tβ2(q(2)

v)2−(q(1)

v)2o+O(ε).(5.47)

Using the definitions, we get

hC,Ui=2hC,Ui=2

∑

v=1

cvuv,

logdetC=2logdetC=2

∑

v=1

logcv.(5.48)

Motivated by (5.43) (or by (5.35)), we define

q(1)

v:=uv−√2

2β.(5.49)

In this case, as a straightforward calculation shows, the expression in the curly brackets in

(5.47) equals

2√2βuv+β√2e

Q0(1)

v,v(1−t)−logβ−1

2(log2 −t).(5.50)

By the definitions and the general properties of matrix trace, we have

∑

v=1e

Q0(1)

v,v=

∑

v=1e

Q(1)

v,v=2

∑

v=1

Q(1)

v,v,

∑

v=1

uv=2

∑

v=1

Uv,v.(5.51)

Combining (5.47) with (5.50), (5.51) and (5.48), we obtain

ϕ(2)

N(1,t,x,Q,Σ(2)

N(L,U,ε,δ)) ≤

∑

v=1−cvuv+logcv+3√2uβ

−1

2(log2 −t)−√2βtu −logβ−1+O(ε)

∑

v=1

φ(t)|c=cv,

u=uv

+O(ε),(5.52)

where in the last line we have used the relation (5.36).

5. To get the version of the a priori bound (5.52) with the quadratic correction term as stated in

(5.37), we perturb the r.h.s of (5.38) around our choice of D(1)in (5.45), i.e.,

D(1)=Uv−Q(1)

v−1=√2βI,

where in the last equality we used (5.49).

5.2.7 The local low temperature Parisi formula

Proof of Theorem 5.1.1. The result follows from Theorem 5.2.1 and Theorem 4.3.1. Note that

the proof of Theorem 4.3.1 requires a minor modification to cope with the fact that the a priori

distribution (5.1) is unbounded. This minor problem can be fixed by considering the pruned

Gaussian distribution and using the elementary estimates to bound the tiny Gaussian tails. ut

The GREM in the presence of uniform external field

In this chapter, we find the fluctuations of the ground state and of the partition function for the

GREM with external field. We provide an explicit expression for the free energy of the model.

We also obtain some large deviation results providing an expression for the free energy for a

class of models with Gaussian Hamiltonians and external field.

6.1 Introduction

Despite the recent substantial progress due to Guerra (2003), Aizenman et al. (2003, 2007),

and Talagrand (2006b) in establishing rigorously the Parisi formula for the free energy of the

celebrated SK model, understanding of the corresponding limiting Gibbs measure is still very

limited.

Due to the above mentioned works, it is now rigorously known that Derrida’s GREM is

closely related to the SK model at the level of free energy, see, e.g., (Bovier, 2006, Section 11.3).

Recently Bovier & Kurkova (2003a, 2004a,b, 2007) have performed a detailed study of the

geometry of the Gibbs measure for the GREM. This confirmed the predicted in the theoretical

physics literature hierarchical decomposition of the Gibbs measure in rigorous terms.

As pointed out in Bovier & Kurkova (2004a) (see also Ben Arous et al. (2005)), the GREM-

like models may represent an independent interest in various applied contexts, where correlated

heavy-tailed inputs play an important role, e.g., in risk modelling.

One of the key steps in the results of Bovier & Kurkova (2004a) is the identification of the

fluctuations of the GREM partition function in the thermodynamic limit with Ruelle’s proba-

bility cascades. In this chapter we also perform this step and study the effect of external field

on the fluctuations (i.e., the weak limit laws) of the partition function of the GREM in the ther-

modynamic limit. We find that the main difference introduced by the presence of external field,

comparing to the system without external field, is that the coarse graining mechanism should be

altered. This change reflects the fact that the coarse-grained parts of the system tend to have a

certain optimal magnetisation as prescribed by the strength of external field and by parameters

of the GREM. We use the general line of reasoning suggested by Bovier & Kurkova (2004a),

i.e., we consider the point processes generated by the scaling limits of the GREM Hamiltonian.

We streamline the proof of the weak convergence of these point processes to the corresponding

Poisson point process by using the Laplace transform.

126 6 The GREM in the presence of uniform external field

Organisation of the chapter

In the following subsections of the introduction we define the model of interest and formulate

our main results on the fluctuations of the partition function of the REM and GREM with ex-

ternal field and also on their limiting free energy (Theorems 6.1.1, 2.3.1, 6.1.3 and 6.1.4). Their

proofs are provided in the subsequent sections. Section 6.2 is devoted to the large deviation

results providing an expression for the free energy for a class of models with Gaussian Hamil-

tonians and external field (Theorem 6.2.1). In Section 6.3 we resort to more refined analysis and

perform the calculations of the fluctuations of the ground state and of the partition function in

the REM with external field in the thermodynamic limit. Section 6.4 contains the proofs of the

results on the fluctuations of the ground state and of the partition function for the GREM with

external field.

Definition of the model

In contrast to the work of Derrida & Gardner (1986), we consider here the GREM with ex-

ternal field which depends linearly on the total magnetisation (i.e., we consider the uniform

magnetic field, cf. the definitions (0.6) and (0.7)). Derrida & Gardner (1986) considered the

“lexicographic” external field which is particularly well adapted to the natural lexicographic

distance generated by the GREM Hamiltonian.

The important quantities are the free energy defined as

pN(β,h):=1

NlogZN(β,h),(6.1)

and the ground state energy

MN(h):=N−1/2max

σ∈ΣN

XN(h,σ).(6.2)

In what follows, we shall think of βand has fixed parameters. We shall occasionally lighten

our notation by not indicating the dependence on these parameters explicitly.

We denote the total magnetisation (cf. the second summand in (0.6)) by

mN(σ):=1

∑

i=1

σi,σ∈ΣN.(6.3)

In this chapter, we shall mainly be interested in the weak limit theorems (i.e., fluctuations) of

the partition function (0.7) and of the ground state as N↑+∞. To be precise, the general results

on Gaussian concentration of measure imply that (6.2) and (0.10) are self-averaging. By the

fluctuations of the ground state, we mean the weak limiting behaviour of the rescaled point

process generated by the Gaussian process (0.6). This behaviour is studied in Theorems 6.1.1

and 6.1.2 below. These theorems readily imply the formulae for the limiting free energy (0.10)

and the ground state (6.2). A recent account of mathematical results on the GREM without

external field and, in particular, on the behaviour of the limiting Gibbs measure can be found in

Bovier & Kurkova (2007). The GREM with external field was previously considered by Jana &

Rao (2006) (see also Jana (2007)), where its free energy was expressed in terms of a variational

problem induced by an application of Varadhan’s lemma. In this chapter, we apply very different

methods to obtain precise control of the fluctuations of the partition function for the GREM with

6.1 Introduction 127

external field. As a simple consequence of these results, we also get a rather explicit1formula

for the limiting free energy in the GREM with external field (see Theorem 6.1.4).

Main results

Limiting objects

We now collect the objects which appear in weak limit theorems for the GREM partition func-

tion and for the ground states. We denote by I:[−1;1]→

+Cram´

er’s entropy function, i.e.,

I(t):=1

2[(1−t)log(1−t)+(1+t)log(1+t)].(6.4)

Define

µ(t):=p2(log2 −I(t)),

M(h):=max

t∈[−1;1](µ(t)+ht).(6.5)

Suppose that the maximum in (6.5) is attained at t=t∗=t∗(h). (The maximum exists and is

unique, since µ(t)+ ht is strictly concave.) Consider the following two real sequences

AN(h):=µ(t∗)√N−1,(6.6)

BN(h):=M(h)√N+AN(h)

2logAN(h)2(I00(t∗)+ h)

2π(1−t2

∗).(6.7)

Define the REM scaling function uN,h(x):

→

uN,h(x):=AN(h)x+BN(h).(6.8)

Given f:D⊂

→

+, we denote by PPP(f(x)dx,x∈D)the Poisson point process with inten-

sity f. We start from a basic limiting object. Assume that the point process P(1)on

satisfies

P(1)∼PPP(exp(−x)dx,x∈

),(6.9)

and is independent of all random variables around. The point process (6.9) is the limiting object

which appears in the REM.

Theorem 6.1.1 (fluctuations of the ground state of the REM with external field).If n =1(the

REM case), then, using the above notations, we have

∑

σ∈ΣN

δu−1

N,h(XN(h,σ))

−−−→

N→∞

P(1),(6.10)

where the convergence is the weak one of the random probability measures equipped with the

vague topology.

1In contrast to (Jana & Rao, 2006, Theorem 5.1) and (Jana, 2007, Corollary 4.3.5), who stop at the level of variational

problem.

128 6 The GREM in the presence of uniform external field

To formulate the weak limit theorems for the GREM (i.e., for the case n>1), we need

a limiting object which is a point process closely related to the Ruelle probability cascade,

(Ruelle, 1987). Define, for j,k∈[1;n+1]∩

,j<k, the “slopes” corresponding to the function

ρin (0.5) as

θj,k:=qk−qj−1

xk−xj−1

Define also the following h-dependent “modified slopes”

θj,k(h):=θj,kµ(t∗(θ−1/2

j,kh))−2.

Define the increasing sequence of indices {Jl(h)}m(h)

l=0⊂[0;n+1]∩

by the following algo-

rithm. Start from J0(h):=0, and define iteratively

Jl(h):=minnJ∈[Jl−1;n+1]∩

θJl−1,J(h)>e

θJ+1,k(h),for all k>Jo.(6.11)

Note that m(h)≤n. The subsequence of indices (6.11) induces the following coarse-graining

of the initial GREM

¯ql(h):=qJl(h)−qJl−1(h),(6.12)

¯xl(h):=xJl(h)−xJl−1(h),(6.13)

θl(h):=θJl−1,Jl.(6.14)

The parameters (6.12) induce the new order parameter ρ(J(h)) ∈Q0

min the usual way

ρ(J(h))(q):=

m(h)

∑

l=1

qJl(h)

[xJl(h);xJl+1(h))(x).

Define the GREM scaling function uN,ρ,h:

→

uN,ρ,h(x):=

m(h)

∑

l=1h¯ql(h)1/2B¯xl(h)N¯

θl(h)−1/2hi+N−1/2x.

Define the rescaled GREM process as

GREMN(h,σ):=u−1

N,ρ,h(GREMN(h,σ)).

Define the point process of the rescaled GREM energies ENas

EN(h):=∑

σ∈ΣN

δGREMN(h,σ).(6.15)

Consider the following collection of independent point processes (which are also independent

of all random objects introduced above)

6.1 Introduction 129

{P(k)

α1,...,αl−1|α1,...,αl−1∈

;l∈[1;m]∩

}

such that

P(k)

α1,...,αk−1∼P(1).

Define the limiting GREM cascade point process Pmon

mas follows

Pm:=∑

α∈

δ(P(1)(α1),P(2)

α1(α2),...,P(m)

α1,α2,...,αm−1(αm)),(6.16)

Consider the following constants

γl(h):=e

θJl−1,Jl1/2,

and define the function Eh,f:

m→

E(m)

h,ρ(e1,...,em):=¯

γ1(h)e1+...+¯

γm(h)em.

Note that due to (6.11), the constants {¯

γl(h)}m

l=1form a decreasing sequence, i.e., for all l∈

[1;m]∩

, we have

γl(h)>¯

γl+1(h).(6.17)

The cascade point process (6.16) is the limiting object which describes the fluctuations of the

ground state in the GREM.

Theorem 6.1.2 (fluctuations of the ground state of the GREM with external field).We have

EN(h)w

−−−→

N↑+∞Z

mδE(m)

h,ρ(e1,...,em)Pm(de1,...,dem)(6.18)

and

MN(h)−−−→

N↑+∞

m(h)

∑

l=1h¯ql(h)¯xl(h)1/2M¯

θl(h)−1/2hi,(6.19)

almost surely and in L1.

Theorem 6.1.2 allows for complete characterisation of the limiting distribution of the GREM

partition function. To formulate the result, we need the β-dependent threshold l(β,h)∈[0;m]∩

such that above it all coarse-grained levels l>l(β,h)of the limiting GREM are in the “high

temperature regime”. Below this threshold the levels l≤l(β,h)are in the “frozen state”. Given

β∈

+, define

l(β,h):=max{l∈[1;n]∩

:β¯

γl(h)>1}.

We set l(β,h):=0, if β¯

γ1(h)≤1. The following gives full information about the limiting

fluctuations of the partition function at all temperatures.

130 6 The GREM in the presence of uniform external field

Theorem 6.1.3 (fluctuations of the partition function of the GREM with external field).We

have

exp"−β√N

l(β,h)

∑

l=1¯ql(h)1/2B¯xl(h)N¯

θ−1/2

lh#

×exp−Nlog2 +logchβh(1−xJl(β,h))+1

2β21−qJl(β,h)ch2/3βh(1−xJl(β,h))

×ZN(β,h)w

−−−→

N↑+∞K(β,h,ρ)Z

l(β,h)exphβE(l(β,h))

h,ρ(e1,...,el(β,h))iPl(β,h)(de1,...,del(β,h)),

(6.20)

where the constant K(β,h,ρ)depends on β, h and ρonly. Moreover, K(β,h,ρ) = 1, if

βγl(β,h)+1<1and K(β,h,ρ)∈(0;1), if βγl(β,h)+1=1.

The above theorem suggests that the increasing sequence of the constants {βl:=¯

γ−1}m(h)

l=1⊂

+can be thought of as the sequence of the inverse temperatures at which the phase transitions

occur: at βlthe corresponding coarse-grained level lof the GREM with external field “freezes”.

As a simple consequence of the fluctuation results of Theorem 6.1.3, we obtain the following

formula for the limiting free energy of the GREM.

Theorem 6.1.4 (free energy of the GREM with external field).We have

lim

N↑+∞pN(β,h) =β

l(β,h)

∑

l=1h(¯xl¯ql)1/2µ(t∗(¯

θ−1/2

lh))+h¯xlt∗(¯

θ−1/2

lh)i

+log2 +logchβh(1−xJl(β,h))+1

2β21−qJl(β,h),(6.21)

almost surely and in L1.

6.2 Partial partition functions, external fields and overlaps

In this section, we propose a way to compute the free energy of disordered spin systems with

external field using the restricted free energies of systems without external field. The compu-

tation involves a large deviations principle. For gauge invariant systems, we also show that the

partition function of the system with external field induced by the total magnetisation has the

same distribution as the one induced by the overlap with fixed but arbitrary configuration. This

section is based on the ideas of Derrida & Gardner (1986).

Fix p∈

. Given some finite interaction p-hypergraph (VN,E(p)

N), where VN= [1;n]∩

and

E(p)

N⊂(VN)p, define the p-spin interaction Hamiltonian as

XN(σ):=∑

i∈E(p)

J(N,p)

iσi1σi2···σip,σ∈ΣN,(6.22)

where J(N,p):=nJ(N,p)

ioi∈E(p)

is the collection of random variables having the symmetric joint

distribution. That is, we assume that, for any ε(1),ε(2)∈{−1;+1}E(p)

N, and any t∈

E(p)

6.2 Partial partition functions, external fields and overlaps 131



exp

i∑

r∈E(p)

trε(1)

rJ(N,p)

i



=



exp

i∑

r∈E(p)

trε(2)

rJ(N,p)

i



,(6.23)

where i ∈

denotes the imaginary unit.

A particular important example of (6.22) is Derrida’s p-spin Hamiltonian given by

SK(p)

N(σ):=N−p/2N

∑

i1,...,ip=1

gi1,...,ipσi1σi2···σip,

where {gi1,...,ip}N

i1,...,ip=1is a collection of i.i.d. standard Gaussian random variables. Note that

the condition (6.23) is obviously satisfied.

Given µ∈ΣN, define the corresponding gauge transformation Tµ:ΣN→ΣNas

Tµ(σ)i=µiσi,σ∈ΣN.(6.24)

Note that the gauge transformation (6.24) is obviously an involution. We say that a d-variate

random function f:Σd

N→

is gauge invariant, if, for any µ∈ΣNand any (σ(1),...,σ(d))∈Σd

f(Tµ(σ(1)),...,Tµ(σ(d))) ∼f(σ(1),...,σ(d)),

where ∼denotes equality in distribution. Define the overlap between the configurations σ,σ0∈

ΣNas

RN(σ,σ0):=1

∑

i=1

σiσ0

i.(6.25)

Note that the overlap (6.25) and the lexicographic overlap (0.2) are gauge invariant.

Given a bounded function FN:ΣN→

, define the partial partition function as

Z(p)

N(β,q,ε,XN,FN):=∑

σ:|FN(σ)−q|≤ε

expβ√NXN(σ).(6.26)

Denote

UN:=FN(ΣN),U:=∞

[

N=1

UN.(6.27)

(The bar in (6.27) denotes closure in the Euclidean topology.) Note that for the case FN=RN

we obviously have

UN=1−2k

N:k∈[0;N]∩

,U= [−1;1].

Proposition 6.2.1 (Derrida & Gardner (1986)).Assume that XNis given either by (6.22) or

XN∼GREMN. Fix some gauge invariant bivariate function FN:Σ2

N→

, and q ∈

Then, for all σ0,τ0∈ΣN, we have

132 6 The GREM in the presence of uniform external field

Z(p)

N(β,q,ε,XN,FN(·,σ0)) ∼Z(p)

N(β,q,ε,XN,FN(·,τ0)).(6.28)

In particular, the partial partition function (6.26) with FN:=RN(·,σ0)has the same distribution

as the partial partition function which corresponds to fixing the total magnetisation (6.3), i.e.,

Z(p)

N(β,q,ε,XN,RN(·,σ0)) ∼Z(p)

N(β,m,ε,):=∑

σ:|m(σ)−q|<ε

expβ√NXN(σ).

Remark 6.2.1. The proposition obviously remains valid for the Hamiltonians XNgiven by the

linear combinations of the p-spin Hamiltonians (6.22) with varying p ∈

Proof. 1. If XNis defined by (6.22), then (6.28) follows due to the gauge invariance of (6.22)

and FN. Indeed, there exists µ∈ΣNsuch that σ0=Tµ(τ0). Define

J(N,p,µ)

i:=J(N,p)

iµi1···µip.

Due to the symmetry of the joint distribution of J(N,p), we have

{XN(σ)}σ∈ΣN∼{XN(σ)|J(N,p)=J(N,p,µ)}σ∈ΣN

which implies (6.28).

2. If XN=GREMN, then, since XNis a Gaussian process, to prove the equality in distribution,

it is enough to check that the covariance of XNis gauge symmetric. Equivalence (6.28)

follows, due to (0.5) and the fact that the lexicographic overlap (0.2) is gauge invariant.

The partial partition function (6.26) induces the restricted free energy in the usual way:

p(p)

N(β,q,ε,XN,FN):=1

NlogZ(p)

N(β,q,ε,XN,FN).(6.29)

Given σ(1),σ(2)∈ΣN, let

CN(σ(1),σ(2)):=

hXN(σ(1))XN(σ(2))i,e

CN(σ(1)):=CN(σ(1),σ(1)).

Define

VN:={CN(σ,σ):σ∈ΣN},V:=∞

[

N=1

VN.

The following result establishes a large deviations type relation between the partial free energy

and the full one.

Theorem 6.2.1. Assume XN={XN(σ)}σ∈ΣNis a centred Gaussian process and FN:ΣN→

are such that, for all N,M∈

, all σ(1),σ(2)∈ΣN, and all τ(1),τ(2)∈ΣM,

CN+M(σ(1)qτ(1),σ(2)qτ(2))≤N

N+MCN(σ(1),σ(2))+ M

N+MCM(τ(1),τ(2)),(6.30)

FN+M(σ(1)qτ(1))≤N

N+MFN(σ(1))+ M

N+MFM(τ(1)).(6.31)

Assume that CNand FNare bounded uniformly in N.

Then

6.2 Partial partition functions, external fields and overlaps 133

1. The following holds

pN(β,XN,FN):=1

Nlog ∑

σ∈ΣN

expβ√NXN(σ)+NFN(σ)

−−−→

N↑+∞p(β,X,F),almost surely and in L1.(6.32)

2. The limiting free energy p(β,X,F)is almost surely deterministic.

3. We have

lim

ε↓+0lim

N↑+∞p(p)

N(β,q,ε,XN,FN) =:p(p)(β,q,X,F)

=sup

v∈V

inf

λ∈

,γ∈

−λq−γv+p(β,X,λF+γe

C),

almost surely and in L1.(6.33)

4. Finally,

p(β,X,F) = sup

q∈Up(p)(β,q,X,F)+q.(6.34)

Remark 6.2.2.

1. If there exists {constN∈

+}∞

N=1such that, for all σ∈ΣN,

CN(σ) = constN,(6.35)

then (6.33) simplifies to

p(p)(β,q,X,F) = inf

λ∈

(−λq+p(β,X,λF)),almost surely and in L1.(6.36)

2. Inequality (6.31) can alternatively be substituted by the assumption (see (Guerra & Toninelli,

2003, Theorem 1)) that FN(σ) = f(SN(σ)), where f :

→

, f ∈C1(

), and SN:ΣN→

is the bounded function such that, for all σ∈ΣN,τ∈ΣM,

SN+M(σqτ) = N

N+MSN(σ)+ M

N+MSM(τ).

3. It is easy to check that the assumptions of Proposition 6.2.1 are fulfilled, e.g., for

XN:=c1GREMN+c2SK(p)

and

FN(·):=f1(RN(·,σ(N)))+ f2(qL(·,σ(N))),

where σ(N)∈ΣN, c1,c2∈

, and f1,f2:

→

, such that f1∈C1(

), f2is convex. Note that

in this case, due to Proposition 6.2.1, the free energies (6.32) and (6.33) does not depend on

the choice of the sequence {σ(N)}∞

N=1⊂ΣN.

134 6 The GREM in the presence of uniform external field

Proof. Similarly to (Contucci et al., 2003, Theorem 1) and (Guerra & Toninelli, 2003, Theo-

rem 1) we obtain (6.32). Then (6.32) implies that

p(β,XN,λFN+γe

CN)−−−→

N↑+∞p(β,X,λF+γe

C),almost surely and in L1.

Hence, we can apply the quenched large deviation results (Theorems 3.3.1 and 3.3.2) which

readily yield (6.34) and (6.33) (or (6.36), in the case of (6.35)). ut

Remark 6.2.3. Derrida & Gardner (1986) sketched a calculation of the free energy defined in

(6.32) in the following case

FN=qLand XN=GREMN.(6.37)

This case is easier than the case (0.10) we are considering here, since both qLand GREMN

have lexicographic nature, cf. (0.5) and (0.2).

6.3 The REM with external field revisited

In this section, we recall some known results on the limiting free energy of the REM with

external field. However, we give some new proofs of these results which illustrate the approach

of Section 6.2. Moreover, we prove the weak limit theorem for the ground state and for the

partition function of the REM with external field.

Recall that the REM corresponds to the case n=1 in (1.30). This implies that the process X

is simply a family of 2Ni.i.d. standard Gaussian random variables. To emphasise this situation

we shall write REM(σ)instead of GREM(σ).

6.3.1 Free energy and ground state

Let us start by recalling the following well-known result on the REM.

Theorem 6.3.1 (Derrida (1980); Eisele (1983); Olivieri & Picco (1984)).Assume that n =1

and let p(β,h)be given by (6.1). The following assertions hold

1. We have

lim

N→∞pN(β,0) = (β2

2+log2,β≤√2log2

β√2log2,β≥√2log2 ,almost surely and in L1.(6.38)

2. For all β≥√2log2 and N ∈

, we have

0≤

[pN(β,0)] ≤βp2log2.(6.39)

See, e.g., (Bovier, 2006, Theorem 9.1.2) for a short proof.

Given k∈[0;N]∩

, define the set of configurations having a given magnetisation

ΣN,k:={σ∈ΣN:

∑

i=1

σi=N−2k}.(6.40)

6.3 The REM with external field revisited 135

Lemma 6.3.1. Set tk,N:=N−2k

N. Given any ε>0, uniformly in k ∈[0;N]∩

such that

tk,N∈[−1+ε;1 −ε],

we have the following asymptotics

N

k=

N↑+∞r2

2Ne−NI(tk,N)

qN(1−t2

k,N) 1+1

N 1

12 +1

3(1−t2

k,N)!+O1

N2!.(6.41)

Proof. A standard exercise on Stirling’s formula. ut

Theorem 6.3.2 (Dorlas & Wedagedera (2001)).Assume that n =1(the REM case) and let

p(β,h)be given by (6.1). We have

p(β,h):=lim

N→∞pN(β,h)

=(log2 +logchβh+β2

2,β≤p2(log2 −I(t∗)) =:β0

β(p2(log2 −I(t∗))+ht∗),β≥p2(log2 −I(t∗)) ,almost surely and in L1,

(6.42)

and t∗∈(−1;1)is a unique maximiser of the following concave function

(−1;1)3t7→ht +p2(log2 −I(t)).

Proof. For the sake of completeness, we give a short proof based on (the ideas of) Theo-

rem 6.2.1. Put

Mk,N:=(blog2N

kc,k∈[1;N−1]∩

1,k∈{0,N},

where bxcdenotes the largest integer smaller than x. Consider the free energy (cf. (6.29)) of the

REM of volume Mk,N

pk,N(β):=1

Mk,N

log ∑

σ∈Σk

expβM1/2

k,NREM(σ),

where REM :={REM(σ)}σ∈ΣNis the family of standard i.i.d. Gaussian random variables. Let

pk,N(β):=Mk,N

Npk,N N

Mk,N1

2β.(6.43)

Note that (6.43) is the restricted free energy (cf. (6.29)) of the REM, where the restriction is

imposed by the total magnetisation (6.3) given by tk,N.

We claim that the family of functions P:={

pN(·)}N∈

is uniformly Lipschitzian. In-

deed, uniformly in β≥0, we have

136 6 The GREM in the presence of uniform external field

∂β

pN(β)=N−1/2

GN(β,0)XN(σ)≤N−1/2

max

σ∈ΣN

X(σ)−−−→

N↑+∞p2log2.

Hence, the family Phas uniformly bounded first derivatives.

Given t∈(−1;1)and tkN,N∈UN(cf. (6.27)) such that limN↑+∞tkN,N=t, using (6.41), we

have

lim

N↑+∞

MkN,N

N=1−I(t)log2e.(6.44)

Using (6.44) and the uniform Lipschitzianity of the family P, we get

lim

N↑+∞e

pkN,N(β)=(1−I(t)log2e)p β

p1−I(t)log2e!.(6.45)

Combining (6.45) with (6.34) and (6.36), we get

p(β,h) = max

t∈[−1;1](tβh+(1−I(t)log2e)p β

p1−I(t)log2e!).(6.46)

To find the maximum in (6.46), we consider two cases.

1. If β≤p2(log2 −I(t∗)), then according to (6.38), we have

p β

p1−I(t)log2e!=log2 +β2

2(1−I(t)log2e).

Hence, (6.46) implies

p(β,h) = max

t∈[−1;1]tβh+β2

2+log2 −I(t)=log2 +logchβh+β2

2,(6.47)

where the last equality is due to the fact that the expression in the curly brackets is concave

and, hence, the maximum is attained at a stationary point. The stationarity condition reads

t=t0(β,h):=tanhβh.(6.48)

It is easy to check that the following identity holds

I(t) = ttanh−1t−logchtanh−1t.(6.49)

Combining (6.49) and (6.48), we get (6.47).

2. If β≥p2(log2 −I(t∗)), then again by (6.38), we have

p β

p1−I(t)log2e!=β√2log2

p1−I(t)log2e.

Hence, (6.46) transforms to

p(β,h) = max

t∈[−1;1]ntβh+βp2(log2 −I(t))o=βp2(log2 −I(t∗))+ ht∗,(6.50)

where the last equality is due to the concavity of the expression in the curly brackets.

6.3 The REM with external field revisited 137

Combining (6.47) and (6.50), we get (6.42). ut

Remark 6.3.1. We note that due to the continuity of the free energy as a function of β, we have

at the freezing temperature β0

t0(β0,h) = t∗(h).(6.51)

Theorem 6.3.2 suggests that the following holds.

Theorem 6.3.3. Under the assumptions of Theorem 6.3.2, we have

lim

N↑+∞

√Nmax

σ∈ΣN

XN(h,σ) = p2(log2 −I(t∗))+ht∗,almost surely and in L1.(6.52)

Proof. We have

βpN(β)≤1

NlogNβ√Nmax

σ∈ΣN

XN(h,σ)=logN

βN+1

√Nmax

σ∈ΣN

XN(h,σ).(6.53)

In view of (6.42), relation (6.53) readily implies that

p2(log2 −I(t∗))+ht∗≤lim

N↑+∞

N−1/2max

σ∈ΣN

XN(h,σ).(6.54)

We also have

βpN(β)≥1

√Nmax

σ∈ΣN

XN(h,σ)

which combined again with (6.42) implies that

p2(log2 −I(t∗))+ht∗≥lim

N↑+∞N−1/2max

σ∈ΣN

XN(h,σ).(6.55)

Due to the standard concentration of Gaussian measure (e.g., (Ledoux, 2001, (2.35))) and the

fact that the free energy (6.1) is Lipschitzian with the constant β√Nas a function of XN(h,·)

with respect to the Euclidean topology, the bounds (6.54) and (6.55) combined with the Borell-

Cantelli lemma give the convergence (6.52). ut

6.3.2 Fluctuations of the ground state

In this subsection, we shall study the limiting distribution of the point process generated by the

properly rescaled process of the energy levels, i.e. (6.15).

Proof of Theorem 6.1.1. Let us denote

EN(h):=∑

σ∈ΣN

δu−1

N,h(XN(h,σ)).(6.56)

We treat EN(h)as a random pure point measure on

. Given some test function ϕ∈C+

)

(i.e., a non-negative function with compact support), consider the Laplace transform of (6.56)

corresponding to ϕ

138 6 The GREM in the presence of uniform external field

LEN(h)(ϕ):=

"exp(−∑

σ∈ΣN

ϕu−1

N,h(XN(h,σ)))#

∏

k=01

2πZ

exp−ϕu−1

N,h(x+h

√N(N−2k))−x2

2dx(N

.(6.57)

Introduce the new integration variables y=u−1

N,h(x+h

√N(N−2k)). We have

(6.57) =

∏

k=0AN(h)

2πZ

exp−ϕ(y)−1

2uN,h(y)−h

√N(N−2k)2dy(N

=expN

∑

k=0N

klog1−AN(h)

√2πZ

(1−e−ϕ(y))exp−1

2uN,h(y)−h

√N(N−2k)2.

(6.58)

Note that the integration in (6.58) is actually performed over y∈suppϕ, since the integrand is

zero on the complement of the support. It is easy to check that uniformly in y∈suppϕthe inte-

grand in (6.58) and, hence, the integral itself are exponentially small (as N↑+∞). Consequently,

we have

(6.58) =

N↑+∞exp−Zsuppϕ(1−e−ϕ(y))

∑

k=0N

kAN(h)

√2πexp−1

2uN,h(y)−h

√N(N−2k)2(1+o(1)).(6.59)

Denote tk,N:=N−2k

N. Using Lemma 6.3.1, we get

(6.59) =

N↑+∞exp−(1+o(1))Zsuppϕ(1−e−ϕ(y))

∑

k=0

AN(h)

πN(1−t2

k,N)1/2expNlog2 −I(tk,N)−1

2uN,h(y)−htk,N√N2.

(6.60)

Note that despite the fact that Lemma 6.3.1 is valid only for tk,N∈[−1+ε;1 −ε], we can still

write (6.60), since the both following sums are negligible:

0≤∑

k:tk,N∈([−1;−1+ε]∪[1−ε,1])

AN(h)

πN(1−t2

k,N)1/2exphNlog2 −I(tk,N)

−1

2uN,h(y)−htk,N√N2i≤KN exp(−LN),

and

0≤∑

k:tk,N∈([−1;−1+ε]∪[1−ε,1])N

kAN(h)

√2πexp−1

2uN,h(y)−htk,N√N2≤KN exp(−LN).

6.3 The REM with external field revisited 139

Consider the sum appearing in (6.60)

SN(h,y):=

∑

k=0

AN(h)

πN(1−t2

k,N)1/2expNlog2 −I(tk,N)−1

2uN,h(y)−htk,N√N2.

(6.61)

Introduce the functions fN,gN:[−1;1]→

fN(t):=I(t)+ 1

2uN,h(y)

√N−ht2−log2,

gN(t):=AN(h)

π(N(1−t2))1/2.

Note that definition (6.5) implies

I0(t∗) = hµ(t∗).(6.62)

A straightforward computation using (6.6), (6.7) and (6.62) gives

f00

N(t) = I00(t)+ h>0,(6.63)

N(t∗) = −h

(2µ(t∗)N)2y+logI00(t∗)+h

4π(1−t2

∗)(log2 −I(t∗))N=OlogN

N,(6.64)

fN(t∗) = −1

Ny+1

2logI00(t∗) +h

4π(1−t2

∗)(log2 −I(t∗))N+o1

N.(6.65)

Hence, since (6.64) vanishes even after being multiplied by √N, (6.64) is negligible for the

purposes of the asymptotic Laplace principle. This readily implies that uniformly in y∈suppϕ

SN(h,y)∼

N↑+∞

NgN(t∗)

22πf00

N(t∗)

N1/2

expN fN(t∗).(6.66)

Using (6.63), (6.64) and (6.65) in the r.h.s. of (6.66), we obtain that uniformly in y∈suppϕ

SN(h,y)∼

N↑+∞exp(−y).(6.67)

Finally, combining (6.67) and (6.60), we obtain

lim

N↑+∞LEN(h)(ϕ) = exp−Z

1−e−ϕ(y)e−ydy.(6.68)

The r.h.s. of (6.68) is the Laplace transform of PPP(e−xdx,x∈

). Then a standard result implies

the claim (6.10). ut

140 6 The GREM in the presence of uniform external field

6.3.3 Fluctuations of the partition function

In this subsection, we compute the weak limiting distribution of the partition function under the

natural scaling induced by (6.8). Define

CN(β,h):=expβM(h)N+β

2µ(t∗)logI00(t∗)+h

4π(1−t2

∗)(log2 −I(t∗))N,(6.69)

DN(β,h):=ch−2/3(βh)expNlog2 +logchβh+β2

2,(6.70)

α(β,h):=β

µ(t∗).(6.71)

Theorem 6.3.4. If β>µ(t∗), then

ZN(β,h)

CN(β,h)

−−−→

N↑+∞Z

eα(β,h)xdP(1)(x).(6.72)

If β<µ(t∗), then

ZN(β,h)

DN(β,h)

−−−→

N↑+∞1.(6.73)

Proof. This is a specialisation of Theorem 6.1.3 which is proved in Section 6.4.2. ut

6.4 The GREM with external field

In this section, we obtain the main results of this chapter concerning the GREM with external

field. We prove the limit theorems for the distribution of the partition function and that of the

ground state.

6.4.1 Fluctuations of the ground state

As in the REM, we start from the ground state fluctuations (cf. Theorem 6.1.1). The following

is the main technical result of this section which shows exactly in which situations the GREM

with external field has the same scaling limit behaviour as the REM with external field.

Proposition 6.4.1. Either of the following two cases holds

1. If, for all l ∈[2;n]∩

log2 −I(t∗(θ−1/2

l,nh))

log2 −I(t∗(h)) <θl,n,(6.74)

then we have

∑

σ∈ΣN

δu−1

N,h(XN(h,σ))

−−−→

N↑+∞PPP(e−x,x∈

d).(6.75)

6.4 The GREM with external field 141

2. If, for all l ∈[2,...,n]∩

log2 −I(t∗(θ−1/2

l,nh))

log2 −I(t∗(h)) ≤θl,n,(6.76)

and there exists (at least one) l0∈[2;n]∩

log2 −I(t∗(θ−1/2

l0,nh))

log2 −I(t∗(h)) =θl0,n,(6.77)

then there exits the constant K =K(ρ,h)∈(0;1)such that

∑

σ∈ΣN

δu−1

N,h(XN(h,σ))

−−−→

N↑+∞PPP(Ke−x,x∈

).(6.78)

Remark 6.4.1. If condition (6.76) is violated, i.e., there exists l0∈[2;n]∩

such that

log2 −I(t∗(θ−1/2

l,nh))

log2 −I(t∗(h)) >θl,n,(6.79)

then the REM scaling (cf. (6.10),(6.8)) is too strong to reveal the structure of the ground state

fluctuations of the GREM. Theorem 6.1.2 shows how the scaling and the limiting object should

be modified to capture the fluctuations of the GREM in this regime.

Proof. 1. Denote Nl:=∆xlN, for l∈[1;n]. We fix arbitrary test function ϕ∈C+

), i.e., a

nonnegative function with compact support. Consider the Laplace transform LEN(h)(ϕ)of

the random measure EN(h)evaluated on the test function ϕ.

LEN(h)(ϕ):=

"exp−∑

σ∈ΣN

(ϕ◦u−1

N,h)(XN(h,σ))#

"∏

σ∈ΣN

exp−(ϕ◦u−1

N,h)(XN(h,σ))#.(6.80)

Consider also the family of i.i.d standard Gaussian random variables

{X(σ(l),σ(2),...,σ(n))|l∈[1;n]∩

,σ(l)∈ΣNl,...,σ(n)∈ΣNn}.

Given l∈[1;n]∩

and y∈

, define

LN(l,v):=

h∏

σ(l)q...qσ(n)∈Σ(1−xl−1)N

exp−ϕ◦u−1

N,h(v+alX(σ(l))+

...+anX(σ(l),...,σ(n))+h(1−xl−1)√Nm(σ(l),...,σ(n)))i.(6.81)

We readily have

142 6 The GREM in the presence of uniform external field

LEN(h)(ϕ) = LN(1,0).(6.82)

Due to the tree-like structure of the GREM, for l∈[1;n−1]∩

, we have the following

recursion

LN(l,v) = ∏

σ(l)∈ΣNl

hLN(l+1,v+alX+h∆xl√Nm(σ(l)))i,(6.83)

where Xis a standard Gaussian random variable. Introduce the following quantities

YN(h,y,v,t,l):=uN,h(y)−h(1−xl−1)√Nt −v.

We claim that, for any l∈[1;n]∩

, uniformly in v∈

satisfying

v≤√NM(h)−δ−1−ql−1µ(h)−h(1−xl−1)t∗(θ−1/2

l,nh),

we have

logLN(l,v)∼

N↑+∞−AN(h)

q2π(1−ql−1)

(1−xl−1)N

∑

k=0(1−xl−1)N

k

×Z

1−e−ϕ(y)exph−1

2(1−ql−1)YN(h,y,v,tk,(1−xl−1)N,l)2idy.(6.84)

We shall prove (6.84) by a decreasing induction in lstarting from l=n.

2. The base of induction is a minor modification of the proof of Theorem 6.1.1. By the defini-

tion (6.81) and independence, we have

LN(n,v) =

∏

k=0

exph−(ϕ◦u−1

h,N)(anX+h∆xn√Ntk,Nn+v)i(Nn

k).(6.85)

For fixed k∈[0;Nn]∩

hexp−(ϕ◦u−1

N,h)(anX+h∆xn√Ntk,N+v)i

= (2π)−1

dxexph−x2/2−(ϕ◦u−1

N,h)(anx+h∆xn√Ntk,N+v)i.(6.86)

We introduce in (6.86) the new integration variable

y:=u−1

N,hanx+h∆xn√Ntk,Nn+v.(6.87)

Using the change of variables (6.87), we get that the r.h.s. of (6.86) is equal to

AN(h)

√2πanZ

dyexp−1

2a2

YN(h,y,v,tk,Nn,n)2−ϕ(y).(6.88)

Combining (6.85) and (6.88), we get

6.4 The GREM with external field 143

LN(n,v) =

∏

k=0AN(h)

√2πanZ

dyexph−1

2a2

YN(h,y,v,tk,Nn,n)2−ϕ(y)i(Nn

∏

k=01−AN(h)

√2πanZ

dy1−e−ϕ(y)exph−1

2a2

YN(h,y,v,tk,Nn,n)2i(Nn

k).

Define

VN(h,v,t,n):=AN(h)

√2πanZ

dy1−e−ϕ(y)exph−1

2a2

YN(h,y,v,t,n)2i.

Given any small enough δ>0, it straightforward to show that uniformly in v∈

such that

v≤√NM(h)−h∆xnt∗(hθ−1/2

n−1,n)−δ,

we have

LN(n,v) =

N↑+∞

∏

k=01−Nn

kVN(h,v,tk,Nn,n)1+O(e−CN).(6.89)

Indeed, we have

exph−1

2a2

YN(h,y,v,tk,Nn,n)2i≤exph−Nnlog2 −I(tn)i.

Next, using the fact that (1−e−ϕ(·))∈C+

), we get for some C>0

dy1−e−ϕ(y)exph−1

2a2

YN(h,y,v,tk,Nn,n)2i≤Cexph−Nnlog2 −I(tn)i.

Applying the elementary bounds

x−x2≤log(1+x)≤x,for |x|<1

2(6.90)

x:=−AN(h)

√2πanZ

dy1−e−ϕ(y)exph−1

2a2

YN(h,y,v,tk,Nn,n)2i,

and using the fact that, due to (6.41), there exists C>0 such that uniformly in k∈[1;Nn]∩

x2≤exph−2Nnlog2 −I(tn)iNn

k≤e−CN,

we get (6.89) and, consequently, (6.84) holds for l=n.

3. For simplicity of presentation, we prove only the induction step l=n l=n−1. Due to

(6.83), we have

LN(n−1,v) =

Nn−1

∏

kn−1=0

hLN(n,v+an−1X+h∆xn−1√Ntkn−1,Nn−1)i(Nn−1

kn−1).(6.91)

144 6 The GREM in the presence of uniform external field

Define

t(kn,kn−1):=1

1−xl−2∆xntkn,Nn+∆xn−1tkn−1,Nn−1.

Fix an arbitrary δ>0 and ε>0. Due to (6.84) with l=n, there exists some C>0, such

that uniformly for all kn,kn−1with

tkn,kn−1∈{t∈[−1;1]:|t∗(θ−1/2

n−1,nh)−tkn,kn−1|≤ε},

and uniformly for all v,x∈

satisfying

∆xn(log2 −I(tkn,Nn)) ≤1

2a2

nM(h)−δ−an−1x−N−1/2v

−h(∆xntkn,Nn+∆xn−1tkn−1,Nn−1)2,(6.92)

we obtain

logLN(n,v+an−1x+h∆xn−1√Ntk,Nn−1)≤CN exp(−N/C).(6.93)

Define

xN(v):=√N

an−1M(h)−δ−vN−1/2−an(2∆xn(log2 −I(tkn,Nn)))1/2

−h(∆xntkn,Nn+∆xn−1tkn−1,Nn−1).(6.94)

Using the elementary bounds

1+x≤ex≤1+x+x2,for |x|<1,(6.95)

and the bound (6.93), we obtain

{X≤xN(v)}LN(n,v+an−1X+h∆xn−1√Ntkn−1,Nn−1)i

N↑+∞

{X≤xN(v)}+

{X≤xN(v)}logLN(n,v+an−1X+h∆xn−1√Ntkn−1,Nn−1)i

+O(Nexp(−N/C)).(6.96)

Given kn−1∈[1;Nn−1]∩

, we have



{X≤xN(v)}exp−1

2a2

YN(h,y,v+an−1X+h∆xn−1√Ntkn−1,Nn−1,tkn,Nn,n)2

√2πZxN(v)

−∞

dxexp−x2

2−1

2a2

nuN,h(y)−an−1x−h√N(∆xntkn,Nn+∆xn−1tkn−1,Nn−1)−v2

=exp−1

1−qn−2

YN(h,y,v,t(kn,kn−1),n−1)2

6.4 The GREM with external field 145

×1

√2πZxN(v)

−∞

exp

−a2

n+a2

n−1

2a2

n x−an−1

n+a2

n−1

YN(h,y,v,t(kn,kn−1),n−1)!2

dx.

(6.97)

We claim that due to the strict inequalities (6.74), we have

√2πZxN(v)

−∞

exp

−a2

n+a2

n−1

2a2

n x−an−1

n+a2

n−1

YN(h,y,v,t(kn,kn−1),n−1)!2

dx

−−−→

N↑+∞

a2

n+a2

n−11/2,(6.98)

uniformly in v∈

such that

v≤√NM(h)+δ0−h(∆xntkn,Nn+∆xn−1tkn−1,Nn−1)−(a2

n+a2

n−1)µ(h)=:vmax

N,(6.99)

where 0 <δ0exists due to strict inequality (6.74), for l=n. Indeed, due to the standard

bounds on Gaussian tails, to show (6.98) it is enough to check that

an−1

n+a2

n−1

YN(h,y,v,t(kn,kn−1),n−1)+δ√N≤xN(v),(6.100)

for vsatisfying (6.99). Due to (6.74) with l=n, there exists δ3>0 such that we have

(2∆xn(log2 −I(tkn,Nn)))1/2≤µ(h)−δ3.(6.101)

Choosing a small enough δ0>0, we have

xN(v)−an−1

n+a2

n−1

YN(h,y,v,t(kn,kn−1),n−1)+δ√N

=a2

n(M(h)−vN−1/2−h∆xntkn,Nn+∆xn−1tkn−1,Nn−1)

−(a2

n+a2

n−1)an(2∆xn(log2 −I(tkn,Nn)))1/2−δ

≥

(6.99)

n(a2

n+a2

n−1)µ(h)−δ0−(a2

n+a2

n−1)an(2∆xn(log2 −I(tkn,Nn)))1/2−δ

≥

(6.101)

n(a2

n+a2

n−1)µ(h)−δ0−(a2

n+a2

n−1)a2

nµ(h)−δ

= (a2

n+a2

n−1)(δ3a2

n+δ)−a2

nδ0>0

which proves (6.100).

We claim that there exists C>0 such that uniformly in kn−1∈[1;Nn−1]∩

and in v∈

satisfying (6.99) we have

Nn−1

kn−1

{X≥xN(v)}≤exp(−N/C).(6.102)

146 6 The GREM in the presence of uniform external field

Indeed, in view of (6.41) and due to the classical Gaussian tail asymptotics, to obtain (6.102)

it is enough to show that

Nn−1(log2 −I(tkn−1,Nn−1)) ≤1

2x2

N(vmax

N).(6.103)

Using (6.99) and (6.94), we obtain

xN(vmax

N) = N1/2

an−1(a2

n+a2

n−1)µ(h)−an(2∆xn(log2 −I(tkn,Nn)))1/2+δ0−δ.(6.104)

If n>2, then due to strict inequality (6.74), for l=n−2, there exists δ00 >0 such that we

have

(a2

n+a2

n−2)µ(h)−δ00 >(log2 −I(t∗(θ−1/2

l,nh)))(a2

n+a2

n−2)(∆xn+∆xn−1)1/2

≥(2a2

n−1∆xn−1(log2 −I(tkn−1,Nn−1)))1/2

+(2a2

n∆xn(log2 −I(tkn,Nn)))1/2,(6.105)

where the last inequality may be obtained as a consequence of Slepian’s lemma (Slepian,

1962). If n=2, then (6.105) follows directly from Slepian’s lemma. Combining (6.104) and

(6.105), we get (6.103). Note that (6.102), in particular, implies that

{X≥xN(v)}≤exp(−N/C).(6.106)

Given kn−1∈[1;Nn−1]∩

, denote

LN(n−1,v,kn−1):=

hLN(n,v+an−1X+h∆xn−1√Ntkn−1,Nn−1)i(Nn−1

kn−1)

Due to (6.106) and (6.96), we have

LN(n−1,v,kn−1) =

{X≤xN(v)}+

{X>xN(v)})LN(n,v+an−1X+h∆xn−1√Ntkn−1,Nn−1)i(Nn−1

kn−1)

=1+

{X≤xN(v)}LN(n,v+an−1X+h∆xn−1√Ntkn−1,Nn−1)i

+O

{X≥xN(v)}+Nexp(−N/C)(Nn−1

kn−1).

Using (6.102) and the standard bounds (6.90) and (6.95), we get

LN(n−1,v,kn−1) = expNn−1

kn−1

{X≤xN(v)}logLN(n,v+an−1X+h∆xn−1√Ntkn−1,Nn−1)i

+O(Nexp(−N/C)).

Applying (6.98), (6.97), (6.84), for l=n, we obtain

logLN(n−1,v,kn−1) = −AN(h)

q2π(a2

n+a2

n−1)

∑

kn=0Nn

knNn−1

kn−1

6.4 The GREM with external field 147

×Z

1−e−ϕ(y)exph−1

2(a2

n+a2

n−1)YN(h,y,v,tkn,kn−1,n−1)2idy

+O(Nexp(−N/C)).

Finally, we arrive at

logLN(n−1,v) =

Nn−1

∑

kn−1=0

logLN(n−1,v,kn−1)

=−AN(h)

q2π(a2

n+a2

n−1)

∑

kn=0

Nn−1

∑

kn−1=0Nn

knNn−1

kn−1

×Z

1−e−ϕ(y)exph−1

2(a2

n+a2

n−1)YN(h,y,v,tkn,kn−1,n−1)2idy

+O(N2exp(−N/C))

=−AN(h)

q2π(a2

n+a2

n−1)

Nn+Nn−1

∑

k=0Nn+Nn−1

k

×Z

1−e−ϕ(y)exph−1

2(a2

n+a2

n−1)YN(h,y,v,tk,Nn+Nn−1,n−1)2idy

+O(N2exp(−N/C)).

4. Combining (6.82) and (6.84) for l=1, we obtain

LEN(h)(ϕ) = exp−Z

1−e−ϕ(y)SN(h,y)dy+o(1),(6.107)

where SN(h,y)is given by (6.61). Invoking the proof of Theorem 6.1.1, we get that

LEN(h)(ϕ)−−−→

N↑+∞exp−Z

1−e−ϕ(y)e−ydy

=LP(e−x)(ϕ).

This establishes (6.75).

5. The proof of (6.78) is very similar to the above proof of (6.75). The main difference is that

(6.98) does not hold. Instead, if (6.77) holds for l0=n, then we have

√2πZxN(v)

−∞

exp

−a2

n+a2

n−1

2a2

n x−an−1

n+a2

n−1

YN(h,y,v,t(kn,kn−1),n−1)!2

dx

−−−→

N↑+∞

a2

n+a2

n−11/2

X<√N

an−1qa2

n+a2

n−1hM(h)−vN−1/2−(1−xn−2)ht∗(hθ−1/2

n,n)

−(a2

n+a2

n−1)µ(h)i,(6.108)

148 6 The GREM in the presence of uniform external field

uniformly in

v≤√NM(h)−(1−xn−2)ht∗(hθ−1/2

n,n)−(a2

n+a2

n−1)µ(h)−δ0.

The subsequent applications of the recursion (6.83) to (6.108) give rise to the constant

K(h,ρ)∈(0;1)in (6.78).

Proof of Theorem 6.1.2. The existence of the r.h.s. of (6.18) follows from (Bovier & Kurkova,

2004a, Theorem 1.5 (ii)). It remains to show convergence (6.18) itself. We apply Proposi-

tion 6.4.1 to each coarse-grained block. Note that the assumption (6.74) of Proposition 6.4.1

is fulfilled, due to the construction of the blocks, cf. (6.11), (6.17). The result then follows from

(Bovier & Kurkova, 2004a, Theorem 1.2).

The representation of the limiting ground state (6.19) is proved exactly as in (Bovier &

Kurkova, 2004a, Theorem 1.5 (iii))). ut

6.4.2 Fluctuations of the partition function

In this subsection we compute the limiting distribution of the GREM partition function under the

scaling induced by (6.8). The analysis amounts to handling both the low and high temperature

regimes. The low temperature regime is completely described by the behaviour of the ground

states which is summarised in Theorem 6.1.2. The high temperature regime is considered in

Lemma 6.4.1 below.

Lemma 6.4.1. Assume l(β,h) = 0. Then

exp"−Nlog2 +logchβh+β2

2#ch2/3(βh)ZN(β,h)

−−−→

N↑+∞K(β,h),(6.109)

where K(β,h) = 1, if β¯

γ1(h)<1, and K(β,h)∈(0;1), if β¯

γ1(h) = 1.

Proof. We follow the strategy of (Bovier & Kurkova, 2004a, Lemma 3.1). By the very construc-

tion of the coarse graining algorithm (6.11), we have

θ1,k≤e

θ1,J1=¯

γ1(h)2,k∈[1;J1]∩

θ1,k<e

θ1,J1,k∈(J1;n]∩

.(6.110)

Assume β¯

γ1(h)<1. Hence, due to (6.110), we have

βe

θ1/2

1,k<1,k∈[1;n]∩

.(6.111)

Strict inequality (6.111) implies that there exists ε>0 such that, for all k∈[1;n]∩

β2−1

2(β−ε)2qk<xklog2 −I(t∗(h(xk/qk)1/2)).(6.112)

6.4 The GREM with external field 149

We have

ZN(β,h)=

∑

k=0N

kexpβhtk,NN+β2N

2=:SN(β,h).(6.113)

Note that due to (6.41)

SN(β,h)∼

N↑+∞

∑

k=0

gN(tk,N)expN f (tk,N),(6.114)

where

f(t):=log2 −I(t)+ βht +β2/2,

gN(t):=2

πN(1−t2)1/2

A straightforward computation gives

f0(t0) = βh−tanh−1(t0(β,h)) = 0

f00(t0) = −(1−t2

0)−1=−ch2(βh),

gN(t0) = 2

πN(1−t2)1/2

=2

πN1/2

ch(βh).

The asymptotic Laplace method then yields

SN(β,h)∼

N↑+∞ch−2/3(βh)expNlog2 +logchβh+β2

2.(6.115)

For p≤q, define

GREM(p,q)

N(σ(1),...,σ(q)):=

∑

k=p

akX(σ(1),...,σ(k)).

Consider the event

EN(σ):=nGREM(1,k)

N(σ(1),...,σ(k))<(β+ε)qk√N,

for all k∈[1;n]∩

Define the truncated partition function as

Z(T)

N(β,h):=∑

σ∈ΣN

EN(σ)exphβ√NXN(h,σ)i.(6.116)

The truncation (6.116) is mild enough in the following sense

hZ(T)

N(β)i=SN(β,h)

nGREM(1,k)

N(σ(1),...,σ(k))<εqk√N,for all k∈[1;n]∩

150 6 The GREM in the presence of uniform external field

∼

N↑+∞

ZN(β,h).(6.117)

We write

ZN(β)

ZN(β)=Z(T)

N(β)

hZ(T)

N(β)i×

hZ(T)

N(β)i

ZN(β)+ZN(β)−Z(T)

N(β)

ZN(β)

=: (I) ×(II) +(III).

Due to (6.117), we get

(II) ∼

N↑+∞1,(III) L1

−−−→

N↑+∞0.

To estimate (I), we fix any δ>0, and use the Chebyshev inequality

{|(I) −1|>δ}≤δ

hZ(T)

N(β)i−2

VarhZ(T)

N(β)i.(6.118)

Expanding the squares, we have

VarhZ(T)

N(β)i=

hZ(T)

N(β)2i−

hZ(T)

N(β)i2

∑

p=1

∑

σ(1)q...qσ(k)∈ΣxkN

"expn2β√NGREM(1,p)

N(σ(1),...,σ(p))

+2βhxpmxpN(σ(1),...,σ(p))√No

×∑

σ(p+1)q...qσ(n),

τ(p+1)q...qτ(n)∈Σ(1−xp)N,

σ(p+1)6=τ(p+1)

expnβ√N

×GREM(p+1,n)

N(σ(1),...,σ(n))+GREM(p+1,n)

N(τ(1),...,τ(n))

+h(1−xp)√N(m(1−xp)N(σ(p+1)q...qσ(n))+m(1−xp)N(τ(p+1)q...qτ(n))o

EN(σ(1)q...qσ(n))

EN(τ(1)q...qτ(n))#.(6.119)

Hence, due to the independence, we arrive at

VarhZ(T)

N(β)i≤

∑

p=1

xkN

∑

k=0N

k

"expn2β√NGREM(1,p)

N(σ(1),...,σ(p))

+hxptk,N√No

nGREM(1,p)

N(σ(1),...,σ(p))<(β+ε)qp√No#

× (1−xp)N

∑

k=0(1−xp)N

k

"expβ√N(GREM(p+1,n)

N(σ(1),...,σ(n))

6.4 The GREM with external field 151

+h(1−xp)tk,(1−xp)N√N#!2

.(6.120)

Assume that Xis a standard Gaussian random variable. Using the standard Gaussian tail bounds,

we have

"exp2β√N(GREM(1,p)

N(σ(1),...,σ(p))+hxptk,N√N)

nGREM(1,p)

N(σ(1),...,σ(p))<(β+ε)qp√No#

=expnN2β2qp+βhtk,No

X≥(β−ε)pqpN

≤

N↑+∞

CexpN2β2qp+βhtk,N−1

2(β−ε)2qp.(6.121)

Similarly to (6.114), using (6.41) and (6.121), we have

xkN

∑

k=0N

k

"exp2β√N(GREM(1,p)

N(σ(1),...,σ(p))

+hxptk,N√N)

nGREM(1,p)

N(σ(1),...,σ(p))<(β+ε)qp√No#

≤

N↑+∞

xkN

∑

k=0

expNxp(log2 −I(tk,xpN))+ 2β2qp+2βhxptk,xpN−1

2(β−ε)2qp=:PN(p).

(6.122)

Using (6.41), we also obtain

(1−xp)N

∑

k=0(1−xp)N

k

"expβ√N(GREM(p+1,n)

N(σ(1),...,σ(n))

+h(1−xp)√Ntk,(1−xp)N#

≤

N↑+∞

(1−xp)N

∑

k=0

expnN(1−xp)(log2 −I(tk,(1−xp)N))+ 1

2(1−qp)β2

+βh(1−xp)tk,(1−xp)No=:e

PN(p).(6.123)

Combining (6.120), (6.122) and (6.123), we get

VarhZ(T)

N(β)i≤

N↑+∞

∑

p=1

PN(p)e

N(p).(6.124)

For any p∈[1;n]∩

, we have the following factorisation

152 6 The GREM in the presence of uniform external field

hZ(T)

N(β)i=

xpN

∑

k=0xpN

k

"expβ√N(GREM(1,p)

N(σ(1),...,σ(p))+hxptk,xpN√N)

(1−xp)N

∑

k=0(1−xp)N

kexpβ√N(GREM(p+1,n)

N(σ(1),...,σ(n))

+h(1−xp)Ntk,(1−xp)N√N)

EN(σ(1)q...qσ(n))#.(6.125)

Hence, again similarly to (6.114), we obtain

hZ(T)

N(β)i∼

N↑+∞C

xpN

∑

k=0

expNxp(log2 −I(tk,xpN))+ 1

2qpβ2+βhxptk,xpN

(1−xp)N

∑

k=0

expnN(1−xp)(log2 −I(tk,(1−xp)N))+ 1

2(1−qp)β2

+βh(1−xp)tk,(1−xp)No=:QN(p)×e

PN(p).(6.126)

Denote

RN(p):=qpβ2+2xpmax

t∈[−1;1]{log2 −I(t)+βht}.

We observe that similarly to (6.115) we have

N(p)

exp(NR(p)) ∼

N↑+∞C.(6.127)

Combining (6.124), (6.126), (6.127) and (6.51), we get

(6.118) ≤

N↑+∞

∑

p=1

PN(p)

N(p)=C

∑

p=1

PN(p)/exp(NR(p))

N(p)/exp(NR(p)) ≤

N↑+∞

∑

p=1

PN(p)

exp(NR(p))

≤

N↑+∞

∑

p=1

expN(β2−1

2(β−ε)2)qp−(log2 −I(t0))xp

∑

p=1

expN(β2−1

2(β−ε)2)qp−(log2 −I(t∗(h(xp/qp)1/2)))xp−−−→

N↑+∞0,

(6.128)

where the convergence to zero in the last line is assured by the choice of εin (6.112). Finally,

combining (6.118) and (6.128), we get

(I)

−−−→

N↑+∞1.

This finishes the proof of (6.109) in the case β¯

γ1(h)<1.

The case β¯

γ1(h) = 1 is a little bit more tedious and uses the information about the low

temperature regime obtained in Theorem 6.1.2 in the spirit of the proof of (Bovier & Kurkova,

2004a, Lemma 3.1). The lemma follows. ut

6.4 The GREM with external field 153

Proof of Theorem 6.1.3. The proof is verbatim the one of (Bovier & Kurkova, 2004a, The-

orem 1.7), where the analysis of the high temperature regime (Bovier & Kurkova, 2004a,

Lemma 3.1) is substituted by Lemma 6.4.1. The low temperature regime is governed by the

fluctuations of the ground state which are summarised in Theorem 6.1.2.

6.4.3 Formula for the free energy of the GREM

Proof of Theorem 6.1.4. The L1convergence follows immediately from Theorem 6.1.3. Almost

sure convergence is a standard consequence of Gaussian measure concentration, e.g., (Ledoux,

2001, (2.35)), and the Borell-Cantelli lemma. ut

Some open problems and outlook

In this thesis, we studied two types of the large sums of strongly correlated exponentials. The

first type is a sum of hierarchically correlated random variables (the GREM with external field).

The second type is an infinitesimal sum of genuine non-hierarchically strongly correlated ran-

dom variables (the SK model with multidimensional spins). We were interested in the limiting

behaviour of such sums as the number of their summands and the effective dimension of the

correlation structure simultaneously tend to infinity.

For the GREM with external field, we provided an explicit expression for the free energy

(Theorem 6.1.4) and even obtained precise information about the fluctuations of the partition

function (Theorem 6.1.3). Our understanding of the SK model with multidimensional spins is

only at the level of free energy (cf. Question 0.0.1) and is substantially less elaborate. Below we

list some open questions concerning the latter model.

In view of the bounds on the free energy (of the SK model with multidimensional spins)

obtained using the generalised AS2scheme (Theorems 3.1.1 and 3.1.2, cf. also (3.22)), it is

natural to pose the following question.

Question 7.0.1 (Parisi-type formula, the AS2scheme version).Using the notations of Chapter 3

and, in particular, of Theorem 3.1.1, for which a priori distributions of spins do we have

p(β) = sup

U∈U

inf

(x,Q,Λ)f(x,Q,Λ,U),(7.1)

where the infimum runs over all x satisfying (3.7), all Λ∈Sym(d), all Qsatisfying both (3.6)

and “Hadamard squares” Assumption 3.1.2?

In view of the bounds on the free energy obtained using generalised Guerra’s scheme (see

Theorems 4.1.1 and 4.1.2), we pose the following version of Question 7.0.1.

Question 7.0.2 (Parisi-type formula, Guerra’s scheme version).Using the notation of Theo-

rem 4.1.1, for which a priori distributions of spins do we have (7.1), where the infimum in (7.1)

runs over all x satisfying (3.7), all Qsatisfying (3.6), and all Λ∈Sym(d)?

In view of the results on the Parisi-type formula in the case of multidimensional Gaussian a

priori distribution (Chapter 5), we pose the following question.

Question 7.0.3 (simultaneous diagonalisation scenario).Using the notations of Section 4.2.3,

for which a priori distributions of spins does (7.1) hold, where the infimum in (7.1) runs over all

x∈Q0

n(1,1), all Q∈Qdiag(U,O,d)∩Q0

n(U,d), all O ∈O(d), all n ∈

and all Λ∈Sym(d)?

156 7 Some open problems and outlook

In view of the conditional positive answer to Question 7.0.2 which we obtained by Tala-

grand’s methodology of a priori estimates (cf. Theorem 4.3.1), we pose the following question.

Question 7.0.4. For which a priori distributions of spins is Assumption 4.3.1 satisfied?

A related question (see Remark 4.3.8) seems to be the following one.

Question 7.0.5. For which a priori distributions of spins does the condition (4.75) hold?

We embedded an open problem of strict convexity of the Parisi functional posed by Panchenko

(2005a); Talagrand (2006c) into a more general setting in Section 4.2. In view of our partial re-

sult on strict convexity (see Theorem 4.2.4), this problem can be generalised and paraphrased

in the following way.

Question 7.0.6 (multidimensional Parisi functional).Given a piece-wise continuous x ∈Q(1,1)

and Q ∈Q(U,d), assume fx,Qis the solution of (4.39). For which terminal conditions g and

parameters Q is the mapping Q0(1,1)3x7→ fx,Q(0,0)∈

strictly convex?

Remark 7.0.2. The case of Question 7.0.6 with d =1essentially corresponds to the problem

posed by Panchenko (2005a); Talagrand (2006c).

Remark 7.0.3. It would, perhaps, also be useful to understand the behaviour of the mapping

Q(U,d)3Q7→ fx,Q(0,0)∈

Appendix

The general result of Guerra & Toninelli (2003) implies that the thermodynamic limit of the

local free energy (3.5) exists almost surely and in L1. The following existence of the limiting

average overlap is an immediate consequence of this.

Proposition A.0.1. We have

GN(β)⊗GN(β)VarHN(σ)−

HN(σ)HN(σ0)−−−→

N↑+∞C(β)≥0,

where C :

+→

Proof. The free energy is a convex function of β(a consequence of the H¨

older inequality).

Hence, by a result of Griffiths (1964) the following holds

lim

N↑∞

dβ

pN(β)=d

dβ

[p(β)].

Proposition 3.2.4 implies

dβ

pN(β)=β

GN(β)⊗GN(β)VarHN(σ)−

HN(σ)HN(σ0).

The following super-additivity result is an application of the Gaussian comparison inequali-

ties obtained in Subsection 3.2.3. Note that the result does not provide enough information for

the cavity-like argument of Aizenman et al. (2003).

Proposition A.0.2. For any V≡B(U,ε)⊂U, we have

pN(V)+M

[pM(V)] ≤(N+M)

pN+M(V)+(N+M)O(ε),

as ε↓+0.

Proof. Define the process YN,M:={Y(σ):σ=αqτ;α∈ΣN,τ∈ΣM}as follows

Y(αqτ):=N

N+M1/2

X(1)

N(α)+M

N+M1/2

X(2)

M(τ),

158 A Appendix

where X(1)and X(2)are two independent copies of the process X. Given some Gaussian process

{C(σ)}σ∈ΣN, let us introduce the functional ΦN(β)[C]as follows

ΦN,M(β)[C]:=

hlog µ⊗(N+M)h

ΣN(V)

ΣM(V)exp(β√N+MC)ii.

Now, set ϕ(t):=ΦN+M(β)h√tXN+M+√1−tYN,Mi. Applying Proposition 3.2.5, we get

dtϕ(t) = β2(N+M)

[G(t)⊗G(t)[

VarXN+M(σ(1))−VarYN,M(σ(1))

−CovhXN+M(σ(1)),XN+M(σ(2))i−CovhYN,M(σ(1)),YN,M(σ(2))iii.(A.1)

Note that we have

ϕ(0) = N

pN(V)+M

[pM(V)],

ϕ(1)≤(N+M)

pN+M(V),(A.2)

where the last inequality is due to the fact that, for all α∈ΣN(V)and all τ∈ΣN(V), we have

αqτ∈ΣN+M(V).

Moreover, for σ=αqτwith α∈ΣN(V)and σ∈ΣM(V)we have

VarXN+M(σ)−VarYN,M(σ) = 



N+MRN(α,α)+ M

N+MRM(τ,τ)



2−N

N+MkRN(α,α)k2

−M

N+MkRM(τ,τ)k2

2=O(ε).

Also, due to convexity of the norm, we have

CovhXN+M(σ(1)),XN+M(σ(2))i−CovhYN,M(σ(1)),YN,M(σ(2))i

=



N+MRN(α(1),α(2))+ M

N+MRM(τ(1),τ(2))



2−N

N+MkRN(α(1),α(2))k2

−M

N+MkRM(τ(1),τ(2))k2

2≤0.

Applying R1

0dtto (A.1) and using the previous two formulae, we get the claim. ut

Notation Index

Sets, spaces

natural numbers, i.e., {1,2,...}

real axis, i.e., (−∞;+∞)

+nonnegative reals, i.e., [0;+∞)

+compactified nonnegative reals, i.e., [0;+∞]

[−∞;+∞)

d−1unit Euclidean sphere in

dwith centre at 0 ∈

B(x,r)Euclidean ball with centre at xand radius r∈

d×dreal d×dmatrices

Sym(d)symmetric d×dmatrices

Sym+(d)symmetric non-negative-definite d×dmatrices

O(d)orthogonal d×dmatrices

C(k)(D)k-times continuously differentiable functions f:D⊂

d→

C+

K(D)nonnegative functions f:D⊂

d→

+with compact support

Mf(D)finite measures on the metric space D

M1(D)probability measures on the metric space D

Λa finite index set

Λ0an infinite index set

Σa single spin configuration space

ΣΛ(or ΣN) whole system configuration space

ΣIsing Ising spins, ΣIsing :={−1;1}

(Ω,F,

)probability space of disorder

G0(β)set of all infinite-volume (DLR) Gibbs measures

Q(U,d)set of all c`

adl`

ag Sym+(d)-valued non-decreasing (in the sense of

quadratic forms) “paths” which start in 0 and end in matrix U, (4.17)

Q0(U,d)set of all piece-wise constant paths in Q(U,d)with finite (but arbitrary)

number of jumps

n(U,d)set of all piece-wise constant paths in Q0(U,d)with exactly n∈

jumps

E(I)set of all equivalence relations on the set I

Relations

ΛbΛ0set Λis the finite subset of the infinite set Λ0

160 Notation Index

xα

−−→

(α)y(xα)weakly converges to yalong the direction α

xα

−−→

(α)y(xα)converges to yin probability along the direction α

xα∼

(α)yα(xα)and (yα)are asymptotically equivalent along the direction α, i.e.,

limαyα

xα=1

xα=

(α)

O(yα)“the big-O notation”, rough asymptotic domination along the direction

α, i.e., limα|xα

yα|<+∞

A0 matrix A∈

d×dis nonnegative definite

AB B −A0

X∼Yrandom variables Xand Yare equidistributed

i∼

kjBolthausen-Sznitman equivalence relation, (2.27)

Π1Π2partition Π1is not finer than Π2

Operations

x:=y(y=:x) element xequals yby definition

xqyconcatenation of the vectors xand y

[x]kprojection of the vector x∈

non its first k(with k≤n) coordinates,

i.e., (x1,...,xk)∈

[σ]Aprojection of the vector σ∈ΣΛ0on ΣA, where A⊂Λ

bxclargest integer smaller than x

x∨ymaximum of xand y

x∧yminimum of xand y

N(·)normalisation operation, (2.18)

cardΛcardinality of the set Λ

diagxdiagonal matrix in

d×dinduced by the vector x∈

detAdeterminant of the matrix A∈

d×d

∇f(x)gradient of the function f:D⊂

d→

, i.e., (∂1f(x),...,∂df(x))

∇2f(x)Hessian of the function f:D⊂

d→

, i.e., (∂2

u,vf(x))d

u,v=1

∂x eF(x)directional (Gˆ

ateaux) derivative of F:X→

at the point xalong the

direction e

hx,yiEuclidean scalar product between the vectors x,y∈

d, i.e., ∑d

u=1xuyu

hA,Bitracial scalar product between the matrices A,B∈

d×d, i.e.,

∑d

u,v=1Au,vBu,v

µ⊗νproduct measure

µ⊗Λproduct measure, i.e., Ni∈Λµ

xacoordinate-wise raising of the vector x∈

dto the power a∈

Aaentry-wise raising of the matrix x∈

d×dto the power a∈

intDinterior of the set D

convDconvex hull of the set D

∂Dborder of the set D

Norms, metrics, valuations

D(x)indicator function of the set D

[X]expectation of the random variable X

Notation Index 161

µ[X]average of the function X with respect to the measure µ, i.e., RXdµ

Var[X]variance of the random variable X

Cov[X,Y]covariance between the random variables Xand Y

kxk2Euclidean norm of the vector x∈

d, i.e., qx2

1+···+x2

kAkFFrobenius (Hilbert-Schmidt) norm of the matrix A∈

d×d

kµkTV total variation norm of the finite (signed) measure µ, i.e., kµkTV =

supD|µ(D)|

dH(σ(1),σ(2))Hamming distance,

dL(α(1),α(2))lexicographic distance, (0.3)

R(σ(1),σ(2))overlap matrix, (3.3)

Q(α(1),α(2))ultrametric overlap matrix, (3.14)

q(i,j)limiting ultrametric overlap of i,j∈

, see (2.26)

S(ν|µ)relative entropy, (1.6)

I(·)Cram`

er Entropy, (6.4)

I∗(·)large deviations (good) rate function, (3.41)

pΛfree energy

P(·)Parisi functional, (5.10)

C S (·)Crisanti-Sommers functional, (5.14)

Processes, random variables, measures

µa priori measure on Σ

µΛ(or µN) a priori product measure on ΣΛ(or ΣN)

GΛ(or GN) finite volume Gibbs measure on ΣΛ(or ΣN), (1.1)

G0(β)infinite-volume (DLR) Gibbs measure

GΛ(β,σ(c))Gibbsian local specification on ΣΛ, given the external condition σ(c) ∈

Λ0\Λ, (1.12)

probability on the space of disorder

U(D)uniform distribution on the set D⊂

REM(·)Derrida’s REM process, (1.24)

GREM(·)Derrida’s GREM process, (0.5)

SK(·)Sherrington-Kirkpatrick process, (1.20)

PPP(D3x7→ f(x)∈

)Poisson point process with the density f:D⊂

d→

PD(x,a)Poisson-Dirichlet point process

ξRuelle’s probability cascade, (2.22)

ENpoint process of the GREM energy levels, (6.15)

HΛ(A,h;σ)spin glass Hamiltonian, (1.14)

Abbreviations

i.i.d. independent identically distributed

k-D k-dimensional

LLN law of large numbers

LDP large deviations principle

CLT central limit theorem

PDE partial differential equation

162 Notation Index

BSDE backward stochastic differential equation

HJB equation Hamilton-Jacobi-Bellman equation

adl`

ag continue `

a droite, limite `

a gauche (right continuous with left limits)

RS replica symmetric

RSB replica symmetry breaking

FRSB full replica symmetry breaking

RPC Ruelle’s probability cascade

REM random energy model

GREM generalised random energy model

CREM continuous GREM

SK model Sherrington-Kirkpatrick model

EA model Edwards-Anderson model

RKKY model Ruderman-Kittel-Kasuya-Yoshida model

AS2scheme Aizenman-Sims-Starr scheme

References

ADLER, ROBERT J. 1990. An introduction to continuity, extrema, and related topics for general

Gaussian processes. Institute of Mathematical Statistics Lecture Notes—Monograph Series,

12. Hayward, CA: Institute of Mathematical Statistics.

ADLER, ROBERT J. & TAYLOR, JONATHAN E. 2007. Random fields and geometry. Springer

Monographs in Mathematics. New York: Springer.

AIZENMAN, MICHAEL; SIMS, ROBERT & STARR, SHANNON L. 2003. An Extended Varia-

tional Principle for the SK Spin-Glass Model. Phys. Rev. B,68, 214403.

AIZENMAN, MICHAEL; SIMS, ROBERT & STARR, SHANNON L. 2007. Mean-field spin glass

models from the cavity-ROSt perspective. Pages 1–30 of: Prospects in mathematical physics.

Contemp. Math., vol. 437. Providence, RI: Amer. Math. Soc. arXiv:math-ph/0607060.

AMIT, DANIEL J. 1989. Modeling brain function. Cambridge: Cambridge University Press.

The world of attractor neural networks.

ARBIB, MICHAEL A. (ed). 2003. The handbook of brain theory and neural networks. Second

edn. A Bradford Book. Cambridge, MA: MIT Press.

ARGUIN, LOUIS-PIERRE. 2007. Spin glass computations and Ruelle’s probability cascades. J.

Stat. Phys.,126(4-5), 951–976. arXiv:math-ph/0608045v1.

ARRATIA, RICHARD; BARBOUR, ANDREW D. & TAVAR ´

E, SIMON. 2003. Logarithmic com-

binatorial structures: a probabilistic approach. EMS Monographs in Mathematics. European

Mathematical Society (EMS), Z¨

urich.

BASDEVANT, ANNE-LAURE. 2006. Ruelle’s probability cascades seen as a fragmentation pro-

cess. Markov Process. Related Fields,12(3), 447–474.

BAUKE, HEIKO & MERTENS, STEPHAN. 2004. Universality in the level statistics of disordered

systems. Phys. Rev. E,70, 025102(R).

BEN AROUS, G´

ERARD; DEMBO, AMIR & GUIONNET, ALICE. 2001. Aging of spherical spin

glasses. Probab. Theory Related Fields,120(1), 1–67.

BEN AROUS, G´

ERARD; BOGACHEV, LEONID V. & MOLCHANOV, STANISLAV A. 2005.

Limit theorems for sums of random exponentials. Probab. Theory Related Fields,132(4),

579–612.

BEN AROUS, G´

ERARD; GAYRARD, VERONIQUE & KUPTSOV, ALEXEY. 2006. A new REM

conjecture. Preprint. arXiv:0706.2135v1 [math.PR].

BERTOIN, JEAN. 2006. Random fragmentation and coagulation processes. Cambridge Studies

in Advanced Mathematics, vol. 102. Cambridge: Cambridge University Press.

164 References

BERTOIN, JEAN & LEGALL, JEAN-FRANC¸OIS. 2000. The Bolthausen-Sznitman coalescent

and the genealogy of continuous-state branching processes. Probab. Theory Related Fields,

117(2), 249–266.

BOGACHEV, VLADIMIR I. 1998. Gaussian measures. Mathematical Surveys and Monographs,

vol. 62. Providence, RI: American Mathematical Society.

BOLTHAUSEN, ERWIN. 2007. Random media and spin glasses: an introduction into some

mathematical results and problems. Pages 1–44 of: Spin glasses. Lecture Notes in Math.,

vol. 1900. Berlin: Springer.

BOLTHAUSEN, ERWIN & KISTLER, NICOLA. 2006. On a nonhierarchical version of the gen-

eralized random energy model. Ann. of Appl. Prob.,16, 1.

BOLTHAUSEN, ERWIN & KISTLER, NICOLA. 2008. On a nonhierarchical version of the Gen-

eralized Random Energy Model. II. Ultrametricity. Preprint.

BOLTHAUSEN, ERWIN & SZNITMAN, ALAIN-SOL. 1998. On Ruelle’s probability cascades

and an abstract cavity method. Comm. Math. Phys.,197(2), 247–276.

BOLTHAUSEN, ERWIN & SZNITMAN, ALAIN-SOL. 2002. Ten Lectures on Random Media.

Birkh¨

auser Verlag.

BOLTHAUSEN, ERWIN (ED.) & BOVIER, ANTON (ED.). 2007. Spin glasses. Lecture Notes in

Mathematics 1900. Berlin: Springer. ix, 179 p. .

BORGS, CHRISTIAN; CHAYES, JENNIFER & PITTEL, BORIS. 2001. Phase transition and

finite-size scaling for the integer partitioning problem. Random Structures Algorithms,19(3-

4), 247–288. Analysis of algorithms (Krynica Morska, 2000).

BORGS, CHRISTIAN; CHAYES, JENNIFER; MERTENS, STEPHAN & NAIR, CHANDRA. 2005a.

Proof of the local REM conjecture for number partitioning I: Constant energy scales.

Preprint. arXiv:cond-mat/0501760v2 [cond-mat.dis-nn].

BORGS, CHRISTIAN; CHAYES, JENNIFER; MERTENS, STEPHAN & NAIR, CHANDRA. 2005b.

Proof of the local REM conjecture for number partitioning II: growing energy scales.

Preprint. arXiv:cond-mat/0508600v1 [cond-mat.dis-nn].

BOUCHAUD, JEAN-PHILIPPE & POTTERS, MARC. 2003. Theory of Financial Risk and

Derivative Pricing. Cambridge: Cambridge University Press. From statistical physics to

risk management.

BOVIER, ANTON. 2001. Statistical mechanics of disordered systems. MaPhySto Lecture Notes,

no. 10. Aarhus University.

BOVIER, ANTON. 2006. Statistical mechanics of disordered systems. Cambridge Series in

Statistical and Probabilistic Mathematics. Cambridge: Cambridge University Press. A math-

ematical perspective.

BOVIER, ANTON & KURKOVA, IRINA. 2003a. Gibbs measures of Derrida’s generalized

random energy models and the genealogy of Neveu’s continuous state branchng process.

Preprint.

BOVIER, ANTON & KURKOVA, IRINA. 2003b. Rigorous results on some simple spin glass

models. Markov Process. Related Fields,9(2), 209–242. Inhomogeneous random systems

(Cergy-Pontoise, 2002).

BOVIER, ANTON & KURKOVA, IRINA. 2004a. Derrida’s generalised random energy models.

I. Models with finitely many hierarchies. Ann. Inst. H. Poincar´

e Probab. Statist.,40(4), 439–

480.

References 165

BOVIER, ANTON & KURKOVA, IRINA. 2004b. Derrida’s generalized random energy models.

II. Models with continuous hierarchies. Ann. Inst. H. Poincar´

e Probab. Statist.,40(4), 481–

495.

BOVIER, ANTON & KURKOVA, IRINA. 2004c. Poisson convergence in the restricted k-

partitioning problem. WIAS preprint 964. arXiv:cond-mat/0504366v1 [cond-mat.dis-nn],

to appear in Random Structures & Algorithms.

BOVIER, ANTON & KURKOVA, IRINA. 2006a. Local energy statistics in disordered systems:

a proof of the local REM conjecture. Comm. Math. Phys.,263(2), 513–533.

BOVIER, ANTON & KURKOVA, IRINA. 2006b. A tomography of the GREM: beyond the REM

conjecture. Comm. Math. Phys.,263(2), 535–552.

BOVIER, ANTON & KURKOVA, IRINA. 2007. Much ado about Derrida’s GREM. Pages 81–115

of: Spin glasses. Lecture Notes in Math., vol. 1900. Berlin: Springer.

BOVIER, ANTON; KURKOVA, IRINA & L ¨

OWE, MATTHIAS. 2002. Fluctuations of the free

energy in the REM and the p-spin SK models. Ann. Probab.,30(2), 605–651.

BRIAND, PHILIPPE & HU, YING. 2007. Quadratic BSDEs with convex generators and un-

bounded terminal conditions. Probab. Theory Related Fields, –. arXiv:math/0703423.

CARMONA, PHILIPPE & HU, YUEYUN. 2006. Universality in Sherrington-Kirkpatrick’s spin

glass model. Ann. Inst. H. Poincar´

e Probab. Statist.,42(2), 215–222.

CHALLET, DAMIEN; MARSILI, MATTEO & ZHANG, YI-CHENG. 2005. Minority Games.

Oxford University Press.

CHATTERJEE, SOURAV & PAL, SOUMIK. 2007. A phase transition behavior for Brownian

motions interacting through their ranks. Preprint. arXiv:0706.3558v1 [math.PR].

COMETS, FRANCIS. 1989. Large deviation estimates for a conditional probability distribution.

Applications to random interaction Gibbs measures. Probab. Theory Related Fields,80(3),

407–432.

CONTUCCI, PIERLUIGI & GRAFFI, SANDRO (eds). 2007. How Can Mathematics Contribute

to Social Sciences. Quality and Quantity, vol. 41, no. 4.

CONTUCCI, PIERLUIGI; DEGLI ESPOSTI, MIRKO; GIARDIN `

A, CRISTIAN & GRAFFI, SAN-

DRO. 2003. Thermodynamical limit for correlated Gaussian random energy models. Comm.

Math. Phys.,236(1), 55–63.

COOLEN, A. C. C. 2005. The mathematical theory of minority games. Oxford: Oxford Uni-

versity Press. Statistical mechanics of interacting agents.

CRISANTI, ANDREA & SOMMERS, HANS-J ¨

URGEN. 1992. The spherical p-spin interaction

spin glass model: the statics. Zeitschrift f¨

ur Physik B Condensed Matter,87(3), 341–354.

DALIO, FRANCESCA & LEY, OLIVIER. 2006. Uniqueness Results for Second-Order

Bellman–Isaacs Equations under Quadratic Growth Assumptions and Applications. SIAM

J. Control Optim.,45(1), 74–106.

DALEY, DARYL J. & VERE-JONES, DAVID. 2003. An introduction to the theory of point

processes. Vol. I. Second edn. Probability and its Applications (New York). New York:

Springer-Verlag. Elementary theory and methods.

DEN HOLLANDER, FRANK. 2000. Large deviations. Fields Institute Monographs, vol. 14.

Providence, RI: American Mathematical Society.

DERRIDA, BERNARD. 1980. Random-energy model: limit of a family of disordered models.

Phys. Rev. Lett.,45(2), 79–82.

166 References

DERRIDA, BERNARD. 1981. Random-energy model: An exactly solvable model of disordered

systems. Phys. Rev. B,24(5), 2613–2626.

DERRIDA, BERNARD. 1985. A generalization of the Random Energy Model which includes

correlations between energies. Journal de physique. Lettres,46(9), 401–407.

DERRIDA, BERNARD & GARDNER, ELISABETH. 1986. Magnetic properties and the function

q(x)of the generalised random-energy model. Journal of Physics C: Solid State Physics,

19(29), 5783–5798.

DORLAS, TEUNIS C. & WEDAGEDERA, JANAK R. 2001. Large deviations and the random

energy model. Int. J. Mod. Phys. B,15, 1–15.

DOTSENKO, VIKTOR. 2001. Introduction to the Replica Theory of Disordered Statistical Sys-

tems. Cambridge University Press.

DUDLEY, RICHARD M. 1999. Uniform central limit theorems. Cambridge Studies in Advanced

Mathematics, vol. 63. Cambridge: Cambridge University Press.

DURLAUF, STEVEN N. 1999. How can statistical mechanics contribute to social science? Proc.

Natl. Acad. Sci. USA,96(19), 10582–10584 (electronic).

EDWARDS, S.F. & ANDERSON, P.W. 1975. Theory of spin glasses. Journal of Physics F:

Metal Physics,5(5), 965–974.

EISELE, THEODOR. 1983. On a third-order phase transition. Comm. Math. Phys.,90(1), 125–

159.

ELLIS, RICHARD S. 2006. Entropy, large deviations, and statistical mechanics. Classics in

Mathematics. Berlin: Springer-Verlag. Reprint of the 1985 original.

EVANS, LAWRENCE C. 1998. Partial differential equations. Graduate Studies in Mathematics,

vol. 19. Providence, RI: American Mathematical Society.

FERNIQUE, XAVIER. 1997. Fonctions al´

eatoires gaussiennes, vecteurs al´

eatoires gaussiens.

Montreal, QC: Universit´

e de Montr´

eal Centre de Recherches Math´

ematiques.

FISCHER, K.H. & HERTZ, J.A. 1991. Spin Glasses. Cambridge University Press.

FR¨

OHLICH, J ¨

URG & ZEGARLI ´

NSKI, BOGUSŁAW. 1987. Some comments on the Sherrington-

Kirkpatrick model of spin glasses. Comm. Math. Phys.,112(4), 553–566.

GRIFFITHS, ROBERT B. 1964. A proof that the free energy of a spin system is extensive. J.

Mathematical Phys.,5, 1215–1222.

GUERRA, FRANCESCO. 2003. Broken replica symmetry bounds in the mean field spin glass

model. Comm. Math. Phys.,233(1), 1–12.

GUERRA, FRANCESCO. 2005. Spin Glasses. Preprint. arXiv:cond-mat/0507581v1.

GUERRA, FRANCESCO & TONINELLI, FABIO LUCIO. 2002. The thermodynamic limit in

mean field spin glass models. Comm. Math. Phys.,230(1), 71–79.

GUERRA, FRANCESCO & TONINELLI, FABIO LUCIO. 2003. The infinite volume limit in

generalized mean field disordered models. Markov Process. Related Fields,9(2), 195–207.

HARTMANN, ALEXANDER K. & WEIGT, MARTIN. 2005. Phase transitions in combinatorial

optimization problems. Wiley-VCH Verlag, Weinheim. Basics, algorithms and statistical

mechanics.

JANA, NABIN KUMAR. 2007. Contributions to Random Energy Models. Ph.D. thesis, Indian

Statistical Institute.

JANA, NABIN KUMAR & RAO, B. V. 2006. Large Deviations and Random Energy Models.

Preprint. arXiv:math/0612836v1 [math.PR].

References 167

JOAG-DEV, KUMAR; PERLMAN, MICHAEL D. & PITT, LOREN D. 1983. Association of

normal random variables and Slepian’s inequality. Ann. Probab.,11(2), 451–455.

KAHANE, JEAN-PIERRE. 1986. Une in´

egalit´

e du type de Slepian et Gordon sur les processus

gaussiens. Israel J. Math.,55(1), 109–110.

KALLENBERG, OLAV. 1983. Random measures. Third edn. Berlin: Akademie-Verlag.

KLIMOVSKY, ANTON. 2005. Learning and generalization errors for the 2D binary perceptron.

Math. Comput. Modelling,42(11-12), 1339–1358.

LEADBETTER, M. R.; LINDGREN, GEORG & ROOTZ ´

EN, HOLGER. 1983. Extremes and re-

lated properties of random sequences and processes. Springer Series in Statistics. New York:

Springer-Verlag.

LEDOUX, MICHEL. 2001. The concentration of measure phenomenon. Mathematical Surveys

and Monographs, vol. 89. Providence, RI: American Mathematical Society.

LEDOUX, MICHEL & TALAGRAND, MICHEL. 1991. Probability in Banach spaces. Ergebnisse

der Mathematik und ihrer Grenzgebiete (3) [Results in Mathematics and Related Areas (3)],

vol. 23. Berlin: Springer-Verlag. Isoperimetry and processes.

LIFSHITS, M. A. 1995. Gaussian random functions. Mathematics and its Applications, vol.

322. Dordrecht: Kluwer Academic Publishers.

M´

EZARD, MARC & MONTANARI, ANDREA. 2007. Constraint satisfaction networks in

Physics and Computation. Oxford University Press. To appear.

M´

EZARD, MARC; PARISI, GIORGIO & VIRASORO, MIGUEL ANGEL. 1987. Spin glass theory

and beyond. World Scientific Lecture Notes in Physics, vol. 9. Teaneck, NJ: World Scientific

Publishing Co. Inc.

NEWMAN, CHARLES M. & STEIN, DANIEL L. 2007. Short-range spin glasses: results and

speculations. Pages 159–178 of: Spin glasses. Lecture Notes in Math., vol. 1900. Berlin:

Springer.

NISHIMORI, HIDETOSHI. 2001. Statistical Physics of Spin Glasses and Information Process-

ing: An Introduction. Oxford University Press, USA.

OLIVIERI, ENZO & PICCO, PIERRE. 1984. On the existence of thermodynamics for the random

energy model. Comm. Math. Phys.,96(1), 125–144.

OPPER, MANFRED & SAAD, DAVID (eds). 2001. Advanced mean field methods. Neural

Information Processing Series. Cambridge, MA: MIT Press. Theory and practice, Papers

from the workshop held at Aston University, Birmingham, 1999, A Bradford Book.

PANCHENKO, DMITRY. 2005a. A question about the Parisi functional. Electron. Commun.

Probab.,10, 155–166.

PANCHENKO, DMITRY. 2005b. Free energy in the generalized Sherrington-Kirkpatrick mean

field model. Rev. Math. Phys.,17(7), 793–857.

PANCHENKO, DMITRY & TALAGRAND, MICHEL. 2007a. Guerra’s interpolation using

Derrida-Ruelle cascades. Preprint. arXiv:0708.3641v2 [math.PR].

PANCHENKO, DMITRY & TALAGRAND, MICHEL. 2007b. On the overlap in the multiple spher-

ical SK models. Ann. Probab.,35(6), 2321–2355.

PARISI, GIORGIO. 2007. Mean field theory of spin glasses: statics and dynamics. In: Proceed-

ings of the 2006 Les Houches Summer School.

PARISI, GIORGIO & TALAGRAND, MICHEL. 2004. On the distribution of the overlaps at given

disorder. C. R. Math. Acad. Sci. Paris,339(4), 303–306.

168 References

PASTUR, LEONID & SHCHERBINA, MARIYA. 1991. Absence of self-averaging of the order

parameter in the Sherrington-Kirkpatrick model. J. Statist. Phys.,62(1-2), 1–19.

PITMAN, JIM. 2006. Combinatorial stochastic processes. Lecture Notes in Mathematics, vol.

1875. Berlin: Springer-Verlag. Lectures from the 32nd Summer School on Probability Theory

held in Saint-Flour, July 7–24, 2002, With a foreword by Jean Picard.

PITMAN, JIM & YOR, MARC. 1997. The two-parameter Poisson-Dirichlet distribution derived

from a stable subordinator. Ann. Probab.,25(2), 855–900.

PRESTON, CHRIS. 1976. Random fields. Berlin: Springer-Verlag. Lecture Notes in Mathemat-

ics, Vol. 534.

RESNICK, SIDNEY I. 1987. Extreme values, regular variation, and point processes. Applied

Probability. A Series of the Applied Probability Trust, vol. 4. New York: Springer-Verlag.

RUELLE, DAVID. 1987. A mathematical reformulation of Derrida’s REM and GREM. Comm.

Math. Phys.,108(2), 225–239.

RUZMAIKINA,A.&AIZENMAN, M. 2005. Characterization of invariant measures at the

leading edge for competing particle systems. Ann. Probab.,33(1), 82–113.

SHCHERBINA, MARIYA. 2005. Some mathematical problems of neural networks theory. Pages

425–444 of: European Congress of Mathematics. Eur. Math. Soc., Z¨

urich.

SHCHERBINA, MARIYA & TIROZZI, BRUNELLO. 2002. Generalization and Learning Error

for Nonlinear Perceptron. Mathematical and Computer Modelling,35, 259–271.

SHERRINGTON, DAVID. 2007. Spin glasses: a perspective. Pages 45–62 of: Spin glasses.

Lecture Notes in Math., vol. 1900. Berlin: Springer.

SHERRINGTON, DAVID & KIRKPATRICK, SCOTT. 1975. Solvable Model of a Spin-Glass.

Physical Review Letters,35(26), 1792–1796.

SIMON, BARRY. 1993. The statistical mechanics of lattice gases. Vol. I. Princeton Series in

Physics. Princeton, NJ: Princeton University Press.

SLEPIAN, DAVID. 1962. The one-sided barrier problem for Gaussian noise. Bell System Tech.

J.,41, 463–501.

TALAGRAND, MICHEL. 2000a. Large deviation principles and generalized Sherrington-

Kirkpatrick models. Ann. Fac. Sci. Toulouse Math. (6),9(2), 203–244.

TALAGRAND, MICHEL. 2000b. Rigorous low-temperature results for the mean field p-spins

interaction model. Probab. Theory Related Fields,117(3), 303–360.

TALAGRAND, MICHEL. 2003. Spin glasses: a challenge for mathematicians. Ergebnisse der

Mathematik und ihrer Grenzgebiete. 3. Folge. A Series of Modern Surveys in Mathemat-

ics [Results in Mathematics and Related Areas. 3rd Series. A Series of Modern Surveys in

Mathematics], vol. 46. Berlin: Springer-Verlag. Cavity and mean field models.

TALAGRAND, MICHEL. 2006a. Free energy of the spherical mean field model. Probab. Theory

Related Fields,134(3), 339–382.

TALAGRAND, MICHEL. 2006b. The Parisi formula. Ann. of Math. (2),163(1), 221–263.

TALAGRAND, MICHEL. 2006c. Parisi measures. J. Funct. Anal.,231(2), 269–286.

TALAGRAND, MICHEL. 2007a. Large Deviations, Guerra’s and A.S.S. Schemes, and the Parisi

Hypothesis. Journal of Statistical Physics,126(4), 837–894.

TALAGRAND, MICHEL. 2007b. Mean field models for spin glasses: some obnoxious problems.

Pages 63–80 of: Spin glasses. Lecture Notes in Math., vol. 1900. Berlin: Springer.

TEH, YEE WHYE; JORDAN, MICHAEL I.; BEAL, MATTHEW J. & BLEI, DAVID M. 2006.

Hierarchical Dirichlet processes. J. Amer. Statist. Assoc.,101(476), 1566–1581.

References 169

TOUBOL, ALAIN. 1998. High temperature regime for a multidimensional Sherrington-

Kirkpatrick model of spin glass. Probab. Theory Related Fields,110(4), 497–534.

WINKLER, GERHARD. 2003. Image analysis, random fields and Markov chain Monte Carlo

methods. Second edn. Applications of Mathematics (New York), vol. 27. Berlin: Springer-

Verlag. A mathematical introduction, Stochastic Modelling and Applied Probability.

ZEGARLI ´

NSKI, BOGUSŁAW. 1998. Random spin systems with long-range interactions. Pages

289–320 of: Mathematical aspects of spin glasses and neural networks. Progr. Probab., vol.

41. Boston, MA: Birkh¨

auser Boston.