scieee Science in your language
[en] (orig)
Energy-Based Model Reduction of
Transport-Dominated Phenomena
vorgelegt von
M. Sc.
Philipp Schulze
ORCID: 0000-0002-7299-4628
von der Fakultät II - Mathematik und Naturwissenschaften
der Technischen Universität Berlin
zur Erlangung des akademischen Grades
Doktor der Naturwissenschaften
- Dr. rer. nat. -
genehmigte Dissertation
Promotionsausschuss:
Vorsitzender: Prof. Dr. Wilhelm Stannat
Gutachter: Prof. Dr. Volker Mehrmann
Gutachter: Prof. Dr. Matthias Heinkenschloss
Gutachter: Prof. Dr. Benjamin Peherstorfer
Tag der wissenschaftlichen Aussprache: 17. April 2023
Berlin 2023
ii
Abstract
Transport-dominated systems are characterized by the propagation of waves
and occur in many applications such as aerodynamics and chemical engineer-
ing. To predict the dynamics of such systems, mathematical models should
ideally be fast to evaluate and at the same time sufficiently accurate. One
possibility for deriving such models is to start with a complex and accurate
full-order model (FOM) and use model order reduction (MOR) techniques to
obtain a corresponding reduced-order model (ROM). Classical MOR methods
are based on approximating the FOM state by a linear combination of ansatz
functions or modes, but such approaches are often inadequate in the context of
transport-dominated systems. This is one of the reasons why there has been an
increasing research effort in the past years to develop MOR techniques which
are based on nonlinear approximation ansatzes.
As the field of nonlinear MOR is relatively new, there are still many open
research questions to be addressed. These include for instance suitable choices
for the approximation ansatz as well as appropriate ways for the construction
of corresponding ROMs. Furthermore, nonlinear MOR approaches typically
lead to ROMs whose evaluation scales with the dimension of the FOM and thus
may be too expensive. In fact, similar issues may also occur in the context
of linear MOR approaches and, therefore, one uses so-called hyperreduction
techniques to obtain fast ROMs. However, classical hyperreduction methods
suffer from similar difficulties as classical MOR schemes when being applied to
transport-dominated systems. Another challenge is to develop nonlinear MOR
techniques which preserve important system properties such as stability.
In this thesis, we present a new nonlinear model reduction framework which
is based on approximating the state of the FOM by a linear combination of
transformed modes. The transformations may be, e.g., achieved by shift opera-
tors and are parametrized by so-called paths or shift amounts, which constitute
a part of the ROM state. The resulting class of ansatzes is well-suited for ob-
taining low-dimensional and accurate approximations of transport-dominated
systems. For the determination of the modes, we present an optimization ap-
proach based on given snapshot data of the FOM state. Furthermore, the
construction of the ROM is carried out via a residual minimization approach
and we also suggest a new hyperreduction framework to ensure that the ROM
can be efficiently evaluated. In addition, we demonstrate how to preserve sta-
bility via an energy-based formulation using the framework of so-called port-
Hamiltonian systems. Finally, we illustrate the new methodology by means of
numerical experiments for some transport-dominated test cases.
iii
Zusammenfassung
Transportdominierte Systeme sind durch die Ausbreitung von Wellen charak-
terisiert und kommen in vielen Anwendungen vor, z. B. in der Aerodynamik
und Verfahrenstechnik. Um die Dynamik solcher Systeme vorherzusagen, soll-
ten mathematische Modelle schnell auswertbar und dabei hinreichend genau
sein. Hierzu kann man z. B. ausgehend von einem sehr genauen Originalmod-
ell (FOM) mit Verfahren der Modellreduktion (MOR) ein reduziertes Modell
(ROM) herleiten. Klassische MOR-Methoden basieren auf der Approxima-
tion des FOM-Zustandes durch eine Linearkombination von Ansatzfunktionen
bzw. Moden. Solche Ansätze sind jedoch bei transportdominierten Systemen
oft unzureichend. Unter anderem deshalb wird seit ein paar Jahren vermehrt
an MOR-Verfahren geforscht, die auf nichtlinearen Ansätzen basieren.
Da das Gebiet der nichtlinearen Modellreduktion relativ neu ist, gibt es
noch viele offene Forschungsfragen. Diese betreffen z.B. eine adäquate Wahl
des MOR-Ansatzes sowie geeignete Methoden für die Erstellung von ROMs
basierend auf einem konkreten Ansatz. Ferner führen nichtlineare Ansätze
häufig zu ROMs, deren Auswertung mit der Dimension des FOMs skaliert
und dadurch zu aufwendig sein kann. Ähnliche Probleme können auch bei
linearen MOR-Ansätzen auftreten und daher werden sogenannte Hyperreduk-
tionsverfahren verwendet, um effizient auswertbare ROMs zu erhalten. Klas-
sische Hyperreduktionsmethoden sind jedoch bei der Anwendung auf trans-
portdominierte Systeme von ähnlichen Schwierigkeiten betroffen wie klassische
MOR-Verfahren. Eine weitere Herausforderung ist die Entwicklung nichtlin-
earer MOR-Methoden, welche Eigenschaften wie Stabilität erhalten.
In der vorliegenden Arbeit stellen wir eine nichtlineare MOR-Methode vor,
die auf der Approximation des FOM-Zustandes durch eine Linearkombination
von transformierten Moden basiert. Die Transformationen können z. B. durch
Translationen realisiert werden und sind durch sogenannte Pfade bzw. Transla-
tionsstrecken parametrisiert, die einen Teil des ROM-Zustandes bilden. Solche
Ansätze sind gut geeignet, um niedrigdimensionale und genaue Approxima-
tionen transportdominierter Systeme zu erhalten. Für die Modenbestimmung
stellen wir einen Optimierungsansatz vor, der auf Daten des FOM-Zustandes
basiert. Zudem erstellen wir die ROMs durch Residuumsminimierung und
gewährleisten eine effiziente Auswertung durch ein neues Hyperreduktionsver-
fahren. Des Weiteren zeigen wir, wie Stabilität durch eine energiebasierte For-
mulierung erhalten werden kann, indem wir eine sogenannte port-Hamiltonsche
Darstellung verwenden. Schließlich veranschaulichen wir die neue Methodik
anhand numerischer Experimente für transportdominierte Anwendungsfälle.
v
Acknowledgements
First, I would like to thank Volker Mehrmann for his supervision, proofreading,
valuable feedback, and continuous support over the years. I am also grateful to
Matthias Heinkenschloss and Benjamin Peherstorfer for agreeing to examine
this thesis. In addition, I acknowledge funding by the Deutsche Forschungs-
gemeinschaft via the Collaborative Research Centers 1029 and Transregio 154
as well as by the Berlin Mathematical School. Besides, I am grateful to my
colleagues and friends at TU Berlin for the good working atmosphere, mem-
orable group trips, and joyful summer and Christmas parties. Furthermore,
I thank all my co-authors for the pleasant and fruitful collaboration and in
addition Amelie Binder, Felix Black, Philipp Krah, Riccardo Morandin, Julius
Reiss, and Christoph Zimmer for the inspiring discussions and valuable hints
regarding the subjects of this thesis. Special thanks go to Benjamin Unger for
teaming up with me in many research activities and conferences as well as in
non-mathematical disciplines like the Christmas office chair races. Moreover,
I owe many thanks to Robert Altmann for the nice atmosphere in the office
as well as for his careful proofreading of this thesis and the precious feedback.
Finally, I thank Marine and my family for their unconditional and permanent
support.
vii
Contents
List of Figures xi
List of Algorithms xiii
List of Acronyms xv
1. Introduction 1
1.1. ProblemSetting........................... 3
1.2. Motivation.............................. 6
1.2.1. Transport-Dominated Systems . . . . . . . . . . . . . . 6
1.2.2. Port-Hamiltonian Systems . . . . . . . . . . . . . . . . . 14
1.3. Review of Existing Approaches . . . . . . . . . . . . . . . . . . 15
1.3.1. Model Reduction Techniques for Transport-Dominated
Systems ........................... 15
1.3.2. Structure-Preserving Model Reduction for Port-Hamiltonian
Systems ........................... 24
1.4. Contributions, Outline, and Previously Published Results . . . . 31
2. Preliminaries 37
2.1. Notation............................... 37
2.2. Nonlinear Optimization . . . . . . . . . . . . . . . . . . . . . . . 39
2.3. Functional Analysis . . . . . . . . . . . . . . . . . . . . . . . . 40
2.4. Differential Equations . . . . . . . . . . . . . . . . . . . . . . . 46
2.4.1. Abstract Evolution Equations and Semigroups . . . . . . 46
2.4.2. Finite-Dimensional Systems of Differential Equations . . 48
2.5. Parametric Model Order Reduction . . . . . . . . . . . . . . . . 53
2.5.1. Proper Orthogonal Decomposition . . . . . . . . . . . . 55
2.5.2. POD-Greedy Algorithm . . . . . . . . . . . . . . . . . . 57
2.5.3. Galerkin Projection . . . . . . . . . . . . . . . . . . . . 59
2.5.4. Hyperreduction . . . . . . . . . . . . . . . . . . . . . . . 61
2.6. Port-Hamiltonian Systems . . . . . . . . . . . . . . . . . . . . . 63
2.6.1. Formulations and Basic Properties . . . . . . . . . . . . 63
2.6.2. Structure-Preserving Model Order Reduction . . . . . . . 67
3. Mode Identification 69
3.1. Residual Minimization . . . . . . . . . . . . . . . . . . . . . . . 70
3.1.1. Solving the Full Optimization Problem . . . . . . . . . . 76
3.1.2. Using Variable Projection . . . . . . . . . . . . . . . . . 77
ix
Contents
3.2. Greedy Algorithm based on Transformed Modes . . . . . . . . . 86
3.3. Boundary Treatment . . . . . . . . . . . . . . . . . . . . . . . . 92
3.3.1. Extended Domain . . . . . . . . . . . . . . . . . . . . . . 93
3.3.2. ZeroPadding ........................ 97
3.3.3. Constant Extrapolation . . . . . . . . . . . . . . . . . . 98
3.4. Comparison with Other Approaches . . . . . . . . . . . . . . . . 100
4. Projection-Based Model Order Reduction 105
4.1. Continuously Optimal Reduced-Order Models . . . . . . . . . . 105
4.2. Relation with Symmetry Reduction . . . . . . . . . . . . . . . . 118
4.3. Hyperreduction ...........................122
4.3.1. Case of a Linear Full-Order Model . . . . . . . . . . . . 124
4.3.2. Case of a Nonlinear Full-Order Model . . . . . . . . . . 131
5. Structure-Preserving Model Reduction for Port-Hamiltonian Sys-
tems 141
5.1. Linear Approximation Ansatz . . . . . . . . . . . . . . . . . . . 144
5.2. Nonlinear Separable Approximation Ansatz . . . . . . . . . . . 155
5.3. Nonlinear Factorizable Approximation Ansatz . . . . . . . . . . 167
6. Numerical Examples 175
6.1. Linear Wave Equation . . . . . . . . . . . . . . . . . . . . . . . 176
6.2. Linear Advection–Diffusion Equation . . . . . . . . . . . . . . . 185
6.3. Nonlinear Reaction–Diffusion Equation . . . . . . . . . . . . . . 196
7. Conclusion 209
7.1. Summary ..............................209
7.2. Outlook ...............................211
A. Properties of the Periodic Shift Operator 215
B. Linear Time-Varying Port-Hamiltonian Systems 221
C. Discrete Gradient Schemes 227
C.1. Discrete Gradients for Hamiltonian Systems . . . . . . . . . . . 227
C.2. Discrete Gradients for a Special Class of Port-Hamiltonian Sys-
tems .................................229
D. Technical Details for Chapter 6 237
D.1. Discretized Shift Operators . . . . . . . . . . . . . . . . . . . . . 237
D.1.1. Periodic Shift Operator . . . . . . . . . . . . . . . . . . . 237
D.1.2. Shift Operator used in Section 6.2 . . . . . . . . . . . . . 239
D.1.3. Shift Operator used in Section 6.3 . . . . . . . . . . . . . 242
D.2. Approximation of the Integral in (6.19) . . . . . . . . . . . . . . 245
Bibliography 247
x
List of Figures
1.1. Example 1.2.1: pseudocolor plot of the analytical solution (left)
and some selected snapshots (right). . . . . . . . . . . . . . . . . 9
1.2. An exemplary mode (left) and its shifted analogue (right) using
a periodic shift operator. . . . . . . . . . . . . . . . . . . . . . . 10
1.3. Pulsed detonation combustion process. . . . . . . . . . . . . . . 13
1.4. Pseudocolor plot of the density ρfor the pulsed detonation com-
bustion (PDC) process based on simulation data from [122]. . . 14
3.1. Example 3.3.1: pseudocolor plots of the analytical solution (left)
and a corresponding approximation with one transformed mode
based on the family of periodic shift operators (right). . . . . . . 93
3.2. An exemplary mode (left) and its shifted analogue (right), where
the gray area indicates a region of undetermined values. . . . . . 94
3.3. An exemplary mode defined on the extended domain e=R
(here depicted on the domain b
e= (0.25,1), left) and its
shifted analogue on = (0,1) (right)................ 96
3.4. An exemplary mode (left) and its shifted analogue (right) using
a zero padding shift operator. . . . . . . . . . . . . . . . . . . . 98
3.5. An exemplary mode (left) and its shifted analogue (right) using
a constant extrapolation shift operator. . . . . . . . . . . . . . . 99
4.1. Example 4.3.8: initial value for the density. . . . . . . . . . . . . 129
4.2. Example 4.3.8: pseudocolor plots of the analytical solution for
the density (left) and the velocity (right). . . . . . . . . . . . . . 130
4.3. Example 4.3.12: pseudocolor plots of the traveling wave solu-
tion xof the Burgers’ equation (left) and of the corresponding
nonlinear term x∂ξx(right). ...................133
4.4. Example 4.3.12: singular value decays of the snapshot matrices
depicted in Figure 4.3. . . . . . . . . . . . . . . . . . . . . . . . 133
6.1. Linear wave equation: pseudocolor plots of the FOM solution
for the density (left) and the velocity (right). . . . . . . . . . . . 179
6.2. Linear wave equation: some selected snapshots of the FOM so-
lution for the density (left) and the velocity (right). . . . . . . . 180
6.3. Linear wave equation: singular value decay of the snapshot matrix.180
6.4. Linear wave equation: density (left) and velocity (right) com-
ponents of the determined modes. . . . . . . . . . . . . . . . . . 182
xi
List of Figures
6.5. Linear wave equation: online values of the amplitudes (left) and
thepaths(right). ..........................185
6.6. Linear wave equation: Comparison of the error in conserva-
tion of the ROM Hamiltonian using the implicit midpoint rule
and the midpoint discrete gradient pair method outlined in ap-
pendixC.2. .............................186
6.7. Linear advection–diffusion equation: pseudocolor plot of the
FOM solution (left) and some selected snapshots (right). . . . . 188
6.8. Linear advection–diffusion equation: Comparison of the discrete
time derivative of the ROM Hamiltonian and the corresponding
dissipation and supplied power with time step size t= 0.08
when using the implicit midpoint rule and the midpoint discrete
gradient pair approach from appendix C.2. The inset highlights
an energy inconsistency of the implicit midpoint rule, where the
discrete time derivative of the Hamiltonian is positive despite a
vanishing power supply. Here, t= 0.28 corresponds to the first
midpoint where the input becomes permanently zero. . . . . . 193
6.9. Linear advection–diffusion equation: Convergence of the im-
plicit midpoint rule and the midpoint discrete gradient pair
method from appendix C.2. . . . . . . . . . . . . . . . . . . . . 194
6.10. Linear advection–diffusion equation: pseudocolor plots of the
FOM solution for d= 103.5(left) and d= 102.5(right). . . . . 195
6.11. Linear advection–diffusion equation: ROM accuracy for differ-
ent values of the diffusion coefficient dwhen using a ROM with
r= 3 transformed modes based on FOM snapshots with d= 103.195
6.12. Linear advection–diffusion equation: relative online error for dif-
ferent mode numbers and values of dwhen using the greedy al-
gorithm from section 3.2 for determining the modes. The black
circles highlight the respective worst-case parameter values. . . . 197
6.13. Nonlinear reaction–diffusion equation: pseudocolor plot of the
FOM solution (left) and some selected snapshots (right). . . . . 201
6.14. Nonlinear reaction–diffusion equation: pseudocolor plot of the
FOM nonlinearity (left) and some selected snapshots (right). . . 205
xii
List of Algorithms
2.1. POD-greedy algorithm . . . . . . . . . . . . . . . . . . . . . . . 58
3.1. Evaluation of the cost function and its gradient for (3.9) . . . . 78
3.2. Evaluation of the cost function (3.22) and its gradient . . . . . . 85
3.3. Greedy algorithm based on transformed modes . . . . . . . . . . 86
xiii
List of Acronyms
ANN artificial neural network
DAE differential–algebraic equation
DDT deflagration-to-detonation transition
DEIM discrete empirical interpolation method
EIM empirical interpolation method
FEM finite element method
FOM full-order model
IRKA iterative rational Krylov algorithm
LQG linear quadratic Gaussian
MFEM moving finite element method
MOR model order reduction
ODE ordinary differential equation
PDC pulsed detonation combustion
PDE partial differential equation
pH port-Hamiltonian
POD proper orthogonal decomposition
PSD proper symplectic decomposition
ROM reduced-order model
SOBMOR structured optimization-based model order reduction
SVD singular value decomposition
xv
1. Introduction
Nowadays, in many applications design decisions are made based on numerical
simulations. The increasing demand for accuracy and complexity results in
very high-dimensional systems that need to be simulated. This, however, is in
conflict with multi-query applications like control, optimization, or uncertainty
quantification. The goal of model order reduction (MOR) techniques is to
replace the high-dimensional full-order model (FOM) by a low-dimensional
surrogate model, which is usually called a reduced-order model (ROM). The
main requirement for such a ROM is that it needs to capture the relevant part
of the system accurately enough for a reasonable solution of the task at hand,
while at the same time the computational complexity in evaluating the ROM
should be much smaller in comparison to an evaluation of the FOM. In the
past decades, model reduction methods have experienced an immense research
effort and have been employed in various applications. For an overview, we
refer to the book chapters, books, and survey papers [14, 16, 23, 29, 30, 31,
142, 144, 235, 255].
In short, model reduction aims at constructing a surrogate model which can
be evaluated much faster than the full-order model and which yields a good
approximation of the solution of the problem at hand for a desired range of
parameter or input settings. Usually, the process of constructing the reduced
model is referred to as the offline phase, while the repeated evaluation of the
reduced model, for instance for control or optimization purposes, is called the
online phase. An essential requirement is that the computation time in the
online phase is as small as possible, whereas the computational effort needed
for the offline phase should be at least significantly smaller than the solution
of the optimization, simulation, or control task using the full model. While
this aspect rather concerns the efficiency of the model reduction process, there
are additional goals regarding the approximation quality of the ROM. In the
following we summarize the most relevant requirements for ROMs in terms of
both efficiency and approximation quality:
(i) The approximation error with respect to the quantities of interest should
be smaller than a prescribed tolerance in some specified norm.
(ii) The required time for evaluating the ROM in the online phase should be
significantly smaller than the time needed for solving the corresponding
FOM.
(iii) When considering optimization tasks, the computation time needed for
1
1. Introduction
the offline phase should be considerably smaller than solving the corre-
sponding full-order problem.
(iv) The availability of cheap-to-compute, reliable, and sharp error estimators
is desirable, e.g., for supporting the fulfillment of (i) or for facilitating
techniques which allow to adaptively build the ROM.
(v) In some applications it may be necessary that the reduced model reflects
the physics in the sense that certain properties of the original model
such as conservation laws are preserved. Similarly, if the full model is
given by an interconnected system of several submodels, then it is usually
important that the reduced model maintains this modular structure as
well.
In this thesis, we develop new model reduction techniques with special em-
phasis on two different aspects. First, we aim for MOR methods which are
suitable for the effective reduction of transport-dominated systems, i.e., sys-
tems whose dynamics is dominated by the propagation of waves or similar
structures. Especially, in cases where the corresponding wave profiles exhibit
sharp fronts, classical MOR techniques are usually observed to fail in provid-
ing low-dimensional and accurate approximations. Consequently, we consider
a new model reduction framework which overcomes such difficulties by explic-
itly accounting for the wave propagation in the model reduction ansatz.
The second central aspect of this thesis is the preservation of certain sys-
tem properties such as stability and passivity. Passivity corresponds, roughly
speaking, to the inability of the system to internally generate energy out of
nowhere. In this context, we consider port-Hamiltonian (pH) systems which
generalize classical Hamiltonian systems such that they are not only useful for
describing conservative systems, but they also account for energy dissipation
and energy exchange with the environment. Such systems can be shown to
be inherently passive and in many cases even stable. Thus, this structure is
especially well-suited for preserving such system properties when performing
model reduction. More precisely, there are model reduction methods which
preserve the pH structure and, thus, also the associated properties encoded
in this structure. In this thesis, we are especially interested in investigating
how the new model reduction scheme mentioned in the last paragraph can be
applied in a way which allows to preserve the pH structure.
The remainder of the introductory chapter is structured as follows. The
subsequent section provides a brief introduction to the mathematical problem
setting which is considered in most parts of this thesis. In section 1.2, we pro-
vide a motivation for considering transport-dominated systems and structure-
preserving methods for pH systems. A literature review on these two research
areas is outlined in section 1.3, whereas in section 1.4 we list the main contri-
butions of this thesis and provide a brief overview on the subsequent chapters.
2
1.1. Problem Setting
1.1. Problem Setting
The FOMs we consider in this thesis are of the general form
˙x(t) = F(t, x(t)) for all tI:= [t0, tend], x(t0) = x0,(1.1)
with state x:IW, initial time t0R0, final time tend R>t0, initial value
x0W, right-hand side operator F:R0×WX, a real Hilbert space
X, and a subspace WX. Classical model reduction schemes are based on
first determining a suitable r-dimensional subspace of Wfor approximating
the state x, where ris much smaller than the dimension of W. This subspace
is usually represented by a corresponding basis of so-called ansatz functions or
modes φ1, . . . , φrW. The corresponding approximation ansatz then reads
x(t)ˆx(t):=
r
X
i=1
αi(t)φifor all tI,(1.2)
where α1, . . . , αr:IRare the so-called coefficients or amplitudes corre-
sponding to the respective modes φ1, . . . , φr. A common way for determin-
ing suitable ansatz functions is the proper orthogonal decomposition (POD)
method which yields an optimal approximation of given snapshot data in the
sense that the error in (1.2) is minimized, cf. section 2.5.1. Once the modes
have been determined, a usual way of constructing a corresponding ROM is a
Galerkin projection of (1.1) onto the subspace spanned by the modes, cf. sec-
tion 2.5.3. The state of the resulting reduced-order model corresponds to the
amplitudes αiin the approximation ansatz (1.2). Thus, since the latter one is
linear in the reduced-order state, (1.2) is called a linear approximation ansatz.
As mentioned in the introduction of this chapter, the power of linear ap-
proximation ansatzes as in (1.2) is often observed to be rather limited when
facing FOM dynamics which involve the propagation of waves with sharp wave
fronts. This shortcoming of linear ansatzes is subject to the discussion in sec-
tion 1.2.1. Roughly speaking, the major reason is that (1.2) allows the modes
only to change their amplitude as time evolves, but the modes themselves are
fixed and cannot follow the propagation of the wave profiles. To overcome
this issue, we add a time-dependent coordinate transformation to the approx-
imation ansatz which allows to explicitly account for the propagation. More
precisely, we extend (1.2) and consider the approximation ansatz
x(t)ˆx(t):=
r
X
i=1
αi(t)Ti(pi(t)) φifor all tI(1.3)
with suitable families of transformation operators Ti:b
Pi L(V,X), so-called
paths pi:Ib
Piwhich parametrize the transformation, finite-dimensional real
Banach spaces b
Pi, modes φiVfor i= 1, . . . , r, and a real Banach space V.
Here, L(V,X)denotes the space of linear and bounded operators from Vto
3
1. Introduction
X, cf. section 2.3. In all examples of this thesis, the considered transformation
operators are given by translation or shift operators and the paths are real-
valued functions determining by which amount the modes are shifted. In
contrast to the classical ansatz (1.2), in (1.3) the modes are allowed to be in
a general Banach space V, which does not need to be a subspace of X. This
flexibility allows us, for instance, to define the modes on an extended spatial
domain as discussed in section 3.3.1.
To ensure that the approximation ˆxlies within the domain of the right-hand
side operator Fof the FOM, it is sufficient to require that the modes satisfy
φi\
ηb
Pi
(Ti(η))1(W)
for i= 1, . . . , r. For the remainder of this thesis, we restrict ourselves for
simplicity to the case that only one transformation family T1=. . . =Tr=:T
is used and that the corresponding path space b
P1=. . . =b
Pr=:b
Pis given by
b
P=R. These restrictions lead to the more special ansatz
x(t)ˆx(t):=
r
X
i=1
αi(t)T(pi(t)) φifor all tI.(1.4)
Here, the modes are required to satisfy
φ1, . . . , φrY:=\
ηR
(T(η))1(W).(1.5)
A different setting allowing for higher-dimensional path spaces and for the
simultaneous use of different types of transformation operators is discussed in
[37], where some of the topics of this thesis have been already addressed. In
particular, the nonlinear projection framework outlined in section 4.1 has been
presented first in [37], however, in a slightly different setting with W-invariant
transformation operators Ti(η) L(X):=L(X,X)for ηb
Pi,i= 1, . . . , r.
A more extensive discussion of results, which are presented in this thesis and
have been published before, is provided in section 1.4.
While model reduction based on a linear approximation ansatz of the form
(1.2) has been studied for decades and is fairly well-understood, the use of the
new ansatz (1.4) poses new questions to be addressed, for instance:
1. What are suitable families of transformation operators T?
2. For a given transformation family Tand for given snapshot data of the
FOM state, how can we determine rand αi,φi,pifor i= 1, . . . , r such
that the approximation error in (1.4) is small?
3. For a given transformation family Tand for given modes φ1, . . . , φr,
how can we construct a ROM for determining the time evolution of the
4
1.1. Problem Setting
amplitudes α1, . . . , αrand the paths p1, . . . , pr, while at the same time
ensuring that the ROM can be evaluated in an efficient way?
4. If the FOM possesses desirable properties like stability, does an approxi-
mation ansatz of the form (1.4) allow for a model reduction scheme which
preserves these properties?
All of these questions are at least partially addressed in this thesis. While a
general answer to the first question of suitable choices of transformation opera-
tors is out of the scope of this thesis, we introduce several families of translation
operators and illustrate their usefulness via various examples, cf. sections 1.2.1
and 3.3 and chapter 6. The second question of approximating given snap-
shot data via an ansatz of the form (1.4) is addressed in chapter 3. Here
we especially focus on determining the modes and the amplitudes, whereas
ris assumed to be given and the paths may be for instance determined in
a pre-processing step. The ROM construction, which is subject of the third
question, is discussed in detail in chapter 4 and, especially, the question of how
to obtain a ROM which may be evaluated in an efficient way is addressed in
section 4.3. Finally, the fourth and last question is so general that it cannot
be completely answered in this thesis. Instead, we focus on preserving port-
Hamiltonian structures in chapter 5 and demonstrate that this often also leads
to stable ROMs.
Remark 1.1.1 (Parameter dependency).In many situations, the right-hand
side and the initial value in (1.1) may depend additionally on a parameter
vector µMRnpand, thus, so does the solution x, i.e., the FOM takes
the form
˙x(t;µ) = F(t, x(t;µ); µ)for all (t, µ)I×M,
x(t0;µ) = x0(µ)for all µM(1.6)
with F:R0×W×MX,x0:MW, and x:I×MW. In this
case, we consider instead of (1.4) the slightly modified ansatz
x(t;µ)ˆx(t;µ):=
r
X
i=1
αi(t;µ)T(pi(t;µ)) φifor all (t, µ)I×M,(1.7)
i.e., the amplitudes αiand the paths pimay also depend on the parameter
vector µ. On the other hand, the modes φiare assumed to be chosen such that
they are suitable for approximating the solution xover the complete parameter
domain M. Alternatively, one could also determine modes separately for dif-
ferent parameter values and use interpolation afterwards for constructing the
reduced-order model, see for instance [30, sec. 4] for a detailed comparison of
these two different approaches in the context of finite-dimensional linear time-
invariant control systems. In the remainder of this thesis, we usually drop the
parameter dependency for simplicity, but we discuss in several remarks how to
incorporate parameters, cf. Remarks 3.1.4, 4.1.7, 4.3.10, and 4.3.14. ¨
5
1. Introduction
Remark 1.1.2 (Ansatz based on clustered modes).In practice, it is often use-
ful to not have one transformation operator per mode, but instead to allow
for groups or clusters of modes to be transformed by the same operator. An
example, where this may be helpful, is when the dynamics exhibit multiple
traveling waves whose profiles change due to diffusion as time evolves. In such
a scenario, time-dependent translation operators may be used for describing
the propagation of the waves, whereas the diffusion can be taken into account
by considering a time-dependent linear combination of multiple modes for each
traveling wave. Such a clustering approach may be represented by an approx-
imation ansatz of the form
x(·)ˆx(·):=
nt
X
i=1 T(pi(·))
ri
X
j=1
αi,j(·)φi,j,(1.8)
where the total number of modes is r:=Pnt
i=1 riand ntNdenotes the
number of transformations. We emphasize that the mentioned example of
traveling and diffusing waves may be likewise described by the approximation
ansatz (1.4). However, the ansatz (1.8) yields more robustness for the ROM
since the modes corresponding to one traveling wave are forced to share the
same path, see for instance [37, sec. 7.1] for a numerical comparison of the
ansatzes (1.4) and (1.8). Moreover, (1.8) is often also advantageous for the
theory in terms of solvability of the reduced-order model, cf. section 4.1 or [37,
Rem. 5.8]. ¨
1.2. Motivation
In this section, we first provide a motivation for considering special MOR
approaches for transport-dominated systems in section 1.2.1. Afterwards, in
section 1.2.2 we mention several reasons for preserving a port-Hamiltonian
structure when performing model reduction.
1.2.1. Transport-Dominated Systems
In this thesis, we use the term transport-dominated for dynamical systems
whose solutions are mainly characterized by the propagation of one or several
waves or similar structures through the physical domain. Such phenomena
may be observed, for instance, in the context of hyperbolic conservation laws
like the wave equation or in combustion processes with traveling shock waves
and reaction fronts.
By means of a simple example, we illustrate why classical model reduction
methods which are based on linear subspace approximations are often ineffec-
tive when applied to transport-dominated systems.
Example 1.2.1 (Linear advection equation with periodic boundary condi-
tions).We consider the linear advection equation with periodic boundary con-
6
1.2. Motivation
ditions
tx(t, ξ) = c∂ξx(t, ξ),for all (t, ξ)I×,
x(0, ξ) = x0(ξ),for all ξ,
x(t, a) = x(t, b),for all tI,
(1.9a)
(1.9b)
(1.9c)
on a one-dimensional domain = (a, b)with aR,bR>a and time interval
I= [0, tend]with tend R>0. Furthermore, cRdenotes the advection speed.
As initial value we consider x0:Rdefined via
x0(ξ) =
4
3(ξa)37
2(ξa)2+2
(ξa)+1,if ξ[a, a +],
1
2(ξb)2+2
(ξb)+1,if ξ[b, b],
0,otherwise,
(1.10)
with small parameter (0,ba
2). This initial value is zero everywhere except
for regions of length adjacent to the boundaries. Moreover, the initial con-
dition is constructed such that it is continuously differentiable and that the
values of x0and of its first derivative coincide at the boundaries, i.e.,
x0(a) = x0(b) = 1 and x0
0(a) = x0
0(b) = 2
.
The analytical solution of (1.9) is given by x(t, ξ) = ˇx0(ξct)for all (t, ξ)
I×, where ˇx0:RRdenotes the (ba)-periodic continuation of the
initial value x0. This solution is unique, which follows from the fact that all
continuously differentiable solutions of the linear advection equation (1.9a)
have to be constant along lines which are parallel to {(t, ξ)R2|ξ=ct}.
These lines are usually referred to as the characteristic ground curves, cf. [198,
sec. 1.2.1]. For c= 1, we consider fully discrete snapshots of the analytical
solution stored in the snapshot matrix XRn,q, whose entries are given by
[X]i,j =x(tj, ξi)(1.11)
with discrete points in space ξi= (i1)∆ξand discrete points in time tj=
(j1)∆tfor i= 1, . . . , n and j= 1, . . . , q. Here, we choose n=qN>1, the
mesh width ξ= (ba)/n > , and the time step size t= ξ. Thus, the
solution is considered in a time horizon where the advected profile is in the
end just one grid point apart from its initial position.
Next, we consider the approximation of the solution by means of a linear
approximation ansatz of the form (1.2), which corresponds (in the fully discrete
case) to a low rank factorization of the snapshot matrix X. For the considered
example, the special choice of the initial value (1.10), the transport behavior
of the advection equation, and the relatively coarse sampling of the analytical
solution cause the snapshot matrix to be equal to the n×nidentity matrix.
Consequently, the singular values of Xdo not decay at all and, hence, there
7
1. Introduction
exists no low-rank factorization of Xwith a small approximation error. For
similar examples we refer to [1, 242]. We emphasize that the singular values
would decay if the spatial and the temporal sampling resolution was increased,
but nevertheless the singular value decay would be very slow, in particular for
small values of .
Considering the singular value decay of a snapshot matrix is one possibility
for judging how well-suited a linear approximation ansatz of the form (1.2)
is for a certain example. However, there are also other ways for seeing that
the considered advection equation problem is challenging for linear approxi-
mations. To this end, we set a= 0,b= 1,n= 100,= 0.005 and depict
the corresponding snapshots in a pseudocolor plot in Figure 1.1, left. Here,
the snapshot matrix contains values of the analytical solution within the range
[0,0.99] ×[0,0.99] I×, cf. (1.11). In accordance with the considerations
of the last paragraph, the traveling wave corresponds to a diagonal line in the
space time diagram. On the contrary, the classical linear approximation ansatz
(1.2) is based on a sum of dyadic products, which, roughly speaking, is well-
suited for approximating horizontal and vertical structures in the space time
diagram, but not for approximating diagonal structures as in Figure 1.1, left.
Furthermore, Figure 1.1, right, depicts some selected snapshots of the solution.
If we were aiming for approximating the snapshots within a low-dimensional
subspace, we would ideally desire them to be at least almost linearly depen-
dent, i.e., that the angles between them are as small as possible. However,
the depicted snapshots in Figure 1.1, right, have compact non-overlapping
supports and, thus, the snapshots are pairwise orthogonal. Thus, the angles
between them are maximal and this is the worst possible situation in terms
of approximability by a subspace whose dimension is smaller than the num-
ber of snapshots. Of course, if the temporal resolution were increased and we
would add more and more snapshots, then there would be also overlapping
snapshots, which are not orthogonal to each other. Nevertheless, even in that
case most snapshot pairs would be orthogonal to each other, which makes the
set of snapshots hard to approximate within a low-dimensional subspace.
Finally, let us mention that the approximability of a solution manifold by
means of a linear approximation ansatz may be also studied by considering
the so-called Kolmogorov n-widths. These are measures for the worst-case ap-
proximation error obtained by the best linear subspace of a certain dimension,
cf. [72, 167, 274]. Thus, if the Kolmogorov n-widths decay slowly, we can in
general not expect to obtain accurate approximations by a linear approxima-
tion ansatz of the form (1.2) with just a few ansatz functions. In [220, sec. 5.1]
the authors demonstrate for a linear advection test case with jump disconti-
nuity that the Kolmogorov n-widths decay slowly and they conclude that any
MOR scheme which is based on a linear approximation ansatz is condemned
to failure. l
We emphasize that Example 1.2.1 is an academic example which has been
constructed to illustrate the limitations of linear approximation ansatzes and
8
1.2. Motivation
0 0.5
0
0.5
ξ
t
0
1
x(t, ξ)
0 0.5
0
1
ξ
t= 0.1t= 0.3t= 0.5t= 0.7t= 0.9
Figure 1.1.: Example 1.2.1: pseudocolor plot of the analytical solution (left) and some
selected snapshots (right).
to motivate considering more general approximation ansatzes which go beyond
the linear one in (1.2). For instance, drawing inspiration from Example 1.2.1,
where the analytical solution is given by applying a time-dependent shift to
the initial value, a reasonable modification of the linear approximation ansatz
(1.2) is given by
x(t)
r
X
i=1
αi(t)T(p(t)) φifor all tI,(1.12)
which constitutes a special case of (1.4) with p1=. . . =pr=:p, i.e., all modes
are transformed using the same time-dependent transformation operator T(p).
For Example 1.2.1, a natural choice for Tis given by the family T=Tper of
periodic shift operators as defined in Definition 1.2.2, see also Figure 1.2 for an
illustration and appendix A for a summary of some mathematical properties
of Tper.
Definition 1.2.2 (Periodic shift operator).For given Ω=(a, b)with a
Rand bR>a, we define the family of periodic shift operators Tper :R
L(L2(Ω)) via Tper(η)f:=g, where, for given ηRand fL2(Ω),gis the
unique element in L2(Ω) satisfying
a+ζ
Za|g(ξ)f(ξ+baζ)|dξ+
b
Z
a+ζ|g(ξ)f(ξζ)|dξ= 0
with ζ:=ηmod (ba).K
Note that despite the different notation used in Example 1.2.1, we may
apply the approximation ansatz (1.12) to Example 1.2.1 by considering xas a
function mapping from Ito a suitable subspace of L2(Ω). In fact, the analytical
9
1. Introduction
0 0.5 1
3
2
1
0
ξ
φ(ξ)
0 0.5 1
3
2
1
0
ξ
(Tper(0.25)φ)(ξ)
Figure 1.2.: An exemplary mode (left) and its shifted analogue (right) using a periodic
shift operator.
solution of Example 1.2.1 can be expressed via the ansatz (1.12) with just one
mode, e.g., by setting
r= 1,T=Tper, φ1=x0, p(t) = ct, α1(t)=1
for all tI. The ansatz (1.12) also allows for low-dimensional representa-
tions of more complex systems including non-constant transport velocities and
moving structures with varying shape, see, for example, [54].
While the advection equation considered in Example 1.2.1 can be well-
described by an ansatz of the form (1.12) with just one transformation param-
etrized by a single path p(t) = ct, in practice one often encounters multiple
wave profiles with different wave speeds. This motivates using a more gen-
eral ansatz with multiple paths as in (1.4). One of the simplest examples for
the occurrence of multiple waves is given by the dynamics of the linear wave
equation.
Example 1.2.3 (Linear wave equation with periodic boundary conditions).
We consider the linear acoustic wave equation with periodic boundary condi-
tions
tρ(t, ξ) = ρref ξv(t, ξ)for all (t, ξ)I×,
tv(t, ξ) = c2
ρref
ξρ(t, ξ)for all (t, ξ)I×,
ρ(0, ξ) = ρ0(ξ)for all ξ,
v(0, ξ) = v0(ξ)for all ξ,
ρ(t, a) = ρ(t, b)for all tI,
v(t, a) = v(t, b)for all tI
(1.13)
on a one-dimensional domain Ω=(a, b)with aR, b R>a and time interval
I= [0, tend]with tend R>0. Here, the unknowns are the density variation
ρ:I×Rand the velocity variation v:I×R, whereas the reference
density ρref R>0and the velocity of sound cR>0are assumed to be given,
cf. [177, § 64]. Moreover, the initial values ρ0, v0C1(Ω) are assumed to
10
1.2. Motivation
satisfy the boundary conditions
ρ0(a) = ρ0(b), ρ0
0(a) = ρ0
0(b), v0(a) = v0(b), v0
0(a) = v0
0(b).(1.14)
By following a similar approach as for instance presented in [198, p. 23 f.], we
may derive the analytical solution
"ρ(t, ξ)
v(t, ξ)#="ρref
c#ϑr(ξct) + "ρref
c#ϑl(ξ+ct),(1.15)
where the so-called Riemann invariants ϑr:RRand ϑl:RRare given
by the (ba)-periodic continuations of
1
2 1
ρref
ρ0+1
cv0!and 1
2 1
ρref
ρ01
cv0!,(1.16)
respectively. Especially, the analytical solution (1.15) involves time-dependent
shifts of the Riemann invariants and, thus, we may expect similar difficulties
in approximating the solution by a linear approximation ansatz of the form
(1.2) as for the linear advection equation considered in Example 1.2.1. A
corresponding theoretical analysis of the decay of the Kolmogorov n-widths
for the linear wave equation has been carried out in [123] and confirms this
intuition. However, if we instead consider an approximation based on the
ansatz (1.4) with
r= 2, φ1="ρref
c#ϑr|, φ2="ρref
c#ϑl|,T="Tper 0
0Tper#,
α1(t) = α2(t)=1, p1(t) = p2(t) = ct for all tI,
(1.17)
then the analytical solution coincides with the approximation. This equality
may be established by considering the analytical solution at each time instance
as element of (L2(Ω))2. Thus, the solution allows for a description using only
two modes, which are transformed by different time-dependent transformations
T(ct)and T(ct). This motivates using an approximation ansatz as in (1.4),
which allows to incorporate more than just one time-dependent transformation
operator. l
The difficulty of standard model reduction schemes with treating transport-
dominated phenomena does not only apply to the simple academic Exam-
ples 1.2.1 and 1.2.3, but has also been observed in many applications where
the dynamics involve the propagation of wave profiles with locally large first
derivative. For instance, in [184, ch. III] the author considers a one-dimensional
nozzle flow, which involves a moving shock, and observes that many POD
modes are necessary to obtain a suitable approximation in that part of the
spatial domain which is affected by the moving shock. A different application
is addressed in [32], where the authors consider among others a batch chro-
11
1. Introduction
matography model and compare a POD-based ROM with a surrogate model
obtained via a purely data-driven approach. They observe a rather slow decay
of the singular values of the snapshot matrix and attribute this to the trans-
port within the system. The same also applies to [163], where a mathematical
model of a solidification process is considered, which leads to dynamics with
propagating fronts. In [213], the authors consider a convection-diffusion test
case and observe that the smaller the diffusion coefficient is chosen, the slower
is the corresponding singular value decay. Similarly, in [180] the authors study
the performance of the POD method for the flow around an airfoil for different
Mach numbers and note that the singular value decay is significantly slower for
higher Mach numbers, i.e., higher flow velocities. Another application where
a slow singular value decay results from some kind of transport is given by
seismic data, as for instance discussed in [174]. Here, the transport is not
explicitly depending on time, but rather results from different positions of the
seismic sources and receivers.
Another example is a pulsed detonation engine which has, for instance, been
a research subject within the Collaborative Research Center 1029 Substantial
efficiency increase in gas turbines through direct use of coupled unsteady com-
bustion and flow dynamics. A schematic depiction of the pulsed detonation
combustion (PDC) cycle is given in Figure 1.3, cf. [256]. Here, a combus-
tion tube is considered whose geometry features a convergent-divergent nozzle,
cf. [122]. In the first stage, the combustion tube is filled with a fuel-air mixture
before the mixture is ignited by a spark plug. As a result, a deflagration flame,
i.e., a subsonic diffusion-driven flame, cf. [51], is propagating through the com-
bustion tube. While the gray scale in Figure 1.3 indicates where the gas is
already burnt and where not, pressure waves are not visible in Figure 1.3. In
fact, there is a fast leading shock traveling in front of the deflagration flame,
which is for instance visible when considering corresponding snapshots of the
density, cf. Figure 1.4. As the shock approaches the nozzle, it gets partially
reflected and one observes a huge pressure increase in the focus point of the re-
flecting pressure waves. As a consequence, a small explosion is observed which
leads to a so-called deflagration-to-detonation transition (DDT). Afterwards,
a detonation wave, which is a supersonic flame coupled to a shock wave, con-
tinues to propagate through the pipe, cf. [51, 122]. In the end, the combustion
products are exhausted, the tube is purged, and afterwards the next cycle of
the PDC process begins.
From a thermodynamic perspective, a detonation wave means a higher ther-
modynamic efficiency than a deflagration flame, since it is closer to an ideal
constant-volume combustion. Thus, it is desirable to reduce the DDT length
as much as possible while still achieving a DDT in order to exploit the more
efficient detonation combustion. However, the DDT process depends on many
different parameters and extensive parameter studies are very expensive or
even infeasible, both numerically and experimentally. This motivates for us-
ing parametric model order reduction techniques to significantly reduce the
12
1.2. Motivation
Filling Stage
fuel
air
Spark Ignition
Propagation of Deflagration Flame
Deflagration-to-Detonation Transition (DDT)
Propagation of Detonation Wave
Exhaust
ξ
Figure 1.3.: Pulsed detonation combustion process.
13
1. Introduction
0 0.5 1
0
0.5
1
ξ
t
2
4
6
ρ(t, ξ)
leading shock
reaction front
reflected wave
re-reflected wave
detonation wave
Figure 1.4.: Pseudocolor plot of the density ρfor the PDC process based on simulation
data from [122].
computational effort for carrying out parameter studies. However, the PDC
process exhibits several transports within the system including a reaction front
and multiple shock waves. This is for instance visible in Figure 1.4, where
simulated density data from [122] are depicted in a pseudocolor plot over the
spatial coordinate ξand the time t. Similarly as for the academic examples
addressed in Examples 1.2.1 and 1.2.3, the transport of reaction fronts and
shock waves causes standard MOR techniques to be ineffective in constructing
low-dimensional accurate surrogate models. Thus, for these kinds of applica-
tions new methods are needed for decreasing the computation time by several
orders of magnitude to allow to perform extensive parameter studies numer-
ically. Since the performance of model reduction methods based on linear
approximation ansatzes is very limited for such transport-dominated systems,
model reduction schemes based on nonlinear approximation ansatzes as the
ones considered in this thesis provide a promising research direction to be able
to tackle such challenging problems as the PDC process. For the specific PDC
simulation data from [122], approximations of the snapshot data based on a
decomposition of the form (1.8) have been presented in [240, 259]. In partic-
ular, it has been demonstrated that this can be done by taking significantly
fewer modes than when using a linear approximation ansatz as in (1.2).
1.2.2. Port-Hamiltonian Systems
The classical model reduction objectives are to obtain low-dimensional and
accurate reduced-order models, ideally in combination with an estimate for
quantifying the approximation error a priori or a posteriori. Depending on
the application, also other objectives may be of importance, as preserving
certain qualitative properties of the original system, such as algebraic con-
straints [29, 248, 268], network or other structures [67, 108, 176, 181], passiv-
ity [15, 44, 98, 239, 267], or stability [62, 206, 234]. In this thesis, we aim
14
1.3. Review of Existing Approaches
for preserving port-Hamiltonian structures, since these come with many de-
sirable properties, see for instance [278] for a general overview, [237] for a
recent survey article on infinite-dimensional pH systems, [168] for a mono-
graph about structure-preserving discretization schemes, and [197] for a re-
cent survey article on port-Hamiltonian descriptor systems. In particular, a
port-Hamiltonian structure implies passivity and often also stability of the dy-
namical system. Moreover, pH structures are closed under power-preserving
interconnection, which makes them especially attractive for control purposes
[85, 187, 189, 222, 276, 277, 254] and for modeling networks [8, 102, 139, 275].
Furthermore, since the energy balance plays the central role within the port-
Hamiltonian framework, it may be applied to many physical systems and is
especially well-suited for the coupling of different physical domains, see for in-
stance [58, 95, 112, 173, 207, 281, 282, 289]. The flexibility of the pH modeling
framework becomes especially noticeable when looking at the wide range of
applications including acoustics [273], chemistry [146, 236, 283], electromag-
netism [69, 116, 225], fluid dynamics [7, 19, 208, 238], structural dynamics
[47, 48, 186, 284], thermodynamics [87, 182], and even economics [185].
Important properties of linear time-invariant port-Hamiltonian systems are,
for instance, their robustness with respect to perturbations in the coefficient
matrices [193, 195], the existence of efficient solvers for associated linear equa-
tion systems [128], and special algebraic properties of the associated matrix
pencil [194].
1.3. Review of Existing Approaches
As mentioned at the beginning of this chapter, the major focus of this the-
sis is on model reduction for transport-dominated systems and on structure-
preserving MOR schemes for port-Hamiltonian systems. In the following two
subsections we give an overview of the most relevant approaches proposed in
the past years within these two subtopics of model reduction.
1.3.1. Model Reduction Techniques for
Transport-Dominated Systems
In recent years, there has been an increasing effort in the model reduction com-
munity to develop new methods which are suitable for transport-dominated
systems. In this subsection, we give an overview of some of these approaches,
which we subdivide into three classes, see also [37, sec. 2], where parts of this
overview have been originally presented. The first class mainly uses time-
dependent coordinate transformations to account for the transport and these
methods are hence referred to as transformation-based methods. For instance,
in the case of a simple advection problem, a natural coordinate transformation
is given by a time-dependent translation or shift that describes the advective
15
1. Introduction
behavior, cf. section 1.2.1. These transformations are either used for trans-
forming the FOM such that it allows for a more effective reduction by stan-
dard MOR techniques or they are incorporated into the MOR approximation
ansatz. In the latter case, the approximation ansatz is typically nonlinear, for
instance if the time dependency of the coordinate transformations is implicit
via some time-dependent parameters and if these are considered as unknowns
of the ROM. This is especially true for the ansatz (1.4), where the transfor-
mations are parametrized by the paths pi, which correspond in the case of a
shift operator to the time-dependent shift amounts.
The second class summarizes methods which include a time-dependent on-
line update of the MOR basis functions, which is not based on a coordinate
transformation. We refer to these methods as adaptive basis methods. Most
of these approaches are based on a time-discrete update of the basis functions
and lead to switched reduced-order models.
As mentioned before, transformation-based methods often involve specific
nonlinear approximation ansatzes, where the nonlinearity originates from a
suitable coordinate transformation, which is typically motivated by physical
insights. The third class consists of methods employing a more generic non-
linear approximation ansatz which is not explicitly targeting the transport
within the dynamics by a suitable coordinate transformation. Therefore, we
refer to them as generic nonlinear methods. Such generic nonlinear approx-
imation ansatzes may be for instance characterized by a quadratic function
or by an artificial neural network (ANN) architecture. Especially, ANN-based
methods have received much attention in the past years and they are often
based on autoencoder architectures, which naturally lead to a low-dimensional
state space for the ROM, see for instance [135, 157, 164, 179, 244].
Remark 1.3.1 (Approximation of transport-dominated problems by time-delay
systems).Most of the methods mentioned in the remainder of this subsection
are based on some kind of projection of the FOM. A completely different
approach is based on describing the input-output dynamics of a transport-
dominated system by a time-delay system. Some first results in this direction
are obtained in [103, 233, 253, 257, 258]. ¨
Transformation-Based Methods
A common approach among the methods using coordinate transformations
consists of formulating the FOM dynamics in a new coordinate system, which
we refer to as the reference frame. The first developments in this direction are
presented within the symmetry reduction framework, cf. [36, 245, 246]. The
main idea is to approximate the solution by a composition of a time-dependent
group action and a so-called frozen solution, cf. section 4.2. Ideally, the group
action and the time-dependent group element are chosen such that the frozen
solution is almost constant over time, which supports a low-dimensional ap-
proximation. The group action can, for example, be chosen based on physical
16
1.3. Review of Existing Approaches
considerations or from snapshot data of the full-order solution. Related model
reduction techniques may be roughly divided into two categories: The first one
is based on first transforming the FOM and then applying standard MOR tech-
niques, while the second one is based on augmenting the approximation ansatz
by a suitable transformation. While both of these categories usually involve
a projection of the FOM, in [154] the authors present a purely data-driven
approach which is based on a modified version of the operator inference frame-
work introduced in [227]. The classical operator inference method constructs
a ROM directly from given snapshot data, whereas the approach introduced
in [154] involves a preceding coordinate transformation applied to the snap-
shot data in order to obtain a low-dimensional and accurate ROM even for
advection-dominated problems.
An example for the first category, where the FOM is transformed before ap-
plying classical MOR techniques, is presented in [219] and relies on Lie group
actions for transforming the original partial differential equation (PDE). In
this context it is assumed that the right-hand side of the FOM is equivariant
under the group action. The resulting transformed PDE is closed by alge-
braic equations, so-called phase conditions, which determine the group com-
ponent of the solution and which are usually chosen such that the temporal
change of the state is minimized, see also [36]. The resulting transformed
system is then reduced by a classical MOR approach based on projecting the
FOM onto a low-dimensional subspace. Moreover, the authors achieve an
efficient offline/online decomposition by employing the empirical operator in-
terpolation method, which is a hyperreduction technique originally proposed
in [84]. In [205], the authors present a model reduction framework which is
based on applying standard MOR techniques, such as the POD method, to a
one-dimensional nonlinear scalar convection-diffusion equation formulated in
a Lagrangian, i.e., co-moving, coordinate system. They also present numeri-
cal experiments for a one-dimensional Euler equations test case and observe
that considering the system in a Lagrangian coordinate system significantly
improves the performance of classical MOR methods. The combination of
their framework with hyperreduction techniques, which would be necessary to
obtain fast ROMs, is not considered in [205]. Instead, in [74] the authors ex-
tend the approach from [205] in several aspects including hyperreduction and
a time-windowing technique. The methods presented in [74, 205, 219] have in
common that they either have restrictive assumptions on the right-hand side of
the FOM or they assume that the transformed problem is already at hand. By
contrast, in [270] the author presents a general method for constructing a bijec-
tive parameter-dependent transformation of the spatial domain such that the
transformed PDE may be effectively reduced by standard MOR approaches.
The major focus is on parameter-dependent PDEs, whereas time-dependent
evolution problems are not explicitly considered, but only indirectly, for in-
stance by discretizing in time and considering time as another parameter. The
parameter-dependent transformation map is constructed in the offline phase
17
1. Introduction
based on solving a nonconvex and nonlinear optimization problem, which aims
for minimizing the distance between the transformed state and a suitably cho-
sen template function. Once this map is constructed, a ROM may be obtained
by first transforming the spatial domain of the PDE and afterwards applying a
standard MOR scheme such as projection onto POD modes. In the follow-up
work [271], the authors extend the approach from [270] by constructing trans-
formation maps acting on space and time and by combining the method with a
hyperreduction scheme to achieve considerable speed-ups in the online phase.
The application of the approach from [270] to transport-dominated problems
on two-dimensional spatial domains is discussed in [101, 272]. In [125], the
authors propose a model reduction technique for nonlinear scalar hyperbolic
PDEs by first introducing a relaxation of the original PDE. This relaxation
leads to a system of two hyperbolic partial differential equations with con-
stant wave speeds and nonlinear right-hand side. For the semi-discretization
in space, they approximate the state by a linear combination of shifted finite el-
ement basis functions, which takes care of the transport within the system and,
thus, allows the successful application of classical model reduction techniques
based on linear subspaces to the semi-discretized system. Other approaches
based on the idea to first transform the FOM and afterwards apply standard
model reduction schemes are, e.g., discussed in [183, 204, 212]. Further meth-
ods which only consider the approximation of snapshot data but not the ROM
construction are for instance presented in [152, 264, 266].
Instead of first transforming the full-order model and then reducing the
transformed system, the authors in [119] directly reduce the untransformed
FOM using an approximation ansatz that includes a translation as coordi-
nate transformation. They construct a corresponding ROM for a linear one-
dimensional advection equation by enforcing the residual to be orthogonal to
the shifted modes, whereas the update of the shift is determined based on the
advection speed of the advection equation. The question of how to achieve an
efficient offline/online decomposition has not been addressed in [119]. A simi-
lar approach has been considered in [245], but in contrast to [119] the method
from [245] may be applied to a more general class of nonlinear problems with
equivariant right-hand side. Especially, they present numerical results for the
one-dimensional Kuramoto–Sivashinsky equation with periodic boundary con-
ditions. Furthermore, the authors propose to compute the time-dependent
shift based on a reconstruction equation, which may for instance be obtained
via template fitting. The latter is based on defining a template function, for
example the initial value, and choosing the shift amount such that the dis-
tance between the shifted state and the template function is minimized. This
results in an algebraic constraint, but instead of considering the corresponding
differential–algebraic equation (DAE), the authors differentiate the algebraic
constraint with respect to time and obtain an ordinary differential equation
(ODE) system as ROM. Similarly as in [119], the question of efficiently eval-
uating the ROM has not been addressed in [245]. In [54] the authors also
18
1.3. Review of Existing Approaches
use an approximation ansatz that includes a coordinate transformation and
for the application of the method they also consider a simple translation as
transformation. For constructing the ROM, the authors first discretize in time
and then substitute the approximation ansatz into the semi-discrete full-order
model. They present an algorithm for updating the time-discrete states of
the ROM by minimizing the time-discrete residual. The evaluation of the
ROM still scales with the dimension of the FOM, but for a Burgers’ equation
test case they present an additional approximation of the ROM which allows
achieving an efficient offline/online decomposition. This hyperreduction ap-
proach is based on constructing interpolation-based approximants of the ROM
coefficient functions in the offline phase and makes use of the periodic bound-
ary conditions of the considered test case and of the quadratic nature of the
nonlinearity of the Burgers’ equation. Another online-efficient approach is the
manifold approximation via transported subspaces method which has been in-
troduced in [243] for time- and parameter-dependent scalar conservation laws.
As approximation ansatz, the authors use a linear combination of transformed
modes, where all modes are affected by the same coordinate transformation,
which is itself described by a linear combination of so-called transport modes.
Both kinds of modes are determined in the offline phase based on snapshot
data and, moreover, the authors present a time-discrete online update of the
time-dependent coefficients occurring in the approximation ansatz. To achieve
an efficient offline/online decomposition, they use the empirical interpolation
method with collocation points which are also updated as time evolves. In
[171], the authors consider one mode with fixed amplitude one for the shape of
the wave profile and a rather general coordinate transformation, which is sim-
ilarly as in [243] described by a linear combination of a few space-dependent
ansatz functions. They discuss two different methods for determining these
ansatz functions in the offline phase, one of which is based on the observation
that their approximation ansatz may be interpreted as a special autoencoder
structure, which allows to determine the ansatz functions by training a neural
network. For the ROM construction, they consider an intrusive and a non-
intrusive scheme, where the former one is the manifold Galerkin method, which
is also addressed below in the passage addressing the generic nonlinear meth-
ods. They also propose a hyperreduction method for a rather general class of
FOM nonlinearities, but they also emphasize that this method requires to eval-
uate the FOM nonlinearity at a number of sample points which is significantly
higher than the dimension of the ROM. Furthermore, the time-dependent up-
date of the sample points involves a matrix-vector product which scales with
the dimension of the FOM. Nevertheless, their method achieves speed-ups for
the considered two-dimensional test cases.
The methods mentioned in the last paragraphs have in common that they
use a single coordinate transformation to obtain low-dimensional and accurate
ROMs for transport-dominated systems. However, in some applications it may
be advantageous to apply multiple coordinate transformations, for instance
19
1. Introduction
when there are several traveling waves with different propagation speeds. A
simple example for the occurrence of more than one traveling wave with differ-
ent advection speeds is provided by the dynamics of the linear wave equation
as discussed in Example 1.2.3. To be able to also achieve low-dimensional ap-
proximations of such phenomena, recently some methods have been proposed
which use multiple transformations, see also section 3.4 for a more detailed
summary of some of these methods. For instance, the shifted POD method,
which has been introduced in [241] and enhanced in [240, 259], is based on an
approximation ansatz involving shift operators with different shift amounts for
different sets of modes. The goal of this method is to obtain a low-dimensional
approximation of a given snapshot matrix. The first version of the shifted
POD algorithm in [241] is based on transforming the snapshots into the differ-
ent reference frames and compressing the transformed data via singular value
decompositions (SVDs). To separate the dynamics corresponding to the dif-
ferent reference frames, a heuristic iterative procedure is introduced and its
performance is illustrated by various numerical examples. In contrast to this,
the shifted POD version presented in [240] is based on an optimization problem
which aims for maximizing the largest singular values in each reference frame.
Another optimization-based approach is introduced in [259] and it directly tar-
gets the minimization of the difference between the original snapshot data and
their approximation. While the methods presented in [240] and [259] consider
the coordinate transformations to be given, an extension of the method in [259]
which also optimizes the paths is provided in [40]. An alternative approach is
considered in [242], where the authors also present a method for approximating
snapshots by a linear combination of shifted modes using multiple coordinate
transformations. In contrast to the shifted POD method, the technique intro-
duced in [242] is based on a greedy-type algorithm. As a consequence, this
approach is not able to yield the two traveling wave profiles when applied
to a linear wave equation test case where the analytical solution is known.
Nevertheless, they also present some interesting extensions which have not yet
been addressed within the shifted POD framework, such as advection velocities
which may vary within the spatial domain. In [199], the authors propose an
optimization scheme for determining the advection speeds of multiple traveling
waves and afterwards compute low-rank approximations within each reference
frame. Since this method is lacking an iteration as for instance in [241], it is
similarly as the approach in [242] not able to obtain optimal decompositions
for examples like the linear wave equation. An application of the approach
in [199] in the context of a rotating detonation engine has been presented in
[200] and an extension to two-dimensional wave phenomena in [201]. A special
class of coordinate transformations where the modes are composed with lin-
ear affine mappings is considered in [124]. In contrast to the other approaches
mentioned in this paragraph, the method presented in [124] assumes the modes
to be given and focuses on the optimization of the amplitudes and of the path
variables, i.e., the quantities which parametrize the transformations.
20
1.3. Review of Existing Approaches
All of the methods mentioned in the previous paragraph have in common
that they focus on a low-dimensional decomposition of snapshot data, whereas
no ROMs are constructed based on the identified modes. On the contrary, in
[37] the authors introduce a framework that allows constructing ROMs based
on transformed modes, which can, for instance, be computed by one of the
contributions mentioned in the previous paragraph. Notably, their framework
is not restricted to the case that all modes are transformed by the same type
of coordinate transformation, but they consider the general ansatz (1.3) which
allows incorporating different families of transformation operators.
Another class of methods which is based on multiple coordinate transforma-
tions is described in [214, 251]. Especially, these methods consider transports
with respect to parameter changes instead of with respect to time increments
and they use snapshots from close-by parameter values as ansatz functions.
While the authors in [214] consider stationary problems, the authors in [251]
consider time- and parameter-dependent problems in a time-discrete setting.
Even though it has not been proposed in the context of model order re-
duction, we also mention the moving finite element method (MFEM), which
has been proposed in [202, 203] and which is also based on the idea of refer-
ence frames. The MFEM extends the classical finite element method (FEM)
by allowing the FEM basis functions to move in space as time evolves. This
method is especially suitable for problems with moving shocks and the idea is
that the basis functions follow the movements of the shocks such that a fine
resolution is only achieved where it is needed. The unknowns of the result-
ing semi-discretized system are the amplitudes and the positions of the basis
functions. The corresponding approximation ansatz is nonlinear in these un-
known state variables, which means that the approximation of the PDE state
is not restricted to a linear subspace but to a nonlinear manifold. For the time
evolution of the state variables, the authors in [203] propose to minimize the
residual with respect to the time derivative of the state variables. The resulting
semi-discretized system is inherently nonlinear and features a state-dependent
mass matrix. To ensure that this matrix is always nonsingular, the authors
propose a regularization technique which punishes the relative movement of
different basis functions. The major difference between the MFEM and the
model reduction framework presented in [37], see also section 4.1, is that the
MFEM uses generic basis functions with compact support, whereas [37] em-
ploys problem-specific basis functions which are usually non-zero on the entire
computational domain.
Adaptive Basis Methods
While transformation-based methods involve a coordinate transformation of
the FOM or of the basis functions, the adaptive basis methods considered
in the following are based on time-dependent updates of the ROM which do
not originate from a coordinate transformation. One example for such an ap-
proach is presented in [81], where the authors divide the total time interval
21
1. Introduction
into several subintervals and determine different reduced bases for the differ-
ent subintervals. The segmentation of the total time interval is carried out in
an adaptive way which ensures that a given error tolerance is met and that
the number of basis functions in a subinterval cannot exceed a certain pre-
defined value. In the online phase, the ROM is simulated step by step on
each subinterval specified in the offline phase. When an interface between two
subintervals is reached, the initial value of the ROM corresponding to the new
subinterval is determined based on projecting the current approximation of
the full-order state onto the span of the basis functions of the new subinterval.
Consequently, the ROM may in total be regarded as a switched system with
a switching condition which depends only on time.
Differently from [81], the authors in [93] propose a scheme that adapts the
reduced basis only in the online phase and only if the error estimator returns
values which are very high or very low. In the offline phase, they define a
tree structure that represents hierarchical orthogonal decompositions of the
underlying vector space of the FOM. Based on this tree structure, they are
able to adaptively increase the number of basis functions if the error of the
ROM is very high or to reduce the number of basis functions if the error is
smaller than a prescribed error threshold. As in [81], the obtained ROM is
a switched system, but in contrast to [81], the switching condition is state-
dependent, since the error estimators are based on the current state of the
ROM. Thus, the switching times are a priori not known. The work in [93] is
based on the ideas presented in [59] and extends them towards more general
refinement trees and a more general and efficient basis compression scheme.
Another online-adaptive scheme is proposed in [226], where the basis functions
are regularly modified via a low-rank update, but the number of basis functions
remains constant in contrast to [93].
In [115], the authors present an approach that also involves a reduced basis
which is adapted as time evolves. However, in contrast to the works men-
tioned in the previous two paragraphs, the rules for updating the basis are
more problem-specific. Concretely, they propose to use the eigenfunctions of
a linear Schrödinger operator associated with the initial value of the FOM as
basis functions. Then, their time evolution is performed in such a way that
the basis functions remain eigenfunctions of a linear Schrödinger operator as-
sociated with the time-dependent FOM state. Consequently, they obtain an
additional evolution equation for the basis functions. In contrast to the works
mentioned in the previous two paragraphs, the ROM is not a switched system
with time-discrete changes in the basis functions, but instead they obtain a
time-continuous equation for their evolution.
An approach which combines the idea of using time-dependent basis func-
tions with machine learning techniques is provided in [229]. To this end, the
authors use a linear combination of space-, time-, and parameter-dependent
basis functions with time- and parameter-dependent coefficients as approxima-
tion ansatz. In the offline phase, they use FOM snapshots to train two neural
22
1.3. Review of Existing Approaches
networks: one for the coefficients and one for the basis functions. Afterwards,
they project the time-discrete FOM onto the span of the basis functions to
obtain a reduced-order model. Thus, each time step of the online phase in-
volves the evaluation of the neural network for the basis functions and the
construction of the projected ROM based on the current reduced basis. Con-
sequently, the online phase still scales with the FOM dimension. Nevertheless,
they observe a computational speed-up, at least for cases where a high spatial
resolution is used for the FOM.
Generic Nonlinear Methods
The idea of the third class of methods is to approximate the solution via a
generic nonlinear approximation ansatz. In [179], the authors use an autoen-
coder, which is a type of artificial neural network, to obtain a low-dimensional
description of the FOM solution. Based on the snapshot data, a decoder and
an encoder mapping are learned, where the decoder is a mapping from the
reduced state space to the full state space and the encoder vice versa. Es-
pecially, the lifting of the reduced state to an approximation of the full-order
state is performed by the decoder mapping, which thus describes the approx-
imation ansatz. The projection of the FOM is carried out by substituting
the approximation ansatz into the FOM and then constructing the ROM via
minimization of the residual. They propose two different approaches: The
manifold Galerkin method which is based on minimizing the residual for the
time-continuous FOM and the manifold least-squares Petrov–Galerkin method
which considers the FOM in a time-discrete setting. The idea to use autoen-
coders for the purpose of model order reduction has been previously presented
in [135] and [157]. The mentioned approaches have in common that the eval-
uation of the ROM usually still scales with the dimension of the FOM, which
prevents an efficient online phase. Therefore, in [164] the authors combine
the approach from [179] with a classical hyperreduction approach for linear
subspace methods. Furthermore, in contrast to [179], the method in [164]
is based on shallow de- and encoders and, moreover, they propose to use a
sparse decoder with a sparsity structure which reflects for instance spatial lo-
cality properties in the solver used for simulating the FOM. As a consequence,
they are able to show that the evaluation of the hyperreduced ROMs does in
general not scale with the FOM dimension. Nevertheless, in their numerical
experiments for one- and two-dimensional Burgers’ equation test cases their
approach only achieves moderate speed-ups. A possible explanation could be
that the sample points used for the hyperreduction scheme are fixed and do
not move along the transport within the system. As a consequence, the re-
ported number of hyperreduction sample points is significantly larger than the
corresponding ROM dimension. A similar hyperreduction approach has been
presented in [244]. Instead of obtaining the ROM via projection onto a non-
linear manifold, in [106] the authors introduce a method which approximates
the FOM state by means of an autoencoder and describes the ROM dynamics
23
1. Introduction
by means of a deep feedforward neural network. This approach only requires
solution snapshots of the FOM state, but not access to the FOM itself. Fur-
thermore, since no projection is involved, no hyperreduction is needed and
the only computational cost in the online phase is the evaluation of the neu-
ral networks trained in the offline phase. This framework has been further
developed and applied to challenging test cases in [104, 105, 107]. Further
MOR approaches based on artificial neural networks are, for instance, given in
[50, 86, 191].
Instead of using an approximation ansatz based on an ANN architecture, in
[20] the authors propose a model reduction framework which is based on ap-
proximating the FOM state by a quadratic approximation ansatz. For the
offline phase, a two-step procedure is introduced where the snapshots are
first compressed using the classical POD method and, afterwards, a purely
quadratic ansatz is used for approximating the part of the dynamics which is
not captured by the POD modes. The second step involves solving ninde-
pendent linear least squares problems, where ndenotes the FOM dimension.
To obtain a ROM, the authors use a special case of the manifold least-squares
Petrov–Galerkin method for quadratic manifolds. Furthermore, they achieve
an efficient offline/online decomposition by suitably extending the energy-
conserving sampling and weighting method, which is a hyperreduction tech-
nique originally proposed in [97] and is based on a quadrature approximation
of the ROM nonlinearity. The numerical experiments presented in [20] reveal
that this new method is able to achieve considerable speed-ups in comparison
to classical POD-based ROMs. Also in [113] the authors use a quadratic ap-
proximation ansatz. However, in contrast to [20], their approach is not based
on projection, but instead corresponds to an extension of the non-intrusive
operator inference framework to quadratic approximation ansatzes.
Further model reduction techniques which are based on nonlinear approxi-
mation ansatzes are for instance presented in [13, 56, 89, 190].
1.3.2. Structure-Preserving Model Reduction for
Port-Hamiltonian Systems
Most standard model reduction schemes in general do not preserve stability
or passivity of the FOM and this shortcoming may lead to unphysical ROMs
and unbounded errors. One possibility to guarantee preservation of stability
and passivity during the model reduction is achieved by making use of a pH
representation of the governing equations. Further properties which motivate
for preserving the pH structure are mentioned in section 1.2.2.
Structure-preserving model reduction techniques for pH systems may be
roughly divided into two classes: The first class of methods aims for approxi-
mating the input-output map of a port-Hamiltonian system, whereas the sec-
ond class targets a good approximation of the state, for instance by using
a POD-based approach. As port-Hamiltonian systems arose from the control
24
1.3. Review of Existing Approaches
systems community, the former class has received considerably more attention.
In the following we provide a summary of structure-preserving MOR techniques
and begin with those ones which are based on control system strategies and
afterwards we discuss methods which aim for a good state approximation. We
also refer to [197, sec. 8] for a detailed overview on structure-preserving model
reduction schemes with particular emphasis on methods for DAE systems.
Remark 1.3.2 (Passivity-preserving MOR techniques).In the special case of
controllable and observable linear time-invariant ODE systems, it is for in-
stance shown in [26] that the subclasses of port-Hamiltonian systems and pas-
sive systems coincide, see also [68] for corresponding statements in the DAE
case. Consequently, in this case we may regard passivity-preserving MOR
schemes as structure-preserving schemes for port-Hamiltonian systems, see for
instance [44] for a more detailed discussion. ¨
Balancing-Based Methods
We start by summarizing structure-preserving MOR methods which are based
on some kind of balancing technique and a subsequent truncation. For in-
stance, in [109] the author presents a modification of the balanced truncation
approach for nonlinear control systems originally introduced in [110] for gen-
eral unstructured systems. While the original work is based on balancing
suitably defined observability and controllability functions, the approach in
[109] is based on either replacing the observability or the controllability func-
tion by an appropriately weighted version. By this modified balancing, the
method ensures that the corresponding ROM preserves the pH structure of
the FOM. The author also investigates the existence of the required weighting
matrices and derives equivalent conditions based on the column spans of the
state-dependent dissipation and input matrices. Consequently, the approach
is restricted to a certain subclass of nonlinear pH systems and for this subclass
it is shown that the weighted version of the controllability or observability
function coincides with the Hamiltonian. Furthermore, in the nonlinear case
the approach presented in [109] requires solving a Hamilton–Jacobi equation
and, thus, may be only applied to nonlinear FOMs of small size.
In [137] the authors consider linear time-invariant port-Hamiltonian FOMs
with even state space dimension and invertible structure matrix. They present
an approach which is based on balanced truncation and singular perturbation
arguments to obtain a port-Hamiltonian ROM. Furthermore, they present con-
ditions for the FOM and the ROM to be asymptotically stable and in that case
they obtain an a priori error bound as in the classical unstructured balanced
truncation method.
Similarly as in [109], in [160] the authors consider a modification of bal-
anced truncation, where they balance the controllability function and the
Hamiltonian. In contrast to [109], their approach is presented for linear time-
invariant pH systems with positive definite dissipation matrix. Furthermore,
25
1. Introduction
they present a bound for the error in the supplied energy for a given input
signal and this bound involves the sum of the truncated singular values. An
extension of this approach to nonlinear pH systems is provided in [161] and
it involves similarly as in [109] computing a solution of a Hamilton–Jacobi
equation.
Instead of considering classical balanced truncation, in [42] the authors pro-
pose structure-preserving variants of the generalized balanced truncation and
extended balanced truncation methods. The former one has been originally
introduced in [145] and is based on balancing solutions of Lyapunov inequal-
ities instead of Lyapunov equations as in the classical balanced truncation
method. Similarly, also the extended balanced truncation method introduced
in [250] relies on balancing solutions of linear matrix inequalities, which may
be regarded as extended versions of the ones considered in the generalized
balanced truncation method and provide additional degrees of freedom. The
structure-preserving variants of both methods presented in [42] are based on
the idea to simultaneously diagonalize the energy matrix of the pH system
and the solutions of the respective matrix inequalities by a suitable congru-
ence transformation. To this end, they derive for each of the two methods
sufficient conditions to ensure that such a congruence transformation exists
and demonstrate how such a transformation may be constructed. Moreover,
both methods provide a priori error bounds similarly as in the classical bal-
anced truncation method and both the theory and the numerical experiments
show that the error bound obtained via extended balanced truncation is in
general smaller than the one obtained via generalized balanced truncation.
While the balancing approaches discussed so far mainly focus on the open-
loop behavior, in [45] the authors propose a structure-preserving variant of the
linear quadratic Gaussian (LQG) balanced truncation method, which is espe-
cially suitable for constructing low-dimensional controllers. Their approach is
based on appropriately choosing the weighting matrices of a pair of algebraic
Riccati equations to ensure that the resulting system obtained after balancing
and truncation is port-Hamiltonian. Moreover, they derive an error bound in
the gap metric and demonstrate that replacing the FOM Hamiltonian by one
which is based on a maximal solution of a Kalman-Yakubovich-Popov inequal-
ity may lead to a significantly lower error bound and error. In addition, they
also propose a similar approach for classical balanced truncation and also ob-
tain an Herror bound provided that a certain condition on the dissipation
and input ports is satisfied. We note that the same condition also appears
in [42] in the context of generalized balanced truncation. An extension of
the structure-preserving LQG balanced truncation method from [45] to port-
Hamiltonian DAE systems is presented in [43]. Since the techniques presented
in [43, 45] require the solutions of quadratic matrix equations whose size scales
with the FOM dimension, the application of these methods is limited to FOMs
of moderate size.
26
1.3. Review of Existing Approaches
Methods Based on Interpolating the Transfer Function
In [230] the authors present a structure-preserving model reduction scheme
for linear time-invariant pH systems via moment matching at infinity, i.e.,
via matching the Markov parameters of the transfer function. They present
structure-preserving variants of the Arnoldi and the Lanczos method, cf. [14,
sec. 10.4], by exploiting that the considered linear time-invariant pH struc-
tures are preserved by Galerkin or suitable Petrov–Galerkin projections. For
the structure-preserving variant of the Lanczos method, they require an ad-
ditional assumption on the port-Hamiltonian system matrices to accomplish
the matching of 2rMarkov parameters with a ROM of state space dimension
r. This additional assumption is for instance satisfied if the FOM is purely
damped or purely undamped. Further structure-preserving techniques, which
achieve moment matching at other expansion points than at infinity, are pre-
sented in [231, 286].
All interpolation-based methods mentioned so far are based on matching
the first moments of the FOM transfer function at a fixed expansion point.
On the other hand, in [130] the authors present a structure-preserving MOR
scheme for linear time-invariant multiple-input/multiple-output pH systems
via tangential interpolation of the transfer function. This approach naturally
allows for interpolation at various points and, moreover, the authors propose
an algorithm for selecting the interpolation points and directions, which is
inspired by the iterative rational Krylov algorithm (IRKA) presented in [129]
for general unstructured systems. However, while the original IRKA method
yields ROMs which are optimal with respect to the H2norm, the algorithm
introduced in [130] only enforces parts of the necessary optimality conditions
and does hence in general not guarantee H2-optimal ROMs. Another structure-
preserving method which achieves moment matching at multiple expansion
points is presented in [153].
In [117], the authors present a Krylov-based structure-preserving MOR
scheme for parameter-dependent pH systems. The structure preservation is
achieved by a suitable modification of the matrix interpolation method, which
has been originally presented in [224] for the unstructured case.
While the interpolation-based methods mentioned so far consider model re-
duction for pH systems of ODEs, in [88] the authors use Krylov methods for
reducing a port-Hamiltonian DAE system, which arises from the modeling of
a transport network. The structure of the FOM allows to preserve both the
port-Hamiltonian structure and the algebraic constraints by a Galerkin pro-
jection which only acts on the differential equations and the corresponding
variables. They also achieve the preservation of further properties like mass
conservation by ensuring that the projection matrices satisfy some algebraic
compatibility conditions.
While the approach in [88] is rather specific for the considered special class
of port-Hamiltonian DAE systems, in [27] the authors present a more general
structure-preserving model reduction framework for port-Hamiltonian DAEs.
27
1. Introduction
In particular, they discuss for different classes of semi-explicit index-one and
index-two systems how to construct ROMs which satisfy given tangential in-
terpolation conditions and match the polynomial part of the FOM transfer
function, while preserving the pH structure. Furthermore, for determining
suitable interpolation points and directions, they propose an extension of the
pH-IRKA method introduced in [130] to the DAE case.
Effort-Constraint and Flow-Constraint Method
A general procedure for structure-preserving MOR of port-Hamiltonian sys-
tems is presented in [232] and it is based on the representation of a pH system
in terms of a Dirac structure. The method assumes that the FOM is already
balanced in the sense that a part of the state variables has little contribu-
tion to the input-output map of the system. Based on this assumption, the
authors propose the idea to obtain energy-consistent ROMs by cutting the en-
ergy flow related to the less relevant state variables. To this end, they present
two methods, which both remove the less relevant state variables and yield
port-Hamiltonian ROMs: the so-called effort-constraint and flow-constraint
methods. Both techniques are presented in a general setting which also in-
cludes nonlinear pH systems, but the authors also demonstrate the two ap-
proaches for the special case of a linear time-invariant pH system. While the
effort-constraint method yields a ROM which has the same structure as the
FOM, the flow-constraint method leads to an additional feedthrough term in
the ROM. Furthermore, the application of the effort-constraint method re-
quires a certain submatrix of the energy matrix to be invertible, whereas the
flow-constraint method requires a certain submatrix of the structure matrix
to be invertible. Even though it is not explicitly mentioned in [232], we note
that comparing the ROMs obtained via the effort-constraint method and those
derived in [137] via balanced truncation and singular perturbation arguments
appear to coincide. This suggests a relation between these two approaches in
the special case where the FOM is balanced via classical Lyapunov balancing.
In [138], the authors extend the effort-constraint and flow-constraint meth-
ods to linear time-invariant DAEs. For this purpose, they first transform the
port-Hamiltonian DAE such that the differential and algebraic equations are
separated and such that the differential equations do not depend on the al-
gebraic variables. Afterwards, they perform the model reduction only on the
differential part and, hence, preserve not only the pH structure but also the
algebraic constraints. In addition, they present a structure-preserving MOR
technique for port-Hamiltonian DAEs which is based on moment matching.
Optimization-Based Methods
Most of the methods discussed so far have in common that they are based
on projection-based MOR techniques. As an alternative, in [252] the author
proposes to directly determine the coefficients of a linear time-invariant asymp-
28
1.3. Review of Existing Approaches
totically stable ROM by solving an optimization problem. More precisely, it
is proposed to choose the ROM coefficients such that the error in the trans-
fer function is minimized with respect to the H2norm. To ensure that the
ROM is pH and asymptotically stable, the optimal ROM matrices are sought
within a product manifold which involves the spaces of skew-symmetric and
of symmetric positive definite matrices. To solve this optimization problem,
a Riemannian trust-region algorithm is presented, which involves solving lin-
ear matrix equations scaling with the FOM dimension in every iteration and
is, thus, only applicable for problems of moderate size. As starting value, it
is proposed to use a ROM obtained via the pH-IRKA approach from [130]
and the numerical results indicate a clear improvement in comparison to this
starting value, especially when the dimension of the ROM is very small. How-
ever, this advantage becomes smaller as the ROM dimension increases and the
ROM obtained via pH-IRKA is even partially better when considering the H
instead of the H2error.
In [209], the authors propose a more efficient way for solving the optimiza-
tion problem considered in [252] based on pole-residue formulations of the
transfer functions of the FOM and of the ROM. This avoids having to solve
high-dimensional linear matrix equations in each step and, thus, extends the
applicability to large-scale FOMs.
While [209] and [252] are based on optimization with respect to the H2
distance, in [261] the authors propose the structured optimization-based model
order reduction (SOBMOR) method using a cost function which is motivated
by Hminimization, but only requires transfer function evaluations at certain
sampling points on the imaginary axis. In their numerical experiments, they
demonstrate that minimizing this cost function is advantageous in terms of
computational cost and accuracy compared to a direct minimization of the
Herror. Furthermore, their numerical experiments reveal that this new
method outperforms other structure-preserving MOR techniques as the effort-
constraint balanced truncation method introduced in [232] and the pH-IRKA
method proposed in [130]. In addition, an adaptive sampling strategy for
the SOBMOR approach is presented in [260], whereas an extension of the
approaches from [209] and [261] to DAE systems is provided in [210, 262].
An Approach Based on POD
The methods mentioned so far are based on considering port-Hamiltonian sys-
tems as control systems and they build ROMs by approximating the input-
output map. Due to the focus on the input-output behavior, the ROMs ob-
tained from these methods do typically not lead to a good approximation of
the FOM state variables. In contrast to this, in [65] the authors present a
structure-preserving model reduction technique which is based on the POD
method for generating the projection matrices and, thus, this approach may
be used for applications where a good approximation of the FOM state is
desirable. The considered class of FOMs consists of nonlinear time-invariant
29
1. Introduction
port-Hamiltonian systems of ordinary differential equations. To ensure struc-
ture preservation, the authors propose to use a Petrov–Galerkin projection,
where the trial and test spaces are chosen such that they provide good ansatz
spaces for the FOM state and the gradient of the FOM Hamiltonian, respec-
tively, see also section 2.6.2 for more details. Those spaces may be obtained
by computing SVDs of matrices containing snapshots of the state or of the
gradient of the Hamiltonian, respectively.
Since the evaluation of the ROM involves in general the evaluation of the
FOM nonlinearity and is thus not efficient, they also present a structure-
preserving hyperreduction technique, which is based on an additional approx-
imation of the nonlinearity and thereby renders the ROM evaluation efficient.
The hyperreduction method is based on a modification of the discrete empir-
ical interpolation method (DEIM) and involves first writing the Hamiltonian
as the sum of a quadratic function and a remainder term. Afterwards, they
introduce a DEIM-inspired approximation of the remainder term such that the
gradient of the approximate Hamiltonian may be evaluated in an efficient way.
Finally, this leads to a ROM which is port-Hamiltonian and whose evaluation
does not scale with the FOM dimension.
Next to the POD-based approach, the authors also present another method,
where the linear subspace used for approximating the FOM state is determined
based on applying the pH-IRKA method from [130] to a linearized version of
the FOM.
Methods for Hamiltonian Systems
While this literature overview mainly focuses on general pH systems, we em-
phasize that there has been also much research effort to develop structure-
preserving model reduction techniques for Hamiltonian systems. In this con-
text, researchers are typically not only interested in preserving the energy
conservation properties, but also in preserving the symplectic structure. For
instance, in [228] the authors present a structure-preserving model reduction
scheme based on the proper symplectic decomposition (PSD), which is a mod-
ification of the POD method to determine a symplectic projection matrix, and
a subsequent symplectic Galerkin projection. As an alternative to the PSD, in
[3] a greedy procedure for parameter-dependent Hamiltonian systems is intro-
duced, which gradually constructs a symplectic projection matrix by adding
two basis vectors per iteration. This approach has been also applied to dissipa-
tive Hamiltonian systems in [4] by first coupling the FOM to a heat bath, which
collects the dissipated energy such that the coupled system may be regarded as
a conservative system. In [143, 223], the authors present a structure-preserving
MOR framework using time-dependent basis functions and thereby not only
achieve preservation of the Hamiltonian structure, but this approach is also
suitable for achieving an effective reduction of transport-dominated systems.
This applies also to the symplectic manifold Galerkin method, which is intro-
duced in [50] and involves a structure-preserving projection of a Hamiltonian
30
1.4. Contributions, Outline, and Previously Published Results
system onto a nonlinear manifold, which may be for instance constructed based
on an ANN.
1.4. Contributions, Outline, and Previously
Published Results
In this section, we provide an overview of the major contributions of this thesis
and briefly summarize the structure of its remainder. Moreover, since some of
the results of this thesis originate from joint work with other researchers and
have already been published, we address these previously published works at
the end of this section.
We begin by listing the main contributions of this thesis in the following.
(i) Given snapshot data of a finite-dimensional FOM of the form (1.1), we
present a new method for computing a decomposition of the form (1.4).
To this end, we assume the transformation operators and paths to be
given and propose to solve an optimization problem which aims for min-
imizing the error of the approximation (1.4). Moreover, we propose two
different optimization strategies: the optimization parameters of the first
one consist of the amplitudes and the modes, whereas the second one is
based on a reduced optimization problem, which has only the modes as
optimization parameters, cf. section 3.1.
(ii) For the case of parameter-dependent FOMs, we introduce a greedy al-
gorithm for combining the parameter sampling with a gradual determi-
nation of the modes in the offline phase. This algorithm extends the
classical POD-greedy algorithm to nonlinear approximation ansatzes of
the form (1.8) and involves the repeated application of the optimization
procedure from (i), cf. section 3.2.
(iii) While the framework in this thesis is mostly presented for a rather general
class of transformation operators, we propose various families of trans-
lation operators for one-dimensional spatial domains. Especially, these
translation operators differ in their boundary treatment and we illus-
trate their usefulness by means of some exemplary PDEs with different
boundary conditions, cf. section 3.3 and chapter 6.
(iv) Based on given modes, we present a nonlinear projection framework for
constructing dynamic reduced-order models for determining the ampli-
tudes and the paths in the online phase. In particular, the ROMs are
constructed in such a way that the residual is minimized at each time
instant and, due to the nonlinearity of the approximation ansatzes (1.4)
and (1.8), the ROMs are in general nonlinear even in the case of a linear
FOM. We also present a residual-based error bound and discuss existence
31
1. Introduction
and uniqueness of solutions of the ROM. In addition, we also discuss the
case where the state-dependent mass matrix of the ROM may become
singular and propose a regularization strategy for ensuring invertibility
of the mass matrix, see section 4.1.
(v) We show that in a special case the ROMs constructed as in (iv) are equiv-
alent to ROMs obtained via a symmetry reduction technique, provided
that the same paths are used in both frameworks. Moreover, we compare
the residual minimization strategy for fixing the paths as considered in
this thesis with two different strategies within the symmetry reduction
framework and show a relation between the corresponding minimization
problems, cf. section 4.2.
(vi) To obtain ROMs whose evaluation does not scale with the dimension
of the FOM, we introduce a hyperreduction framework, which is based
on additional approximations of the ROM nonlinearities. One source
of nonlinearity of the ROM is the nonlinearity of the approximation
ansatz and it leads to path-dependent coefficient matrices whose eval-
uation involves large-scale matrix-vector products. To avoid evaluating
these path-dependent matrices in the online phase, we propose to de-
termine corresponding interpolation- or regression-based approximants,
which may be evaluated in the online phase in an efficient way. Fur-
thermore, we present an extension of the empirical interpolation method
(EIM)/DEIM which is based on approximating the FOM nonlinearity by
an ansatz of the form (1.8), cf. section 4.3.
(vii) We discuss in a general finite-dimensional setting how a port-Hamiltonian
representation of the FOM may be exploited in the context of projection-
based model reduction to obtain port-Hamiltonian ROMs. To this end,
we consider different linear and nonlinear as well as time-invariant and
time-varying classes of port-Hamiltonian FOMs and present structure-
preserving schemes based on different linear and nonlinear as well as
time-invariant and time-varying approximation ansatzes. The consid-
ered nonlinear approximation ansatzes include in the finite-dimensional
setting (1.4) as a special case, which allows us to obtain port-Hamiltonian
ROMs which are based on the ansatz (1.4). Moreover, we provide ad-
ditional conditions which ensure that the port-Hamiltonian ROMs are
stable and we demonstrate that in many cases we are able to obtain
ROMs which are at the same time pH and optimal in the sense of resid-
ual minimization, cf. chapter 5.
(viii) We illustrate the numerical performance of the new MOR framework by
means of several transport-dominated test cases including an example
with more than one traveling wave, an example with non-periodic and
inhomogeneous boundary conditions, and a nonlinear example. In this
context, we also provide details on suitable discretizations of the involved
32
1.4. Contributions, Outline, and Previously Published Results
translation operators and on structure-preserving discretization schemes.
All in all, the numerical results illustrate that the new method not only
yields accurate ROMs of very low dimension, but it also allows for a great
flexibility in increasing the time step size compared to the corresponding
FOMs and to corresponding POD-based ROMs, cf. chapter 6.
The remainder of this thesis is subdivided into six chapters and four ap-
pendices. In chapter 2, we introduce some notation used throughout this
thesis and present a couple of preliminary results from different relevant areas
such as nonlinear optimization, model reduction, and port-Hamiltonian sys-
tems. We proceed in chapter 3 by addressing the problem of identifying modes
based on given snapshot data, whereas chapter 4 is devoted to the construc-
tion of ROMs, once the modes have been determined. Afterwards, we discuss
structure-preserving, projection-based MOR for different classes of linear and
nonlinear port-Hamiltonian systems and using different linear and nonlinear
approximation ansatzes in chapter 5. In chapter 6, we demonstrate the appli-
cation of some of the methods presented in chapters 3 to 5 by means of three
transport-dominated numerical examples. Finally, a summary of the whole
thesis and an outlook to future work are presented in chapter 7.
For the sake of a better readability of the main chapters, some of the results
are presented only at the end of this thesis in one of the appendices. In
appendix A we discuss some properties of the periodic shift operator since
this transformation operator is used in many examples of this thesis and some
of them require the properties formally proven in appendix A. Furthermore,
we analyze some properties of a class of linear time-varying port-Hamiltonian
systems in appendix B, since we use this class in chapter 5 and since it differs
from the classical definition introduced in [25]. In appendix C, we discuss time
integration schemes based on discrete gradients and propose a new scheme
for nonlinear time-invariant port-Hamiltonian systems with a state-dependent
mass matrix. The latter one is used in chapter 6 to ensure a dissipation
inequality for the ROMs after time discretization. At the end, we provide in
appendix D some details about the discretization of the shift operators used
in chapter 6 and about the spatial discretization of the nonlinear term of the
reaction–diffusion test case considered in section 6.3.
We close this section by stating which of the results of this thesis have
been obtained in joint work with other researchers and are already published
elsewhere.
The nonlinear projection framework considered in section 4.1 has been
originally developed in joint work with F. Black and B. Unger and was
first published in [37]. As mentioned in section 1.1, the setting con-
sidered in this thesis is slightly different, but nevertheless large parts
of section 4.1 are directly based on the results from [37], whereas the
regularization method presented at the end of section 4.1 is new. In
33
1. Introduction
section 4.2, we consider even the same setting as in [37] and in par-
ticular the content of section 4.2 essentially coincides with the one of
[37, sec. 6], even though section 4.2 is partially more detailed. Also the
infinite-dimensional formulation of the optimization problem considered
in section 3.1 is based on the one presented in [37] with only slight mod-
ifications. In addition, the extended domain shift operator presented in
section 3.3.1 has been originally introduced in [37]. Finally, the numer-
ical test cases considered in sections 6.1 and 6.2 are very similar to the
ones considered in sections 7.2 and 7.3 from [37]. However, the numerical
experiments are not identical, as we use for instance different initial and
boundary values as well as different discretization methods in this thesis.
A preliminary version of the optimization framework considered in sec-
tion 3.1 has been originally presented in [259] in joint work with J. Reiss
and V. Mehrmann. However, section 3.1 provides a significantly more
profound analysis and, especially, discusses in detail the connection to
the variable projection framework as introduced in [120].
The hyperreduction framework which is subject of section 4.3 has been
originally presented in a very similar fashion in [39] in joint work with
F. Black and B. Unger. A notable difference to [39] is that the method
presented in section 4.3.2 is formulated for the case where the FOM is a
PDE.
The original idea of approximating the FOM state by an augmented
approximation ansatz including multiple transformation operators has
been presented in [241] in joint work with J. Reiss, V. Mehrmann, and
J. Sesterhenn. While the results from [241] are not presented in detail in
this thesis, we emphasize that the constant extrapolation shift operator
presented in section 3.3.3 has been already used in [241]. Moreover, in
section 3.4 we compare the approach from [241] to the one outlined in
section 3.1.
In addition to the listed papers, there are also other previously published
works, which are at least mentioned in this thesis. For instance, the optimiza-
tion framework presented in [40] (joint work with F. Black and B. Unger) is
compared in section 3.4 to the approach from section 3.1 and in addition men-
tioned several times throughout this thesis. Also the study [38] (joint work
with F. Black and B. Unger), which addresses the application of the projection
framework from [37] to the linear wave equation, is mentioned a few times and
its findings are also used for the numerical test case presented in section 6.1.
Further previously published papers which are mentioned in this thesis include
contributions on port-Hamiltonian modeling as in [7] (with R. Altmann), [19]
(with H. Bansal, M. H. Abbasi, H. Zwart, L. Iapichino, W. H. A. Schilders, and
N. van de Wouw), and [69] (with K. Cherifi and V. Mehrmann), on structure-
preserving model reduction as in [45] (with T. Breiten and R. Morandin) and
34
1.4. Contributions, Outline, and Previously Published Results
[43] (with T. Breiten), and on structured data-driven modeling as in [103] (with
E. Fosong and B. Unger), [257] (with B. Unger), and [258] (with B. Unger,
C. Beattie, and S. Gugercin).
35
2. Preliminaries
In this chapter, we introduce the notation which is used throughout this thesis,
cf. section 2.1, and provide some standard results from different areas which
are relevant for the content following in the next chapters. Since we encounter
various optimization problems throughout this thesis, the most relevant re-
sults of nonlinear optimization are summarized in section 2.2. Furthermore, in
section 2.3 we recap the relevant concepts from functional analysis, whereas in
section 2.4 we state standard existence, uniqueness, and stability results for dif-
ferential equations. In section 2.5, we summarize the general idea of parametric
model order reduction and present some common techniques based on linear
approximation ansatzes. Finally, some basic properties of port-Hamiltonian
systems and a corresponding structure-preserving model reduction scheme are
addressed in section 2.6.
2.1. Notation
The sets of natural, real, and integer numbers are denoted with N,R, and Z,
respectively. Here, we use the convention that Ndoes not contain 0. Further-
more, for a general field F, we use Fm,n for the set of m×nmatrices. For
column vectors, we abbreviate Fm,1as Fmand for the special case F=R, we
write k·k for the Euclidean norm on Rm. Moreover, given a symmetric and
positive definite matrix ARm,m, we use the notation k·kAfor the associated
weighted norm defined by kxkA:=x>Ax for all xRm. In addition, the
Frobenius norm of a matrix ARm,n is denoted with kAkFand the Frobenius
inner product of two matrices A, B Rm,n with hA, BiF. Besides, we write
[A]jto refer to the jth column vector of a matrix Aand [A]i,j for the ith
entry of [A]j. Conversely, the definition of an m×nmatrix Aby its entries
a1,1, a1,2, . . . , am,n Fis denoted as A:= [ai,j]ij Fm,n. Similarly, we define a
column vector vof length nby its entries via v:= [vi]iFn. The n×niden-
tity matrix is denoted by Inand its entries can be described by the Kronecker
delta, i.e., [In]i,j =δij for i, j = 1, . . . , n or, equivalently, In= [δij]ij Fn,n.
Moreover, for a column vector in Fnwhose entries are all equal to one, we
use the symbol 1n, while 0m×ndenotes an m×nmatrix whose entries are all
equal to zero. Furthermore, we introduce the following short-hand notation
37
2. Preliminaries
for (block-)diagonal and tridiagonal matrices:
diag (A1, . . . , An):=
A10··· 0
0A2....
.
.
.
.
.......0
0··· 0An
,
tridiagn(a, b, c):=
b c 0··· 0
a b c ....
.
.
0a b ...0
.
.
..........c
0··· 0a b
Rn,n.
Here, A1, . . . , Ancan be matrices of any size, whereas a, b, c are scalars. The
Kronecker product of two matrices AFm,n,BFp,q is denoted as AB,
i.e.,
AB=
a1,1B··· a1,nB
.
.
.....
.
.
am,1B··· am,nB
Rmp,nq.
For a matrix AFm,n, we use the notations ker(A)and im(A)for the kernel
and the image of the associated linear mapping x7→ Ax from Fnto Fm, respec-
tively. Furthermore, the transpose of Ais denoted with A>and, for the special
case F=R, the Moore–Penrose pseudoinverse of Ais denoted with A+, cf. [57,
ch. 1]. Moreover, to indicate that a matrix ARm,m is positive (semi-)definite,
we write A > 0(A0). Besides, we use the notation σmax(A)and σmin(A)
for the largest and the smallest singular values of ARm,n, respectively. If
Ais invertible, we denote its condition number σmax(A)σmin(A)1with κ(A).
In the special case of a symmetric matrix A, we use λmax(A)and λmin(A)for
the largest and the smallest eigenvalues of A, respectively. If ARn,n is in
addition positive semi-definite, we call the uniquely determined symmetric and
positive semi-definite matrix BRn,n satisfying B2=Athe square root of A
and denote it with A1
2, see for instance [150, Thm. 7.2.6].
Given xRand yR\{0}, we define the corresponding modulo operation
via
xmod y:=xy$x
y%,
where b·c:RZdenotes the floor function. Besides, for real scalars x1, . . . , xn
with nN, we define max(x1, . . . , xn):=xkwhere xksatisfies xkxifor all
i= 1, . . . , n and similarly we use min(x1, . . . , xn)for the minimal value.
For metric spaces Xand Y, we use the notation C(X, Y )for the set of
continuous mappings from Xto Yand, in particular, C(X):=C(X, R)for
the special case Y=R. In addition, the closure of a subset Uof a metric space
Xis denoted by U. Besides, we use the notations [a, b],[a, b),(a, b], and (a, b)
38
2.2. Nonlinear Optimization
for intervals between the real numbers aand b>a, where a square bracket is
used if the respective boundary belongs to the interval and a round bracket if
not.
Throughout this thesis, we use Lagrange’s notation f0for the total derivative
of a function for alternatively Newton’s notation ˙
for Leibniz’s notation df
dt
in the special case where the independent variable corresponds to the time t,
cf. [55, ch. II]. Furthermore, the partial derivatives of a function depending on
several variables, e.g. f(x1, . . . , xn), are denoted by x1f:=f
x1etc. Moreover,
if one of the independent variables, e.g. x1, is an element of Rn, we use the
notation x1ffor the partial gradient of fwith respect to x1.
For a general set M, we use the notation |M|for its cardinality and P(M)
for its power set, i.e., for the set of all subsets of M. Moreover, the set of all
finite subsets of Mis denoted as Pf(M). Finally, the identity operator which
maps every element of Mto itself is denoted as IdM.
2.2. Nonlinear Optimization
In this section we recall some basic definitions and theorems from uncon-
strained nonlinear optimization, as these are relevant for most of the chapters
of this thesis. To this end, we consider an optimization problem of the form
min
xRnJ(x)(2.1)
with a cost function J:RnR. Before stating necessary and sufficient
conditions for solutions of this optimization problem, we introduce the notion
of local and global minimum points, see for instance [24, sec. 4.1].
Definition 2.2.1 (Local and global minimum point).Let nNand J:Rn
Rbe given. A point ˆxRnis called a local minimum point of Jif there exists
R>0such that J(ˆx)J(x)holds for all xRnwith kxˆxk< . If there
even exists R>0such that J(ˆx)< J(x)is satisfied for all xRn\ {ˆx}
with kxˆxk< , then ˆxis called a strict local minimum point of J. Similarly,
we define the terms global minimum point and strict global minimum point by
omitting the restriction kxˆxk< .K
The following theorem provides necessary first-order and second-order opti-
mality conditions for a local minimum point. In particular, we call each point
xRnsatisfying (2.2) a critical point of Jor of the associated minimization
problem (2.1). A sufficient condition for a critical point to be a strict local
minimum point is provided in Theorem 2.2.3. The proofs of both theorems
may for example be found in [24, sec. 4.1].
Theorem 2.2.2 (Necessary optimality conditions).For given nNand
J:RnR, let ˆxRnbe a local minimum point of (2.1) and let Jbe
39
2. Preliminaries
continuously differentiable at ˆx. Then, the derivative of Jvanishes at ˆx, i.e.,
J0(ˆx)=0.(2.2)
Furthermore, if Jis even twice continuously differentiable at ˆx, then J00(ˆx)is
positive semi-definite.
Theorem 2.2.3 (Sufficient second-order optimality condition).For given n
Nand J:RnR, let ˆxRnbe a critical point of (2.1) and let Jbe twice
continuously differentiable at ˆx. If J00(ˆx)is positive definite, then ˆxis a strict
local minimum point of J.
In the special case where Jis continuously differentiable and convex, cf. Def-
inition 2.2.4, it turns out that a simple sufficient condition may be formulated
even for global minima, see Theorem 2.2.5 and [216, p. 53] for a corresponding
proof. Furthermore, a sufficient condition for convexity is provided in Theo-
rem 2.2.6, cf. [216, p. 55].
Definition 2.2.4 (Convex continuously differentiable functions).For nN,
we call fC1(Rn)convex on Rnif it satisfies
f(y)f(x) + f0(x)(yx)for all (x, y)Rn.K
Theorem 2.2.5 (Sufficient first-order optimality condition for a convex func-
tion).Let JC1(Rn)with nNbe convex and let J0(ˆx) = 0 hold for some
ˆxRn. Then, ˆxis a global minimum point of J.
Theorem 2.2.6 (Sufficient condition for convexity).Let fC2(Rn)with
nNbe given and let f00(x)be positive semi-definite for all xRn. Then, f
is convex.
2.3. Functional Analysis
In this section, we recall some standard notions and results from functional
analysis. In particular, we commence by introducing Lebesgue and Sobolev
spaces and afterwards address linear bounded operators as well as differentiable
mappings between Banach spaces.
Lebesgue and Sobolev Spaces
We start by introducing Lebesgue spaces following [12, ch. X]. To this end,
we consider a complete σ-finite measure space (Ω,A, µ), cf. [12, ch. IX], and
first introduce the notions of µ-simple and µ-measurable functions from to
a Banach space B.
Definition 2.3.1 (µ-simple and µ-measurable functions).Let (Ω,A, µ)be a
complete σ-finite measure space and (B,k·kB)a Banach space. Then, we call
a function f: B µ-simple if each of the following conditions is satisfied:
40
2.3. Functional Analysis
(i) f(Ω) is a finite set,
(ii) for every b B, the fiber of bunder fis an element of A,
(iii) µ(f1(B\{0})) <.
Besides, we call f: B µ-measurable if there exists a sequence of µ-simple
functions converging µ-almost everywhere to f. Moreover, the set of all µ-
measurable functions from to Bis denoted with L0(Ω, µ, B).K
If f: B is µ-measurable, then also kfkp
B: R0is a µ-measurable
function for any pR1, cf. Remark X.1.2(d) and Theorem X.1.7(i) in [12].
In particular, this allows to define the integral Rkfkp
Bdµas in [12, p. 98] and
based on this we introduce the Lebesgue spaces.
Definition 2.3.2 (Lebesgue spaces).Let (Ω,A, µ)be a complete σ-finite mea-
sure space and (B,k·kB)a Banach space. For each pR1, the set of all
µ-measurable functions f: B satisfying
Z
kfkp
Bdµ <
is denoted with Lp(Ω, µ, B). Furthermore, we use the notation L(Ω, µ, B)for
the set of all µ-measurable functions f: B for which there exists rR0
with µ({s| kf(s)kB> r})=0.K
In [12, sec. X.4] it is shown that Lp(Ω, µ, B)as in Definition 2.3.2 is a vector
space for all p[1,]. Moreover, it is proven in [12, Rem. X.4.4(b)] that the
set
N:={f L0(Ω, µ, B)|f= 0 µ-almost everywhere}
is a subspace of Lp(Ω, µ, B)for all p[1,]. In particular, this allows to
introduce the corresponding quotient spaces
Lp(Ω, µ, B):=Lp(Ω, µ, B)/N
for all p[1,]. While the elements of Lp(Ω, µ, B)are functions from to B,
Lp(Ω, µ, B)consists of equivalent classes of functions which coincide µ-almost
everywhere. As is common in the literature and by abuse of notation, we do
usually not distinguish between the equivalent class [f]Lp(Ω, µ, B)and its
representatives.
If Bis a real Hilbert space with inner product ,·iB, then also L2(Ω, µ, B)
is a real Hilbert space with inner product
hf, giL2(Ω,µ,B):=Z
hf, giBdµ,
cf. [12, Thm. X.4.10(ii)]. Moreover, in most examples of this thesis, we consider
the case of being an open or a closed subset of Rnwith λn(Ω) >0, where λn
41
2. Preliminaries
denotes the n-dimensional Lebesgue measure, cf. [12, sec. IX.4]. In this case,
we use the abbreviations Lp(Ω,B):=Lp(Ω, λn,B)and Lp(Ω) :=Lp(Ω,R)for
all p[1,]. In particular, for the case that is open and pin R1, it is
stated in [2, Thm. 2.21] that Lp(Ω) is separable, i.e., it has a countable dense
subset. Sometimes, we also consider the special case where is a non-empty
finite subset of Rand then we use the counting measure H0, cf. [12, sec. 9.2],
and the notation Lp(Ω) :=Lp(Ω,H0,R).
In the following, we present a second important class of function spaces,
namely the Sobolev spaces, cf. [2]. To this end, we consider a non-empty open
set Rnand we start by introducing the notion of weak partial derivatives
of a locally integrable function. A function fwhich is almost everywhere
defined on is called locally integrable in , if it satisfies fL1(M)for all
non-empty open sets Mwhich are bounded and satisfy M. Furthermore,
the set of locally integrable functions in is denoted with L1
loc(Ω). Especially,
Lp(Ω) is contained in L1
loc(Ω) for all p[1,], cf. [2, Cor. 2.15].
We proceed with the definition of weak partial derivatives. For this purpose,
we call a vector αin (Z0)namultiindex of order |α|:=Pn
i=1 αi. Given such
a multiindex αand a function u: R, we define
Dαu:=α1
ξ1···αn
ξnu, (2.3)
where ξidenotes the partial derivative with respect to the ith component of
for i= 1, . . . , n. If there exists for a given uL1
loc(Ω) and a multiindex
α(Z0)na function vL1
loc(Ω) satisfying
Z
ϕ(ξ)v(ξ) dξ= (1)|α|Z
u(ξ)Dαϕ(ξ) dξfor all ϕC
c(Ω),
then we call vthe α-th weak partial derivative of uand write Dαu=v. Here,
C
c(Ω) denotes the space of infinitely differentiable functions with compact
support in , cf. [2, ch. 1]. In fact, it can be shown that the weak derivative, if
it exists, is uniquely determined up to a set of measure zero, cf. [94, sec. 5.2.1].
Based on the notion of weak derivatives, we introduce the Sobolev spaces.
Definition 2.3.3 (Sobolev space).For nN, a non-empty open set Rn,
p[1,], and kZ0, we define the Sobolev space Wk,p(Ω) via
Wk,p(Ω) :={uLp(Ω) |DαuLp(Ω) for all α(Z0)nwith |α| k},
where Dαudenotes the α-th weak partial derivative of u.K
In the proof of [2, Thm. 3.3] it is shown that the Sobolev spaces equipped
with suitable norms are Banach spaces. For instance, for pR1and k
and as in Definition 2.3.3, a norm for Wk,p(Ω) is given by the mapping
42
2.3. Functional Analysis
k·kWk,p(Ω) :Wk,p(Ω) Rdefined via
kukp
Wk,p(Ω) =X
|α|≤kZ
|Dαu(ξ)|pdξ.
Moreover, the Sobolev space Wk,2(Ω) is a Hilbert space with inner product
,·iWk,2(Ω) :Wk,2(Ω) ×Wk,2(Ω) Rdefined as
hu, viWk,2(Ω) :=X
|α|≤khDαu, DαviL2(Ω)
for all kZ0, cf. [2, Thm. 3.6]. Accordingly, we use in the following the
abbreviation Hk(Ω) :=Wk,2(Ω).
In this thesis, we are especially interested in H1(Ω) and often consider the
case where is given by an open interval in R. If this is the case, then every
equivalence class in H1(Ω) has a continuous representative in as the following
theorem specifies, cf. [46, Thm. 8.2].
Theorem 2.3.4 (Continuous representatives in H1(Ω)).Let fH1(Ω) with
Ω=(a, b),aR,bR>a be given. Then, there exists a function ˜
fC(Ω)
such that fand ˜
fcoincide almost everywhere on and
˜
f(ξ2)˜
f(ξ1) =
ξ2
Z
ξ1
D1f(ξ) dξ
holds for all ξ1, ξ2.
By Theorem 2.3.4, we may define pointwise evaluations for elements of
H1((a, b)) in [a, b]via the respective continuous representatives. In particu-
lar, this allows to define the subspaces
H1
0(Ω) :={fH1(Ω) |f(a) = f(b)=0},
H1
per (Ω) :={fH1(Ω) |f(a) = f(b)},
which correspond to vanishing and periodic boundary values, respectively.
Linear Operators and Differentiable Mappings
As mentioned in section 1.1, the approximation ansatz (1.4) involves a trans-
formation family T, which is pointwise a linear bounded operator. In Defini-
tion 2.3.5, we provide a formal definition of such operators, see also for instance
[11, sec. VI.2]. Especially, one may show that L(X, Y )is a vector space and
that k·kL(X,Y )is indeed a norm for non-trivial X, cf. [11, p. 13f.].
Definition 2.3.5 (Linear bounded operator).Let (X, k·kX),(Y, k·kY)be two
normed spaces. We call T:XYalinear bounded operator from Xto Y,
43
2. Preliminaries
if Tis linear and there exists cR0such that
kTxkYckxkXfor all xX.
Moreover, the set of linear bounded operators from Xto Yis denoted with
L(X, Y )and in the special case X=Ywe write L(X):=L(X, X). In
addition, for the case that Xis non-trivial, i.e., X6={0}, we introduce the
operator norm k·kL(X,Y ):L(X, Y )R0via
kTkL(X,Y ):= sup
xX\{0}
kTxkY
kxkX
.K
While Definition 2.3.5 is formulated for general normed spaces, we often
consider the special case of Hilbert spaces. In this context, the notions of
adjoint and unitary operators are provided in the following.
Definition 2.3.6 (Adjoint operator).Let (H1,,·iH1)and (H2,,·iH2)be
real Hilbert spaces and T:H1H2a linear bounded operator. Then, we call
the unique operator T L(H2, H1)satisfying
hTx, yiH2=hx, TyiH1for all xH1, y H2
the adjoint operator of T.K
Definition 2.3.7 (Unitary operator).For given real Hilbert spaces H1and H2,
we call T L(H1, H2)unitary if it satisfies TT= IdH2and TT= IdH1.K
The fact that there exists a uniquely determined adjoint for each operator
between two Hilbert spaces H1and H2is for instance shown in the proof
of [155, Thm. 2.4.2]. Moreover, a direct consequence of Definition 2.3.7 is
that a unitary operator is isometric, i.e., it satisfies kTxkH2=kxkH1and
hTx, TyiH2=hx, yiH1for all x, y H1, cf. [158, p. 257f.].
In the remainder of this section, we introduce the differentiability notions
used in this thesis and start with the differentiability of mappings between
Banach spaces, cf. [11, p. 149].
Definition 2.3.8 (Differentiability of functions between Banach spaces).Let
(X, k·kX)and (Y, k·kY)be real Banach spaces and let Ube an open subset of
X. Then, we call a mapping f:UYdifferentiable at u0Uif there exists
L L(X, Y )satisfying
lim
uu0
f(u)f(u0)L(uu0)
kuu0kX
= 0.
Furthermore, if fis differentiable at every u0U, then we say that fis
differentiable.K
44
2.3. Functional Analysis
We emphasize that the mapping Lfrom Definition 2.3.8 depends on u0and
in particular that it is uniquely determined for given u0U, provided that fis
differentiable at u0, cf. [11, Prop. VII.2.1(iii)]. If fis differentiable, then we call
the function which maps u0Uto the corresponding linear bounded operator
L, the derivative of fand denote it by f0. If additionally f0is continuous, then
we call fcontinuously differentiable and write fC1(U, Y ). Furthermore, if
a function f:UYis differentiable at a point u0Uas in Definition 2.3.8,
then fis also continuous at u0, see [11, Prop. VII.2.1(ii)]. Besides, in this
thesis we often make use of the chain rule as it is formalized in Theorem 2.3.9,
cf. [11, Thm. VII.3.3]. Another important property is the generalized product
rule stated in Theorem 2.3.11, which is a special case of [11, Cor. VII.4.7].
It uses the notion of a bounded or, equivalently, continuous bilinear form as
defined in Definition 2.3.10, cf. [11, sec. VII.4].
Theorem 2.3.9 (Chain rule).Let X,Y, and Zbe real Banach spaces and
UX, V Ybe open subsets. If f:UYwith f(U)Vis differentiable
at uUand g:VZis differentiable at f(u), then gfis differentiable at
uwith derivative
(gf)0(u) = g0(f(u)) (f0(u)).
Definition 2.3.10 (Bounded bilinear form).Let (X, k·kX)and (Y, k·kY)be
real Banach spaces. Then, we call a bilinear form a:X×YRbounded, if
there exists a constant βR0with
|a(x, y)| βkxkXkykYfor all xX, y Y. K
Theorem 2.3.11 (Generalized product rule).Let Xbe a real Banach space,
URbe an open set, a:X×XRbe a bounded bilinear form, and
f1, f2:UXbe continuously differentiable. Then, the map g:URdefined
via g(u):=a(f1(u), f2(u)) is continuously differentiable with derivative
g0=a(f0
1, f2) + a(f1, f0
2).
The differentiability definition provided in Definition 2.3.8 considers a rather
general setting with Banach spaces, but is restricted to functions defined on
an open set. In this thesis, we often also consider functions which are defined
for instance on a closed interval and for such functions we consider the dif-
ferentiability notion presented in Definition 2.3.12, cf. [10, sec. IV.1]. If fis
differentiable in the sense of Definition 2.3.12 with continuous derivative f0,
then we call fcontinuously differentiable and write fC1(U, X). Similarly as
for the differentiability notion from Definition 2.3.8, also Definition 2.3.12 leads
to a chain rule for the derivative of a composition of differentiable functions,
see [10, Thm. IV.1.7]. For the special case U= [a, b]with aR,bR>a, we
introduce the notation C1
per([a, b], X)for the subset
C1
per([a, b], X):={fC1([a, b], X)|f(a) = f(b), f0(a) = f0(b)}.
45
2. Preliminaries
Definition 2.3.12 (Differentiability at a limit point).Let Ube a subset of R,
u0Ube a limit point of U, and Xbe a real normed vector space. Then, we
call a function f:UXdifferentiable at u0if the limit
f0(u0):= lim
uu0
f(u)f(u0)
uu0X
exists. In that case we call f0(u0)the derivative of fat u0. Furthermore, if
Uis dense-in-itself, i.e., every element of Uis a limit point of U, and if fis
differentiable at every u0U, then we say that fis differentiable.K
So far, we have only addressed differentiable functions which are defined on
an open set or on a dense-in-itself subset of R. In addition, we consider also
the differentiability of functions which map from URnto Rmwith m, n N,
U=J1×J2×···×Jn, and J1, . . . , JnRbeing proper intervals. Especially,
every element of Uis a limit point of Uand one can introduce the notion of
differentiability analogously to Definition 2.3.8. Also most of the important
properties like uniqueness of the derivative, the fact that differentiability im-
plies continuity, and the chain rule may be proven following the lines of the
corresponding proofs for differentiable functions defined on open sets. Simi-
larly as for the other differentiability notions, we denote the set of continuously
differentiable functions from URnto Rmwith C1(U, Rm).
2.4. Differential Equations
In section 1.1, we have considered a differential equation as full-order model
which describes the time evolution of the X-valued state x. Such equations
are also referred to as abstract evolution equations. In section 2.4.1, we spec-
ify how classical solutions of the associated initial value problems are defined
and discuss the existence and uniqueness of such solutions for a special class
of right-hand sides. Afterwards, we address finite-dimensional systems of dif-
ferential equations in section 2.4.2, where we also introduce the notions of
equilibrium points and stability.
2.4.1. Abstract Evolution Equations and Semigroups
In the following, we consider an abstract evolution equation of the form
˙x(t) = F(t, x(t)) for all tI:= [0, tend],(2.4a)
x(0) = x0(2.4b)
with tend R>0,F:R0×WX, and x0W, where Xis a real Banach
space and WXa subspace. We call x:IXaclassical solution of the
initial value problem (2.4) if xis continuously differentiable, x(t)Wfor all
tI, and (2.4) holds.
46
2.4. Differential Equations
Before we address the question of existence and uniqueness of classical solu-
tions for a special class of right-hand sides F, we introduce strongly continuous
semigroups and their generators in Definitions 2.4.1 and 2.4.2, see also Defini-
tions I.5.1 and II.1.2 in [92].
Definition 2.4.1 (Strongly continuous semigroup).Let Bbe a Banach space.
Then, we call T:R0 L(B)astrongly continuous semigroup if it satisfies
the following properties:
(i) T(t+s) = T(t)T(s)for all s, t R0,
(ii) T(0) = IdB,
(iii) for every x B, the orbit map ξx:R0 B defined via ξx(t):=T(t)xis
continuous. K
Definition 2.4.2 (Generator of a strongly continuous semigroup).Let Bbe
a Banach space and T:R0 L(B)a strongly continuous semigroup. Then,
based on the domain
D(A):=x B | lim
h&0
1
h(T(h)xx)exists,
we define the generator A:D(A) B of Tvia
Ax := lim
h&0
1
h(T(h)xx).K
From Definition 2.4.2 it follows that the domain of a generator is a sub-
space and the generator itself is linear, cf. [92, Lemma II.1.3]. Furthermore,
a generator is bounded if and only if its domain coincides with the complete
underlying Banach space B, cf. [92, Cor. II.1.5].
In the following, we consider the initial value problem (2.4) for the special
case where the right-hand side Fsatisfies
F(t, x):=Ax +f(t)for all (t, x)R0×W,(2.5)
where Ais the generator of a strongly continuous semigroup and fis contin-
uously differentiable. A sufficient condition for the existence and uniqueness
of classical solutions is provided in Theorem 2.4.3, see [99, sec. I.5] for a cor-
responding proof.
Theorem 2.4.3 (Unique solution of (2.4) for special F).Let Xbe a Banach
space and Abe the generator of a strongly continuous semigroup T:R0
L(X)with domain D(A) =:W. Furthermore, let x0be an element of Wand
f:R0Xbe continuously differentiable on I= [0, tend]with tend R>0.
Besides, let F:R0×WXbe given by (2.5). Then, there exists a unique
47
2. Preliminaries
solution to the initial value problem (2.4), which is given by
x(t) = T(t)x0+
t
Z0
T(ts)f(s) dsfor all tI.
Theorem 2.4.4 (Growth bound for strongly continuous semigroups).Let B
be a Banach space and T:R0 L(B)be a strongly continuous semigroup.
Then, there exist ωRand MR1satisfying
kT(t)kL(B)Meωt for all tR0.
Finally, we note that the growth bound presented in Theorem 2.4.4, cf. [92,
Prop. I.5.5], allows to bound the unique solution mentioned in Theorem 2.4.3
via
kx(t)kX kT(t)x0kX+
t
Z0
T(ts)f(s) dsX
kT(t)x0kX+
t
Z0kT(ts)f(s)kXds
kT(t)kL(X)kx0kX+
t
Z0kT(ts)kL(X)kf(s)kXds
Meωt
kx0kX+
t
Z0
eωs kf(s)kXds
(2.6)
for all tI, where the second inequality follows from [12, Thm. X.2.11(i)].
2.4.2. Finite-Dimensional Systems of Differential Equations
In this subsection, we consider evolution equations on a finite-dimensional
space X=W=Rngiven by
E(t, x(t)) ˙x(t) = F(t, x(t)) for all tI,(2.7a)
x(t0) = x0(2.7b)
with time interval I= [t0,)or I= [t0, tend]with t0R0and tend R>t0,
a mass matrix E:R0×RnRn,n, a right-hand side F:R0×RnRn,
and an initial value x0Rn. In the following we only consider the case where
Eis at least invertible at (t0, x0). For the more general case of a singular
mass matrix, we refer to the differential-algebraic equations literature, see for
instance [175] and the references therein.
We call xC(I,Rn)asolution of (2.7a) if xis differentiable in Iand satisfies
(2.7a). If additionally (2.7b) holds, then we call xasolution of the initial value
48
2.4. Differential Equations
problem (2.7). Theorem 2.4.5 provides sufficient conditions for the initial value
problem (2.7) to have locally a unique solution and is based on a corresponding
result for standard ODE systems with E=In, cf. [136, Thm. 1.1]. Based on
additional assumptions on Eand F, one may also provide statements about
the maximal interval of existence as detailed in Theorem 2.4.6.
Theorem 2.4.5 (Local existence and uniqueness of solutions of (2.7)).Con-
sider the initial value problem (2.7) with I= [t0, tend],t0= 0,tend R>0, and
x0Rn, and let E:R0×RnRn,n and F:R0×RnRnbe continu-
ous. Furthermore, let E(t0, x0)be invertible and let there exist a neighborhood
of (t0, x0)in R0×Rnwhere xEand xFexist and are continuous. Then,
the initial value problem (2.7) has exactly one solution, provided that tend is
sufficiently small.
Proof. Since the set of invertible matrices is open, cf. [96, Prop. 1.2.1], and
due to the assumptions on Eand F, we infer that there exist 1, 2R>0such
that xEand xFexist and are continuous on the compact set
R:={(t, x)[0, 1]×Rn| kxx0k 2}
and such that E(t, x)is invertible for all (t, x)R. Moreover, since Eand
xEare continuous on R, this is also true for the pointwise inverse E1and
its partial derivative with respect to x, cf. [57, Prop. 10.5.1] and Theorem 2
in [178, p. 124]. Consequently, also the function ˜
F:RRndefined via
˜
F(t, x):=E1(t, x)F(t, x)is continuous as well as its partial derivative x˜
F.
Especially, we infer that ˜
Fis uniformly Lipschitz continuous with respect to
x, cf. [11, Rem. VII.3.11(b)]. Thus, by means of classical ODE theory, see for
instance [136, Thm. 1.1], we obtain that the initial value problem
˙x(t) = ˜
F(t, x)for all t˜
I,(2.8a)
x(0) = x0(2.8b)
has a unique solution on the time interval ˜
I= [0, 3]with
3:= min 1,2
M,
where MR>0is an upper found for k˜
Fkon R. In particular, this solution
also solves the original initial value problem (2.7) on the time interval [0, 3],
which proves the existence. To also show the uniqueness, let ˜xbe a solution of
the initial value problem (2.7) on the time interval [0, 3]. If k˜x(t)x0k< 2
holds for all t[0, 3], then E(t, ˜x(t)) is invertible for all t[0, 3]and ˜xhas
to coincide with the unique solution of (2.8). Otherwise, the set A:={t
[0, 3]| k˜x(t)x0k 2}is non-empty. Especially, we observe that Ais the
preimage of the closed set B:={zRn| kzx0k 2}under ˜x. Thus, since
˜xis differentiable and hence continuous, the set Ais closed as well. Since A
49
2. Preliminaries
is also bounded, it has a minimum value t1(0, 3]. Moreover, we must have
k˜x(t1)x0k=2, since otherwise the openness of {t[0, 3]| k˜x(t)x0k>
2}would imply the existence of t2(0, t1)with k˜x(t2)x0k> 2, which
would contradict the fact that t1is the minimum value in A. In particular, we
infer that E(t, ˜x(t)) is invertible for all t[0, t1]and hence ˙
˜xcoincides with
E1(·,˜x(·))F(·,˜x(·)) on [0, t1]. As a consequence, ˙
˜xis continuous on [0, t1]and
we obtain the bound
2=k˜x(t1)x0k=
t1
Z0
˙
˜x(t) dt=
t1
Z0
E1(t, ˜x(t))F(t, ˜x(t)) dtMt1,
which yields t12
M3t1and thus t1=3. Hence, we conclude that
k˜x(t)x0k 2must hold for all t[0, 3]and, consequently, ˜xcoincides
with the unique solution of (2.8), which proves the uniqueness.
Theorem 2.4.6 (Maximal solution).Consider the ODE system (2.7a) with
FC1(R0×Rn,Rn)and pointwise invertible EC1(R0×Rn,Rn,n). Then,
for each (t0, x0)R0×Rnexactly one of the following two statements is true.
(i) There exists a unique δmax R>t0such that for any tend (t0, δmax)the
initial value problem (2.7) with I= [t0, tend]and x(t0) = x0has a unique
solution, but not for any larger end time tend δmax. Furthermore, we
have lim
t%δmax kx(t)k=.
(ii) For any tend R>t0, the initial value problem (2.7) with I= [t0, tend]and
x(t0) = x0has a unique solution.
Proof. For given (t0, x0)R0×Rn, we observe that for any tend R>t0
the function x:IRnwith I= [t0, tend]is a solution of (2.7) if and only if
˜x: [0, tend t0]Rnwith ˜x(t):=x(t+t0)solves the initial value problem
˙
˜x(t)=(E(t+t0,˜x(t)))1F(t+t0,˜x(t)) for all t[0, tend t0
|{z }
=:˜
tend
],
˜x(0) = x0.
(2.9)
Furthermore, using similar arguments as in the proof of Theorem 2.4.5, we
infer that ˜
F:R0×RnRndefined via ˜
F(t, ˜x):= (E(t+t0,˜x))1F(t+t0,˜x)
is continuously differentiable. Then, Corollary 23 in [280, p. 38] implies that
either (2.9) has a unique solution for any ˜
tend R>0or that there exists
a unique ˜
δmax R>0such that (2.9) has a unique solution for any ˜
tend
(0,˜
δmax), but not for any ˜
tend ˜
δmax. Furthermore, in the latter case we
have lim
t%˜
δmax k˜x(t)k=. The claim then follows from transferring these state-
ments from the shifted initial value problem (2.9) to the original one (2.7) via
introducing δmax :=˜
δmax +t0.
50
2.4. Differential Equations
We emphasize that the right endpoint δmax of the maximal existence interval
in Theorem 2.4.6 depends in general on the initial value x0and the initial time
point t0. Moreover, we note that also Theorem 2.4.6 is based on standard ODE
theory, cf. [280, sec. 2.4].
Stability
In the following, we introduce some common stability notions for the system
(2.7a) and in particular introduce a sufficient stability criterion in terms of the
existence of a Lyapunov function. First, we introduce the term equilibrium
point in Definition 2.4.7 and associated stability concepts in Definition 2.4.8,
cf. [280, sec. 5.1].
Definition 2.4.7 (Equilibrium point).For given F:R0×RnRnand
pointwise invertible E:R0×RnRn,n, we call xRnan equilibrium point
of (2.7a) if F(t, x)=0holds for all tR0.K
Definition 2.4.8 (Stability).Consider the system (2.7a) with FC(R0×
Rn,Rn), pointwise invertible EC(R0×Rn,Rn,n), and equilibrium point
0Rn. Furthermore, assume that for each initial condition x(t0) = x0with
t0R0and x0Rn, the initial value problem (2.7) has a unique solution on
[t0,). We denote the evaluation of this solution at tRt0by s(t, t0, x0).
(i) We call the equilibrium point 0stable, if for each R>0and each
t0R0there exists a δR>0such that
ks(t, t0, x0)k< (2.10)
holds for all (t, x0)Rt0×Rnwith kx0k< δ.
(ii) We call the equilibrium point 0uniformly stable, if for each R>0there
exists a δR>0such that (2.10) holds for all t0R0,tRt0, and
x0Rnwith kx0k< δ.
(iii) We call the equilibrium point 0globally exponentially stable, if there exist
constants a, b R>0such that
ks(t, t0, x0)k akx0keb(tt0)
holds for all t0R0,tRt0, and x0Rn.K
The following definition of a Lyapunov function is inspired by standard Lya-
punov theory for ODE systems with E=In, see for instance [162, Thm. 4.10],
and by the port-Hamiltonian formulation introduced in [196]. Moreover, The-
orem 2.4.10 gives a relation between the existence of a Lyapunov function as
defined in Definition 2.4.9 and stability of the equilibrium point 0.
51
2. Preliminaries
Definition 2.4.9 (Globally quadratic Lyapunov function).We consider the
system (2.7a) with Eand Fas in Definition 2.4.7 and call V:R0×RnR
aglobally quadratic Lyapunov function of (2.7a) if the following conditions are
satisfied.
(i) The function Vis continuously differentiable. Moreover, there exist a
function z:R0×RnRnand a constant c1R0such that for all
(t, x)R0×Rnwe have
xV(t, x) = E(t, x)>z(t, x),(2.11)
and tV(t, x) + z(t, x)>F(t, x) c1kxk2.(2.12)
(ii) There exist constants c2, c3R>0with
c2kxk2V(t, x)c3kxk2for all (t, x)R0×Rn.
In addition, we call a globally quadratic Lyapunov function which satisfies (i)
for some c1R>0astrong globally quadratic Lyapunov function.K
Theorem 2.4.10 (Lyapunov’s theorem for (2.7a)).We consider the system
(2.7a) with Eand Fas in Definition 2.4.8. Then, the following assertions
hold.
(i) If there exists a globally quadratic Lyapunov function of (2.7a), then the
equilibrium point 0is uniformly stable.
(ii) If there exists a strong globally quadratic Lyapunov function of (2.7a),
then the equilibrium point 0is globally exponentially stable.
Proof. (i) Let t0R0and x0Rnbe arbitrary and let s(t, t0, x0)denote
the evaluation of the solution of the corresponding initial value problem
(2.7) at tRt0. In particular, since Fis continuous and Eis continuous
and pointwise invertible, we infer from (2.7a) that the solution s(t, t0, x0)
is continuously differentiable with respect to t. Using this solution as well
as a globally quadratic Lyapunov function Vof (2.7a), we introduce the
mapping ˜
V:Rt0Rdefined via ˜
V(t):=V(t, s(t, t0, x0)). Condition
(i) in Definition 2.4.9 implies that ˜
Vis continuously differentiable and
that its derivative satisfies
˙
˜
V(t) = tV(t, s(t, t0, x0)) + xV(t, s(t, t0, x0))>ts(t, t0, x0)
=tV(t, s(t, t0, x0)) + z(t, s(t, t0, x0))>E(t, s(t, t0, x0))ts(t, t0, x0)
=tV(t, s(t, t0, x0)) + z(t, s(t, t0, x0))>F(t, s(t, t0, x0))
0
for all tRt0. Thus, we infer
V(t, s(t, t0, x0)) = ˜
V(t)˜
V(t0) = V(t0, s(t0, t0, x0)) = V(t0, x0)
52
2.5. Parametric Model Order Reduction
for all tRt0. Using condition (ii) from Definition 2.4.9, we further
obtain that there exist constants c2, c3R>0such that for all tRt0
we have
ks(t, t0, x0)k21
c2
V(t, s(t, t0, x0)) 1
c2
V(t0, x0)c3
c2kx0k2.
This implies that (2.10) holds for instance when choosing δ=qc2
c3.
(ii) The proof follows similar arguments as the proof of (i). Since (2.12)
is satisfied with c1R>0, we may replace ˙
˜
V(t)0by the stronger
statement
˙
˜
V(t) c1ks(t, t0, x0)k2 c1
c3
V(t, s(t, t0, x0)) = c1
c3
˜
V(t)(2.13)
for all tRt0. Moreover, applying standard theory for differential
inequalities to (2.13), see for instance [9, Lem. (16.4)], we obtain
˜
V(t)˜
V(t0) exp c1
c3
(tt0)
for all tRt0. This in turn yields
ks(t, t0, x0)k21
c2
V(t, s(t, t0, x0)) = 1
c2
˜
V(t)
1
c2
˜
V(t0) exp c1
c3
(tt0)
=1
c2
V(0, x0) exp c1
c3
(tt0)
c3
c2kx0k2exp c1
c3
(tt0)
and, consequently,
ks(t, t0, x0)k sc3
c2kx0kexp c1
2c3
(tt0)
for all tRt0.
2.5. Parametric Model Order Reduction
In this section, we discuss model order reduction techniques for parametric
full-order models of the form
˙x(t;µ) = F(t, x(t;µ); µ)for all (t, µ)I×M, x(0; µ) = x0(µ),(2.14)
53
2. Preliminaries
with I= [0, tend],MRnp,F:R0×W×MX,x0:MW,x:I×
MW, real Hilbert space X, and subspace WX, cf. Remark 1.1.1.
The task of parametric model order reduction is to determine a reduced-order
model of state space dimension rdim(X), which is usually of the form
˙
˜x(t;µ) = ˜
F(t, ˜x(t;µ); µ)for all (t, µ)I×M,˜x(0; µ) = ˜x0(µ),
with ROM state ˜x:I×MRr, right-hand side ˜
F:R0×Rr×MRr, and
initial value ˜x0:MRr. An approximation of the full-order state is then
given by a mapping Vr:R0×Rr×MW, i.e., x(t;µ)Vr(t, ˜x(t;µ); µ)for
all (t, µ)I×M. Often, Vrdoes not explicitly depend on tor µand is linear
in the reduced state ˜x. This important special case is in particular subject to
the discussion in section 2.5.3.
Remark 2.5.1 (Goal-oriented model reduction).We emphasize that in many
applications one aims for an accurate approximation of some quantities of
interest or outputs of the form
y(t;µ) = C(t, x(t;µ); µ)for all (t, µ)I×M,
with output y:I×MRp, output mapping C:R0×W×MRp, and
output dimension pN. Depending on the application, one is interested in
approximating both the FOM state and the output or only in approximating
the output. For instance, in model reduction techniques for control systems
it is commonly exploited that not the complete FOM state needs to be ap-
proximated well, but typically only a few derived quantities. In this context,
one usually targets a good approximation of the input-output map, cf. [14]
and the references therein. Also when considering systems without control
input, so-called goal-oriented model reduction methods make use of the quan-
tities of interest and construct ansatz functions which are tailored for a good
approximation of these quantities, see for instance [41, 52, 265]. ¨
The whole process of parametric model order reduction is usually separated
into two phases: the offline and the online phase, see also the beginning of
chapter 1. The offline phase summarizes all steps required for constructing
a reduced-order model. Especially in the case of nonlinear full-order models,
this often involves simulations of the FOM and a subsequent determination of
modes or ansatz functions based on the simulated FOM data, cf. section 2.5.1.
The parameter samples which are used for simulating the FOM in the offline
phase may be selected via a greedy algorithm as presented in section 2.5.2.
Also the actual construction of the ROM belongs to the offline phase and may
for instance be achieved by means of a Galerkin projection, see section 2.5.3.
On the other hand, the online phase describes the actual use of the reduced-
order model, i.e., its evaluation for many different parameter configurations.
These multi-query evaluations of the ROM are, among others, often needed in
control or optimization tasks.
Since the offline phase is ideally only performed once, the requirements on its
54
2.5. Parametric Model Order Reduction
computational effort are usually less severe than for the evaluation of the ROM
in the online phase. In this context, the term efficient offline/online decompo-
sition is used if the computation of the online phase does not scale with the
dimension of the full-order model. Then, the evaluation of the reduced-order
model is typically much cheaper than the evaluation of the FOM. However, es-
pecially when the FOM is nonlinear, another approximation of the nonlinearity
is often required for achieving such an efficient offline/online decomposition,
cf. section 2.5.4.
2.5.1. Proper Orthogonal Decomposition
A common technique for determining modes as in the approximation ansatz
(1.2) is the POD method, see for instance [127, 148]. To this end, we assume
in this subsection that Xis a real separable Hilbert space, that WXis
a dense subspace, and that we have access to a solution trajectory xof the
FOM (1.1), for instance, by means of a numerical simulation. Furthermore,
we present the POD method for a solution trajectory xwhich depends only
on time but not on a parameter vector. Instead, in section 2.5.2 we present a
POD-greedy algorithm to handle the parameter-dependent case, see also [156]
for an alternative POD-based approach for parametric problems.
The POD method is based on a minimization of the approximation error by
considering the optimization problem
min 1
2
tend
Z0
x(t)
r
X
j=1
αj(t)φj
2
X
dt
s.t. φjWand hφi, φjiX=δij, i, j = 1, . . . , r
(2.15)
with given rdim(X). Since the best approximation within a subspace
is given by the orthogonal projection onto this subspace, we can replace the
coefficients αjby hx(t), φjiXfor j= 1, . . . , r. This results in the optimization
problem
min 1
2
tend
Z0
x(t)
r
X
j=1 hx(t), φjiXφj
2
X
dt
s.t. φjWand hφi, φjiX=δij, i, j = 1, . . . , r.
(2.16)
According to [127], this optimization problem may be solved by solving an
eigenvalue problem associated with the operator R:XXdefined via
Rφ:=
tend
Z0hx(t), φiXx(t) dt. (2.17)
We observe that if the solution xis in L2((0, tend),W), then Rφis in Wfor
all φX, see also the proof of [127, Lemma 1.24]. In that case, we obtain a
55
2. Preliminaries
solution of (2.16) as follows, cf. [127, Theorem 1.15].
Theorem 2.5.2 (Solution of the POD minimization problem).Let Xbe a
real separable Hilbert space, xbe in L2((0, tend),X)with tend R>0, and R
be as defined in (2.17). Then, there exist eigenvalues λiRand associated
orthonormal eigenfunctions φiXfor i I with
Rφi=λiφifor all i I :={1,...,dim(X)},
λiλi+1 0for all i I \{dim(X)},(2.18)
if Xis finite-dimensional, or
Rφi=λiφifor all i I :=N,
λiλi+1 0for all i I,(2.19)
otherwise. If xis additionally in L2((0, tend),W)with Wbeing a dense sub-
space of X, then for any r I with λr>0, the rleading eigenfunctions
φ1, . . . , φrform a solution of (2.16).
Remark 2.5.3 (Uniqueness of the POD modes).Since eigenfunctions are in
general not unique, but only the corresponding eigenspaces, we infer from
Theorem 2.5.2 that the POD modes and the solution of (2.16) are in general
not unique. However, as long as the eigenvalues are distinct, the only source
of non-uniqueness is given by the fact that we can replace φiby φifor any
i {1, . . . , r}, see also for instance [131, sec. 2.4.5]. ¨
For the actual computation of the POD modes, one usually considers a
finite-dimensional full-order model (2.14) with X=W=Rn, which is, for
example, obtained via semi-discretization of a PDE in space. This finite-
dimensional FOM is then simulated and the collected data of the state are
stored in a matrix XRn,q, i.e.,
X:=hx(t1)x(t2)··· x(tq)i.(2.20)
The columns are referred to as snapshots of the solution and the matrix Xis
called a snapshot matrix. The discrete counterpart of the minimization prob-
lem (2.16) may then be solved by computing a singular value decomposition
(SVD) of X, cf. [127, Rem. 1.10]. The leading rleft singular vectors constitute
the matrix Vrwhich can be used for obtaining a reduced-order model based on
a Galerkin projection, see section 2.5.3. Reasonable choices for the dimension
ressentially depend on the singular value decay. In particular, if the singular
values decay very fast, then a few modes are usually sufficient to obtain a
suitable approximation of the snapshot matrix. On the other hand, if the sin-
gular values decay rather slowly, then the POD method requires many modes
to achieve a reasonable approximation and, consequently, the evaluation of a
corresponding reduced-order model may be rather slow.
56
2.5. Parametric Model Order Reduction
2.5.2. POD-Greedy Algorithm
A common approach for sampling the parameter domain during the offline
phase is to use a POD-greedy algorithm as discussed in the following, see also
[131, sec. 2.4.5]. To this end, we consider a parametric FOM of the form (2.14)
with X=W=Rn. For the POD-greedy algorithm, cf. Algorithm 2.1, we
also consider a grid based on discrete time points t1, . . . , tqI, which allows
to compute a POD via an SVD as outlined at the end of section 2.5.1. Fur-
thermore, we require a function ε:Mtrain ×Pf(Rn)R0which can be used
as an error estimator and which usually involves constructing and evaluat-
ing a ROM based on the modes specified by the second input argument of ε,
see section 2.5.3 for more details on the ROM construction. Neglecting the
computational effort, an ideal choice would be the true squared online error
εtrue :Mtrain ×Pf(Rn)R0, which is defined via
εtrue (µ, {φ1, . . . , φr}):=
q
X
`=1 x(t`;µ)
r
X
i=1
˜xi(t`;µ)φi
2
,
where xand ˜xdenote the solutions of the FOM and of the ROM, respectively.
Here, we assume that the FOM and the ROM are uniquely solvable. Another
important error measure is the true squared projection error εopt :Mtrain ×
Pf(Rn)R0, which is defined as
εopt (µ, {φ1, . . . , φr}):=
q
X
`=1 x(t`;µ)Pspan{φ1,...,φr}x(t`;µ)2
and provides a lower bound for εtrue. Here, Pspan{φ1,...,φr}denotes the orthogonal
projection onto the span of φ1,. . .,φr. We note that both error measures εtrue
and εopt involve in general solving the FOM and, thus, their evaluation is
typically expensive. An alternative approach is based on residual-based error
bounds, which may be often computed in an efficient way, see [131] and the
references therein for more details.
During the iterations of the POD-greedy algorithm, the error estimator is
evaluated on a pre-defined parameter training set Mtrain Mand the algo-
rithm terminates once the estimated maximum error does not exceed the error
tolerance tol or the maximum number of iterations imax is reached. Besides,
the POD-greedy algorithm may be initialized with an empty basis or with a
given orthonormal basis Φ0.
Algorithm 2.1 is an iterative procedure and at the beginning of each itera-
tion the error estimator is evaluated on the training set Mtrain. When using
a residual-based error estimator, this step usually involves the construction
of a parametric ROM and multiple ROM evaluations. Here, an efficient of-
fline/online decomposition for the ROM and for the error estimator itself is
crucial for a reasonable computation time, especially if the training set Mtrain
is large. Afterwards, the worst-case value of the error estimator is compared to
57
2. Preliminaries
the tolerance and the algorithm terminates if the tolerance is met. Otherwise,
the FOM is solved for the worst-case parameter value to generate snapshots of
the FOM state. Here, we note that there may be some arbitrariness in line 5
of Algorithm 2.1, since the worst-case parameter value µmax is not necessarily
uniquely determined. After the FOM snapshots have been computed, they are
projected onto the orthogonal complement of the span of the current basis.
This ensures that the basis vector added in the last step of Algorithm 2.1 is
orthogonal to the other basis vectors. Consequently, the output basis is guar-
anteed to be orthonormal as well. We remark that the mode to be added in the
last step is in general not uniquely determined, especially if the multiplicity
of the largest singular value of the projected snapshot matrix is greater than
one, cf. Remark 2.5.3.
Algorithm 2.1 POD-greedy algorithm
Input:
FOM as in (2.14) with X=W=Rn,MRnp,n, npN
discrete time points t1, . . . , tqIwith 0 = t1< t2< . . . < tq=tend
and qN2
parameter training set Mtrain Mwith |Mtrain|<
initial set of ansatz vectors Φ0={φ1, . . . , φr0} Rnwith r0
{0, . . . , n}and hφi, φjiX=δij for i, j = 1, . . . , r0
error estimator ε:Mtrain ×Pf(Rn)R0
error tolerance tol R>0
maximum number of iterations imax N
Output:
set of ansatz vectors Φk={φ1, . . . , φrk} Rnwith kN{0}and
rkNr0, satisfying max
µMtrain
ε(µ, Φk)tol or k=imax
1: for i1to imax do
2: Evaluate the error estimator ε(·,Φi1)on Mtrain
3: if maxµMtrain ε(µ, Φi1)tol then return Φi1
4: end if
5: Solve the FOM for a parameter value µmax arg max
µMtrain
ε(µ, Φi1)
6: Project the FOM snapshots x(t1;µmax), . . . , x(tq;µmax)onto the orthog-
onal complement of span{Φi1}
7: Compute an SVD of the projected snapshot matrix and add the leading
left singular vector to Φi1and thereby obtain Φi
8: end for
Even without specifying a maximum number of iterations, the POD-greedy
algorithm terminates in theory after a finite number of steps, provided that
58
2.5. Parametric Model Order Reduction
the error estimator is zero whenever the true projection error is zero. This
follows from the fact that, as long as the maximum error is not small enough,
the POD-greedy algorithm continues extending the orthonormal basis until
the error criterion is met. Thus, since the FOM state is assumed to take val-
ues in a finite-dimensional space, the reduced basis is guaranteed to span this
complete state space after a finite number of steps. However, this theoreti-
cal property is in practice often of minor importance, since we usually desire
ROMs whose state space dimension is orders of magnitude smaller than the
one of the FOM. Thus, we are rather interested in the convergence speed of
the POD-greedy algorithm. In fact, under certain assumptions on the param-
eter training set Mtrain and on the error estimator ε, one can show that an
algebraic or exponential decay of the Kolmogorov n-widths, cf. Example 1.2.1,
leads to an algebraic or exponential convergence of the POD-greedy algorithm,
respectively, see for instance [131, Prop. 2.94].
2.5.3. Galerkin Projection
In sections 2.5.1 and 2.5.2, we have discussed approaches for determining suit-
able modes φibased on snapshot data of the FOM. The focus of this section
is to derive a reduced-order model which describes the time and parameter
dependency of the amplitudes αifor given modes φi. For this purpose, we as-
sume that the modes are linearly independent, as is for instance the case when
using one of the methods from sections 2.5.1 and 2.5.2. A common approach
for obtaining a reduced-order model is to perform a Galerkin projection. To
this end, we substitute the linear approximation ansatz
x(t;µ)ˆx(t;µ):=
r
X
i=1
αi(t;µ)φi
into the full-order model (2.14) to obtain the residual
r
X
i=1
˙αi(t;µ)φiF t,
r
X
i=1
αi(t;µ)φi;µ!(2.21)
at (t, µ)I×M. An evolution equation for the coefficients α1, . . . , αris
then obtained by enforcing the residual to be orthogonal to the span of the
modes φ1, . . . , φr. Moreover, we derive initial values for the coefficients via an
orthogonal projection of the FOM initial value x0onto the span of φ1, . . . , φr.
The resulting ROM reads
M˙
˜x(t;µ) = ˜
F(t, ˜x(t;µ); µ),˜x(0; µ) = ˜x0(µ)for all (t, µ)I×M(2.22)
59
2. Preliminaries
with mass matrix MRr,r, ROM state ˜x:I×MRr, right-hand side
˜
F:R0×Rr×MRr, and initial value ˜x0:MRrdefined as
M:=hhφi, φjiXiij ,(2.23a)
˜x(t;µ):= [αi(t;µ)]i,(2.23b)
˜
F(t, α;µ):=
*φi, F
t,
r
X
j=1
αjφj;µ
+X
i
,(2.23c)
˜x0(µ):=M1[hφi, x0(µ)iX]i.(2.23d)
An important property is that for given (t, µ)I×Mand for given values
α1(t;µ),. . .,αr(t;µ)R, the corresponding values of the time derivatives
˙α1(t;µ),. . .,˙αr(t;µ)Rdetermined by the Galerkin ROM (2.22) are optimal
in the sense that they minimize the norm of the residual (2.21). Since the
continuous-time residual is minimized, this property is called continuous opti-
mality in [61] to distinguish it from an alternative approach which minimizes
the residual after time discretization.
The first equation in (2.22) is a system of rdifferential equations and r
unknowns and, thus, the number of ROM equations and unknowns is reduced
in comparison to the FOM. However, the evaluation of the ROM right-hand
side ˜
Fstill involves the evaluation of the high-dimensional FOM right-hand
side Fand hence the online phase may still be expensive. Nevertheless, in
some situations we may transfer those computations which scale with the FOM
dimension from the online to the offline phase. As a consequence, the ROM
can then be evaluated with a computational cost which only scales with the
reduced dimension r. This is for instance possible if Fis linear with respect
to its second argument and if its time and parameter dependency allows for a
separation of the form
F(t, x;µ) =
K
X
k=1
θk(t;µ)Fkx(2.24)
with Kn, coefficient functions θ1, . . . , θK:R0×MR, and linear
mappings F1, . . . , FK:WX. In this case, we have
˜
F(t, α;µ) =
K
X
k=1
θk(t;µ)
hφ1, Fkφ1iX. . . hφ1, FkφriX
.
.
..
.
.
hφr, Fkφ1iX. . . hφr, FkφriX
|{z }
=:˜
Fk
α1
.
.
.
αr
and, thus, ˜
Fmay be efficiently evaluated in the online phase, once the r×r
matrices ˜
F1,..., ˜
FKhave been precomputed in the offline phase. In a similar
way, an efficient offline/online decomposition can be achieved if Fis a poly-
nomial function with respect to its second argument, see for instance [172]
60
2.5. Parametric Model Order Reduction
and the references therein. Furthermore, the treatment of piecewise polyno-
mial nonlinearities is discussed in [82] and a method for treating nonlinearities
which are linear combinations of elementary functions such as sin(·),cos(·), or
exp(·)is presented in [126].
In the case that Finvolves a more general nonlinearity, it may not be possi-
ble to sufficiently reduce the computational complexity of the ROM evaluations
just by offline precomputations. To this end, several hyperreduction methods
have been proposed in the literature and they usually involve an approximation
of ˜
Fin order to significantly decrease the computational cost for evaluating
the ROM. For instance, a commonly applied hyperreduction method is the
(discrete) empirical interpolation method, which is briefly summarized in sec-
tion 2.5.4.
We note that the Galerkin method presented at the beginning of this sub-
section is a special case of a Petrov–Galerkin scheme. In general, the Petrov–
Galerkin method is based on enforcing the residual to be orthogonal to the
span of some linearly independent test functions ζ1, . . . , ζrXwhich satisfy
the compatibility condition
span{ζ1, . . . , ζr}span{φ1, . . . , φr}={0}.(2.25)
This condition is in particular satisfied if the ansatz functions φiand test
functions ζicoincide for i= 1, . . . , r, respectively, and this special case cor-
responds to the Galerkin method. In general, a Petrov–Galerkin projection
yields a ROM of the form (2.22) where the mass matrix, right-hand side, and
initial value are given by
M:=hhζi, φjiXiij ,˜
F(t, α;µ):=
*ζi, F
t,
r
X
j=1
αjφj;µ
+X
i
,
˜x0(µ):=M1[hζi, x0(µ)iX]i.
In particular, we note that the mass matrix Mis invertible due to the compat-
ibility condition (2.25). In the special case of a finite-dimensional FOM with
X=W=Rn, the ROM is obtained via matrix multiplications using the ma-
trices Vr:= [φ1··· φr]Rn,r and Wr:= [ζ1··· ζr]Rn,r. More precisely,
the ROM mass matrix is then given by M=W>
rVr, the right-hand side by
˜
F(t, α;µ) = W>
rF(t, Vrα;µ), and the initial value by ˜x0(µ) = M1W>x0(µ).
2.5.4. Hyperreduction
As mentioned in the previous subsection, the evaluation of the ROM (2.22)
may still scale with the FOM dimension, especially if the FOM (2.14) is non-
linear. To this end, various hyperreduction techniques have been introduced
for achieving an efficient offline/online decomposition by an additional approx-
imation of the ROM, cf. [6, 17, 21, 60, 64, 97, 134, 140, 166, 217, 247]. In the
61
2. Preliminaries
following we only discuss the empirical interpolation method (EIM) and the
discrete empirical interpolation method (DEIM), since these methods are the
most relevant ones for the new approach presented in section 4.3.2.
The empirical interpolation method is introduced in [21] for approximating a
general space- and parameter-dependent function g: ×MRwith spatial
domain R2and parameter domain MRnp. Especially, the authors
assume that g(·, µ)is an element of L(Ω) for all µMand present a
procedure for approximating gby a function ˜g: ×MRwhich is affine
in the second argument, i.e., there exist KN,γ1, . . . , γK:MR, and
g1, . . . , gK: Rsatisfying
˜g(ξ, µ) =
K
X
k=1
γk(µ)gk(ξ)for all (ξ, µ)×M.
Furthermore, the authors also discuss the usefulness of such an affine approx-
imation in the context of model order reduction, see also (2.24) and the corre-
sponding discussion in section 2.5.3.
The DEIM is introduced in [64] and applies the ideas of the EIM in the con-
text of general nonlinear finite-dimensional ODE systems, which usually arise
from a semi-discretization of a PDE in space. Accordingly, in the following
we consider a ROM of the form (2.22)–(2.23) with X=Rnand mode matrix
Φ:= [φ1··· φr]Rn,r. In addition, we assume for simplicity that the right-
hand side Fdoes not explicitly depend on time, which can be achieved by mak-
ing the system autonomous, see for instance [136, sec. VII.1]. Furthermore, we
omit parameter dependencies while emphasizing that the following considera-
tions may be straightforwardly extended to the case of parameter-dependent
FOMs, cf. [64]. According to (2.23), the ROM right-hand side ˜
F:RrRr
without time and parameter dependencies is defined as ˜
F(˜x):= Φ>F˜x),
where Fis the FOM right-hand side. In particular, as mentioned in the last
subsection, the evaluation of ˜
Finvolves the evaluation of Fand this may ren-
der the online phase expensive. To circumvent this issue, the DEIM is based
on approximating Fby a suitable linear combination of appropriately chosen
ansatz vectors, i.e., we aim to find an approximation of the form
F˜x(t)) Ψβ(t)(2.26)
with DEIM mode matrix Ψ=[ψ1··· ψs]Rn,s, DEIM coefficient vector
β:IRs, and sn. Especially, in [64] the authors propose to determine
the DEIM modes ψ1, . . . , ψsin the offline phase by computing a POD of the
nonlinearity snapshots F(x(t1)), . . . , F(x(tq)). Then, in order to determine
the DEIM coefficients in the online phase, they follow the EIM and suggest
to enforce equality in ssuitably selected rows of (2.26). Thus, the coefficients
can be determined by solving the linear equation system
S>Ψβ(t) = S>F˜x(t)),
62
2.6. Port-Hamiltonian Systems
where S= [eπ(1), . . . , eπ(s)]Rn,s is a truncated permutation matrix. Here, ei
denotes the ith unit vector of the standard basis of Rnfor i= 1, . . . , n and
π:{1, . . . , s}→{1, . . . , n}is an injective mapping which is determined in the
offline phase. Corresponding algorithms for determining πbased on Ψare for
instance proposed in [64, 83]. Especially, as long as Ψhas full column rank,
the mentioned algorithms ensure the invertibility of S>Ψ. Consequently, the
DEIM results in replacing ˜
F(˜x)=Φ>F˜x)in the ROM by
Φ>Ψ(S>Ψ)1S>F˜x).(2.27)
Here, the matrix Φ>Ψ(S>Ψ)1can be precomputed in the offline phase and,
moreover, the evaluation of (2.27) only involves the evaluation of srows of the
FOM right-hand side F. As long as the Jacobian of Fis sparse, this allows to
implement the ROM evaluation in an efficient way, cf. [64].
2.6. Port-Hamiltonian Systems
In the following we provide a brief summary of port-Hamiltonian systems with
particular emphasis on the topics which are relevant for this thesis. Especially,
we present different linear and nonlinear pH representations and discuss some
related properties such as stability and passivity in section 2.6.1. Moreover,
we summarize a structure-preserving MOR approach for pH systems based on
a Petrov–Galerkin projection in section 2.6.2.
2.6.1. Formulations and Basic Properties
We start by considering linear time-invariant port-Hamiltonian systems of the
form
˙x(t)=(JR)Qx(t) + Bu(t), x(t0) = x0,(2.28a)
y(t) = B>Qx(t)(2.28b)
for all tI, with time interval I= [t0, tend],t0R0,tend R>t0, state
x:IRn,input port u:R0Rm,output port y:IRm, initial value
x0Rn,structure matrix JRn,n with J=J>,dissipation matrix RRn,n
with R=R>0,energy matrix QRn,n with Q=Q>0, and input matrix
BRn,m. Associated to this system, we introduce a quadratic Hamiltonian
H:RnRvia H(z):=1
2z>Qz. Furthermore, we emphasize that there are
alternative linear time-invariant pH representations including a leading matrix
Ein front of ˙xin (2.28a) or a feedthrough term, cf. Remark 2.6.1, Remark 2.6.4,
and chapter 5.
As a consequence of the structure in (2.28), one can show that the Hamil-
tonian may only increase if the input uand the output ydo not vanish. To
this end, let ube chosen such that (2.28a) admits a solution xin C1(I,Rn).
63
2. Preliminaries
Then, due to the symmetry and definiteness properties of Jand Ras well as
the symmetry of Q, this solution satisfies the so-called dissipation inequality
d
dt(Hx)(t) = x(t)>Q>˙x(t) = x(t)>Q>((JR)Qx(t) + Bu(t))
=x(t)>Q>RQx(t) + y(t)>u(t)y>(t)u(t)
(2.29)
for all tI. The Hamiltonian often represents the stored energy of the sys-
tem and (2.29) corresponds to a power balance, where the term x>Q>RQx
describes the internal energy dissipation and y>uthe energy exchange with
the environment or with other subsystems, see for instance [278]. Moreover,
(2.29) implies that the pH system (2.28) is passive, i.e., there exists a storage
function V:RnR0satisfying V(0) = 0 and a dissipation inequality of
the form (2.29) with V=H, see for instance [53] for a formal definition of
passivity.
In the special case where Qis even positive definite and uvanishes, the
Hamiltonian is a globally quadratic Lyapunov function in the sense of Defi-
nition 2.4.9 with H=V,z=∇H,E=In,F(t, x):= (JR)Qx,c1= 0,
c2=σmin(Q), and c3=σmax(Q). Thus, by Theorem 2.4.10(i) we infer that the
pH structure in (2.28) with the additional assumption Q > 0implies that the
system (2.28a) with u= 0 has a uniformly stable equilibrium point at 0, see
also [193].
Remark 2.6.1 (Port-Hamiltonian systems with feedthrough term).The port-
Hamiltonian structure (2.28) may be extended to systems with feedthrough
terms via
˙x(t)=(JR)Qx(t)+(GP)u(t), x(t0) = x0
y(t)=(G+P)>Qx(t)+(S+N)u(t),
for all tI, with G, P Rn,m and S, N Rm,m. In this case, we require the
coefficient matrices to satisfy Q=Q>0,J=J>,N=N>, and
"R P
P>S#="R P
P>S#>0,
cf. [276, sec. 6.1]. In a similar way, one may also extend the upcoming nonlinear
pH structure (2.30) to include feedthrough terms, cf. [196]. ¨
Remark 2.6.2 (Hamiltonian and dissipative Hamiltonian systems).In the re-
mainder of this thesis, we refer to the special case of a port-Hamiltonian sys-
tem without external port variables, i.e., with B= 0 in (2.28), as a dissipative
Hamiltonian system. Similarly, we call a dissipative Hamiltonian system with-
out dissipation, i.e, with R= 0 in (2.28), a Hamiltonian system. Moreover,
we use these terms not only in the context of linear time-invariant systems,
but also for nonlinear systems as discussed in the remainder of this section.
We emphasize that especially our use of the term Hamiltonian system is less
64
2.6. Port-Hamiltonian Systems
restrictive than it is common in the differential equations literature, where the
term is rather used for the case where Jhas a special canonical structure, see
for instance [35, Def. 11.1]. ¨
In [196] the authors introduce a class of nonlinear port-Hamiltonian descrip-
tor systems. The special case without feedthrough term and with pointwise
square Ematrix function reads
E(t, x(t)) ˙x(t) + r(t, x(t)) = (J(t, x(t)) R(t, x(t)))z(t, x(t)) + B(t, x(t))u(t),
(2.30a)
y(t) = B>(t, x(t))z(t, x(t)) (2.30b)
for all tI, with E, J, R C(R0×Rn,Rn,n),r, z C(R0×Rn,Rn), and
BC(R0×Rn,Rn,m). Here, the second term on the left-hand side of (2.30a)
reflects an explicit time dependency of the Hamiltonian. More precisely, we
consider an associated Hamiltonian H C1(R0×Rn)and require that E,J,
R,r,zsatisfy pointwise
J=J>, R =R>0, E>z=xH, z>r=tH.(2.31)
Then, by means of a similar calculation as in (2.29), cf. [196, sec. IIB], one can
show that, for any continuously differentiable solution xof (2.30a), we have a
dissipation inequality of the form
dHs
dt(t)y(t)>u(t)for all tI,(2.32)
where Hs:R0Ris defined via Hs(t):=H(t, x(t)). This dissipation in-
equality is an important property when investigating stability as well as the
existence and uniqueness of solutions of the state equation (2.30a) with u= 0
and pointwise invertible E.
Theorem 2.6.3 (Stability of (2.30a)).Consider the system (2.30a) with van-
ishing input u= 0,J, R C1(R0×Rn,Rn,n),r, z C1(R0×Rn,Rn), and
pointwise invertible EC1(R0×Rn,Rn,n). Furthermore, let (2.31) be sat-
isfied pointwise for some Hamiltonian H C1(R0×Rn), which additionally
fulfills condition (ii) in Definition 2.4.9 with H=V. Then, the following
assertions hold.
(i) For each initial value x0Rnand for any time interval I= [t0, tend]
with t0R0and tend R>t0, the initial value problem associated with
(2.30a),u= 0, and x(t0) = x0has a unique solution on I.
(ii) If r(t, 0) = (J(t, 0) R(t, 0))z(t, 0) is satisfied for all tR0, then
(2.30a) with u= 0 has a uniformly stable equilibrium point at the origin.
Proof. (i) Since the assumptions of Theorem 2.4.6 are satisfied, we conclude
that, for a given initial value x0Rnand initial time t0R0, the
65
2. Preliminaries
corresponding initial value problem associated with (2.30a) and u= 0 is
either uniquely solvable on any time interval I= [t0, tend]with tend R>t0
or there is a maximal existence interval [t0, δmax)with δmax R>t0and
lim
t%δmax kx(t)k=.(2.33)
Let us assume that for some (t0, x0)R0×Rnthe latter statement
is true. Then, since the Hamiltonian satisfies condition (ii) in Defini-
tion 2.4.9, (2.33) implies
lim
t%δmax H(t, x(t)) = .
However, this is a contradiction to the inequality H(t, x(t)) H(t0, x0),
which holds for any tt0and follows from the dissipation inequality
(2.32) in the case u= 0. Thus, we infer that for any initial value x0Rn
and initial time t0R0, the corresponding initial value problem asso-
ciated with (2.30a) and u= 0 is uniquely solvable on any time interval
I= [t0, tend]with tend R>t0.
(ii) First, we note that the equation r(·,0) = (J(·,0) R(·,0))z(·,0) implies
that 0Rnis an equilibrium point of (2.30a) with u= 0. Furthermore,
due to (i), the requirements of Theorem 2.4.10 are satisfied. Besides,
using (2.31) we infer that the Hamiltonian satisfies not only condition
(ii) in Definition 2.4.9, but also condition (i) with c1= 0. Thus, the
Hamiltonian is a globally quadratic Lyapunov function of (2.30a) with
u= 0 and, hence, the claim follows by applying Theorem 2.4.10(i).
The classes of port-Hamiltonian systems considered in (2.28) and (2.30) are
formulated in a continuous-time setting and this is also true for the corre-
sponding dissipation inequalities (2.29) and (2.32). However, the simulation
of a pH system usually involves a discretization in time and there is in general
no guarantee that the resulting time-discrete system inherits a correspond-
ing dissipation inequality. In [169, 196] it is shown that for the special case
where the Hamiltonian is independent of tand quadratic with respect to x,
Gauss–Legendre collocation methods may be used to obtain a dissipation in-
equality also on the discrete-time level. For instance, the Hamiltonians of the
full-order models considered in chapter 6 satisfy this requirement and thus
we obtain a dissipation inequality on the time-discrete level by using the im-
plicit midpoint rule, which is the simplest Gauss–Legendre collocation method,
cf. [80, sec. 6.3.2]. However, the corresponding reduced-order models are port-
Hamiltonian systems with non-quadratic Hamiltonian and, therefore, we partly
use a different time integration scheme based on discrete gradient pairs for the
ROMs, cf. chapter 6 and appendix C.
66
2.6. Port-Hamiltonian Systems
2.6.2. Structure-Preserving Model Order Reduction
In this subsection we summarize the approach presented in [65] for structure-
preserving model reduction for nonlinear port-Hamiltonian systems of the form
˙x(t) = (JR)∇H(x(t)) + Bu(t),(2.34a)
y(t) = B>∇H(x(t)) (2.34b)
for all tI. In particular, this system class is a special case of (2.30) with
E=In,H C1(Rn),z=∇H,r= 0, and constant J,R,B.
When applying a classical Galerkin projection to (2.34), cf. section 2.5.3, it is
in general not clear if the resulting reduced-order model is a port-Hamiltonian
system as well. Therefore, in [65] the authors propose a model reduction
scheme based on a suitable Petrov–Galerkin projection. For this purpose, two
subspaces of dimension rnare determined in the offline phase: The trial
space is chosen such that the full-order state xmay be well approximated within
this subspace, whereas the choice of the test space targets a good approximation
of ∇H x. Especially, both subspaces may for instance be obtained via the
POD method based on snapshots of xand ∇H x, respectively. This yields
matrices VrRn,r and WrRn,r whose columns span the trial and the test
space, respectively. Moreover, it is assumed that the two identified subspaces
satisfy a compatibility condition similar to (2.25), which allows to choose Vr
and Wrsuch that W>
rVr=Irholds.
Based on the matrices Vrand Wr, a ROM is constructed as follows. First, a
Petrov–Galerkin projection as outlined at the end of section 2.5.3 is performed,
which results in the reduced system
˙
˜x(t) = W>
r(JR)∇H(Vr˜x(t)) + W>
rBu(t),
˜y(t) = B>∇H(Vr˜x(t)) (2.35)
for all tI, with reduced state ˜x:IRrand output ˜y:IRm. Especially,
Vr˜xcorresponds to an approximation of the FOM state xand, accordingly, we
define the Hamiltonian of the reduced system ˜
H:RrRvia ˜
H(˜x):=H(Vr˜x).
Furthermore, since Vr˜xapproximates xand, by assumption, ∇H xmay be
well approximated by elements of im(Wr), it is reasonable to assume that also
∇H(Vr˜x(·)) may be well approximated by elements of im(Wr), cf. [65, sec. 2].
Consequently, we may assume that for each tIthere exists β(t)Rrwith
∇H(Vr˜x(t)) Wrβ(t).
Especially, using V>
rWr= (W>
rVr)>=Irwe obtain
∇H(Vr˜x(t)) WrV>
rWrβ(t)WrV>
r∇H(Vr˜x(t)) = Wr˜
H(˜x(t)).
67
2. Preliminaries
Then, formally replacing ∇H(Vr˜x)in (2.35) by Wr˜
H(˜x)yields the ROM
˙
˜x(t)=(˜
J˜
R)˜
H(˜x(t)) + ˜
Bu(t),
˜y(t) = ˜
B>˜
H(˜x(t)) (2.36)
for all tI, with ˜
J:=W>
rJWr,˜
R:=W>
rRWr, and ˜
B:=W>
rB. In particular,
˜
Jis skew-symmetric and ˜
Ris symmetric and positive semi-definite and, hence,
the ROM (2.36) has the same pH structure as the FOM (2.34).
Remark 2.6.4 (Structure-preserving MOR for linear time-invariant pH sys-
tems).In the special case of linear time-invariant pH systems of the form
(2.28) with Q=Q>>0and Hamiltonian H(x):=1
2x>Qx, the model reduc-
tion approach explained in this subsection greatly simplifies. In particular, we
may choose Wrvia Wr=QVr(V>
rQVr)1, see also [130]. Moreover, in this case
we have
∇H(Vr˜x) = QVr˜x=QVr(V>
rQVr)1V>
rQVr˜x=Wr˜
H(˜x)
for all ˜xRrand, thus, the ROM (2.35) obtained via Petrov–Galerkin pro-
jection coincides with the port-Hamiltonian ROM (2.36).
Another important class of linear time-invariant port-Hamiltonian systems
is given by
E˙x(t) = (JR)x(t) + Bu(t),
y(t) = B>x(t),(2.37)
for all tI, with ERn,n satisfying E=E>>0, associated Hamiltonian
H:RnRdefined via H(x):=1
2x>Ex, and J,R,Bas in (2.28). Here, the
structure-preserving model reduction problem becomes even simpler, since a
classical Galerkin projection with Wr=Vrautomatically preserves the port-
Hamiltonian structure, see for instance [230]. Furthermore, applying a state
space transformation of the form ˜x:=Ex to (2.37) leads to a port-Hamiltonian
system of the form (2.28) with positive definite Q=E1. Consequently, the
stability and passivity properties mentioned at the beginning of section 2.6.1
also apply to pH systems of the form (2.37). ¨
68
3. Mode Identification
As mentioned at the beginning of chapter 1 and outlined in section 2.5, model
reduction schemes are often separated into an offline and an online phase. The
offline phase includes all steps for constructing a ROM and, for instance when
using a POD-based approach, this typically includes the simulation of the
FOM for different input or parameter configurations as well as a subsequent
determination of suitable ansatz functions based on the snapshot data of the
FOM state. This chapter is devoted to the latter task when using approxima-
tion ansatzes based on transformed modes as in (1.4) or (1.8). To this end,
we present in section 3.1 a framework which is inspired by the POD optimiza-
tion problem, cf. section 2.5.1. In particular, given a trajectory of the FOM
state x, a suitable family of transformations T, and paths p1, . . . , pr, the new
framework aims for determining modes φ1, . . . , φrand amplitudes α1, . . . , αr
such that the error of the approximation
x(·)
r
X
i=1
αi(·)T(pi(·)) φi,(3.1)
is minimized, cf. (1.4) and (1.7) in section 1.1. Based on the framework from
section 3.1, we demonstrate in section 3.2 how the POD-greedy algorithm
discussed in section 2.5.2 may be extended to approximations based on trans-
formed modes. This yields an adaptive way of gradually sampling the param-
eter domain for cases where the FOM is parameter-dependent.
While we consider a quite general class of transformation families Tin sec-
tions 3.1 and 3.2, in most examples of this thesis we consider the special case
where Tcorresponds to a family of some kind of translation operators on a one-
dimensional spatial domain . The application of such a translation operator
is straightforward in the case = Ror for problems with periodic boundary
conditions. On the contrary, when considering problems on bounded domains
with non-periodic boundary conditions, there are several ways of defining suit-
able translation operators and some of them are presented in section 3.3. Fi-
nally, in section 3.4 we mention some other methods which construct mode
decompositions of the form (3.1) or similar and discuss their advantages and
disadvantages in comparison to the method presented in section 3.1.
69
3. Mode Identification
3.1. Residual Minimization
In this subsection, we consider a given trajectory xL2(I,W)with I= [0, tend]
and tend R>0, where Wis a subspace of a real Hilbert space X. Usually, this
trajectory xis the result of solving an evolution equation of the form (1.1), for
instance via a numerical simulation. The goal of this subsection is to present a
framework for determining an optimal approximation of xusing an ansatz of
the form (3.1). To this end, we assume that we are given a real Banach space V
and a suitable family T:R L(V,X)of transformation operators together
with suitable paths p1, . . . , prL2(I). For deriving a corresponding numerical
algorithm, the special case X=W=Rn,V=Rdφwith n, dφNplays an
important role and may for instance correspond to a numerical approximation
of x. Moreover, in many situations the trajectory xis only known at discrete
time points, which is also reflected in the algorithms presented in this section.
We note that in contrast to the parameter-dependent ansatz (1.7), in this
section we focus on trajectories which are only dependent on time, but not on
additional parameters. However, we emphasize that the presented framework
may be generalized to the parameter-dependent case and we briefly address
this in Remark 3.1.4.
As a starting point, we recall that the POD method is based on the mini-
mization problem (2.15), cf. section 2.5.1. Consequently, the POD method is
optimal in the sense that the POD modes φ1, . . . , φrand the corresponding
coefficients α1, . . . , αrare chosen such that the squared L2(I,X)norm of the
residual is minimized. To obtain a similar optimality for the decomposition
(3.1), we consider the minimization problem
min
φ1,...,φrY, α1,...,αrL2(I)
1
2
tend
Z0x(t)
r
X
i=1
αi(t)T(pi(t)) φi
2
X
dt, (3.2)
where YVis defined as in (1.5). The major two differences of the mini-
mization problem (3.2) to the one in (2.15) are the additional transformation
operators and the omission of the orthogonality constraints for the modes φi.
In the POD case, where an optimal time-independent linear subspace is sought,
we can restrict ourselves to searching for an orthonormal basis of the subspace
without affecting the approximation quality of the optimizer. On the other
hand, the approximation ansatz (3.1) is not just given by a linear combination
of modes, but instead also involves transformation operators, which act on the
modes and are parametrized by the time-dependent paths pi. Consequently,
we may not use the same arguments as in the POD case for justifying an or-
thonormality constraint for the modes. In fact, Example 3.1.1 illustrates that
even linearly dependent modes may be optimal when considering an approx-
imation ansatz of the form (3.1). This is in contrast to methods based on
time-independent linear subspaces, where a linearly dependent set of modes
corresponds to a redundancy and allows to remove modes without increasing
70
3.1. Residual Minimization
the approximation error.
Example 3.1.1 (Linear dependence of optimal modes).We revisit Exam-
ple 1.2.3 for the special case where the initial value of the velocity is zero, i.e.,
v0= 0. Furthermore, we only consider the analytical solution for the density,
which is given by
ρ(t, ξ) = ρref (ϑr(ξct) + ϑl(ξ+ct)) for all (t, ξ)I×,
cf. (1.15). Since v0is zero, the Riemann invariants ϑrand ϑlcoincide and are
given by the (ba)-periodic continuation of 1
2ρref ρ0, cf. (1.16). As pointed out
in Example 1.2.3, the analytical solution may be described by an ansatz of the
form (3.1) with
r= 2, φ1=φ2=ρref ϑr|,T=Tper,
α1(t) = α2(t)=1, p1(t) = p2(t) = ct for all tI,(3.3)
cf. (1.17). Moreover, since the approximation error is zero, the modes and
amplitudes in (3.3) are a solution of the corresponding optimization problem
(3.2) with x=ρand X=Y=L2(Ω). We emphasize that the modes are
optimal and at the same time they coincide, i.e., they are linearly dependent.
However, since they are transformed by different time-dependent transforma-
tion operators Tper(ct)and Tper(ct), this linear dependence does not imply
that we can remove one of them without introducing an error. All in all, this
example demonstrates that even linearly dependent modes may be optimal
when using an approximation ansatz of the form (3.1). l
Remark 3.1.2 (Determination of the paths).We emphasize that the cost func-
tional in (3.2) is only optimized over the modes and amplitudes, whereas the
paths piare fixed parameters. The simultaneous optimization of the modes,
amplitudes, and paths is not within the scope of this thesis, but is instead ad-
dressed in [40] in a very similar setting. Further data-driven techniques for es-
timating the paths based on snapshots of xare discussed in [199, 241, 242, 259].
Approaches which do not only determine the paths but also the family of trans-
formation operators are for instance presented in [152, 171, 243, 270]. They
are based on describing a transport map via a suitable linear combination of
ansatz functions, which are determined based on snapshot data. ¨
Remark 3.1.3 (Existence and uniqueness of solutions).Throughout this thesis,
we assume that the integral in (3.2) exists and that the minimization problem
(3.2) has a solution, whereas the analysis of the existence of solutions is not
discussed in this thesis. We refer to [37, sec. 4], where these questions have been
discussed in a very similar setting. If a solution of the minimization problem
(3.2) exists, then it is in general not unique. For instance, multiplying an
amplitude with any non-zero constant and dividing the corresponding mode
by the same constant changes the amplitude and the mode, but not the value of
the cost functional. This source of non-uniqueness can be avoided by enforcing
71
3. Mode Identification
the modes to be normalized, cf. [37]. But even then, the minimizer is in
general not unique, since we can for instance multiply an amplitude and the
corresponding mode by 1without changing the cost functional. As pointed
out in Remark 2.5.3, a similar kind of non-uniqueness also applies to the POD
optimization problem. ¨
Remark 3.1.4 (Parameter-dependent trajectories).If we consider a trajectory
which does not only depend on time but also on a parameter vector µ
M, then the cost functional in (3.2) has to be adapted to also account for
the parameter dependency. To this end, we assume that Mis the closure
of an open, non-empty subset of Rnp. Then, we search for solutions of the
minimization problem
min
φYr, α(L2(I×M))r
1
2Z
M
tend
Z0x(t;µ)
r
X
i=1
αi(t;µ)T(pi(t;µ)) φi
2
X
dtdµ.
In a fully discrete setting, the parameter dependency can be treated analo-
gously as the time dependency by using appropriate weights for the different
snapshots, cf. (3.6). A theoretical discussion of the classical POD optimization
problem for parameter-dependent elliptic PDEs is presented in [156]. An alter-
native treatment of the parameter dependency is given by a greedy procedure
as discussed in section 3.2. ¨
As illustrated in Example 3.1.1, it is advisable to omit orthogonality con-
straints for the modes when using an ansatz of the form (3.1), at least in the
case where more than one transformation operator is involved. However, the
fact that the transformed modes are in general not orthonormal prevents a
connection to the singular value decomposition as in the POD case. In the
following, we consider a special case where the minimization problem (3.2)
may be reduced to one of the form (2.16), which may be solved via the POD
method. To this end, we consider the ansatz (1.8) with nt= 1 and assume that
Vis a Hilbert space and that Tis pointwise unitary. Then, since all modes are
affected by the same transformation and since this transformation is unitary,
orthonormality of the modes is preserved by the transformation. Thus, in this
special case it is reasonable to enforce the modes to be orthonormal and the
corresponding minimization problem reads
min
φ1,...,φrY
1
2
tend
Z0x(t)
r
X
i=1 hx(t),T(p(t)) φiiXT(p(t)) φi
2
X
dt,
s.t. hφi, φjiV=δij for i, j = 1, . . . , r.
(3.4)
Here, we exploited the orthonormality of the transformed modes, which allows
to replace the coefficients αiby the optimal ones hx, T(p)φiiXobtained via
orthogonal projection, cf. section 2.5.1. Furthermore, by using again the fact
that Tis pointwise unitary, the minimization problem (3.4) can be shown to
72
3.1. Residual Minimization
be equivalent to
min
φ1,...,φrY
1
2
tend
Z0T(p(t))x(t)
r
X
i=1 hT(p(t))x(t), φiiVφi
2
V
dt,
s.t. hφi, φjiV=δij for i, j = 1, . . . , r.
(3.5)
If we additionally assume that Vis separable, YVis a dense subspace,
and the transformed trajectory T(p)xis in L2(I,Y), then Theorem 2.5.2
yields that a solution of this minimization problem is given by the first rPOD
modes of T(p)x. This relation has been used for instance in [54] and it has
been formally proven in a slightly different setting in [37, sec. 4]. An example
where the mentioned assumptions on V,Y, and Tare satisfied is given by
V=X=L2(Ω),W=H1
per (Ω) with some spatial domain = (a, b),aR,
bR>a, and T=Tper, cf. Definition 1.2.2. Due to this special choice of W, the
subspace Yas defined in (1.5) may be shown to coincide with W=H1
per (Ω).
Furthermore, H1
per (Ω) is indeed a dense subspace of V=L2(Ω), which follows
from [46, Cor. 4.23] and the fact that the space of infinitely differentiable
functions with compact support in is a subspace of H1
per (Ω).
Apart from the special case considered in the last paragraph, it is in general
not clear if the minimization problem (3.2) may be reduced to a corresponding
POD optimization problem as in (2.16). Instead, we propose to discretize
(3.2) and solve the resulting finite-dimensional nonlinear optimization problem
numerically. For this purpose, we consider the special case X=W=Rnand
assume that we have access to a finite number of samples of the trajectory
x. Furthermore, we discretize the time integral in (3.2) and consider the fully
discrete problem
min
φ1,...,φrRdφ, a1,1,...,ar,qR
q
X
i=1
ωi
2[X]i
r
X
j=1 T(gj,i)aj,iφj
2
W
,(3.6)
where we also assume a finite-dimensional space V=Y=Rdφfor the modes.
Here, the coefficients ωiR>0,i= 1, . . . , q, are weighting factors originating
from a quadrature approximation of the time integral in (3.2), for instance,
using a composite trapezoidal scheme. Furthermore, we use a weighted norm
k·kWwith symmetric and positive definite WRn,n to be able to include for
example a discretization of the L2(Ω) norm with spatial domain , cf. chap-
ter 6. Besides, XRn,q denotes a snapshot matrix which is assumed to be
given and whose columns correspond to samples of the trajectory x, cf. (2.20).
Especially, in order for these samples to be well-defined, we implicitly assume
that xis an element of a subspace of L2(I,Rn)for which point-wise evaluations
are well-defined, as for instance C(I,Rn). A similar assumption applies also to
the amplitudes and the paths, whose discrete analogues αj(ti)and pj(ti)are
denoted by aj,i Rand gj,i R, respectively, for j= 1, . . . , r and i= 1, . . . , q.
73
3. Mode Identification
Remark 3.1.5 (Alternative for discretizing the cost functional).The discretiza-
tion step leading from the infinite-dimensional problem (3.2) to the finite-
dimensional one (3.6) usually involves replacing the family of transformation
operators Tby a suitable family of matrices, which approximate the action
of Ton the discrete level. This is demonstrated in appendix D.1 for the
translation operators used in chapter 6. In [40, sec. 4], the authors present
an alternative discretization which is based on approximating the modes by
linear combinations of FEM basis functions. This method has the advantage
that it does not require a discretization of the family of transformations T,
but instead it is based on an analytical computation of the occurring inner
products of transformed FEM basis functions, cf. [40, Ex. 3]. However, first
investigations indicate that the resulting discretized cost function cannot be
simply brought into the same form as (3.6) without a significant computa-
tional overhead. Consequently, it is not clear if an efficient reduction of the
cost function as discussed in the upcoming section 3.1.2 is possible when using
the alternative discretization from [40]. This is why we instead focus here on
the discrete cost function as given in (3.6). ¨
Before we discuss strategies for solving the optimization problem (3.6), we
introduce the vectors
φ:=
φ1
.
.
.
φr
Rrdφ, a :=
a1
.
.
.
aq
Rrq, ai:=
a1,i
.
.
.
ar,i
Rr(3.7)
as well as the matrix functions K:RrdφRqn,rq and Ki:RrdφRn,r defined
via
K(φ):=
K1(φ)...
Kq(φ)
, Ki(φ):=hT(g1,i)φ1··· T (gr,i)φri,
(3.8)
for i= 1, . . . , q. This allows to reformulate the minimization problem (3.6) as
min
φRrdφ, aRrq
1
2kvec(X)K(φ)ak2
ˆ
W,(3.9)
where the weighting matrix ˆ
WRnq,nq is defined via
ˆ
W:=
ω1...
ωq
W. (3.10)
In particular, we observe that the minimization problem (3.9) can be written
74
3.1. Residual Minimization
as a weighted separable nonlinear least squares problem of the form
min
βRsRp
1
2kbA(β)γk2
˜
W
|{z }
=:Jf(β)
(3.11)
with given matrix function A:RsRk,p, symmetric and positive definite
weighting matrix ˜
WRk,k, and vector bRk. In (3.9), the quantities
φ, a,K,vec(X), and ˆ
Wcorrespond to β,γ,A,b, and ˜
W, respectively, in
the general problem formulation (3.11). The characteristic feature of such
separable nonlinear least squares problems is that the unknowns are separated
into two block components βand γand that A(β)γis linear with respect to the
γcomponent and possibly nonlinear with respect to the βcomponent. These
kinds of problems have been extensively investigated in the past decades. For
instance, in [120] the authors have introduced the variable projection method,
which we follow in section 3.1.2. Constrained separable nonlinear least squares
problems have for example been addressed in [159, 221]. Furthermore, for the
treatment of ill-conditioned problems, some regularizations of Tikhonov type
have been proposed, see e.g. [66, 70, 288]. These and other extensions like the
punishment of discretized derivatives of the modes or the amplitudes can be
added to the minimization problem (3.9) to take additional requirements for
the solution into account. However, these extensions are not within the scope
of this thesis.
In the following, we discuss two different strategies to numerically solve the
minimization problem (3.9). The first approach relies on solving the problem
simultaneously in both variables φand a, whereas the latter one uses the
variable projection method in order to reduce (3.9) to a problem which only
depends on φ.
Remark 3.1.6 (Choice of the mode number).Regardless of whether the full
optimization problem is solved as in section 3.1.1 or the variable projection
method is applied as in section 3.1.2, we usually assume the mode number rto
be given in advance. For simple examples like the linear advection or the linear
wave equation as considered in section 1.2.1, physical insights may be used to
properly choose r. However, when considering more involved problems, it may
be much more difficult or even impossible to determine a suitable value for r
just based on physical considerations. If at least the most relevant transports
have been identified, i.e., nt,T, and p1, . . . , pntin (1.8) may be assumed to be
given, then the mode numbers r1, . . . , rntcould be for instance chosen based
on a greedy algorithm as discussed in section 3.2. A similar procedure for
gradually increasing the mode numbers is proposed in [259, Alg. 1], but in
contrast to the approach in section 3.2 this is not a greedy procedure, but
instead all modes are optimized each time the mode numbers are increased. A
completely different strategy is suggested in [240], where the author proposes
a cost function based on a Schatten 1-norm and argues that the corresponding
heuristic rank minimization property may be used to remove redundant modes
75
3. Mode Identification
or even redundant transports, i.e., this approach could be used for decreasing
ntand r1, . . . , rntin case that these numbers are chosen larger than necessary
for obtaining a reasonable approximation. ¨
3.1.1. Solving the Full Optimization Problem
We start by considering the full optimization problem (3.9) and aim to solve it
directly without any reduction of the optimization parameters. To this end, we
intend to use gradient-based optimization techniques and, thus, first compute
the partial derivatives of the cost function in (3.9) with respect to φand a, see
Lemma 3.1.7. A corresponding algorithm for evaluating the cost function and
its gradient is provided in Algorithm 3.1. The expression [aJfull]ioccurring in
the last line of the for-loop denotes the ith block row of the partial gradient with
respect to a, cf. (3.13) in Lemma 3.1.7. In section 6.1 we use this algorithm
together with a gradient-based optimization solver in the context of a wave
equation test case.
Lemma 3.1.7 (Partial derivatives of cost function in (3.9)).Let q, n, dφN
and rNnas well as T:RRn,dφ,XRn,q, and g1,1, . . . , gr,q Rbe
given. Furthermore, let ˆ
WRnq,nq be as defined in (3.10) with ω1, . . . , ωq
R>0and symmetric positive definite matrix WRn,n. Besides, let K:Rrdφ
Rqn,rq be as defined in (3.8) and let Jfull :Rrdφ×Rrq Rdenote the cost
function in (3.9), i.e.,
Jfull(φ, a):=1
2kvec(X)K(φ)ak2
ˆ
W.
Then, Jfull is continuously differentiable and its partial derivatives are given by
φJfull(φ, a) =
q
X
i=1
ωi(Ki(φ)ai[X]i)>Wha1,iT(g1,i)··· ar,iT(gr,i)i,(3.12)
aJfull(φ, a) =
ω1K1(φ)>W(K1(φ)a1[X]1)
.
.
.
ωqKq(φ)>W(Kq(φ)aq[X]q)
>
,(3.13)
where we use the notation from (3.7).
Proof. Since Kdefined in (3.8) is a linear mapping, Jfull is not only a quadratic
function in abut also in φand, hence, continuously differentiable. Before we
compute its partial derivatives, we determine those of K, which are given by
K
[φk]j
(φ) =
K1
[φk]j(φ)
...
Kq
[φk]j(φ)
=
[T(gk,1)]je>
k...
[T(gk,q)]je>
k
(3.14)
76
3.1. Residual Minimization
for k= 1, . . . , r,j= 1, . . . , dφ, where ekdenotes the kth unit vector of the
standard basis of Rr. Consequently, using the product rule we obtain the first
set of partial derivatives of Jfull as
Jfull
[φk]j
(φ, a) = K
[φk]j
(φ)a!>ˆ
W(vec(X)K(φ)a)
=
[T(gk,1)]jak,1
.
.
.
[T(gk,q)]jak,q
>
ω1W...
ωqW
K1(φ)a1[X]1
.
.
.
Kq(φ)aq[X]q
=
q
X
i=1
ωi(Ki(φ)ai[X]i)>Wak,i[T(gk,i)]j
for k= 1, . . . , r,j= 1, . . . , dφ. The concatenation of these partial derivatives
into one row vector results in (3.12). Finally, for obtaining (3.13) we compute
aJfull(φ, a) = (vec(X)K(φ)a)>ˆ
WK(φ)
=
K1(φ)a1[X]1
.
.
.
Kq(φ)aq[X]q
>
ω1W...
ωqW
K1(φ)...
Kq(φ)
=
ω1K1(φ)>W(K1(φ)a1[X]1)
.
.
.
ωqKq(φ)>W(Kq(φ)aq[X]q)
>
.
3.1.2. Using Variable Projection
In the following, we summarize the variable projection method introduced in
[120] for separable nonlinear least squares problems of the form (3.11) and
afterwards discuss its application to the minimization problem (3.9). In [120]
the authors consider a special case of (3.11) where the weighting matrix ˜
W
equals the identity matrix. However, based on a decomposition of ˜
Wof the
form ˜
W=R>
˜
WR˜
Wwith R˜
WRk,k, the cost function in (3.11) can be written
as
1
2kbA(β)γk2
˜
W=1
2(bA(β)γ)>R>
˜
WR˜
W(bA(β)γ) = 1
2˜
b˜
A(β)γ2
with ˜
b:=R˜
Wband ˜
A:RsRk,p defined via ˜
A(η):=R˜
WA(η). Consequently,
we may assume without loss of generality ˜
W=Ikin (3.11) and apply the
approach from [120].
For a fixed βRs, the minimization problem (3.11) is a linear least squares
problem for γand, hence, the optimal value ˆγneeds to satisfy the necessary
77
3. Mode Identification
Algorithm 3.1 Evaluation of the cost function and its gradient for (3.9)
Inputs:
snapshot matrix XRn,q,n, q N
modes φ1, . . . , φrRdφwith rNn,dφN
amplitudes ai,j Rfor i= 1, . . . , r,j= 1, . . . , q
transformation family T:RRn,dφ
path values gi,j Rfor i= 1, . . . , r,j= 1, . . . , q
time weights ω1, . . . , ωqR>0
symmetric and positive definite spatial weighting matrix WRn,n
Outputs:
value of the cost function Jfull as in Lemma 3.1.7 evaluated at (φ, a) =
([φ>
1. . . φ>
r]>,[a1,1··· ar,1a1,2··· ar,2··· a1,q ··· ar,q]>)
gradient of the cost function Jfull evaluated at (φ, a)
1: Jfull 0
2: φJfull 0
3: for i1to qdo
4: RPr
j=1 T(gj,i)φjaj,i [X]i
5: Jfull Jfull +1
2ωiR>WR
6: φJfull φJfull +ωiha1,iT(g1,i)··· ar,iT(gr,i)i>WR
7: [aJfull]iωihT(g1,i)φ1··· T (gr,i)φri>WR
8: end for
9: Jfull [φJ>
full aJ>
full]>
optimality conditions given by the so-called normal equations
A>(β)A(β)ˆγ=A>(β)b, (3.15)
see for instance [111, sec. 6.2]. We note that the Hessian of the cost function
is A>(β)A(β)and, hence, symmetric and positive semi-definite. Thus, by
Theorem 2.2.6 the cost function is convex and by Theorem 2.2.5 every solution
of (3.15) is a global minimum point of the minimization problem (3.11) with
fixed β. If A(β)has full column rank, then the Gram matrix A>(β)A(β)is
even positive definite and the minimizer ˆγis uniquely determined. In general,
all solutions of the normal equations (3.15) can be characterized by
ˆγ=A(β)+b+v, (3.16)
where vker(A(β)) Rpcan be chosen arbitrarily and A(β)+denotes the
Moore–Penrose pseudoinverse of A(β), cf. section 2.1.
78
3.1. Residual Minimization
Substituting (3.16) into the minimization problem (3.11), we obtain
min
βRs
1
2bA(β)A(β)+b2
|{z }
=:Jr(β)
.(3.17)
Thus, by using the optimality condition (3.15) for γ, the minimization problem
(3.11) can be reduced to the minimization problem (3.17), where the only
remaining unknown is β. Theorem 3.1.8, which is a modified version of a
part of Theorem 2.1 from [120], establishes a relation between the solutions
of the original problem (3.11) and the reduced one (3.17). Theorem 2.1 from
[120] additionally contains a relation between the critical points of (3.11) and
(3.17), provided that Ahas constant rank and is continuously differentiable
with respect to βin an open set containing the critical points. While we do
not need that part of [120, Thm. 2.1] in this thesis, we exploit the formula for
the partial derivatives of the reduced cost function (3.17) provided in [120],
which is given by
Jr
βi
(β) = b>(IkA(β)A(β)+)A
βi
(β)A(β)+bfor i= 1, . . . , s. (3.18)
Theorem 3.1.8 (Relation between (3.11) and (3.17)).Consider the mini-
mization problems (3.11) and (3.17) with bRk,A:RsRk,p,˜
W=Ik, and
k, s, p N. Then, the following assertions hold.
(i) Let (ˆ
β, ˆγ)Rs×Rpbe a global minimum point of (3.11). Then, ˆ
βis a
global minimum point of (3.17).
(ii) Let ˆ
βbe a global or local minimum point of (3.17). Then, (ˆ
β, A(ˆ
β)+b)is
a global or local minimum point of (3.11), respectively.
Proof. A very similar version of the statements concerning the global mini-
mum points is included in Theorem 2.1 in [120], which is based on the as-
sumptions that Ais continuously differentiable and has constant rank in an
open set containing the minimum points. However, the part of the proof of
[120, Thm. 2.1] which concerns the global minimum points does not use these
additional assumptions on A. For the sake of self-containedness, we repeat the
main arguments in the following. Both directions are based on the property
bA(β)A(β)+b2 kbA(β)γk2for all γRp,(3.19)
which holds for any βRsand follows from the theory of linear least squares
problems, cf. the discussion before (3.17).
(i) Since (ˆ
β, ˆγ)Rs×Rpis a global minimum point of (3.11), we have
bA(ˆ
β)ˆγ2 kbA(β)γk2for all (β, γ)Rs×Rp.
79
3. Mode Identification
Let us assume that ˆ
βis not a global minimum point of (3.17), i.e., there
exists ˜
βRswith
bA(˜
β)A(˜
β)+b2<bA(ˆ
β)A(ˆ
β)+b2.
However, due to (3.19) this implies
bA(˜
β)A(˜
β)+b2<bA(ˆ
β)ˆγ2,
which is a contradiction to the assumption that (ˆ
β, ˆγ)is a global minimum
point of (3.11). Thus, ˆ
βhas to be a global minimum point of (3.17).
(ii) If ˆ
βis a local minimum point of (3.17), then there exists R>0with
bA(ˆ
β)A(ˆ
β)+b2bA(β)A(β)+b2for all βB(ˆ
β),(3.20)
where B(ˆ
β)denotes the open ball in Rswith radius and center ˆ
β. Let
us assume that (ˆ
β, A(ˆ
β)+b)is not a local minimum point of (3.11), i.e.,
for any ˜R>0there exists (˜
β, ˜γ)Rs×Rpwith
bA(˜
β)˜γ2<bA(ˆ
β)A(ˆ
β)+b2and "˜
βˆ
β
˜γA(ˆ
β)+b#<˜.
However, due to (3.19) this implies
bA(˜
β)A(˜
β)+b2<bA(ˆ
β)A(ˆ
β)+b2.
Thus, for instance by choosing ˜=, this yields a contradiction to (3.20)
since we also have
˜
βˆ
β"˜
βˆ
β
˜γA(ˆ
β)+b#<˜=.
Consequently, (ˆ
β, A(ˆ
β)+b)needs to be a local minimum point of (3.11).
The proof for global minimum points follows the same lines by omitting
the restrictions to neighborhoods.
Theorem 3.1.8 provides a theoretical justification for replacing the full opti-
mization problem (3.11) by the reduced one (3.17). The assertions (i) and (ii)
establish a one-to-one correspondence between the global minimum points of
the two optimization problems, except for a potential non-uniqueness of the
γcomponent of the original problem (3.11). Furthermore, the local minimum
point property from (ii) implies that the reduced optimization problem (3.17)
has only local minimum points which correspond to local minimum points of
the original problem (3.11). Example 3.1.9 illustrates that the converse in gen-
80
3.1. Residual Minimization
eral is not true, i.e., there may be local minimum points of the original problem
which do not correspond to local minimum points of the reduced problem. In
summary, this means that when we determine a global or local minimum point
of the reduced problem (3.17), we can be sure that it also corresponds to a
global or local minimum point of the original problem. Furthermore, as il-
lustrated in Example 3.1.9, the reduced optimization problem (3.17) may in
general have less local minimum points than the original problem. This is in
general an advantage when using for instance gradient-based methods for com-
puting a minimum point, since there might be less local minimum points where
the solver may get stuck. The only difficulty which may arise when going from
the original to the reduced problem is that the reduced problem may have in
general more points where the cost function is not differentiable, namely at
points where Achanges rank. In the context of the optimization problem (3.9)
this happens, for instance, at points where the transformed modes become lin-
early dependent, since this leads to a rank drop in K(φ). In fact, in this case
the cost function of the reduced problem may even be discontinuous as illus-
trated in Example 3.1.10, see also Example 3.1.9. This is also reflected in the
fact that the cost function in (3.17) coincides up to the prefactor 1
2with the
squared norm of the orthogonal projection of bonto the kernel of A(β)>. When
the rank of Achanges, then also the dimension of the kernel of A>changes,
which may in turn result in a discontinuity of the cost function. Thus, in order
to numerically solve the reduced nonlinear least squares problem (3.17), it may
be advantageous to use solvers which may handle non-smooth cost functions,
see for instance [75] and the references therein.
Example 3.1.9 (Local minimum points of (3.11) and (3.17)).To illustrate
that the statement (i) in Theorem 3.1.8 may in general not be extended to
local minimum points, we consider the special case given by k=s=p= 1,
b= 1,˜
W= 1, and A:RRwith A(β):=β2. The corresponding cost
function of the full optimization problem (3.11) reads
Jf(β, γ) = 1
21β2γ2.
In particular, it has a local minimum point at (0,1), since for all (β, γ)
B1(0,1), where B1(0,1) denotes the open ball in R2with radius 1and
center (0,1), we have
Jf(β, γ) = 1
21β2γ21
2=Jf(0,1).
On the other hand, the corresponding cost function of the reduced problem
(3.17) reads
Jr(β) =
1
2,if β= 0,
0,otherwise
and hence has no local minimum point at 0. This demonstrates that Theo-
81
3. Mode Identification
rem 3.1.8(i) may in general not be extended to local minimum points. l
Example 3.1.10 (Discontinuous cost function).We consider the cost function
J:R4Rdefined via
J(φ):=1
2I2K(φ)K(φ)+vec(X)2
as a special case for the reduced cost function corresponding to (3.9), where
XR2and K:R4R2,2are given by
X= vec(X) = "1
0#and K(φ):="φ1φ3
φ2φ4#.
In particular, we investigate the continuity at ˆ
φ= [0 1 0 1]>as an example
for a point where K(ˆ
φ)is rank-deficient and where Xis not contained in the
column span of K(ˆ
φ). The limit considerations
lim
0Jh1 0 1i>=1
2lim
0
I2"0
1 1#"0
1 1#+
"1
0#
2
= 0,
lim
0Jh0 1 + 0 1i>=1
2lim
0
I2"0 0
1 + 1#" 0 0
1 + 1#+
"1
0#
2
=1
2lim
0 I2"0 0
0 1#!"1
0#
2
=1
2
reveal that limφˆ
φJ(φ)does not exist and, thus, Jis not continuous at ˆ
φ.l
In the remainder of this subsection, we present an algorithm for computing
the cost function and the gradient of (3.17), as these two ingredients are usually
required in gradient-based optimization solvers. Since we aim to determine
a solution of the minimization problem (3.9) in the end, we are especially
interested in the special case
b=Rˆ
Wvec(X), A =Rˆ
WK, (3.21)
where Rˆ
WRnq,nq is given by
Rˆ
W=
ω1...ωq
RW
and RWRn,n is chosen such that W=R>
WRWis satisfied. In particular,
Rˆ
Wthen satisfies ˆ
W=R>
ˆ
WRˆ
W.
The cost function in (3.17) can be further simplified as demonstrated in
Lemma 3.1.11. In the special case (3.21), the reduced version of (3.9) is hence
82
3.1. Residual Minimization
equivalent to minimizing
JvarPro(φ):=1
2vec(X)>R>
ˆ
WRˆ
WK(φ) (Rˆ
WK(φ))+Rˆ
Wvec(X)
=1
2vec(X)>ˆ
WK(φ) (Rˆ
WK(φ))+Rˆ
Wvec(X)
=1
2
q
X
i=1
ωi[X]>
iWKi(φ)
|{z }
=:[B(φ)]>
i
(ωiRWKi(φ))+ωiRW[X]i
|{z }
=:[C(φ)]i
=1
2hB(φ), C(φ)iF.
(3.22)
Here, ,·iFdenotes the Frobenius inner product, cf. section 2.1, and moreover
we used the fact that the Moore–Penrose pseudoinverse of a block diagonal ma-
trix may be obtained by computing the pseudoinverses of the diagonal blocks,
see for instance [57, Thm. 3.4.1]. In addition, we note that the computation
of Cstill involves computing the pseudoinverses of the matrices ωiRWKi(φ),
which typically involves computing SVDs of ωiRWKi(φ)for i= 1, . . . , q,
cf. [57, ch. 12]. However, these matrices are usually tall and skinny and, thus,
using Lemma 3.1.12 allows us to significantly reduce the computational effort
by instead calculating eigenvalue decompositions of the r×rmatrices
(ωiRWKi(φ))>ωiRWKi(φ) = ωiKi(φ)>WKi(φ)
for i= 1, . . . , q. The calculation of the gradient of JvarPro is carried out based
on (3.18), which together with (3.21) and (3.14) yields
JvarPro
[φi]j
(φ)
= vec(X)>R>
ˆ
WRˆ
WK(φ) (Rˆ
WK(φ))+Rˆ
W
K
[φi]j
(φ) (Rˆ
WK(φ))+Rˆ
Wvec(X)
vec(X)>R>
ˆ
WRˆ
W
K
[φi]j
(φ) (Rˆ
WK(φ))+Rˆ
Wvec(X)
= vec(X)>R>
ˆ
WRˆ
WK(φ) (Rˆ
WK(φ))+>Rˆ
W
K
[φi]j
(φ) (Rˆ
WK(φ))+Rˆ
Wvec(X)
vec(X)>ˆ
WK
[φi]j
(φ) (Rˆ
WK(φ))+Rˆ
Wvec(X)
=(Rˆ
WK(φ))+Rˆ
Wvec(X)>K(φ)>ˆ
WK
[φi]j
(φ) (Rˆ
WK(φ))+Rˆ
Wvec(X)
vec(X)>ˆ
WK
[φi]j
(φ) (Rˆ
WK(φ))+Rˆ
Wvec(X)
=
q
X
k=1
ωk(Kk(φ)[C(φ)]k[X]k)>W[T(pi,k)]je>
i[C(φ)]k
83
3. Mode Identification
=
q
X
k=1
ωk[C(φ)]i,k[T(pi,k)]>
jW(Kk(φ)[C(φ)]k[X]k).
The details of the computational steps for evaluating JvarPro and its gradient
are summarized in Algorithm 3.2. Especially, we note that the algorithm only
requires the weighting matrix W, whereas the factor RWis not needed.
Lemma 3.1.11 (Alternative formulation of (3.17)).For given bRkand
A:RsRk,p with k, p, s N, the vector ˆ
βRsis a solution of the mini-
mization problem (3.17) if and only if it is a solution of
min
βRs1
2b>A(β)A(β)+b.(3.23)
Proof. Using the general identities (CC+)>=CC+and CC+C=C, which
hold for any real-valued matrix C, cf. [57, Thm. 1.1.1], we obtain
1
2bA(β)A(β)+b2
=1
2kbk2b>A(β)A(β)+b+1
2b>(A(β)A(β)+)>A(β)A(β)+b
=1
2kbk2b>A(β)A(β)+b+1
2b>A(β)A(β)+A(β)A(β)+b
=1
2kbk21
2b>A(β)A(β)+b.
This calculation shows that the cost functions in (3.17) and (3.23) only differ
by the constant 1
2kbk2, which yields the claim.
Lemma 3.1.12 (Pseudoinverse in terms of an eigenvalue decomposition).
Let ARm,n with m, n Nbe given with r:= rank(A). Furthermore, let
A>A=USU>be an eigenvalue decomposition of A>Awith orthogonal ma-
trix U= [U1U2]Rn,n with U1Rn,r, U2Rn,nrand diagonal matrix
S= diag(S1,0) Rn,n with S1Rr,r. Then, the Moore–Penrose pseudoin-
verse of Ais given by A+=U1S1
1U>
1A>.
Proof. By [57, Thm. 1.1.1], A+is the Moore–Penrose pseudoinverse of Aif
and only if AA+and A+Aare symmetric and AA+A=Aand A+AA+=A+
are satisfied. The first property holds since AU1S1
1U>
1A>is symmetric and
the second one follows from
U1S1
1U>
1A>A=U1S1
1U>
1U1S1U>
1=U1U>
1=U1U>
1>.
For the other two properties, we exploit the fact that the definition of U1
implies im(U1) = im(A>A) = im(A>)and, thus, obtain
AU1S1
1U>
1A>A=AU1S1
1U>
1U1S1U>
1=AU1U>
1=A,
U1S1
1U>
1A>AU1S1
1U>
1A>=U1S1
1U>
1U1S1U>
1U1S1
1U>
1A>=U1S1
1U>
1A>,
84
3.1. Residual Minimization
which concludes the proof. We note that the claim may alternatively be shown
using the construction of the Moore–Penrose pseudoinverse via an SVD, see
for instance [28, Cor. 6.2.1].
Algorithm 3.2 Evaluation of the cost function (3.22) and its gradient
Inputs:
snapshot matrix XRn,q with n, q N
modes φ1, . . . , φrRdφwith rNn,dφN
transformation family T:RRn,dφ
path values gi,j Rfor i= 1, . . . , r,j= 1, . . . , q
time weights ω1, . . . , ωqR>0
spatial weighting matrix WRn,n with W=W>>0
Outputs:
value of the cost function JvarPro evaluated at φ:= [φ>
1. . . φ>
r]>
gradient of the cost function JvarPro evaluated at φ
1: JvarPro(φ)0
2: for i1to qdo
3: ˜
KhT(g1,i)φ1··· T (gr,i)φri
4: [B]iωi˜
K>W[X]i
5: Compute an eigendecomposition ωi˜
K>W˜
K=USU>with orthogonal
URr,r and diagonal SRr,r with descending diagonal entries
6: Determine the rank of S
7: ˜
Uh[U]1··· [U]rank(S)i
8: ˜
Sdiag([S]1,1,[S]2,2,...,[S]rank(S),rank(S))
9: Solve the linear equation system ˜
S˜a=˜
U>[B]ifor ˜a
10: [C]i˜
U˜a
11: vωih[C]1,iT(g1,i)··· [C]r,iT(gr,i)i>W(˜
K[C]i[X]i)
12: JvarPro(φ) JvarPro(φ) + v
13: end for
14: JvarPro(φ) 1
2hB, CiF
Remark 3.1.13 (Alternative algorithms for solving separable least squares prob-
lems).We note that there are also efficient implementations of the variable
projection algorithm available in the literature, see for instance [221]. Never-
theless, the corresponding implementations do usually not exploit the block
structure as it occurs in the specific separable nonlinear least squares problem
(3.9). An adaptation of the algorithms available in the literature to this block-
structured setting would be certainly of interest and may have the potential
to further speed up the computations. Also contributions from the literature
about bilinear least squares problems could perhaps be used for developing
85
3. Mode Identification
more efficient algorithms, cf. [18, 91]. However, such considerations are not
within the scope of this thesis. ¨
In summary, we conclude that the variable projection approach allows to
replace the optimization problem (3.9) with r(dφ+q)unknowns by a reduced
optimization problem where the number of optimization parameters is rdφ. A
drawback of the variable projection is that it may introduce points where the
cost function is discontinuous, which may result in a need for employing opti-
mization techniques that can also handle non-smooth problems. A numerical
comparison of the two approaches is presented in section 6.1 by means of a
wave equation test case.
3.2. Greedy Algorithm based on Transformed
Modes
While we discussed in the previous section mainly the case where the trajec-
tory xonly depends on time but not additionally on a parameter vector, we
focus in this section on the case where xoriginates from a time- and parameter-
dependent FOM as in (1.6). More precisely, the goal of this section is to extend
the POD-greedy algorithm as presented in section 2.5.2 to the setting of trans-
formed modes. To this end, we use the approximation ansatz (1.8) with x,ˆx,
pi, and αi,j depending on tand on a parameter vector µMfor j= 1, . . . , ri,
i= 1, . . . , nt. Furthermore, we assume for simplicity that the considered time
interval Idoes not depend on µ. In particular, we propose Algorithm 3.3 and
discuss its properties as well as its peculiarities in comparison to the standard
POD-greedy algorithm, cf. Algorithm 2.1, in the following.
Algorithm 3.3 Greedy algorithm based on transformed modes
Inputs:
FOM as in (1.6) with X=W=Rn,MRnp,n, npN
discrete time points t1, . . . , tqIwith 0 = t1< t2< . . . < tq=tend
and qN2
parameter training set Mtrain Mwith |Mtrain|<
transformation family T:RRn,dφwith dφN
offline path estimates ˆp:{t1, . . . , tq}×Mtrain Rntwith ntN
initial ansatz vectors
Φ0= ({φ1,1, . . . , φ1,r1,0}, . . . , {φnt,1, . . . , φnt,rnt,0})(Pf(Rdφ))nt
with r1,0, . . . , rnt,0N{0}and Pnt
j=1 rj,0n
error estimator ε:Mtrain ×(Pf(Rdφ))ntR0
error tolerance tol R>0
86
3.2. Greedy Algorithm based on Transformed Modes
maximum number of iterations imax N
spatial weighting matrix W=W>>0Rn,n
time weights ω1, . . . , ωqR>0
Output:
ansatz vectors
Φk= ({φ1,1, . . . , φ1,r1,k }, . . . , {φnt,1, . . . , φnt,rnt,k })(Pf(Rdφ))nt
with kN {0}and r1,k Nr1,0,. . .,rnt,k Nrnt,0, satisfying
max
µMtrain
ε(µ, Φk)tol or k=imax
1: for i1to imax do
2: Evaluate the error estimator ε(·,Φi1)on Mtrain
3: if maxµMtrain ε(µ, Φi1)tol then return Φi1
4: end if
5: Solve the FOM for a parameter value µmax arg max
µMtrain
ε(µ, Φi1)
6: for `1to qdo
7: Project the FOM snapshot x(t`;µmax)onto the orthogonal comple-
ment of
span nT(ˆp1(t`;µmax)) φ1,1, . . . , T(ˆp1(t`;µmax)) φ1,r1,i1, . . . ,
T(ˆpnt(t`;µmax)) φnt,1, . . . , T(ˆpnt(t`;µmax)) φnt,rnt,i1o
and denote the projected snapshot as v`
8: end for
9: for `1to ntdo
10: Solve the minimization problem ˜
φ`arg min
φRdφ
J`(φ)with
J`(φ):=
q
X
j=1
ωj
2InT (ˆp`(tj;µmax)) φ(T(ˆp`(tj;µmax)) φ)+vj2
W
11: end for
12: Determine `opt arg min
`∈{1,...,nt}
J`(˜
φ`)
13: Set r`opt,i :=r`opt,i1+ 1 and r`,i :=r`,i1for all ` {1, . . . , nt}\{`opt}
14: Add φ`opt,r`opt,i :=˜
φ`opt to Φi1and thereby obtain Φi
15: end for
The major differences between Algorithm 3.3 and the POD-greedy algorithm
presented in section 2.5.2 are as follows. First, we note that the list of input
parameters in Algorithm 3.3 differs from the one in Algorithm 2.1. For ex-
ample, since the approximation ansatz (1.8) does not only involve modes and
87
3. Mode Identification
corresponding coefficients but also transformation operators parametrized by
paths, Algorithm 3.3 requires a transformation family as well as estimators
for the paths as additional input parameters. Especially, the paths may for
instance be estimated based on snapshot data as mentioned in Remark 3.1.2.
Another difference to the input parameters of Algorithm 2.1 is that the modes
in Algorithm 3.3 are clustered according to the different transformation opera-
tors, which is due to the special structure of the ansatz (1.8) and results in two
indices for the modes. Moreover, Pf(Rdφ)in Algorithm 3.3 denotes the set of
finite subsets of Rdφ, cf. section 2.1. Besides, in contrast to Algorithm 2.1, the
initial set of ansatz vectors in Algorithm 3.3 does not need to be orthonormal,
see also Example 3.1.1.
Most of the steps of Algorithm 3.3 are the same or similar as the ones in
Algorithm 2.1. As mentioned in section 2.5.2, evaluating the error estimator
usually involves evaluating the ROM and, as outlined in chapter 4, the con-
struction and evaluation of a ROM based on the approximation ansatz (1.8)
is more involved than the classical MOR approach addressed in section 2.5.
Another difference between Algorithms 2.1 and 3.3 is the projection step 7,
since the basis used for the approximation in Algorithm 3.3 depends on the
time-dependent paths and, thus, each snapshot has to be projected separately
onto a different subspace. Furthermore, the determination of the mode to be
added involves solving ntoptimization problems and is, thus, often computa-
tionally more demanding than computing an SVD as in the last step of Algo-
rithm 2.1. Especially, in lines 9–11 of Algorithm 3.3, for each reference frame
it is determined how well the projected snapshots may be approximated by
one transformed mode via solving an optimization problem as in section 3.1.2.
Afterwards, those different approximation errors are compared in line 12 and
then in line 14 only one mode is added which is the most suitable for approx-
imating the projected snapshot matrix. In principle, one may also consider
alternative algorithms where several modes are added in each iteration, for in-
stance one for each reference frame. All in all, Algorithm 3.3 may be regarded
as a generalization of Algorithm 2.1.
Remark 3.2.1 (Special cases of Algorithm 3.3).Comparing Algorithms 2.1
and 3.3 reveals that a particularly important special case of the latter al-
gorithm is given if
(i) the dimension of the modes dφcoincides with the FOM dimension n,
(ii) the transformations are trivial in the sense that T(η) = Inholds for all
ηR,
(iii) the number of paths ntequals one,
(iv) the initial ansatz vectors φ1,1, . . . , φ1,r1,0form an orthonormal basis, and
(v) the spatial weighting matrix Wis chosen as Inand the time weights are
all equal to one.
88
3.2. Greedy Algorithm based on Transformed Modes
In this special case, we note that the optimization problem in line 10 reduces
to a POD optimization problem of the form (2.16) but without normalization
constraint for the mode. Thus, a particular solution is given by the leading
left singular vector of the projected snapshot matrix and, if this solution is
chosen, then Algorithms 2.1 and 3.3 yield the same output. In this sense,
the POD-greedy algorithm is included in Algorithm 3.3 as a special case, i.e.,
Algorithm 3.3 generalizes Algorithm 2.1. Moreover, we note that (v) may be
dropped if the SVD in the last step of Algorithm 2.1 is modified such that
a spatial weighting matrix and time weights are taken into account, see for
instance [127, p. 51]. Furthermore, in [131, Rem. 2.93] it is mentioned that
the POD-greedy algorithm contains the POD method as a special case where
only one parameter sample is considered. Similarly, one may ask the question
whether Algorithm 3.3 provides a generalization of the procedure presented
in section 3.1. However, this is not the case, because in section 3.1 all modes
are optimized simultaneously and this in general is not equivalent to a greedy
procedure, where the modes are added one after another, cf. Example 3.2.2.
Thus, Algorithm 3.3 with only one parameter sample rather resembles the
greedy algorithm introduced in [242], see also section 3.4 for a brief discussion
of this approach. ¨
Example 3.2.2 (Simultaneous vs. greedy optimization).We consider a special
case of the optimization problem (3.9) with r= 2,dφ=q=n= 3, snapshot
matrix
X=
5 2 0
2 6 2
0 2 7
,(3.24)
and transformations
T(g1,1) = T(g1,2) = T(g1,3) = T(g2,2) = I3,
T(g2,1) =
1 0 0
0 0 1
0 1 0
,T(g2,3) =
0 1 0
1 0 0
0 0 1
.
Especially, we observe that Xmay be written as
X=
2
X
i=1 hT(gi,1)φiai,1T(gi,2)φiai,2T(gi,3)φiai,3i
with φ1= [0 1 0]>,φ2= [1 0 1]>,a1,1=3,a1,2= 6,a1,3=5,a2,1= 5,
a2,2= 2, and a2,3= 7. This shows that optimal modes are given by φ1and
φ2and that this leads to an approximation error of zero. In the following, we
compare this to a greedy-type optimization, where the modes are determined
89
3. Mode Identification
one after another. To this end, we first consider the optimization problem
min
φ1R3, aR3
1
2
vec(X)
T(g1,1)φ10 0
0T(g1,2)φ10
0 0 T(g1,3)φ1
a1
a2
a3
2
.
Since T(g1,i)coincides with the identity matrix for i= 1,2,3, this is a dis-
cretized version of a POD optimization problem of the form (2.15) without
normalization constraint for the mode. Hence, a solution may be obtained by
computing a truncated SVD of Xleading to the optimal mode
φ1=1
3
1
2
2
.
Next, we project the snapshot matrix onto the orthogonal complement of the
span of φ1and obtain the projected snapshot matrix
V=
4 0 2
0 2 2
22 3
.
This matrix has rank two and, hence, adding another mode to the refer-
ence frame corresponding to the transformation matrices T(g1,1) = T(g1,2) =
T(g1,3) = I3would not result in a vanishing approximation error. If we instead
consider the other reference frame, the corresponding minimization problem
reads
min
φ2R3, aR3
1
2
vec(V)
T(g2,1)φ20 0
0T(g2,2)φ20
0 0 T(g2,3)φ2
a1
a2
a3
2
.
Since T(g2,i)is orthogonal for i= 1,2,3, we may use similar arguments as
after Remark 3.1.4 to infer that this minimization problem is equivalent to
min
φ2R3, aR3
1
2
vec( ˜
V)
φ20 0
0φ20
0 0 φ2
a1
a2
a3
2
(3.25)
with transformed snapshot matrix
˜
V=hT(g2,1)>[V]1T(g2,2)>[V]2T(g2,3)>[V]3i=
4 0 2
2 2 2
02 3
.
Especially, the solution of (3.25) may be obtained by computing a truncated
SVD of ˜
V. However, since the rank of ˜
Vis two, it may not be described by just
90
3.2. Greedy Algorithm based on Transformed Modes
one mode and, thus, in total the greedy procedure does not lead to a perfect
approximation of Xwith just two modes. Especially, we emphasize that this is
also true if we would not have started by adding a mode to the reference frame
corresponding to T(g1,1) = T(g1,2) = T(g1,3) = I3but instead by adding
a mode to the other reference frame described by T(g2,1),T(g2,2),T(g2,3).
We omit the corresponding calculations here, but nevertheless point out that
this example demonstrates that a simultaneous optimization of the modes in
general is not equivalent to a greedy procedure as in Algorithm 3.3. l
Remark 3.2.3 (Choice of the training set).The quality of the basis constructed
by Algorithm 3.3 depends in particular on the training set Mtrain. If this set
has too few elements or if these elements are poorly selected, it may be not
representative for the parameter domain M. In this case, the approxima-
tion quality of a ROM, which is constructed based on the modes determined
by Algorithm 3.3, may be low for parameter values which lie outside of the
training set. On the other hand, if the parameter training set is too large,
this may result in a high computational cost for Algorithm 3.3. This may in
particular happen when the number of parameters npis large, since then a
high-dimensional space needs to be sampled. Therefore, some general strate-
gies have been proposed for modifying the training set during the greedy algo-
rithm, see for instance [132, 141, 263], or [131, Rem. 2.52] for a brief summary
of these methods. Most of these strategies are independent of the approxima-
tion ansatz and, thus, may also be applied to modify the training set used for
Algorithm 3.3. ¨
As mentioned in section 2.5.2, the classical POD-greedy algorithm can be
shown to converge algebraically or exponentially under certain assumptions.
The key assumption for this is that the Kolmogorov n-widths, cf. Exam-
ple 1.2.1, decay algebraically or exponentially. Since the latter ones are specific
to linear approximation ansatzes, the convergence theory cannot be straight-
forwardly extended to Algorithm 3.3, which is based on the nonlinear approxi-
mation ansatz (1.8). This motivates for studying extensions of the Kolmogorov
n-widths to more general approximation ansatzes, see for instance [72, 243] for
some first results in this direction. A better understanding of such extended
n-widths may lead to a rigorous convergence analysis of Algorithm 3.3, but
this is subject to future work.
We note that in general there is no guarantee that the maximum error
based on the error indicator εdecreases monotonously, i.e., there may exist an
i {0, . . . , imax 1}with
max
µMtrain
ε(µ, Φi)<max
µMtrain
ε(µ, Φi+1).
This also applies to the classical greedy algorithm, cf. [131]. However, since
ansatz vectors are only added in every iteration of Algorithm 3.3 but never
removed, we infer that at least in the special case where we choose the squared
optimal nonlinear projection error εopt :Mtrain ×(Pf(Rdφ))ntR0defined
91
3. Mode Identification
via
εopt µ, ({φ1,1, . . . , φ1,r1}, . . . , {φnt,1, . . . , φnt,rnt})
:= inf
α1,1,...,αnt,rnt, p1,...,pntL2(I)x(·;µ)
nt
X
i=1 T(pi)
ri
X
j=1
αi,jφi,j
2
L2(I,Rn)
(3.26)
as error indicator, the maximum error cannot increase between two iterations
of Algorithm 3.3. However, it is in general not clear how to compute this
optimal projection error. Moreover, there is in general no guarantee that the
reduced-order models discussed in chapter 4 yield coefficients and paths which
lead to an optimal L2(I,Rn)error as in (3.26). The investigation of special
cases where such an optimality may be proven is left for future research.
3.3. Boundary Treatment
The considerations in sections 3.1 and 3.2 are based on a rather general family
of transformation operators Tand a particular example is given by the family
of periodic shift operators Tper as introduced in Definition 1.2.2. Such periodic
shift operators are well-suited for systems with periodic boundary conditions
as demonstrated for the linear wave equation in Example 1.2.3. In principle,
a mode decomposition of the form (3.1) with T=Tper can also be used for
transport-dominated systems with arbitrary boundary conditions. However,
the following example illustrates that periodic shift operators may be inappro-
priate for systems with non-periodic boundary conditions.
Example 3.3.1 (Advection equation with homogeneous Dirichlet boundary
conditions).We consider the linear advection equation as in (1.9) with posi-
tive c, but instead of periodic boundary conditions we consider homogeneous
Dirichlet boundary conditions at the left boundary, i.e., x(t, a) = 0 for all
tI. Furthermore, instead of the initial value considered in Example 1.2.1,
we choose x0:Rdefined via
x0(ξ):=
1
2(ξξm)2+2
(ξξm)+1,if ξ[ξm, ξm],
4
3(ξξm)37
2(ξξm)2+2
(ξξm)+1,if ξ[ξm, ξm+],
0,otherwise
with ξm:= (a+b)/2and (0, ξm). Especially, x0is constructed such that it
is continuously differentiable and has compact support. By similar arguments
as in Example 1.2.1, we obtain the analytical solution x:I×Rof the
corresponding initial-boundary value problem as
x(t, ξ) =
x0(ξct),if ξct ,
0,otherwise.
92
3.3. Boundary Treatment
0 0.5 1
0
0.5
1
ξ
t
0 0.5 1
0
0.5
1
ξ
t
0
0.5
1
x(t, ξ)
Figure 3.1.: Example 3.3.1: pseudocolor plots of the analytical solution (left) and a cor-
responding approximation with one transformed mode based on the family of
periodic shift operators (right).
A pseudocolor plot of the analytical solution is depicted in Figure 3.1, left,
for = 0.1,c= 1,a= 0,b= 1, and tend = 1. It is simply given by a
time-dependent shift of the initial value to the right until the wave leaves the
computational domain through the right boundary. Based on the approach
presented in section 3.1, we compute an optimal approximation using one
transformed mode with T=Tper and p(t) = tfor all tI. The corresponding
approximation is depicted in Figure 3.1, right. Although the moving wave
profile is captured well, some parasitic structures occur at the time when the
wave reaches the boundary. This illustrates that the analytical solution cannot
be described by just one transformed mode when using the family of periodic
shift operators Tper.l
Example 3.3.1 indicates that the choice of the transformation family should
ideally depend on the boundary conditions of the problem. In the special case
of periodic boundary conditions, the family of periodic shift operators Tper is a
natural choice, but this is not necessarily true for other boundary conditions.
In general, translation or shift operators on bounded domains require a certain
rule for the treatment of values which lie outside of the computational domain.
This is schematically depicted in Figure 3.2 for an exemplary mode. After the
mode is shifted, here by an amount of 0.25, there is a region where the values
are unknown, since the mode is only defined in the computational domain
(0,1) but not outside. In the following, we propose various shift operators
which differ in their treatment of filling the gray area in Figure 3.2, right.
3.3.1. Extended Domain
Before we propose an alternative to the family of periodic shift operators, we
start by drawing inspiration from another example.
Example 3.3.2 (Linear wave equation with reflecting and outlet boundary
conditions).We consider the linear acoustic wave equation as in Example 1.2.3,
93
3. Mode Identification
0 0.25 0.5 0.75 1
3
2
1
0
ξ
φ(ξ)
0 0.25 0.5 0.75 1
3
2
1
0
ξ
φ(ξ0.25)
Figure 3.2.: An exemplary mode (left) and its shifted analogue (right), where the gray
area indicates a region of undetermined values.
but instead of periodic boundary conditions we assume
v(t, a) = 0, (t, b) = ρref v(t, b)for all tI.(3.27)
Furthermore, the initial values are assumed to satisfy the compatibility condi-
tions
v0(a) = 0, ρ0
0(a) = 0, 0(b) = ρref v0(b), 0
0(b) = ρrefv0
0(b).(3.28)
Then, similarly as in Example 1.2.3, the analytical solution is given by
x(t, ξ):="ρ(t, ξ)
v(t, ξ)#="ρref
c#ϑr(ξct) + "ρref
c#ϑl(ξ+ct),(3.29)
where ϑr:RRand ϑl:RRare functions determined via the initial
and boundary conditions. More precisely, the values of ϑrand ϑlin are
determined by the initial values via
"ϑr(ξ)
ϑl(ξ)#=1
2ρrefc"c ρref
cρref#"ρ0(ξ)
v0(ξ)#for all ξ.(3.30)
This follows directly from (3.29). Moreover, using the boundary conditions
(3.27), we obtain
ϑr(act) = ϑl(a+ct), ϑl(b+ct) = 0 for all tI.(3.31)
These considerations reveal that the boundary conditions are chosen such that
waves are reflected at the left boundary and leave the computational domain
at the right boundary. Moreover, (3.31) uniquely determines ϑlon [b, b+ctend]
and ϑron [actend, a). In addition, ϑr(a),ϑl(a), and ϑr(b)result from (3.30)
and (3.31) by enforcing ϑrand ϑlto be continuous. Especially, we note that
the compatibility conditions (3.28) ensure that ϑrand ϑlare even continuously
differentiable. Furthermore, since the considered time interval I= [0, tend]is
94
3.3. Boundary Treatment
finite, the values of ϑrin [actend, b]and ϑlin [a, b +ctend]are sufficient to
describe the analytical solution in I×, cf. (3.29). l
In Example 3.3.2 the analytical solution is given by the sum of two shifted
modes which correspond to [ρref c]>ϑrand [ρref c]>ϑl, respectively. Espe-
cially, we note that the fact that the computational domain is bounded does
not interfere with the shifting operation, since the functions ϑrand ϑlare not
only defined on the computational domain but on an extended domain. In
Example 3.3.2, this extended domain is given by R, since the considered spatial
domain is one-dimensional, but in general it may be a subset of Rdwhere d
corresponds to the dimension of the spatial domain. For simplicity we restrict
ourselves to the case d= 1 in the following.
Inspired by the observation that in Example 3.3.2 the Riemann invariants ϑr
and ϑlare defined on an extended domain, we propose the following approach,
see also [240] for a similar idea. Instead of modes which are only defined on the
computational domain = (a, b), we consider modes defined on an extended
domain, i.e.,
x(t)
r
X
i=1
αi(t)Te(pi(t)) φi
with φ1, . . . , φrL2(Ωe). To be consistent with the general family of transfor-
mation operators T, which allows input arguments in b
P=R, cf. section 1.1,
we consider in the following e=Ras extended domain. However, we empha-
size that the values of the paths are often bounded, which allows to consider
a smaller extended domain of the form
b
e= asup
(k,t)∈{1,...,rI
pk(t), b inf
(k,t)∈{1,...,rIpk(t)!,(3.32)
see section 6.2 for a corresponding numerical example. In principle, it is also
possible to define extended domains for each mode separately, which may allow
to use smaller extended domains in comparison to the one introduced in (3.32).
This is for instance used in Example 3.3.2, where it is mentioned that it would
be sufficient to define ϑrand ϑlon [actend, b]and [a, b +ctend], respectively.
The extended domain shift operator Te(η): L2(R)L2(Ω) with ηRis
defined in Definition 3.3.3 and the fact that Te(η)is a linear bounded operator
is formalized in Theorem 3.3.4. Furthermore, an example for the action of
the shift operator Te(η)with η= 0.25 is depicted in Figure 3.3. Finally, in
Example 3.3.5 the use of the family of shift operators Teis demonstrated by
means of an example with inhomogeneous Dirichlet boundary conditions.
Definition 3.3.3 (Extended domain shift operator).For given Ω=(a, b)with
aRand bR>a, we define the family of extended domain shift operators
Te:R L(L2(R), L2(Ω)) via Te(η)f=g, where, for given ηRand f
95
3. Mode Identification
0 0.5 1
3
2
1
0
ξ
φ(ξ)
0 0.5 1
3
2
1
0
ξ
(Te(0.25)φ)(ξ)
Figure 3.3.: An exemplary mode defined on the extended domain e=R(here depicted
on the domain b
e= (0.25,1), left) and its shifted analogue on = (0,1)
(right).
L2(R),gis the unique element in L2(Ω) satisfying
Z
|g(ξ)f(ξη)|dξ= 0.K
Theorem 3.3.4 (Boundedness of the extended domain shift operator).Let
ηRbe given and let and Te(η)be as in Definition 3.3.3. Then, Te(η)is a
linear bounded operator.
Proof. Let fL2(R)be arbitrary and let ˆ
f:RRbe a representative
of the equivalence class f. From Definition 3.3.3 it follows that Te(η)f
L2(Ω) is given by the equivalence class of the function ˆg: Rdefined
via ˆg(ξ):=ˆ
f(ξη). Thus, ˆgdepends linearly on ˆ
fand this property also
translates to the corresponding equivalence classes. This shows that Te(η)is
linear. Furthermore, we have
kTe(η)fk2
L2(Ω) =
b
Za|(Te(η)f)(ξ)|2dξ=
b
Za|f(ξη)|2dξ=
bη
Z
aη|f(ξ)|2dξ
Z
R|f(ξ)|2dξ=kfk2
L2(R),
which proves the boundedness of Te(η).
Example 3.3.5 (Advection equation with inhomogeneous Dirichlet boundary
conditions).We revisit the linear advection equation (1.9) from Example 1.2.1
with c > 0and, instead of periodic boundary conditions, we consider Dirichlet
boundary conditions of the form
x(t, a) = g(t)for all tI,
96
3.3. Boundary Treatment
where gC1(R0)is assumed to satisfy the consistency conditions g(0) =
x0(a)and ˙g(0) = x0
0(a). Following similar arguments as in Example 1.2.1, we
infer that the analytical solution x:I×Rof the corresponding initial-
boundary value problem is given by
x(t, ξ) =
x0(ξct),if ξct ,
gt1
c(ξa),otherwise.
Using an approximation ansatz of the form (1.4) and Teas transformation
family, we observe that the analytical solution coincides with the corresponding
approximation based on r= 1,φ1:RRdefined via
φ1(ξ) =
x0(ξ),if ξ,
g1
c(ξa),if ξ[actend, a),
0,otherwise,
α1(t)=1, and p1(t) = ct for all tI.l
3.3.2. Zero Padding
A simple solution for the problem depicted in Figure 3.2 is to just fill the
area of unknown values with zeros. The corresponding family of zero padding
shift operators T0is introduced in the following definition and illustrated in
Figure 3.4. The fact that T0(η)is indeed a linear and bounded operator for
any ηRis formally stated in Theorem 3.3.7.
Definition 3.3.6 (Zero padding shift operator).For given Ω=(a, b)with
aRand bR>a, we define the family of zero padding shift operators
T0:R L(L2(Ω)) via T0(η)f= 0 if |η| baholds or, otherwise, via
T0(η)f=g, where, for given ηRand fL2(Ω),gis the unique element in
L2(Ω) satisfying
a+max(0)
Za|g(ξ)|dξ+
b+min(0)
Z
a+max(0)|g(ξ)f(ξη)|dξ+
b
Z
b+min(0)|g(ξ)|dξ= 0.
K
Theorem 3.3.7 (Boundedness of the zero padding shift operator).Let η
Rand Ω=(a, b)with aRand bR>a be given. Then, the operator
T0(η): L2(Ω) L2(Ω) introduced in Definition 3.3.6 is linear and bounded.
Proof. In the special case |η| ba, the operator T0(η)coincides by definition
with the zero mapping and is hence linear and bounded. For the complemen-
tary case, let fL2(Ω) be arbitrary and ˆ
f: Rbe a representative of
97
3. Mode Identification
0 0.5 1
3
2
1
0
ξ
φ(ξ)
0 0.5 1
3
2
1
0
ξ
(T0(0.25)φ)(ξ)
Figure 3.4.: An exemplary mode (left) and its shifted analogue (right) using a zero padding
shift operator.
the equivalence class f. From Definition 3.3.6 it follows that T0(η)fL2(Ω)
is given by the equivalence class of the function ˆg: Rdefined via
ˆg(ξ):=
ˆ
f(ξη),if ξη,
0,otherwise.
Thus, ˆgdepends linearly on ˆ
fand this property also translates to the corre-
sponding equivalence classes, which shows that T0(η)is linear. Moreover, the
boundedness of T0(η)follows from
kT0(η)fk2
L2(Ω) =
b+min(0)
Z
a+max(0)
f(ξη)2dξ=
bη+min(0)
Z
aη+max(0)
f(ξ)2dξ kfk2
L2(Ω) .
In general, the zero padding shift operator is especially suitable for examples
with homogeneous Dirichlet boundary conditions. For instance, the solution
in Example 3.3.1 can be described by one mode shifted via the time-dependent
operator T0(ct)with tI. Furthermore, zero padding shift operators are also
employed in section 6.3 in the context of a reaction–diffusion test case with a
homogeneous Dirichlet condition at the left boundary.
3.3.3. Constant Extrapolation
Another natural way of dealing with the problem depicted in Figure 3.2 is
to extrapolate the modes outside of the computational domain. While there
are various ways of performing extrapolation, we only consider the family of
constant extrapolation shift operators Tcin Definition 3.3.8, see also Figure 3.5
for a graphical illustration. In particular, we emphasize that, in contrast to
the shift operators considered in the previous subsections, Tcis pointwise an
operator defined on H1(Ω), which ensures that the boundary values of the
98
3.3. Boundary Treatment
0 0.5 1
3
2
1
0
ξ
φ(ξ)
0 0.5 1
3
2
1
0
ξ
(Tc(0.25)φ)(ξ)
Figure 3.5.: An exemplary mode (left) and its shifted analogue (right) using a constant
extrapolation shift operator.
argument are well-defined, cf. Theorem 2.3.4. The fact that Tcis pointwise
linear and bounded is formally stated in Theorem 3.3.9. In general, constant
extrapolation shift operators appear to be especially useful in the context of
systems with constant Dirichlet boundary values. In particular, this class of
operators has been applied in [241] and [259] for deriving low-dimensional
approximations for systems with traveling shocks.
Definition 3.3.8 (Constant extrapolation shift operator).For given = (a, b)
with aRand bR>a, we define the family of constant extrapolation shift
operators Tc:R L(H1(Ω)) via
(Tc(η)f) (ξ):=
f(ξη),if ξη,
f(a),if ξηa,
f(b),otherwise.
K
Theorem 3.3.9 (Boundedness of the constant extrapolation shift operator).
Let ηRbe given and let and Tc(η)be as in Definition 3.3.8. Then, Tc(η)
is a linear bounded operator.
Proof. The linearity of Tc(η)follows directly from Definition 3.3.8 and, thus,
it remains to prove the boundedness. To this end, we consider an arbitrary
fH1(Ω) and observe that for η[0, b a)we have
kTc(η)fk2
H1(Ω) =
b
Za(Tc(η)f)(ξ)2+D(Tc(η)f)(ξ)2dξ
=ηf(a)2+
b
Z
a+ηf(ξη)2+Df(ξη)2dξ
=ηf(a)2+
bη
Zaf(ξ)2+Df(ξ)2dξ
99
3. Mode Identification
η(f(a)2+f(b)2) + kfk2
H1((a,bη)) ηkfk2
L2(Ω) +kfk2
H1(Ω) ,
where Ddenotes the weak derivative, cf. section 2.3. By the trace theorem,
cf. [94, p. 272], there exists a constant cRwith kfkL2(Ω) ckfkH1(Ω).
Thus, Tc(η)is bounded for all η[0, b a). Similarly, for ηbawe obtain
the bound kTc(η)fkH1(Ω) bakfkL2(Ω) and, hence, by the trace theorem
we infer that Tc(η)is also bounded for ηba. The proof for negative η
values follows the same lines and is therefore omitted here.
Remark 3.3.10 (Non-unitariness of shift operators).We note that the shift
operators Te(η),T0(η), and Tc(η)introduced in this section are in general not
unitary for all ηR, whereas Tper as introduced in Definition 1.2.2 is pointwise
unitary, cf. Theorem A.1(iv). Thus, even if only one reference frame with
orthonormal modes is considered as in (3.4), it is in general not clear if this
may be reduced to a POD optimization problem with transformed trajectory
as in (3.5) when using Te,T0, or Tcas transformation family. ¨
3.4. Comparison with Other Approaches
In the past years, several methods have been proposed for computing a de-
composition of given snapshot data based on an approximation ansatz of the
form (1.4) or (1.8), i.e., using a linear combination of transformed modes. In
the following, we summarize the most relevant methods in this context and
compare the main features with the framework presented in section 3.1.
The shifted proper orthogonal decomposition has been originally introduced
in [241]. It is based on a heuristic optimization method which has been suc-
cessfully applied to simple test cases like the linear advection equation or the
linear wave equation. However, also the application to more involved exam-
ples like the crossing of two shock waves governed by the Euler equations and
the emergence and transport of two vortex pairs governed by the incompress-
ible Navier–Stokes equations is demonstrated in [241]. The considered test
cases are based on a one-dimensional spatial domain except for the vortex
pair, which is based on a two-dimensional domain but transported along a
one-dimensional subspace. Furthermore, the example of crossing shock waves
involves non-periodic boundary conditions, which are treated by employing
a shift operator with constant extrapolation, cf. section 3.3.3. The iterative
shifted POD algorithm presented in [241] is based on transforming the snap-
shots into the co-moving reference frames and computing a truncated SVD in
each frame. Moreover, the approximation is updated in each iteration by en-
riching the approximation ansatz via low-rank decompositions of the residual
and performing a least squares optimization. The numerical results presented
in [241] are promising, but it remains unclear whether the algorithm always
converges and, even if it does, whether the obtained approximation is optimal
in any sense. Furthermore, there is no theory so far which states that the
100
3.4. Comparison with Other Approaches
norm of the residual monotonously decreases during the iterative procedure.
In contrast to this, the method presented in section 3.1 is based on minimizing
the residual and, thus, when using a descent method for the optimization, the
norm of the residual cannot increase during the iteration.
In [242] the authors introduce the transport reversal method, which simi-
larly as the shifted POD method aims for an approximation of given snap-
shot data by means of a linear combination of shifted or translated modes.
While the shift operator used in [242] corresponds to a discretized version of
the periodic shift operator defined in Definition 1.2.2, they combine it with
a so-called cut-off operator. The latter sets some entries of the modes to
zero to avoid spurious phenomena when applying periodic shifts for instance
to problems with absorbing or outflow boundary conditions, see also Exam-
ple 3.3.1. Furthermore, they present an algorithm which allows to construct an
approximation using different amplitudes, translations, and cut-offs for differ-
ent modes, which makes this framework also suitable for transport-dominated
systems with multiple wave profiles with different propagation speeds and di-
rections. In contrast to the shifted POD algorithm from [241], the algorithm
introduced in [242] is based on a greedy-type iteration scheme, i.e., modes
are added gradually to the approximation, but once a mode has been added
it is fixed and does not change as the iteration proceeds by adding further
modes. Besides, the algorithm involves an optimization of the shift amounts
but not of the modes, which are chosen as columns of intermediate residual
matrices instead. An application of the algorithm in [242] to a linear wave
equation problem illustrates that the greedy-type algorithm does not yield an
approximation by two shifted modes, as the analytical solution would suggest.
Instead, an approximation is obtained which is based on one mode and fifteen
different combinations of amplitudes, cut-offs, and translations applied to this
one mode. This example illustrates that the greedy-type algorithm introduced
in [242] in general does not converge to the solution expected from physical or
analytical considerations. On the contrary, in section 6.1 we demonstrate that
the residual-based optimization approach from section 3.1.2 is able to identify
the two wave profiles of a linear wave equation test case. However, for this we
assume the wave speeds to be given, since the method from section 3.1.2 does
not optimize over the paths in contrast to the approach presented in [242].
Another optimization-based method for constructing a decomposition of the
form (1.8) with X=V=Rnis proposed in [240]. However, instead of
explicitly determining such an approximate decomposition, the approach in
[240] is based on first determining a decomposition of the form
x(tj) =
nt
X
i=1 T(pi(tj)) θi,j (3.33)
with θi,j Rnfor i= 1, . . . , ntand j= 1, . . . , q. Here, the idea is to choose
the θi,j such that (3.33) is satisfied and such that each of the reference frame
101
3. Mode Identification
snapshot matrices
Θi:=hθi,1··· θi,qi
for i= 1, . . . , nthas rapidly decaying singular values. Once this is achieved,
an approximation of the form (1.8) may be obtained by replacing the θi,j
in (3.33) by corresponding approximations obtained via truncated SVDs of
the reference frame snapshot matrices Θifor i= 1, . . . , nt. To this end, the
author introduces several constrained optimization problems with different cost
functions and with the constraint (3.33). In total, three cost functions are
introduced, namely J1, J2,¯
J2: (Rn,q)ntRdefined via
J11,...,Θnt):=
nt
X
i=1
min(n,q)
X
j=1
σji),
J21,...,Θnt):=
nt
X
i=1 1Pri
j=1 σji)2
kΘik2
F!,
¯
J21,...,Θnt):=
nt
X
i=1
kΘik2
F
ri
X
j=1
σji)2
.
Here, σi(·)denotes the ith largest singular value and r1, . . . , rntcorrespond to
the prescribed numbers of modes for each reference frame. The cost functions
J2and ¯
J2have in common that they attain the minimum value zero if and
only if the ranks of the reference frame snapshot matrices Θiare equal to
ri, respectively for i= 1, . . . , nt. Furthermore, it is shown in [240] that ¯
J2
is an upper bound of the Frobenius norm of the residual, provided that T
is pointwise orthogonal. On the other hand, J1represents the sum of the
Schatten 1-norms of the reference frame snapshot matrices and is motivated
by a corresponding heuristic rank minimization property, cf. [100]. Moreover,
it is shown in [240] that J1may be written as the Schatten 1-norm of the block
diagonal matrix whose diagonal blocks are given by Θ1,...,Θntand, thus, J1
is convex, see [100]. However, the evaluation of J1is more expensive than the
one of J2or ¯
J2, since it requires to compute all singular values of the reference
frame snapshot matrices.
To solve the constrained optimization problems associated with the different
cost functions and the constraint (3.33), formulas for the gradients of J1,J2,
and ¯
J2are derived in [240]. Furthermore, to also ensure that the constraint
(3.33) is satisfied after each iteration of a gradient-based optimization method
like steepest descent, the author proposes to replace the gradient by a suitably
modified one. For instance, for J2and analogously for the other cost functions,
the partial gradients θi,j J2are replaced by
f
θi,j J2:=θi,j J21
ntT(pi(tj)) nt
X
k=1 T(pk(tj))θk,j J2!(3.34)
102
3.4. Comparison with Other Approaches
for i= 1, . . . , ntand j= 1, . . . , q. Then, straightforward calculations yield
nt
X
i=1 T(pi(tj)) f
θi,j J2= 0 for j= 1, . . . , q,
assuming that T(pi(tj))T(pi(tj)) = Inholds for i= 1, . . . , nt,j= 1, . . . , q.
Consequently, if a scalar multiple of f
J2is used as descent direction, then
the constraint (3.33) is satisfied after each iteration, provided that the start-
ing value satisfies the constraint, cf. [240]. However, the question whether the
negative modified gradient f
J2is still a descent direction despite the addi-
tional term on the right-hand side of (3.34) is not explicitly addressed in [240].
Especially, we note that if T(pi(tj))T(pi(tj)) = Inholds for i= 1, . . . , nt,
j= 1, . . . , q, then the modification in (3.34) may only be reasonable for nt2,
since otherwise f
θ1,j J2is zero for j= 1, . . . , q.
To be able to treat also problems with non-periodic boundary conditions,
a boundary treatment is presented in [240], which is based on the idea of
extending the computational domain, similarly as in section 3.3.1. In con-
trast to the residual minimization approach from section 3.1, we note that the
optimization problems proposed in [240] are based on first optimizing the ref-
erence frame snapshot matrices Θiand afterwards computing corresponding
low-rank factorizations. Consequently, the number of optimization parame-
ters in [240] amounts to ntnq, whereas the optimization problems considered
in sections 3.1.1 and 3.1.2 involve r(q+n)and rn optimization parameters,
respectively. Here we used dφ=nsince the setting considered in [240] is
based on square shift matrices. Moreover, since the number of snapshots q
and the FOM dimension nare typically much larger than the number of ref-
erence frames ntand the number of modes r, we infer that the number of
optimization parameters in [240] is usually much larger than for the optimiza-
tion problems considered in section 3.1. Furthermore, the modification of the
gradient in (3.34) is based on the assumption that T(pi(tj))T(pi(tj)) = In
holds for i= 1, . . . , nt,j= 1, . . . , q, whereas this assumption is not needed
for the framework presented in section 3.1. In addition, the approach in sec-
tion 3.1 is directly based on minimizing the error in the approximation (3.1),
whereas it is in general not clear whether low-rank decompositions obtained
by the approach from [240] also correspond to minimum points of the norm
of the residual. Nevertheless, the numerical results presented in [240] reveal a
smaller approximation error when using the same number of modes and the
same numerical example as in [259], where a residual-minimization-based ap-
proach as in section 3.1 is used. This motivates for a more thorough theoretical
comparison as well as for a more extensive numerical comparison of the ap-
proach introduced in [240] with the residual minimization technique presented
in section 3.1, which is left for future research.
In [40] the authors consider a similar minimization problem as in (3.2),
where the major difference is that in [40] also the paths are optimized. Beside
103
3. Mode Identification
an analysis of the solvability of the minimization problem, also a discretiza-
tion procedure is presented and tested by means of numerical test cases with
one-dimensional spatial domains. In all of these examples, there are either no
boundary effects considered or periodic boundary conditions are assumed. In
contrast to the approach presented in section 3.1, in [40] the cost functional
and its partial derivatives are determined on the infinite-dimensional level and
then discretized by means of a quadrature rule in time and a finite element
approach in space. This first-differentiate-then-discretize approach has the ad-
vantage that it allows to potentially exploit properties like isometry of the
transformation operators, which may no longer hold after discretization. Fur-
thermore, it provides in principle the flexibility to adapt the discretization
resolution during an iterative optimization procedure. On the other hand, it
is in general not clear if the discretized partial derivatives coincide with the
partial derivatives of the discretized cost functional and, thus, it is not clear
if the negative discretized gradient corresponds to a descent direction for the
discretized cost functional. However, the authors of [40] have not observed any
problems related to non-descent directions in their numerical experiments.
While the simultaneous optimization of the modes, amplitudes, and paths
constitutes an improvement and generalization of the optimization procedure
presented in section 3.1, the path optimization still provides some challenges.
On the one hand, the computational effort is significantly higher than in the
case where the paths are determined heuristically in a pre-processing step. On
the other hand, in [40] it is mentioned that the optimization with respect to
the paths appears to be more sensitive to the starting value than in the case
where only the modes and amplitudes are optimized. However, the latter issue
may be alleviated by choosing starting values via a heuristic pre-determination
of the paths based on the snapshot data as discussed in Remark 3.1.2 of this
thesis.
104
4. Projection-Based Model Order
Reduction
While the focus of the previous chapter has been on the determination of
suitable modes based on given snapshot data of the FOM (1.1), in this chapter
we assume the modes to be given and consider the problem of constructing
reduced-order models based on the approximation ansatzes (1.4) and (1.8).
More precisely, we present a nonlinear projection framework for constructing
ROMs which may be used for determining the amplitudes and paths in the
online phase. To this end, we discuss in section 4.1 how the ROM construction
can be done in an optimal way in the sense that the residual is minimized.
In a special case, this framework shares similarities with classical symmetry
reduction techniques and a detailed comparison is outlined in section 4.2.
The ROM obtained by the approach presented in section 4.1 usually still
scales with the dimension of the FOM, even though the number of equations
and unknowns is small. To overcome this issue, in section 4.3 we introduce a
hyperreduction framework, i.e., an additional approximation of the reduced-
order model. This ensures that the ROM can be evaluated in an efficient
way.
4.1. Continuously Optimal Reduced-Order Models
This section is devoted to the construction of reduced-order models for the
FOM (1.1) based on the approximation ansatzes (1.4) and (1.8) and assuming
that the modes are given. Especially, we derive optimal reduced-order models
in the sense that the residual is minimized, following ideas from [179] and [203].
In most of this section, we consider the approximation ansatz (1.4) and only
mention at the end how the ROM changes when instead the clustered ansatz
(1.8) is used.
When using an approximation ansatz of the form (1.4), we propose the ROM
"M11(p(t)) M12(α(t), p(t))
M12(α(t), p(t))>M22(α(t), p(t))#"˙α(t)
˙p(t)#="˜
F1(t, α(t), p(t))
˜
F2(t, α(t), p(t))#for all tI
(4.1)
with ROM state variables α, p:IRrand coefficient functions M11 :Rr
105
4. Projection-Based Model Order Reduction
Rr,r,M12, M22 :Rr×RrRr,r,˜
F1,˜
F2:R0×Rr×RrRr, defined via
M11(p):=Mα(p):=hhT(pi)φi,T(pj)φjiXiij Rr,r,
M12(α, p):=N(p)D(α),
D(α):= diag(α1, . . . , αr)Rr,r,
N(p):=hhT(pi)φi,T0(pj)φjiXiij Rr,r,
M22(α, p):=D(α)>Mp(p)D(α),
Mp(p):=hhT0(pi)φi,T0(pj)φjiXiij Rr,r,
˜
F1(t, α, p):=˜
Fα(t, α, p):=
*T(pi)φi, F
t,
r
X
j=1 T(pj)φjαj
+X
iRr,
˜
F2(t, α, p):=D(α)>˜
Fp(t, α, p)
˜
Fp(t, α, p):=
*T0(pi)φi, F
t,
r
X
j=1 T(pj)φjαj
+X
iRr.
(4.2)
Here, by abuse of notation, we use for given pRand φVthe notation
T0(p)φfor the derivative of the map T(·)φ:RXat p, while emphasizing
that the map T:R L(V,X)itself does not need to be differentiable for
T0(p)φto be well-defined, see for instance [22].
In the following, we show that solutions of the ROM (4.1)–(4.2) are optimal
in the sense of residual minimization. To this end, we first introduce the
mapping R:R0×Rr×Rr×Rr×RrXvia
R(t, η1, η2, η3, η4)
:=
r
X
i=1
[η1]iT([η4]i)φi+
r
X
i=1
[η3]iT0([η4]i)φi[η2]iF t,
r
X
i=1
[η3]iT([η4]i)φi!(4.3)
and observe that R(t, ˙α(t),˙p(t), α(t), p(t)) coincides with the residual obtained
by substituting the ansatz (1.4) into the FOM (1.1) for all tI. Furthermore,
we note that the ROM (4.1)–(4.2) may be formally obtained by enforcing the
residual to be orthogonal to the span of
T(p1(t))φ1, . . . , T(pr(t))φr,T0(p1(t))φ1α1(t), . . . , T0(pr(t))φrαr(t)(4.4)
for all tI. In a finite-dimensional setting and using a general nonlinear
approximation ansatz, it is noted in [179, Rem. 3.3] that this orthogonality
property allows to interpret the ROM as a result of projection. More precisely,
it is obtained by an orthogonal projection of the FOM right-hand side onto the
tangent space of the nonlinear manifold corresponding to the approximation
ansatz, see [179] for more details. Moreover, Theorem 4.1.1 and its proof
show that this orthogonality property corresponds to optimality in the sense
106
4.1. Continuously Optimal Reduced-Order Models
of residual minimization. Especially, ROMs satisfying a minimality property
of the form (4.5) are usually referred to as continuously optimal, since they are
optimal with respect to the residual of the time-continuous system, cf. [61].
This is in contrast to methods which aim for minimizing the residual after time
discretization, see [61] and Remark 4.1.6.
Theorem 4.1.1 (Continuous optimality).Let Xbe a real Hilbert space, Wa
subspace of X,Fa mapping from R0×Wto X, and Va real Banach space.
Furthermore, let T:R L(V,X)and φ1, . . . , φrY, r Nwith YV
as defined in (1.5) be given such that the mappings gi:RXdefined via
gi(η):=T(η)φiare continuously differentiable for i= 1, . . . , r. Then, any
solution (α, p): IRr×Rrof the ROM (4.1) with M11, M12, M22,˜
F1,˜
F2as
specified in (4.2) and time interval I= [0, tend],tend R>0satisfies
( ˙α(t),˙p(t)) arg min
(η12)Rr×Rr
1
2kR(t, η1, η2, α(t), p(t))k2
X(4.5)
for all tI, where Ris as defined in (4.3).
Proof. Let tIbe arbitrary and let (α, p): IRr×Rrbe a solution of the
ROM (4.1)–(4.2). We introduce the notation Jres :Rr×RrRwith
Jres(η1, η2):=1
2kR(t, η1, η2, α(t), p(t))k2
X
for the cost function in (4.5). Its partial derivatives are given by
Jres
[η1]`
(η1, η2) =
r
X
i=1
[η1]ihT(p`(t))φ`,T(pi(t))φiiX
+
r
X
i=1
αi(t)hT(p`(t))φ`,T0(pi(t))φi[η2]iiX
*T(p`(t))φ`, F t,
r
X
i=1
αi(t)T(pi(t))φi!+X
,
Jres
[η2]`
(η1, η2) =
r
X
i=1
[η1]iα`(t)hT0(p`(t))φ`,T(pi(t))φiiX
+
r
X
i=1
α`(t)αi(t)hT0(p`(t))φ`,T0(pi(t))φi[η2]iiX
α`(t)*T0(p`(t))φ`, F t,
r
X
i=1
αi(t)T(pi(t))φi!+X
for `= 1, . . . , r. Thus, using the notation from (4.2), the first-order necessary
optimality condition, cf. Theorem 2.2.2, may be written as
M11(p(t))η1+M12(α(t), p(t))η2=˜
F1(t, α(t), p(t)),
M12(α(t), p(t))>η1+M22(α(t), p(t))η2=˜
F2(t, α(t), p(t)).(4.6)
107
4. Projection-Based Model Order Reduction
Furthermore, the Hessian of Jres is given by
M(α(t), p(t)) :="M11(p(t)) M12(α(t), p(t))
M12(α(t), p(t))>M22(α(t), p(t))#,
which is the Gram matrix of (4.4) and, hence, positive semi-definite. Es-
pecially, it is singular if and only if the vectors in (4.4) are linearly depen-
dent. The positive semi-definiteness of M(α(t), p(t)) allows to apply Theo-
rem 2.2.6 and, thus, we infer that Jres is convex in the sense of Definition 2.2.4.
Consequently, the first-order necessary optimality condition is also sufficient,
cf. Theorem 2.2.5. Finally, since (α, p)is a solution of (4.1), we conclude that
(η1, η2) = ( ˙α(t),˙p(t)) solves (4.6) and is thus a minimizer of Jres.
Example 4.1.2 (Assumptions of Theorem 4.1.1 when using the periodic shift
operator).To present an example, where the assumptions of Theorem 4.1.1
are satisfied, we consider the linear advection equation with periodic boundary
conditions as in Example 1.2.1. This initial-boundary value problem may be
written as abstract ODE of the form (1.1), for instance, by setting X=L2(Ω),
W=C1
per(Ω), and F:WXwith F(x):=cx0. As mode space we choose
V=X=L2(Ω) and we set T=Tper, where Tper denotes the family of
periodic shift operators introduced in Definition 1.2.2. Furthermore, since W
is an invariant subspace of Tper(η)for any ηR, cf. Theorem A.2, we may
choose the modes φ1, . . . , φras elements of WY. Besides, Theorem A.3
yields that the mappings gi:RL2(Ω) defined via gi(η):=Tper(η)φiare
continuously differentiable with derivatives g0
i(η) = −Tper(η)φ0
ifor i= 1, . . . , r.
Consequently, all requirements of Theorem 4.1.1 are satisfied for this example.
l
Remark 4.1.3 (Case of a finite-dimensional FOM).In the case X=W=Rn,
the orthogonality property mentioned after (4.3) is achieved by a multiplication
of matrix-valued functions. More precisely, introducing the matrix functions
V1, V2:RrRn,r via
V1(p):=hT(p1)φ1··· T (pr)φri, V2(p):=hT0(p1)φ1··· T 0(pr)φri,
the approximation ansatz in (1.4) becomes x(t)V1(p(t))α(t)for all tI
and the definitions of Mα,N,Mp,˜
Fα, and ˜
Fpfrom (4.2) simplify to
Mα(p) = V1(p)>V1(p), N(p) = V1(p)>V2(p), Mp(p) = V2(p)>V2(p),
˜
Fα(t, α, p) = V1(p)>F(t, V1(p)α),˜
Fp(t, α, p) = V2(p)>F(t, V1(p)α),
see also [39, sec. 3.3]. ¨
It should be emphasized that the continuous optimality in Theorem 4.1.1
does not necessarily imply that the approximation error is minimized, since
the optimality is only formulated with respect to the residual. In the context
of linear equation systems, one can show that the error is small provided that
108
4.1. Continuously Optimal Reduced-Order Models
the problem is well-conditioned and the residual is small, see for instance [285,
ch. 3]. Similarly, residual-based error bounds as the one below in Theorem 4.1.9
motivate the usefulness of a residual-based optimization property as this may
result at least in a small error bound. In addition, we emphasize that the
optimality property from Theorem 4.1.1 is only local in nature, i.e., for a given
current value of the state (α(t), p(t)), the time derivative ( ˙α(t),˙p(t)) is chosen
such that the residual is minimized. This local optimality does not necessarily
lead to a minimal residual over the whole time interval, see for instance the
upcoming Example 5.1.3. On the other hand, the alternative of constructing
ROMs by minimizing the time integral of the residual has other drawbacks as
addressed in the following.
Remark 4.1.4 (Cumulative residual minimization).As an alternative to mini-
mizing the instantaneous value of the residual as in Theorem 4.1.1, we could
also construct a reduced-order model, which satisfies
(α, p)arg min
(η12)C1(I,Rr)×C1(I,Rr)
tend
Z0kR(t, ˙η1(t),˙η2(t), η1(t), η2(t))k2
Xdt. (4.7)
Such cumulative residual minimization schemes have been investigated in the
literature using different ansatzes for the approximation of the FOM state. For
instance, in [13, app. A] the authors apply such a scheme to a linear PDE using
a linear approximation ansatz. The resulting ROM is a system of second-order
ODEs which is unstable, even though the FOM and the ROMs obtained via
classical Galerkin projection are asymptotically stable. Furthermore, it was
demonstrated in [49, app. C] that the minimization of the cumulative residual
subject to a constraint on the initial value leads to a boundary value problem,
which is in general computationally more challenging than solving initial value
problems. For these reasons, we follow in this thesis the instantaneous residual
minimization approach as detailed in Theorem 4.1.1. ¨
Remark 4.1.5 (Relation to [179]).We note that in the finite-dimensional case
with X=W=Rn, the continuously optimal ROM (4.1)–(4.2) corresponds
to a special case of the ROM presented in [179]. Therein, the authors discuss a
general nonlinear model reduction ansatz xf(α)with f:RrRn, whereas
we focus in this thesis on the special nonlinear approximation ansatz (1.4). In
contrast to a general nonlinear map, which may for instance arise from a neural
network, the ansatz (1.4) allows for a physical interpretation as linear combi-
nation of transformed modes. Moreover, in section 5.2 we demonstrate that
the special nonlinear ansatz (1.4) allows for structure-preserving model order
reduction for port-Hamiltonian systems, while still ensuring residual minimiza-
tion for a special class of port-Hamiltonian FOMs. ¨
Remark 4.1.6 (Relation to [54]).In [54] the authors also propose to determine
the time evolution of the amplitudes and of the paths by minimizing the resid-
ual. However, they consider the residual on the time-discrete level, i.e., after
applying a specific time integration scheme to the original model. Furthermore,
109
4. Projection-Based Model Order Reduction
they focus on the case where all modes are transformed by the same trans-
formation operator, which corresponds to a special case of the approximation
ansatz (1.8) with nt= 1. In general, to minimize the time-continuous residual
rather than the time-discrete one has the disadvantage that the optimality
may be lost after discretizing in time. On the other hand, the minimization of
the time-discrete residual has the drawback of losing flexibility with respect to
the time discretization scheme used for the ROM. Especially, in chapter 6 we
exploit this flexibility and partly use a different time discretization scheme for
the ROM than for the FOM in order to ensure a dissipation inequality on the
time-discrete level. A more detailed analytical and numerical comparison of
our approach with the one from [54] is certainly an interesting future research
direction, but not within the scope of this thesis. An extensive comparison of
the minimization of the time-continuous residual and that of the time-discrete
residual in the context of classical linear subspace methods can be found in
[61]. ¨
Remark 4.1.7 (Parametric reduced-order models).The construction of the
ROM (4.1)–(4.2) works analogously in the case, where the FOM depends ad-
ditionally on a parameter vector as in (1.6). In that case, the functions α,p,
˜
F1,˜
F2, and Rdepend additionally on the parameter vector. ¨
As mentioned in the proof of Theorem 4.1.1, the leading matrix on the
left-hand side of the ROM (4.1) is the Gram matrix of the vectors in (4.4).
Consequently, it is invertible if and only if these vectors are linearly indepen-
dent. If that is not the case, the equations of the ROM are linearly dependent
and, thus, we cannot expect a unique solution in this case. On the other hand,
for the case that linear independence is given at t= 0, the following theorem
states sufficient conditions for local existence and uniqueness of solutions.
Theorem 4.1.8 (Existence and uniqueness of solutions of the initial value
problem associated with (4.1)).Let the assumptions of Theorem 4.1.1 be sat-
isfied and let (α0, p0)Rr×Rrbe given. Furthermore, let the following
assumptions be satisfied:
(i) The function Ffrom Theorem 4.1.1 is continuous and there exists a
neighborhood of (0,Pr
i=1 T([p0]i)φi[α0]i)in R0×Wwhere xFexists
and is continuous.
(ii) For i= 1, . . . , r, the mappings gidefined as in Theorem 4.1.1 are twice
continuously differentiable in a neighborhood of p0.
(iii) Every component of α0is non-zero.
(iv) The matrix "Mα(p0)N(p0)
N(p0)>Mp(p0)#
with Mα,N, and Mpas defined in (4.2) is invertible.
110
4.1. Continuously Optimal Reduced-Order Models
Then, the initial value problem consisting of (4.1) with I= [0, tend],tend
R>0and (α(0), p(0)) = (α0, p0)has exactly one solution, provided that tend is
sufficiently small.
Proof. First, we note that the assumptions of Theorem 4.1.1 imply that the
function ˜
E:Rr×RrR2r,2rdefined via
˜
E(α, p):="M11(p)M12(α, p)
M12(α, p)>M22(α, p)#
with M11,M12, and M22 as defined in (4.2) is continuous and that α˜
Eis con-
tinuous. Furthermore, due to assumption (ii), we have that p˜
Eis continuous
in a neighborhood of (α0, p0). Thus, in total we conclude that ˜
Eis contin-
uously differentiable in a neighborhood of (α0, p0). In addition, assumptions
(iii) and (iv) imply that ˜
E(α0, p0)is invertible. Finally, assumptions (i) and
(ii) yield that the function ˜
F:R0×Rr×RrR2rdefined via
˜
F(t, α, p):="˜
F1(t, α, p)
˜
F2(t, α, p)#
with ˜
F1and ˜
F2as defined in (4.2) is continuous and that α˜
Fand p˜
Fare
continuous in a neighborhood of (0, α0, p0)in R0×Rr×Rr. The claim follows
then from Theorem 2.4.5 in section 2.4.2.
In Theorem 4.1.8 we assume that the initial value (α0, p0)is given, without
addressing how to choose it in practice. When using a linear approximation
ansatz of the form (1.2), a natural choice for the ROM initial value is to
determine it via the orthogonal projection of the FOM initial value onto the
span of the modes, as this leads to an optimal approximation of the initial
value, cf. section 2.5.3. However, when using the more general ansatz (1.4)
with transformed modes, the determination of an optimal approximation of
the initial value is not trivial. For a very similar setting as the one considered
in this thesis, the derivation of a corresponding optimization problem and
the associated necessary first-order optimality conditions are presented in [37].
We omit a more detailed discussion here and instead emphasize that, once
the initial value p0for the path is fixed, an optimal initial value α0for the
amplitudes needs to solve the linear equation system
Mα(p0)α0=
hT([p0]1)φ1, x0iX
.
.
.
hT([p0]r)φr, x0iX
(4.8)
with Mαas specified in (4.2). This system is uniquely solvable if and only
if the transformed modes T([p0]1)φ1,...,T([p0]r)φrare linearly independent.
Otherwise, there are infinitely many solutions, which are all optimal. For the
111
4. Projection-Based Model Order Reduction
numerical experiments in chapter 6, we always use p0= 0 and determine α0
by solving (4.8).
As mentioned earlier, the usefulness of the residual minimization property
in Theorem 4.1.1 may be motivated by residual-based error bounds. In The-
orem 4.1.9, we present such an error bound for the special case where the
right-hand side of the FOM may be written as in (4.9) with continuously dif-
ferentiable fand Abeing the generator of a strongly continuous semigroup and
at the same time a linear and bounded operator. In general, the generator of a
strongly continuous semigroup Tis bounded if and only if its domain coincides
with the complete space X, cf. [92, Cor. II.1.5]. Furthermore, this is equiv-
alent to Tbeing a uniformly continuous semigroup, see [92, sec. I.3] for more
details. In addition, we note that the error bound (4.10) applies in general
to an approximation of the form (1.4) with twice continuously differentiable
amplitudes and paths, even if they do not solve the ROM (4.1).
Theorem 4.1.9 (Error bound).Let the assumptions of Theorem 4.1.1 be sat-
isfied with W=Xand let additionally the mappings gidefined in Theo-
rem 4.1.1 be twice continuously differentiable for i= 1, . . . , r. Furthermore,
let F:R0×XXbe given by
F(t, x):=Ax +f(t),(4.9)
where f:R0Xis continuously differentiable on I= [0, tend]with tend
R>0and A L(X)is the generator of a strongly continuous semigroup
T:R0 L(X). In addition, let x0Xbe given and let x:IXbe
the unique solution of the initial value problem (1.1) with t0= 0. Moreover,
let α, p C2(I,Rr)be given and let ε:IXbe the corresponding approxi-
mation error defined via
ε(t) = x(t)
r
X
i=1
αi(t)T(pi(t))φi.
Then, there exist constants ωRand MR1such that
kε(t)kXMeωt
kε(0)kX+
t
Z0
eωs kR(s, ˙α(s),˙p(s), α(s), p(s))kXds
(4.10)
holds for all tI, where Rdenotes the residual map defined in (4.3).
Proof. First, we note that the solution xis indeed uniquely determined due
to Theorem 2.4.3. Furthermore, since x,α, and pas well as the maps gifor
i= 1, . . . , r are continuously differentiable, we infer that εis also continuously
112
4.1. Continuously Optimal Reduced-Order Models
differentiable with derivative
˙ε(t) = ˙x(t)
r
X
i=1
( ˙αi(t)T(pi(t))φi+αi(t)T0(pi(t))φi˙pi(t))
=F(t, x(t)) R(t, ˙α(t),˙p(t), α(t), p(t)) F t,
r
X
i=1
αi(t)T(pi(t))φi!
=(t)R(t, ˙α(t),˙p(t), α(t), p(t))
|{z }
=:˜
R(t)
.
(4.11)
To investigate the regularity of the function ˜
R:IX, we first note that the
facts that Ais linear and bounded and that fis continuously differentiable
imply that Fis continuously differentiable. Since in addition α,p, and the
maps gifor i= 1, . . . , r are twice continuously differentiable, we infer from
(4.3) that the function ˜
Ris continuously differentiable. The claim follows
then from (4.11), Theorem 2.4.3, and from the bound (2.6).
Remark 4.1.10 (Error bound for a finite-dimensional FOM).For the special
case X=W=Rn, we note that any matrix ARn,n is the generator of
the corresponding strongly continuous semigroup T:R0Rn,n given by
T(t):=eAt, cf. [92, sec. I.2]. Especially, there are several ways of bounding the
matrix exponential to obtain the parameters Mand ωin (4.10), see for instance
[170, 215, 279] and the references therein. For example, if Ais diagonalizable
and URn,n is invertible with U1AU being diagonal, then keAtk2may be
bounded via
keAtk2κ(U)espabs(A)tfor all tR0,
where k·k2denotes the spectral norm, κ(U)the condition number of U, and
spabs(A)the spectral abscissa of A, cf. [279, sec. 2]. If Ais normal, we obtain
the simple expression keAtk2=espabs(A)tfor all tR0, see for instance [279,
eq. (2.12)].
In chapter 6, the FOMs are obtained via a finite element discretization and
especially in sections 6.1 and 6.2 the FOMs have the form
E˙x(t) = Ax(t) + f(t)for all tI(4.12)
with invertible mass matrix ERn,n. By following the lines of the proof of
Theorem 4.1.9, we may derive an associated error bound of the form
kε(t)k Meωt
kε(0)k+
t
Z0
eωs E1R(s, ˙α(s),˙p(s), α(s), p(s))ds
=Meωt
kε(0)k+
t
Z0
eωs kR(s, ˙α(s),˙p(s), α(s), p(s))k(EE>)1ds
(4.13)
for all tI, where Mand ωsatisfy keE1Atk2Meωt for all tI. Moreover,
113
4. Projection-Based Model Order Reduction
in situations where Eis even symmetric and positive definite, one may measure
the error alternatively in the E-norm, which leads to the error bound
kε(t)kEMeωt
kε(0)kE+
t
Z0
eωs kR(s, ˙α(s),˙p(s), α(s), p(s))kE1ds
(4.14)
for all tI, where Mand ωsatisfy
eE1AtE=E1
2eE1AtE1
22Meωt for all tI.
Finally, we remark that the two error bounds (4.13) and (4.14) motivate for
two different model reduction schemes based on minimizing the residual in a
weighted norm with weighting matrix (EE>)1or E1. When using a linear
approximation ansatz of the form (1.2) and by following similar steps as in the
proof of Theorem 4.1.1, we obtain that the former approach corresponds to first
multiplying the FOM (4.12) from the left by E1and afterwards performing
a Galerkin projection. On the other hand, the minimization of the residual in
the E1-norm corresponds to a direct Galerkin projection of the FOM (4.12),
see also section 5.1 where such weighted norm minimizations are used in the
context of structure-preserving MOR for pH systems. ¨
In chapter 6 we partly work with the clustered approximation ansatz
x(·)ˆx(·):=
nt
X
i=1 T(pi(·))
ri
X
j=1
αi,j(·)φi,j,(1.8)
cf. Remark 1.1.2. The construction of the ROM follows the same lines as
in the beginning of this section and the resulting ROM also has the same
structure as in (4.1). However, the dimension of the ROM state and the
coefficient functions have to be adjusted accordingly. More precisely, the state
variables α:IRrand p:IRntwith r=Pnt
i=1 rias well as the coefficient
functions M11 :RntRr,r,M12 :Rr×RntRr,nt,M22 :Rr×RntRnt,nt,
˜
F1:R0×Rr×RntRr,˜
F2:R0×Rr×RntRnthave the block structure
α(t):=hα1(t)>··· αnt(t)>i>,
αi(t):= [αi,k(t)]kRri,
(4.15a)
D(α):= diag(α1, . . . , αnt)Rr,nt,(4.15b)
M11(p):=Mα(p):=
M1,1
α(p)··· M1,nt
α(p)
.
.
.....
.
.
Mnt,1
α(p)··· Mnt,nt
α(p)
Rr,r,
Mi,j
α(p):=hhT(pi)φi,k,T(pj)φj,`iXik` Rri,rj,
(4.15c)
114
4.1. Continuously Optimal Reduced-Order Models
M12(α, p):=N(p)D(α),
N(p):=
N1,1(p)··· N1,nt(p)
.
.
.....
.
.
Nnt,1(p)··· Nnt,nt(p)
Rr,r,
Ni,j(p):=hhT(pi)φi,k,T0(pj)φj,`iXik` Rri,rj,
(4.15d)
M22(α, p):=D(α)>Mp(p)D(α),
Mp(p):=
M1,1
p(p)··· M1,nt
p(p)
.
.
.....
.
.
Mnt,1
p(p)··· Mnt,nt
p(p)
Rr,r,
Mi,j
p(p):=hhT0(pi)φi,k,T0(pj)φj,`iXik` Rri,rj,
(4.15e)
˜
F1(t, α, p):=˜
Fα(t, α, p):=
˜
F1
α(t, α, p)
.
.
.
˜
Fnt
α(t, α, p)
Rr,
˜
Fi
α(t, α, p):=
*T(pi)φi,k, F t,
nt
X
s=1
rs
X
`=1 T(ps)φs,`αs,`!+X
kRri
(4.15f)
˜
F2(t, α, p):=D(α)>˜
Fp(t, α, p),
˜
Fp(t, α, p):=
˜
F1
p(t, α, p)
.
.
.
˜
Fnt
p(t, α, p)
Rr,
˜
Fi
p(t, α, p):=
*T0(pi)φi,k, F t,
nt
X
s=1
rs
X
`=1 T(ps)φs,`αs,`!+X
kRri
(4.15g)
for i, j = 1, . . . , nt. We note that in contrast to (4.2), the matrix D(α)as
defined in (4.15b) is in general not a diagonal matrix with αon its diagonal,
but instead a block diagonal matrix, where the ith diagonal block consists
of the column vector αiRrifor i= 1, . . . , nt. Thus, D(α)as defined in
(4.15b) has full column rank if and only if each of the vectors α1,. . .,αnt
has at least one non-zero component. This allows to weaken condition (iii) in
Theorem 4.1.8 accordingly, if the ROM coefficients are constructed as in (4.15)
based on a clustered approximation ansatz of the form (1.8). This has also
been noted in [37, Rem. 5.8].
Regardless of whether we use the approximation ansatz (1.4) or the clustered
one (1.8), it may in general happen that the mass matrix in the ROM (4.1) is
singular. The issue of a singular mass matrix and a resulting non-uniqueness
of the solution occurs also within the framework of the moving finite element
method, cf. [203]. The authors propose in [203, sec. 2] to add a penalization
term to the cost function which prevents that the nodes of the finite elements
115
4. Projection-Based Model Order Reduction
get too close to each other. In particular, this penalization term leads to a
positive definite and, thus, invertible mass matrix. However, this idea cannot
be directly applied to the framework considered here, since the modes are more
general than the finite element basis functions considered in [203] and since
the transformation does not have to be a simple translation. Nevertheless,
inspired by the penalization method from [203], we propose to set up a ROM
based on the modified optimization problem
( ˙α(t),˙p(t)) arg min
(β)Rr×Rnt
1
2kR(t, β, γ, α(t), p(t))k2+σα(t)kβk2+σp(t)kγk2
(4.16)
for all tI. The cost function in (4.16) with suitable penalization param-
eters σα, σp:R0R0corresponds to a minimization of a weighted sum
of the norms of the residual, the time derivative of the amplitudes α, and
the time derivative of the paths p. By similar arguments as in the proof of
Theorem 4.1.1, the minimization problem (4.16) leads to the reduced-order
model
"M11(p(t)) + σα(t)IrM12(α(t), p(t))
M12(α(t), p(t))>M22(α(t), p(t)) + σp(t)Int#"˙α(t)
˙p(t)#="˜
F1(t, α(t), p(t))
˜
F2(t, α(t), p(t))#
(4.17)
for all tI, with M11,M12,M22,˜
F1, and ˜
F2as in (4.15). Especially, at time
instances where the penalization parameters σα(t)and σp(t)are positive, the
mass matrix on the left-hand side of (4.17) is symmetric and positive defi-
nite and, hence, invertible. Consequently, a proper choice of the penalization
parameters may help to avoid the issue of a singular mass matrix and the re-
sulting lack of uniqueness of solutions, see Example 4.1.11 for an application to
the linear wave equation. However, in general one has to be careful with this
choice since the larger the values of σαand σpare, the smaller is the influence
of the residual on the cost function in (4.16).
Example 4.1.11 (Regularization for a wave equation test case).We consider
the special case where the FOM is given by the linear wave equation with
periodic boundary conditions as considered in Example 1.2.3. For the approx-
imation we use a clustered ansatz of the form (1.8) and we choose the family of
periodic shift operators introduced in Definition 1.2.2 as transformation fam-
ily. In fact, since the FOM state consists of two components, the density and
the velocity component, the transformation is applied to each component sep-
arately. Furthermore, we set nt= 2 to account for the two traveling waves and
for each of the two traveling waves we use two modes: one for the density and
one for the velocity. For simplicity, we assume that both wave profiles have
the same shape and, thus, use modes of the form
φ1,1=φ2,1= (ψ, 0), φ1,2=φ2,2= (0, ψ),(4.18)
where ψis assumed to be in C1
per(Ω). Even though the analytical solution (1.15)
116
4.1. Continuously Optimal Reduced-Order Models
suggests that for fixed cand ρref one mode per traveling wave is sufficient,
the use of four modes like in (4.18) also allows to capture variations in the
parameters cand ρref without having to add further modes. By setting X=
V= (L2(Ω))2,W= (C1
per(Ω))2, and F:WXdefined as
F(ζ1, ζ2):= ρrefζ0
2,c2
ρref
ζ0
1!,
we construct a ROM of the form (4.1) with coefficients defined as in (4.15).
Inspired by the analytical solution, we set the initial values as p1(0) = p2(0) =
0,α1,1(0) = α2,1(0) = ρref , and α1,2(0) = α2,2(0) = c. Then, exploiting
the property Tper(0) = IdL2(Ω), the special structure (4.18) of the modes, the
periodic boundary conditions of ψ, and Theorem A.3, we obtain
Mα(p(0)) = kψk2
L2(Ω) "I2I2
I2I2#, N(p(0)) = 0,˜
Fα(α(0), p(0)) = 0,
Mp(p(0)) = kψ0k2
L2(Ω) "I2I2
I2I2#,˜
Fp(α(0), p(0)) = 2c2kψ0k2
L2(Ω) [0 1 0 1]>.
Since Mα(0) is singular, also the ROM mass matrix at t= 0 given by
"M11(0) M12(α(0),0)
M12(α(0),0)>M22(α(0),0)#="Mα(0) 0
0D(α(0))>Mp(0)D(α(0))#
is singular. Consequently, the ROM at t= 0, which reads
"Mα(0) 0
0D(α(0))>Mp(0)D(α(0))#"˙α(0)
˙p(0)#="0
D(α(0))>˜
Fp(α(0),0)#,(4.19)
does not uniquely determine ˙α(0). In fact, using the definition of Din (4.15),
straightforward calculations yield that at least ˙p(0) is uniquely determined
and given by ˙p1(0) = cand ˙p2(0) = c, which is in accordance with the fact
that the advection speeds of the traveling waves are cand c, respectively.
Furthermore, the analytical solution of the wave equation suggests that the
amplitudes of the wave profiles do not change over time and, hence, we would
expect ˙α= 0, at least if ψcoincides with the wave profiles. Thus, it appears to
be natural in this example to consider a regularized minimization problem of
the form (4.16) with σα(0) R>0and σp(0) = 0 for constructing the reduced-
order model. Such a regularized approach allows to replace (4.19) by
"Mα(0) + σα(0)I40
0M22(α(0),0)#"˙α(0)
˙p(0)#="0
˜
F2(α(0),0)#,
which uniquely determines ˙α(0) as ˙α(0) = 0 and is, thus, consistent with the
qualitative behavior of the analytical solution of the wave equation. l
117
4. Projection-Based Model Order Reduction
4.2. Relation with Symmetry Reduction
The method considered in section 4.1 shares similarities with the symmetry
reduction framework as presented for instance in [36, 219, 245, 246]. This
section is devoted to a comparison of the two approaches, see also [37, sec. 6],
where this comparison has been presented first. For this purpose, we consider
a FOM of the form (1.1) with a right-hand side Fwhich does not explicitly
depend on time. In addition, we use the approximation ansatz (1.8) with nt=
1, i.e., there is in total one transformation applied to all modes. Furthermore,
we consider the case where the spaces Vand Xcoincide and where T(η)
is W-invariant for all ηR, which allows to choose the modes in W. The
resulting approximation ansatz is given by
x(t)
r
X
i=1
αi(t)T(p(t))φifor all tI.(4.20)
In this special case nt= 1, the ROM coefficient matrices and right-hand
side defined in (4.15) simplify, since the block structures in the definitions of
α,D,Mα,N,Mp,˜
Fα, and ˜
Fpare reduced to one block, respectively. More
precisely, (4.15) simplifies to
α(t):= [αi(t)]iRr, D(α):=α,
M11(p):=Mα(p):=hhT(p)φi,T(p)φjiXiij Rr,r,
M12(α, p):=N(p)α,
N(p):=hhT(p)φi,T0(p)φjiXiij Rr,r,
M22(α, p):=α>Mp(p)α,
Mp(p):=hhT0(p)φi,T0(p)φjiXiij Rr,r,
˜
F1(α, p):=˜
Fα(α, p):=
*T(p)φi, F
r
X
j=1 T(p)φjαj
+X
iRr,
˜
F2(α, p):=α>˜
Fp(α, p),
˜
Fp(α, p):=
*T0(p)φi, F
r
X
j=1 T(p)φjαj
+X
iRr.
(4.21)
In the following, we compare the ROM given by (4.1) and (4.21) to a ROM
which is obtained by using symmetry reduction techniques and model reduc-
tion via Galerkin projection. Within the symmetry reduction framework, T
is commonly assumed to be a group action and pointwise isometric and Fis
assumed to be equivariant with respect to T, as formalized in the following.
Assumption 4.2.1 (Equivariance and transformation families as group ac-
tions).Let Xbe a real Hilbert space, Wa subspace of X, and Fa mapping
118
4.2. Relation with Symmetry Reduction
from Wto X. Furthermore, let T:R L(X)define a family of W-invariant
and isometric operators. The mapping Fis equivariant with respect to T, i.e.,
we have
F(T(η)φ) = T(η)F(φ)for all φW, η R.(4.22)
Furthermore, the mapping Tdefines a group action in the sense that
T(0) = IdXand T(˜η)T(η)φ=T(˜η+η)φfor all φX, η, ˜ηR.
(4.23)
An essential ingredient of the symmetry reduction framework is a factoriza-
tion of the FOM state as
x(t) = T(p(t))v(t)for all tI(4.24)
for some path pC1(I)with p(0) = 0. Especially, if Tdefines a group action
as in Assumption 4.2.1, this equality is equivalent to v=T(p)x. For the
special case where Tis a family of shift operators, the function vcorresponds
to a different reference frame. Moreover, in some cases xallows to choose
Tand psuch that vis constant with respect to time and this is why vis
sometimes referred to as the frozen solution, cf. [36].
Assuming that the mapping T(·)v:RXis continuously differentiable
and substituting (4.24) into the FOM (1.1), we obtain the equation
T(p(t))˙v(t) + T0(p(t))v(t) ˙p(t) = F(T(p(t))v(t)) for all tI.
If also Assumption 4.2.1 is satisfied, this simplifies to
˙v(t) = F(v(t)) T(p(t))T0(p(t))v(t) ˙p(t)for all tI.(4.25)
Especially, in [36] it is shown that vsolves (4.25) if and only if x=T(p)vis a
solution of (1.1), cf. [36, Thm. 2.6]. Moreover, if Tand pare chosen properly,
then the transformed FOM (4.25) is typically more suitable for classical model
reduction methods than the original FOM (1.1). For instance, a Galerkin
projection of (4.25) onto a suitable low-dimensional subspace ˜
WWwith
associated orthonormal basis φ1, . . . , φrleads to a ROM of the form
˙
˜vi(t) = *φi, F
r
X
j=1
˜vj(t)φj
+X
r
X
j=1 hφi,T(p(t))T0(p(t))φj˙p(t)iX˜vj(t)
(4.26)
for all tIand i= 1, . . . , r. Even though the derivation of the ROM (4.26)
is different from the one in (4.1), Lemma 4.2.2 provides conditions for which
(4.26) coincides with the first block row of (4.1). If the conditions of this
lemma are satisfied, then we can write (4.26) in matrix notation as
˙
˜v(t) = ˜
Fα(˜v(t), p(t)) N(p(t))˜v(t) ˙p(t)for all tI.(4.27)
119
4. Projection-Based Model Order Reduction
Lemma 4.2.2 (Equivalence between (4.26) and the first block row of (4.1)).
Let X,W,F, and Tbe as specified in Assumption 4.2.1 and let the latter
be satisfied. Furthermore, let φ1, . . . , φrW, r Nwith hφi, φjiX=δij
for i, j = 1, . . . , r be given such that the mappings gi:RXdefined via
gi(η):=T(η)φiare continuously differentiable for i= 1, . . . , r. In addition,
let pC1(I)with I= [0, tend],tend R>0be given and let the coefficient
functions M11 :RRr,r,M12,˜
F1:Rr×RRrbe as specified in (4.21).
Then, α= ˜vC1(I,Rr)satisfies the first block row in (4.1) if and only if it
satisfies (4.26).
Proof. First, we note that the pointwise isometry of Tand the fact that the
modes φ1, . . . , φrform an orthonormal basis imply that M11 as defined in (4.21)
satisfies M11(p) = Irfor all pR. By exploiting additionally the group action
property (4.23), we observe that the ith component of M12(˜v(t), p(t)) ˙p(t)
coincides with the second term on the right-hand side of (4.26) for i= 1, . . . , r
and all tI. Moreover, by using the equivariance property (4.22), we note
that the ith component of ˜
F1(˜v(t), p(t)) equals the first term on the right-hand
side of (4.26) for i= 1, . . . , r and all tI. Thus, in total we have shown the
equivalence of the first block row of (4.1) and (4.26).
Lemma 4.2.2 demonstrates that the ROM obtained via symmetry reduction
coincides with the first block row of the ROM obtained via residual minimiza-
tion, if the same path is used for both ROMs. As outlined in the previous
section, the residual minimization framework includes a second equation of
the form
α(t)>N(p(t))>˙α(t) + α(t)>Mp(p(t))α(t) ˙p(t) = α(t)>˜
Fp(α(t), p(t)) (4.28)
for all tI, which fixes the path variable. Here, N,Mp, and ˜
Fpare as
defined in (4.21). In contrast to (4.28), different phase conditions have been
proposed within the symmetry reduction framework for determining the path.
An example is presented in [36], where the authors propose a phase condition
by minimizing the temporal change of the solution of (4.25), i.e.,
˙p(t)arg min
ηR
1
2kF(v(t)) T(p(t))T0(p(t))v(t)ηk2
Xfor all tI.(4.29)
This way of determining the path is typically referred to as freezing. Then, a
phase condition is provided by the first-order necessary optimality condition
of (4.29), which, using Assumption 4.2.1, may be written as
hT0(p(t))v(t),T0(p(t))v(t)iX˙p(t) = hT 0(p(t))v(t), F(T(p(t))v(t))iX(4.30)
for all tI. For the semi-discretization in space, in [36] the authors propose
to replace the occurring operators and the frozen solution in (4.25) and (4.30)
by corresponding finite difference approximations. In the context of model
120
4.2. Relation with Symmetry Reduction
reduction, this idea corresponds to replacing (4.25) by the ROM (4.26) and
the phase condition (4.30) by
˜v(t)>Mp(p(t))˜v(t) ˙p(t) = ˜v(t)>˜
Fp(˜v(t), p(t)) for all tI,(4.31)
where Mpand ˜
Fpare as defined in (4.21).
Instead of first deriving a phase condition for the FOM (4.25) and then
reducing it, an alternative approach is to derive a phase condition for the
ROM (4.26) by minimizing the time derivative of the ROM state. For the case
that Assumption 4.2.1 is satisfied, we have shown in the proof of Lemma 4.2.2
that (4.26) may be written as (4.27) and, thus, minimizing the time derivative
of the ROM state means solving the minimization problem
˙p(t)arg min
ηR
1
2˜
Fα(˜v(t), p(t)) N(p(t))˜v(t)η2for all tI.
This is in general not equivalent to (4.31), but instead leads to the phase
condition
˜v(t)>N(p(t))>N(p(t))˜v(t) ˙p(t) = ˜v(t)>N(p(t))>˜
Fα(˜v(t), p(t)) for all tI.
(4.32)
In total, we have presented three different ways of deriving a phase condition
for fixing the path variable in the ROM (4.26). The first one is based on resid-
ual minimization as discussed in the previous section and leads to the phase
condition (4.28) with α= ˜v. The other two are based on the idea of freezing
the state by minimizing its temporal change and only differ by the order: The
phase condition (4.31) is based on first freezing the state of the FOM (4.25)
and reducing afterwards, whereas (4.32) is based on first reducing the FOM
and then freezing the state of the ROM (4.26). Moreover, the phase condi-
tions (4.28) and (4.32) are directly linked to minimization problems since they
coincide with the first-order necessary optimality conditions corresponding to
the problems of minimizing the residual and the temporal change of the ROM
state, respectively. On the contrary, (4.31) is obtained by approximating the
phase condition (4.30) and due to the corresponding approximation error it is
not immediately clear if (4.31) is itself optimal in some sense. The following
theorem states that the phase condition (4.31) leads indeed to optimal paths,
which minimize a cost function which coincides with the sum of the cost func-
tions corresponding to the other two phase conditions (4.28) and (4.32). Thus,
Theorem 4.2.3 does not only yield an optimality result for (4.31), but it also
provides a connection between the three phase conditions (4.28), (4.31), and
(4.32).
Theorem 4.2.3 (Optimality of the phase condition (4.31)).Let the assump-
tions of Lemma 4.2.2 be satisfied and let additionally (˜v, p)C1(I,Rr)×C1(I)
satisfy (4.27) and the phase condition (4.31). Then, the path pis optimal in
the sense that for each tIthe derivative value ˙p(t)solves the minimization
121
4. Projection-Based Model Order Reduction
problem
min
ηR1
2R(˙
˜v(t), η, ˜v(t), p(t))2
X+1
2˜
Fα(˜v(t), p(t)) N(p(t))˜v(t)η2,
(4.33)
where ˜
Fαand Nare as defined in (4.21) and the residual map R:Rr×R×
Rr×RXis defined via
R(η1, η2, η3, η4):=
r
X
i=1
([η1]iT(η4)φi+ [η3]iT0(η4)φiη2)F r
X
i=1
[η3]iT(η4)φi!.
Proof. Similarly, as in the proof of Theorem 4.1.1 and in the derivation of
(4.32), we obtain the first-order necessary optimality condition of (4.33) as
˜v(t)>N(p(t))>˙
˜v(t) + ˜v(t)>Mp(p(t))˜v(t)η˜v(t)>˜
Fp(˜v(t), p(t))
= ˜v(t)>N(p(t))>˜
Fα(˜v(t), p(t)) ˜v(t)>N(p(t))>N(p(t))˜v(t)η. (4.34)
Furthermore, the second derivative of the cost function in (4.33) is given by
˜v(t)>(Mp(p(t)) + N(p(t))>N(p(t)))˜v(t)
and, thus, non-negative. Consequently, the first-order necessary optimality
condition is also sufficient, cf. section 2.2. Moreover, since ˜vand psatisfy
(4.27) and (4.31), we have
˜v(t)>N(p(t))>˙
˜v(t) + ˜v(t)>Mp(p(t))˜v(t) ˙p(t)˜v(t)>˜
Fp(˜v(t), p(t))
= ˜v(t)>N(p(t))>˜
Fα(˜v(t), p(t)) N(p(t))˜v(t) ˙p(t),
i.e., η= ˙p(t)satisfies the first-order optimality condition (4.34), which yields
the claim.
4.3. Hyperreduction
Since the approximation ansatzes (1.4) and (1.8) are in general nonlinear with
respect to (α, p), the corresponding ROMs of the form (4.1) are also in general
nonlinear, even if the FOM is linear. This is for instance reflected in the fact
that the mass matrix in (4.1) depends on the ROM state, which prevents an
efficient offline/online decomposition as discussed in section 2.5.3. To overcome
this issue, in this section we address the problem of deriving an additional
approximation of the ROM (4.1) to achieve that the evaluation of the ROM
does not scale with the dimension of the FOM. To this end, we consider the
case that the full-order model is linear in section 4.3.1, whereas the case of a
nonlinear FOM is treated in section 4.3.2. This separation is done since the
nonlinearities in the former case only come from the nonlinear approximation
ansatz, whereas there is an additional nonlinearity when the FOM is nonlinear.
122
4.3. Hyperreduction
Since we treat these two sources of nonlinearity differently, we also discuss these
two cases separately in sections 4.3.1 and 4.3.2.
In some special cases, the ROM (4.1) may be evaluated in an efficient way
without requiring another approximation via a hyperreduction technique. This
is for instance the case in the setting considered in Theorem 4.3.1 and Re-
mark 4.3.2, see also Example 4.3.3.
Theorem 4.3.1 (ROM with path-independent coefficients).Let Xbe a real
Hilbert space and Wa subspace of X. Furthermore, let T:R L(X)
be pointwise W-invariant and isometric and let F:R0×WXbe T-
equivariant, i.e.,
F(t, T(p)x) = T(p)F(t, x)for all xW, p R, t R0.(4.35)
In addition, let φ1, . . . , φrW, r Nbe such that the mappings gi:RX
defined via gi(η):=T(η)φiare continuously differentiable for i= 1, . . . , r.
Moreover, let there exist a mapping Q:WXwhich satisfies
g0
i(η) = T(η)Q(φi)for all ηR, i {1, . . . , r}.(4.36)
Then, for nt= 1,r1:=r, and φ1,i :=φi,i= 1, . . . , r, the functions M11,M12,
M22,˜
F1, and ˜
F2defined in (4.15) are constant with respect to their respective
last argument.
Proof. Since Tis pointwise isometric and due to (4.36), we infer
hT(p)φ1,i,T(p)φ1,jiX=hφ1,i, φ1,jiX,
hT(p)φ1,i,T0(p)φ1,jiX=hT (p)φ1,i,T(p)Q(φ1,j)iX=hφ1,i,Q(φ1,j)iX,
hT0(p)φ1,i,T0(p)φ1,jiX=hQ(φ1,i),Q(φ1,j )iX
for all i, j {1, . . . , r}and pR. Consequently, the matrix functions M11,
M12, and M22 as defined in (4.15) are constant with respect to p. Similarly,
exploiting additionally the equivariance property (4.35), we obtain
T(p)φ1,i, Ft,
r
X
j=1 T(p)φ1,jα1,jX
=φ1,i, Ft,
r
X
j=1
φ1,jα1,jX
,
T0(p)φ1,i, Ft,
r
X
j=1 T(p)φ1,jα1,jX
=Q(φ1,i), Ft,
r
X
j=1
φ1,jα1,jX
for all i {1, . . . , r},tR0,α1,1, . . . , α1,r R, and pR. Thus, also the
functions ˜
F1and ˜
F2as defined in (4.15) are constant with respect to p.
Remark 4.3.2 (Special cases where no hyperreduction is needed).As demon-
strated in Theorem 4.3.1, the coefficients of the ROM (4.1) are sometimes
independent of the path p. If additionally the operator Fallows for a separa-
tion of the form (2.24), then an efficient offline/online decomposition without
123
4. Projection-Based Model Order Reduction
hyperreduction can be achieved in the same way as when using a classical
linear model reduction ansatz, cf. section 2.5.3. A corresponding observation
has been made in [37, Rem. 6.3] in a similar setting. Furthermore, even if
the ROM coefficients depend on the path p, in some special cases it may be
still possible to achieve an efficient offline/online decomposition without any
hyperreduction. This has for instance been observed in [38] when applying the
framework to the linear wave equation. ¨
Example 4.3.3 (ROM for linear advection equation with periodic boundary
conditions).We consider the linear advection equation with periodic boundary
conditions as in Example 1.2.1. As mentioned in Example 4.1.2, the corre-
sponding initial-boundary value problem may be written as abstract ODE of
the form (1.1) by setting X=L2(Ω),W=C1
per(Ω), and F:WXwith
F(x):=cx0. We construct a ROM of the form (4.1), (4.15) based on the
approximation ansatz (1.8) with nt= 1,φ1, . . . , φrW, and T=Tper with
Tper as defined in Definition 1.2.2. Especially, all assumptions of Theorem 4.1.1
are satisfied as discussed in Example 4.1.2. Furthermore, Theorem A.1(iv) and
Theorems A.2 and A.3 imply that Tper is pointwise W-invariant and isometric
as well as the existence of Qsatisfying (4.36). Besides, from Definition 1.2.2
we infer that Fis Tper-equivariant as in (4.35). Consequently, Theorem 4.3.1
yields that the ROM coefficients and right-hand side are constant with respect
to the path. Since in addition Fis linear, by following the arguments from
Remark 4.3.2 we conclude that the ROM may be evaluated in an efficient way
without the need for hyperreduction. l
4.3.1. Case of a Linear Full-Order Model
In this subsection we focus on the special case where the operator Fin (1.1)
does not depend explicitly on time and is linear, see also [39, sec. 4.1]. The
treatment of time- and parameter-dependent right-hand sides is briefly ad-
dressed in Remark 4.3.10 at the end of this subsection. Furthermore, we
mainly focus on the case where the ROM is constructed based on the approx-
imation ansatz (1.4), while emphasizing that the ROM based on the clustered
approximation ansatz (1.8) may be treated analogously.
In the case of a linear operator F:WX, the functions ˜
Fα,˜
Fp:Rr×Rr
Rrdefined in (4.2) can be written as ˜
Fα(α, p) = Aα(p)αand ˜
Fp(α, p) = Ap(p)α,
respectively, where Aα, Ap:RrRr,r are defined via
Aα(p) = hhT(pi)φi, F (T(pj)φj)iXiij ,
Ap(p) = hhT0(pi)φi, F (T(pj)φj)iXiij .(4.37)
As mentioned in the introduction of this section, the ROM coefficient matrix
functions depend on the state variables αand p, which makes (4.1) a nonlinear
system of dimension 2r. We observe especially that the evaluation of Mα,N,
124
4.3. Hyperreduction
Mp,Aα, and Apinvolves the computation of inner products in X. Since these
evaluations scale with the dimension of the high-dimensional vector space X,
these computations need to be avoided in the online phase for the sake of ef-
ficiency. To this end, we pursue the following idea. In the offline phase we
evaluate the matrix functions Mα,N,Mp,Aα, and Apfor different sample
points of the path and use these samples for constructing corresponding ap-
proximants of the matrix functions, e.g., via interpolation or regression. In the
online phase, we may then employ these approximants instead of the original
matrix functions Mα,N,Mp,Aα, and Apin order to avoid expensive computa-
tions scaling with the dimension of the FOM. This idea has also been pursued
in [54] in a less general setting which corresponds to the approximation ansatz
(1.8) with nt= 1 and T=Tper.
In the following we illustrate this idea based on the matrix function Mα,
while the treatment of the other matrix functions follows the same lines. Dur-
ing the sampling process, we need to compute inner products of transformed
modes. There are at least two possible ways of sampling Mα: We can for
instance sample the whole matrix over the complete path space Rr, or alter-
natively Rntwhen using the clustered ansatz (1.8), and build a corresponding
approximation of Mα. Since each entry of Mαis an inner product of two
transformed modes and, thus, depends on not more than two path compo-
nents, we can alternatively sample each single entry of Mαover R2and build
a corresponding approximation of each entry. In the case r > 2, the latter
version results in sampling spaces of lower dimension, but may on the other
hand result in a more expensive online phase since the approximation has to
be evaluated for each entry separately. Sometimes, the specific choice of the
family of transformation operators Tallows to further reduce the dimension
of the sampling space, see Remark 4.3.4. Another tool to decrease the di-
mension of the sampling space is the method of active subspaces, see [73] for
an overview and [39] for an application in the context of the nonlinear model
reduction framework considered in this thesis.
Remark 4.3.4 (Transformation operators which simplify the sampling proce-
dure).A particularly useful special case occurs when the family of transforma-
tion operators Tused for the approximation ansatz (1.4) satisfies the property
T(p1)T(p2) = T(τ(p1, p2)) for all p1, p2R(4.38)
for some mapping τ:R×RR. Here, T(p1)denotes the adjoint operator
of T(p1)with respect to the inner product ,·iX. If Tsatisfies (4.38), then
the definition of Mαin (4.2) implies that it is sufficient to sample each entry of
Mαon a one-dimensional subspace. If additionally condition (4.36) is satisfied
for some Q:WX, then the same applies to the entries of Nand Mp.
Finally, if additionally Fis equivariant with respect to T, cf. (4.35), then also
each entry of Aαand Apmay be just sampled on a one-dimensional subspace.
The properties (4.38) and (4.36) are, for instance, satisfied for the family of
125
4. Projection-Based Model Order Reduction
periodic shift operators Tper with X=L2(Ω) and W=C1
per(Ω), where (4.38)
is satisfied with τ(p1, p2):=p2p1and (4.36) with Qφ:=φ0, cf. [37, Ex. 5.12]
and appendix A. However, (4.38) is not necessarily satisfied for shift operators
with other boundary treatment as demonstrated for the example of the zero
padding shift operator in Theorem 4.3.5. ¨
Theorem 4.3.5 (Zero padding shift operator and property (4.38)).Let and
T0be as in Definition 3.3.6. Then, there exists no mapping τ:R×RR
satisfying (4.38) with T=T0.
Proof. If there existed a mapping τ:R×RRsatisfying (4.38) with T=T0,
then for any (η1, η2)R2there would exist an η3Rsatisfying
hT0(η1)φ1,T0(η2)φ2iL2(Ω) =hφ1,T0(η3)φ2iL2(Ω) for all φ1, φ2L2(Ω) .
(4.39)
In the following, we show that such an η3does not exist for all (η1, η2)R2
by means of a counterexample. To this end, we consider the special choice
η1=2
3(ba),η2=1
3(ba), and φ1, φ2L2(Ω) defined via φ1(ξ) = 1 and
φ2(ξ) = sin 3π
ba(ξa)
for almost every ξ. In this case we have
hT0(η1)φ1,T0(η2)φ2iL2(Ω) =
b
ZaT02
3(ba)φ1(ξ)T0ba
3φ2(ξ) dξ
=
b
Z
a+2
3(ba)
sin 3π
baξba
3adξ=
2
3(ba)
Z
1
3(ba)
sin 3π
baξdξ
=ba
3π
2π
Zπ
sin(z) dz=ab
3πcos(z)|2π
π=2(ab)
3π<0,
i.e., η3Rneeds to satisfy
0>2(ab)
3π=hφ1,T0(η3)φ2iL2(Ω) =
b
Za
(T0(η3)φ2)(ξ) dξ. (4.40)
Thus, |η3|must be smaller than basince otherwise the right-hand side of
(4.40) would be zero. Consequently, we have
b
Za
(T0(η3)φ2)(ξ) dξ=
b+min(03)
Z
a+max(03)
sin 3π
ba(ξη3a)dξ
126
4.3. Hyperreduction
=
ba+min(03)η3
Z
max(03)η3
sin 3π
baξdξ=ab
3πcos 3π
baξba+min(03)η3
max(03)η3
=ab
3π cos 3π 1 + min(0, η3)η3
ba!!cos 3π
ba(max(0, η3)η3)!.
If η3is negative, this equals
ab
3π1cos 3πη3
baR0
and otherwise the result is
ab
3πcos 3π1η3
ba1R0.
Comparing this with (4.40) shows that for the considered choice of (η1, η2)
there exists no η3satisfying (4.39). Thus, T0does not satisfy (4.38) for any
τ:R×RR.
Regardless of whether we may reduce the dimension of the sampling domain
by using (4.38) or not, it is in general desirable for the practical implementation
to replace the unbounded sampling area by a bounded subset. For instance,
if Tis given by the family of periodic shift operators over a bounded domain
as introduced in Definition 1.2.2, the sampling domain can be restricted to
( R. In the following, we discuss possible choices for the sampling domain
by means of several examples.
Example 4.3.6 (Hyperreduction for the telegrapher’s equations).We consider
the telegrapher’s equations with periodic boundary conditions
CtV(t, ξ) = ξI(t, ξ)GV (t, ξ)for all (t, ξ)I×,
L∂tI(t, ξ) = ξV(t, ξ)RI(t, ξ)for all (t, ξ)I×,
V(0, ξ) = V0(ξ)for all ξ,
I(0, ξ) = I0(ξ)for all ξ,
V(t, a) = V(t, b)for all tI,
I(t, a) = I(t, b)for all tI
on a one-dimensional domain = (a, b)with aR, b R>a and time interval
I= [0, tend]with tend R>0. The telegrapher’s equations may for instance
be used for modeling an electrical transmission line, cf. [249, p. 259]. The
unknowns are the voltage V:I×Rand the current I:I×R, whereas
the capacitance CR>0, the inductance LR>0, the conductance GR0,
and the resistance RR0are assumed to be given. Furthermore, similarly as
for the linear wave equation problem considered in Example 1.2.3, the initial
values are assumed to be continuously differentiable functions defined on
127
4. Projection-Based Model Order Reduction
and to satisfy the boundary conditions
V0(a) = V0(b), V 0
0(a) = V0
0(b), I0(a) = I0(b), I0
0(a) = I0
0(b).
For the special case LG =CR, we may derive by similar arguments as in [249,
p. 259f.] the analytical solution
"V(t, ξ)
I(t, ξ)#=eR
Lt " 1
qC
L#ϑr ξ1
LC t!+"1
qC
L#ϑl ξ+1
LC t!!,
(4.41)
where ϑr:RRand ϑl:RRare, similarly as in Example 1.2.3, given by
the periodic continuations of
1
2
V0+sL
CI0
and 1
2
V0sL
CI0
,(4.42)
respectively. Thus, the analytical solution is given by two traveling waves
with wave speeds ±1
LC and with exponentially decreasing amplitudes. Sim-
ilarly as in Example 1.2.3, this may be described by an ansatz of the form
(1.4) with r= 2 modes and using a component-wise application of periodic
shift operators as in (1.17). Then, after constructing a ROM of the form
(4.1)–(4.2) and exploiting the equivariance of the right-hand side of the tele-
grapher’s equations with respect to the family of periodic shift operators, we
may follow the arguments of Remark 4.3.4 and only need to sample the ROM
coefficient matrices in a one-dimensional space. Furthermore, due to the prop-
erty Tper(η) = Tper(η+k(ba)), which holds for all ηRand kZ, it is
even sufficient to sample the ROM matrices on an interval of length ba.l
Example 4.3.7 (Hyperreduction for the advection equation with homoge-
neous Dirichlet boundary conditions).We revisit Example 3.3.1, where a lin-
ear advection equation with homogeneous boundary conditions is considered.
As mentioned in section 3.3, the family of zero padding shift operators T0
is well-suited for describing the analytical solution. However, since T0(η)is
not an isometry for ηR\ {0}, the coefficient matrices of a corresponding
reduced-order model constructed as in section 4.1 depend in general on the
path p. Nevertheless, in this example we have just one scalar path and, thus,
an efficient offline/online decomposition requires sampling the path only within
a one-dimensional sampling domain in the offline phase. Furthermore, when
considering the spatial domain (0,1), it is sufficient to reduce the sampling
domain to [1,1], since all shifts pwhose absolute values are greater than 1
lead to T0(p)φ= 0 and T0
0(p)φ= 0, independently from φ.l
Example 4.3.8 (Hyperreduction for the wave equation with outlet boundary
conditions).We consider the linear wave equation (1.13) on the spatial domain
128
4.3. Hyperreduction
0 0.2 0.4 0.6 0.8 1
0
0.5
1
1.5
2
ξ
ρ0(ξ)
Figure 4.1.: Example 4.3.8: initial value for the density.
= (0,1) with c= 1,ρref = 1, initial values
v0(ξ)=0, ρ0(ξ) =
1cos 10πξ2
5,if ξh2
5,3
5i,
0,otherwise,
for all ξ[0,1], cf. Figure 4.1, and outlet boundary conditions
ρ(t, a) = v(t, a), ρ(t, b) = v(t, b)for all tI.
The analytical solution of this initial-boundary value problem is given by
x(t, ξ):="ρ(t, ξ)
v(t, ξ)#="1
1#f(ξct) + "1
1#f(ξ+ct)
with f:RRbeing defined as the zero extension of 1
2ρ0, i.e., f=1
2ρ0on
(0,1) and f= 0 on R\(0,1), cf. Figure 4.2 for a depiction of this solution. We
observe that xcan be represented by the approximation ansatz (1.4) with two
modes by using for instance the zero padding shift operator as introduced in
Definition 3.3.6. Since such an approximation is based on two path variables,
one for the left-going and one for the right-going wave, applying the hyperre-
duction scheme presented in this subsection would involve the sampling of the
two-dimensional path space R2. Since the zero padding shift operator does in
general not satisfy (4.38) for any τ:R×RR, cf. Theorem 4.3.5, we can-
not use this property for reducing the dimension of the path space. However,
following similar arguments as in Example 4.3.7, we may reduce the sampling
domain to [1,1]2. Moreover, we may exploit the fact that all relevant path
values for this example satisfy p1=p2, which allows to further reduce the
sampling domain to the line segment {(p1, p2)[1,1]2|p1=p2}. This
last reduction step is closely linked to the method of active subspaces, cf. the
discussion right before Remark 4.3.4. l
129
4. Projection-Based Model Order Reduction
0 0.5 1
0
0.2
0.4
0.6
ξ
t
0
1
2
ρ(t,ξ)
0 0.5 1
0
0.2
0.4
0.6
ξ
t
1
0
1
v(t,ξ)
Figure 4.2.: Example 4.3.8: pseudocolor plots of the analytical solution for the density
(left) and the velocity (right).
Example 4.3.9 (Hyperreduction for the linearized Euler equation).We con-
sider the linearized Euler equation
tρ(t, ξ)
tv(t, ξ)
tp(t, ξ)
=
vref ρref 0
0vref 1
ρref
0γpref vref
ξρ(t, ξ)
ξv(t, ξ)
ξp(t, ξ)
for all (t, ξ)I×(4.43)
on the spatial domain = (0,1) and time interval I= [0, tend]with tend R>0.
Here, ρ, v, p:I×Rdenote the density, velocity, and pressure fluctuations,
respectively. The constants vref R,ρref , pref R>0refer to the reference
velocity, density, and pressure, respectively, and γR>0is the adiabatic ex-
ponent. To obtain a unique solution, (4.43) is closed by appropriate initial and
boundary conditions, cf. [33]. Using a similar technique as for the linear wave
equation considered in Example 1.2.3, one may derive the general analytical
solution of (4.43) as
ρ(t, ξ)
v(t, ξ)
p(t, ξ)
=
1
0
0
f1(ξvreft)+
ρref
c
ρrefc2
f2(ξ(vref +c)t)+
ρref
c
ρrefc2
f3(ξ(vref c)t)
for all (t, ξ)I×, where c:=qγpref
ρref denotes the speed of sound and
f1, f2, f3:RRare determined based on the initial and boundary conditions.
Thus, the analytical solution is given by a linear superposition of three traveling
waves with the three different wave speeds vref and vref ±c. Hence, it appears
to be natural to use an approximation ansatz of the form (1.8) with nt= 3
reference frames and using appropriate shift operators whose choice depends
on the boundary conditions, cf. section 3.3. As a consequence, the ROM
coefficient matrices depend in general on three path variables, whereas each
single matrix entry only involves two of the path variables, cf. (4.15). Thus,
for this example it may be advantageous to construct an approximant for each
matrix entry separately, which only involves sampling a two-dimensional space
130
4.3. Hyperreduction
instead of a three-dimensional one. l
Remark 4.3.10 (Hyperreduction for a linear, non-autonomous, and parame-
ter-dependent FOM).If the right-hand side Fof the FOM (1.1) is linear with
respect to xand depends on time and on a parameter vector µwith a sep-
arability property of the form (2.24), then this separability is also inherited
in the reduced-order model. Thus, also in this case an efficient offline/online
decomposition can be achieved, but the offline phase becomes in general more
expensive since the number of matrix functions which need to be sampled
increases with increasing Kin (2.24). ¨
Remark 4.3.11 (Efficient computation of the residual).For evaluating the error
bound from Theorem 4.1.9 in the online phase, it is important to be able to
evaluate the residual in an efficient way which does not scale with the FOM
dimension. In the case of a linear autonomous FOM as considered in this
subsection, straightforward calculations reveal that the squared norm of the
residual at time tIis given by
kR( ˙α(t),˙p(t), α(t), p(t))k2
X
= ˙α(t)>Mα(p(t)) ˙α(t) + ˙p(t)>D(α(t))>Mp(p(t))D(α(t)) ˙p(t)
+ 2 ˙α(t)>N(p(t))D(α(t)) ˙p(t)2 ˙α(t)>Aα(p(t))α(t)
2 ˙p(t)>D(α(t))>Ap(p(t))α(t) + α(t)>AR(p(t))α(t),
see (4.3) for the definition of R. Here, AR:RrRr,r is defined via
AR(p):=hhF(T(pi)φi), F (T(pj)φj)iXiij .
Especially, we observe that the norm of the residual can be evaluated in an
efficient way, once the path-dependent matrices Mα,Mp,N,Aα,Ap, and AR
are replaced by corresponding approximants as outlined at the beginning of
this subsection. ¨
4.3.2. Case of a Nonlinear Full-Order Model
In section 4.3.1, we considered the case where the FOM (1.1) is linear and
the only source of nonlinearity in the ROM comes from the nonlinear approx-
imation ansatz. The resulting nonlinearity is special since it arises from inner
products involving the transformed modes and their derivatives with respect
to the path. This is due to the fact that we employ the special nonlinear ansatz
(1.4) which represents a linear combination of transformed modes which de-
pend on the paths. In this section, we consider the case where the FOM is
nonlinear, i.e., in addition to the nonlinearity which originates from the ap-
proximation ansatz (1.4), we have to deal with the nonlinearity of the FOM.
Since the latter nonlinearity can have various forms, we cannot treat it in the
same way as in section 4.3.1 and, thus, the case of a nonlinear FOM is treated
131
4. Projection-Based Model Order Reduction
separately in this section. As it appears to be more convenient, we use the
approximation ansatz (1.8) in this subsection, i.e., the modes are assumed to
be clustered in reference frames. Furthermore, we consider the special case
where Xis a function space over some spatial domain Rdand we assume
that Fallows for pointwise evaluations of F(η)Xfor all ηW, which is
for instance the case if F(η)is in C(Ω). These assumptions are crucial for the
following considerations, since the pointwise evaluations of F(η)are required
for the method presented in the following. Nevertheless, in section 6.3 we also
demonstrate how the framework may be applied in the finite-dimensional set-
ting with X=Rn. For simplicity, we assume in the following that the FOM
right-hand side Fdoes not explicitly depend on time and we briefly address
the more general case of a time- and parameter-dependent Fin Remark 4.3.14.
The assumptions on Xand Fallow us to use a hyperreduction framework
which is based on ideas of the EIM/DEIM, cf. section 2.5.4. In principle, it
is also possible to directly apply EIM or DEIM for approximating the FOM
nonlinearity. However, since both techniques are based on approximations
via suitable linear subspaces, they suffer from the same shortcomings as for
instance the POD method when applied to transport-dominated systems. This
is at least true if the transport also affects the snapshots of the nonlinearity,
which is usually the case, see for instance the following example.
Example 4.3.12 (Viscous Burgers’ equation).We consider the viscous Burg-
ers’ equation
tx(t, ξ) = x(t, ξ)ξx(t, ξ) + ξξx(t, ξ)for all (t, ξ)I×,(4.44)
with spatial domain = R, time interval I= [0, tend],tend R>0, and initial
condition
x(0, ξ) = x0(ξ):= 20 19 tanh 19
2ξfor all ξ.
As for instance shown in [118], this initial value problem admits a traveling
wave solution x(t, ξ) = x0(ξ20t). Indeed, this is also the only regular solu-
tion, cf. [149, Thm. 1]. The analytical solution and the corresponding nonlinear
term x∂ξxare depicted in Figure 4.3. Especially, we observe that not only
the solution xbut also the nonlinearity snapshots exhibit the propagation of a
sharp traveling wave. This indicates that both the solution xand the nonlin-
earity x∂ξxcannot be approximated very accurately by a linear combination
of a few spatial ansatz functions. This observation is also supported by the
corresponding slow singular value decays depicted in Figure 4.4, especially for
the nonlinearity snapshots. We note that the reason for the significantly faster
singular value decay for the snapshots of xis caused by the fact that the corre-
sponding traveling wave does not have compact support. Instead, large parts
of the space time diagram in Figure 4.3, left, are occupied by a constant value
of 39 and this appears to be less challenging for linear subspace approximations
132
4.3. Hyperreduction
Figure 4.3.: Example 4.3.12: pseudocolor plots of the traveling wave solution xof the Burg-
ers’ equation (left) and of the corresponding nonlinear term x∂ξx(right).
50 100 150 200 250 300
105
104
103
102
101
100
i
σi
σ1
Snapshots of x
Snapshots of x∂ξx
Figure 4.4.: Example 4.3.12: singular value decays of the snapshot matrices depicted in
Figure 4.3.
than the traveling wave with compact support depicted in Figure 4.3, right.
We note that the singular value decays shown in Figure 4.4 are based
on snapshot matrices obtained via sampling the corresponding continuous
functions with t= 0.002 and ξ= 0.01 on the bounded set {(ξ, t)
[1,21] ×[0,1]}. Further decreasing tand ξdoes not significantly change
the picture in Figure 4.4. l
In the following, we make use of the idea of the EIM/DEIM as outlined in
section 2.5.4 and adapt it for obtaining a method which is more suitable for
transport-dominated problems. To this end, we approximate the nonlinearity
by a linear combination of transformed modes, i.e.,
F
nt
X
i=1 T(pi(t))
ri
X
j=1
αi,j(t)φi,j
nt
X
i=1 T(pi(t))
si
X
j=1
βi,j(t)ψi,j,(4.45)
where ψi,j ZVare the EIM ansatz functions or modes and βi,j :IR
133
4. Projection-Based Model Order Reduction
are the corresponding coefficients for j= 1, . . . , si,i= 1, . . . , nt. Here, the
subspace ZVis chosen such that T(η)ψi,j is in F(W)for all ηR,
j= 1, . . . , si,i= 1, . . . , nt. Moreover, the total number of EIM modes is
s:=Pnt
i=1 siand they are obtained in the offline phase in the same way as the
modes φi,j, but based on snapshots of F(x(·)), cf. section 2.5.4 and chapter 3.
The coefficients βi,j are a priori unknown and, similarly as in section 2.5.4,
we propose to determine them by enforcing the approximation in (4.45) to be
exact at a set of scollocation points. In the classical DEIM, these collocation
points are typically determined based on the DEIM mode matrix, which occurs
as leading matrix on the right-hand side of the DEIM ansatz (2.26), cf. sec-
tion 2.5.4. Thus, in our setting it appears natural to choose the collocation
points based on the transformed EIM modes T(pi(t)) ψi,j. Since these depend
on the path p, we propose to also consider path-dependent collocation points
pi,j :Rntfor j= 1, . . . , si,i= 1, . . . , nt. We first consider them to be
given and discuss suitable choices at the end of this subsection.
Remark 4.3.13 (Transformations and paths used for the nonlinearity approxi-
mation).We emphasize that by using the approximation ansatz (4.45) for the
nonlinearity, we implicitly assume that the transformation operators and paths
used for approximating the FOM state are also suitable for approximating the
FOM nonlinearity. We observe that this assumption is especially valid for the
numerical example considered in section 6.3, see also [39] for the application of
the method to a wildland fire model. In general, there might exist nonlinear-
ities and transformation operators for which this assumption is not valid and
in that case one may have to use different transformation operators or paths
for approximating the FOM nonlinearity. ¨
Based on the given collocation points, the EIM coefficients are determined
by the linear equation system
nt
X
k=1
sk
X
`=1
βk,`(t) (T(pk(t)) ψk,`) (pi,j (p(t)))
=F nt
X
k=1 T(pk(t))
rk
X
`=1
αk,`(t)φk,`!(pi,j(p(t)))
for j= 1, . . . , si,i= 1, . . . , nt,tI. This can be written in the compact
form AEIM(p(t))β(t) = b(α(t), p(t)), where β:IRs,AEIM :RntRs,s, and
b:Rr×RntRsare defined via
β(t):=hβ1(t)>··· βnt(t)>i,
βi(t):= [βi,k(t)]kRri,(4.46a)
134
4.3. Hyperreduction
AEIM(p):=
A1,1
EIM(p)··· A1,nt
EIM(p)
.
.
.....
.
.
Ant,1
EIM(p)··· Ant,nt
EIM (p)
,
Ai,j
EIM(p):= [(T(pj)ψj,`)(pi,k(p))]k` Rsi,sj,
(4.46b)
b(α, p):=hb1(α, p)>··· bnt(α, p)>i>,
bi(α, p):=
F
nt
X
q=1 T(pq)
rq
X
`=1
αq,`φq,`
(pi,k(p))
kRsi
(4.46c)
for i, j = 1, . . . , nt. If AEIM(p(t)) is nonsingular for all tI, then the EIM
coefficient vector βis uniquely determined. A necessary condition for this
is that the collocation points p1,1(p(t)),...,pnt,snt(p(t)) are pairwise distinct
for all tI, since otherwise some rows of AEIM(p(t)) would coincide. The
treatment of the singular case is briefly addressed in Remark 4.3.17.
In summary, after employing the approximation (4.45), the resulting ROM
reads
M11(p(t)) ˙α(t) + M12(α(t), p(t)) ˙p(t) = ˆ
Aα(p(t))β(t),(4.47a)
M12(α(t), p(t))>˙α(t) + M22(α(t), p(t)) ˙p(t) = D(α(t))>ˆ
Ap(p(t))β(t),(4.47b)
AEIM(p(t))β(t) = b(α(t), p(t)),(4.47c)
for all tI, where ˆ
Aα,ˆ
Ap:RntRr,s are defined via
ˆ
Aα(p):=
ˆ
A1,1
α(p)··· ˆ
A1,nt
α(p)
.
.
.....
.
.
ˆ
Ant,1
α(p)··· ˆ
Ant,nt
α(p)
,ˆ
Ap(p):=
ˆ
A1,1
p(p)··· ˆ
A1,nt
p(p)
.
.
.....
.
.
ˆ
Ant,1
p(p)··· ˆ
Ant,nt
p(p)
,
ˆ
Ai,j
α(p):=hhT(pi)φi,k,T(pj)ψj,`iXik` Rri,sj,
ˆ
Ai,j
p(p):=hhT0(pi)φi,k,T(pj)ψj,`iXik` Rri,sj
(4.48)
for i, j = 1, . . . , ntand M11, M12, M22, D are as specified in (4.15). The eval-
uation of (4.47) scales in general still with the dimension of the full-order
model. However, the left-hand side as well as the path-dependent matrices on
the right-hand side of (4.47a) and (4.47b) can be approximated as detailed
in section 4.3.1 and afterwards evaluated in an efficient way without scaling
with the FOM dimension. Furthermore, provided that we have access to the
functions p1,1,...,pnt,snt,AEIM only depends on the path p, cf. (4.46), and thus
it may be also treated as discussed in section 4.3.1. The most critical part is
the evaluation of b(α(t), p(t)) which still involves the evaluation of the FOM
nonlinearity F, cf. (4.46). However, the nonlinearity is only to be evaluated at
the collocation points. Let us consider the special case where Xis a space of
scalar-valued functions and where Fis given by the pointwise evaluation of a
135
4. Projection-Based Model Order Reduction
function ˆ
F:RR, i.e., (F(x))(ξ) = ˆ
F(x(ξ)) for all (ξ, x)×W. Then,
we have
bi(α(t), p(t)) =
F
nt
X
q=1 T(pq(t))
rq
X
`=1
αq,`(t)φq,`
(pi,k(p(t)))
k
=
ˆ
F
nt
X
q=1
rq
X
`=1
αq,`(t)(T(pq(t)) φq,`)(pi,k(p(t)))
k
(4.49)
for all i {1, . . . , nt}and tI, which can be evaluated in an efficient way
once the matrix function P:RntRs,r, defined via
P(p):=
P1,1(p)··· P1,nt(p)
.
.
.....
.
.
Pnt,1(p)··· Pnt,nt(p)
,
Pi,q(p):= [(T(pq)φq,`)(pi,k(p))]k` Rsi,rq
(4.50)
for i, q = 1, . . . , nt, has been approximated in the offline phase analogously
to the treatment of AEIM. Similarly, one can treat the case where (F(x))(ξ)
only depends on xand its spatial derivatives evaluated at ξ, see the upcoming
Example 4.3.16. In many applications, the nonlinearity satisfies such a locality
property, which allows an efficient evaluation of the ROM. For general functions
F, the evaluation of the hyperreduced ROM may still depend on the FOM
dimension. This issue also applies to the standard EIM/DEIM framework, see
section 2.5.4.
Remark 4.3.14 (Hyperreduction for a FOM with nonlinear, time- and parame-
ter-dependent right-hand side).We note that the approximation (4.45) of the
nonlinearity as well as the derivation of the hyperreduced ROM (4.47) follows
the same lines when considering a FOM right-hand side which additionally
depends explicitly on time. The presented method may also be straightfor-
wardly extended to the case where the FOM state and right-hand side depend
additionally on a parameter vector as in Remark 1.1.1. This is achieved by re-
placing all time-dependent quantities like α,p, or βby corresponding functions
of both tand µ.¨
Example 4.3.15 (Fisher’s equation).We consider Fisher’s equation
tx(t, ξ) = νξξx(t, ξ) + gx(t, ξ)11
κx(t, ξ),(t, ξ)I×,
which is used, among other things, to model the wave propagation of genes
in a population, cf. [78, sec. 8.8]. Here, ν, g, κ R>0denote the diffusion
coefficient, linear growth rate, and the carrying capacity of the environment,
respectively. We consider this partial differential equation on the bounded
domain = (0,1) with homogeneous Dirichlet boundary conditions for the
state x. Formally, by considering xas a function from Ito some function
136
4.3. Hyperreduction
space over , we can write this problem as an abstract differential equation of
the form
˙x(t) = Ax(t) + F(x(t)) for all tI(4.51)
with linear operator A:H2(Ω) H1
0(Ω) L2(Ω) and nonlinear operator
F:H1
0(Ω) L2(Ω) defined via
Av :=νD2v+gv and F(v):=g
κv2.
Here, D2vdenotes the second weak derivative of v, cf. section 2.3. Moreover,
since is a bounded interval, we have the embeddings H1(Ω) W1,1(Ω)
L4(Ω), cf. [2, Thm. 2.14] and [46, Thm. 8.8]. Thus, F(v)is indeed in L2(Ω)
for all vH1
0(Ω) and, hence, the considered operator equation fits into the
setting of the FOM (1.1) with X=L2(Ω) and W=H2(Ω) H1
0(Ω) X.
Furthermore, by [46, Cor. 8.10] we infer that F(v)is even in H1(Ω) for all
vH1
0(Ω), which ensures that pointwise evaluations of F(v)in are well-
defined, cf. Theorem 2.3.4.
For obtaining a ROM which can be evaluated in an efficient way, the linear
part characterized by Acan be treated as explained in section 4.3.1, whereas
the nonlinear part Fcan be treated as detailed in this subsection. Especially,
we observe that the nonlinearity is defined pointwise in the sense that we
have the equality (F(v))(ξ) = ˆ
F(v(ξ)) for all vH1
0(Ω) and ξ, where
ˆ
F:RRis defined via ˆ
F(u):=gκ1u2. Thus, the nonlinearity can be
treated as explained after (4.49). l
Example 4.3.16 (Nonlinearity of the Burgers’ equation).We consider again
the viscous Burgers’ equation (4.44) as in Example 4.3.12, but this time on the
spatial domain = (0,1) and with homogeneous Dirichlet boundary condi-
tions. Similarly as in Example 4.3.15, we may write the corresponding initial-
boundary value problem as an abstract initial value problem of the form
˙x(t) = Ax(t) + F(x(t)) for all tI, x(0) = x0
with initial value x0H2(Ω) H1
0(Ω), linear operator A:H2(Ω) H1
0(Ω)
L2(Ω) and nonlinear operator F:H1
0(Ω) L2(Ω) defined via
Av :=D2vand F(v):=vD1v.
By [46, Cor. 8.10], F(v)may be written as 1
2D1(v2)and is in L2(Ω) for
all vH1
0(Ω). Consequently, the considered operator equation fits into the
setting of the FOM (1.1) with X=L2(Ω) and W=H2(Ω) H1
0(Ω) X.
Furthermore, using again [46, Cor. 8.10] we infer that F(v)is in H1(Ω) for all
vW, which ensures that pointwise evaluations of F(v)in are well-defined.
In particular, we observe that these pointwise evaluations may be written
as (F(v))(ξ) = ˆ
F(v(ξ), D1v(ξ)) for all vWand ξ, where ˆ
F:R×
RRis defined via ˆ
F(u1, u2):=u1u2. Thus, an efficient offline/online
137
4. Projection-Based Model Order Reduction
decomposition of the nonlinearity may be achieved similarly as explained after
(4.49). However, in contrast to (4.49) and Example 4.3.15, the nonlinearity
Fof the Burgers’ equation also involves the derivative of the argument of F.
Hence, it is not sufficient to build only an approximant of the matrix function P
as in (4.50), but additionally we need to construct a matrix function involving
the derivatives of the transformed modes, i.e., b
P:RntRs,r defined via
b
P(p):=
b
P1,1(p)··· b
P1,nt(p)
.
.
.....
.
.
b
Pnt,1(p)··· b
Pnt,nt(p)
,
b
Pi,q(p):=hD1(T(pq)φq,`)(pi,k(p))ik` Rsi,rq
for i, q = 1, . . . , nt.l
Remark 4.3.17 (Case of singular AEIM).As mentioned before, the EIM coeffi-
cient vector β(t)is uniquely determined by the linear equation system (4.47c),
provided that AEIM(p(t)) is nonsingular for all tI. If there exists a tI
where AEIM(p(t)) is singular, the linear equation system (4.47c) may be not
solvable. In this case, we could for instance search for a solution in a least
squares sense, i.e., we search for β(t)Rswhich minimizes
kb(α(t), p(t)) AEIM(p(t))β(t)k2.
A solution of this minimization problem can be expressed in terms of the
Moore–Penrose pseudoinverse of AEIM(p(t)) and is given by
β(t) = AEIM(p(t))+b(α(t), p(t)).
As an additional computational cost, this procedure usually involves the com-
putation of an SVD of AEIM(p(t)) Rs,s, which however does not scale with
the dimension of the FOM. ¨
A question which has not been addressed so far is the choice of the collocation
points p1,1,...,pnt,sntfor a given path value. We address this question here
for the special case that Tdescribes pointwise a translation or shift on a one-
dimensional spatial domain, as for instance in the Definitions 1.2.2, 3.3.3, 3.3.6,
and 3.3.8. Drawing inspiration from Example 4.3.12, a natural way of choosing
the collocation points is given by the rule
pi,j(p) = pi,j(0) + pi(4.52)
for j= 1, . . . , si,i= 1, . . . , nt, i.e., the collocation points are chosen such that
they travel along with the shifted modes. The values of the collocation points
at p= 0 may be, for instance, chosen by applying one of the point selection
algorithms from the classical MOR literature to the EIM ansatz functions, see
e.g. [21, 64, 83]. Furthermore, to ensure that the collocation points are in ,
138
4.3. Hyperreduction
the update rule (4.52) may be only used as long as pi,j(0) + piis in for
j= 1, . . . , si,i= 1, . . . , nt. If this condition is satisfied for all path values
encountered in the online phase, the simple rule (4.52) allows for an efficient
implementation, especially for the special case considered in Remark 4.3.18. Its
performance is demonstrated in section 6.3 by means of a numerical example.
The update rule (4.52) comes with a couple of shortcomings, which need
to be addressed in the future. Although Example 4.3.12 suggests that (4.52)
may provide reasonable path-dependent collocation points, there is in gen-
eral no guarantee that this is the case. Furthermore, (4.52) may in general
yield collocation points which are outside of and in these cases one cannot
evaluate the transformed modes at these points. This may for instance hap-
pen when a wave profile leaves the computational domain. In this situation,
one workaround could be to remove the EIM basis functions and associated
collocation points corresponding to the exiting wave during the online phase.
However, this needs further investigation, especially how this effects the ap-
proximation error. Also the situation where two collocation points cross, for
instance caused by the crossing of two traveling waves, needs to be investigated
in more detail.
Remark 4.3.18 (Special case nt= 1).In the special case where Tis a family of
translation operators and only one path p:=p1is considered, the collocation
point update rule (4.52) leads to some simplifications for AEIM and bdefined
in (4.46). Especially, we may write AEIM(p)Rs,s as
AEIM(p) = [(T(p)ψ1,`)(p1,k(p))]k` = [(T(p)ψ1,`)(p1,k(0) + p)]k` ,
for all pRwhich satisfy p1,k(0) + pfor k= 1, . . . , s. Furthermore, we
note that the evaluations of the shifted EIM modes T(p)ψ1,` at ξare given
by ψ1,`(ξp)for `= 1, . . . , s, as long as ξpis in and Tis one of the
translation operators considered in this thesis. Thus, if p1,k(0)+pp=p1,k(0)
is in for k= 1, . . . , s, we may further simplify AEIM(p)to
AEIM(p) = [ψ1,`(p1,k(0))]k` .
Especially, we observe that AEIM is constant in this special case and may
hence be simply precomputed in the offline phase. By similar arguments and
additionally assuming that Fis such that (4.49) holds, we infer that b(α, p)
Rsdefined in (4.46) may be simplified to
b(α, p) = "F T(p)
r
X
`=1
α1,`φ1,`!(p1,k(p))#k
="ˆ
F r
X
`=1
α1,`(T(p)φ1,`)(p1,k(0) + p)!#k
="ˆ
F r
X
`=1
α1,`φ1,`(p1,k(0))!#k
139
4. Projection-Based Model Order Reduction
for all αRrand pRwhich satisfy p1,k(0) + pfor k= 1, . . . , s. Hence,
for this special case, bis constant with respect to pand may be efficiently
evaluated in the online phase once the matrix [φ1,`(p1,k(0))]k` Rs,r has been
precomputed in the offline phase. ¨
Remark 4.3.19 (Relation to [243]).In the context of scalar hyperbolic PDEs,
the authors in [243] use a hyperreduction scheme which is also based on up-
dating the collocation points according to the transport map which is used
for transforming the modes. Since their approximation ansatz is based on
one invertible coordinate transformation which is applied to all modes, their
collocation point update achieves the same accuracy as in the case where the
EIM/DEIM point selection algorithm would be applied in every time step to
the transformed modes. However, this statement may not be directly applied
to the more general setting considered in this subsection, since the ansatz
(1.8) may involve multiple transformations which are not necessarily invert-
ible. Nevertheless, also in this setting, we expect the path-dependent update
of the collocation points as in (4.52) to be advantageous in comparison to fixed
collocation points, at least for transport-dominated problems. ¨
Remark 4.3.20 (Relation to [251]).In [251] the authors use a different method
for selecting the collocation points, but they also propose to update them in
the online phase based on a suitable coordinate transformation. Similarly as
in [243] they consider the special case where only one time- and parameter-
dependent invertible transformation is involved. ¨
Remark 4.3.21 (Alternative approach for updating the collocation points).In
[39] it is proposed to determine the path-dependent collocation points by em-
ploying for instance the point selection algorithm from [83] for different values
of the path p. This can be done in the offline phase by sampling the path space
and performing the algorithm from [83] for each sample point. Afterwards, in
the spirit of section 4.3.1, one can construct approximants for all quantities
which depend on the collocation points, such as AEIM in (4.46b) or Pin (4.50).
This procedure has the advantage that a singular AEIM matrix can be avoided
in the online phase, especially when wave profiles are crossing, cf. [39, Rem. 6].
On the other hand, this method comes with a higher offline cost, since a point
selection algorithm has to be evaluated for each path sample. ¨
140
5. Structure-Preserving Model
Reduction for Port-Hamiltonian
Systems
In the last chapter, we have discussed a model reduction framework based on
a nonlinear projection of the original model. In this chapter, we are especially
interested in the reduction of port-Hamiltonian FOMs, where we restrict our-
selves to the case of finite-dimensional FOMs, i.e., the state xmaps from a
time interval I= [t0, tend]with t0R0and tend R>t0to X=W=Rn,
cf. section 1.1. Four classes of port-Hamiltonian systems are in the main fo-
cus of this chapter: linear time-invariant, linear time-varying, nonlinear time-
invariant, and nonlinear time-varying systems. These are presented one after
another in the following, see also section 2.6.
First, we consider linear time-invariant port-Hamiltonian systems of the form
E˙x(t)=(JR)Qx(t) + Bu(t),
y(t) = B>Qx(t)(5.1)
for all tI, with input port u:R0Rm, output port y:IRm, and
coefficient matrices E, J, R,QRn,n,BRn,m satisfying
J=J>, R =R>0,and E>Q=Q>E0.(5.2)
We note that this class is more general than the ones in (2.28) and (2.37), which
correspond to special cases with E=Inand Q=In, respectively. Associated
with (5.1), we consider the quadratic Hamiltonian H:RnRdefined via
H(x):=1
2x>E>Qx.
As a generalization of (5.1) and inspired by the pH formulation for descriptor
systems introduced in [25], we consider linear time-varying port-Hamiltonian
systems of the form
E(t) ˙x(t) + K(t)x(t)=(J(t)R(t))Q(t)x(t) + B(t)u(t),(5.3a)
y(t) = B(t)>Q(t)x(t)(5.3b)
for all tI, with E, Q, J, R, K C(R0,Rn,n)and BC(R0,Rn,m)satisfy-
141
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
ing E>QC1(R0,Rn,n)and pointwise
E>Q=Q>E0, Q>RQ =Q>R>Q0,
and d
dt(Q>E) = Q>(KJQ)+(KJQ)>Q. (5.4)
Note that in contrast to [25] we consider for simplicity the case where E>Qis
pointwise positive semi-definite on the whole space Rnand where there is no
feedthrough term. Furthermore, the authors of [25] use E(t)K(t)x(t)instead
of K(t)x(t)on the left-hand side of the state equation (5.3a). A reason for
us deviating from [25] in this respect is that using a structure with K(t)x(t)
instead of E(t)K(t)x(t)appears to be more convenient in terms of structure-
preserving model order reduction, cf. the upcoming Remark 5.1.6. The last
difference between the formulation considered here and the one presented in
[25] is that we only require E>Qto be continuously differentiable, whereas in
[25] it is assumed that Eand Qare continuously differentiable. As a conse-
quence, the corresponding structure is invariant with respect to a more general
class of state space transformations, as detailed in appendix B. We emphasize
that (5.3)–(5.4) still results in a dissipation inequality of the associated Hamil-
tonian H C1(R0×Rn)defined via
H(t, x):=1
2x>E(t)>Q(t)x, (5.5)
see also appendix B.
The next considered class of pH systems is a special case of the port-
Hamiltonian DAE systems introduced in [196] and consists of nonlinear time-
invariant systems of the form
E(x(t)) ˙x(t)=(J(x(t)) R(x(t)))z(x(t)) + B(x(t))u(t),(5.6a)
y(t) = B(x(t))>z(x(t)) (5.6b)
for all tI, with associated Hamiltonian H C1(Rn)and coefficient func-
tions E, J, R C(Rn,Rn,n),zC(Rn,Rn), and BC(Rn,Rn,m)satisfying
pointwise
J=J>, R =R>0,and E>z=∇H.(5.7)
Finally, we also consider nonlinear time-varying port-Hamiltonian systems
of the form
E(t, x(t)) ˙x(t) + r(t, x(t)) = (J(t, x(t)) R(t, x(t)))z(t, x(t)) + B(t, x(t))u(t),
(5.8a)
y(t) = B(t, x(t))>z(t, x(t)) (5.8b)
for all tI, with E, J, R C(R0×Rn,Rn,n),r, z C(R0×Rn,Rn), and
BC(R0×Rn,Rn,m). Here, E,J,R,r, and zare required to satisfy
142
pointwise
J=J>, R =R>0, E>z=xH,and z>r=tH,(5.9)
where H C1(R0×Rn)is the associated Hamiltonian, cf. [196]. We note
in particular that the pH structure given by (5.8)–(5.9) generalizes the non-
linear time-invariant structure (5.6)–(5.7), but not the linear time-varying pH
structure (5.3)–(5.4), since for instance Jdoes not need to be pointwise skew-
symmetric to fulfill (5.4), see also [197, sec. 4.2]. This incompatibility issue is
addressed in more detail in section 5.2, where we also propose weaker condi-
tions than (5.9), which are especially satisfied if (5.4) holds.
From the point of view of structure-preserving model order reduction, an
important special case of (5.8)–(5.9) is when z(t, x(t)) can be written as
Q(t, x(t))x(t)with QC(R0×Rn,Rn,n), i.e.,
E(·, x) ˙x+r(·, x)=(J(·, x)R(·, x))Q(·, x)x+B(·, x)u, (5.10a)
y=B(·, x)>Q(·, x)x, (5.10b)
where the coefficients satisfy
E(t, x)>Q(t, x)x=xH(t, x), x>Q(t, x)>r(t, x) = tH(t, x)
J(t, x) = J(t, x)>, R(t, x) = R(t, x)>0for all (t, x)R0×Rn.
(5.11)
Remark 5.0.1 (Preservation of algebraic constraints).In this thesis, we only
focus on preserving a port-Hamiltonian structure, whereas the preservation
of algebraic constraints is not considered here. The treatment of algebraic
constraints in the context of model reduction methods for general unstructured
DAEs is for instance discussed in [29], whereas structure-preserving model
reduction for port-Hamiltonian DAEs is addressed in [27, 43, 88, 138, 197,
210, 262], see also section 1.3.2. ¨
In the remainder of this chapter, we discuss structure-preserving model re-
duction approaches for port-Hamiltonian systems based on different linear and
nonlinear model reduction ansatzes. In particular, classical linear model re-
duction is addressed in section 5.1, whereas we consider two nonlinear ap-
proximation ansatzes in sections 5.2 and 5.3. We emphasize that even the
most general ansatz considered in section 5.3 is based on a special nonlin-
earity, whereas more general nonlinear approximation ansatzes are not within
the scope of this thesis. However, in the following remark we briefly address
one possibility for ensuring a dissipation inequality for the ROM Hamiltonian
when using a general nonlinear ansatz of the form xf(˜x).
Remark 5.0.2 (Ensuring a dissipation inequality for the ROM Hamiltonian
via constrained residual minimization).In the following we demonstrate how
constrained residual minimization may be used to obtain a ROM based on a
general nonlinear ansatz of the form xf(˜x)with fC1(Rr,Rn), while
143
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
ensuring that the associated ROM Hamiltonian satisfies a dissipation inequal-
ity. To this end, we consider a FOM of the form (5.1)–(5.2). We propose to
construct the ROM by choosing the time derivative ˙
˜x(t)of the reduced state
for all tIsuch that it solves the constrained minimization problem
min
ηRr
1
2kR(η, ˜x(t), u(t))k2s.t. f(˜x(t))>Q>R(η, ˜x(t), u(t)) = 0.(5.12)
Here, R:Rr×Rr×RmRnis defined via
R(η1, η2, η3):=Ef0(η2)η1(JR)Qf(η2)Bη3,
where f0denotes the derivative of f, cf. section 2.1. Thus, Ris defined such
that R(˙
˜x(t),˜x(t), u(t)) coincides with the residual at tI. We emphasize that
the constraint in (5.12) is chosen such that the ROM Hamiltonian ˜
H C1(Rr)
defined via
˜
H(z):=H(f(z)) = 1
2f(z)>E>Qf(z)
satisfies the dissipation inequality
d
dt(˜
H ˜x)(t) = f(˜x(t))>Q>Ef0(˜x(t)) ˙
˜x(t)u(t)>˜y(t)for all tI
with ˜y:=B>Qf(˜x). Furthermore, applying standard theory for optimiza-
tion problems with equality constraints to (5.12) leads to a DAE system for
the reduced state ˜xand a Lagrange multiplier λ. Moreover, the dissipation
inequality for ˜
His directly encoded in the algebraic constraint of this DAE
system. A detailed investigation of this approach including an explicit port-
Hamiltonian representation of the ROM and an analysis of its solvability will
be subject to future research. ¨
5.1. Linear Approximation Ansatz
In this section we consider structure-preserving model order reduction methods
based on linear approximation ansatzes. In particular, most of this section is
devoted to the special case of a linear time-invariant ansatz of the form xVr˜x
with VrRn,r, whereas the more general case with time-dependent Vris briefly
addressed at the end of this section. In the following, we discuss corresponding
structure-preserving MOR schemes for different classes of port-Hamiltonian
FOMs. Especially, all ROMs proposed in the following are based on enforcing
the residual to be orthogonal to the span of QVr, where Qmay be time- or state-
dependent according to the corresponding FOM. In the special case Q=In
this corresponds to a classical Galerkin projection, see also Remark 2.6.4. We
begin with considering a linear time-invariant full-order model of the form
(5.1), cf. Theorem 5.1.1. The first assertion of Theorem 5.1.1 is well-known,
see for instance [197, Rem. 8.3], whereas the second assertion regarding the
144
5.1. Linear Approximation Ansatz
residual minimization property is new. Especially, since Eand Qare assumed
to be invertible and E>Qto be symmetric and positive semi-definite, we have
that E−>Q>=E−>(Q>E)E1is symmetric and positive definite and, thus,
k·kE−>Q>in (5.15) is indeed a norm. We omit the proof of Theorem 5.1.1,
since it is a special case of the upcoming Theorem 5.1.5.
Theorem 5.1.1 (Structure-preserving MOR for (5.1) using a linear time-in-
variant approximation ansatz).Consider the port-Hamiltonian system (5.1)
with E, J, R, Q satisfying (5.2), i.e.,
J=J>, R =R>0, E>Q=Q>E0,
and let Eand Qbe invertible. Furthermore, let VrRn,r with rNnbe a
matrix with full column rank and let
˜
E˙
˜x(t)=(˜
J˜
R)˜
Q˜x(t) + ˜
Bu(t),(5.13a)
˜y(t) = ˜
B>˜
Q˜x(t)(5.13b)
for all tIbe a corresponding ROM with coefficient matrices
˜
E=V>
rQ>EVr,˜
J=V>
rQ>JQVr,˜
R=V>
rQ>RQVr,
˜
Q=Ir,˜
B=V>
rQ>B. (5.14)
Besides, we introduce the mapping R:Rr×Rr×RmRnvia
R(η1, η2, η3):=EVrη1(JR)QVrη2Bη3,
i.e., Ris defined such that R(˙
˜x(t),˜x(t), u(t)) coincides with the residual at
tI. Then, the following assertions hold.
(i) The ROM coefficient matrices satisfy
˜
J=˜
J>,˜
R=˜
R>0,and ˜
E>˜
Q=˜
Q>˜
E > 0,
i.e., the ROM (5.13) inherits the pH structure from the FOM.
(ii) The ROM (5.13) is optimal in the sense that any solution ˜xof (5.13a)
satisfies
˙
˜x(t)arg min
η1Rr
1
2kR(η1,˜x(t), u(t))k2
E−>Q>(5.15)
for all tIand for any input signal u:R0Rmwhich admits a
solution of the ROM state equation (5.13a).
Remark 5.1.2 (Another motivation for using the E−>Q>-norm in (5.15)).In
Theorem 5.1.1 it is stated that the ROM obtained by minimizing the residual in
the E−>Q>-norm inherits the port-Hamiltonian structure from the correspond-
ing FOM. An alternative motivation for the use of the E−>Q>-norm is given by
145
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
residual-based error bounds as addressed in Theorem 4.1.9 and Remark 4.1.10.
To this end, we first note that in the case where Eand Qare invertible and
E>Qis symmetric and positive semi-definite, the Hamiltonian coincides up
to the prefactor 1
2with the squared E>Q-norm, i.e., H(x) = 1
2kxk2
E>Qfor
all xRn. If we choose this norm for measuring the error, then by similar
arguments as in the proof of Theorem 4.1.9, see also Remark 4.1.10, we obtain
the error bound
kε(t)kE>QMeωt
kε(0)kE>Q+
t
Z0
eωs R(˙
˜x(s),˜x(s), u(s))E−> Q>ds
for all tI, where Mand ωsatisfy eE1(JR)QtE>QMeωt for all t
I. Hence, this error bound, where the error is measured in the E>Q-norm,
motivates for minimizing the residual in the E−>Q>-norm. ¨
We note that the structure of the ROM (5.13) in Theorem 5.1.1 is the
same as in (2.37) and, thus, the ROM is stable and passive. Furthermore, we
emphasize that it is essential in Theorem 5.1.1 that a weighted norm of the
residual is minimized with weighting matrix E−>Q>. The following example
illustrates that the dynamics of the resulting ROM can be completely different
when we instead use another norm for the residual minimization.
Example 5.1.3 (Standard vs. structure-preserving residual minimization).
We demonstrate the importance of the special choice of the weighted norm
in Theorem 5.1.1 by means of a simple mass-spring system presented in [278,
Ex. 2.1]. The state is given by x= [q p]>R2, where qdenotes the elongation
of the spring and pthe momentum of the mass. The dynamics is described by
˙x=JQx with J="0 1
1 0#, Q ="k0
01
2m#,(5.16)
where kR>0denotes the spring constant and mR>0the mass. Since
Jis skew-symmetric and Qsymmetric and positive definite, this system has
a Hamiltonian structure with Hamiltonian H:R2Rdefined via H(x):=
1
2x>Qx and, thus, the system is stable, cf. section 2.6.1.
With the objective of comparing different residual-minimization-based MOR
approaches, we use the special linear approximation ansatz
x1
2"1
1#˜x. (5.17)
In the following, we consider two different variants of constructing a ROM
based on this ansatz: The first one is based on a Galerkin projection, which
corresponds to minimizing the residual in the Euclidean norm, cf. section 2.5.3.
The second approach is based on a Petrov–Galerkin projection as in Theo-
rem 5.1.1, which corresponds to minimizing the residual in the norm k·kQ. In
146
5.1. Linear Approximation Ansatz
the Galerkin projection case, we obtain the ROM
˙
˜xG=1
21
2mk˜xG,(5.18)
whereas the Petrov–Galerkin projection yields the ROM
˙
˜xPG = 0.(5.19)
We observe that the Hamiltonian structure is preserved by the Petrov–Galerkin
projection, whereas this is in general not true for the Galerkin projection. In
fact, the Galerkin ROM is Hamiltonian if and only if k=1
2m, cf. (5.18). In that
case, Qis a multiple of the identity matrix, see (5.16), and, hence, in that case
the ROMs obtained by the Galerkin and by the Petrov–Galerkin projection
coincide. For all other combinations of the spring constant kand the mass
m, the ROMs (5.18) and (5.19) have in general different solutions. Moreover,
if 2km < 1holds, then the Galerkin ROM (5.18) is unstable and, thus, ˜xG
grows exponentially, provided that the initial value is non-zero. Consequently,
also the residual and the error associated with ˜xGgrow exponentially. On
the other hand, the solution ˜xPG of the Petrov–Galerkin ROM (5.19) remains
constant independently of kand mand, hence, the corresponding residual
and error remain bounded. Thus, this example illustrates that the choice of
the norm for the residual minimization may play an important role not only
for structure preservation, but also for the error. Furthermore, this example
shows that a ROM derived via minimizing the residual locally in time does
not necessarily yield a trajectory which leads to a small residual over the
complete time interval, see also the related discussion in Remark 4.1.4 about
the difference between instantaneous and cumulative residual minimization.
l
While Theorem 5.1.1 is formulated based on constant FOM coefficient ma-
trices, we emphasize that its assertions are still valid even if J,R, or Bde-
pend on time or on the state. An example for such a system is provided
in Example 5.1.4, where we consider a dissipative Hamiltonian system with
state-dependent Jand R, which arises from the modeling of a wildland fire.
However, even though the structure preservation may be achieved by a Petrov–
Galerkin projection as in Theorem 5.1.1, the state dependency of the coefficient
matrices may require to also apply a hyperreduction method to the resulting
ROM to achieve an efficient offline/online decomposition. Structure-preserving
hyperreduction methods are not within the scope of this thesis, but instead
we refer to [65] for a modification of the DEIM which allows to preserve a pH
structure, see also section 1.3.2.
Example 5.1.4 (Wildfire model).As example for a dissipative Hamiltonian
system with state-dependent Jand R, we consider a semi-discretized wild-
land fire model as used for the numerical experiments in [39]. The governing
147
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
equations on the infinite-dimensional level and on a one-dimensional spatial
domain are given by
tT=kξξTwξT+α(Sv(T, β)γT),
tS=ζSv(T, β)(5.20)
with unknowns T, S :I×R, spatial domain Ω=(a, b)with aR,
bR>a, time interval I= [0, tend]with tend R>0,v:R×RRdefined via
v(T, β):=
exp β
T,if T > 0,
0,otherwise,
and given constants k, α, β, γ, ζ R>0,wR. Furthermore, the system (5.20)
is closed via appropriate initial conditions and periodic boundary conditions,
cf. [39]. For the physical meaning of the coefficients, unknowns, and model
equations we refer to [39] as well as to [188] where the model has been originally
proposed.
After a semi-discretization of (5.20) in space by a central finite difference
scheme using an equidistant grid with grid size h=ba
N+1 with NN, we
obtain a finite-dimensional system of the form
"˙x1(t)
˙x2(t)#="kD2wD1αγIN+1 αV (x1(t), β)
0ζV (x1(t), β)#"x1(t)
x2(t)#,(5.21)
where x1, x2:IRN+1 correspond to approximations of Tand Sat the
spatial grid points ih for i= 1, . . . , N +1, while D1=D>
1and D2=D>
20
are finite difference approximations of the first and second spatial derivative,
respectively. Furthermore, the function V:RN+1 ×RRN+1,N+1 is given by
V(x1, β):= diag (v([x1]1, β), . . . , v([x1]N+1, β)) .
By introducing η:=α
4γζ , we obtain that (5.21) is equivalent to
"IN+1 0
0ηIN+1#
|{z }
=:E
"˙x1(t)
˙x2(t)#
=
"wD10
0 0#
|{z }
=:J1
+α
2"0V(x1(t), β)
V(x1(t), β) 0 #
|{z }
=:J2(x1(t))
"x1(t)
x2(t)#
k"D20
0 0#
|{z }
=:R1
+"αγIN+1 α
2V(x1(t), β)
α
2V(x1(t), β)ηζV (x1(t), β)#
|{z }
=:R2(x1(t))
"x1(t)
x2(t)#.
(5.22)
148
5.1. Linear Approximation Ansatz
The facts that Eis symmetric and positive definite, J1is skew-symmetric, J2is
pointwise skew-symmetric, and R1is symmetric and positive semi-definite fol-
low from the symmetry and definiteness properties of D1,D2, and V(x1(t), β)
as well as from the positive signs of ηand k. To investigate the definiteness
properties of R2, we consider for arbitrary vectors z= [p>q>]>R2(N+1)
with p, q RN+1 and uRN+1 the product
z>R2(u)z=αγp>p+
N+1
X
i=1 ηζv(ui, β)q2
iαv(ui, β)piqi
=α
N+1
X
i=1 γp2
iv(ui, β)piqi+1
4γv(ui, β)q2
i
|{z }
=:si
.
We continue by investigating the sign of the summands si. If uiis smaller
than or equal to 0for some i {1, . . . , N + 1}, then v(ui, β)is zero and siis
non-negative. If on the other hand uiis positive, we have
si=γp2
iexp β
ui!piqi+1
4γexp β
ui!q2
i
γp2
iexp β
ui!piqi+1
4γexp 2β
ui!q2
i
=γp2
iexp β
ui!piqi+1
4γ exp β
ui!!2
q2
i
= γpi1
2γexp β
ui!qi!2
0.
Thus, in total we infer that R2is pointwise symmetric and positive semi-
definite and, thus, (5.22) is a dissipative Hamiltonian system with J:=J1+J2
and R:=R1+R2depending on the state. l
Next, we consider the case, where the FOM has a linear time-varying port-
Hamiltonian structure as in (5.3)–(5.4). Also in this case, we may obtain a
ROM which is pH and optimal in the sense of weighted residual minimization.
Theorem 5.1.5 (Structure-preserving MOR for (5.3) using a linear time-in-
variant approximation ansatz).Consider the port-Hamiltonian system (5.3)
with E, K, J, R, Q satisfying pointwise (5.4), i.e.,
E>Q=Q>E0, Q>RQ =Q>R>Q0,
d
dt(Q>E) = Q>(KJQ)+(KJQ)>Q
and let Eand Qbe pointwise invertible. Furthermore, let VrRn,r with
149
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
rNnbe a matrix with full column rank and let
˜
E(t)˙
˜x(t) + ˜
K(t)˜x(t)=(˜
J(t)˜
R(t)) ˜
Q˜x(t) + ˜
B(t)u(t),(5.23a)
˜y(t) = ˜
B(t)>˜
Q˜x(t)(5.23b)
for all tIbe a corresponding ROM with coefficients ˜
E, ˜
K, ˜
J, ˜
R:R0Rr,r,
˜
QRr,r, and ˜
B:R0Rr,m defined as
˜
E(t):=V>
rQ(t)>E(t)Vr,˜
K(t):=V>
rQ(t)>K(t)Vr,
˜
J(t):=V>
rQ(t)>J(t)Q(t)Vr,˜
R(t):=V>
rQ(t)>R(t)Q(t)Vr,
˜
Q:=Ir,˜
B(t):=V>
rQ(t)>B(t).
(5.24)
Besides, we introduce the residual mapping R:R0×Rr×Rr×RmRnvia
R(t, η1, η2, η3):=E(t)Vrη1+ (K(t)(J(t)R(t))Q(t))Vrη2B(t)η3.
Then, the following assertions hold.
(i) The ROM matrices satisfy pointwise
˜
E>˜
Q=˜
Q>˜
E > 0,d
dt(˜
Q>˜
E) = ˜
Q>(˜
K˜
J˜
Q)+(˜
K˜
J˜
Q)>˜
Q,
and ˜
Q>˜
R˜
Q=˜
Q>˜
R>˜
Q0,
i.e., the ROM (5.23) inherits the pH structure from the FOM (5.3).
(ii) The ROM state equation (5.23a) is optimal in the sense that any solution
˜xsatisfies
˙
˜x(t)arg min
η1Rr
1
2kR(t, η1,˜x(t), u(t))k2
E(t)−>Q(t)>(5.25)
for all tIand for any input signal u:R0Rmwhich admits a
solution of the ROM state equation.
Proof. (i) The fact that ˜
Q>˜
E=˜
Edefined in (5.24) is pointwise symmetric
and positive definite follows from the assumptions that E>Qis pointwise
symmetric and positive semi-definite, that Eand Qare pointwise in-
vertible, and that Vrhas full column rank. Furthermore, using the third
equation in (5.4), we obtain
d
dt(˜
Q>˜
E) = V>
r
d
dt(Q>E)Vr=V>
rQ>(KJQ)+(KJQ)>QVr
=˜
Q>(˜
K˜
J˜
Q)+(˜
K˜
J˜
Q)>˜
Q.
Finally, the pointwise symmetry and positive semi-definiteness of ˜
Q>˜
R˜
Q
follow from the corresponding properties of Q>RQ.
150
5.1. Linear Approximation Ansatz
(ii) For fixed tI, the first-order necessary optimality condition for (5.25)
reads
V>
rQ(t)>E(t)Vrη1+V>
rQ(t)>K(t)Vr˜x(t)
=V>
rQ(t)>(J(t)R(t))Q(t)Vr˜x(t) + V>
rQ(t)>B(t)u(t)
and this condition is even sufficient since the corresponding Hessian
V>
rQ(t)>E(t)Vr=˜
E(t)does not depend on η1and is symmetric and
positive definite, cf. section 2.2. Finally, comparing the first-order opti-
mality condition with (5.23a) yields the claim.
Remark 5.1.6 (Structure-preserving MOR for the linear time-varying pH struc-
ture from [25]).The structure preservation and the residual minimization
stated in Theorem 5.1.5 may also be shown when considering a linear time-
varying port-Hamiltonian system of the form (5.3)–(5.4) where Kis replaced
by EK, as considered in [25]. In this case, one obtains ˜
Eas specified in (5.24)
and ˜
Kas ˜
E1V>
rQ>EKVr. On the other hand, an advantage of the port-
Hamiltonian formulation (5.3)–(5.4) as considered in Theorem 5.1.5 is that
the invertibility of ˜
Eis formally not required for the structure preservation,
i.e., Theorem 5.1.5(i) with ˜
Q>˜
E0instead of ˜
Q>˜
E > 0holds even if Eor Q
are not invertible or if Vrhas not full column rank. ¨
Stability of linear time-varying port-Hamiltonian systems of the form (5.3) is
discussed in appendix B, in particular in Theorem B.3. The following corollary
states that if the FOM satisfies the assumptions in Theorem B.3, then the
ROM state equation (5.23a) with u= 0 has a uniformly stable equilibrium
point in the origin. Hence, in this case the model reduction scheme considered
in Theorem 5.1.5 is not only structure-preserving, but also stability-preserving.
We omit the proof of Corollary 5.1.7, since it is a special case of the upcoming
Corollary 5.3.5 by choosing Vrto be constant.
Corollary 5.1.7 (Stability of (5.23a)).Let the assumptions of Theorem 5.1.5
be satisfied and let additionally E,K,J,R, and Qbe continuously differen-
tiable. Furthermore, let there exist constants ˜c1,˜c2R>0with
σmax(E(t)>Q(t)) ˜c1and σmin(E(t)>Q(t)) ˜c2for all tR0.(5.26)
Then, (5.23a) with u= 0 has a uniformly stable equilibrium point at 0Rr.
We proceed by considering structure-preserving MOR for nonlinear time-
varying pH systems of the form (5.10), cf. Theorem 5.1.8. The residual min-
imization properties of the structure-preserving ROMs presented in Theo-
rems 5.1.1 and 5.1.5 rely on the assumption that E>Qis (pointwise) positive
definite. Since E>Qcoincides with the Hessian of the Hamiltonian with respect
to the state, the (pointwise) positive definiteness of E>Qcorresponds to the
assumption that the Hamiltonian is equivalent to a squared norm of the state,
cf. Remark 5.1.2. Also when considering a nonlinear port-Hamiltonian FOM
151
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
of the form (5.10), one could derive a residual minimization property, provided
that E>Qis pointwise symmetric and positive definite. However, in contrast to
the linear case, the matrix function E>Qdoes in general not coincide with the
Hessian of the Hamiltonian associated with the nonlinear pH system (5.10).
Consequently, assuming E>Qto be pointwise positive definite in the context
of the nonlinear pH structure (5.10)–(5.11) appears to be less natural than in
the context of linear pH systems of the forms (5.1) or (5.3). For this reason, we
do not make this assumption in Theorem 5.1.8 and only focus on the structure
preservation without addressing the question whether the ROM is optimal in
some sense. Furthermore, we omit the proof of Theorem 5.1.8, since it is a
special case of the upcoming Theorem 5.3.6 by choosing Vrto be constant.
Accordingly, we may use Corollary 5.3.7 for obtaining sufficient conditions for
the stability of the ROM state equation (5.27a) with u= 0.
Theorem 5.1.8 (Structure-preserving MOR for (5.10) using a linear time-in-
variant approximation ansatz).Consider the port-Hamiltonian system (5.10)
with E, r, J, R, Q and the associated Hamiltonian Hsatisfying (5.11), i.e.,
E(t, x)>Q(t, x)x=xH(t, x), x>Q(t, x)>r(t, x) = tH(t, x)
J(t, x) = J(t, x)>, R(t, x) = R(t, x)>0for all (t, x)R0×Rn.
Furthermore, let VrRn,r be a given matrix with rNnand let
˜
E(t, ˜x(t)) ˙
˜x(t) + ˜
r(t, ˜x(t)) = ( ˜
J(t, ˜x(t)) ˜
R(t, ˜x(t))) ˜
Q˜x(t) + ˜
B(t, ˜x(t))u(t),
(5.27a)
˜y(t) = ˜
B(t, ˜x(t))>˜
Q˜x(t),(5.27b)
for all tIbe a corresponding ROM with coefficients ˜
E, ˜
J, ˜
R:R0×RrRr,r,
˜
r:R0×RrRr,˜
QRr,r, and ˜
B:R0×RrRr,m defined as
˜
E(t, ˜x):=V>
rQ(t, Vr˜x)>E(t, Vr˜x)Vr,
˜
J(t, ˜x):=V>
rQ(t, Vr˜x)>J(t, Vr˜x)Q(t, Vr˜x)Vr,
˜
R(t, ˜x):=V>
rQ(t, Vr˜x)>R(t, Vr˜x)Q(t, Vr˜x)Vr,˜
Q:=Ir
˜
r(t, ˜x):=V>
rQ(t, Vr˜x)>r(t, Vr˜x),˜
B(t, ˜x):=V>
rQ(t, Vr˜x)>B(t, Vr˜x).
(5.28)
Moreover, we define the associated ROM Hamiltonian ˜
H:R0×RrRvia
˜
H(t, ˜x):=H(t, Vr˜x). Then, ˜
His continuously differentiable and the ROM
coefficients satisfy
˜
J(t, ˜x) = ˜
J(t, ˜x)>,˜
R(t, ˜x) = ˜
R(t, ˜x)>0,
˜
E(t, ˜x)>˜
Q˜x=˜x˜
H(t, ˜x),˜x>˜
Q>˜
r(t, ˜x) = t˜
H(t, ˜x)
for all (t, ˜x)R0×Rr, i.e., the ROM (5.27) inherits the port-Hamiltonian
structure from the FOM (5.10).
152
5.1. Linear Approximation Ansatz
Remark 5.1.9 (Structure-preserving MOR for pH systems of the form (5.6)).
The structure preservation in Theorem 5.1.8 relies on the special structure of
the FOM (5.10)–(5.11), which is a special case of (5.8)–(5.9) where zmay be
factorized as z(t, x) = Q(t, x)x. Similarly, one may exploit such a factorization
in the time-invariant case (5.6). However, if such a factorization of zis not
available, we cannot readily apply the Petrov–Galerkin projection techniques
considered in this section. To still obtain structure-preserving schemes in this
case, one could for instance make use of the ideas of [65], see also section 2.6.2.
The model reduction method presented in [65] considers the special case E=In
and an extension to the case of a more general state-dependent Ewould be
desirable to treat general nonlinear pH systems of the form (5.6). However,
such an extension appears not to be straightforward, especially in the case
of a singular Ematrix. Moreover, since zis a linear function of the state in
all examples considered in this thesis, we refrain from discussing the general
nonlinear case in detail here. ¨
While classical model reduction schemes are based on ansatz functions which
do not depend on time, the most general linear approximation ansatz allows
also for a time dependency in the modes. Accordingly, we close this section by
considering an approximation ansatz of the form
x(t)Vr(t)˜x(t),(5.29)
where the mapping Vr:R0Rn,r is assumed to be given. This ansatz
is particularly important for transport-dominated systems as considered in
the previous chapters. Especially, in the case where the paths piare known
functions of time, the approximation ansatz (1.4) corresponds in the finite-
dimensional setting to a special case of (5.29).
In the following, we restrict ourselves to structure-preserving MOR for linear
time-varying port-Hamiltonian FOMs of the form (5.3), cf. Theorem 5.1.10.
The case of linear time-invariant FOMs is obtained as a special case of this
theorem and the case of nonlinear FOMs may be treated analogously as out-
lined in section 5.3. Moreover, we may use the upcoming Corollary 5.3.5 to
obtain sufficient conditions for the stability of the ROM state equation (5.30a).
Especially, the assumption on the invertibility of ˜
Ein Corollary 5.3.5 is auto-
matically satisfied if Vrdoes not depend on the ROM state.
Theorem 5.1.10 (Structure-preserving MOR for (5.3) using a linear time–
varying approximation ansatz).Consider the port-Hamiltonian system (5.3)
with E, K, J, R, Q satisfying pointwise (5.4), i.e.,
E>Q=Q>E0, Q>RQ =Q>R>Q0,
d
dt(Q>E) = Q>(KJQ)+(KJQ)>Q
and let Eand Qbe pointwise invertible. Furthermore, let VrC1(R0,Rn,r)
153
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
with rNnhave pointwise full column rank and let
˜
E(t)˙
˜x(t) + ˜
K(t)˜x(t)=(˜
J(t)˜
R(t)) ˜
Q˜x(t) + ˜
B(t)u(t),(5.30a)
˜y(t) = ˜
B(t)>˜
Q˜x(t),(5.30b)
for all tIbe a corresponding ROM with coefficients ˜
E, ˜
K, ˜
J, ˜
R:R0Rr,r,
˜
QRr,r, and ˜
B:R0Rr,m defined as
˜
E:=V>
rQ>EVr,˜
K:=V>
rQ>KVr+E˙
Vr,˜
J:=V>
rQ>JQVr,
˜
R:=V>
rQ>RQVr,˜
Q:=Ir,˜
B:=V>
rQ>B.
(5.31)
Besides, we introduce the residual mapping R:R0×Rr×Rr×RmRnvia
R(t, η1, η2, η3)
:=E(t)Vr(t)η1+E(t)˙
Vr(t)+(K(t)(J(t)R(t))Q(t))Vr(t)η2B(t)η3.
Then, the following assertions hold.
(i) The ROM matrices satisfy pointwise
˜
E>˜
Q=˜
Q>˜
E > 0,d
dt(˜
Q>˜
E) = ˜
Q>(˜
K˜
J˜
Q)+(˜
K˜
J˜
Q)>˜
Q,
and ˜
Q>˜
R˜
Q=˜
Q>˜
R>˜
Q0,
i.e., the ROM (5.30) inherits the port-Hamiltonian structure from the
FOM (5.3).
(ii) The ROM state equation (5.30a) is optimal in the sense that any solution
˜xsatisfies
˙
˜x(t)arg min
η1Rr
1
2kR(t, η1,˜x(t), u(t))k2
E(t)−>Q(t)>(5.32)
for all tIand for any input signal u:R0Rmwhich admits a
solution of the ROM state equation.
Proof. (i) The pointwise symmetry and definiteness properties of ˜
Q>˜
Eand
˜
Q>˜
R˜
Qfollow by similar arguments as in the proof of Theorem 5.1.5.
Furthermore, exploiting (5.4) and the definitions of the ROM coefficient
matrices provided in (5.31), we obtain
d
dt(˜
Q>˜
E) = ˙
Vr>Q>EVr+V>
r
d
dt(Q>E)Vr+V>
rQ>E˙
Vr
=˙
Vr>Q>EVr+V>
rQ>(KJQ)+(KJQ)>QVr+V>
rQ>E˙
Vr
=˜
Q>(˜
K˜
J˜
Q)+(˜
K˜
J˜
Q)>˜
Q.
154
5.2. Nonlinear Separable Approximation Ansatz
(ii) The proof follows along the lines of the proof of Theorem 5.1.5, where
the major difference is that the first-order necessary optimality condition
of the minimization problem (5.32) for fixed tIis given by
Vr(t)>Q(t)>E(t)Vr(t)η1+Vr(t)>Q(t)>E(t)˙
Vr(t) + K(t)Vr(t)˜x(t)
=Vr(t)>Q(t)>(J(t)R(t))Q(t)Vr(t)˜x(t) + Vr(t)>Q(t)>B(t)u(t).
5.2. Nonlinear Separable Approximation Ansatz
In this section we consider a nonlinear approximation ansatz of the form
x(t)Vr(p(t))α(t),(5.33)
where the mapping Vr:RrpRn,rαis assumed to be given and where the
reduced state
˜x="α
p#(5.34)
consists of the variables p:IRrpand α:IRrαwith r:=rα+rp. Since
the ansatz (5.33) is linear in αand possibly nonlinear in p, we call this a sepa-
rable approximation ansatz, since it is the same kind of nonlinearity as in the
separable nonlinear least-squares problem (3.11). In fact, this class of nonlin-
ear ansatzes is of particular importance for this thesis since the ansatz (1.4) is
a special case of (5.33) in the finite-dimensional setting, cf. Remark 4.1.3.
We start by considering structure-preserving MOR for the case of a linear
time-invariant FOM of the form (5.1)–(5.2) with invertible Eand Q. By using
the approximation ansatz (5.33) and minimizing the residual in the weighted
E−>Q>-norm as in the previous section, we obtain the ROM
˜
E(˜x(t)) ˙
˜x(t) = ( ˜
J(˜x(t)) ˜
R(˜x(t))) ˜
Q˜x(t) + ˜
B(˜x(t))u(t),(5.35a)
˜y(t) = ˜
B(˜x(t))>˜
Q˜x(t),(5.35b)
for all tI, where ˜
J, ˜
R, ˜
E:RrRr,r,˜
QRr,r, and ˜
B:RrRr,m are
defined as1
˜
J(˜x):="˜
J11(p)˜
J21(α, p)>
˜
J21(α, p) 0 #,
˜
J11(p):=Vr(p)>Q>JQVr(p)Rrα,rα,
˜
J21(α, p):=c
Vr(p)α>Q>JQVr(p)Rrp,rα,
(5.36a)
1I owe special thanks to Riccardo Morandin who made me aware that the ROM may be
written in the form (5.35) by using a singular ˜
Qmatrix and by introducing a suitable
second block column in the definitions of ˜
Jand ˜
R. In an earlier version, I used a more
general definition of pH systems to show that the ROM is pH, but this is not necessary.
155
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
˜
R(˜x):="˜
R11(p)˜
R21(α, p)>
˜
R21(α, p)˜
R22(α, p)#,
˜
R11(p):=Vr(p)>Q>RQVr(p)Rrα,rα,
˜
R21(α, p):=c
Vr(p)α>Q>RQVr(p)Rrp,rα,
˜
R22(α, p):=c
Vr(p)α>Q>RQc
Vr(p)αRrp,rp
(5.36b)
˜
E(˜x):="˜
E11(p)˜
E12(α, p)
˜
E12(α, p)>˜
E22(α, p)#,
˜
E11(p):=Vr(p)>Q>EVr(p)Rrα,rα,
˜
E12(α, p):=Vr(p)>Q>Ec
Vr(p)αRrα,rp,
˜
E22(α, p):=c
Vr(p)α>Q>Ec
Vr(p)αRrp,rp,
(5.36c)
˜
Q:="Irα0
0 0#,(5.36d)
˜
B(˜x):="˜
B1(p)
˜
B2(α, p)#,
˜
B1(p):=Vr(p)>Q>BRrα,m,
˜
B2(α, p):=c
Vr(p)α>Q>BRrp,m.
(5.36e)
Here, we use the notation for the block components of ˜xas in (5.34) and,
besides, c
Vr:Rrp L(Rrα,Rn,rp)is defined via
c
Vr(η1)(η2)η3:=V0
r(η1)(η3)η2for all (η1, η2, η3)Rrp×Rrα×Rrp,(5.37)
where V0
rdenotes the derivative of Vr, cf. section 2.1. Moreover, we note that
(5.35)–(5.36) is obtained by enforcing the residual to be orthogonal to the
column span of Q[Vr(p)c
Vr(p)α]. As a consequence, the ROM (5.35) is port-
Hamiltonian and optimal in the sense of residual minimization as stated in the
following.
Theorem 5.2.1 (Structure-preserving MOR for (5.1) using a separable ap-
proximation ansatz).Consider the pH system (5.1) with E, J, R, Q satisfying
(5.2), i.e.,
J=J>, R =R>0, E>Q=Q>E0,
and let Eand Qbe invertible. Furthermore, let VrC1(Rrp,Rn,rα)with
rα, rpNand r:=rα+rpnbe given. We consider the corresponding
reduced-order model (5.35) with coefficients ˜
QRr,r,˜
E, ˜
J, ˜
R:RrRr,r,
and ˜
B:RrRr,m as defined in (5.36). Besides, we define the Hamiltonian
156
5.2. Nonlinear Separable Approximation Ansatz
˜
H:RrRassociated with (5.35) via
˜
H(˜x):=1
2α>Vr(p)>E>QVr(p)α, (5.38)
where we use the notation from (5.34) for the block components of ˜x. Then,
the following assertions hold.
(i) The ROM Hamiltonian ˜
His continuously differentiable and the ROM
coefficients satisfy
˜
J(˜x) = ˜
J(˜x)>,˜
R(˜x) = ˜
R(˜x)>0,˜
E(˜x)>˜
Q˜x=˜
H(˜x)
for all ˜xRr, i.e., the ROM (5.35) has a nonlinear time-invariant port-
Hamiltonian structure as in (5.6)(5.7).
(ii) The ROM state equation (5.35a) is optimal in the sense that any solution
˜x= [α>p>]>satisfies
˙
˜x(t) = "˙α(t)
˙p(t)#arg min
(η12)Rrα×Rrp
1
2kR(η1, η2, α(t), p(t), u(t))k2
E−>Q>
(5.39)
for all tIand for any input signal u:R0Rmwhich admits a
solution of the ROM state equation (5.35a). Here, the residual mapping
R:Rrα×Rrp×Rrα×Rrp×RmRnis defined via
R(η1, η2, η3, η4, η5)
:=EVr(η4)η1+EV 0
r(η4)(η2)η3(JR)QVr(η4)η3Bη5.
Proof. (i) The pointwise skew-symmetry of ˜
Jfollows from J=J>and
(5.36a). Regarding ˜
R, we observe that it may be factorized as
˜
R(˜x) = hVr(p)c
Vr(p)αi>Q>RQ hVr(p)c
Vr(p)αi
for all ˜xRrand, thus, its pointwise symmetry and positive semi-
definiteness follow from the corresponding properties of R. For the Hamil-
tonian ˜
H, we note that it is continuously differentiable due to the con-
tinuous differentiability of Vrand we compute its partial derivatives as
α˜
H(˜x) = α>Vr(p)>E>QVr(p),
p˜
H(˜x)ζ=α>Vr(p)>E>QV 0
r(p)(ζ)α=α>Vr(p)>E>Qc
Vr(p)(α)ζ
for all (˜x, ζ)Rr×Rrp. Consequently, we obtain the gradient of ˜
Has
˜
H(˜x) =
Vr(p)>Q>EVr(p)α
c
Vr(p)α>Q>EVr(p)α
=˜
E(˜x)>˜
Q˜x.
157
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
(ii) For fixed tIand by computing the partial derivatives of the cost func-
tion in (5.39), we obtain the first-order necessary optimality conditions
Vr(p(t))>Q>EVr(p(t))η1+Ec
Vr(p(t))(α(t))η2
=Vr(p(t))>Q>((JR)QVr(p(t))α(t) + Bu(t)) ,
c
Vr(p(t))α(t)>Q>EVr(p(t))η1+Ec
Vr(p(t))(α(t))η2
=c
Vr(p(t))α(t)>Q>((JR)QVr(p(t))α(t) + Bu(t)) ,
(5.40)
where we used the definition of c
Vrin (5.37). Moreover, the corresponding
Hessian is given by
Vr(p(t))>Q>EVr(p(t)) Vr(p(t))>Q>Ec
Vr(p(t))α(t)
c
Vr(p(t))α(t)>Q>EVr(p(t)) c
Vr(p(t))α(t)>Q>Ec
Vr(p(t))α(t)
|{z }
=˜
E(˜x(t))
,
which is independent of (η1, η2)and symmetric positive semi-definite due
to the assumptions on E>Q. Thus, the necessary conditions (5.40) are
also sufficient, cf. section 2.2. The claim then follows from observing that
(5.40) coincides with the ROM state equation (5.35a) with η1= ˙α(t)and
η2= ˙p(t).
In contrast to the setting considered in Theorem 5.1.1 from section 5.1, the
ROM Hamiltonian in Theorem 5.2.1 in general is not a quadratic function of
the ROM state, cf. (5.38). This prevents us from using similar arguments as in
section 5.1 to show that the ROM state equation (5.35a) is stable. Moreover,
whenever the first block component αof the ROM state ˜xattains the value 0,
the second block row of (5.35a) reduces to the trivial equation 0 = 0. Conse-
quently, an initial value problem associated with (5.35a), u= 0, and an initial
value (α(t0), p(t0)) with α(t0)=0is not uniquely solvable. Thus, since the
stability notions introduced in Definition 2.4.8 require unique solvability of the
initial value problem for any initial condition, the ROM state equation (5.35a)
cannot possess any stable equilibrium point in the sense of Definition 2.4.8. In
order to still obtain at least a statement about the boundedness of solutions, we
observe that the ROM Hamiltonian ˜
Hdefined in (5.38) is a quadratic function
with respect to α. Hence, under additional assumptions on Vr, we may at least
show that αremains bounded by exploiting the dissipation inequality for ˜
H.
This is detailed in Corollary 5.2.2. Similarly as in the proof of Corollary 5.2.2,
one can show that even if the condition (5.42) on the singular values of Vris
not satisfied, we have at least the bound
kVr(p(t))α(t)k qκ(E>Q)kVr(p(t0))α(t0)kfor all tI(5.41)
for the approximation Vr(p)αof the FOM state, where κ(E>Q)denotes the
158
5.2. Nonlinear Separable Approximation Ansatz
condition number of E>Q, cf. section 2.1.
Corollary 5.2.2 (Boundedness of part of the state vector in (5.35a)).Let
the assumptions of Theorem 5.2.1 be satisfied and let there additionally exist
constants ˆc1,ˆc2R>0with
σmax(Vr(η)) ˆc1and σmin(Vr(η)) ˆc2for all ηRrp.(5.42)
Furthermore, let ˜x= [α>p>]>C1(I,Rrα+rp)satisfy pointwise the ROM state
equation (5.35a) with u= 0 on the time interval I= [t0, tend]with t0R0
and tend R>t0. Then, there exists a constant cR>0which is independent
of t0and tend and satisfies
kα(t)k ckα(t0)kfor all tI.
Proof. By Theorem 5.2.1(i), the ROM Hamiltonian ˜
Hdefined in (5.38) is
continuously differentiable and, since also ˜xis continuously differentiable, we
infer that ˜
H ˜xC1(I,R)holds. Furthermore, the fact that the ROM (5.35)
has a port-Hamiltonian structure as stated in Theorem 5.2.1(i), yields the
dissipation inequality
d
dt(˜
H ˜x)(t)0for all tI,
where we have used u= 0. Consequently, we obtain
(˜
H ˜x)(t)(˜
H ˜x)(t0)for all tI.
Using this inequality and the bounds (5.42) as well as [34, Lem. 8.4.3] and [34,
Fact 9.13.1], we arrive at
kα(t)k21
σmin(Vr(p(t)))2kVr(p(t))α(t)k2
1
σmin(E>Q)ˆc2
2
α(t)>Vr(p(t))>E>QVr(p(t))α(t)
=2
σmin(E>Q)ˆc2
2
˜
H(˜x(t)) 2
σmin(E>Q)ˆc2
2
˜
H(˜x(t0))
κ(E>Q)
ˆc2
2kVr(p(t0))α(t0)k2κ(E>Q)σmax(Vr(p(t0)))2
ˆc2
2kα(t0)k2
κ(E>Q)ˆc2
1
ˆc2
2kα(t0)k2
for all tI, which yields the assertion.
Next, we focus on deriving a structure-preserving ROM using a separable
nonlinear approximation ansatz of the form (5.33) for the case where the FOM
has a linear time-varying port-Hamiltonian structure as in (5.3). Again, based
159
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
on a suitably weighted residual minimization approach, we propose the ROM
˜
E(t, ˜x(t)) ˙
˜x(t) + ˜
r(t, ˜x(t)) = ( ˜
J(t, ˜x(t)) ˜
R(t, ˜x(t))) ˜
Q˜x(t) + ˜
B(t, ˜x(t))u(t),
(5.43a)
˜y(t) = ˜
B(t, ˜x(t))>˜
Q˜x(t),(5.43b)
for all tI, where ˜
E, ˜
J, ˜
R:R0×RrRr,r,˜
r:R0×RrRr,˜
QRr,r,
and ˜
B:R0×RrRr,m are defined as
˜
E(t, ˜x):="˜
E11(t, p)˜
E12(t, α, p)
˜
E12(t, α, p)>˜
E22(t, α, p)#,
˜
E11(t, p):=Vr(p)>Q(t)>E(t)Vr(p)Rrα,rα,
˜
E12(t, α, p):=Vr(p)>Q(t)>E(t)c
Vr(p)αRrα,rp,
˜
E22(t, α, p):=c
Vr(p)α>Q(t)>E(t)c
Vr(p)αRrp,rp,
(5.44a)
˜
J(t, ˜x):="˜
J11(t, p) 0
˜
J21(t, α, p) 0#,
˜
J11(t, p):=Vr(p)>Q(t)>J(t)Q(t)Vr(p)Rrα,rα,
˜
J21(t, α, p):=c
Vr(p)α>Q(t)>J(t)Q(t)Vr(p)Rrp,rα,
(5.44b)
˜
R(t, ˜x):="˜
R11(t, p) 0
˜
R21(t, α, p) 0#,
˜
R11(t, p):=Vr(p)>Q(t)>R(t)Q(t)Vr(p)Rrα,rα,
˜
R21(t, α, p):=c
Vr(p)α>Q(t)>R(t)Q(t)Vr(p)Rrp,rα,
(5.44c)
˜
r(t, ˜x):="˜
r1(t, ˜x)
˜
r2(t, ˜x)#,
˜
r1(t, ˜x):=Vr(p)>Q(t)>K(t)Vr(p)αRrα,
˜
r2(t, ˜x):=c
Vr(p)α>Q(t)>K(t)Vr(p)αRrp,
(5.44d)
˜
Q:="Irα0
0 0#,(5.44e)
˜
B(t, ˜x):="˜
B1(t, p)
˜
B2(t, α, p)#,
˜
B1(t, p):=Vr(p)>Q(t)>B(t)Rrα,m,
˜
B2(t, α, p):=c
Vr(p)α>Q(t)>B(t)Rrp,m.
(5.44f)
Here, we use the notation from (5.34) for the block components of ˜x, whereas
160
5.2. Nonlinear Separable Approximation Ansatz
c
Vris as defined in (5.37).
The residual minimization property which leads to the ROM state equation
(5.43a) is stated in Theorem 5.2.3(ii), whereas some structural properties of
the ROM coefficients are addressed in (i). Especially, we emphasize that the
nonlinear time-varying ROM (5.43) does in general not fit into the class of
port-Hamiltonian systems of the form (5.8)–(5.9), for instance, since ˜
Jis in
general not pointwise skew-symmetric. It appears that the reason for this
incompatibility of the pH structure (5.8)–(5.9) and the ROM (5.43) originates
from the fact that the nonlinear time-varying pH structure (5.8)–(5.9) is not a
consistent generalization of the linear time-varying pH structure (5.3)–(5.4) of
the corresponding FOM. The resulting structure of the ROM (5.43) motivates
to generalize the port-Hamiltonian structure (5.8)–(5.9) via replacing (5.9) by
the less restrictive conditions
z(t, x)>R(t, x)z(t, x)0,xH(t, x) = E(t, x)>z(t, x),
tH(t, x) = z(t, x)>(r(t, x)J(t, x)z(t, x)) (5.45)
for all (t, x)R0×Rn. First, we note that these conditions are indeed less
restrictive than (5.9), since the first property in (5.45) holds especially if Ris
pointwise positive semi-definite and since the last equation in (5.45) is satisfied
if Jis pointwise skew-symmetric and tH=z>rholds. However, we emphasize
that (5.8) with (5.45) is not a generalization of the port-Hamiltonian structure
introduced in [196]. Instead, one would also have to include a feedthrough
term and allow for over- and underdetermined systems to obtain a proper
generalization of the structure in [196], but this is not within the scope of this
thesis.
Nevertheless, the nonlinear time-varying pH structure (5.8) with (5.45) is a
consistent generalization of the linear time-varying pH structure (5.3)–(5.4).
In fact, the latter one is a special case of (5.8), (5.45) with z(t, x) = Q(t)x,
r(t, x) = K(t)x, and the Hamiltonian as in (5.5), which follows from the cal-
culation
z(t, x)>R(t)z(t, x) = x>Q(t)>R(t)Q(t)x0,
xH(t, x) = E(t)>Q(t)x=E(t)>z(t, x)
as well as
tH(t, x) = 1
2x>d
dt(E>Q)(t)x=1
2x>d
dt(Q>E)(t)x
=1
2x>Q(t)>(K(t)J(t)Q(t)) + (K(t)J(t)Q(t))>Q(t)x
=x>Q(t)>(K(t)J(t)Q(t))x=z(t, x)>(r(t, x)J(t)z(t, x))
for all (t, x)R0×Rn. Moreover, we note that the nonlinear time-varying
pH structure (5.8) with (5.45) leads to a dissipation inequality for the corre-
161
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
sponding Hamiltonian. More precisely, for a given solution xC1(I,Rn)of
(5.8a), we obtain that the function Hs:IRdefined via Hs(t):=H(t, x(t))
satisfies
dHs
dt(t) = tH(t, x(t)) + xH(t, x(t))>˙x(t)
=z(t, x(t))>(r(t, x(t)) J(t, x(t))z(t, x(t))) + z(t, x(t))>E(t, x(t)) ˙x(t)
=z(t, x(t))>R(t, x(t))z(t, x(t)) + z(t, x(t))>B(t, x(t))u(t)
=z(t, x(t))>R(t, x(t))z(t, x(t)) + y(t)>u(t)y(t)>u(t)
for all tI. Finally, we note that Theorem 5.2.3(i) implies that the ROM
(5.43) has a port-Hamiltonian structure of the form (5.8) with (5.45) and,
hence, its structure implies a dissipation inequality for the associated Hamil-
tonian ˜
Hdefined in (5.46). Similarly as in Corollary 5.2.2, this allows to derive
a bound for the αcomponent of the ROM state, cf. Corollary 5.2.4. Further-
more, similarly as in (5.41) we also obtain a bound for the approximation of
the FOM state, even if the condition (5.42) on the singular values of Vris not
satisfied. Especially, we obtain the bound
kVr(p(t))α(t)k s˜c1
˜c2kVr(p(t0))α(t0)kfor all tI
with ˜c1and ˜c2as in Corollary 5.2.4.
Theorem 5.2.3 (Structure-preserving MOR for (5.3) using a separable ap-
proximation ansatz).Consider the pH system (5.3) with E, K, J, R, Q satisfy-
ing pointwise (5.4), i.e.,
E>Q=Q>E0, Q>RQ =Q>R>Q0,
d
dt(Q>E) = Q>(KJQ)+(KJQ)>Q
and let Eand Qbe pointwise invertible. Furthermore, let VrC1(Rrp,Rn,rα)
with rα, rpNand r:=rα+rpnbe given. Moreover, we consider
the reduced-order model (5.43) with coefficients ˜
E,˜
r,˜
J, ˜
R, ˜
Q, ˜
Bas defined in
(5.44). Besides, we define the Hamiltonian ˜
H:R0×RrRassociated with
(5.43) via
˜
H(t, ˜x):=1
2α>Vr(p)>E(t)>Q(t)Vr(p)α, (5.46)
where we use the notation from (5.34) for the block components of ˜x. Then,
the following assertions hold.
(i) The ROM Hamiltonian ˜
His continuously differentiable and the ROM
coefficients satisfy
˜
Q>˜
R(t, ˜x)˜
Q=˜
Q>˜
R(t, ˜x)>˜
Q0,˜
E(t, ˜x)>˜
Q˜x=˜x˜
H(t, ˜x),
162
5.2. Nonlinear Separable Approximation Ansatz
t˜
H(t, ˜x) = ˜x>˜
Q>˜
r(t, ˜x)˜
J(t, ˜x)˜
Q˜x
for all (t, ˜x)R0×Rr.
(ii) The ROM state equation (5.43a) is optimal in the sense that any solution
˜x= [α>p>]>satisfies
˙
˜x(t) = "˙α(t)
˙p(t)#arg min
(η12)Rrα×Rrp
1
2kR(t, η1, η2, α(t), p(t), u(t))k2
E(t)−>Q(t)>
(5.47)
for all tIand for any input signal u:R0Rmwhich admits a
solution of the ROM state equation (5.43a). Here, the residual mapping
R:R0×Rrα×Rrp×Rrα×Rrp×RmRnis defined via
R(t, η1, η2, η3, η4, η5):=E(t) (Vr(η4)η1+V0
r(η4)(η2)η3)
+ (K(t)(J(t)R(t))Q(t))Vr(η4)η3B(t)η5.
Proof. (i) The pointwise symmetry and positive semi-definiteness of ˜
Q>˜
R˜
Q
with ˜
Rand ˜
Qas defined in (5.44) follow from the assumption that Q>RQ
is pointwise symmetric and positive semi-definite. For the ROM Hamil-
tonian ˜
H, we first observe that it is continuously differentiable due to the
continuous differentiability of Vrand E>Qand by following the lines of
the proof of Theorem 5.2.1 we obtain the equation
˜x˜
H(t, ˜x) =
Vr(p)>
c
Vr(p)α>
Q(t)>E(t)Vr(p)α=˜
E(t, ˜x)>˜
Q˜x
for all (t, ˜x)R0×Rr. For the partial derivative of the ROM Hamilto-
nian with respect to t, we calculate
t˜
H(t, ˜x) = 1
2α>Vr(p)>d
dt(E>Q)(t)Vr(p)α
=α>Vr(p)>Q(t)>(K(t)J(t)Q(t)) Vr(p)α
= ˜x>˜
Q>˜
r(t, ˜x)˜
J(t, ˜x)˜
Q˜x
for all (t, ˜x)R0×Rr.
(ii) The proof follows along the lines of the proof of Theorem 5.2.1, where
the major difference is that the first-order necessary optimality condition
of the minimization problem (5.47) for fixed tIis given by
Vr(p(t))>Q(t)>E(t)Vr(p(t))η1+c
Vr(p(t))(α(t))η2
=Vr(p(t))>Q(t)>(((J(t)R(t))Q(t)K(t)) Vr(p(t))α(t)B(t)u(t)) ,
c
Vr(p(t))α(t)>Q(t)>E(t)Vr(p(t))η1+c
Vr(p(t))(α(t))η2
163
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
=c
Vr(p(t))α(t)>Q(t)>((J(t)R(t))Q(t)K(t)) Vr(p(t))α(t)
c
Vr(p(t))α(t)>Q(t)>B(t)u(t).
Corollary 5.2.4 (Boundedness of part of the state in (5.43a) with (5.44)).
Let the assumptions of Theorem 5.2.3 be satisfied and let there additionally
exist constants ˜c1,˜c2,ˆc1,ˆc2R>0such that the singular value bounds (5.26)
and (5.42) are satisfied. Furthermore, let ˜x= [α>p>]>C1(I,Rrα+rp)be a
solution of the ROM state equation (5.43a) with u= 0 and coefficients as in
(5.44) on the time interval I= [t0, tend]with t0R0and tend R>t0. Then,
there exists a constant cR>0which is independent of t0and tend and satisfies
kα(t)k ckα(t0)kfor all tI.
Proof. From the continuous differentiability of ˜xand ˜
H, cf. Theorem 5.2.3(i),
we infer that the function ˜
Hs:IRdefined via ˜
Hs(t):=˜
H(t, ˜x(t)) is contin-
uously differentiable as well. Furthermore, as mentioned before Theorem 5.2.3,
˜
Hssatisfies the dissipation inequality
˙
˜
Hs(t)0for all tI,
where we have used u= 0. Consequently, we obtain
˜
H(t, ˜x(t)) = ˜
Hs(t)˜
Hs(t0) = ˜
H(t0,˜x(t0)) for all tI.
Using this inequality, the bounds (5.26) and (5.42), and similar arguments as
in the proof of Corollary 5.2.2, we arrive at
kα(t)k ˆc1
ˆc2v
u
u
tσmax(E(t0)>Q(t0))
σmin(E(t)>Q(t)) kα(t0)k ˆc1
ˆc2s˜c1
˜c2kα(t0)k
for all tI.
At the end of this section, we discuss the case of a nonlinear time-varying
FOM of the form (5.10) with coefficients satisfying (5.11). Similarly as in the
previous section, we only consider the task of structure preservation, whereas
it is in general not clear how to achieve residual minimization at the same time,
unless E>Qhappens to be pointwise symmetric and positive definite, cf. the
discussion before Theorem 5.1.8. Especially, based on enforcing the residual
to be orthogonal to the column span of
Q(t, Vr(p(t))α(t))[Vr(p(t)) c
Vr(p(t))α(t)]
for all tI, we propose a ROM of the form (5.43) where the coefficients are
164
5.2. Nonlinear Separable Approximation Ansatz
not specified as in (5.44), but instead given by
˜
E(t, ˜x):="˜
E11(t, ˜x)˜
E12(t, ˜x)
˜
E21(t, ˜x)˜
E22(t, ˜x)#,
˜
E11(t, ˜x):=Vr(p)>Q(t, Vr(p)α)>E(t, Vr(p)α)Vr(p)Rrα,rα,
˜
E12(t, ˜x):=Vr(p)>Q(t, Vr(p)α)>E(t, Vr(p)α)c
Vr(p)αRrα,rp,
˜
E21(t, ˜x):=c
Vr(p)α>Q(t, Vr(p)α)>E(t, Vr(p)α)c
Vr(p)αRrα,rp,
˜
E22(t, ˜x):=c
Vr(p)α>Q(t, Vr(p)α)>E(t, Vr(p)α)c
Vr(p)αRrp,rp,
(5.48a)
˜
J(t, ˜x):="˜
J11(t, ˜x)˜
J21(t, ˜x)>
˜
J21(t, ˜x) 0 #,
˜
J11(t, ˜x):=Vr(p)>Q(t, Vr(p)α)>J(t, Vr(p)α)Q(t, Vr(p)α)Vr(p)Rrα,rα,
˜
J21(t, ˜x):=c
Vr(p)α>Q(t, Vr(p)α)>J(t, Vr(p)α)Q(t, Vr(p)α)Vr(p)Rrp,rα,
(5.48b)
˜
R(t, ˜x):="˜
R11(t, ˜x)˜
R21(t, ˜x)>
˜
R21(t, ˜x)˜
R22(t, ˜x)#,
˜
R11(t, ˜x):=Vr(p)>Q(t, Vr(p)α)>R(t, Vr(p)α)Q(t, Vr(p)α)Vr(p)Rrα,rα,
˜
R21(t, ˜x):=c
Vr(p)α>Q(t, Vr(p)α)>R(t, Vr(p)α)Q(t, Vr(p)α)Vr(p)Rrp,rα,
˜
R22(t, ˜x):=c
Vr(p)α>Q(t, Vr(p)α)>R(t, Vr(p)α)Q(t, Vr(p)α)c
Vr(p)αRrp,rp,
(5.48c)
˜
r(t, ˜x):="˜
r1(t, ˜x)
˜
r2(t, ˜x)#,
˜
r1(t, ˜x):=Vr(p)>Q(t, Vr(p)α)>r(t, Vr(p)α)Rrα,
˜
r2(t, ˜x):=c
Vr(p)α>Q(t, Vr(p)α)>r(t, Vr(p)α)Rrp,
(5.48d)
˜
Q:="Irα0
0 0#,(5.48e)
˜
B(t, ˜x):="˜
B1(t, ˜x)
˜
B2(t, ˜x)#,
˜
B1(t, ˜x):=Vr(p)>Q(t, Vr(p)α)>B(t, Vr(p)α)Rrα,m,
˜
B2(t, ˜x):=c
Vr(p)α>Q(t, Vr(p)α)>B(t, Vr(p)α)Rrp,m.
(5.48f)
Here, we use the notation from (5.34) for the block components of ˜x, whereas c
Vr
is as defined in (5.37). We note in particular that in contrast to the definitions
165
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
in (5.36) and (5.44), the matrix function ˜
Edefined in (5.48a) is not necessarily
pointwise symmetric.
In Theorem 5.2.5 it is stated that the ROM (5.43) with coefficients as de-
fined in (5.48) has a nonlinear time-varying pH structure as the corresponding
FOM. Under additional assumptions on the FOM Hamiltonian and Vr, one can
show that the αcomponent of the ROM state is bounded, cf. Corollary 5.2.6.
Furthermore, following the same arguments as in the proof of Corollary 5.2.6,
we may infer the bound
kVr(p(t))α(t)k sc3
c2kVr(p(t0))α(t0)kfor all tI
for the approximation Vr(p)αof the FOM state, even if the singular values of
Vrare not uniformly bounded as in (5.42). Here, the constants c2and c3are
as specified in Corollary 5.2.6 and Definition 2.4.9.
Theorem 5.2.5 (Structure-preserving MOR for (5.10) using a separable ap-
proximation ansatz).Consider the pH system (5.10) with E, r, J, R, Q and the
associated Hamiltonian Hsatisfying (5.11), i.e.,
E(t, x)>Q(t, x)x=xH(t, x), x>Q(t, x)>r(t, x) = tH(t, x)
J(t, x) = J(t, x)>, R(t, x) = R(t, x)>0for all (t, x)R0×Rn.
Furthermore, let VrC1(Rrp,Rn,rα)with rα, rpNand r:=rα+rpnbe
given. Moreover, we consider the reduced-order model (5.43) with coefficients
˜
E,˜
r,˜
J, ˜
R, ˜
Q, ˜
Bas defined in (5.48) and associated Hamiltonian ˜
H:R0×Rr
Rdefined via ˜
H(t, ˜x):=H(t, Vr(p)α). Then, ˜
His continuously differentiable
and the ROM coefficients satisfy
˜
J(t, ˜x) = ˜
J(t, ˜x)>,˜
R(t, ˜x) = ˜
R(t, ˜x)>0,˜
E(t, ˜x)>˜
Q˜x=˜x˜
H(t, ˜x),
t˜
H(t, ˜x) = ˜x>˜
Q>˜
r(t, ˜x)for all (t, ˜x)R0×Rr,
i.e., the ROM inherits the port-Hamiltonian structure from the FOM.
Proof. The properties of ˜
Jand ˜
Rfollow by similar arguments as in the proof
of Theorem 5.2.1. To check the properties associated with ˜
H, we first note
that ˜
His continuously differentiable due to the continuous differentiability of
Hand Vr. Furthermore, its partial derivatives with respect to the ROM state
variables are given by
α˜
H(t, ˜x) = xH(t, Vr(p)α)Vr(p),
p˜
H(t, ˜x)ζ=xH(t, Vr(p)α)V0
r(p)(ζ)α=xH(t, Vr(p)α)c
Vr(p)(α)ζ
for all (t, ˜x, ζ)R0×Rr×Rrpand, hence, we obtain
˜x˜
H(t, ˜x) = hα˜
H(t, ˜x)p˜
H(t, ˜x)i>=hVr(p)c
Vr(p)αi>xH(t, Vr(p)α)
166
5.3. Nonlinear Factorizable Approximation Ansatz
=hVr(p)c
Vr(p)αi>E(t, Vr(p)α)>Q(t, Vr(p)α)Vr(p)α=˜
E(t, ˜x)>˜
Q˜x
for all (t, ˜x)R0×Rr. Finally, we compute the partial derivative of ˜
Hwith
respect to tand arrive at
t˜
H(t, ˜x) = tH(t, Vr(p)α) = α>Vr(p)>Q(t, Vr(p)α)>r(t, Vr(p)α) = ˜x>˜
Q>˜
r(t, ˜x)
for all (t, ˜x)R0×Rr.
Corollary 5.2.6 (Boundedness of part of the state in (5.43a) with (5.48)).Let
the assumptions of Theorem 5.2.5 be satisfied and let there additionally exist
constants ˆc1,ˆc2R>0such that the singular value bounds in (5.42) hold. Fur-
thermore, let the FOM Hamiltonian Hsatisfy condition (ii) in Definition 2.4.9
with V=H. In addition, let ˜x= [α>p>]>C1(I,Rrα+rp)be a solution of
the ROM state equation (5.43a) with u= 0 and coefficients as in (5.48) on the
time interval I= [t0, tend]with t0R0and tend R>t0. Then, there exists a
constant cR>0which is independent of t0and tend and satisfies
kα(t)k ckα(t0)kfor all tI.
Proof. By using similar arguments as in the proof of Corollary 5.2.4, we infer
that the function ˜
Hs:IRdefined via ˜
Hs(t):=˜
H(t, ˜x(t)) is continuously
differentiable and satisfies
˜
H(t, ˜x(t)) = ˜
Hs(t)˜
Hs(t0) = ˜
H(t0,˜x(t0)) for all tI.
Using this inequality, the singular value bounds (5.42), the assumption that
Hsatisfies condition (ii) in Definition 2.4.9 with constants c2, c3R>0, and
similar arguments as in the proof of Corollary 5.2.2, we arrive at
kα(t)k21
σmin(Vr(p(t)))2kVr(p(t))α(t)k21
c2ˆc2
2H(t, Vr(p(t))α(t))
=1
c2ˆc2
2
˜
H(t, ˜x(t)) 1
c2ˆc2
2
˜
H(t0,˜x(t0)) = 1
c2ˆc2
2H(t0, Vr(p(t0))α(t0))
c3
c2ˆc2
2kVr(p(t0))α(t0)k2c3σmax(Vr(p(t0)))2
c2ˆc2
2kα(t0)k2
c3ˆc2
1
c2ˆc2
2kα(t0)k2,
for all tI, which yields the assertion.
5.3. Nonlinear Factorizable Approximation Ansatz
In section 5.2, we have considered a special nonlinear approximation ansatz
which is given by a linear combination of state-dependent ansatz vectors corre-
167
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
sponding to the columns of Vr. In particular, this ansatz includes a splitting of
the ROM state into two block components αand p, where the αcomponent rep-
resents the coefficients of the linear combination, whereas the pcomponent is
used for parametrizing the ansatz vectors. As demonstrated in Theorems 5.2.1
and 5.2.3, this special structure still allows to obtain structure-preserving and
residual-minimizing ROMs. In this section, we consider a more general non-
linear approximation ansatz, which is based on a linear combination of ansatz
vectors which depend on time and on the ROM state, but in contrast to the
previous section we do not assume a separation of the ROM state variables.
Thus, we consider a so-called factorizable approximation ansatz of the form
x(t)Vr(t, ˜x(t))˜x(t)(5.49)
with Vr:R0×RrRn,r. Unfortunately, an extension of the residual mini-
mization property as in Theorems 5.2.1 and 5.2.3 to the more general ansatz
(5.49) appears to be challenging and is left for future research. Instead, we
only focus on the structure preservation in the following. In particular, this is
similarly as in section 5.1 achieved by enforcing the residual to be orthogonal
to the span of QVr, where Qmay be time- or state-dependent according to
the corresponding FOM. For the special case of a linear time-invariant port-
Hamiltonian FOM, a corresponding statement is provided in Theorem 5.3.1.
Its proof is omitted since it is a special case of the upcoming Theorem 5.3.6,
as the linear time-invariant pH structure (5.1)–(5.2) is a special case of the
nonlinear time-varying pH structure (5.10)–(5.11).
Theorem 5.3.1 (Structure-preserving MOR for (5.1) using a factorizable ap-
proximation ansatz).Consider the pH system (5.1) with E, J, R, Q satisfying
(5.2), i.e.,
J=J>, R =R>0, E>Q=Q>E0.
Furthermore, for given VrC1(R0×Rr,Rn,r)we consider the ROM
˜
E(t, ˜x(t)) ˙
˜x(t) + ˜
K(t, ˜x)˜x(t)=(˜
J(t, ˜x(t)) ˜
R(t, ˜x(t))) ˜
Q˜x(t) + ˜
B(t, ˜x(t))u(t),
(5.50a)
˜y(t) = ˜
B(t, ˜x(t))>˜
Q˜x(t),(5.50b)
for all tI, with coefficients ˜
E, ˜
K, ˜
J, ˜
R:R0×RrRr,r,˜
QRr,r, and
˜
B:R0×RrRr,m defined via
˜
E(t, ˜x):=Vr(t, ˜x)>Q>EVr(t, ˜x) + Vr(t, ˜x)>Q>Ec
Vr(t, ˜x)˜x, (5.51a)
˜
K(t, ˜x):=Vr(t, ˜x)>Q>EtVr(t, ˜x),˜
J(t, ˜x):=Vr(t, ˜x)>Q>JQVr(t, ˜x),
(5.51b)
˜
R(t, ˜x):=Vr(t, ˜x)>Q>RQVr(t, ˜x),˜
Q=Ir,˜
B(t, ˜x):=Vr(t, ˜x)>Q>B.
(5.51c)
168
5.3. Nonlinear Factorizable Approximation Ansatz
Here, c
Vr:R0×Rr L(Rr,Rn,r)is defined via
c
Vr(t, η1)(η2)η3:=˜xVr(t, η1)(η3)η2(5.52)
for all (t, η1, η2, η3)R0×Rr×Rr×Rr. Moreover, we define the associated
Hamiltonian ˜
H C1(R0×Rr)via ˜
H(t, ˜x):=1
2˜x>Vr(t, ˜x)>E>QVr(t, ˜x)˜x.
Then, ˜
His continuously differentiable and the ROM coefficients satisfy
t˜
H(t, ˜x) = ˜x>˜
Q>˜
K(t, ˜x)˜x, ˜x˜
H(t, ˜x) = ˜
E(t, ˜x)>˜
Q˜x,
˜
J(t, ˜x) = ˜
J(t, ˜x)>,˜
R(t, ˜x) = ˜
R(t, ˜x)>0
for all (t, ˜x)R0×Rr, i.e., the ROM (5.50)(5.51) has a nonlinear port-
Hamiltonian structure as in (5.10)(5.11).
In section 5.1, we have argued that the state equation (5.13a) of the port-
Hamiltonian ROM considered in Theorem 5.1.1 is stable by exploiting that
Vrhas full column rank and that Eand Qare invertible and satisfy E>Q=
Q>E0. Especially, these assumptions allow to infer that ˜
Edefined in
(5.14) is symmetric and positive definite. However, we may not use the same
arguments for the ROM (5.50) presented in Theorem 5.3.1. This is mainly
due to the second term in the definition of ˜
Ein (5.51), which results in ˜
E
not necessarily being pointwise positive definite even in the case where E>Q
is positive definite and Vrhas pointwise full column rank. Nevertheless, if
˜
Eis at least invertible, we may obtain a stability result for the ROM (5.50)
as detailed in Corollary 5.3.2. Its proof is omitted, since it is a special case
of Corollary 5.3.7, where especially the assumption that the origin 0Rn
is an equilibrium point of the FOM with u= 0 is automatically satisfied.
Moreover, the invertibility of Eand Qimplies together with (5.2) that E>Q
is positive definite and, hence, the FOM Hamiltonian satisfies condition (ii) in
Definition 2.4.9.
Corollary 5.3.2 (Stability of (5.50a) with coefficients as in (5.51)).Let the
assumptions of Theorem 5.3.1 be satisfied and let additionally Eand Qbe
invertible. Furthermore, let Vrbe twice continuously differentiable and let Vr,
E, and Qbe such that ˜
Eas defined in (5.51) is pointwise invertible, cf. Re-
mark 5.3.3. Besides, let there exist constants ˆc1,ˆc2R>0with
σmax(Vr(t, ˜x)) ˆc1and σmin(Vr(t, ˜x)) ˆc2for all (t, ˜x)R0×Rr.
(5.53)
Then, the ROM state equation (5.50a) with u= 0 and coefficients as in (5.51)
has a uniformly stable equilibrium point at 0Rr.
Remark 5.3.3 (Invertibility of ˜
E).In the special case where Vris constant
with respect to its second argument, we have that ˜
Eas defined in (5.51)
is pointwise symmetric and positive definite, provided that E>Qis positive
definite and Vrhas pointwise full column rank, cf. section 5.1. However, as
169
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
mentioned before Corollary 5.3.2, this is not necessarily true if ˜xVrdoes not
vanish. Nevertheless, if E>Qis positive definite and Vrhas pointwise full
column rank, then we have at least that ˜
E(t, 0) is invertible for all tR0,
since the second term on the right-hand side of (5.51a) vanishes for ˜x= 0. To
derive general conditions for the pointwise invertibility of ˜
E, we observe that
it may be factorized as
˜
E(t, ˜x):=Vr(t, ˜x)>Q>EVr(t, ˜x) + c
Vr(t, ˜x)˜xfor all (t, ˜x)R0×Rr.
Thus, using [34, Fact. 2.10.14], we infer
rank ˜
E(t, ˜x)
= rank(Vr(t, ˜x)) dim ker (Vr(t, ˜x)) im Vr(t, ˜x) + c
Vr(t, ˜x)˜x>E>Q,
= rank Vr(t, ˜x) + c
Vr(t, ˜x)˜x
dim ker Vr(t, ˜x)>Q>Eim Vr(t, ˜x) + c
Vr(t, ˜x)˜x
for all (t, ˜x)R0×Rr. Consequently, ˜
Eis pointwise invertible if and only if
for all (t, ˜x)R0×Rrwe have
(i) Vr(t, ˜x)has full column rank and
(ii) ker (Vr(t, ˜x)) im Vr(t, ˜x) + c
Vr(t, ˜x)˜x>E>Q={0}
or, equivalently,
(a) Vr(t, ˜x) + c
Vr(t, ˜x)˜xhas full column rank and
(b) ker Vr(t, ˜x)>Q>Eim Vr(t, ˜x) + c
Vr(t, ˜x)˜x={0}.
Especially, we note that these conditions are rather general and do not require
Eor Qto be invertible or E>Qto be symmetric or positive semi-definite. In
the special case where E>Qis symmetric and positive definite, we have for
instance that (b) is equivalent to the condition that im(Vr(t, ˜x) + c
Vr(t, ˜x)˜x)
has a trivial intersection with the orthogonal complement of im(Vr(t, ˜x)) with
respect to the weighted E>Qinner product. In accordance with the beginning
of this remark, this is for instance satisfied if c
Vror ˜xVrvanishes. Finally, we
note that similar arguments allow to obtain analogous conditions for the case,
where Eand Qdepend on tand ˜x.¨
We continue by considering the case of a linear time-varying pH full-order
model of the form (5.3) and present a corresponding port-Hamiltonian ROM
in Theorem 5.3.4. In particular, we emphasize that the ROM coefficients
defined in (5.54) do in general not satisfy the conditions (5.11), but instead
less restrictive conditions of the form (5.45), cf. the discussion after (5.44).
Under additional assumptions including the pointwise invertibility of ˜
E, one
170
5.3. Nonlinear Factorizable Approximation Ansatz
can show that the ROM state equation with u= 0 has a uniformly stable
equilibrium point in the origin, cf. Corollary 5.3.5.
Theorem 5.3.4 (Structure-preserving MOR for (5.3) using a factorizable ap-
proximation ansatz).Consider the pH system (5.3) with E, K, J, R, Q satisfy-
ing pointwise (5.4), i.e.,
E>Q=Q>E0, Q>RQ =Q>R>Q0,
d
dt(Q>E) = Q>(KJQ)+(KJQ)>Q.
Furthermore, for given VrC1(R0×Rr,Rn,r)we consider a ROM of the form
(5.50) with coefficient matrix functions ˜
E, ˜
K, ˜
J, ˜
R:R0×RrRr,r,˜
QRr,r,
and ˜
B:R0×RrRr,m defined via
˜
E(t, ˜x):=Vr(t, ˜x)>Q(t)>E(t)Vr(t, ˜x) + c
Vr(t, ˜x)˜x,
˜
K(t, ˜x):=Vr(t, ˜x)>Q(t)>(E(t)tVr(t, ˜x) + K(t)Vr(t, ˜x)) ,
˜
J(t, ˜x):=Vr(t, ˜x)>Q(t)>J(t)Q(t)Vr(t, ˜x),
˜
Q:=Ir,
˜
R(t, ˜x):=Vr(t, ˜x)>Q(t)>R(t)Q(t)Vr(t, ˜x),
˜
B(t, ˜x):=Vr(t, ˜x)>Q(t)>B(t)
(5.54)
with c
Vras defined in (5.52). Moreover, we introduce the ROM Hamiltonian
˜
H:R0×RrRvia
˜
H(t, ˜x):=H(t, Vr(t, ˜x)˜x) = 1
2˜x>Vr(t, ˜x)>E(t)>Q(t)Vr(t, ˜x)˜x.
Then, ˜
His continuously differentiable and the ROM coefficients satisfy
t˜
H(t, ˜x) = ˜x>˜
Q>˜
K(t, ˜x)˜
J(t, ˜x)˜
Q˜x,
˜x˜
H(t, ˜x) = ˜
E(t, ˜x)>˜
Q˜x,
˜
Q>˜
R(t, ˜x)˜
Q=˜
Q>˜
R(t, ˜x)>˜
Q0
(5.55)
for all (t, ˜x)R0×Rr, i.e., the ROM (5.50) with coefficients as in (5.54)
has a nonlinear time-varying port-Hamiltonian structure as in (5.8),(5.45).
Proof. The fact that ˜
Q>˜
R˜
Q=˜
Rdefined in (5.54) is pointwise symmetric
and positive semi-definite follows from the corresponding properties of Q>RQ.
Furthermore, ˜
His continuously differentiable due to the continuous differen-
tiability of E>Qand Vr. Moreover, the partial derivative of ˜
Hwith respect to
˜xis given by
˜x˜
H(t, ˜x)ζ=xH(t, Vr(t, ˜x)˜x) (Vr(t, ˜x)ζ+˜xVr(t, ˜x)(ζ)˜x)
171
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
= ˜x>Vr(t, ˜x)>Q(t)>E(t)Vr(t, ˜x) + c
Vr(t, ˜x)˜xζ
for all (t, ˜x, ζ)R0×Rr×Rrand, hence, we obtain
˜x˜
H(t, ˜x) = Vr(t, ˜x) + c
Vr(t, ˜x)˜x>E(t)>Q(t)Vr(t, ˜x)˜x=˜
E(t, ˜x)>˜
Q˜x
for all (t, ˜x)R0×Rr. Finally, for the partial derivative of ˜
Hwith respect
to t, we compute
t˜
H(t, ˜x) = tH(t, Vr(t, ˜x)˜x) + xH(t, Vr(t, ˜x)˜x)tVr(t, ˜x)˜x
=1
2˜x>Vr(t, ˜x)>d
dt(E>Q)(t)Vr(t, ˜x)˜x+ ˜x>Vr(t, ˜x)>Q(t)>E(t)tVr(t, ˜x)˜x
= ˜x>Vr(t, ˜x)>Q(t)>((K(t)J(t)Q(t)) Vr(t, ˜x) + E(t)tVr(t, ˜x)) ˜x
= ˜x>˜
Q>˜
K(t, ˜x)˜
J(t, ˜x)˜
Q˜x
for all (t, ˜x)R0×Rr.
Corollary 5.3.5 (Stability of (5.50a) with coefficients as in (5.54)).Let the
assumptions of Theorem 5.3.4 be satisfied and let there additionally exist con-
stants ˜c1,˜c2,ˆc1,ˆc2R>0such that the singular value bounds (5.26) and (5.53)
hold. Furthermore, let E, Q, J, R, K be continuously differentiable, Vrbe twice
continuously differentiable, and Vr,E, and Qbe such that ˜
Eas defined in
(5.54) is pointwise invertible, cf. Remark 5.3.3. Then, the ROM state equa-
tion (5.50a) with u= 0 and coefficients as in (5.54) has a uniformly stable
equilibrium point at 0Rr.
Proof. By Theorem 5.3.4 and due to the differentiability assumptions on Vrand
on the FOM coefficients, we obtain that ˜
J,˜
R,˜
K,˜
H, and ˜
Eare continuously
differentiable. Furthermore, due to (5.26) and (5.53) and by using similar
arguments as in the proof of Corollary 5.2.2, we obtain the bounds
˜
H(t, ˜x) = 1
2˜x>Vr(t, ˜x)>E(t)>Q(t)Vr(t, ˜x)˜x1
2σmax(E(t)>Q(t)) kVr(t, ˜x)˜xk2
1
2σmax(E(t)>Q(t))σmax(Vr(t, ˜x))2k˜xk2ˆc2
1˜c1
2k˜xk2=:c3k˜xk2
as well as
˜
H(t, ˜x)1
2σmin(E(t)>Q(t)) kVr(t, ˜x)˜xk2
1
2σmin(E(t)>Q(t))σmin(Vr(t, ˜x))2k˜xk2ˆc2
2˜c2
2k˜xk2=:c2k˜xk2
for all (t, ˜x)R0×Rr. Thus, ˜
Hsatisfies condition (ii) from Definition 2.4.9.
Moreover, 0Rris an equilibrium point of (5.50a) with u= 0 and the claim
follows then by using the same arguments as in the proof of Theorem 2.6.3.
172
5.3. Nonlinear Factorizable Approximation Ansatz
In fact, we note that the only reason which prevents us from directly applying
Theorem 2.6.3 is that ˜
Jis in general not pointwise skew-symmetric, but the
proof of Theorem 2.6.3 works analogously by exploiting (5.55) instead of (2.31).
We close this chapter by considering structure-preserving MOR for nonlin-
ear pH systems of the form (5.10) using a factorizable approximation ansatz
of the form (5.49). A corresponding port-Hamiltonian ROM is presented in
the following theorem and under some additional assumptions including the
pointwise invertibility of ˜
E, we may infer that the ROM state equation (5.56a)
with vanishing input has a uniformly stable equilibrium point at 0, see Corol-
lary 5.3.7.
Theorem 5.3.6 (Structure-preserving MOR for (5.10) using a factorizable
approximation ansatz).Consider the pH system (5.10) with E, r, J, R, Q and
the associated Hamiltonian Hsatisfying (5.11), i.e.,
E(t, x)>Q(t, x)x=xH(t, x), x>Q(t, x)>r(t, x) = tH(t, x)
J(t, x) = J(t, x)>, R(t, x) = R(t, x)>0for all (t, x)R0×Rn.
Furthermore, let Vr:R0×RrRn,r with rNnbe continuously differen-
tiable and let
˜
E(t, ˜x(t)) ˙
˜x(t) + ˜
r(t, ˜x(t)) = ( ˜
J(t, ˜x(t)) ˜
R(t, ˜x(t))) ˜
Q˜x(t) + ˜
B(t, ˜x(t))u(t),
(5.56a)
˜y(t) = ˜
B(t, ˜x(t))>˜
Q˜x(t),(5.56b)
for all tIbe a corresponding ROM with coefficient functions ˜
E, ˜
J, ˜
R:R0×
RrRr,r,˜
r:R0×RrRr,˜
QRr,r, and ˜
B:R0×RrRr,m defined as
˜
E(t, ˜x):=Vr(t, ˜x)>Q(t, Vr(t, ˜x)˜x)>E(t, Vr(t, ˜x)˜x)Vr(t, ˜x) + c
Vr(t, ˜x)˜x,
˜
J(t, ˜x):=Vr(t, ˜x)>Q(t, Vr(t, ˜x)˜x)>J(t, Vr(t, ˜x)˜x)Q(t, Vr(t, ˜x)˜x)Vr(t, ˜x),
˜
R(t, ˜x):=Vr(t, ˜x)>Q(t, Vr(t, ˜x)˜x)>R(t, Vr(t, ˜x)˜x)Q(t, Vr(t, ˜x)˜x)Vr(t, ˜x),
˜
r(t, ˜x):=Vr(t, ˜x)>Q(t, Vr(t, ˜x)˜x)>(r(t, Vr(t, ˜x)˜x) + E(t, Vr(t, ˜x)˜x)tVr(t, ˜x)˜x),
˜
Q:=Ir,
˜
B(t, ˜x):=Vr(t, ˜x)>Q(t, Vr(t, ˜x)˜x)>B(t, Vr(t, ˜x)˜x).
(5.57)
Here, c
Vris defined as in (5.52) and, moreover, we introduce the ROM Hamilto-
nian ˜
H:R0×RrRvia ˜
H(t, ˜x):=H(t, Vr(t, ˜x)˜x). Then, ˜
His continuously
differentiable and the ROM coefficients satisfy
˜
J(t, ˜x) = ˜
J(t, ˜x)>,˜
R(t, ˜x) = ˜
R(t, ˜x)>0,
˜
E(t, ˜x)>˜
Q˜x=˜x˜
H(t, ˜x),˜x>˜
Q>˜
r(t, ˜x) = t˜
H(t, ˜x),
173
5. Structure-Preserving Model Reduction for Port-Hamiltonian Systems
for all (t, ˜x)R0×Rr, i.e., the ROM (5.56) inherits the port-Hamiltonian
structure from the FOM (5.10).
Proof. The properties of ˜
Jand ˜
Rfollow from the corresponding properties of
Jand R, respectively. Furthermore, ˜
His continuously differentiable due to the
continuous differentiability of Hand Vr. Moreover, the relations concerning
the partial derivatives of ˜
Hfollow from
t˜
H(t, ˜x) = tH(t, Vr(t, ˜x)˜x) + xH(t, Vr(t, ˜x)˜x)tVr(t, ˜x)˜x
= ˜x>Vr(t, ˜x)>Q(t, Vr(t, ˜x)˜x)>(r(t, Vr(t, ˜x)˜x) + E(t, Vr(t, ˜x)˜x)tVr(t, ˜x)˜x)
= ˜x>˜
Q>˜
r(t, ˜x)
and
˜x˜
H(t, ˜x)ζ=xH(t, Vr(t, ˜x)˜x) (Vr(t, ˜x)ζ+˜xVr(t, ˜x)(ζ)˜x)
= ˜x>Vr(t, ˜x)>Q(t, Vr(t, ˜x)˜x)>E(t, Vr(t, ˜x)˜x)Vr(t, ˜x) + c
Vr(t, ˜x)˜xζ
=˜
E(t, ˜x)>˜
Q˜x>ζ
for all (t, ˜x, ζ)R0×Rr×Rr.
Corollary 5.3.7 (Stability of the ROM state equation (5.56a)).Let the as-
sumptions of Theorem 5.3.6 be satisfied and let additionally E,J,R,Q, and r
be continuously differentiable and Vrbe twice continuously differentiable. Fur-
thermore, let the FOM Hamiltonian Hsatisfy condition (ii) in Definition 2.4.9
with V=Hand let there exist constants ˆc1,ˆc2R>0such that the singular
values of Vrsatisfy (5.53). Besides, let E,Q, and Vrbe such that ˜
Eas defined
in (5.57) is pointwise invertible, cf. Remark 5.3.3, and let 0Rnbe an equi-
librium point of the FOM state equation (5.10a) with u= 0. Then, the ROM
state equation (5.56a) with u= 0 has a uniformly stable equilibrium point at
0Rr.
Proof. First, we note that the differentiability assumptions on the FOM co-
efficient functions and on Vrimply that ˜
E,˜
J,˜
R, and ˜
ras defined in (5.57)
are continuously differentiable. Furthermore, since Hsatisfies condition (ii) in
Definition 2.4.9 with constants c2, c3R>0and since the singular values of Vr
are bounded as in (5.53), we infer that also the ROM Hamiltonian ˜
Hsatisfies
condition (ii) in Definition 2.4.9, which follows from the calculation
˜
H(t, ˜x) = H(t, Vr(t, ˜x)˜x)c3kVr(t, ˜x)˜xk2c3σmax(Vr(t, ˜x))2k˜xk2c3ˆc2
1k˜xk2
and ˜
H(t, ˜x)c2kVr(t, ˜x)˜xk2c2σmin(Vr(t, ˜x))2k˜xk2c2ˆc2
2k˜xk2
for all (t, ˜x)R0×Rr. In addition, the fact that 0Rnis an equilibrium
point of (5.10a) with u= 0 implies r(t, 0) = 0 for all tR0. Thus, we also
have ˜
r(t, 0) = 0 for all tR0and the claim follows from Theorem 2.6.3.
174
6. Numerical Examples
In this chapter, we illustrate some of the methods discussed in the previous
chapters by means of numerical test cases with a one-dimensional spatial do-
main . In particular, we start in section 6.1 by considering the linear wave
equation with periodic boundary conditions as in Example 1.2.3. This test
case allows for an explicit analytical solution on the PDE level, cf. (1.15), and
this solution may be described by two shifted modes. Moreover, the modes
are shifted in opposite directions and thus we use an approximation ansatz of
the form (1.4) with two distinct path variables.
In section 6.2 we consider a linear advection–diffusion problem with an inho-
mogeneous Robin condition at the left boundary and a homogeneous Neumann
condition at the right boundary. Especially, the left boundary condition re-
sults in a wave entering the computational domain after a certain time period.
In order to reflect this entering wave in the approximation ansatz, we use a
discretized version of the extended domain shift operator introduced in sec-
tion 3.3.1, cf. Example 3.3.5. Furthermore, we demonstrate the performance
of the greedy algorithm presented in section 3.2.
Finally, we consider a nonlinear test case in section 6.3 given by a nonlinear
reaction–diffusion equation with mixed Dirichlet/Neumann boundary condi-
tions. In particular, the nonlinearity originates from the reaction term and
an efficient ROM evaluation is achieved by using the hyperreduction method
introduced in section 4.3.2.
For all three test cases, the space discretization is carried out by a standard
Galerkin finite element scheme based on an equidistant mesh and piecewise
linear basis functions. Moreover, the initial value of the semi-discrete system
is obtained by evaluating the corresponding PDE initial value at the FEM
grid points. For the time discretization, we use the implicit midpoint rule
and an equidistant time grid. Since each of the considered FOMs allows for
a port-Hamiltonian representation with quadratic Hamiltonian, the implicit
midpoint rule guarantees that a dissipation inequality is satisfied after time
discretization, cf. section 2.6.1. Besides, whenever a nonlinear equation system
needs to be solved during the simulation of the FOM or the ROM, we use
MATLAB’s fsolve function with default settings, except for the numerical
experiments in section 6.3 where the parameter OptimalityTolerance is set
to 108.
In the offline phase, the determination of the modes follows the approach
presented in section 3.1. More precisely, we solve minimization problems of the
form (3.9)–(3.10), where the time weights are chosen based on the composite
175
6. Numerical Examples
trapezoidal rule as ω1=ωq=1
2,ω2=. . . =ωq1= 1 and the spatial weighting
matrix Wis chosen as the corresponding FEM mass matrix. Unless stated oth-
erwise, we employ the variable projection approach outlined in section 3.1.2 for
reducing the optimization problems and we use the non-commercial software
GRANSO (version 1.6.4) as optimization solver, cf. [75].
The ROMs are constructed based on residual minimization as outlined in
section 4.1, where we use the finite-dimensional systems obtained after space
discretization as full-order models. However, in contrast to section 4.1, we
use a weighted norm for the residual as in Theorem 5.2.1(ii) to preserve the
port-Hamiltonian structure of the FOM. The time integration of the ROMs is
carried out using either the implicit midpoint rule or a time integration scheme
based on discrete gradient pairs. The latter is detailed in appendix C.2 and
guarantees that a dissipation inequality is satisfied after time discretization,
even if the Hamiltonian is not quadratic.
Unless stated otherwise, the error values specified in the following sections
correspond to the relative error in a discretized L2(I×Ω) norm. To this end,
we use the same discretization as for the mode determination, i.e., the relative
error between the FOM state x:IRnand a corresponding approximation
ˆx:IRnis computed as
v
u
u
tPq
i=1 ωikx(ti)ˆx(ti)k2
W
Pq
i=1 ωikx(ti)k2
W
,
where t1, . . . , tqIare the grid points of the time integration scheme and the
weights ω1, . . . , ωqand Ware chosen as for the mode determination.
All numerical experiments have been conducted on a laptop with 2.7 GHz
Dual-Core Intel Core i5 processor and 8 GB RAM. Furthermore, we have used
MATLAB R2020b.
6.1. Linear Wave Equation
We revisit Example 1.2.3 and consider the linear acoustic wave equation
tρ(t, ξ) = ρref ξv(t, ξ)for all (t, ξ)I×,
tv(t, ξ) = c2
ρref
ξρ(t, ξ)for all (t, ξ)I×,
ρ(0, ξ) = ρ0(ξ)for all ξ,
v(0, ξ) = v0(ξ)for all ξ,
ρ(t, 0) = ρ(t, 1) for all tI,
v(t, 0) = v(t, 1) for all tI,
(6.1)
with spatial domain = (0,1), time interval I= [0, tend], constants c, ρref
R>0, initial values ρ0, v0C1
per(Ω), and unknowns ρ, v:I×R. Fur-
176
6.1. Linear Wave Equation
thermore, we introduce the associated Hamiltonian H:L2(Ω) ×L2(Ω) R
via
H(ˆρ, ˆv):=1
2Z
c2
ρref
ˆρ(ξ)2+ρref ˆv(ξ)2!dξ, (6.2)
which is defined such that Hs:IRwith Hs(t):=H(ρ(t, ·), v(t, ·)) corre-
sponds to the sound energy, cf. [177, § 65]. It can be shown that Hsis constant,
i.e., the sound energy is a conserved quantity of (6.1), cf. the upcoming calcu-
lation in (6.4).
In the following, we perform a semi-discretization of (6.1) in space using
a finite element approach and, for this purpose, we start by deriving a corre-
sponding weak formulation. We are interested in obtaining a finite-dimensional
system which is port-Hamiltonian and ideally allows for a pH representation
with an Ematrix as in (2.37), since this is particularly convenient for model
reduction as mentioned in Remark 2.6.4. For this reason and taking into ac-
count the coefficients of the Hamiltonian in (6.2), it turns out to be beneficial
to multiply the first and second equation in (6.1) by c2
ρref and ρref, respectively.
After that, by applying integration by parts similarly as in [211], we obtain a
corresponding weak formulation as follows: Find (ρ, v): I×R2such that
(i) for all tI,ρ(t, ·)and v(t, ·)are in H1
per (Ω) and satisfy
c2
ρref hψρ, tρ(t, ·)iL2(Ω) +ρref hψv, tv(t, ·)iL2(Ω)
=c2
2hψρ, ξv(t, ·)iL2(Ω) Dv(t, ·), ψ0
ρEL2(Ω)
+hψv, ξρ(t, ·)iL2(Ω) hρ(t, ·), ψ0
viL2(Ω)
(6.3)
for all ψρ, ψvH1
per (Ω), where ψ0
ρand ψ0
vdenote the weak derivatives of
ψρand ψv, respectively, and ξρand ξvthe weak partial derivatives of
ρand vwith respect to ξ, respectively,
(ii) for all ξ, we have ρ(0, ξ) = ρ0(ξ)and v(0, ξ) = v0(ξ).
Before we proceed with the FEM discretization, we use (6.3) to demonstrate
that the sound energy is indeed a conserved quantity. For this purpose, let
(ρ, v): I×R2be a solution of the weak formulation of (6.1) and let it be
continuously differentiable with respect to time in the sense that the mapping
t7→ (ρ(t, ·), v(t, ·)) is in C1(I, L2(Ω) ×L2(Ω)). In addition, we note that Hs
as defined after (6.2) represents the Hamiltonian along the solution trajectory.
Then, using the chain rule, [11, Prop. VII.4.6] for the derivative of H, and
177
6. Numerical Examples
(6.3), we obtain
dHs
dt(t) = H0(ρ(t, ·), v(t, ·))(tρ(t, ·), tv(t, ·))
=c2
ρref hρ(t, ·), tρ(t, ·)iL2(Ω) +ρref hv(t, ·), tv(t, ·)iL2(Ω)
=c2
2hρ(t, ·), ξv(t, ·)iL2(Ω) hv(t, ·), ξρ(t, ·)iL2(Ω)
+hv(t, ·), ξρ(t, ·)iL2(Ω) hρ(t, ·), ξv(t, ·)iL2(Ω)
= 0
(6.4)
for all tI, i.e., the Hamiltonian is preserved along the solution trajectories.
Here, we exploited that the right-hand side of (6.3) may be described via a
skew-adjoint operator and that (6.3) may be formulated as a Hamiltonian sys-
tem. However, we omit further details here since infinite-dimensional Hamil-
tonian or port-Hamiltonian systems are not within the scope of this thesis.
Instead, we demonstrate the Hamiltonian structure on the finite-dimensional
level after semi-discretization in space.
Applying a standard Galerkin finite element scheme in space based on an
equidistant mesh with mesh size h=1
N+1 ,NN, and globally continuous and
piecewise linear ansatz and test functions yields the semi-discretized system
Eh˙xh(t) = Jhxh(t)for all tI,(6.5)
where xh:IR2(N+1) contains the coefficients corresponding to the FEM
ansatz functions and Eh, JhR2(N+1),2(N+1) are given by
Eh:=h
6diag c2
ρref
, ρref!M, M :=
4 1 0 ··· 0 1
1 4 1 ....
.
.0
0 1 4 ...0.
.
.
.
.
..........1 0
0··· 0 1 4 1
1 0 ··· 0 1 4
RN+1,N+1,
Jh:=c2
2"0 1
1 0#P, P :=
0 1 0 ··· 01
1 0 1 ....
.
.0
01 0 ...0.
.
.
.
.
..........1 0
0··· 01 0 1
1 0 ··· 01 0
RN+1,N+1.
(6.6)
We emphasize that Ehis symmetric and positive definite and Jhis skew-
symmetric. Consequently, (6.5) is a Hamiltonian system with Hamiltonian
178
6.1. Linear Wave Equation
0 1
0
1
ξ
t
0
0.2
0.4
ρ(t, ξ)
0 1
0
1
ξ
t
0.2
0
0.2
v(t, ξ)
Figure 6.1.: Linear wave equation: pseudocolor plots of the FOM solution for the density
(left) and the velocity (right).
Hh:R2(N+1) Rdefined via Hh(xh):=1
2x>
hEhxhand, in particular, stable,
cf. section 2.6.
For the following numerical experiments, we choose the PDE parameters as
c= 1 and ρref = 1, the final time as tend = 1, and the initial values as v0= 0
and
ρ0(ξ) =
1
2exp 11
1(20(ξ1
2))2,if ξ(0.45,0.55),
0,otherwise (6.7)
for all ξ, cf. Figure 6.2, left. Furthermore, we divide the spatial domain
into N+ 1 = 2000 equidistant intervals, which corresponds to a mesh size
of h= 5 ·104, and for the time discretization we use a step size of 104.
This results in a relative error of about 7·104when comparing the numerical
solution of the fully discretized system with the analytical solution provided
in Example 1.2.3. Figures 6.1 and 6.2 depict the numerical solution in the
form of pseudocolor plots and by means of plotting some selected snapshots,
respectively. As is typical for the linear wave equation, we observe traveling
waves both in the density and in the velocity snapshots. We note that the
velocity snapshots at t= 0 and t= 0.5in Figure 6.2, right, are zero everywhere
and thus covered by the other graphs.
Before we proceed with constructing a ROM based on transformed modes,
we first consider the singular value decay of the snapshot matrix in Figure 6.3.
As outlined in section 2.5.1, the singular value decay reflects the approxima-
tion quality which may be expected from a classical POD-based approach. We
observe a rather slow singular value decay in Figure 6.3, which is not unusual
considering the transport-dominated nature of the problem and especially the
sharp front profiles of the two traveling waves, cf. section 1.2.1. For instance,
for obtaining a relative offline error of 103with respect to the spectral norm
of the snapshot matrix, the POD requires more than 150 modes. While this
still corresponds to a dimension reduction of more than one order of magni-
tude, a description based on more than 150 degrees of freedom appears to be
unnecessarily complex in light of the simple structure of the analytical solution
179
6. Numerical Examples
0 0.5 1
0
0.2
0.4
ξ
ρ(t, ξ)
t= 0 t= 0.1t= 0.3t= 0.5t= 0.8
0 0.5 1
0.2
0
0.2
ξ
v(t, ξ)
Figure 6.2.: Linear wave equation: some selected snapshots of the FOM solution for the
density (left) and the velocity (right).
100 200 300 400 500 600 700 800 900 1,000
107
105
103
101
i
σi
σ1
Figure 6.3.: Linear wave equation: singular value decay of the snapshot matrix.
provided in (1.15).
In the following, we pursue a nonlinear model reduction approach based
on the approximation ansatz (1.4). To this end, we first determine suitable
modes by following the residual minimization approach discussed in section 3.1,
where we use the spatial weighting matrix W=h
6I2Mwith Mas in (6.6).
Furthermore, for reducing the computational effort, we consider only every
fifth snapshot for the mode determination, which corresponds to an effective
time step size of t= 5 ·104. Moreover, we set the number of modes to
r= 2 and the mode dimension to dφ= 2(N+ 1). For the transformations we
employ a discretized version of the family of periodic shift operators defined in
Definition 1.2.2 based on cubic spline interpolation. More precisely, for given
shift value pR, we use I2 Tper,h(p)as shift operator, where Tper,h(p)
180
6.1. Linear Wave Equation
L(RN+1)is defined via
Tper,h(p)φ:=T(q(p))a(φ)ζ(p)b(φ) + ζ(p)2c(φ)ζ(p)3d(φ)
with q(p):=$pmod 1
h%, ζ(p):=pmod h, T(q):="0Iq
IN+1q0#.
Here, a(φ), b(φ), c(φ), d(φ)RN+1 are vectors containing the coefficients of the
spline interpolant of φ, see appendix D.1.1 for their definition and a derivation
of Tper,h from Tper. For the choice of the path values in the offline phase, we
exploit the fact that the wave speeds of (6.1) are known and set p1(t) = ct
and p2(t) = ct for all tI. Alternatively, they could be determined based
on the snapshot data as illustrated in [241, sec. 2.3] or by using one of the
other methods mentioned in Remark 3.1.2. For determining the modes, we
use the variable projection approach discussed in section 3.1.2 and solve the
corresponding optimization problem by means of the GRANSO software with
default settings. As starting values we use the respective first left singular
vector of the shifted snapshot matrices
h(I2Tper,h(p1(t1))) xh(t1)··· (I2Tper,h(p1(tq))) xh(tq)i
and h(I2Tper,h(p2(t1))) xh(t1)··· (I2Tper,h(p2(tq))) xh(tq)i.
Here, the application of the negative shift corresponds to shifting the snapshots
into the respective co-moving reference frame.
The GRANSO solver terminates after nine iterations, since the line search
algorithm bracketed a minimizer, but fails to satisfy the Wolfe conditions,
see [75] for more details about the employed line search algorithm. It is also
stated that this often indicates that a stationary point has been reached. The
resulting relative offline error is about 3.4·104and the determined modes are
depicted in Figure 6.4. On the other hand, we omit a plot of the corresponding
amplitudes, since they are nearly constant with respect to time, which is in
accordance with the corresponding discussion in Example 1.2.3 based on the
analytical solution. Considering the modes depicted in Figure 6.4, we observe
that they agree well with the profiles of the traveling waves except for some
constant offsets in the velocity component. These offsets are actually already
present in the corresponding starting values and the fact that they are not
removed by the optimization procedure may be attributed to a non-uniqueness
of the optimal solution. In fact, since constant functions are invariant under
shifting operations, we may add constant offsets to each of the modes without
changing the value of the cost function, as long as the offsets sum up to zero.
Indeed, by inspecting Figure 6.4, right, we note that the magnitude of the
offsets in the first and the second mode is the same, but they have opposite
signs. A similar observation is made in [241, sec. 3], where such offsets are
encountered in the context of a two-dimensional vortex pair test case.
181
6. Numerical Examples
0 0.5 1
0
0.04
0.08
ξ
φ(ξ)
ρcomponent of φ1
ρcomponent of φ2
0 0.5 1
0.1
0
0.1
ξ
φ(ξ)
vcomponent of φ1
vcomponent of φ2
Figure 6.4.: Linear wave equation: density (left) and velocity (right) components of the
determined modes.
The results stated in the last paragraph are based on solving a reduced op-
timization problem, which is obtained via variable projection as outlined in
section 3.1.2. In order to assess the benefit of solving the reduced problem
instead of the full one discussed in section 3.1.1, we also solve the full opti-
mization problem via GRANSO, where we use the same settings as before.
Especially, for the starting values we take the same modes as for the reduced
problem, whereas the starting values of the amplitudes are determined based
on solving the corresponding linear least squares problem with fixed modes,
cf. the discussion before (3.17). The total number of unknowns of the full
problem is r(dφ+ 1 + 1
t) = 12,002, whereas the reduced problem involves
rdφ= 8000 optimization parameters. For the full problem, the GRANSO
solver terminates after 105 iterations, since the norm of the gradient becomes
smaller than the default tolerance of 108. The resulting offline error is the
same as the one obtained using the reduced optimization problem, while the
number of iterations is more than one order of magnitude higher for the full
optimization problem. However, this comparison of the iteration numbers is of
little significance, since for the full problem the solver has reached the tolerance
for the norm of the gradient, whereas for the reduced problem the solver has
terminated prematurely due to a failure of the line search algorithm. In order
to obtain a more meaningful comparison, we specify a target value of 103for
the relative error of the resulting snapshot matrix approximation. Solving the
reduced problem, the GRANSO procedure attains an error of approximately
3.6·104after three iterations, which involve nine evaluations of the cost func-
tion and its gradient, and requires a computation time of about 16 seconds.
On the other hand, when solving the full problem, GRANSO needs four itera-
tions involving ten function evaluations, takes roughly 23 seconds, and attains
182
6.1. Linear Wave Equation
an error of about 7.1·104. The advantage of using the reduced problem
may also be observed when decreasing the target value for the relative error
to 3.5·104: Solving the reduced problem, GRANSO needs four iterations,
ten function evaluations, and approximately 19 seconds, whereas it requires
twelve iterations, 31 function evaluations, and about 67 seconds when solving
the full problem. All in all, the numerical experiments indicate an advantage
of using the reduced optimization problem discussed in section 3.1.2.
We proceed to construct a ROM similarly as in (4.1)–(4.2) based on the
determined modes and using the derivative of Tper,h provided in (D.4) in ap-
pendix D.1.1. In fact, we use a slightly modified version of the framework
presented in section 4.1 by minimizing a weighted norm of the residual with
weighting matrix E1
h. Especially, since (1.4) is a special case of a separable
approximation ansatz, it follows from the considerations in section 5.2 that
the resulting ROM is port-Hamiltonian, cf. Theorem 5.2.1. Actually, since the
FOM is even a Hamiltonian system, this is also true for the ROM, i.e., the
ROM Hamiltonian is a conserved quantity.
Since the FOM is linear, the resulting ROM has the form
"Mα(p)N(p)D(α)
D(α)>N(p)>D(α)>Mp(p)D(α)#"˙α
˙p#="Aα(p)
D(α)>Ap(p)#α, (6.8)
where Dis constructed as in (4.2) and Aαand Apas in (4.37). Also Mα,
N, and Mpare obtained as in (4.2), but the occurring inner products need
to be replaced by weighted inner products with weighting matrix Eh. As
discussed in section 4.3, the evaluation of the ROM still scales with the FOM
dimension, since each entry of Mα,N,Mp,Aα, and Apinvolves products
of high-dimensional matrices and vectors. However, in [38] it is shown for the
linear wave equation that, even though the ROM coefficients depend in general
on the paths, the solution of the ROM remains the same when ignoring the
path dependency of the coefficient matrices. Admittedly, the result in [38] is
only stated for the case where the ROM is directly obtained by projecting the
infinite-dimensional problem (6.1), whereas the ROM (6.8) has been derived
based on the finite-dimensional FOM (6.5). Nevertheless, we have observed
in our numerical experiments that the solution of the ROM (6.8) does not
significantly change when ignoring the path dependencies of Mα,N,Mp,Aα,
and Ap. Hence, we may precompute the evaluations of these matrix functions
at p(0) = 0 and use the resulting constant matrices in the online phase. In
particular, this hyperreduction approach, which consists in approximating the
path-dependent coefficient matrices by constant ones, ensures that the online
phase does not scale with the FOM dimension.
For a comparison of the ROMs with and without hyperreduction, we solve
them numerically using the implicit midpoint rule and the same parameter
values which have been used for generating the FOM snapshot data in the
offline phase. The relative speed-up of the hyperreduced ROM in comparison
to the ROM without hyperreduction is roughly 6, while the corresponding
183
6. Numerical Examples
errors differ only slightly: the ROM without hyperreduction yields a relative
online error of about 3.4·104and the error of the hyperreduced ROM is
approximately 3.7·104. Consequently, in both cases the online error is in the
same order of magnitude as the offline error and as the error of the FOM with
respect to the analytical solution. Furthermore, both ROMs are faster than
the FOM, which takes about 80 seconds and hence roughly 20 seconds more
than the ROM without hyperreduction. One reason for the ROM without
hyperreduction to be faster than the FOM is that the ROM evaluation only
involves matrix vector products which scale with the FOM dimension, whereas
evaluating the FOM also requires solving a high-dimensional linear equation
system in each time step, since we use an implicit time integration scheme.
Next, we compare the ROM (6.8) and its hyperreduced analogue with a
ROM obtained by using a classical POD approach and a subsequent Galerkin
projection, cf. sections 2.5.1 and 2.5.3. To this end, we construct a ROM based
on 280 POD modes, which results in an online error of about 3.6·104, i.e., the
accuracy is comparable to that of the two ROMs addressed in the preceding
paragraph. Even though the number of POD modes is fairly large, the evalua-
tion of the resulting ROM requires only ten seconds, which is about the same
computation time as for the hyperreduced ROM based on two transformed
modes. One reason for this appears to be that the POD-based ROM is linear
and therefore its evaluation only requires to solve a linear equation system of
moderate size in each time step. On the other hand, the ROM (6.8) and its
hyperreduced counterpart are nonlinear and, hence, each time step involves
solving a nonlinear equation system via an iterative procedure. However, we
emphasize that all computation times mentioned so far are based on a time
step size of 104, i.e., we have used the same time step size for the ROMs as
for the FOM. When increasing the time step size, the accuracy of the POD-
based ROM decreases significantly. For instance, an increment of the time
step size from 104to 2.5·104leads to an online error of about 2.7·103. In
contrast to this, the accuracy of the hyperreduced ROM based on the nonlin-
ear ansatz (1.4) appears to be largely independent of the time step size. For
example, increasing the time step size to 0.02 still yields an online error of
around 3.7·104, while the computation time reduces to less than a second.
This flexibility in choosing the time step size is in fact not surprising when
considering the time dependency of the amplitudes and the paths. Based on
the analytical solution and in accordance with the discussion in Example 1.2.3,
we expect the amplitudes to be nearly constant and the paths to be almost
linear functions of time. This expectation is also confirmed by the numerical
experiments, cf. Figure 6.5, which explains the flexibility in choosing the time
step size, since the implicit midpoint rule is exact for constant and linear func-
tions, see for instance [80, sec. 6.3]. We note that in [37, sec. 7.3] a similar
freedom of choosing the time step size has been observed, while using a time
integration scheme with adaptive time stepping.
The main reason for us choosing the implicit midpoint rule as time inte-
184
6.2. Linear Advection–Diffusion Equation
0 0.5 1
3.3
3.35
3.4
t
α(t)
α1
α2
0 0.5 1
1
0
1
t
p(t)
p1
p2
Figure 6.5.: Linear wave equation: online values of the amplitudes (left) and the paths
(right).
gration scheme is that the Hamiltonian of the FOM is a quadratic function
and, thus, the implicit midpoint rule ensures that the Hamiltonian is also a
conserved quantity of the time-discrete system, see for instance [80, sec. 6.3.4].
However, due to the nonlinearity of the approximation ansatz (1.4), the Hamil-
tonian of the ROM (6.8) is in general not a quadratic function, cf. section 5.2.
Consequently, there is no guarantee that the ROM Hamiltonian is a conserved
quantity of the time-discrete ROM when using the implicit midpoint rule. In
fact, the ROM Hamiltonian is slightly increasing as illustrated in Figure 6.6.
On the contrary, when using a time discretization scheme based on the mid-
point discrete gradient pair, cf. appendix C.2, the ROM Hamiltonian is guaran-
teed to be a conserved quantity of the time-discrete ROM. This is also reflected
in the numerical experiments, where the largest occurring deviation from the
initial value of the ROM Hamiltonian is about 2·1016, see also Figure 6.6.
These results indicate that discrete gradient pair methods as discussed in ap-
pendix C.2 may be a useful tool in applications with state-dependent mass
matrix and non-quadratic Hamiltonian, especially when an exact conservation
of the Hamiltonian is required.
6.2. Linear Advection–Diffusion Equation
In contrast to the preceding section, we consider in the following a test case
with non-periodic boundary conditions. More precisely, we focus on a linear
advection–diffusion equation on the spatial domain = (0,1) with an inhomo-
geneous Robin condition on the left boundary and a homogeneous Neumann
condition on the right boundary. The corresponding governing equations for
185
6. Numerical Examples
0 0.2 0.4 0.6 0.8 1
0
0.5
1
1.5·1011
t
˜
H(˜x(t)) ˜
H(˜x(0))
implicit midpoint rule
discrete gradient pair method
Figure 6.6.: Linear wave equation: Comparison of the error in conservation of the ROM
Hamiltonian using the implicit midpoint rule and the midpoint discrete gra-
dient pair method outlined in appendix C.2.
the unknown x:I×Rare given by
tx(t, ξ) = c∂ξx(t, ξ) + d∂ξξx(t, ξ)for all (t, ξ)I×,
cx(t, 0) d∂ξx(t, 0) = cg(t)for all tI,
ξx(t, 1) = 0 for all tI,
x(0, ξ) = x0(ξ)for all ξ
(6.9)
with time interval I= [0, tend],tend R>0, advection speed cR>0, diffusion
coefficient dR>0, Robin boundary value g:R0R, and initial value
x0:R. The combination of Robin and Neumann boundary conditions
as used in (6.9) is sometimes referred to as Danckwerts boundary conditions,
cf. [5, 77].
Similarly as in section 6.1, we use integration by parts and obtain the fol-
lowing weak formulation: Find x:I×Rsuch that
(i) for all tI,x(t, ·)is in H1(Ω) and satisfies
hψ, tx(t, ·)iL2(Ω) =(0)g(t)c
2hψ, ξx(t, ·)iL2(Ω) hψ0, x(t, ·)iL2(Ω)
dhψ0, ξx(t, ·)iL2(Ω) c
2(ψ(1)x(t, 1) + ψ(0)x(t, 0))
for all ψH1(Ω),
(ii) for all ξ, we have x(0, ξ) = x0(ξ).
Furthermore, we introduce the associated Hamiltonian H:L2(Ω) Rvia
H(x):=1
2kxk2
L2(Ω) and derive in the following a corresponding dissipation
inequality. To this end, let x:I×Rbe a solution of the weak formulation
186
6.2. Linear Advection–Diffusion Equation
of (6.9) and let it be continuously differentiable with respect to time in the
sense that the mapping t7→ x(t, ·)is in C1(I, L2(Ω)). In addition, we consider
the function Hs:IRwhich is defined via Hs(t):=H(x(t, ·)) and represents
the Hamiltonian along the solution trajectory. Then, by similar arguments as
in (6.4), we obtain the dissipation inequality
dHs
dt(t) = H0(x(t, ·))(tx(t, ·)) = hx(t, ·), tx(t, ·)iL2(Ω)
=dkξx(t, ·)k2
L2(Ω) c
2x(t, 1)2+x(t, 0)2+cx(t, 0)g(t)
cx(t, 0)g(t)
for all tI. Thus, in the special case g= 0 the Hamiltonian does not
increase along solution trajectories. This property is also reflected in a port-
Hamiltonian structure on the semi-discretized level as shown in the following.
For the semi-discretization in space, we use the Galerkin FEM with mesh size
h=1
N+1 ,NN, analogously as in section 6.1. The resulting semi-discretized
system takes the form
Eh˙xh(t)=(JhRh)xh(t) + Bhu(t)for all tI,(6.10)
where xh:IRN+2 contains the coefficients corresponding to the FEM ansatz
functions, the input u:R0Ris given by u=g, and Eh, Jh, RhRN+2,N+2,
BhRN+2 are defined as
Eh:=h
6
2 1 0 ··· 0 0
1 4 1 ....
.
..
.
.
0 1 4 ...0 0
.
.
..........1 0
0··· 0 1 4 1
0··· 0 0 1 2
, Bh:=c
1
0
0
.
.
.
0
,(6.11a)
Jh:=c
2tridiagN+2(1,0,1),(6.11b)
Rh:=d
h
11 0 ··· 0 0
1 2 1....
.
..
.
.
01 2 ...0 0
.
.
..........1 0
0··· 01 2 1
0··· 0 0 1 1
+c
2diag(1,0,...,0,1).(6.11c)
Here, Ehis symmetric and positive definite, Jhis skew-symmetric, and Rhis
symmetric and positive semi-definite. Hence, the semi-discrete system (6.10)
corresponds to the state equation of a port-Hamiltonian system of the form
187
6. Numerical Examples
0 0.5 1
0
0.5
1
1.5
ξ
t
0
0.2
0.4
x(t, ξ)
0 0.5 1
0
0.2
0.4
ξ
t= 0 t= 0.15 t= 0.3t= 0.6t= 0.9
Figure 6.7.: Linear advection–diffusion equation: pseudocolor plot of the FOM solution
(left) and some selected snapshots (right).
(2.37) with Hamiltonian Hh(xh) = 1
2x>
hEhxh. In particular, we infer that
(6.10) is stable for u= 0, cf. section 2.6. Moreover, since the Hamiltonian is
quadratic, a time discretization based on the implicit midpoint rule ensures a
dissipation inequality also on the time-discrete level, see section 2.6.1.
For the following numerical experiments, we choose the PDE parameters as
c= 1 and d= 103, the final time as tend = 1.5, the boundary value as
g(t) = u(t) =
1
2exp 11
1(20(t0.15·tend))2,if t(0.175,0.275),
0,otherwise,(6.12)
for all tR0, and the initial value as x0=ρ0with ρ0as in (6.7). Besides,
we divide the spatial domain into N+ 1 = 1000 equidistant intervals, which
corresponds to a mesh size of h= 103. For the time discretization, we use a
step size of 103as well. Figure 6.7 depicts the numerical solution by means of a
pseudocolor plot and some selected snapshots. We observe that the initial wave
profile is transported to the right, while its shape and amplitude changes due
to the diffusion. After a certain time, a second wave enters the computational
domain via the left boundary and is also transported to the right. Eventually,
both waves leave the computational domain via the right boundary.
We proceed by determining suitable modes based on the snapshot data of the
FOM and, for this purpose, we use again the residual minimization approach
from section 3.1. The spatial weighting matrix is chosen as W=Eh, where
Ehdenotes the FEM mass matrix as defined in (6.11a). Furthermore, for
reducing the computational effort, we only use every second snapshot for the
mode determination, which corresponds to an effective time step size of t=
2·103. Moreover, we set the number of modes to r= 3 and we use a family
of transformation operators which is inspired by the extended domain shift
operator introduced in Definition 3.3.3. More precisely, we first introduce an
188
6.2. Linear Advection–Diffusion Equation
extended domain similar to (3.32), where we use one common shift pfor all
three modes. In particular, for the offline phase we exploit the fact that the
advection speed of (6.9) is known and set p(t) = ct for all tI. The resulting
extended domain is given by b
e= (ctend,1), i.e., defining the modes φion
b
eensures the shifted modes φi(ξct)to be well-defined for all (t, ξ)I×
and i {1,2,3}. For the discretized modes, this leads to a dimension of
dφ=1 + ctend
h+ 1 = 2501,
where each entry corresponds to a point of the grid which is obtained by di-
viding b
einto equidistant intervals of length h. However, we emphasize that,
even though the extended domain is sufficiently large for the offline phase
where the shift is prescribed, there is in general no guarantee that the shift
computed in the online phase remains in the interval [0, ctend]. This is due to
the fact that the shift pconstitutes a part of the ROM state, which is a priori
unknown. In order to avoid defining the modes on an even larger domain, we
use a family of transformation operators which combines the extended domain
shift operator introduced in Definition 3.3.3 with the constant extrapolation
shift operator defined in Definition 3.3.8. The resulting family of transfor-
mation operators essentially uses an extended domain shift operator, as long
as the shift values are within the range [0, ctend], and constant extrapolation
otherwise, see appendix D.1.2 for more details. A discretized version of this
transformation family is obtained via cubic spline interpolation and given by
Tce,h :R L(Rdφ,RN+2)with
Tce,h(η)φ:=T1(q(η))φ+T2(q(η))a(φ)ζ(η)b(φ) + ζ(η)2c(φ)ζ(η)3d(φ),
(6.13a)
q(η):=η
h, ζ(η):=ηmod h, (6.13b)
T1(q):=
1N+2+min(q+1dφ,0) 0
0 0
RN+2,dφ,if qdφ(N+ 2),
0 0
01min(N+2,max(0,q))
RN+2,dφ,otherwise,
(6.13c)
189
6. Numerical Examples
T2(q):=
0 0
Imax(dφ1q,0) 0
RN+2,dφ1,if qdφ(N+ 2),
0z1(q)×z2(q)Iz1(q)0
0 0 0
RN+2,dφ1,otherwise,
z1(q):= max(N+ 2 + min(0, q),0),
z2(q):=dφ1max(0, N +2+q).
(6.13d)
The unspecified block sizes in the definitions of T1and T2result from the sizes
of the other blocks with the understanding that a block row or column does not
occur if the corresponding number of rows or columns is zero. Furthermore,
the vectors a(φ), b(φ), c(φ), d(φ)Rdφ1contain the spline coefficients of φ,
see appendix D.1.2 for their definition and a derivation of Tce,h.
For the actual computation of the modes, we use the variable projection
approach discussed in section 3.1.2 and solve the corresponding optimization
problem by means of the GRANSO software with default settings. Regarding
the choice of the starting values, we cannot use the same approach as for the
wave equation in section 6.1, since we use a different family of transformation
operators. In fact, our choice for the starting values in section 6.1 is motivated
by the pointwise invertibility of the family of periodic shift operators Tper,
cf. appendix A. On the other hand, the family of extended domain shift opera-
tors Teis not pointwise invertible and neither is Tce,h, which is pointwise a non-
square matrix. However, one can show that Tesatisfies Te(η)Te(η)= IdL2(Ω)
for all ηRand we observed that also Tce,h(η)Tce,h(η)>IN+2 holds, at least
within the considered range of shift values. Accordingly, we propose to use the
first three left singular vectors of the transformed snapshot matrix
hTce,h(p(t1))>xh(t1)··· Tce,h(p(tq))>xh(tq)i
as starting values for the modes. Based on these starting values, the GRANSO
solver terminates after almost 5000 iterations and the resulting relative offline
error is about 7.6·103.
Next, we construct a ROM based on the determined modes and using the
derivative of Tce,h provided in (D.10) in appendix D.1.2. Analogously as in
section 6.1, we slightly differ from the setting in section 4.1 and use a weighted
norm for the residual minimization, which ensures that the ROM is port-
Hamiltonian. The resulting ROM has the form
"Mα(p)N(p)α
α>N(p)>α>Mp(p)α#"˙α
˙p#="Aα(p)
α>Ap(p)#α+"Bα(p)
α>Bp(p)#u, (6.14)
where Aα, Ap:RRr,r are constructed as in (4.37) with p1=. . . =pr=p
190
6.2. Linear Advection–Diffusion Equation
and F:=JhRh, while Bα, Bp:RRrare defined via
Bα(p):=hφ>
iTce,h(p)>Bii, Bp(p):=hφ>
iT0
ce,h(p)>Bii.
Moreover, Mα,N, and Mpare constructed as in (4.15c)–(4.15e), but the oc-
curring inner products need to be replaced by weighted inner products with
weighting matrix Eh.
As discussed in section 4.3, the evaluation of the ROM still scales with the
FOM dimension, since each entry of Mα,N,Mp,Aα,Ap,Bα, and Bpinvolves
products of high-dimensional matrices and vectors. In order to achieve an
efficient offline/online decomposition, we follow the approach outlined in sec-
tion 4.3.1 by sampling these path-dependent matrices and constructing asso-
ciated interpolants. More precisely, we construct piecewise linear interpolants
for each of these matrix functions. For this purpose, we compute correspond-
ing samples within the range [0, ctend]and based on an equidistant grid whose
grid size coincides with the one used for the FEM discretization. Due to the
resulting large number of interpolation intervals, the ROM evaluation formally
still scales with the FOM dimension, since we need to determine the correct
interpolation interval whenever the shift pis updated. However, such a binary
search only scales logarithmically with the full dimension, see for instance [165,
sec. 6.2.1].
For a comparison of the ROMs with and without hyperreduction, we solve
them numerically using the implicit midpoint rule and the same parameter val-
ues which have been used for generating the FOM snapshot data in the offline
phase. The evaluation of the non-hyperreduced ROM takes approximately
eight seconds, whereas the hyperreduced ROM requires only a computation
time of about 4.2seconds. This amounts to a relative speed-up of roughly 1.9,
while both ROMs achieve a relative online error of 9.9·103, which is thus
slightly higher than the offline error of 7.6·103. However, even if the hyper-
reduction leads to a lower computation time of the ROM, this speed-up is still
not sufficient, since the evaluation of the FOM is already very fast and takes
only about 0.35 seconds. Furthermore, the same computation time is required
by a POD-based ROM with 32 modes, which leads to an online error of about
8.6·103. Thus, for the problem at hand, the nonlinear ROMs cannot compete
with the FOM or with the POD-based ROM in terms of the achieved speed-up
and accuracy. One reason for this is that the nonlinear ROMs require non-
linear system solves in each time step, see also the corresponding discussion
for the wave equation test case in section 6.1. Furthermore, the simulation
of the hyperreduced ROM involves the evaluation of multiple interpolants for
assembling the coefficient matrices and this constitutes a computational over-
head, which is neither present in the FOM evaluation nor in the evaluation of
the POD-based ROM. Nevertheless, the disadvantage of the nonlinear ROMs
in terms of computation time may be reduced by exploiting the fact that the
ROMs may be simulated with a larger time step size than the FOM without
191
6. Numerical Examples
a significant loss of accuracy. For instance, when increasing the time step size
tenfold, the computation time of the hyperreduced ROM is about 0.8seconds
and thus at least in the same order of magnitude as the FOM simulation time.
By contrast, the resulting increment of the online error is negligibly small.
On the other hand, this flexibility in increasing the time step size without a
significant loss of accuracy does not equally apply to the POD-based ROM,
for which the tenfold time step size results in a tenfold error.
As mentioned after (6.11), an advantage of using the implicit midpoint rule
is that it ensures a dissipation inequality for the time-discrete FOM, since the
corresponding Hamiltonian is a quadratic function of the state. However, this
is not true for the ROM Hamiltonian and consequently there is no guarantee
that the time-discrete ROM satisfies a dissipation inequality when using the
implicit midpoint rule. Nevertheless, as long as the time step size is suffi-
ciently small, we have observed in our numerical experiments that the implicit
midpoint rule still yields an energy-consistent solution in the sense that the
Hamiltonian only increases within the time interval (0.175,0.275), where the
input is non-zero, cf. (6.12). On the other hand, when using an increased
time step size of 0.08, this is no longer true and we observe a violation of
the power balance, cf. Figure 6.8. In particular, at t= 0.28 the input and
hence the supplied power are zero, whereas the discrete time derivative of the
ROM Hamiltonian obtained via the implicit midpoint rule is positive, i.e., the
Hamiltonian increases despite the vanishing input value. Moreover, we observe
that the discrete time derivative of the ROM Hamiltonian does not match the
corresponding dissipation at t= 0.28 and partly also afterwards. When we use
instead a time discretization scheme based on the midpoint discrete gradient
pair, cf. appendix C.2, Theorem C.2.3 yields that also on the time-discrete level
a dissipation inequality is satisfied. Accordingly, we observe in Figure 6.8 that
the inconsistency of the power balance does not apply to the solution obtained
by the discrete gradient pair approach. Instead, the discrete time derivative of
the Hamiltonian and the corresponding dissipation are in excellent agreement
in that part of the time interval where the supplied power is zero.
While the fact that discrete gradient pair methods lead to an exact power
balance on the time-discrete level is proven in appendix C.2 and illustrated in
Figures 6.6 and 6.8, we have not yet addressed its convergence behavior. To
study the order of convergence numerically, we consider a reference solution
obtained by solving the ROM (6.14) via the RADAU IIA method of order
five, cf. [133, p. 72ff.], with time step size 2·106. Furthermore, to diminish
the influence of the accuracy of the nonlinear equation system solver fsolve,
we set the tolerances OptimalityTolerance and FunctionTolerance to 1013
and 108, respectively. Moreover, we consider the reduced time interval [0,1.2]
instead of [0,1.5], since we have observed in our numerical experiments that
the obtained solutions of the nonlinear equation systems become quite sensitive
with respect to the starting point and the fsolve tolerances towards the end
of the time interval [0,1.5], especially when using a coarse time discretization.
192
6.2. Linear Advection–Diffusion Equation
0 0.2 0.4 0.6 0.8 1 1.2 1.4
0.05
0
0.05
0.1
t
d˜
H
dt(implicit midpoint rule)
d˜
H
dt(discrete gradient pair method)
dissipation (implicit midpoint rule)
dissipation (discrete gradient pair method)
supplied power (implicit midpoint rule)
supplied power (discrete gradient pair method)
0.26 0.28 0.3
0.01
0
0.01
Figure 6.8.: Linear advection–diffusion equation: Comparison of the discrete time deriva-
tive of the ROM Hamiltonian and the corresponding dissipation and supplied
power with time step size t= 0.08 when using the implicit midpoint rule
and the midpoint discrete gradient pair approach from appendix C.2. The
inset highlights an energy inconsistency of the implicit midpoint rule, where
the discrete time derivative of the Hamiltonian is positive despite a vanishing
power supply. Here, t= 0.28 corresponds to the first midpoint where the
input becomes permanently zero.
193
6. Numerical Examples
106105104103102101100
1011
109
107
105
103
101
t
relative error
implicit midpoint rule
discrete gradient pair method
(∆t)2
Figure 6.9.: Linear advection–diffusion equation: Convergence of the implicit midpoint
rule and the midpoint discrete gradient pair method from appendix C.2.
Presumably, this is due to the fact that the values of the FOM state become
very small, once the second wave leaves the computational domain, and hence
there may be multiple choices for the path and the amplitudes leading to a
small residual.
Based on the reference solution, we determine the relative errors of the ROM
solutions obtained via the implicit midpoint rule and the discrete gradient
pair approach for various time step sizes ranging from 2·106to 217 ·106
0.13. The specified error values correspond to the relative error with respect
to the Frobenius norm of the ROM state snapshot matrices. The resulting
error decays for both time integration methods are depicted in Figure 6.9
together with a reference line, which corresponds to a convergence order of
two. In particular, we observe that the convergence behavior of both methods
is very similar and the discrete gradient pair approach is almost as accurate
as the implicit midpoint rule. In addition, the numerical results indicate a
convergence order of two as it is to be expected for the implicit midpoint rule,
see for instance [80, sec. 6.3.2].
So far, we have only considered the case where the ROM is evaluated for the
same parameter setting as has been used in the offline phase for generating the
FOM snapshot data. At the end of this section, we also investigate the ability
of the ROM to handle parameter variations. For this purpose, we consider
variations of the diffusion coefficient dwithin the range M= [103.5,102.5],
see Figure 6.10 for corresponding pseudocolor plots of the FOM solutions for
the extreme values 103.5and 102.5. We note that the only FOM coefficient
matrix depending on dis Rh, cf. (6.11). Especially, Rhis given by the sum of
two matrices: The first one is linear with respect to d, while the second one
is independent of d. Consequently, the d-dependency of JhRhallows for a
194
6.2. Linear Advection–Diffusion Equation
0 0.5 1
0
0.5
1
1.5
ξ
t
0 0.5 1
0
0.5
1
1.5
ξ
t
0
0.2
0.4
x(t, ξ)
Figure 6.10.: Linear advection–diffusion equation: pseudocolor plots of the FOM solution
for d= 103.5(left) and d= 102.5(right).
103.4103.2103102.8102.6
0
0.05
0.1
diffusion coefficient d
relative error
Figure 6.11.: Linear advection–diffusion equation: ROM accuracy for different values of
the diffusion coefficient dwhen using a ROM with r= 3 transformed modes
based on FOM snapshots with d= 103.
separation similar to (2.24) with K= 2 and thus we may construct parameter-
dependent ROMs while achieving an efficient offline/online decomposition as
mentioned in Remark 4.3.10.
For investigating the parameter-dependent case, we start by considering the
hyperreduced ROM which has been the subject of the discussion after (6.14),
i.e., it is based on r= 3 modes, which have been determined based on snapshots
of the FOM with d= 103. As already mentioned, the corresponding relative
online error is 9.9·103when using the same parameter value as in the offline
phase. In Figure 6.11 it is depicted how the error changes as the diffusion
coefficient is varied within the range M= [103.5,102.5]. As expected, the
error is relatively small in the vicinity of d= 103but increases towards the
boundaries of M. The error attains its maximum value of about 0.14 at the
right boundary, which corresponds to the highest amount of diffusion and thus
also to the strongest deformation of the wave profiles, cf. Figure 6.10, right.
In the following, we construct an alternative parameter-dependent ROM by
195
6. Numerical Examples
employing the greedy algorithm introduced in section 3.2. To this end, an
error indicator is required to decide in each iteration which parameter value
is used for determining a new mode, cf. Algorithm 3.3. From the perspective
of computational efficiency, it would be in general desirable to use an error
indicator whose evaluation does not scale with the dimension of the FOM.
This is for instance the case when using a residual-based error bound as in
Theorem 4.1.9, provided that the residual norm is approximated by combining
the approaches outlined in Remarks 4.3.10 and 4.3.11. Accordingly, we have
implemented and tested error bounds of the form (4.10) using different methods
for bounding the matrix exponential, cf. Remark 4.1.10. However, the resulting
error bounds appear to be not useful for the problem at hand, since the error
bound is mostly orders of magnitudes larger than the actual error. Moreover,
the parameter dependency of the true error is also not well captured by the
corresponding error bound. Consequently, in the following we consider the true
relative online error as error indicator for the greedy algorithm, while exploiting
that the evaluation of the FOM is already quite fast for the problem at hand,
cf. the discussion after (6.14). Furthermore, we choose the logarithmically
equidistant grid
Mtrain ={103.5,103.4,...,102.5}
as parameter training set and an error tolerance tol = 0.05. Besides, as initial
ansatz vector we choose one mode that is determined based on snapshots of
the FOM with d= 103, which corresponds to the logarithmic middle of the
considered parameter domain.
During the greedy procedure, one mode is added after another until the
tolerance is met. For the corresponding ROMs constructed during this iterative
procedure, a plot of the relative online error over the diffusion coefficient is
provided in Figure 6.12. In particular, the black circles indicate the respective
worst-case parameter values, cf. µmax in Algorithm 3.3. The greedy algorithm
terminates after 3iterations, since the resulting ROM with r= 4 achieves a
relative online error of less than tol for all parameter values in the training set
Mtrain. Moreover, we observe that the ROM with r= 3 has a maximum error
of approximately 0.08 and thus outperforms the ROM considered in Figure 6.11
in terms of the maximum error. While both ROMs have the same dimension,
the latter has been constructed using only FOM snapshots with d= 103,
whereas the former is based on FOM snapshots with d= 103.5,d= 103, and
d= 102.5.
6.3. Nonlinear Reaction–Diffusion Equation
To illustrate the application of the hyperreduction framework for nonlinear
systems presented in section 4.3.2, we consider a nonlinear reaction–diffusion
problem on the one-dimensional spatial domain = (0,1) and on a time
interval I= [0, tend]with tend R>0. The corresponding governing equations
196
6.3. Nonlinear Reaction–Diffusion Equation
103.4103.2103102.8102.6
0
tol
0.1
0.15
0.2
0.25
0.3
0.35
diffusion coefficient d
relative error
r= 1
r= 2
r= 3
r= 4
Figure 6.12.: Linear advection–diffusion equation: relative online error for different mode
numbers and values of dwhen using the greedy algorithm from section 3.2 for
determining the modes. The black circles highlight the respective worst-case
parameter values.
read
tu(t, ξ) = d∂ξξu(t, ξ) + ζˆv(u(t, ξ), β)for all (t, ξ)I×,
u(t, 0) = 1 for all tI,
ξu(t, 1) = 0 for all tI,
u(0, ξ) = u0(ξ)for all ξ
with unknown normalized temperature u:I×Rand given diffusion
parameter dR>0, pre-exponential factor ζR>0, Arrhenius coefficient
βR>0, initial value u0:[0,1], and reaction rate function ˆv:R×RR
defined via
ˆv(u, β):=
0,if u0,
(1 u) exp(β
u),otherwise. (6.15)
This test case is based on the reaction–diffusion example presented in [118,
eq. (10.67)], but in contrast to [118] we do not explicitly constrain the state
uto only attain values between 0and 1. Nevertheless, we emphasize that
values outside of this range are not meaningful from a physical point of view.
Furthermore, in contrast to [118], we consider a diffusion coefficient dwhich
is not necessarily equal to one in order to obtain a transport-dominated test
case with a sharp reaction front. Finally, inspired by the wildland fire model
197
6. Numerical Examples
considered in Example 5.1.4, we set the threshold value in (6.15) to zero in
order to ensure that ˆvis smooth with respect to ufor any βR>0.
In order to obtain a system with equilibrium point at 0and homogeneous
boundary conditions, we introduce the new state variable x:=u1and obtain
the corresponding governing equations
tx(t, ξ) = d∂ξξx(t, ξ)ζv(x(t, ξ), β)for all (t, ξ)I×,
x(t, 0) = 0 for all tI,
ξx(t, 1) = 0 for all tI,
x(0, ξ) = x0(ξ):=u0(ξ)1for all ξ
(6.16)
with v:R×RRdefined via
v(x, β):=ˆv(x+ 1, β) =
0,if x 1,
xexp(β
x+1 ),otherwise. (6.17)
Then, using integration by parts, we obtain a corresponding weak formulation
as follows: Find x:I×Rsuch that
(i) for all tI,x(t, ·)is in V:={zH1(Ω) |z(0) = 0}and satisfies
hψ, tx(t, ·)iL2(Ω) =dhψ0, ξx(t, ·)iL2(Ω) ζhψ, v(x(t, ·), β)iL2(Ω)
for all ψ V,
(ii) for all ξ, we have x(0, ξ) = x0(ξ).
Furthermore, similarly as in section 6.2, we introduce the associated Hamil-
tonian H:L2(Ω) Rvia H(x):=1
2kxk2
L2(Ω) and derive in the following a
corresponding dissipation inequality. To this end, let x:I×Rbe a solu-
tion of the weak formulation of (6.16) such that the mapping t7→ x(t, ·)is in
C1(I, L2(Ω)). Moreover, analogously as in section 6.2, we consider the func-
tion Hs:IRdefined via Hs(t):=H(x(t, ·)) and obtain the corresponding
dissipation inequality
dHs
dt(t) = H0(x(t, ·))(tx(t, ·)) = hx(t, ·), tx(t, ·)iL2(Ω)
=dkξx(t, ·)k2
L2(Ω) ζhx(t, ·), v(x(t, ·), β)iL2(Ω) 0
for all tI, where the last inequality follows from the definition of vin (6.17).
Thus, we infer that the Hamiltonian does not increase along solution trajecto-
ries. This property is also reflected in a dissipative Hamiltonian structure on
the semi-discretized level, cf. the upcoming equation (6.24).
For the semi-discretization in space, we use the Galerkin FEM with mesh
size h=1
N+1 ,NN, analogously as in the preceding two sections. The
198
6.3. Nonlinear Reaction–Diffusion Equation
resulting semi-discretized system is given by
Eh˙xh(t) = R1,hxh(t)vh(xh(t)) for all tI,
where xh:IRN+1 contains the coefficients corresponding to the FEM ansatz
functions. Moreover, the coefficient matrices Eh, R1,h RN+1,N+1 are given by
Eh:=h
6
4 1 0 ··· 0
1 4 .......
.
.
0......1 0
.
.
....1 4 1
0··· 0 1 2
, R1,h :=d
h
21 0 ··· 0
1 2 .......
.
.
0......1 0
.
.
....1 2 1
0··· 01 1
(6.18)
and the nonlinearity vh:RN+1 RN+1 is defined via
[vh(xh)]i:=ζ*ψi, v
N+1
X
j=1
[xh]jψj, β
+L2(Ω)
=ζZ
b
Ω(xh)
ψi(ξ) exp β
1 + PN+1
k=1 [xh]kψk(ξ)!N+1
X
j=1
[xh]jψj(ξ) dξ
=ζ
N+1
X
j=1
[xh]jZ
b
Ω(xh)
ψi(ξ)ψj(ξ) exp β
1 + PN+1
k=1 [xh]kψk(ξ)!dξ
(6.19)
for i= 1, . . . , N + 1, with ψ1, . . . , ψN+1 denoting the usual hat functions with
ψi(jh) = δij for i= 1, . . . , N + 1 and j= 0, . . . , N + 1. Here, b
Ω(xh)
denotes the subset of the spatial domain where the reaction is active, i.e., the
mapping b
: RN+1 P(Ω) is defined via
b
Ω(xh):=
ξ
N+1
X
j=1
[xh]jψj(ξ)>1
,(6.20)
cf. (6.17). Since the evaluation of vhas defined in (6.19) still involves an
integral which depends on the unknown xh, we propose to approximate this
integral by means of the composite trapezoidal rule. As quadrature points, we
use the FEM grid points, while setting the weights of all grid points which lie
outside of b
Ω(xh)to zero. The resulting system reads
Eh˙xh(t) = R1,hxh(t)˜vh(xh(t)) for all tI,(6.21)
199
6. Numerical Examples
where ˜vh:RN+1 RN+1 is defined via
[˜vh(xh)]i:=ζ
N+1
X
j=1
[xh]j
N+1
X
k=1
ˆωk(xh)ψi(kh)ψj(kh) exp β
1 + PN+1
`=1 [xh]`ψ`(kh)!
=ζ
N+1
X
j=1
[xh]j
N+1
X
k=1
ˆωk(xh)δikδjk exp β
1 + PN+1
`=1 [xh]`δk` !
=ζ[xh]iˆωi(xh) exp β
1+[xh]i!,if [xh]i6=1,
[˜vh(xh)]i:= 0,otherwise
(6.22)
for i= 1, . . . , N + 1 and the weights ˆω1,...,ˆωN+1 :RN+1 R0are given by
ˆω1(xh):=
h, if ([xh]1,[xh]2)(R>1)2,
h
21 + [xh]1+1
[xh]1[xh]2,if ([xh]1,[xh]2)R>1×R≤−1,
0,otherwise,
ˆωN+1(xh):=
h
2,if ([xh]N,[xh]N+1)(R>1)2,
h
2
[xh]N+1+1
[xh]N+1[xh]N,if ([xh]N,[xh]N+1)R≤−1×R>1,
0,otherwise,
ˆωk(xh):=
h, if ([xh]k1,[xh]k,[xh]k+1)(R>1)3,
z1,k(xh),if ([xh]k1,[xh]k,[xh]k+1)R≤−1×R>1×R≤−1,
z2,k(xh),if ([xh]k1,[xh]k,[xh]k+1)(R>1)2×R≤−1,
z3,k(xh),if ([xh]k1,[xh]k,[xh]k+1)R≤−1×(R>1)2,
0,otherwise,
z1,k(xh):=h
2([xh]k+ 1) 1
[xh]k[xh]k1
+1
[xh]k[xh]k+1 !,
z2,k(xh):=h
2 1 + [xh]k+ 1
[xh]k[xh]k+1 !,
z3,k(xh):=h
2 1 + [xh]k+ 1
[xh]k[xh]k1!
(6.23)
for k= 2, . . . , N, see appendix D.2 for the corresponding derivation. Based on
(6.22) we may write (6.21) equivalently as
Eh˙xh(t) = (R1,h +R2,h(xh(t)))xh(t)for all tI,(6.24)
where R2,h :RN+1 RN+1,N+1 is pointwise diagonal and defined via
[R2,h(xh)]i,i :=
ζˆωi(xh) exp β
1+[xh]i,if [xh]i6=1,
0,otherwise (6.25)
200
6.3. Nonlinear Reaction–Diffusion Equation
0 0.5 1
0
100
200
ξ
t
1
0
x(t, ξ)
0 0.5 1
1
0
ξ
t= 0 t= 60 t= 120 t= 180 t= 240
Figure 6.13.: Nonlinear reaction–diffusion equation: pseudocolor plot of the FOM solution
(left) and some selected snapshots (right).
for i= 1, . . . , N +1. In particular, based on (6.18) and (6.25), we note that Eh
is symmetric and positive definite, R1,h is symmetric and positive semi-definite,
and R2,h is pointwise symmetric and positive semi-definite. Consequently, the
semi-discretized system (6.24) has a dissipative Hamiltonian structure with
Hamiltonian Hh:RN+1 Rdefined via Hh(xh):=1
2x>
hEhxh. Especially, as
in the preceding two sections, the Hamiltonian is a quadratic function and
hence we may apply the implicit midpoint rule for the time discretization to
ensure a dissipation inequality on the time-discrete level.
For the following numerical experiments, we choose the PDE parameters as
d= 5 ·105,ζ= 1, and β= 1, the final time as tend = 250, and the initial
value as
x0(ξ) =
0,if ξ[0,0.1],
exp(1
110ξ)
exp(1
110ξ)+exp(1
10ξ2),if ξ(0.1,0.2),
1,otherwise
for all ξ, cf. Figure 6.13, right. Furthermore, we divide the spatial domain
into N+ 1 = 1000 equidistant intervals, which corresponds to a mesh size of
h= 103, and for the time discretization we use a step size of 0.2. Figure 6.13
depicts the corresponding numerical solution by means of a pseudocolor plot
and some selected snapshots. We observe that the initial wave profile is trans-
ported to the right with an approximately constant wave speed. There is a
slight change in the shape of the wave at the beginning of the time interval,
which may be seen for instance by comparing the initial snapshot with the
one corresponding to t= 60. However, afterwards the shape remains nearly
constant.
As in the previous sections, we employ the residual minimization approach
from section 3.1 for the mode determination based on the FOM snapshot data.
201
6. Numerical Examples
In particular, we use W=Ehdefined in (6.18) as spatial weighting matrix and
we consider an approximation based on r= 1 mode with dimension dφ=N+1.
Furthermore, we use a discretized version of the constant extrapolation shift
operator defined in Definition 3.3.8 and restrict it to functions which vanish at
the left boundary. Due to this restriction, the resulting transformation family
may also be regarded as a combination of the zero padding shift operator
for non-negative shift values and the constant extrapolation shift operator for
negative shift values. This choice reflects the homogeneous Dirichlet condition
on the left boundary and the homogeneous Neumann condition on the right
boundary in (6.16). For the discretization, we use cubic splines such that
the resulting family of discretized shift operators Tc,h :R L(RN+1)may be
specified as
Tc,h(η)φ:=T1(q(η))φ+T2(q(η))a(φ)ζ(η)b(φ) + ζ(η)2c(φ)ζ(η)3d(φ)
with q(η):=jη
hk,ζ(η):=ηmod h,
T1(q):=
0RN+1,N+1,if q0,
0 0
01min(N+1,q)
RN+1,N+1,otherwise,
T2(q):=
0 0
Imax(N+1q,0) 0
RN+1,N+1,if q0,
0Imax(N+1+q,0)
0 0
RN+1,N+1,otherwise.
(6.26)
Here, a(φ), b(φ), c(φ), d(φ)RN+1 are vectors containing the spline coefficients
corresponding to φ, see appendix D.1.3 for their definition and a derivation of
Tc,h from Tc.
In contrast to the test cases considered in the previous two sections, the wave
propagation observed in Figure 6.13 is not caused by an advection term in the
corresponding PDE, but instead originates from an interplay between reaction
and diffusion. Consequently, the wave speed may not be simply read from the
PDE in (6.16), which prevents us from choosing the offline shift values in the
same way as in sections 6.1 and 6.2. Instead, we determine them by finding
the minimum of the spatial derivative of each FOM snapshot to identify the
position of the reaction front depicted in Figure 6.13, right. To this end, we
use a simple forward finite difference scheme for obtaining an approximation
of the first derivative of each snapshot.
As in the previous sections, we use the variable projection approach from sec-
tion 3.1.2 for the actual computation of the modes and solve the corresponding
optimization problem by means of the GRANSO software with default settings.
For the starting values, we use similarly as in section 6.1 the first left singular
202
6.3. Nonlinear Reaction–Diffusion Equation
vector of the transformed snapshot matrix
hTc,h(p(t1))xh(t1)··· Tc,h(p(tq))xh(tq)i.
This seems to be a natural way of aligning the FOM snapshots, cf. Figure 6.13.
Based on these starting values, the GRANSO solver terminates after 184 iter-
ations and the resulting relative offline error is about 2.7·103.
We proceed to construct a ROM of the form (4.1)–(4.2) based on the de-
termined mode φ1and using the derivative of Tc,h provided in (D.14) in ap-
pendix D.1.3. As in the previous two sections, we slightly differ from the
setting in section 4.1 and use the weighted E1
h-norm for the residual mini-
mization, cf. section 5.2. Especially, we note that although Theorem 5.2.1 is
stated for a FOM with constant coefficient matrices, the theorem and its proof
straightforwardly extend to the case of a state-dependent dissipation matrix.
Consequently, we may also apply this theorem here, which yields that the
ROM is dissipative Hamiltonian. The resulting ROM with r= 1 has the form
"Mα(p)αN(p)
αN(p)α2Mp(p)#"˙α
˙p#="Aα(p)
αAp(p)#α+"vα(α, p)
αvp(α, p)#,(6.27)
where Mα,N, and Mpare obtained as in (4.2), but the occurring inner prod-
ucts need to be replaced by weighted inner products with weighting matrix
Eh. Furthermore, Aαand Apare as specified in (4.37) with F=R1,h and
vα, vp:R×RRare defined as
vα(α, p):=φ>
1Tc,h(p)>˜vh(Tc,h(p)φ1α),
vp(α, p):=φ>
1T0
c,h(p)>˜vh(Tc,h(p)φ1α).
As discussed in section 4.3, the evaluation of the ROM still scales with the
dimension of the FOM, since Mα,N,Mp,Aα,Ap,vα, and vpinvolve prod-
ucts of high-dimensional matrices and vectors. Moreover, since the FOM is
nonlinear, this is also reflected in vαand vp, whose evaluations involve the
FOM nonlinearity ˜vh, and this is a second reason which prevents an efficient
evaluation of the ROM. As outlined in sections 4.3.1 and 4.3.2, we treat these
two issues differently. In particular, the path-dependent coefficients Mα,N,
Mp,Aα, and Apare treated as in section 6.2, i.e., we sample them and con-
struct corresponding piecewise linear interpolants. For this purpose, we use a
sampling range of [0,1] and an equidistant grid whose grid size coincides with
the one used for the FEM discretization.
To approximate the functions vαand vp, we follow the approach outlined
in section 4.3.2. We note that the framework in section 4.3.2 is presented for
a PDE as FOM and in particular requires the evaluated nonlinearity to be
a function defined on the spatial domain . Instead, here we have formally
derived the ROM (6.27) based on the finite-dimensional FOM (6.21). Never-
theless, since (6.21) is obtained via a spatial semi-discretization of (6.16) based
203
6. Numerical Examples
on piecewise linear FEM basis functions, the entries of all occurring vectors
of dimension N+ 1 correspond to evaluations of continuous functions at the
FEM grid points. This allows us to still apply the hyperreduction framework
from section 4.3.2 by interpolating between the grid points if necessary.
As the first step of the approach from section 4.3.2, we approximate the FOM
nonlinearity ˜vhbased on a linear combination of transformed EIM ansatz
vectors or modes. These are obtained based on snapshot data of the FOM
nonlinearity, see Figure 6.14 for a depiction of the corresponding snapshots
and note that ˜vhcorresponds to the term v(·,1) on the infinite-dimensional
level, cf. (6.16) with ζ= 1 and β= 1. In particular, we apply the residual
minimization approach from section 3.1 with one mode and using the same
paths and transformation operators as for the snapshots of the state. Also the
starting value is determined similarly as for the state snapshots, i.e., we choose
the first left singular vector of the transformed snapshot matrix
hTc,h(p(t1))˜vh(xh(t1)) ··· Tc,h(p(tq))˜vh(xh(tq))i.
In fact, this starting value appears to be already very close to a local minimum
point as the GRANSO optimization solver does not perform any iteration, but
instead immediately terminates, since the stationarity tolerance is already met.
The resulting relative approximation error is around 0.026.
Based on the determined EIM mode ψ1,1, we proceed to approximate the
ROM (6.27) as outlined in section 4.3.2. In particular, we replace vα(α, p)
and vp(α, p)in (6.27) by ˆ
Aα(p)βand ˆ
Ap(p)β, respectively, where ˆ
Aαand ˆ
Ap
are as specified in (4.48) and their efficient offline/online decomposition works
analogously as for Mα,N,Mp,Aα, and Ap. Here, β:IRis given by the
solution of the linear equation system
AEIM(p(t))β(t) = b(α(t), p(t)) (4.47c)
for all tI. As stated in (4.46), the definitions of AEIM and binvolve the
EIM collocation point p1,1(p). For the latter we use the update rule (4.52),
i.e., p1,1(p) = p1,1(0) + p. Furthermore, for determining p1,1(0) we apply the
Q-DEIM algorithm introduced in [83] to the EIM mode ψ1,1. We note that the
algorithm in [83] actually outputs a value b
p1,1(0) {1, . . . , N + 1}, whereas
p1,1(0) as introduced in section 4.3.2 is formally an element of . However,
since the entries of ψ1,1RN+1 correspond to the FEM grid points, we may
relate b
p1,1(0) and p1,1(0) via p1,1(0) = hb
p1,1(0). Moreover, since pis a scalar-
valued function for the problem at hand, we may follow Remark 4.3.18 to infer
that AEIM is constant with respect to pand especially given by
AEIM(p) = [ψ1,1]bp1,1(0) .
Formally, this is only valid as long as p1,1(p) = p1,1(0) + pis in , cf. Re-
mark 4.3.18, but in our numerical experiments this assumption turns out to
204
6.3. Nonlinear Reaction–Diffusion Equation
0 0.5 1
0
100
200
ξ
t
0
0.02
0.04
0.06
v(x(t, ξ),1)
0 0.5 1
0
0.02
0.04
0.06
ξ
t= 0 t= 60 t= 120 t= 180 t= 240
Figure 6.14.: Nonlinear reaction–diffusion equation: pseudocolor plot of the FOM nonlin-
earity (left) and some selected snapshots (right).
be always satisfied. Especially, since AEIM is constant with respect to p, it may
be precomputed in the offline phase.
To also derive an efficient way of evaluating the right-hand side bin (4.47c),
we first observe that the ith component of ˜vh(xh)defined via (6.22)–(6.23)
only depends on the three components [xh]i1,[xh]i, and [xh]i+1, respectively
for i= 2, . . . , N. Thus, we may write [˜vh(xh)]ias
[˜vh(xh)]i= ˆvh([xh]i1,[xh]i,[xh]i+1),(6.28)
for i= 2, . . . , N, where ˆvh:R3Ris defined via
ˆvh(η1, η2, η3):=
0,if η2=1,
ζη2ˆω(η1, η2, η3) exp β
1+η2,otherwise,
ˆω(η1, η2, η3):=
h, if (η1, η2, η3)(R>1)3,
z1(η1, η2, η3),if (η1, η2, η3)R≤−1×R>1×R≤−1,
z2(η1, η2, η3),if (η1, η2, η3)(R>1)2×R≤−1,
z3(η1, η2, η3),if (η1, η2, η3)R≤−1×(R>1)2,
0,otherwise,
z1(η1, η2, η3):=h
2(η2+ 1) 1
η2η1
+1
η2η3!,
z2(η1, η2, η3):=h
2 1 + η2+ 1
η2η3!, z3(η1, η2, η3):=h
2 1 + η2+ 1
η2η1!,
cf. (6.22)–(6.23). Moreover, using (6.28) and similar arguments as in Re-
mark 4.3.18, we infer that bis constant with respect to pand given by
b(α, p) = ˆvhα[φ1]bp1,1(0)1, α [φ1]bp1,1(0) , α [φ1]bp1,1(0)+1.
205
6. Numerical Examples
Here, we have assumed that p1,1(p) = p1,1(0) + pis in [2h, Nh](and
note that this assumption indeed turns out to be satisfied in our numerical
experiments. In particular, bmay be efficiently evaluated in the online phase,
since ˆvhdoes not scale with the FOM dimension in contrast to ˜vh.
For comparing the ROM (6.27) and its hyperreduced counterpart, we use as
in sections 6.1 and 6.2 the implicit midpoint rule for time integration and the
same parameter values which have been used for generating the FOM snapshot
data in the offline phase. In particular, the evaluation of the non-hyperreduced
ROM takes approximately four seconds, whereas the hyperreduced ROM re-
quires only a computation time of about 2.5seconds. This corresponds to a
relative speed-up of roughly 1.6, while the error of the hyperreduced ROM is
around 5.6·103and thus only slightly larger than the 5·103obtained by the
non-hyperreduced ROM. Moreover, both ROMs are significantly faster than
the corresponding FOM, which requires a computation time of approximately
five minutes. We note that, in contrast to the test cases considered in sec-
tions 6.1 and 6.2, the reaction–diffusion FOM (6.21) is nonlinear and hence
its evaluation involves solving a high-dimensional nonlinear equation system
in each time step.
We also compare the ROMs based on transformed modes with a ROM based
on a classical approach using POD and DEIM, cf. section 2.5. In particular,
a ROM based on 40 POD and 60 DEIM modes yields a relative online error
of around 6.1·103, which is thus comparable to the error obtained via the
hyperreduced ROM based on transformed modes. However, the evaluation of
the POD/DEIM-based ROM takes about 9.6seconds, which corresponds to
almost four times the computation time required for the hyperreduced ROM
based on transformed modes. Another advantage of the latter ROM is that it
allows for more flexibility of choosing the time step size, similarly as reported
in sections 6.1 and 6.2. For example, the ROMs based on transformed modes
yield a relative error of roughly one per cent when increasing the time step
size from 0.2to 50. The fact that this large increase of the time step size is
only accompanied by a rather small increase of the error, may be explained
by the observation that the path pis nearly a linear function of time and
the amplitude αis almost constant, see also the corresponding discussion for
the wave equation test case in section 6.1. Also the POD/DEIM-based ROM
allows to significantly increase the time step size without leading to a significant
increase of the error. However, this effect is less pronounced than for the ROMs
based on transformed modes. For instance, the POD/DEIM-based ROM yields
a relative error of more than 0.1when increasing the time step size to 10.
Remark 6.3.1 (Choice of the number of DEIM modes for the POD-based
ROM).The number of DEIM modes used for the hyperreduction of the POD-
based ROM has been chosen such that the relative error between the hyper-
reduced and non-hyperreduced ROM is smaller than 105. Consequently, this
error is negligible in comparison to the error between the non-hyperreduced
ROM and the full-order model. However, in our numerical experiments we
206
6.3. Nonlinear Reaction–Diffusion Equation
have observed that decreasing the number of DEIM modes may lead to a
decreasing error with respect to the FOM, even though the corresponding er-
ror with respect to the non-hyperreduced ROM increases at the same time.
For instance, a POD/DEIM-based reduced-order model with 40 POD and 40
DEIM modes yields a relative error of 2.8·103in comparison to the FOM.
This is significantly smaller than the 6.1·103obtained using 40 POD and 60
DEIM modes, even though the latter corresponds to a better approximation
of the non-hyperreduced ROM. Thus, it appears that the performance of the
POD/DEIM-based reduced-order model could be improved by exploiting this
counterintuitive behavior. However, since it is in general not clear how the
number of DEIM modes may be chosen to minimize the error with respect to
the FOM, we omit further numerical experiments in this direction. ¨
We close this section by briefly discussing the compliance with the power
balance associated with the considered reaction–diffusion test case. As pointed
out after (6.25), the FOM (6.21) allows for a dissipative Hamiltonian repre-
sentation of the form (6.24). Consequently, an energy-consistent ROM should
have the property that the associated Hamiltonian does not increase over time.
Indeed, the ROM (6.27) may be formulated as a dissipative Hamiltonian sys-
tem, which follows from the fact that we have used the weighted E1
h-norm for
the residual minimization and from the considerations in section 5.2. Thus,
the ROM is energy-consistent in the sense that the associated Hamiltonian is
non-increasing with respect to time. However, as illustrated in sections 6.1
and 6.2, this property is not necessarily preserved after discretization in time,
at least not when using the implicit midpoint rule. Nevertheless, in our nu-
merical experiments we have not observed any increase of the Hamiltonian
associated with the ROM (6.27), even when using the implicit midpoint rule
and significantly increasing the time step size. Accordingly, we omit a cor-
responding numerical comparison between the implicit midpoint rule and the
discrete gradient pair approach from appendix C.2.
207
7. Conclusion
We close this thesis by providing a summary in section 7.1 and by addressing
some future research directions in section 7.2.
7.1. Summary
In this thesis we consider a specific nonlinear model order reduction (MOR)
approach which is suitable for an effective reduction of transport-dominated
systems. To this end, we approximate the state of the full-order model (FOM)
by a linear combination of transformed modes, where the transformation op-
erators are parametrized by time-dependent path variables. In the examples
presented in this thesis, we mostly consider translation or shift operators on
one-dimensional spatial domains and in this case the paths correspond to
the respective shift amounts. As a consequence, this class of approximation
ansatzes may result in very low-dimensional and accurate reduced-order mod-
els even for transport-dominated problems which are challenging for classical
methods based on linear approximation ansatzes. Moreover, the new model
reduction framework allows to preserve important system properties such as
stability and passivity by exploiting a port-Hamiltonian (pH) representation
of the full-order model.
The determination of suitable modes based on snapshot data of the FOM
is subject of chapter 3. For this purpose, we propose to solve an optimiza-
tion problem such that the approximation error between the snapshot data
and the corresponding approximation based on transformed modes is mini-
mized. Especially, we assume the paths to be given or determined in a pre-
processing step and present two different optimization approaches. The first
one is based on directly solving the full optimization problem in terms of
the modes and the corresponding amplitudes. The second approach uses the
variable projection method, which results in a reduced problem where the opti-
mization parameters consist only of the modes. Furthermore, for the treatment
of parameter-dependent problems, we extend the classical POD-greedy algo-
rithm to nonlinear approximation approaches based on transformed modes. In
particular, the resulting algorithm determines one mode after another based
on adaptively chosen parameter samples and uses the reduced optimization
approach obtained via variable projection. While the techniques mentioned
so far are presented for a general class of transformation operators, we also
discuss some special classes of shift operators for one-dimensional problems
with different boundary conditions. In addition, we compare the proposed
209
7. Conclusion
optimization framework with some related techniques from the literature.
In chapter 4 we demonstrate how to construct a reduced-order model (ROM)
using a nonlinear approximation ansatz based on transformed modes. To this
end, we assume the modes to be given and derive a ROM by minimizing the
residual which is obtained by substituting the approximation ansatz into the
full-order model. The state of the resulting ROM consists of the amplitudes
and the paths. Moreover, the ROM is in general nonlinear and involves a
state-dependent mass matrix, which is due to the nonlinearity of the used ap-
proximation ansatz. In general, it is not guaranteed that this mass matrix is
nonsingular and thus we present a corresponding regularization approach to
ensure its invertibility. Furthermore, we discuss the solvability of the ROM and
present an a posteriori error bound for a special class of linear full-order mod-
els. In addition, we compare a special case of the presented model reduction
approach with a framework based on symmetry reduction and demonstrate a
relation between the corresponding optimization problems. Finally, since the
evaluation of the ROM scales in general with the dimension of the FOM, we
present a hyperreduction approach, which is based on an additional approxi-
mation of the ROM to ensure an efficient evaluation in the online phase. In
particular, for the case that the FOM is nonlinear, we present a technique which
is based on ideas of the (discrete) empirical interpolation method ((D)EIM)
and uses an approximation of the FOM nonlinearity via a linear combination
of transformed ansatz functions.
Chapter 5 is devoted to structure-preserving model order reduction for port-
Hamiltonian systems in a general finite-dimensional setting. We consider dif-
ferent classes of pH systems for the FOM including linear, nonlinear, time-
invariant, and time-varying systems. Moreover, we discuss not only linear
approximation ansatzes but also two classes of nonlinear ansatzes, which in
particular include the class of approximation ansatzes considered in the previ-
ous chapters of this thesis. Furthermore, we demonstrate that in many cases
structure-preserving MOR may be achieved via a residual minimization ap-
proach by using a suitable weighted norm for the residual, especially if the
corresponding FOM is linear. The weighted norm used for the residual mini-
mization may be motivated by a residual-based error bound, where the error
is measured via another weighted norm which is related to the Hamiltonian.
Besides, we also discuss the stability of the ROMs and provide corresponding
sufficient conditions.
Finally, in chapter 6 we demonstrate the application of some of the methods
presented in chapters 3 to 5 by means of three numerical test cases. First,
we consider a linear wave equation with periodic boundary conditions and
obtain a corresponding reduced-order model based on two shifted modes. In
particular, the numerical experiments reveal that the time step size may be
considerably increased for this ROM without significantly increasing the er-
ror. Furthermore, we demonstrate that the Hamiltonian of the ROM is a
conserved quantity even after time discretization, when using a discrete gra-
210
7.2. Outlook
dient pair approach as outlined in appendix C. The second test case involves
an advection–diffusion equation with mixed Robin–Neumann boundary con-
ditions, which lead to incoming and outgoing waves at the boundaries. We
demonstrate the performance of the greedy algorithm presented in chapter 3
and obtain a parameter-dependent ROM which yields a decent approximation
quality over the considered range of parameter values. For the last test case,
we consider a nonlinear reaction–diffusion equation whose solution features a
traveling reaction front. Here, we apply the hyperreduction approach from
chapter 4 and obtain a ROM which may be efficiently evaluated and especially
outperforms a corresponding ROM based on POD and DEIM.
7.2. Outlook
The research fields of nonlinear and structure-preserving model order reduction
methods currently experience a significant increase in research efforts and there
are still many open questions to be addressed in the future. In the following
we mention some of them with a special focus on those which are the most
related to the topics of this thesis.
The modal decomposition approach presented in section 3.1 is largely in-
spired by the classical POD approach and extends it to approximation ansatzes
based on transformed modes as in (1.4). The POD method exists at least for
a couple of decades and in the mean time several variants have been proposed
in the literature. One of them is the so-called goal-oriented POD method,
which belongs to the goal-oriented MOR techniques briefly addressed in Re-
mark 2.5.1. This approach does not only target a good approximation of the
FOM state, but also an accurate approximation of some quantities of interest
which are derived from the state, cf. [41]. Such a goal-oriented variant may in
principle also be derived based on the approximation ansatz (1.4) and would
result in a different cost function to be minimized. Also a splitting of the
time interval as used in [41] may be applied to the approach discussed in sec-
tion 3.1. Such an extension may render the method more useful when applying
it to problems, where an accurate approximation of one or several quantities
of interest is required.
In contrast to the method presented in [41], the goal-oriented MOR ap-
proach introduced in [52] constructs basis functions which are not only tailored
for approximating some quantities of interest, but the proposed optimization
problem also involves the ROM dynamics. Extending this idea to the setting
considered in this thesis may have the potential to improve the online perfor-
mance of the ROMs. Especially, during the application of the MOR framework
based on transformed modes to the wildland fire model in [39], we have ob-
served that a small offline approximation error is not necessarily associated
with a small online error. Thus, a goal-oriented approach as the one presented
in [52] may be very useful for determining ansatz functions which lead to an
improved online performance.
211
7. Conclusion
A challenge which has not been extensively addressed in this thesis is the au-
tomatic identification of those transports which are the most relevant ones for
the dynamics. Also an algorithm for determining the mode numbers in a smart
way would be desirable in the future. The numerical examples considered in
chapter 6 all have in common that the relevant transports are clear either
from the physics or from the snapshot data. Besides, the choice of the mode
numbers is not very challenging for these examples: For the wave equation
test case in section 6.1, the analytical solution is known and provides a clear
suggestion for the mode numbers. Moreover, each of the other two examples
only involves one wave speed and thus there is just one mode number, which
may be gradually increased until a certain error tolerance is met. The situa-
tion may be significantly more complicated when more complex examples with
possibly higher-dimensional spatial domains are considered. Even for the one-
dimensional pulsed detonation combuster example considered in section 1.2.1,
the identification of the most relevant transports and a proper adjustment
of the mode numbers is far from trivial. Some approaches for identifying the
transports based on data are mentioned in Remark 3.1.2, whereas Remark 3.1.6
addresses some methods for choosing the mode numbers, once the transports
have been determined. Nevertheless, a unifying framework, where the trans-
formation operators, the corresponding path variables, the mode numbers, the
modes themselves, and the amplitudes are all determined in an automatic fash-
ion and only based on snapshot data, is still missing. Such a framework would
be particularly useful for applications where the physics is not well-understood
and, thus, the possibilities to exploit physical insights for determining good
choices for the transformation operators, mode numbers, etc. are limited.
The setting considered in this thesis is based on a FOM right-hand side which
maps from a subspace of the state space to the state space. In the context
of PDEs this approach corresponds to a strong formulation, whose solvability
requires in general stricter regularity assumptions than the corresponding weak
problem, see for instance Chapters 19 and 23 in [287]. Furthermore, the finite
element method and the corresponding Galerkin projection, as used for the
numerical experiments considered in chapter 6 of this thesis, are based on
the weak formulation. Thus, to be able to obtain the ROMs via a residual-
minimizing approximation of the corresponding infinite-dimensional problem,
it would be desirable to derive a residual-minimizing model reduction scheme
as outlined in section 4.1 for the case where the FOM corresponds to a weak
formulation. This may be achieved by considering the FOM state equation in
the dual space of the state space, i.e., in this case the right-hand side maps
from the state space to its dual and the time derivative of the state is regarded
as an element of the dual space, see [287, Ch. 23] for more details.
The major difficulty of applying the residual minimization framework to
such a setting is that the residual itself lives in the dual space and a resid-
ual minimization scheme would involve the dual norm of the residual. Even
though there are algorithms for numerically approximating the dual norm, see
212
7.2. Outlook
for instance [269], there is in general no analytical expression for the dual norm
available, which prevents the explicit derivation of necessary optimality con-
ditions. If the state space is a Hilbert space, the Riesz representation theorem
allows to identify the residual with an element in the state space, but even
then this element may in general not be expressed analytically. These issues
have already been observed in connection with residual-minimization-based
discretization techniques for PDEs, see for instance [79, sec. 2]. In this context,
some workarounds have been presented, see for example [63, 71, 76, 79, 151].
An interesting future research direction is to investigate if some of these ideas
presented for the discretization of PDEs may be also helpful for residual-
minimization-based model reduction using nonlinear approximation ansatzes.
In the context of structure-preserving model reduction schemes for port-
Hamiltonian systems, a thorough investigation of general nonlinear approxi-
mation ansatzes is still missing. Such considerations would be especially im-
portant for the recently proposed model reduction schemes which are based
on neural network architectures. In Remark 5.0.2 we sketch one possible ap-
proach in this direction, which is based on a constrained optimization prob-
lem. Recently, a structure-preserving nonlinear MOR scheme for classical
Hamiltonian systems has been proposed in [50], which is based on project-
ing the Hamiltonian FOM onto a symplectic manifold. Furthermore, next to
the structure-preserving nonlinear model reduction itself, it would be also de-
sirable to develop corresponding hyperreduction techniques which ensure that
the hyperreduced ROM is port-Hamiltonian as well. A corresponding method
for linear approximation ansatzes has been presented in [65].
213
A. Properties of the Periodic Shift
Operator
Throughout this thesis, the family of periodic shift operators introduced in
Definition 1.2.2 is used in many examples. In this appendix, we discuss some
of its properties and, in particular, start with the following theorem, which
especially implies that Tper is a group homomorphism from Rto the automor-
phism group of L2(Ω).
Theorem A.1 (Basic properties of the periodic shift operator).Let ηR
and Ω=(a, b)with aRand bR>a be given and let Tper be as defined in
Definition 1.2.2. Then, the following assertions hold.
(i) The operator Tper(η)is linear and bounded.
(ii) The adjoint operator of Tper(η)is given by Tper(η).
(iii) For all η1, η2R, we have Tper(η1)Tper(η2) = Tper(η1+η2).
(iv) The operator Tper(η)is unitary.
Proof. (i) Let fL2(Ω) be arbitrary and let ˆ
f: Rbe a representative
of the equivalence class f. From Definition 1.2.2 it follows that Tper(η)f
L2(Ω) is given by the equivalence class of the function ˆg: Rdefined
via
ˆg(ξ):=
ˆ
f(ξ+baζ),if ξ(a, a +ζ),
ˆ
f(ξζ),otherwise (A.1)
with ζ:=ηmod (ba). Thus, ˆgdepends linearly on ˆ
fand this property
also translates to the corresponding equivalence classes, which shows that
Tper(η)is linear. Furthermore, we have
kTper(η)fk2
L2(Ω) =
b
Za
((Tper(η)f)(ξ))2dξ
=
a+ζ
Za
(f(ξ+baζ))2dξ+
b
Z
a+ζ
(f(ξζ))2dξ
=
b
Z
bζ
(f(ξ))2dξ+
bζ
Za
(f(ξ))2dξ=kfk2
L2(Ω) ,
215
A. Properties of the Periodic Shift Operator
which proves the boundedness of Tper(η).
(ii) Let f1, f2L2(Ω) be arbitrary. We need to show that the terms
hf1,Tper(η)f2iL2(Ω) =
a+ζ
Za
f1(ξ)f2(ξ+baζ) dξ+
b
Z
a+ζ
f1(ξ)f2(ξζ) dξ
=
b
Z
bζ
f1(ξ+ζ+ab)f2(ξ) dξ+
bζ
Za
f1(ξ+ζ)f2(ξ) dξ
(A.2)
with ζ:=ηmod (ba)and
hf2,Tper(η)f1iL2(Ω) =
a+ˆ
ζ
Za
f2(ξ)f1(ξ+baˆ
ζ) dξ+
b
Z
a+ˆ
ζ
f2(ξ)f1(ξˆ
ζ) dξ
(A.3)
with ˆ
ζ:= (η) mod (ba)coincide. To this end, we distinguish two
cases: If ηis an integer multiple of ba, then we have ζ=ˆ
ζ= 0 and in
this case the terms in (A.2) and (A.3) coincide. Otherwise, we have
ˆ
ζ=η(ba)η
ba=η+ (ba)η
ba
=η+ (ba)1 + η
ba=baζ,
which shows that also in this case (A.2) and (A.3) coincide. Thus, in total
we have shown hf1,Tper(η)f2iL2(Ω) =hf2,Tper(η)f1iL2(Ω), which proves
the claim.
(iii) First, we note that Definition 1.2.2 implies
Tper(ηi)f(ξ):=
f(ξ+baζi)for a.e. ξ(a, a +ζi),
f(ξζi)for a.e. ξ(a+ζi, b)(A.4)
with ζi:=ηimod (ba)for i= 1,2. In the following, we first consider
the case bζ1> a +ζ2or equivalently ba>ζ1+ζ2. Then, for given
fL2(Ω), (A.4) yields
(Tper(η1)Tper(η2)f)(ξ)
=
(Tper(η2)f)(ξ+baζ1)for a.e. ξ(a, a +ζ1),
(Tper(η2)f)(ξζ1)for a.e. ξ(a+ζ1, b)
=
f(ξ+baζ1ζ2)for a.e. ξ(a, a +ζ1+ζ2),
f(ξζ1ζ2)for a.e. ξ(a+ζ1+ζ2, b).
(A.5)
216
Similarly, for the complementary case bζ1a+ζ2or equivalently
baζ1+ζ2, we obtain
(Tper(η1)Tper(η2)f)(ξ)
=
f(ξ+ 2(ba)ζ1ζ2)for a.e. ξ(a, a +ζ1+ζ2(ba)),
f(ξ+baζ1ζ2)for a.e. ξ(a+ζ1+ζ2(ba), b).
(A.6)
The claim then follows from comparing (A.5) and (A.6) with the defini-
tion of Tper(η1+η2)fand from exploiting
(η1+η2) mod (ba)=(η1mod (ba) + η2mod (ba)) mod (ba)
= (ζ1+ζ2) mod (ba) =
ζ1+ζ2,if ζ1+ζ2< b a,
ζ1+ζ2(ba),otherwise.
(iv) The statement follows from (ii) and (iii) and from Tper(0) = IdL2(Ω).
In this thesis, we often consider the case where the argument of the periodic
shift operator Tper(η)with ηRis in a certain subspace of L2(Ω). An
important example is the space C1
per(Ω), which is even an invariant subspace
of Tper(η)as stated in Theorem A.2. Furthermore, for φC1
per(Ω) we have the
special property that Tper(η)φis continuously differentiable with respect to η,
see Theorem A.3 for the details.
Theorem A.2 (Invariant subspace of the periodic shift operator).Let ηR
and Ω=(a, b)with aRand bR>a be given and let Tper(η) L(L2(Ω)) be
as defined in Definition 1.2.2. Then, C1
per(Ω) is an invariant subspace of Tper(η)
in the sense that for any φC1
per(Ω) the equivalence class Tper(η)φL2(Ω)
has a representative in C1
per(Ω).
Proof. For given φC1
per(Ω), a representative of Tper(η)φL2(Ω) is given by
ψ:Rdefined via
ψ(ξ):=
φ(ξ+baζ),if ξ[a, a +ζ],
φ(ξζ),otherwise
with ζ:=ηmod (ba). Thus, it is sufficient to show that ψis an element of
C1
per(Ω). Since φis in C1
per(Ω), we have that ψis continuously differentiable
on [a, a +ζ)with derivative φ0(·+baζ)and on (a+ζ, b]with derivative
φ0(·ζ). Furthermore, the fact φC1
per(Ω) yields
lim
h&0
ψ(a+ζ+h)ψ(a+ζ)
h= lim
h&0
φ(a+h)φ(b)
h= lim
h&0
φ(a+h)φ(a)
h
217
A. Properties of the Periodic Shift Operator
=φ0(a) = φ0(b) = lim
h%0
φ(b+h)φ(b)
h= lim
h%0
ψ(a+ζ+h)ψ(a+ζ)
h,
i.e., ψis also differentiable at a+ζwith derivative φ0(a) = φ0(b). Thus, in
total we have shown that ψis in C1(Ω). Finally, due to the properties
ψ(a) = φ(bζ) = ψ(b)and ψ0(a) = φ0(bζ) = ψ0(b)
we infer that ψis in C1
per(Ω), which concludes the proof.
Theorem A.3 (Derivative of shifted function with respect to shift).Let
Ω=(a, b)with aRand bR>a be given and let Tper :R L(L2(Ω))
be as defined in Definition 1.2.2. Furthermore, let φbe an element of C1
per(Ω).
Then, the mapping g:RL2(Ω) defined via g(η):=Tper(η)φis continuously
differentiable with derivative g0(·) = −Tper(·)φ0.
Proof. For given ηR, we use Theorem A.1 and compute
lim
h0kg(η+h)g(η) + hTper(η)φ0kL2(Ω)
|h|
= lim
h0kTper(h)φφ+0kL2(Ω)
|h|
=v
u
u
u
tlim
h0
b
Za
(Tper(h)φ) (ξ)φ(ξ)
h+φ0(ξ)
2
dξ.
(A.7)
Furthermore, we introduce the periodic extension φper :RRof φvia
φper(ξ):=φ(a+ (ξa) mod (ba)) .
Especially, since φis in C1
per(Ω), the definition of φper implies φper C1(R).
Furthermore, using the definition of φper, (A.7) yields
lim
h0kg(η+h)g(η) + hTper(η)φ0kL2(Ω)
|h|
=v
u
u
u
tlim
h0
b
Za
φper(ξh)φper(ξ)
h+φ0
per(ξ)
2
dξ.
(A.8)
Besides, we have for all hR\{0}and ξ[a, b]the bound
φper(ξh)φper(ξ)
h+φ0
per(ξ)
2
φper(ξh)φper(ξ)
h+φ0
per(ξ)!2
4 max
ξ[a,b]φ0
per(ξ)!2
.
218
This bound allows us to apply the dominated convergence theorem, cf. [2,
Thm. 1.50], to the right-hand side of (A.8) and, hence, we obtain
lim
h0kg(η+h)g(η) + hTper(η)φ0kL2(Ω)
|h|
=v
u
u
u
t
b
Za
lim
h0
φper(ξh)φper(ξ)
h+φ0
per(ξ)
2
dξ= 0.
Consequently, gis differentiable with derivative g0(·) = −Tper(·)φ0and it re-
mains to show that the derivative is continuous. In fact, for fixed ηRand
by similar arguments as for the differentiability proof, we obtain the equation
lim
˜ηηkg0(η)g0(˜η)kL2(Ω) = lim
˜ηηk−φ0+Tper(˜ηη)φ0kL2(Ω)
= lim
h0k−φ0+Tper(h)φ0kL2(Ω) =v
u
u
u
tlim
h0
b
Zaφ0
per(ξh)φ0
per(ξ)2dξ
=v
u
u
u
t
b
Za
lim
h0φ0
per(ξh)φ0
per(ξ)2dξ= 0,
which shows that the derivative of gis continuous.
219
B. Linear Time-Varying
Port-Hamiltonian Systems
This appendix is devoted to the linear time-varying port-Hamiltonian formu-
lation introduced in (5.3), which deviates from the one originally proposed
in [25] as discussed at the beginning of chapter 5. In the following we show
that the pH formulation (5.3) still has some nice properties such as a dissi-
pation inequality and invariance with respect to a certain class of state space
transformations. The dissipation inequality is formally stated in the following
theorem.
Theorem B.1 (Dissipation inequality for linear time-varying pH systems of
the form (5.3)).Consider a pH system of the form (5.3) with time interval
I:= [t0, tend],t0R0,tend R>t0,E, Q, J, R, K C(R0,Rn,n), and B
C(R0,Rn,m)satisfying E>QC1(R0,Rn,n)and pointwise
E>Q=Q>E0, Q>RQ =Q>R>Q0,
and d
dt(Q>E) = Q>(KJQ)+(KJQ)>Q. (5.4)
Furthermore, we consider the associated Hamiltonian H C1(R0×Rn)de-
fined via (5.5) and let xC1(I,Rn)satisfy (5.3a) for a given input signal
u:R0Rm. Then, the mapping Hs:IRdefined via Hs(t):=H(t, x(t))
is continuously differentiable and satisfies the dissipation inequality
dHs
dt(t)u(t)>y(t)for all tI.
Proof. The continuous differentiability of Hsfollows from the continuous dif-
ferentiability of Hand x. Furthermore, using the chain rule and (5.4), we
obtain
dHs
dt(t) = tH(t, x(t)) + xH(t, x(t)) ˙x(t)
=1
2x(t)>d
dt(E>Q)(t)x(t) + x(t)>E(t)>Q(t) ˙x(t)
=1
2x(t)>d
dt(Q>E)(t)x(t) + x(t)>Q(t)>E(t) ˙x(t)
=1
2x(t)>Q(t)>(K(t)J(t)Q(t)) + (K(t)J(t)Q(t))>Q(t)x(t)
+x(t)>Q(t)>(((J(t)R(t))Q(t)K(t)) x(t) + B(t)u(t))
221
B. Linear Time-Varying Port-Hamiltonian Systems
=x(t)>Q(t)>R(t)Q(t)x(t) + y(t)>u(t)u(t)>y(t)
for all tI.
In Theorem B.2, it is stated that the properties which are characteristic for
the coefficient functions in (5.3) are preserved under transformations of the
form (B.1) with Vand Was specified in Theorem B.2. In particular, if V
is also pointwise invertible, then this transformation corresponds to a multi-
plication of the state equation (5.3a) from the left by W(t)>and a change of
variables of the form x(t) = V(t)ˆx(t)for all tI. Thus, Theorem B.2 implies
that the port-Hamiltonian structure (5.3)–(5.4) is invariant under such state
space transformations. Furthermore, Theorem B.2 provides a reason for the
fact that we do not assume Eand Qto be continuously differentiable, but only
E>Q. Since Wis not necessarily continuously differentiable, the continuous
differentiability of Eand Qis not necessarily preserved by a transformation
as considered in Theorem B.2. If we would include the continuous differentia-
bility of Eand Qin the definition of a linear time-varying port-Hamiltonian
system, then the invariance could be only ensured by more restrictive state
space transformations where Wis assumed to be continuously differentiable
as well. In this context, there appears to be a slight inconsistency in [25],
since the definition of a linear time-varying pH system used in [25], cf. [25,
Def. 5], explicitly includes the continuous differentiability of Eand Q, but this
property is in general not preserved by the transformations considered in [25,
Thm. 18].
An important special case of the port-Hamiltonian structure (5.3) is the case
where Qequals the identity matrix. In this case the conditions (5.4) simplify
to
E=E>0,˙
E=KJ+ (KJ)>, R =R>0.
Especially, whenever Qis pointwise invertible, we may transform the pH sys-
tem to an equivalent system with Q=In, for instance, via a transformation
of the form (B.1) with W=Qand V=In, see also [25, Rem. 14].
Theorem B.2 (Invariance of (5.3) under time-varying state space transfor-
mations).Let E, Q, J, R, K C(R0,Rn,n)and BC(R0,Rn,m)satisfy
E>QC1(R0,Rn,n)and pointwise
E>Q=Q>E0, Q>RQ =Q>R>Q0,
and d
dt(Q>E) = Q>(KJQ)+(KJQ)>Q. (5.4)
Then, for any VC1(R0,Rn,n)and pointwise invertible WC(R0,Rn,n),
the transformed matrix functions ˆ
E, ˆ
Q, ˆ
J, ˆ
R, ˆ
K:R0Rn,n and ˆ
B:R0
222
Rn,m defined via
ˆ
E:=W>EV, ˆ
Q:=W1QV, ˆ
J:=W>JW,
ˆ
K:=W>(KV +E˙
V),ˆ
B:=W>B, ˆ
R:=W>RW (B.1)
are continuous and satisfy ˆ
E>ˆ
QC1(R0,Rn,n)and pointwise
ˆ
E>ˆ
Q=ˆ
Q>ˆ
E0,ˆ
Q>ˆ
Rˆ
Q=ˆ
Q>ˆ
R>ˆ
Q0,
d
dt(ˆ
Q>ˆ
E) = ˆ
Q>(ˆ
Kˆ
Jˆ
Q)+(ˆ
Kˆ
Jˆ
Q)>ˆ
Q.
Proof. The continuity of ˆ
E,ˆ
Q,ˆ
J,ˆ
R,ˆ
K, and ˆ
Bfollows from the continuity
of E,Q,J,R,K,B,W, from the continuous differentiability of V, and from
the fact that the pointwise inverse of a pointwise invertible continuous matrix
function is itself a continuous matrix function, cf. the proof of Theorem 2.4.5.
Furthermore, we have ˆ
E>ˆ
Q=V>E>QV , which shows that ˆ
E>ˆ
Qinherits the
continuous differentiability and the pointwise symmetry and positive semi-
definiteness from E>Q. Consequently, we obtain the derivative of ˆ
Q>ˆ
Eas
d
dt(ˆ
Q>ˆ
E) = ˙
V>Q>EV +V>d
dt(Q>E)V+V>Q>E˙
V
=˙
V>Q>EV +V>Q>(KJQ)+(KJQ)>QV+V>Q>E˙
V
=V>Q>(KV +E˙
VJQV )+(KV +E˙
VJQV )>QV
=ˆ
Q>(ˆ
Kˆ
Jˆ
Q)+(ˆ
Kˆ
Jˆ
Q)>ˆ
Q.
Finally, due to the relation ˆ
Q>ˆ
Rˆ
Q=V>Q>RQV , the pointwise symmetry and
positive semi-definiteness of ˆ
Q>ˆ
Rˆ
Qfollows from the corresponding properties
of Q>RQ.
Similarly as for the nonlinear port-Hamiltonian structure considered in The-
orem 2.6.3, one can show that the unforced state equation (5.3a) with u= 0
has a uniformly stable equilibrium point in the origin, provided that the Hamil-
tonian Hsatisfies the condition (ii) in Definition 2.4.9. In the context of the
linear time-varying pH system (5.3), this is ensured if the singular values of
E>Qare uniformly bounded from below and from above, see the following
theorem and its proof for the details.
Theorem B.3 (Stability of linear time-varying port-Hamiltonian systems of
the form (5.3)).Consider the state equation (5.3a) with vanishing input u= 0
and E, K, J, R, Q C1(R0,Rn,n)and let
E>Q=Q>E0, Q>RQ =Q>R>Q0,
and d
dt(Q>E) = Q>(KJQ)+(KJQ)>Q(5.4)
223
B. Linear Time-Varying Port-Hamiltonian Systems
be satisfied pointwise. Furthermore, let there exist constants ˜c1,˜c2R>0with
σmax(E(t)>Q(t)) ˜c1and σmin(E(t)>Q(t)) ˜c2for all tR0.(B.2)
Then, the following assertions hold.
(i) For each initial value x0Rnand for any time interval I= [t0, tend]
with t0R0and tend R>t0, the initial value problem associated with
(5.3a),u= 0, and x(t0) = x0has a unique solution on I.
(ii) The point 0Rnis a uniformly stable equilibrium point of (5.3a) with
u= 0.
Proof. The proof follows similar arguments as the proof of Theorem 2.6.3 with
some modifications which exploit the fact that (5.3a) is a linear system and that
the associated Hamiltonian H:R0×RnRdefined in (5.5) is a quadratic
function with respect to its second argument.
(i) The assumption that the smallest singular value of E>Qis uniformly
bounded from below as in (B.2) implies that Eis pointwise invertible.
Consequently, the assumptions of Theorem 2.4.6 are satisfied, which al-
lows to conclude that for a given initial value x0Rnand for a given
initial time t0R0either the associated initial value problem is uniquely
solvable on I= [t0, tend]for any tend R>t0or that there is a maximal
existence interval [t0, δmax)with δmax R>t0and
lim
t%δmax kx(t)k=.(B.3)
Let us assume that for some (t0, x0)R0×Rnthe latter statement is
true. Then, the Hamiltonian H C1(R0×Rn)defined in (5.5) satisfies
lim
t%δmax H(t, x(t)) = ,(B.4)
which follows from (B.3) and from
H(t, x(t)) 1
2σmin E(t)>Q(t)kx(t)k2˜c2
2kx(t)k2
for all t[t0, δmax). However, (B.4) is a contradiction to the inequality
H(t, x(t)) H(t0, x0)which holds for any t[t0, δmax)and follows from
the dissipation inequality stated in Theorem B.1 in the case u= 0. In
fact, we may apply Theorem B.1 since the derivative of x, which satisfies
˙x(t) = E(t)1((J(t)R(t))Q(t)K(t)) x(t)
for all t[t0, δmax), is continuous on [t0, δmax)due to the continuity of
E,J,R,Q,K, and x. In total, we have shown that for any initial
224
value x0Rnand for any time interval I= [t0, tend]with t0R0and
tend R>t0, the associated initial value problem is uniquely solvable.
(ii) The linearity of (5.3a) implies that 0Rnis an equilibrium point of
(5.3a) with u= 0. Furthermore, due to (i), the assumptions of Theo-
rem 2.4.10 are satisfied. Besides, using (5.4) we obtain
xH(t, x) = E(t)>Q(t)x,
tH(t, x) + x>Q(t)>((J(t)R(t))Q(t)K(t)) x
=x> 1
2
d
dt(Q>E)(t) + Q(t)>((J(t)R(t))Q(t)K(t))!x
=x>Q(t)>(K(t)J(t)Q(t)) + Q(t)>((J(t)R(t))Q(t)K(t))x
=x>Q(t)>R(t)Q(t)x0
for all (t, x)R0×Rn, which shows that the Hamiltonian satisfies
condition (i) in Definition 2.4.9. In addition, (B.2) implies that also
condition (ii) in Definition 2.4.9 is satisfied and, thus, the Hamiltonian
is a globally quadratic Lyapunov function of (5.3a) with u= 0. Finally,
the claim follows by applying Theorem 2.4.10(i).
225
C. Discrete Gradient Schemes
The goal of this appendix is to introduce a time discretization scheme for non-
linear time-invariant port-Hamiltonian systems of the form (5.6) with pointwise
symmetric and positive definite E. This new approach is used in chapter 6
and it is based on ideas of discrete gradient methods for Hamiltonian sys-
tems, as introduced in [121] and continued for instance in [192]. We start in
appendix C.1 by considering the Hamiltonian case and then present the new
method for pH systems in appendix C.2.
C.1. Discrete Gradients for Hamiltonian Systems
In the following we explain the main ideas of discrete gradient methods by
following [192]. To this end, we consider a Hamiltonian system of the form
˙x(t) = J∇H(x(t)) for all tI,(C.1a)
x(0) = x0(C.1b)
with time interval I= [0, tend],tend R>0, state xC1(I,Rn), initial value
x0Rn, a skew-symmetric matrix JRn,n, and a Hamiltonian H C1(Rn).
We emphasize that in [192] the authors consider a more general class of sys-
tems where Jmay be state-dependent or have a non-vanishing symmetric part
which is negative semi-definite, but for explaining the main ideas it is suffi-
cient to consider (C.1). Due to the skew-symmetry of Jwe obtain that Hx
is a conserved quantity of the ODE system (C.1a), which follows from the
computation
d
dt(Hx)(t) = ∇H(x(t))T˙x(t) = ∇H(x(t))TJ∇H(x(t)) = 0 for all tI.
Discrete gradient methods are time integration schemes which allow to inherit
this important property also after discretization in time. For this purpose, we
call ∇H:Rn×RnRna discrete gradient of Hif
1. ∇H is continuous,
2. ∇H(x, x) = ∇H(x)holds for all xRn, and
3. ∇H(x, ˆx)T(ˆxx) = H(ˆx)H(x)holds for all (x, ˆx)Rn×Rn.
227
C. Discrete Gradient Schemes
An example for a discrete gradient is the midpoint discrete gradient
∇H(x, ˆx):=
∇Hˆx+x
2+H(ˆx)−H(x)−∇H(ˆx+x
2)>(ˆxx)
kˆxxk2(ˆxx),if x6= ˆx,
∇H(x),otherwise,
(C.2)
which has been originally introduced in [121]. Further examples are for in-
stance provided in [192], see also [90] for a discussion of some higher-order
discrete gradient schemes.
Discrete gradient methods may be used to obtain a time-discrete approxi-
mation of a time-continuous Hamiltonian system based on a time grid 0 = t1<
t2< . . . < tq=tend. For instance, by applying a discrete gradient scheme to
(C.1) based on a discrete gradient ∇H of H, we may obtain the time-discrete
system
˘xk+1 = ˘xk+ (tk+1 tk)J∇H˘xk,˘xk+1for k= 1, . . . , q 1,
˘x1=x0,(C.3)
where ˘xkis an approximation of the continuous-time state xat tkfor k=
1, . . . , q. The fact that His also a conserved quantity of the time-discrete
system follows from the definition of a discrete gradient and the computation
H˘xk+1H˘xk=∇H˘xk,˘xk+1>˘xk+1 ˘xk
= (tk+1 tk)∇H˘xk,˘xk+1>J∇H˘xk,˘xk+1>= 0
for k= 1, . . . , q 1. The consistency of the time integration scheme (C.3)
is addressed in Theorem C.1.1 for the case of a continuously differentiable
discrete gradient, see also [218] for a more detailed analysis based on weaker
assumptions. We omit the proof of Theorem C.1.1 since it is a special case
of the upcoming Theorem C.2.5 by considering a constant Jmatrix and by
setting E=E=In,R= 0,z=∇H, and B= 0.
Theorem C.1.1 (Consistency of (C.3)).Let nN,J=J>Rn,n,
x0Rn, and H C2(Rn)be given. Furthermore, for some tend R>0
and I= [0, tend], let x:IRnbe the solution of the initial value problem
(C.1). Besides, let ∇H C1(Rn×Rn,Rn)be a discrete gradient of Hand let
˘x: [0, )Rnfor some sufficiently small (0, tend)satisfy
˘x(h) = x0+hJ∇H(x0,˘x(h)) for all h[0, ).
Then, we have
lim
h&0
x(h)˘x(h)
h= 0.
228
C.2. Discrete Gradients for a Special Class of Port-Hamiltonian Systems
C.2. Discrete Gradients for a Special Class of
Port-Hamiltonian Systems
In this section we extend the idea of discrete gradients as considered in the
previous section to treat nonlinear time-invariant port-Hamiltonian systems of
the form
E(x(t)) ˙x(t) = (J(x(t)) R(x(t)))z(x(t)) + B(x(t))u(t),
y(t) = B(x(t))>z(x(t)),(5.6)
for all tI, with associated Hamiltonian H C1(Rn)and coefficient functions
E, J, R C(Rn,Rn,n),zC(Rn,Rn), and BC(Rn,Rn,m)satisfying point-
wise (5.7). While parts of the following discussion are valid without further
restrictions on E, we assume Eto be pointwise invertible for proving the con-
sistency of the considered time integration schemes. Furthermore, we mostly
study an abstract class of time integration schemes and we explicitly construct
a specific one only for the special case where Eis pointwise symmetric and
positive definite. On the other hand, the explicit construction of such time
integration schemes for the general case of a possibly singular Eis subject to
future research.
In contrast to the last section, the gradient of the Hamiltonian does not
explicitly appear in (5.6), but only implicitly via Eand z, which are related
to ∇H via the factorization ∇H =E>z, cf. (5.7). To account for this factor-
ization of ∇H, we extend the notion of a discrete gradient and introduce in
Definition C.2.1 the concept of a discrete gradient pair. An example is given
by the midpoint discrete gradient pair considered in Theorem C.2.2, which is
inspired by the midpoint discrete gradient (C.2) and requires Eto be pointwise
symmetric and positive definite.
Definition C.2.1 (Discrete gradient pair).Let H C1(Rn)with nNbe
given and let EC(Rn,Rn,n)and zC(Rn,Rn)satisfy
∇H(x) = E(x)>z(x)for all xRn.(C.4)
Then, we call (E, z)C(Rn×Rn,Rn,n)×C(Rn×Rn,Rn)adiscrete gradient
pair for (H, E, z)if the following conditions are satisfied.
(i) E(x, x) = E(x)for all xRn,
(ii) z(x, x) = z(x)for all xRn,
(iii) z(x, ˆx)>E(x, ˆx)(ˆxx) = H(ˆx)H(x)for all (ˆx, x)Rn×Rn.K
Theorem C.2.2 (Midpoint discrete gradient pair).Let H C1(Rn),E
C(Rn,Rn,n), and zC(Rn,Rn)with nNsatisfy (C.4). Furthermore, let
229
C. Discrete Gradient Schemes
Ebe pointwise symmetric and positive definite. Then, a discrete gradient pair
for (H, E, z)is given by E:Rn×RnRn,n and z:Rn×RnRndefined via
E(x, ˆx):=E1
2(ˆx+x),(C.5)
z(x, ˆx):=
z1
2(ˆx+x)+H(ˆx)−H(x)z(1
2(ˆx+x))>E(x,ˆx)(ˆxx)
(ˆxx)>E(x,ˆx)(ˆxx)(ˆxx),if ˆx6=x,
z(x),otherwise.
(C.6)
Proof. The fact that the conditions (i) and (ii) from Definition C.2.1 are sat-
isfied follows from the definitions of Eand z. Furthermore, a straightforward
computation exploiting the special construction of zyields that also condi-
tion (iii) is satisfied. Thus, it remains to show that Eand zare continuous.
For Ethis follows from the continuity of E. Besides, the continuity of zin
{(ˆx, x)Rn×Rn|ˆx6=x}follows from the continuity of z,E, and H, and
from the fact that Eis pointwise symmetric and positive definite, which en-
sures that the denominator in (C.6) is not zero. To prove also the continuity
in {(ˆx, x)Rn×Rn|ˆx=x}, let ˜xRnbe arbitrary and let (xk, yk)kNbe a
sequence in Rn×Rnsatisfying (xk, yk)6= (˜x, ˜x)for all kNand
lim
k→∞(xk, yk) = (˜x, ˜x).
If xk=ykholds for all kN, then we have
lim
k→∞kz(xk, yk)z(˜x, ˜x)k= lim
k→∞kz(xk)z(˜x)k= 0,
where we used the continuity of z. On the other hand, if xk6=ykholds for all
kN, then we have
kz(xk, yk)z(˜x, ˜x)k
zxk+yk
2z(˜x)
+
H(yk)H(xk)zxk+yk
2>E(xk, yk)(ykxk)
(ykxk)>E(xk, yk)(ykxk)(ykxk)
=zxk+yk
2z(˜x)
+
H(yk)H(xk)∇Hxk+yk
2>(ykxk)
(ykxk)>Exk+yk
2(ykxk)kykxkk
for all kN. Before we consider the limit k , we define for some R>0
230
C.2. Discrete Gradients for a Special Class of Port-Hamiltonian Systems
the closed ball B(˜x):={xRn| kx˜xk }and
c1:= min
xB(˜x)
λmin(E(x)).
Note that this minimum indeed exists since B(˜x)is compact, Eis continu-
ous, and the eigenvalues of a matrix are continuous functions of its entries,
cf. Theorem 6 in [178, p. 130]. Furthermore, since Eis pointwise symmetric
and positive definite, we have c1>0. We proceed by using Taylor’s formula,
cf. Theorem VII.5.8 and Remark VII.5.9 in [11], which yields
H(yk)Hxk+yk
21
2∇Hxk+yk
2>(ykxk)
kykxkk
2max
t[0,1] ∇H xk+yk+t(ykxk)
2!∇Hxk+yk
2
(C.7)
for all kN. Similarly, we obtain the bound
H(xk)Hxk+yk
2+1
2∇Hxk+yk
2>(ykxk)
kykxkk
2max
t[0,1] ∇H xk+yk+t(xkyk)
2!∇Hxk+yk
2
(C.8)
for all kN. Combining (C.7) and (C.8) yields
H(yk)H(xk)∇Hxk+yk
2>(ykxk)
kykxkkmax
t[1,1] ∇H xk+yk+t(ykxk)
2!∇Hxk+yk
2
for all kN. As a consequence, we arrive at
0lim
k→∞kz(xk, yk)z(˜x, ˜x)k
lim
k→∞zxk+yk
2z(˜x)
+ lim
k→∞
H(yk)H(xk)∇Hxk+yk
2>(ykxk)
(ykxk)>Exk+yk
2(ykxk)kykxkk
lim
k→∞
maxt[1,1] ∇Hxk+yk+t(ykxk)
2∇Hxk+yk
2
λmin Exk+yk
2
lim
k→∞
maxt[1,1] ∇Hxk+yk+t(ykxk)
2∇Hxk+yk
2
c1
= 0
231
C. Discrete Gradient Schemes
and, thus,
lim
k→∞kz(xk, yk)z(˜x, ˜x)k= 0.
We conclude the proof by observing that all other cases, where neither xk=yk
is satisfied for all kNnor xk6=ykholds for all kN, may be reduced to
the two discussed cases either by removing a finite number of elements of the
sequence or by splitting it up into two partial sequences.
Similarly as in the previous section, we aim to use the concept of discrete
gradient pairs to derive a suitable time-discrete approximation of (5.6) which
ensures a dissipation inequality on the time-discrete level. To this end, we
consider a time grid 0 = t1< t2< . . . < tq=tend and propose the discrete-
time system
E(˘xk,˘xk+1)˘xk+1 =E(˘xk,˘xk+1)˘xk+ (tk+1 tk)B ˘xk+1 + ˘xk
2!˘uk+1
2
+ (tk+1 tk) J ˘xk+1 + ˘xk
2!R ˘xk+1 + ˘xk
2!!z(˘xk,˘xk+1),
(C.9a)
˘yk+1
2=B ˘xk+1 + ˘xk
2!>z(˘xk,˘xk+1)(C.9b)
for k= 1, . . . , q1, where Eand zare assumed to form a discrete gradient pair
for (H, E, z). Furthermore, ˘xiRncorresponds to an approximation of x(ti)
for i= 1, . . . , q and, similarly, ˘yi+1
2Rmcorresponds to an approximation of
the midpoint value y(1
2(ti+ti+1)) for i= 1, . . . , q 1. Accordingly, the time-
discrete input values are chosen as ˘ui+1
2=u(1
2(ti+ti+1)) for i= 1, . . . , q 1.
We note that (C.9) involves time-discrete approximations of J(x(·)),R(x(·)),
B(x(·)), and uwhich are based on the implicit midpoint rule. This choice
appears to be natural when combining this time integration scheme with the
midpoint discrete gradient pair introduced in Theorem C.2.2, but in general
there may be alternative ways for obtaining a time discretization scheme based
on discrete gradient pairs, see for instance [218] for a corresponding discussion
in the context of classical discrete gradients.
By construction, the time-discrete system (C.9) leads to a time-discrete
analogue of the dissipation inequality as detailed in the following theorem. We
note in particular that Eis not required to be pointwise invertible, symmetric,
or positive semi-definite.
Theorem C.2.3 (Dissipation inequality for the time-discrete system (C.9)).
Consider a port-Hamiltonian system of the form (5.6) with time interval I=
[0, tend],tend R>0, associated Hamiltonian H C1(Rn), and coefficient func-
tions E, J, R C(Rn,Rn,n),zC(Rn,Rn), and BC(Rn,Rn,m)satisfying
pointwise
J=J>, R =R>0, E>z=∇H.(5.7)
232
C.2. Discrete Gradients for a Special Class of Port-Hamiltonian Systems
Furthermore, let t1, t2, . . . , tqIwith 0 = t1< t2< . . . < tq=tend be given
and let (E, z)C(Rn×Rn,Rn,n)×C(Rn×Rn,Rn)be a discrete gradient pair
for (H, E, z). Besides, let ˘u3
2,˘u5
2,...,˘uq1
2Rmbe such that there exists a
sequence (˘x1,˘x2,...,˘xq)in Rnsatisfying (C.9a) for k= 1, . . . , q 1. Then,
every such sequence satisfies the time-discrete dissipation inequality
H(˘xk+1)H(˘xk)
tk+1 tk
=z(˘xk,˘xk+1)>R ˘xk+1 + ˘xk
2!z(˘xk,˘xk+1) + ˘yk+1
2>˘uk+1
2
˘yk+1
2>˘uk+1
2
(C.10)
for k= 1, . . . , q 1, where ˘y3
2,˘y5
2,...,˘yq1
2are defined via (C.9b).
Proof. By exploiting (C.9a), (5.7), and the fact that (E, z)is a discrete gradient
pair for (H, E, z), we obtain
H(˘xk+1)H(˘xk)
tk+1 tk
=z(˘xk,˘xk+1)>E(˘xk,˘xk+1)(˘xk+1 ˘xk)
tk+1 tk
=z(˘xk,˘xk+1)> J ˘xk+1 + ˘xk
2!R ˘xk+1 + ˘xk
2!!z(˘xk,˘xk+1)
+z(˘xk,˘xk+1)>B ˘xk+1 + ˘xk
2!˘uk+1
2
=z(˘xk,˘xk+1)>R ˘xk+1 + ˘xk
2!z(˘xk,˘xk+1) + ˘yk+1
2>˘uk+1
2
˘yk+1
2>˘uk+1
2
for k= 1, . . . , q 1.
Remark C.2.4 (Discretization of (5.6) by a classical discrete gradient method).
In the special case where Eis pointwise invertible, we may formally transform
(5.6) to the equivalent system
˙x(t) = ( ˜
J(x(t)) ˜
R(x(t))∇H(x(t)) + ˜
B(x(t))u(t),
y(t) = ˜
B(x(t))>∇H(x(t)) (C.11)
for all tI, with ˜
J:=E1JE−>,˜
R:=E1RE−>, and ˜
B:=E1B. In
particular, since the gradient of the Hamiltonian appears explicitly in (C.11),
the transformed system may be treated by classical discrete gradient methods
as considered in the previous section. However, the notion of discrete gradient
pairs as introduced in Definition C.2.1 allows to obtain a time discretization
scheme as in (C.9) without having to compute the inverse of E. Furthermore,
the time-discrete dissipation inequality in Theorem C.2.3 is also valid in the
general case where Emay be singular. ¨
While Theorem C.2.3 demonstrates that the time-continuous system (5.6)
233
C. Discrete Gradient Schemes
and the time-discrete system (C.9) have in common that they both ensure
a dissipation inequality, it does not address the question whether (C.9) rep-
resents a consistent discretization of (5.6). In fact, under some additional
assumptions, which include the continuous differentiability of the discrete gra-
dient pair, we may also prove consistency in the sense that the ratio of the
local truncation error and the time step size tends to zero as the time step
size approaches zero, see Theorem C.2.5 for a corresponding formal statement.
We emphasize that Theorem C.2.5 requires Eto be pointwise invertible, but
not necessarily symmetric or positive definite. While a convergence analysis of
the time discretization scheme (C.9) is not within the scope of this thesis, we
investigate its convergence numerically in section 6.2 for an example, where
we use the midpoint discrete gradient pair from Theorem C.2.2.
Theorem C.2.5 (Consistency of (C.9)).Consider a port-Hamiltonian system
of the form (5.6) with associated Hamiltonian H C2(Rn)and coefficient
functions E, J, R C1(Rn,Rn,n),zC1(Rn,Rn), and BC1(Rn,Rn,m)
satisfying pointwise
J=J>, R =R>0, E>z=∇H.(5.7)
Furthermore, let Ebe pointwise invertible and x0in Rn. In addition, let
(E, z)C1(Rn×Rn,Rn,n)×C1(Rn×Rn,Rn)be a discrete gradient pair
for (H, E, z). Besides, for some tend R>0and I= [0, tend]and for some
input signal uC1(R0,Rm), let x:IRnbe the solution of the initial value
problem consisting of (5.6a) and the initial condition x(0) = x0. Moreover, let
˘x: [0, )Rnfor some sufficiently small (0, tend)satisfy
E(x0,˘x(h))˘x(h) = E(x0,˘x(h))x0+hB ˘x(h) + x0
2!u h
2!
+h J ˘x(h) + x0
2!R ˘x(h) + x0
2!!z(x0,˘x(h))
(C.12)
for all h[0, ). Then, we have
lim
h&0
x(h)˘x(h)
h= 0.(C.13)
Proof. As a first step, we aim to use the implicit function theorem, cf. [11,
Thm. VII.8.2], to show that ˘xis uniquely determined by (C.12) and con-
tinuously differentiable, as long as is sufficiently small. To this end, let
ˆuC1(R,Rm)be an extension of uand let f:R×RnRnbe defined via
f(h, ˘x):=E(x0,˘x)(˘xx0)hB ˘x+x0
2!ˆu h
2!
234
C.2. Discrete Gradients for a Special Class of Port-Hamiltonian Systems
h J ˘x+x0
2!R ˘x+x0
2!!z(x0,˘x).
Since E,B,ˆu,J,R, and zare continuously differentiable, we infer that also
fis continuously differentiable. Furthermore, f(0, x0)vanishes and
f
˘x(0, x0) = E(x0, x0) = E(x0)
is invertible due to the pointwise invertibility of E. Thus, we may apply
the implicit function theorem, which yields that for sufficiently small the
value ˘x(h)is uniquely determined for any h[0, )and ˘xis continuously
differentiable. Especially, we have ˘x(0) = x0.
We continue to prove (C.13). Since xis a solution of (5.6a) and since all
coefficient functions of (5.6a) are continuous and Ein particular pointwise
invertible, we infer that xis continuously differentiable. Furthermore, since
˘xis continuous, Eis continuous and satisfies E(x0, x0) = E(x0), and Eis
pointwise invertible, we conclude that E(x0,˘x(h)) is invertible for sufficiently
small h. In the following, we consider a fixed h(0, )which is sufficiently
small to ensure that E(x0,˘x(h)) is invertible. Then, we have
x(h)˘x(h)
h
=1
h
h
Z0
E(x(t))1((J(x(t)) R(x(t))) z(x(t)) + B(x(t))u(t))
|{z }
=:g(t)
dt
E(x0,˘x(h))1 J ˘x(h) + x0
2!R ˘x(h) + x0
2!!z(x0,˘x(h))
E(x0,˘x(h))1B ˘x(h) + x0
2!u h
2!.
(C.14)
Since x,E,J,R,z,B, and uare continuous, we infer that also g: [0, h]Rn
as defined in (C.14) is continuous. Thus, applying the mean value theorem
for definite integrals, cf. [11, Cor. VI.4.17], to each component of gyields the
existence of ξh,i [0, h]for i= 1, . . . , n such that
1
h
h
Z0
g(t) dt=
g1(ξh,1)
.
.
.
gn(ξh,n)
is satisfied. Moreover, since gis continuous, we obtain
lim
h&0
1
h
h
Z0
g(t) dt= lim
h&0
g1(ξh,1)
.
.
.
gn(ξh,n)
=
g1(0)
.
.
.
gn(0)
=g(0).
235
C. Discrete Gradient Schemes
Finally, exploiting (C.14) as well as the continuity of E,J,R,z,B,u, and ˘x
yields
lim
h&0
x(h)˘x(h)
h
= lim
h&0
1
h
h
Z0
g(t) dt
lim
h&0E(x0,˘x(h))1 J ˘x(h) + x0
2!R ˘x(h) + x0
2!!z(x0,˘x(h))
lim
h&0E(x0,˘x(h))1B ˘x(h) + x0
2!u h
2!
=g(0) E(x0, x0)1((J(x0)R(x0)) z(x0, x0) + B(x0)u(0)) = 0.
236
D. Technical Details for Chapter 6
In this appendix we provide more details for some of the discretization ap-
proaches used in chapter 6. Especially, we derive the discretized transfor-
mation operators in appendix D.1, whereas we focus in appendix D.2 on the
derivation of the quadrature weights used in section 6.3 for approximating the
integral in (6.19).
D.1. Discretized Shift Operators
As mentioned in chapter 6, the used model reduction ansatzes involve transfor-
mation operators which are given by discretized versions of suitable shift op-
erators. In the following subsections D.1.1 to D.1.3, we derive the discretized
shift operators used in sections 6.1 to 6.3, respectively.
D.1.1. Periodic Shift Operator
In order to derive a discretized analogue of the family of periodic shift operators
introduced in Definition 1.2.2, we consider in this subsection the following
task: For a given shift ηRand samples φ(ξ1),. . .,φ(ξn)of a function
φC1
per([0,1]) at the grid points ξi=i
n=:ih for i= 1, . . . , n, we aim
for an approximation of the samples (Tper(η)φ)(ξ1),. . .,(Tper(η)φ)(ξn)of the
shifted function Tper(η)φ. To this end, we first construct a suitable cubic
spline interpolant of φwith periodic boundary conditions. More precisely, we
construct an approximant ˜
φ: [0,1] Rwhich is piecewise a cubic polynomial
of the form
˜
φ(ξ) = ai+bi(ξξi) + ci(ξξi)2+di(ξξi)3for all ξ[ξi1, ξi]
for i= 1, . . . , n with ξ0:= 0. Here, the spline coefficients ai, bi, ci, diR
for i= 1, . . . , n are chosen such that ˜
φis in C2
per([0,1]) and interpolates φat
the grid points. In particular, these conditions uniquely determine the spline
237
D. Technical Details for Chapter 6
coefficients, which are obtained via
4 1 0 ··· 0 1
1 4 1 ....
.
.0
0 1 4 ...0.
.
.
.
.
..........1 0
0··· 0 1 4 1
1 0 ··· 0 1 4
c1
.
.
.
cn
|{z}
=:c
=3
h2
2 1 0 ··· 0 1
12 1 ....
.
.0
0 1 2...0.
.
.
.
.
..........1 0
0··· 0 1 2 1
1 0 ··· 0 1 2
φ(ξ1)
.
.
.
φ(ξn)
,
a:=
a1
.
.
.
an
=
φ(ξ1)
.
.
.
φ(ξn)
, d :=
d1
.
.
.
dn
=1
3h
1 0 0 ··· 01
1 1 0 ....
.
.0
01 1 ...0.
.
.
.
.
..........0 0
0··· 01 1 0
0 0 ··· 01 1
|{z }
=:D1
c,
b:=
b1
.
.
.
bn
=1
hD1
φ(ξ1)
.
.
.
φ(ξn)
+h
3
2 0 0 ··· 0 1
1 2 0 ....
.
.0
0 1 2 ...0.
.
.
.
.
..........0 0
0··· 0 1 2 0
0 0 ··· 0 1 2
c.
(D.1)
Especially, we note that the leading matrix of the linear equation system in the
first line is symmetric and positive definite and, hence, cis uniquely determined
and so are a,b, and d.
Next, we use the spline interpolant ˜
φto obtain approximations for the sam-
ples (Tper(η)φ)(ξ1),. . .,(Tper(η)φ)(ξn). For this purpose, we first introduce
q1:=bηc Z, q2:=$ηmod 1
h% {0, . . . , n 1}, ζ :=ηmod h[0, h),
(D.2)
which allows us to write ηas η=q1+q2h+ζ. Based on this, we obtain
(Tper(η)φ)(ξi)=(Tper(q1+q2h+ζ)φ)(ξi)=(Tper(q2h+ζ)φ)(ξi)
=
φ(ξiq2ζ),if i {q2+ 1, . . . , n}
φ(ξn+iq2ζ),otherwise
aiq2biq2ζ+ciq2ζ2diq2ζ3,if i {q2+ 1, . . . , n}
an+iq2bn+iq2ζ+cn+iq2ζ2dn+iq2ζ3,otherwise
238
D.1. Discretized Shift Operators
for i= 1, . . . , n, which may be written in matrix notation as
(Tper(η)φ)(ξ1)
.
.
.
(Tper(η)φ)(ξn)
"0Iq2
Inq20#(aζb +ζ2cζ3d) =:Tper,h(η)
φ(ξ1)
.
.
.
φ(ξn)
.
Here, we exploit for the definition of Tper,h :R L(Rn)that the spline coef-
ficients depend linearly on the samples φ(ξ1),. . .,φ(ξn), cf. (D.1), and that ζ
and q2are uniquely determined by η, see (D.2). Moreover, by construction we
have the identity
Tper,h(η)
φ(ξ1)
.
.
.
φ(ξn)
=
(Tper(η)˜
φ)(ξ1)
.
.
.
(Tper(η)˜
φ)(ξn)
,(D.3)
i.e., the approximation of the shifted version of φis essentially accomplished
by replacing φby the corresponding spline interpolant ˜
φ. Furthermore, by
similar arguments as in the proof of Theorem A.3, we infer that (Tper(η)f)(ξ)
is continuously differentiable with respect to ηfor all ξ[0,1],ηR, and
fC1
per([0,1]). Thus, since the spline interpolant ˜
φis by construction in
C2
per([0,1]) C1
per([0,1]), we conclude by exploiting (D.3) that Tper,h is contin-
uously differentiable with derivative
T0
per,h(η)
φ(ξ1)
.
.
.
φ(ξn)
="0Iq2
Inq20#(b+ 2ζc 3ζ2d).(D.4)
D.1.2. Shift Operator used in Section 6.2
In section 6.2 we consider a family of shift operators which is based on a com-
bination of the families of extended domain shift operators Teand constant
extrapolation shift operators Tcas introduced in Definitions 3.3.3 and 3.3.8,
respectively. More precisely, based on the domain = (0,1), the extended
domain b
e= (1.5,1), and a given shift ηR, we define the operator
Tce(η): H1(b
e)H1(Ω) via
(Tce(η)f) (ξ):=
f(ξη),if ξηb
e,
f(1.5),if ξη 1.5,
f(1),otherwise.
(D.5)
To derive a discretized version of Tce, we proceed similarly as in appendix D.1.1
and consider the following task: For a given shift ηRand samples φ(ξk),
. . .,φ(ξn)of a function φC1([1.5,1]) at the grid points ξi=i
n=:ih
for i=k, . . . , n, we target an approximation of the samples (Tce(η)φ)(ξ0),
239
D. Technical Details for Chapter 6
. . .,(Tce(η)φ)(ξn)of the shifted function Tce(η)φ. Here, n, k Nare chosen
such that k
n=kh = 1.5is satisfied. Furthermore, we assume that the first
derivative of φvanishes at the boundaries of b
eand proceed by constructing
a suitable cubic spline interpolant of φwith homogeneous Neumann boundary
conditions. More precisely, we construct an approximant ˜
φ: [1.5,1] R
which is piecewise a cubic polynomial of the form
˜
φ(ξ) = ai+bi(ξξi) + ci(ξξi)2+di(ξξi)3for all ξ[ξi1, ξi]
for i= 1 k, . . . , n. Here, the spline coefficients ai, bi, ci, diRfor i=
1k, . . . , n are chosen such that ˜
φinterpolates φat the grid points, is twice
continuously differentiable, and its first derivative vanishes at 1.5and 1.
In particular, these conditions uniquely determine the spline coefficients. To
obtain explicit formulas for the coefficients, we introduce the auxiliary variables
ek, . . . , enR, which correspond to the values of 1
2˜
φ00 at the grid points and
are given by ei=cifor i= 1 k, . . . , n and ek=c1k3hd1k, cf. [114,
sec. 3.3]. Then, the spline coefficients may be calculated via
2 1
1 4 1
1 4 ...
......1
1 4 1
1 2
ek
.
.
.
en
|{z }
=:e
=3
h2
1 1
12 1
12...
......1
12 1
11
φ(ξk)
.
.
.
φ(ξn)
,
a:=
a1k
.
.
.
an
=
φ(ξ1k)
.
.
.
φ(ξn)
, c :=
c1k
.
.
.
cn
=
e1k
.
.
.
en
,
d:=
d1k
.
.
.
dn
=1
3h
1 1
1 1
......
1 1
1 1
|{z }
=:D1
e,
b:=
b1k
.
.
.
bn
=1
hD1
φ(ξk)
.
.
.
φ(ξn)
+h
3
1 2
1 2
......
1 2
1 2
e.
(D.6)
In particular, we note that the leading matrix of the linear equation system
in the first line is symmetric and positive definite and, hence, eis uniquely
determined and so are a,b,c, and d.
240
D.1. Discretized Shift Operators
Next, we use the spline interpolant ˜
φto obtain approximations for the sam-
ples (Tce(η)φ)(ξ0),. . .,(Tce(η)φ)(ξn). To this end, we first introduce
q:=η
hZand ζ:=ηmod h[0, h),(D.7)
which allows us to write ηas η=qh +ζ. Based on this, we obtain
(Tce(η)φ)(ξi) = (Tce(qh +ζ)φ)(ξi)
=
φ(ξiqζ),if i {q+ 1 k, . . . , q +n},
φ(1.5),if iqk,
φ(1),otherwise
aiqbiqζ+ciqζ2diqζ3,if i {q+ 1 k, . . . , q +n},
φ(1.5),if iqk,
φ(1),otherwise
for i= 0, . . . , n. In matrix notation, this may be written as
(Tce(η)φ)(ξ0)
.
.
.
(Tce(η)φ)(ξn)
T1(q)
φ(ξk)
.
.
.
φ(ξn)
+T2(q)(aζb +ζ2cζ3d)
=:Tce,h(η)
φ(ξk)
.
.
.
φ(ξn)
,
(D.8)
where T1:ZRn+1,n+k+1 and T2:ZRn+1,n+kare defined via
T1(q):=
h01n+1i,if q < n,
0 0
01q
,if nq < 0,
0,if 0qk1,
1qk+1 0
0 0
,if k1< q n+k1,
h1n+1 0i,if q > n +k1,
241
D. Technical Details for Chapter 6
T2(q):=
0,if q < nor q > n +k1,
0In+1+q
0 0
,if nq < 0,
h0(n+1)×(kq1) In+1 0i,if 0qk1,
0 0
Ik+nq0
,if k1< q n+k1.
Here, we emphasize that one may obtain more compact expressions for T1and
T2by summarizing some of the cases in their definitions, cf. (6.13). Further-
more, we exploit for the definition of Tce,h :R L(Rn+k+1,Rn+1)in (D.8)
that the spline coefficients depend linearly on the samples φ(ξk),. . .,φ(ξn),
cf. (D.6), and that ζand qare uniquely determined by η, see (D.7). Moreover,
by construction we have the identity
Tce,h(η)
φ(ξk)
.
.
.
φ(ξn)
=
(Tce(η)˜
φ)(ξ0)
.
.
.
(Tce(η)˜
φ)(ξn)
,(D.9)
similarly as in appendix D.1.1, cf. (D.3). In addition, based on the defini-
tion of Tce in (D.5), we note that, for given ξand fC1(b
e)with
f0(1.5) = f0(1) = 0, the function ˆg:RRdefined via ˆg(η):= (Tce(η)f)(ξ)
is continuously differentiable. Especially, the corresponding derivative is given
by
ˆg0(η):=
f0(ξη),if ξηb
e,
0,otherwise.
Thus, since the spline interpolant ˜
φsatisfies by construction ˜
φC1(b
e)and
˜
φ0(1.5) = ˜
φ0(1) = 0, we infer by exploiting (D.9) that Tce,h is continuously
differentiable with derivative
T0
ce,h(η)
φ(ξk)
.
.
.
φ(ξn)
=T2(q)(b+ 2ζc 3ζ2d).(D.10)
D.1.3. Shift Operator used in Section 6.3
In section 6.3 we consider the family of constant extrapolation shift operators
Tcas introduced in Definition 3.3.8, where we restrict Tc(η)to the subspace
of functions which vanish at the left boundary of = (0,1) for all ηR.
To derive a discretized analogue of this restricted version of Tc, we proceed
similarly as in the previous two subsections and consider the following task: For
a given shift ηRand samples φ(ξ1),. . .,φ(ξn)of a function φC1([0,1]) at
the grid points ξi=i
n=:ih for i= 1, . . . , n, we aim for an approximation of the
242
D.1. Discretized Shift Operators
samples (Tc(η)φ)(ξ1),. . .,(Tc(η)φ)(ξn)of the shifted function Tc(η)φ. Here, we
assume that φsatisfies φ(0) = φ0(0) = φ0(1) = 0 and proceed by constructing
a suitable cubic spline interpolant of φwith homogeneous Neumann boundary
conditions and a homogeneous Dirichlet condition on the left boundary. More
precisely, we construct an approximant ˜
φ: [0,1] Rwhich is piecewise a cubic
polynomial of the form
˜
φ(ξ) = ai+bi(ξξi) + ci(ξξi)2+di(ξξi)3for all ξ[ξi1, ξi]
for i= 1, . . . , n with ξ0:= 0. Here, the spline coefficients ai, bi, ci, diRfor
i= 1, . . . , n are chosen such that ˜
φinterpolates φat the grid points, is twice
continuously differentiable, and satisfies ˜
φ(0) = ˜
φ0(0) = ˜
φ0(1) = 0. Especially,
these conditions uniquely determine the spline coefficients. To obtain explicit
formulas for the coefficients, we proceed similarly as in appendix D.1.2 and
introduce the auxiliary variables e0, . . . , enRvia ei=cifor i= 1, . . . , n and
e0=c13hd1. Using these auxiliary variables, the spline coefficients may be
obtained via
2 1
1 4 1
1 4 ...
......1
1 4 1
1 2
e0
.
.
.
en
|{z}
=:e
=3
h2
1
2 1
12...
......1
12 1
11
φ(ξ1)
.
.
.
φ(ξn)
,
a:=
a1
.
.
.
an
=
φ(ξ1)
.
.
.
φ(ξn)
, c :=
c1
.
.
.
cn
=
e1
.
.
.
en
,
d:=
d1
.
.
.
dn
=1
3h
1 1
1 1
......
1 1
1 1
|{z }
=:D1
e,
b:=
b1
.
.
.
bn
=1
hD1
0
φ(ξ1)
.
.
.
φ(ξn)
+h
3
1 2
1 2
......
1 2
1 2
e.
(D.11)
In particular, we note that the leading matrix of the linear equation system
in the first line is symmetric and positive definite and, hence, eis uniquely
determined and so are a,b,c, and d.
243
D. Technical Details for Chapter 6
Next, we use the spline interpolant ˜
φto obtain approximations for the sam-
ples (Tc(η)φ)(ξ1),. . .,(Tc(η)φ)(ξn). To this end, we first define qZand
ζ[0, h)as in (D.7), which allows us to write ηas η=qh +ζ. Based on this,
we obtain
(Tc(η)φ)(ξi) = (Tc(qh +ζ)φ)(ξi)
=
φ(ξiqζ),if i {q+ 1, . . . , q +n},
0,if iq,
φ(1),otherwise
aiqbiqζ+ciqζ2diqζ3,if i {q+ 1, . . . , q +n},
0,if iq,
φ(1),otherwise
for i= 1, . . . , n. In matrix notation, this may be written as
(Tc(η)φ)(ξ1)
.
.
.
(Tc(η)φ)(ξn)
T1(q)
φ(ξ1)
.
.
.
φ(ξn)
+T2(q)(aζb +ζ2cζ3d)
=:Tc,h(η)
φ(ξ1)
.
.
.
φ(ξn)
(D.12)
where T1, T2:ZRn,n are defined via
T1(q):=
h01ni,if q n,
0 0
01q
,if n < q < 0,
0,if q0,
T2(q):=
0,if |q| n,
0In+q
0 0
,if n < q < 0,
In,if q= 0,
0 0
Inq0
,if 0< q < n.
Here, we emphasize that one may derive more compact formulas for T1and
T2by summarizing some of the cases in their definitions, cf. (6.26). Besides,
we exploit for the definition of Tc,h :R L(Rn)in (D.12) that the spline
coefficients depend linearly on the samples φ(ξ1),. . .,φ(ξn), cf. (D.11), and that
ζand qare uniquely determined by η, see (D.7). Moreover, by construction
244
D.2. Approximation of the Integral in (6.19)
we have the identity
Tc,h(η)
φ(ξ1)
.
.
.
φ(ξn)
=
(Tc(η)˜
φ)(ξ1)
.
.
.
(Tc(η)˜
φ)(ξn)
,(D.13)
similarly as in the preceding two subsections. Furthermore, we use this relation
and similar arguments as at the end of appendix D.1.2 to conclude that Tc,h is
continuously differentiable with derivative
T0
c,h(η)
φ(ξ1)
.
.
.
φ(ξn)
=T2(q)(b+ 2ζc 3ζ2d).(D.14)
D.2. Approximation of the Integral in (6.19)
This section is devoted to the quadrature approximation of the integral
Z
b
Ω(xh)
ψi(ξ)ψj(ξ) exp β
1 + PN+1
k=1 [xh]kψk(ξ)!dξ, (D.15)
for i, j = 1, . . . , N + 1, which occurs in the definition of vhin (6.19). In
particular, b
is defined in (6.20), xhis a vector in RN+1,βis a scalar in R>0,
and ψ1, . . . , ψN+1 :Rwith = (0,1) denote the piecewise linear FEM
basis functions, cf. the discussion after (6.19).
As mentioned in section 6.3, we employ a composite trapezoidal rule for
approximating the integral (D.15), where we use the FEM grid points ξ`:=`h
for `= 0, . . . , N + 1 as quadrature points. In the following, we derive the
corresponding quadrature weights. To this end, we first note that the argument
of the integral (D.15) is zero at ξ0, since all FEM basis functions vanish at ξ0
by construction. Consequently, there is no need to assign a weight to ξ0.
We proceed by deriving the weights corresponding to the quadrature points
ξ2, . . . , ξN. If [xh]` 1holds for some ` {2, . . . , N}, then we have
N+1
X
j=1
[xh]jψj(ξ`) = [xh]` 1
and, thus, ξ`lies outside of b
Ω(xh). Consequently, we set the corresponding
weight ˆω`(xh)to zero in this case. For the complementary case [xh]`>1, we
infer that ξ`is an element of b
Ω(xh). If additionally [xh]`1>1and [xh]`+1 >
1hold, then the complete interval [ξ`1, ξ`+1]is contained in b
Ω(xh), which
follows from the piecewise linearity and continuity of PN+1
j=1 [xh]jψj. Thus, in
this case the corresponding weight is just given by the mesh width ˆω`(xh) = h,
which corresponds to the usual weight for the composite trapezoidal rule with
245
D. Technical Details for Chapter 6
equidistant mesh, cf. [147, eq. (6.24)]. If on the other hand [xh]`1 1or
[xh]`+1 1hold, then there must be a boundary point of b
Ω(xh)in [ξ`1, ξ`)
or in (ξ`, ξ`+1], respectively, which follows from the continuity of PN+1
j=1 [xh]jψj.
Furthermore, since PN+1
j=1 [xh]jψjis piecewise linear, the boundary points may
be explicitly calculated and are obtained as
`1+[xh]`
[xh]`[xh]`1!h, if [xh]`1 1
and `+1+[xh]`
[xh]`[xh]`+1 !h, if [xh]`+1 1.
In these cases we may determine ˆω`(xh)by using the composite trapezoidal
rule based on a general not necessarily equidistant mesh, cf. [147, eq. (6.23)],
by formally adding these boundary points as quadrature points. The weights
associated to these additional quadrature points are zero, since they lie outside
of b
Ω(xh)and hence the reaction rate vanishes at these points. By summarizing
all cases, we obtain the weight associated with ξ`via
ˆω`(xh):=
h, if ([xh]`1,[xh]`,[xh]`+1)(R>1)3,
z1,`(xh),if ([xh]`1,[xh]`,[xh]`+1)R≤−1×R>1×R≤−1,
z2,`(xh),if ([xh]`1,[xh]`,[xh]`+1)(R>1)2×R≤−1,
z3,`(xh),if ([xh]`1,[xh]`,[xh]`+1)R≤−1×(R>1)2,
0,otherwise,
z1,`(xh):=h
2([xh]`+ 1) 1
[xh]`[xh]`1
+1
[xh]`[xh]`+1 !,
z2,`(xh):=h
2 1 + [xh]`+ 1
[xh]`[xh]`+1 !, z3,`(xh):=h
2 1 + [xh]`+ 1
[xh]`[xh]`1!
for `= 2, . . . , N.
By similar arguments, we obtain formulas for the weights corresponding to
the FEM grid points ξ1=hand ξN+1 = (N+1)h= 1. Here, the special feature
of ξN+1 is that it is outside of or a boundary point of b
Ω(xh)and, consequently,
the associated weight is at most h
2. Furthermore, the weight associated with
ξ1is greater than h
2as long as [xh]1>1holds, since in that case [0, ξ1]is
completely contained in b
Ω(xh), which follows from
N+1
X
j=1
[xh]jψj(0) = 0 >1for all xhRN+1.
The resulting formulas for the weights are summarized in (6.23).
246
Bibliography
[1] R. Abgrall, D. Amsallem, and R. Crisovan. Robust model reduction by
L1-norm minimization and approximation via dictionaries: application
to nonlinear hyperbolic problems. Adv. Model. Simul. Eng. Sci., 3(1),
2016. doi: 10.1186/s40323-015-0055-3.
[2] R. A. Adams and J. J. F. Fournier. Sobolev Spaces. Elsevier, Oxford,
UK, second edition, 2003.
[3] B. M. Afkham and J. S. Hesthaven. Structure preserving model reduc-
tion of parametric Hamiltonian systems. SIAM J. Sci. Comput., 39(6):
A2616–A2644, 2017. doi: 10.1137/17M1111991.
[4] B. M. Afkham and J. S. Hesthaven. Structure-preserving model-
reduction of dissipative Hamiltonian systems. J. Sci. Comput., 81:3–21,
2019. doi: 10.1007/s10915-018-0653-6.
[5] L. Agud Albesa, M. Boix García, M. L. Pla Ferrando, and S. C. Car-
dona Navarrete. A study about the solution of convection–diffusion–
reaction equation with Danckwerts boundary conditions by analytical,
method of lines and Crank–Nicholson techniques. Math. Methods Appl.
Sci., 2022. doi: 10.1002/mma.8633.
[6] A. Alla and J. N. Kutz. Nonlinear model order reduction via dynamic
mode decomposition. SIAM J. Sci. Comput., 39(5):B778–B796, 2017.
doi: 10.1137/16M1059308.
[7] R. Altmann and P. Schulze. A port-Hamiltonian formulation of the
Navier–Stokes equations for reactive flows. Systems Control Lett., 100:
51–55, 2017. doi: 10.1016/j.sysconle.2016.12.005.
[8] R. Altmann, V. Mehrmann, and B. Unger. Port-Hamiltonian formula-
tions of poroelastic network models. Math. Comput. Model. Dyn. Syst.,
27(1):429–452, 2021. doi: 10.1080/13873954.2021.1975137.
[9] H. Amann. Ordinary Differential Equations: An Introduction to Non-
linear Analysis. De Gruyter, Berlin, Germany, 1990. doi: 10.1515/
9783110853698.
[10] H. Amann and J. Escher. Analysis I. Birkhäuser Verlag, Basel, Switzer-
land, 2005. doi: 10.1007/b137107.
247
Bibliography
[11] H. Amann and J. Escher. Analysis II. Birkhäuser Verlag, Basel, Switzer-
land, 2008. doi: 10.1007/978-3-7643-7478-5.
[12] H. Amann and J. Escher. Analysis III. Birkhäuser Verlag, Basel, Switzer-
land, 2009. doi: 10.1007/978-3-7643-7480-8.
[13] W. Anderson and M. Farazmand. Evolution of nonlinear reduced-order
solutions for PDEs with conserved quantities. SIAM J. Sci. Comput., 44
(1):A176–A197, 2022. doi: 10.1137/21M1415972.
[14] A. C. Antoulas. Approximation of Large-Scale Dynamical Systems. So-
ciety for Industrial and Applied Mathematics, Philadelphia, PA, USA,
2005. doi: 10.1137/1.9780898718713.
[15] A. C. Antoulas. A new result on passivity preserving model reduction.
Systems Control Lett., 54:361–374, 2005. doi: 10.1016/j.sysconle.2004.
07.007.
[16] A. C. Antoulas, R. Ionutiu, N. Martins, E. J. W. ter Maten, K. Mo-
haghegh, R. Pulch, J. Rommes, M. Saadvandi, and M. Striebel. Model
order reduction: Methods, concepts and properties. In M. Günther, ed-
itor, Coupled Multiscale Simulation and Optimization in Nanoelectron-
ics, pages 159–265. Springer Berlin/Heidelberg, Germany, 2015. doi:
10.1007/978-3-662-46672-8_4.
[17] P. Astrid, S. Weiland, K. Willcox, and T. Backx. Missing point esti-
mation in models described by proper orthogonal decomposition. IEEE
Trans. Automat. Control, 53(10):2237–2251, 2008. doi: 10.1109/TAC.
2008.2006102.
[18] E.-W. Bai and Y. Liu. Least squares solutions of bilinear equations.
Systems Control Lett., 55(6):466–472, 2006. doi: 10.1016/j.sysconle.2005.
09.010.
[19] H. Bansal, P. Schulze, M. H. Abbasi, H. Zwart, L. Iapichino, W. H. A.
Schilders, and N. van de Wouw. Port-Hamiltonian formulation of two-
phase flow models. Systems Control Lett., 149:104881, 2021. doi: 10.
1016/j.sysconle.2021.104881.
[20] J. Barnett and C. Farhat. Quadratic approximation manifold for miti-
gating the Kolmogorov barrier in nonlinear projection-based model order
reduction. J. Comput. Phys., 464:111348, 2022. doi: 10.1016/j.jcp.2022.
111348.
[21] M. Barrault, Y. Maday, N. C. Nguyen, and A. T. Patera. An ‘empirical
interpolation’ method: application to efficient reduced-basis discretiza-
tion of partial differential equations. C. R. Math. Acad. Sci. Paris, 339:
667–672, 2004. doi: 10.1016/j.crma.2004.08.006.
248
Bibliography
[22] T. Bárta. Two notes on eventually differentiable families of operators.
Comment. Math. Univ. Carolin., 51(1):19–24, 2010.
[23] U. Baur, P. Benner, and L. Feng. Model order reduction for linear
and nonlinear systems: a system-theoretic perspective. Arch. Comput.
Methods Eng., 21(4):331–358, 2014. doi: 10.1007/s11831-014-9111-2.
[24] M. S. Bazaraa, H. D. Sherali, and C. M. Shetty. Nonlinear Programming:
Theory and Algorithms. Wiley, Hoboken, NJ, USA, third edition, 2006.
[25] C. Beattie, V. Mehrmann, H. Xu, and H. Zwart. Linear port-Hamiltonian
descriptor systems. Math. Control Signals Systems, 30:17, 2018. doi:
10.1007/s00498-018-0223-3.
[26] C. Beattie, V. Mehrmann, and P. Van Dooren. Robust port-Hamiltonian
representations of passive systems. Automatica, 100:182–186, 2019. doi:
10.1016/j.automatica.2018.11.013.
[27] C. Beattie, S. Gugercin, and V. Mehrmann. Structure-preserving in-
terpolatory model reduction for port-Hamiltonian differential-algebraic
systems. In C. Beattie, P. Benner, M. Embree, S. Gugercin, and S. Lef-
teriu, editors, Realization and Model Reduction of Dynamical Systems:
A Festschrift in Honor of the 70th Birthday of Thanos Antoulas, pages
235–254. Springer Nature Switzerland, Cham, Switzerland, 2022. doi:
10.1007/978-3-030-95157-3_13.
[28] A. Ben-Israel and T. N. E. Greville. Generalized Inverses: Theory and
Applications. Springer-Verlag New York, NY, USA, second edition, 2003.
doi: 10.1007/b97366.
[29] P. Benner and T. Stykel. Model order reduction for differential-algebraic
equations: A survey. In A. Ilchmann and T. Reis, editors, Sur-
veys in Differential-Algebraic Equations IV, pages 107–160. Springer
International Publishing, Cham, Switzerland, 2017. doi: 10.1007/
978-3-319-46618-7_3.
[30] P. Benner, S. Gugercin, and K. Willcox. A survey of projection-based
model reduction methods for parametric dynamical systems. SIAM Rev.,
57(4):483–531, 2015. doi: 10.1137/130932715.
[31] P. Benner, M. Ohlberger, A. Cohen, and K. Willcox. Model Reduction
and Approximation. Society for Industrial and Applied Mathematics,
Philadelphia, PA, USA, 2017. doi: 10.1137/1.9781611974829.
[32] P. Benner, P. Goyal, B. Kramer, B. Peherstorfer, and K. Willcox. Op-
erator inference for non-intrusive model reduction of systems with non-
polynomial nonlinear terms. Comput. Methods Appl. Mech. Eng., 372:
113433, 2020. doi: 10.1016/j.cma.2020.113433.
249
Bibliography
[33] S. Benzoni-Gavage, J.-F. Coulombel, and S. Aubert. Boundary condi-
tions for Euler equations. AIAA J., 41(1):56–63, 2003. doi: 10.2514/2.
1913.
[34] D. S. Bernstein. Matrix Mathematics: Theory, Facts, and Formulas.
Princeton University Press, Princeton, NJ, USA, second edition, 2009.
doi: 10.1515/9781400833344.
[35] D. Betounes. Differential Equations: Theory and Applications. Springer
Science+Business Media, New York, NY, USA, 2001. doi: 10.1007/
978-1-4757-4971-7.
[36] W. J. Beyn and V. Thümmler. Freezing solutions of equivariant evolution
equations. SIAM J. Appl. Dyn. Syst., 3(2):85–116, 2004. doi: 10.1137/
030600515.
[37] F. Black, P. Schulze, and B. Unger. Projection-based model reduction
with dynamically transformed modes. ESAIM: Math. Model. Numer.
Anal., 54:2011–2043, 2020. doi: 10.1051/m2an/2020046.
[38] F. Black, P. Schulze, and B. Unger. Model order reduction with dy-
namically transformed modes for the wave equation. PAMM, 20(1):
e202000321, 2021. doi: 10.1002/pamm.202000321.
[39] F. Black, P. Schulze, and B. Unger. Efficient wildland fire simulation via
nonlinear model order reduction. Fluids, 6(8):280, 2021. doi: 10.3390/
fluids6080280.
[40] F. Black, P. Schulze, and B. Unger. Modal decomposition of flow data via
gradient-based transport optimization. In R. King and D. Peitsch, edi-
tors, Active Flow and Combustion Control 2021, pages 203–224. Springer
Nature, Cham, Switzerland, 2022. doi: 10.1007/978-3-030-90727-3_13.
[41] J. Borggaard, Z. Wang, and L. Zietsman. A goal-oriented reduced-order
modeling approach for nonlinear systems. Comput. Math. Appl., 71(11):
2155–2169, 2016. doi: 10.1016/j.camwa.2016.01.031.
[42] P. Borja, J. M. A. Scherpen, and K. Fujimoto. Extended balancing of
continuous LTI systems: a structure-preserving approach. IEEE Trans.
Automat. Control, 2021. doi: 10.1109/TAC.2021.3138645.
[43] T. Breiten and P. Schulze. Structure-preserving linear quadratic Gaus-
sian balanced truncation for port-Hamiltonian descriptor systems. Tech-
nical report, 2021. https://arxiv.org/abs/2111.05065.
[44] T. Breiten and B. Unger. Passivity preserving model reduction via
spectral factorization. Automatica, 142:110368, 2022. doi: 10.1016/j.
automatica.2022.110368.
250
Bibliography
[45] T. Breiten, R. Morandin, and P. Schulze. Error bounds for port-
Hamiltonian model and controller reduction based on system balancing.
Comput. Math. with Appl., 116:100–115, 2022. doi: 10.1016/j.camwa.
2021.07.022.
[46] H. Brezis. Functional Analysis, Sobolev Spaces and Partial Differen-
tial Equations. Springer New York, NY, USA, 2011. doi: 10.1007/
978-0-387-70914-7.
[47] A. Brugnoli, D. Alazard, V. Pommier-Budinger, and D. Matignon. Port-
Hamiltonian formulation and symplectic discretization of plate models
part I: Mindlin model for thick plates. Appl. Math. Model., 75:940–960,
2019. doi: 10.1016/j.apm.2019.04.035.
[48] A. Brugnoli, D. Alazard, V. Pommier-Budinger, and D. Matignon. Port-
Hamiltonian formulation and symplectic discretization of plate models
part II: Kirchhoff model for thin plates. Appl. Math. Model., 75:961–981,
2019. doi: 10.1016/j.apm.2019.04.036.
[49] J. Bruna, B. Peherstorfer, and E. Vanden-Eijnden. Neural Galerkin
scheme with active learning for high-dimensional evolution equations.
Technical report, 2022. https://arxiv.org/abs/2203.01360v1.
[50] P. Buchfink, S. Glas, and B. Haasdonk. Symplectic model reduction of
Hamiltonian systems on nonlinear manifolds. Technical report, 2021.
https://arxiv.org/abs/2112.10815v1.
[51] J. D. Buckmaster. The Mathematics of Combustion. Society for In-
dustrial and Applied Mathematics, Philadelphia, PA, USA, 1985. doi:
10.1137/1.9781611971064.
[52] T. Bui-Thanh, K. Willcox, O. Ghattas, and B. van Bloemen Waan-
ders. Goal-oriented, model-constrained optimization for reduction of
large-scale systems. J. Comput. Phys., 224(2):880–896, 2007. doi:
10.1016/j.jcp.2006.10.026.
[53] C. I. Byrnes, A. Isidori, and J. C. Willems. Passivity, feedback equiv-
alence, and the global stabilization of minimum phase nonlinear sys-
tems. IEEE Trans. Automat. Control, 36(11):1228–1240, 1991. doi:
10.1109/9.100932.
[54] N. Cagniart, Y. Maday, and B. Stamm. Model order reduction for prob-
lems with large convection effects. In B. N. Chetverushkin, W. Fitzgib-
bon, Y. A. Kuznetsov, P. Neittaanmäki, J. Periaux, and O. Pironneau,
editors, Contributions to Partial Differential Equations and Applications,
pages 131–150. Springer International Publishing, Cham, Switzerland,
2019. doi: 10.1007/978-3-319-78325-3_10.
251
Bibliography
[55] F. Cajori. A History of Mathematical Notations Volume II: Notations
Mainly in Higher Mathematics. The Open Court Publishing Company,
La Salle, IL, USA, 1929.
[56] J. L. Callaham, S. L. Brunton, and J.-C. Loiseau. On the role of nonlinear
correlations in reduced-order modelling. J. Fluid Mech., 938:A1, 2022.
doi: 10.1017/jfm.2021.994.
[57] S. L. Campbell and C. D. Meyer. Generalized Inverses of Linear Trans-
formations. Society for Industrial and Applied Mathematics, Philadel-
phia, PA, USA, 2009. doi: 10.1137/1.9780898719048.
[58] F. L. Cardoso-Ribeiro, D. Matignon, and V. Pommier-Budinger. A port-
Hamiltonian model of liquid sloshing in moving containers and applica-
tion to a fluid-structure system. J. Fluids Struct., 69:402–427, 2017. doi:
10.1016/j.jfluidstructs.2016.12.007.
[59] K. Carlberg. Adaptive h-refinement for reduced-order models. Internat.
J. Numer. Methods Engrg., 102(5):1192–1210, 2015. doi: 10.1002/nme.
4800.
[60] K. Carlberg, C. Bou-Mosleh, and C. Farhat. Efficient non-linear model
reduction via a least-squares Petrov–Galerkin projection and compres-
sive tensor approximations. Int. J. Numer. Methods Eng., 86:155–181,
2011. doi: 10.1002/nme.3050.
[61] K. Carlberg, M. Barone, and H. Antil. Galerkin v. least-squares Petrov–
Galerkin projection in nonlinear model reduction. J. Comput. Phys.,
330:693–734, 2017. doi: 10.1016/j.jcp.2016.10.033.
[62] R. Castañé Selga, B. Lohmann, and R. Eid. Stability preservation in
projection-based model order reduction of large scale systems. Eur. J.
Control, 18(2):122–132, 2012. doi: 10.3166/ejc.18.122-132.
[63] J. Chan, J. A. Evans, and W. Qiu. A dual Petrov–Galerkin finite element
method for the convection-diffusion equation. Comput. Math. Appl., 68
(11):1513–1529, 2014. doi: 10.1016/j.camwa.2014.07.008.
[64] S. Chaturantabut and D. C. Sorensen. Nonlinear model reduction via
discrete empirical interpolation. SIAM J. Sci. Comput., 32(5):2737–2764,
2010. doi: 10.1137/090766498.
[65] S. Chaturantabut, C. Beattie, and S. Gugercin. Structure-preserving
model reduction for nonlinear port-Hamiltonian systems. SIAM J. Sci.
Comput., 38(5):B837–B865, 2016. doi: 10.1137/15M1055085.
[66] G.-Y. Chen, M. Gan, C. L. P. Chen, and H.-X. Li. A regular-
ized variable projection algorithm for separable nonlinear least-squares
252
Bibliography
problems. IEEE Trans. Automat. Control, 64(2):526–537, 2019. doi:
10.1109/TAC.2018.2838045.
[67] X. Cheng and J. M. A. Scherpen. Model reduction methods for complex
network systems. Annu. Rev. Control Robot. Auton. Syst., 4:425–453,
2021. doi: 10.1146/annurev-control-061820-083817.
[68] K. Cherifi, H. Gernandt, and D. Hinsen. The difference between port-
Hamiltonian, passive and positive real descriptor systems. Technical
report, 2022. https://arxiv.org/abs/2204.04990.
[69] K. Cherifi, V. Mehrmann, and P. Schulze. Simulations in a
digital twin of an electrical machine. Technical report, 2022.
https://arxiv.org/abs/2207.02171.
[70] J. Chung and J. G. Nagy. An efficient iterative approach for large-scale
separable nonlinear inverse problems. SIAM J. Sci. Comput., 31(6):
4654–4674, 2010. doi: 10.1137/080732213.
[71] A. Cohen, W. Dahmen, and G. Welper. Adaptivity and variational stabi-
lization for convection-diffusion equations. ESAIM Math. Model. Numer.
Anal., 46(5):1247–1273, 2012. doi: 10.1051/m2an/2012003.
[72] A. Cohen, R. DeVore, G. Petrova, and P. Wojtaszczyk. Optimal stable
nonlinear approximation. Found. Comput. Math., 2021. doi: 10.1007/
s10208-021-09494-z.
[73] P. G. Constantine. Active Subspaces. Society for Industrial and Ap-
plied Mathematics, Philadelphia, PA, USA, 2015. doi: 10.1137/1.
9781611973860.
[74] D. M. Copeland, S. W. Cheung, K. Huynh, and Y. Choi. Reduced order
models for Lagrangian hydrodynamics. Comput. Methods Appl. Mech.
Eng., 388:114259, 2022. doi: 10.1016/j.cma.2021.114259.
[75] F. E. Curtis, T. Mitchell, and M. L. Overton. A BFGS-SQP method
for nonsmooth, nonconvex, constrained optimization and its evaluation
using relative minimization profiles. Optim. Methods Softw., 32(1):148–
181, 2017. doi: 10.1080/10556788.2016.1208749.
[76] W. Dahmen, C. Huang, C. Schwab, and G. Welper. Adaptive Petrov–
Galerkin methods for first order transport equations. SIAM J. Numer.
Anal., 50(5):2420–2445, 2012. doi: 10.1137/110823158.
[77] P. V. Danckwerts. Continuous flow systems: distribution of residence
times. Chem. Eng. Sci., 2(1):1–13, 1953. doi: 10.1016/0009-2509(53)
80001-1.
253
Bibliography
[78] L. Debnath. Nonlinear Partial Differential Equations for Scientists and
Engineers. Birkhäuser, Boston, MA, USA, second edition, 2005. doi:
10.1007/b138648.
[79] L. F. Demkowicz and J. Gopalakrishnan. An overview of the discon-
tinuous Petrov Galerkin method. In X. Feng, O. Karakashian, and
Y. Xing, editors, Recent Developments in Discontinuous Galerkin Fi-
nite Element Methods for Partial Differential Equations, pages 149–
180. Springer International Publishing, Cham, Switzerland, 2014. doi:
10.1007/978-3-319-01818-8_6.
[80] P. Deuflhard and F. Bornemann. Scientific Computing with Ordinary
Differential Equations. Springer-Verlag New York, NY, USA, 2002. doi:
10.1007/978-0-387-21582-2.
[81] M. Dihlmann, M. Drohmann, and B. Haasdonk. Model reduction of
parametrized evolution problems using the reduced basis method with
adaptive time partitioning. In International Conference on Adaptive
Modeling and Simulation, pages 156–167, 2011.
[82] N. Dong and J. Roychowdhury. General-purpose nonlinear model-
order reduction using piecewise-polynomial representations. IEEE Trans.
Comput.-Aided Design Integr. Circuits Syst., 27(2):249–264, 2008. doi:
10.1109/TCAD.2007.907272.
[83] Z. Drmač and S. Gugercin. A new selection operator for the dis-
crete empirical interpolation method improved a priori error bound
and extensions. SIAM J. Sci. Comput., 38(2):A631–A648, 2016. doi:
10.1137/15M1019271.
[84] M. Drohmann, B. Haasdonk, and M. Ohlberger. Reduced basis approx-
imation for nonlinear parametrized evolution equations based on empir-
ical operator interpolation. SIAM J. Sci. Comput., 34(2):A937–A969,
2012. doi: 10.1137/10081157X.
[85] V. Duindam, A. Macchelli, S. Stramigioli, and H. Bruyninckx. Model-
ing and Control of Complex Physical Systems. Springer-Verlag Berlin
Heidelberg, Germany, 2009. doi: 10.1007/978-3-642-03196-0.
[86] S. Dutta, P. Rivera-Casillas, B. Styles, and M. W. Farthing. Reduced or-
der modeling using advection-aware autoencoders. Math. Comput. Appl.,
27:34, 2022. doi: 10.3390/mca27030034.
[87] D. Eberard and B. Maschke. Port Hamiltonian systems extended to irre-
versible systems: the example of the heat conduction. IFAC Proceedings
Volumes, 37(13):243–248, 2004. doi: 10.1016/S1474-6670(17)31230-2.
254
Bibliography
[88] H. Egger, T. Kugler, B. Liljegren-Sailer, N. Marheineke, and
V. Mehrmann. On structure-preserving model reduction for damped
wave propagation in transport networks. SIAM J. Sci. Comput., 40(1):
A331–A365, 2018. doi: 10.1137/17M1125303.
[89] V. Ehrlacher, D. Lombardi, O. Mula, and F.-X. Vialard. Nonlinear model
reduction on metric spaces. Application to one-dimensional conservative
PDEs in Wasserstein spaces. ESAIM Math. Model. Numer. Anal., 54
(6):2159–2197, 2020. doi: 10.1051/m2an/2020013.
[90] S. Eidnes. Order theory for discrete gradient methods. BIT Numer.
Math., 2022. doi: 10.1007/s10543-022-00909-z.
[91] L. Eldén and S. Ahmadi-Asl. Solving bilinear tensor least squares prob-
lems and application to Hammerstein identification. Numer. Linear Al-
gebra Appl., 26(2):e2226, 2019. doi: 10.1002/nla.2226.
[92] K.-J. Engel and R. Nagel. One-Parameter Semigroups for Linear Evo-
lution Equations. Springer New York, NY, USA, 2000. doi: 10.1007/
b97696.
[93] P. A. Etter and K. T. Carlberg. Online adaptive basis refinement and
compression for reduced-order models via vector-space sieving. Comput.
Methods Appl. Mech. Eng., 364:112931, 2020. doi: 10.1016/j.cma.2020.
112931.
[94] L. C. Evans. Partial Differential Equations. American Mathematical
Society, Providence, RI, USA, 1998.
[95] A. Falaize and T. Hélie. Passive simulation of the nonlinear port-
Hamiltonian modeling of a Rhodes piano. J. Sound Vib., 390:289–309,
2017. doi: 10.1016/j.jsv.2016.11.008.
[96] J. Faraut. Analysis on Lie Groups: An Introduction. Cambridge Uni-
versity Press, Cambridge, UK, 2008. doi: 10.1017/CBO9780511755170.
[97] C. Farhat, P. Avery, T. Chapman, and J. Cortial. Dimensional reduction
of nonlinear finite element dynamic models with finite rotations and
energy-based mesh sampling and weighting for computational efficiency.
Int. J. Numer. Methods Eng., 98(9):625–662, 2014. doi: 10.1002/nme.
4668.
[98] H. Fassbender and P. Benner. Passivity preserving model reduction via
a structured Lanczos method. In IEEE Conference on Computer Aided
Control System Design, pages 8–13, Munich, Germany, 2006. doi: 10.
1109/CACSD-CCA-ISIC.2006.4776616.
255
Bibliography
[99] H. O. Fattorini. Second Order Linear Differential Equations in Banach
Spaces. Elsevier Science Publishers, Amsterdam, The Netherlands, 1985.
[100] M. Fazel, H. Hindi, and S. P. Boyd. A rank minimization heuristic with
application to minimum order system approximation. In Proceedings of
the 2001 American Control Conference, pages 4734–4739, Arlington, VA,
USA, 2001. doi: 10.1109/ACC.2001.945730.
[101] A. Ferrero, T. Taddei, and L. Zhang. Registration-based model reduction
of parameterized two-dimensional conservation laws. J. Comput. Phys.,
457:111068, 2022. doi: 10.1016/j.jcp.2022.111068.
[102] S. Fiaz, D. Zonetti, R. Ortega, J. M. A. Scherpen, and A. J. van der
Schaft. A port-Hamiltonian approach to power network modeling and
analysis. Eur. J. Control, 19(6):477–485, 2013. doi: 10.1016/j.ejcon.
2013.09.002.
[103] E. Fosong, P. Schulze, and B. Unger. From time-domain data
to low-dimensional structured models. Technical report, 2019.
https://arxiv.org/abs/1902.05112v1.
[104] S. Fresca and A. Manzoni. POD-DL-ROM: enhancing deep learning-
based reduced order models for nonlinear parametrized PDEs by proper
orthogonal decomposition. Comput. Methods Appl. Mech. Eng., 388:
114181, 2022. doi: 10.1016/j.cma.2021.114181.
[105] S. Fresca, A. Manzoni, L. Dedé, and A. Quarteroni. Deep learning-based
reduced order models in cardiac electrophysiology. PLoS One, 15(10):
e0239416, 2020. doi: 10.1371/journal.pone.0239416.
[106] S. Fresca, L. Dedé, and A. Manzoni. A comprehensive deep learning-
based approach to reduced order modeling of nonlinear time-dependent
parametrized PDEs. J. Sci. Comput., 87:61, 2021. doi: 10.1007/
s10915-021-01462-7.
[107] S. Fresca, A. Manzoni, L. Dedé, and A. Quarteroni. POD-enhanced
deep learning-based reduced order models for the real-time simulation of
cardiac electrophysiology in the left atrium. Front. physiol., 12:679076,
2021. doi: 10.3389/fphys.2021.679076.
[108] R. W. Freund. Recent advances in structure-preserving model order
reduction. In P. Li, L. M. Silveira, and P. Feldmann, editors, Sim-
ulation and Verification of Electronic and Biological Systems, pages
43–70. Springer Netherlands, Dordrecht, The Netherlands, 2011. doi:
10.1007/978-94-007-0149-6_3.
256
Bibliography
[109] K. Fujimoto. Balanced realization and model order reduction for port-
Hamiltonian systems. J. Syst. Des. Dyn., 2(3):694–702, 2008. doi: 10.
1299/jsdd.2.694.
[110] K. Fujimoto and J. M. A. Scherpen. Nonlinear balanced realization based
on singular value analysis of Hankel operators. In 42nd IEEE Interna-
tional Conference on Decision and Control, pages 6072–6077, Maui, HI,
USA, 2003. doi: 10.1109/CDC.2003.1272220.
[111] W. Gander, M. J. Gander, and F. Kwok. Scientific Computing. Springer
International Publishing, Cham, Switzerland, 2014. doi: 10.1007/
978-3-319-04325-8.
[112] L. Gao, W. Mei, M. Kleeberger, H. Peng, and J. Fottner. Modeling
and discretization of hydraulic actuated telescopic boom system in port-
Hamiltonian formulation. In SIMULTECH 2019: Proceedings of the 9th
International Conference on Simulation and Modeling Methodologies,
Technologies and Applications, pages 69–79, Prague, Czech Republic,
2019. doi: 10.5220/0007832100690079.
[113] R. Geelen, S. Wright, and K. Willcox. Operator inference for non-
intrusive model reduction with nonlinear manifolds. Technical report,
2022. https://arxiv.org/abs/2205.02304.
[114] C. F. Gerald and P. O. Wheatley. Applied Numerical Analysis. Pearson
Education, London, UK, seventh edition, 2004.
[115] J.-F. Gerbeau and D. Lombardi. Approximated Lax pairs for the reduced
order integration of nonlinear evolution equations. J. Comput. Phys.,
265:246–269, 2014. doi: 10.1016/j.jcp.2014.01.047.
[116] H. Gernandt, F. E. Haller, T. Reis, and A. J. van der Schaft. Port-
Hamiltonian formulation of nonlinear electrical circuits. J. Geom. Phys.,
159:103959, 2021. doi: 10.1016/j.geomphys.2020.103959.
[117] M. Giftthaler, T. Wolf, H. K. F. Panzer, and B. Lohmann. Para-
metric model order reduction of port-Hamiltonian systems by ma-
trix interpolation. Automatisierungstechnik, 62(9):619–628, 2014. doi:
10.1515/auto-2013-1072.
[118] B. H. Gilding and R. Kersner. Travelling Waves in Nonlinear Diffusion-
Convection Reaction. Springer Basel, Switzerland, 2004. doi: 10.1007/
978-3-0348-7964-4.
[119] S. Glavaski, J. E. Marsden, and R. M. Murray. Model reduction, cen-
tering, and the Karhunen–Loeve expansion. In Proceedings of the 37th
IEEE Conference on Decision and Control, pages 2071–2076, Tampa,
FL, USA, 1998. doi: 10.1109/CDC.1998.758639.
257
Bibliography
[120] G. Golub and V. Pereyra. The differentiation of pseudo-inverses and
nonlinear least squares problems whose variables separate. SIAM J.
Numer. Anal., 10(2):413–432, 1973. doi: 10.1137/0710036.
[121] O. Gonzalez. Time integration and discrete Hamiltonian systems. J.
Nonlinear Sci., 6:449–467, 1996. doi: 10.1007/BF02440162.
[122] J. A. T. Gray, M. Lemke, J. Reiss, C. O. Paschereit, J. Sesterhenn,
and J. P. Moeck. A compact shock-focusing geometry for detonation
initiation: experiments and adjoint-based variational data assimilation.
Combust. Flame, 183:144–156, 2017. doi: 10.1016/j.combustflame.2017.
03.014.
[123] C. Greif and K. Urban. Decay of the Kolmogorov N-width for wave
problems. Appl. Math. Lett., 96:216–222, 2019. doi: 10.1016/j.aml.2019.
05.013.
[124] C. Greif, P. Junk, and K. Urban. Linear/Ridge expansions: enhancing
linear approximations by ridge functions. Adv. Comput. Math., 48:15,
2022. doi: 10.1007/s10444-022-09936-4.
[125] S. Grundel and M. Herty. Model-order reduction for hyperbolic re-
laxation systems. Int. J. Nonlinear Sci. Numer. Simul., 2022. doi:
10.1515/ijnsns-2021-0192.
[126] C. Gu. QLMOR: A projection-based nonlinear model order reduction ap-
proach using quadratic-linear representation of nonlinear systems. IEEE
Trans. Comput.-Aided Design Integr. Circuits Syst., 30(9):1307–1320,
2011. doi: 10.1109/TCAD.2011.2142184.
[127] M. Gubisch and S. Volkwein. Proper orthogonal decomposition for linear-
quadratic optimal control. In P. Benner, M. Ohlberger, A. Cohen, and
K. Willcox, editors, Model Reduction and Approximation, pages 3–63.
Society for Industrial and Applied Mathematics, Philadelphia, PA, USA,
2017. doi: 10.1137/1.9781611974829.ch1.
[128] C. Güdücü, J. Liesen, V. Mehrmann, and D. B. Szyld. On non-Hermitian
positive (semi)definite linear algebraic systems arising from dissipative
Hamiltonian DAEs. SIAM J. Sci. Comput., 44(4):A2871–A2894, 2022.
doi: 10.1137/21M1458594.
[129] S. Gugercin, A. C. Antoulas, and C. Beattie. H2model reduction for
large-scale linear dynamical systems. SIAM J. Matrix Anal. Appl., 30
(2):609–638, 2008. doi: 10.1137/060666123.
[130] S. Gugercin, R. V. Polyuga, C. Beattie, and A. van der Schaft.
Structure-preserving tangential interpolation for model reduction of
258
Bibliography
port-Hamiltonian systems. Automatica, 48(9):1963–1974, 2012. doi:
10.1016/j.automatica.2012.05.052.
[131] B. Haasdonk. Reduced basis methods for parametrized PDEs A tuto-
rial introduction for stationary and instationary problems. In P. Benner,
M. Ohlberger, A. Cohen, and K. Willcox, editors, Model Reduction and
Approximation, pages 65–136. Society for Industrial and Applied Math-
ematics, Philadelphia, PA, USA, 2017. doi: 10.1137/1.9781611974829.
ch2.
[132] B. Haasdonk, M. Dihlmann, and M. Ohlberger. A training set and multi-
ple bases generation approach for parameterized model reduction based
on adaptive grids in parameter space. Math. Comput. Model. Dyn. Syst.,
17(4):423–442, 2011. doi: 10.1080/13873954.2011.547674.
[133] E. Hairer and G. Wanner. Solving Ordinary Differential Equations II:
Stiff and Differential-Algebraic Problems. Springer Berlin Heidelberg,
Germany, second edition, 1996. doi: 10.1007/978-3-642-05221-7.
[134] J. S. Hale, E. Schenone, D. Baroli, L. A. A. Beex, and S. P. A. Bordas.
A hyper-reduction method using adaptivity to cut the assembly costs of
reduced order models. Comput. Methods Appl. Mech. Eng., 380:113723,
2021. doi: 10.1016/j.cma.2021.113723.
[135] D. Hartman and L. K. Mestha. A deep learning framework for model
reduction of dynamical systems. In IEEE Conference on Control Tech-
nology and Applications (CCTA), pages 1917–1922, Kohala Coast, HI,
USA, 2017. doi: 10.1109/CCTA.2017.8062736.
[136] P. Hartman. Ordinary Differential Equations. Society for Industrial and
Applied Mathematics, Philadelphia, PA, USA, second edition, 2002. doi:
10.1137/1.9780898719222.
[137] C. Hartmann, V.-M. Vulcanov, and C. Schütte. Balanced truncation of
linear second-order systems: a Hamiltonian approach. Multiscale Model.
Simul., 8(4):1348–1367, 2010. doi: 10.1137/080732717.
[138] S.-A. Hauschild, N. Marheineke, and V. Mehrmann. Model reduction
techniques for linear constant coefficient port-Hamiltonian differential-
algebraic systems. Control Cybern., 48(1):125–152, 2019.
[139] S.-A. Hauschild, N. Marheineke, V. Mehrmann, J. Mohring,
A. Moses Badlyan, M. Rein, and M. Schmidt. Port-Hamiltonian mod-
eling of district heating networks. In Progress in Differential-Algebraic
Equations II, pages 333–355, Paderborn, Germany, 2020. doi: 10.1007/
978-3-030-53905-4_11.
259
Bibliography
[140] J. A. Hernández, M. A. Caicedo, and A. Ferrer. Dimensional hyper-
reduction of nonlinear finite element models via empirical cubature.
Comput. Methods Appl. Mech. Eng., 313:687–722, 2017. doi: 10.1016/j.
cma.2016.10.022.
[141] J. S. Hesthaven, B. Stamm, and S. Zhang. Efficient greedy algorithms
for high-dimensional parameter spaces with applications to empirical in-
terpolation and reduced basis methods. ESAIM Math. Model. Numer.
Anal., 48:259–283, 2014. doi: 10.1051/m2an/2013100.
[142] J. S. Hesthaven, G. Rozza, and B. Stamm. Certified Reduced Ba-
sis Methods for Parametrized Partial Differential Equations. Springer
International Publishing, Cham, Switzerland, 2016. doi: 10.1007/
978-3-319-22470-1.
[143] J. S. Hesthaven, C. Pagliantini, and N. Ripamonti. Rank-adaptive
structure-preserving model order reduction of Hamiltonian systems.
ESAIM Math. Model. Numer. Anal., 56(2):617–650, 2022. doi: 10.1051/
m2an/2022013.
[144] J. S. Hesthaven, C. Pagliantini, and G. Rozza. Reduced basis methods
for time-dependent problems. Acta Numer., 31:265–345, 2022. doi: 10.
1017/S0962492922000058.
[145] D. Hinrichsen and A. J. Pritchard. An improved error estimate for
reduced-order models of discrete-time systems. IEEE Trans. Automat.
Control, 35(3):317–320, 1990. doi: 10.1109/9.50345.
[146] H. Hoang, F. Couenne, C. Jallut, and Y. Le Gorrec. The port Hamil-
tonian approach to modeling and control of continuous stirred tank
reactors. Journal of Process Control, 21(10):1449–1458, 2011. doi:
10.1016/j.jprocont.2011.06.014.
[147] J. D. Hoffman. Numerical Methods for Engineers and Scientists. Marcel
Dekker, Inc., New York, NY, USA, second edition, 2001. doi: 10.1201/
9781315274508.
[148] P. Holmes, J. L. Lumley, G. Berkooz, and C. W. Rowley. Turbulence, Co-
herent Structures, Dynamical Systems and Symmetry. Cambridge Uni-
versity Press, Cambridge, UK, 2012. doi: 10.1017/CBO9780511919701.
[149] E. Hopf. The partial differential equation ut+uux=µuxx.Commun.
Pure Appl. Math., 3(3):201–230, 1950. doi: 10.1002/cpa.3160030302.
[150] R. A. Horn and C. R. Johnson. Matrix Analysis. Cambridge University
Press, New York, NY, USA, second edition, 2013.
260
Bibliography
[151] P. Houston, S. Roggendorf, and K. G. van der Zee. Eliminating Gibbs
phenomena: a non-linear Petrov–Galerkin method for the convection-
diffusion-reaction equation. Comput. Math. Appl., 80(5):851–873, 2020.
doi: 10.1016/j.camwa.2020.03.025.
[152] A. Iollo and D. Lombardi. Advection modes by optimal mass transfer.
Phys. Rev. E, 89(2):022923, 2014. doi: 10.1103/PhysRevE.89.022923.
[153] T. C. Ionescu and A. Astolfi. Families of moment matching based, struc-
ture preserving approximations for linear port Hamiltonian systems. Au-
tomatica, 49(8):2424–2434, 2013. doi: 10.1016/j.automatica.2013.05.006.
[154] O. Issan and B. Kramer. Predicting solar wind streams from the inner-
heliosphere to Earth via shifted operator inference. Technical report,
2022. https://arxiv.org/abs/2203.13372.
[155] R. V. Kadison and J. R. Ringrose. Fundamentals of the Theory of Op-
erator Algebras. Volume 1: Elementary Theory. Academic Press, New
York, NY, USA, 1983.
[156] M. Kahlbacher and S. Volkwein. Galerkin proper orthogonal decomposi-
tion methods for parameter dependent elliptic systems. Discuss. Math.
Differ. Incl. Control Optim., 27(1):95–117, 2007. doi: 10.7151/dmdico.
1078.
[157] K. Kashima. Nonlinear model reduction by deep autoencoder of noise
response data. In 55th IEEE Conference on Decision and Control
(CDC), pages 5750–5755, Las Vegas, USA, 2016. doi: 10.1109/CDC.
2016.7799153.
[158] T. Kato. Perturbation Theory for Linear Operators. Springer-Verlag
Berlin Heidelberg, Germany, 1995. doi: 10.1007/978-3-642-66282-9.
[159] L. Kaufman and V. Pereyra. A method for separable nonlinear least
squares problems with separable nonlinear equality constraints. SIAM
J. Numer. Anal., 15(1):12–20, 1978. doi: 10.1137/0715002.
[160] Y. Kawano and J. M. A. Scherpen. Structure preserving truncation for
linear port Hamiltonian systems. In 22nd International Symposium on
Mathematical Theory of Networks and Systems, pages 219–224, Min-
neapolis, MN, USA, 2016.
[161] Y. Kawano and J. M. A. Scherpen. Structure preserving truncation of
nonlinear port Hamiltonian systems. IEEE Trans. Automat. Control, 63
(12):4286–4293, 2018. doi: 10.1109/TAC.2018.2811787.
[162] H. K. Khalil. Nonlinear Systems. Prentice Hall, Upper Saddle River,
NJ, USA, third edition, 2002.
261
Bibliography
[163] P. Khodabakhshi and K. Willcox. Non-intrusive data-driven model re-
duction for differential-algebraic equations derived from lifting transfor-
mations. Comput. Methods Appl. Mech. Eng., 389:114296, 2022. doi:
10.1016/j.cma.2021.114296.
[164] Y. Kim, Y. Choi, D. Widemann, and T. Zohdi. A fast and accu-
rate physics-informed neural network reduced order model with shal-
low masked autoencoder. J. Comput. Phys., 451:110841, 2022. doi:
10.1016/j.jcp.2021.110841.
[165] D. E. Knuth. The Art of Computer Programming Volume 3: Sorting
and Searching. Addison–Wesley, Boston, MA, USA, second edition, 1998.
[166] L. Koller, W. Witteveen, F. Pichler, and P. Fischer. A general hyper-
reduction strategy for finite element structures with nonlinear surface
loads based on the calculus of variations and stress modes. Comput.
Methods Appl. Mech. Eng., 379:113744, 2021. doi: 10.1016/j.cma.2021.
113744.
[167] A. Kolmogoroff. Über die beste Annäherung von Funktionen einer
gegebenen Funktionenklasse. Ann. Math., 37(1):107–110, 1936. doi:
10.2307/1968691.
[168] P. Kotyczka. Numerical Methods for Distributed Parameter Port-
Hamiltonian Systems. TUM.University Press, Munich, Germany, 2019.
doi: 10.14459/2019md1510230.
[169] P. Kotyczka and L. Lefèvre. Discrete-time port-Hamiltonian systems: a
definition based on symplectic integration. Systems Control Lett., 133:
104530, 2019. doi: 10.1016/j.sysconle.2019.104530.
[170] B. Kågström. Bounds and perturbation bounds for the matrix exponen-
tial. BIT Numer. Math., 17:39–57, 1977. doi: 10.1007/BF01932398.
[171] P. Krah, S. Büchholz, M. Häringer, and J. Reiss. Front trans-
port reduction for complex moving fronts. Technical report, 2022.
https://arxiv.org/abs/2202.08208v1.
[172] B. Kramer and K. Willcox. Nonlinear model order reduction via lifting
transformations and proper orthogonal decomposition. AIAA J., 57(6):
2297–2307, 2019. doi: 10.2514/1.J057791.
[173] A. Krishna and J. Schiffer. A port-Hamiltonian approach to modeling
and control of an electro-thermal microgrid. IFAC-PapersOnLine, 54
(19):287–293, 2021. doi: 10.1016/j.ifacol.2021.11.092.
262
Bibliography
[174] R. Kumar, C. Da Silva, O. Akalin, A. Y. Aravkin, H. Mansour,
B. Recht, and F. J. Herrmann. Efficient matrix completion for seis-
mic data reconstruction. Geophysics, 80(5):V97–V114, 2015. doi:
10.1190/geo2014-0369.1.
[175] P. Kunkel and V. Mehrmann. Differential-Algebraic Equations Analysis
and Numerical Solution. EMS Publishing House Zürich, Switzerland,
2006. doi: 10.4171/017.
[176] S. Lall, P. Krysl, and J. E. Marsden. Structure-preserving model re-
duction for mechanical systems. Physica D, 184:304–318, 2003. doi:
10.1016/S0167-2789(03)00227-6.
[177] L. D. Landau and E. M. Lifshitz. Fluid Mechanics. Pergamon Press,
Oxford, UK, second edition, 1987. doi: 10.1016/C2013-0-03799-1.
[178] P. D. Lax. Linear Algebra and its Applications. John Wiley & Sons, Inc.,
Hoboken, NJ, USA, second edition, 2007.
[179] K. Lee and K. T. Carlberg. Model reduction of dynamical systems on
nonlinear manifolds using deep convolutional autoencoders. J. Comput.
Phys., 404:108973, 2019. doi: 10.1016/j.jcp.2019.108973.
[180] J. Li and W. Zhang. The performance of proper orthogonal decomposi-
tion in discontinuous flows. Theoretical and Applied Mechanics Letters,
6(5):236–243, 2016. doi: 10.1016/j.taml.2016.08.008.
[181] R.-C. Li and Z. Bai. Structure-preserving model reduction using a Krylov
subspace projection formulation. Comm. Math. Sci., 3(2):179–199, 2005.
doi: 10.4310/CMS.2005.v3.n2.a6.
[182] M. Lohmayer, P. Kotyczka, and S. Leyendecker. Exergetic port-
Hamiltonian systems: modelling basics. Math. Comput. Model. Dyn.
Syst., 27(1):489–521, 2021. doi: 10.1080/13873954.2021.1979592.
[183] H. Lu and D. M. Tartakovsky. Lagrangian dynamic mode decompo-
sition for construction of reduced-order models of advection-dominated
phenomena. J. Comput. Phys., 407:109229, 2020. doi: 10.1016/j.jcp.
2020.109229.
[184] D. J. Lucia. Reduced order modeling for high speed flows with moving
shocks. PhD thesis, Air Force Institute of Technology, Wright-Patterson
Air Force Base, OH, USA, 2001.
[185] A. Macchelli. Towards a port-based formulation of macro-economic sys-
tems. J. Frankl. Inst., 351(12):5235–5249, 2014. doi: 10.1016/j.jfranklin.
2014.09.009.
263
Bibliography
[186] A. Macchelli and C. Melchiorri. Modeling and control of the Timoshenko
beam. The distributed port Hamiltonian approach. SIAM J. Control
Optim., 43(2):743–767, 2004. doi: 10.1137/S0363012903429530.
[187] T. Malzer, H. Rams, and M. Schöberl. On structural invariants in the
energy-based in-domain control of infinite-dimensional port-Hamiltonian
systems. Systems Control Lett., 145:104778, 2020. doi: 10.1016/j.
sysconle.2020.104778.
[188] J. Mandel, L. S. Bennethum, J. D. Beezley, J. L. Coen, C. C. Douglas,
M. Kim, and A. Vodacek. A wildland fire model with data assimilation.
Math. Comput. Simul., 79(3):584–606, 2008. doi: 10.1016/j.matcom.
2008.03.015.
[189] B. M. Maschke and A. J. van der Schaft. Port-controlled Hamiltonian
systems: modelling origins and systemtheoretic properties. IFAC Pro-
ceedings Volumes, 25(13):359–365, 1992. doi: 10.1016/S1474-6670(17)
52308-3.
[190] H. G. Matthies and M. Meyer. Nonlinear Galerkin methods for the
model reduction of nonlinear dynamical systems. Comput. Struct., 81
(12):1277–1286, 2003. doi: 10.1016/S0045-7949(03)00042-7.
[191] R. Maulik, B. Lusch, and P. Balaprakash. Reduced-order modeling
of advection-dominated systems with recurrent neural networks and
convolutional autoencoders. Phys. Fluids, 33(3):037106, 2021. doi:
10.1063/5.0039986.
[192] R. I. McLachlan, G. R. W. Quispel, and N. Robidoux. Geometric inte-
gration using discrete gradients. Phil. Trans. R. Soc. Lond., 357(1754):
1021–1045, 1999. doi: 10.1098/rsta.1999.0363.
[193] C. Mehl, V. Mehrmann, and P. Sharma. Stability radii for linear
Hamiltonian systems with dissipation under structure-preserving per-
turbations. SIAM J. Matrix Anal. Appl., 37(4):1625–1654, 2016. doi:
10.1137/16M1067330.
[194] C. Mehl, V. Mehrmann, and M. Wojtylak. Linear algebra properties
of dissipative Hamiltonian descriptor systems. SIAM J. Matrix Anal.
Appl., 39(3):1489–1519, 2018. doi: 10.1137/18M1164275.
[195] C. Mehl, V. Mehrmann, and M. Wojtylak. Distance problems for dis-
sipative Hamiltonian systems and related matrix polynomials. Linear
Algebra Appl., 623:335–366, 2021. doi: 10.1016/j.laa.2020.05.026.
[196] V. Mehrmann and R. Morandin. Structure-preserving discretization for
port-Hamiltonian descriptor systems. In Proceedings of the 58th IEEE
264
Bibliography
Conference on Decision and Control, pages 6863–6868, Nice, France,
2019. doi: 10.1109/CDC40024.2019.9030180.
[197] V. Mehrmann and B. Unger. Control of port-Hamiltonian
differential-algebraic systems and applications. Technical report, 2022.
https://arxiv.org/abs/2201.06590v1.
[198] A. Meister and J. Struckmeier. Hyperbolic Partial Differential Equations.
Friedr. Vieweg & Sohn Verlagsgesellschaft, Wiesbaden, Germany, 2002.
doi: 10.1007/978-3-322-80227-9.
[199] A. Mendible, S. L. Brunton, A. Y. Aravkin, W. Lowrie, and J. N. Kutz.
Dimensionality reduction and reduced-order modeling for traveling wave
physics. Theor. Comput. Fluid Dyn., 34:385–400, 2020. doi: 10.1007/
s00162-020-00529-9.
[200] A. Mendible, J. Koch, H. Lange, S. L. Brunton, and J. N. Kutz. Data-
driven modeling of rotating detonation waves. Phys. Rev. Fluids, 6:
050507, 2021. doi: 10.1103/PhysRevFluids.6.050507.
[201] A. Mendible, W. Lowrie, S. L. Brunton, and J. N. Kutz. Data-driven
modeling of two-dimensional detonation wave fronts. Wave Motion, 109:
102879, 2022. doi: 10.1016/j.wavemoti.2021.102879.
[202] K. Miller. Moving finite elements. II. SIAM J. Numer. Anal., 18(6):
1033–1057, 1981. doi: 10.1137/0718071.
[203] K. Miller and R. N. Miller. Moving finite elements. I. SIAM J. Numer.
Anal., 18(6):1019–1032, 1981. doi: 10.1137/0718070.
[204] M. A. Mirhoseini and M. J. Zahr. Model reduction of convection-
dominated partial differential equations via optimization-
based implicit feature tracking. Technical report, 2021.
https://arxiv.org/abs/2109.14694v1.
[205] R. Mojgani and M. Balajewicz. Lagrangian basis method for dimen-
sionality reduction of convection dominated nonlinear flows. Technical
report, 2017. https://arxiv.org/abs/1701.04343v1.
[206] N. Monshizadeh, H. L. Trentelman, and M. K. Camlibel. Stability and
synchronization preserving model reduction of multi-agent systems. Sys-
tems Control Lett., 62:1–10, 2013. doi: 10.1016/j.sysconle.2012.10.011.
[207] L. A. Mora, J. I. Yuz, H. Ramirez, and Y. Le Gorrec. A port-
Hamiltonian fluid-structure interaction model for the vocal folds. IFAC-
PapersOnLine, 51(3):62–67, 2018. doi: 10.1016/j.ifacol.2018.06.016.
265
Bibliography
[208] L. A. Mora, Y. Le Gorrec, D. Matignon, H. Ramirez, and J. I. Yuz. On
port-Hamiltonian formulations of 3-dimensional compressible Newtonian
fluids. Phys. Fluids, 33:117117, 2021. doi: 10.1063/5.0067784.
[209] T. Moser and B. Lohmann. A new Riemannian framework for efficient
H2-optimal model reduction of port-Hamiltonian systems. In 59th IEEE
Conference on Decision and Control, pages 5043–5049, Jeju, South Ko-
rea, 2020. doi: 10.1109/CDC42340.2020.9304134.
[210] T. Moser, P. Schwerdtner, V. Mehrmann, and M. Voigt.
Structure-preserving model order reduction for index two
port-Hamiltonian descriptor systems. Technical report, 2022.
https://arxiv.org/abs/2206.03942.
[211] A. Moses Badlyan and C. Zimmer. Operator-GENERIC formulation
of thermodynamics of irreversible processes. Technical report, 2018.
https://arxiv.org/abs/1807.09822.
[212] S. Mowlavi and T. P. Sapsis. Model order reduction for stochastic dy-
namical systems with continuous symmetries. SIAM J. Sci. Comput., 40
(3):A1669–A1695, 2018. doi: 10.1137/17M1126576.
[213] L. Mu and G. Zhang. A domain decomposition model reduction method
for linear convection-diffusion equations with random coefficients. SIAM
J. Sci. Comput., 41(3):A1984–A2011, 2019. doi: 10.1137/18M1170601.
[214] N. J. Nair and M. Balajewicz. Transported snapshot model order re-
duction approach for parametric, steady-state fluid flows containing
parameter-dependent shocks. Int. J. Numer. Methods Eng., 117:1234–
1262, 2018. doi: 10.1002/nme.5998.
[215] Y. M. Nechepurenko. Bounds for the matrix exponential based on the
Lyapunov equation and limits of the Hausdorff set. Comput. Math. Math.
Phys., 42(2):125–134, 2002.
[216] Y. Nesterov. Introductory Lectures on Convex Optimization. Springer
Science+Business Media, New York, NY, USA, 2004. doi: 10.1007/
978-1-4419-8853-9.
[217] N. C. Nguyen and J. Peraire. An efficient reduced-order modeling ap-
proach for non-linear parametrized partial differential equations. Int. J.
Numer. Methods Eng., 76:27–55, 2008. doi: 10.1002/nme.2309.
[218] R. A. Norton and G. R. W. Quispel. Discrete gradient methods for
preserving a first integral of an ordinary differential equation. Discrete
Contin. Dyn. Syst., 34(3):1147–1170, 2014. doi: 10.3934/dcds.2014.34.
1147.
266
Bibliography
[219] M. Ohlberger and S. Rave. Nonlinear reduced basis approximation of
parameterized evolution equations via the method of freezing. C. R.
Math. Acad. Sci. Paris, 351(23–24):901–906, 2013. doi: 10.1016/j.crma.
2013.10.028.
[220] M. Ohlberger and S. Rave. Reduced basis methods: success, limita-
tions and future challenges. In Proceedings of the Conference Algoritmy,
Vysoké Tatry, Slovakia, 2016.
[221] D. P. O’Leary and B. W. Rust. Variable projection for nonlinear least
squares problems. Comput. Optim. Appl., 54(3):579–593, 2013. doi:
10.1007/s10589-012-9492-9.
[222] R. Ortega, A. van der Schaft, F. Castaños, and A. Astolfi. Control by in-
terconnection and standard passivity-based control of port-Hamiltonian
systems. IEEE Trans. Automat. Control, 53(11):2527–2542, 2008. doi:
10.1109/TAC.2008.2006930.
[223] C. Pagliantini. Dynamical reduced basis methods for Hamiltonian
systems. Numer. Math., 148(2):409–448, 2021. doi: 10.1007/
s00211-021-01211-w.
[224] H. Panzer, J. Mohring, R. Eid, and B. Lohmann. Parametric model
order reduction by matrix interpolation. Automatisierungstechnik, 58
(8):475–484, 2010. doi: 10.1524/auto.2010.0863.
[225] G. Payen, D. Matignon, and G. Haine. Modelling and structure-
preserving discretization of Maxwell’s equations as port-Hamiltonian sys-
tem. IFAC-PapersOnLine, 53(2):7581–7586, 2020. doi: 10.1016/j.ifacol.
2020.12.1355.
[226] B. Peherstorfer. Model reduction for transport-dominated problems via
online adaptive bases and adaptive sampling. SIAM J. Sci. Comput., 42
(5):A2803–A2836, 2020. doi: 10.1137/19M1257275.
[227] B. Peherstorfer and K. Willcox. Data-driven operator inference for nonin-
trusive projection-based model reduction. Comput. Methods Appl. Mech.
Eng., 306:196–215, 2016. doi: 10.1016/j.cma.2016.03.025.
[228] L. Peng and K. Mohseni. Symplectic model reduction of Hamiltonian
systems. SIAM J. Sci. Comput., 38(1):A1–A27, 2016. doi: 10.1137/
140978922.
[229] Z. Peng, M. Wang, and F. Li. A learning-based projection method for
model order reduction of transport problems. J. Comput. Appl. Math.,
418:114560, 2022. doi: 10.1016/j.cam.2022.114560.
267
Bibliography
[230] R. V. Polyuga and A. van der Schaft. Structure preserving model reduc-
tion of port-Hamiltonian systems by moment matching at infinity. Au-
tomatica, 46(4):665–672, 2010. doi: 10.1016/j.automatica.2010.01.018.
[231] R. V. Polyuga and A. van der Schaft. Structure preserving moment
matching for port-Hamiltonian systems: Arnoldi and Lanczos. IEEE
Trans. Automat. Control, 56(6):1458–1462, 2011. doi: 10.1109/TAC.
2011.2128650.
[232] R. V. Polyuga and A. van der Schaft. Effort- and flow-constraint
reduction methods for structure preserving model reduction of port-
Hamiltonian systems. Systems Control Lett., 61(3):412–421, 2012. doi:
10.1016/j.sysconle.2011.12.008.
[233] I. Pontes Duff, C. Poussot-Vassal, and C. Seren. Realization indepen-
dent single time-delay dynamical model interpolation and H2-optimal
approximation. In 54th IEEE Conference on Decision and Control, pages
4662–4667, Osaka, Japan, 2015. doi: 10.1109/CDC.2015.7402946.
[234] R. Pulch. Stability-preserving model order reduction for linear stochas-
tic Galerkin systems. J. Math. Ind., 9:10, 2019. doi: 10.1186/
s13362-019-0067-6.
[235] A. Quarteroni, A. Manzoni, and F. Negri. Reduced Basis Methods for
Partial Differential Equations. Springer International Publishing, Cham,
Switzerland, 2016. doi: 10.1007/978-3-319-15431-2.
[236] H. Ramirez, B. Maschke, and D. Sbarbaro. Irreversible port-Hamiltonian
systems: a general formulation of irreversible processes with application
to the CSTR. Chem. Eng. Sci., 89:223–234, 2013.
[237] R. Rashad, F. Califano, A. van der Schaft, and S. Stramigioli. Twenty
years of distributed port-Hamiltonian systems: a literature review. IMA
J. Math. Control Inf., 37(4):1400–1422, 2020. doi: 10.1093/imamci/
dnaa018.
[238] R. Rashad, F. Califano, F. P. Schuller, and S. Stramigioli. Port-
Hamiltonian modeling of ideal fluid flow: part I. Foundations and kinetic
energy. J. Geom. Phys., 164:104201, 2021. doi: 10.1016/j.geomphys.
2021.104201.
[239] T. Reis and M. Heinkenschloss. Model reduction with a-priori error
bounds for a class of nonlinear electrical circuits. In Proceedings of the
48h IEEE Conference on Decision and Control held jointly with 2009
28th Chinese Control Conference, pages 5376–5383, Shanghai, China,
2009. doi: 10.1109/CDC.2009.5400606.
268
Bibliography
[240] J. Reiss. Optimization-based modal decomposition for systems with mul-
tiple transports. SIAM J. Sci. Comput., 43(3):A2079–A2101, 2021. doi:
10.1137/20M1322005.
[241] J. Reiss, P. Schulze, J. Sesterhenn, and V. Mehrmann. The shifted proper
orthogonal decomposition: a mode decomposition for multiple transport
phenomena. SIAM J. Sci. Comput., 40(3):A1322–A1344, 2018. doi:
10.1137/17M1140571.
[242] D. Rim, S. Moe, and R. J. LeVeque. Transport reversal for model reduc-
tion of hyperbolic partial differential equations. SIAM/ASA J. Uncer-
tain. Quantif., 6(1):118–150, 2018. doi: 10.1137/17M1113679.
[243] D. Rim, B. Peherstorfer, and K. T. Mandli. Manifold approximations via
transported subspaces: model reduction for transport-dominated prob-
lems. Technical report, 2020. https://arxiv.org/abs/1912.13024v3.
[244] F. Romor, G. Stabile, and G. Rozza. Non-linear manifold ROM with
convolutional autoencoders and reduced over-collocation method. Tech-
nical report, 2022. https://arxiv.org/abs/2203.00360.
[245] C. W. Rowley and J. E. Marsden. Reconstruction equations and the
Karhunen–Loève expansion for systems with symmetry. Phys. D, 142
(1–2):1–19, 2000. doi: 10.1016/S0167-2789(00)00042-7.
[246] C. W. Rowley, I. G. Kevrekidis, J. E. Marsden, and K. Lust. Reduction
and reconstruction for self-similar dynamical systems. Nonlinearity, 16
(4):1257–1275, 2003. doi: 10.1088/0951-7715/16/4/304.
[247] D. Ryckelynck. A priori hyperreduction method: an adaptive approach.
J. Comput. Phys., 202:346–366, 2005. doi: 10.1016/j.jcp.2004.07.015.
[248] J. Saak and M. Voigt. Model reduction of constrained mechanical sys-
tems in M-M.E.S.S. IFAC-PapersOnLine., 51(2):661–666, 2018. doi:
10.1016/j.ifacol.2018.03.112.
[249] S. Salsa. Partial Differential Equations in Action. Springer-Verlag Italia,
Milan, Italy, 2009. doi: 10.1007/978-88-470-0752-9.
[250] H. Sandberg. An extension to balanced truncation with application to
structured model reduction. IEEE Trans. Automat. Control, 55(4):1038–
1043, 2010. doi: 10.1109/TAC.2010.2041984.
[251] N. Sarna and S. Grundel. Hyper-reduction for parametrized transport
dominated problems via online-adaptive reduced meshes. Technical re-
port, 2021. https://arxiv.org/abs/2003.06362v3.
269
Bibliography
[252] K. Sato. Riemannian optimal model reduction of linear port-Hamiltonian
systems. Automatica, 93:428–434, 2018. doi: 10.1016/j.automatica.2018.
03.051.
[253] G. Scarciotti and A. Astolfi. Model reduction for nonlinear systems and
nonlinear time-delay systems from input/output data. In 54th IEEE
Conference on Decision and Control, pages 7298–7303, Osaka, Japan,
2015. doi: 10.1109/CDC.2015.7403371.
[254] M. Schaller, F. Philipp, T. Faulwasser, K. Worthmann, and B. Maschke.
Control of port-Hamiltonian systems with minimal energy supply. Eur.
J. Control, 62:33–40, 2021. doi: 10.1016/j.ejcon.2021.06.017.
[255] W. H. A. Schilders, H. A. van der Vorst, and J. Rommes. Model Order
Reduction: Theory, Research Aspects and Applications. Springer Berlin
Heidelberg, Germany, 2008. doi: 10.1007/978-3-540-78841-6.
[256] E. Schultz, E. Wintenberger, and J. Shepherd. Investigation of deflagra-
tion to detonation transition for application to pulse detonation engine
ignition systems. In JANNAF 24th Airbreathing Propulsion Subcommit-
tee and 36th Combustion Subcommittee Joint Meeting, pages 175–202,
Cocoa Beach, FL, USA, 1999.
[257] P. Schulze and B. Unger. Data-driven interpolation of dynamical systems
with delay. Systems Control Lett., 97:125–131, 2016. doi: 10.1016/j.
sysconle.2016.09.007.
[258] P. Schulze, B. Unger, C. Beattie, and S. Gugercin. Data-driven
structured realization. Linear Algebra Appl., 537:250–286, 2018. doi:
10.1016/j.laa.2017.09.030.
[259] P. Schulze, J. Reiss, and V. Mehrmann. Model reduction for a pulsed
detonation combuster via shifted proper orthogonal decomposition. In
R. King, editor, Active Flow and Combustion Control 2018, pages 271–
286. Springer International Publishing, Cham, Switzerland, 2019. doi:
10.1007/978-3-319-98177-2_17.
[260] P. Schwerdtner and M. Voigt. Adaptive sampling for structure-
preserving model order reduction of port-Hamiltonian systems. IFAC-
PapersOnLine, 54(19):143–148, 2021. doi: 10.1016/j.ifacol.2021.11.069.
[261] P. Schwerdtner and M. Voigt. SOBMOR: structured optimization-
based model order reduction. Technical report, 2022.
https://arxiv.org/abs/2011.07567.
[262] P. Schwerdtner, T. Moser, V. Mehrmann, and M. Voigt. Structure-
preserving model order reduction for index one port-Hamiltonian descrip-
tor systems. Technical report, 2022. https://arxiv.org/abs/2206.01608.
270
Bibliography
[263] S. Sen. Reduced-basis approximation and a posteriori error estimation
for many-parameter heat conduction problems. Numer. Heat Transf. B:
Fundam., 54:369–389, 2008. doi: 10.1080/10407790802424204.
[264] J. Sesterhenn and A. Shahirpour. A characteristic dynamic mode de-
composition. Theor. Comput. Fluid Dyn., 33(3):281–305, 2019. doi:
10.1007/s00162-019-00494-y.
[265] M. K. Sleeman and M. Yano. Goal-oriented model reduction for param-
etrized time-dependent nonlinear partial differential equations. Comput.
Methods Appl. Mech. Eng., 388:114206, 2022. doi: 10.1016/j.cma.2021.
114206.
[266] B. Sonday, A. Singer, and I. G. Kevrekidis. Noisy dynamic simulations in
the presence of symmetry: data alignment and model reduction. Comput.
Math. Appl., 65(10):1535–1557, 2013. doi: 10.1016/j.camwa.2013.01.024.
[267] D. C. Sorensen. Passivity preserving model reduction via interpolation
of spectral zeros. Systems Control Lett., 54:347–360, 2005. doi: 10.1016/
j.sysconle.2004.07.006.
[268] D. Stadlmayr, W. Witteveen, and W. Steiner. A generalized constraint
reduction method for reduced order MBS models. Multibody Syst. Dyn.,
41:259–274, 2017. doi: 10.1007/s11044-016-9557-0.
[269] T. Taddei. An offline/online procedure for dual norm calculations
of parameterized functionals: empirical quadrature and empirical test
spaces. Adv. Comput. Math., 45:2429–2462, 2019. doi: 10.1007/
s10444-019-09721-w.
[270] T. Taddei. A registration method for model order reduction: data com-
pression and geometry reduction. SIAM J. Sci. Comput., 42(2):A997–
A1027, 2020. doi: 10.1137/19M1271270.
[271] T. Taddei and L. Zhang. Space-time registration-based model reduc-
tion of parameterized one-dimensional hyperbolic PDEs. ESAIM Math.
Model. Numer. Anal., 55(1):99–130, 2021. doi: 10.1051/m2an/2020073.
[272] T. Taddei and L. Zhang. Registration-based model reduction in complex
two-dimensional geometries. J. Sci. Comput., 88:79, 2021. doi: 10.1007/
s10915-021-01584-y.
[273] V. Trenchant, Y. Fares, H. Ramirez, and Y. Le Gorrec. A port-
Hamiltonian formulation of a 2D boundary controlled acoustic system.
IFAC-PapersOnLine, 48(13):235–240, 2015. doi: 10.1016/j.ifacol.2015.
10.245.
271
Bibliography
[274] B. Unger and S. Gugercin. Kolmogorov n-widths for linear dynami-
cal systems. Adv. Comput. Math., 45:2273–2286, 2019. doi: 10.1007/
s10444-019-09701-0.
[275] A. van der Schaft. Port-Hamiltonian systems: Network model-
ing and control of nonlinear physical systems. In H. Irschik and
K. Schlacher, editors, Advanced Dynamics and Control of Structures
and Machines, pages 127–167. Springer Vienna, Austria, 2004. doi:
10.1007/978-3-7091-2774-2_9.
[276] A. van der Schaft. L2-Gain and Passivity Techniques in Nonlinear Con-
trol. Springer International Publishing, Cham, Switzerland, third edi-
tion, 2017. doi: 10.1007/978-3-319-49992-5.
[277] A. van der Schaft. Port-Hamiltonian modeling for control. Annu.
Rev. Control Robot. Auton. Syst., 3:393–416, 2020. doi: 10.1146/
annurev-control-081219-092250.
[278] A. van der Schaft and D. Jeltsema. Port-Hamiltonian systems theory:
an introductory overview. Found. Trends Syst. Control, 1(2–3):173–378,
2014. doi: 10.1561/2600000002.
[279] C. Van Loan. The sensitivity of the matrix exponential. SIAM J. Numer.
Anal., 14(6):971–981, 1977. doi: 10.1137/0714065.
[280] M. Vidyasagar. Nonlinear Systems Analysis. Prentice-Hall, Englewood
Cliffs, NJ, USA, second edition, 1993.
[281] T. V and J. M. A. Scherpen. Port-Hamiltonian modeling of a nonlinear
Timoshenko beam with piezo actuation. SIAM J. Control Optim., 52(1):
493–519, 2014. doi: 10.1137/090774598.
[282] N. M. T. Vu, L. Lefèvre, and B. Maschke. A structured control model
for the thermo-magneto-hydrodynamics of plasmas in tokamaks. Math.
Comput. Model. Dyn. Syst., 22(3):181–206, 2016. doi: 10.1080/13873954.
2016.1154874.
[283] L. Wang, B. Maschke, and A. van der Schaft. Port-Hamiltonian modeling
of non-isothermal chemical reaction networks. J. Math. Chem., 56:1707–
1727, 2018. doi: 10.1007/s10910-018-0882-9.
[284] A. Warsewa, M. Böhm, O. Sawodny, and C. Tarín. A port-Hamiltonian
approach to modeling the structural dynamics of complex systems. Appl.
Math. Model., 89:1528–1546, 2021. doi: 10.1016/j.apm.2020.07.038.
[285] J. H. Wilkinson. Rounding Errors in Algebraic Processes. Prentice Hall,
Englewood Cliffs, NJ, USA, 1994.
272
Bibliography
[286] T. Wolf, B. Lohmann, R. Eid, and P. Kotyczka. Passivity and struc-
ture preserving order reduction of linear port-Hamiltonian systems us-
ing Krylov subspaces. Eur. J. Control, 16(4):401–406, 2010. doi:
10.3166/ejc.16.401-406.
[287] E. Zeidler. Nonlinear Functional Analysis and its Applications II/A:
Linear Monotone Operators. Springer Science+Business Media, New
York, NY, USA, 1990. doi: 10.1007/978-1-4612-0985-0.
[288] X. Zeng, H. Peng, F. Zhou, and Y. Xi. Implementation of regularization
for separable nonlinear least squares problems. Appl. Soft Comput., 60,
2017. doi: 10.1016/j.asoc.2017.07.006.
[289] W. Zhou, Y. Wu, H. Hu, Y. Li, and Y. Wang. Port-Hamiltonian modeling
and IDA-PBC control of an IPMC-actuated flexible beam. Actuators,
10:236, 2021. doi: 10.3390/act10090236.
273