Approximation of stability radii for large-scale dissipative Hamiltonian systems [original]

https://doi.org/10.1007/s10444-020-09763-5

Approximation of stability radii for large-scale

dissipative Hamiltonian systems

Nicat Aliyev1·Volker Mehrmann2·Emre Mengi3

Received: 30 August 2018 / Accepted: 17 January 2020 /

©The Author(s) 2020

Abstract

A linear time-invariant dissipative Hamiltonian (DH) system ˙x=(J −R)Qx, with

a skew-Hermitian J, a Hermitian positive semidefinite R, and a Hermitian posi-

tive definite Q, is always Lyapunov stable and under further weak conditions even

asymptotically stable. By exploiting the characterizations from Mehl et al. (SIAM

J. Matrix Anal. Appl. 37(4), 1625–1654, 2016), we focus on the estimation of two

stability radii for large-scale DH systems, one with respect to non-Hermitian per-

turbations of Rin the form R+BCHfor given matrices B,C, and another with

respect to Hermitian perturbations in the form R+BBH, =H. We propose

subspace frameworks for both stability radii that converge at a superlinear rate in

theory. The one for the non-Hermitian stability radius benefits from the DH structure-

preserving model order reduction techniques, whereas for the Hermitian stability

radius we derive subspaces yielding a Hermite interpolation property between the full

and projected problems. With the proposed frameworks, we are able to estimate the

two stability radii accurately and efficiently for large-scale systems which include a

finite-element model of an industrial disk brake.

Keywords Dissipative hamiltonian system ·Robust stability ·Stability radius ·

Eigenvalue optimization ·Subspace projection ·Structure-preserving subspace

framework ·Hermite interpolation

Mathematics Subject Classiﬁcation (2010) 65F15 ·93D09 ·93A15 ·90C26

Communicated by: Anthony Nouy

Supported by Deutsche Forschungsgemeinschaft via Project ME 40-1 of Priority Program 1897

Calm, Smooth and Smart.

Supported by Deutsche Forschungsgemeinschaft via Project A02 of Sonderforschungsbereich 910.

This article belongs to the Topical Collection: Model reduction of parametrized Systems

Guest Editors: Anthony Nouy, Peter Benner, Mario Ohlberger, Gianluigi Rozza, Karsten Urban and

Karen Willcox

Extended author information available on the last page of the article.

Published online: 6 February 2020

Adv Comput Math (2020) 46: 6

1 Introduction

Linear time-invariant Dissipative Hamiltonian (DH) systems are of the form

˙x=(J −R)Qx, (1.1)

where Q=QH∈Cn×nis a Hermitian positive definite matrix denoted by Q>

0, whereas J=−JH∈Cn×n, and the dissipation matrix R=RH∈Cn×nis

positive semidefinite, denoted by R≥0. DH systems arise as homogeneous parts of

port-Hamiltonian (PH) systems without inputs and outputs.

PH and DH systems play an essential role in many areas of science and engineer-

ing, see e.g. [18,26]. They are automatically Lyapunov stable, i.e., all eigenvalues of

A=(J −R)Q are in the closed left half of the complex plane, and any eigenvalue on

the imaginary axis is semisimple, see [21]. However, DH systems are not necessar-

ily asymptotically stable, since Amay have purely imaginary eigenvalues. Even if a

DH system is asymptotically stable, small perturbations that do not preserve the DH

structure may cause the system to become unstable so that the solution exhibits expo-

nential growth for some initial conditions. Asymptotic stability in the presence of

uncertainty can only be guaranteed when the stability radius of the system is reason-

ably large, i.e., the system is not only asymptotically stable but also reasonably away

from the set of systems with purely imaginary eigenvalues. Hence, an estimation of

the stability radius, which involves an optimization problem over all admissible per-

turbations, is an important ingredient of a proper stability analysis. One concrete real

application where the knowledge of the stability radius is invaluable is the avoidance

of brake squeals, which is presented next.

1.1 A motivating real world application

Disk brake squeal is a well-known problem in mechanical engineering. It occurs

due to self-excited vibrations caused by instability at the pad-rotor interface [1]. In

[12], FE models for disk brakes are derived in the form of second order differential

equations

M¨x+D() ˙x+K()x =f, (1.2)

with large and sparse symmetric coefficient matrices M,D(),andK() ∈Rn×n

depending on the rotational speed >0suchthatM>0, K() > 0andD() ≥0.

The function frepresents a forcing term or control; hence, it may be set as f=

0 for the stability analysis. The incorporation of gyroscopic effects, modeled by a

term G() ˙x, with G() =−G()H, and circulatory effects, modeled by a non-

symmetric term Nx, gives rise to a system

M¨x+(D() +G()) ˙x+(K() +N)x =0. (1.3)

Expressing (1.3) (with f=0) as a first order ODE followed by a straightforward

manipulation yields the system



z=(J −R)Q



z, (1.4)

Adv Comput Math (2020) 46:6

Page 2 of 28

where

J=−G() −K()

K() 0,R=D() N

,Q=M0

0K() −1

z=˙x

x,

z=Q−1z.

(1.5)

In the absence of the circulatory effects, i.e., when N=0, this is a DH system and, as

a result, it is Lyapunov stable and typically it is also asymptotically stable. However,

small circulatory effects, i.e., perturbations by a non-symmetric Nof small norm,

may result in instability. Here, we emphasize that the perturbations due to the matrix

Nare restricted to the finite element nodes on the brake pad and the associated nodes

on the disk. Moreover, the number of such nodes is several orders of magnitude

smaller than the system size.

In modern brakes, damping devices (shims) that increase the damping of the brake

pad are used to avoid resonances between the disk and the brake pad. The uncertain-

ties and the additional damping due to shims mainly affect the matrices D() and

Note that for this brake squeal example leading to the DH system in (1.4)and(1.5),

the matrix Q−1is sparse, whereas Qis usually not. This however does not cause

any difficulty in our approaches; as elaborated on later when we perform numerical

experiments, the ideas we introduce throughout the paper can be modified so that

they are based on Q−1rather than Q.

1.2 Scope

In this paper, we study the stability radii of large-scale DH systems of the form (1.1)

in the presence of perturbations. We typically assume that the DH systems at hand are

sparse; however, the approaches we propose may also be applicable to dense systems

as long as it is feasible to solve a few linear systems of size equal to the order of the

system.

Motivated by the disk brake example, we focus on perturbations in the coefficient

matrix Rof (1.1). In particular, we consider the following two stability radii for a DH

system of the form (1.1) originally introduced in [21], where the notation iRrefers

to the imaginary axis in the complex plane, (A) to the spectrum of a matrix A,and

A2to the spectral norm of A.

Definition 1 Consider a DH system of the form (1.1) and suppose that B∈Cn×m

and C∈Cp×n,n≥m, p, are given restriction matrices.

– The non-Hermitian restricted stability radius r(R;B,C) with respect to pertur-

bations of Runder the fixed restriction matrices B,Cis defined by

r(R;B,C) := inf{2|∈Cm×p,

((J −(R+BC))Q)∩iR=∅}.

Adv Comput Math (2020) 46:6 Page 3 of 28 6

– The Hermitian restricted stability radius with respect to Hermitian perturbations

of Runder the fixed restriction matrix Bis defined by

rHerm(R;B) := inf{2|=H∈Cm×m,

(J −(R+BBH))Q∩iR=∅}.(1.6)

The significance of the two stability radii defined above in the context of the disk

brake example is discussed next.

Example 1 In the brake-squeal example in Section 1.1, it is of interest to know

whether, for given , the norm of the non-symmetric matrix Nis tolerable to pre-

serve the asymptotic stability of the system in (1.4). The relevant stability radius for

a specified is given by

inf {N2|(A(N))∩iR=∅

},(1.7)

where

A(N) := −G() −K()

K() 0−D() 0

+0N

M0

0K() −1

The perturbations in Nonly act on the FE nodes associated with the brake pad.

Hence, the stability radius in (1.7) corresponds to the non-Hermitian restricted stabil-

ity radius r(R;B,C) with the restriction matrices B=Bp0Hand C=0Cp

with blocks Bp,C

passociated with these finite element nodes. On the other hand, the

Hermitian stability radius rHerm(R;B) is associated with restricted Hermitian pertur-

bations with B=Bp0H,C =BHwhen the Hermitian matrix D() is subject

to Hermitian perturbations.

Note that the Hermitian stability radius rHerm(R;B)is finite (see Theorem 3 below

and [21, Corollary 4.10]) and can be arbitrarily small. Additionally, by definition, we

have r(R;B,BH)≤rHerm(R;B). Indeed, these two stability radii can differ con-

siderably, see also [21]. Our main concern is the approximation of these two stability

radii for large-scale DH systems when nm, p. We propose subspace frameworks

that replace the large-scale DH system with a small-scale system whose stability radii

approximate those of the large-scale system usually well.

Contributions The stability radius under non-Hermitian restricted perturbations

r(R;B,C) can be characterized as the reciprocal of the H∞norm of the transfer

function of a linear control system, see in particular (2.1) in the next section. It should

be noted that, due to the DH structure, the singular value function that needs to be

optimized in the H∞-norm computation often has many local maximizers.

For the estimation of r(R;B,C), we propose a DH structure–preserving subspace

framework that converges at a superlinear rate with respect to the subspace dimen-

sion. The idea of the new method is partly inspired by the subspace framework in

[2] applicable for the approximation of the H∞norm of an asymptotically stable

system, but we form the subspaces in a different way by combining it with DH

structure–preserving model order reduction techniques, see [11,15,16,24]. Both the

Adv Comput Math (2020) 46:6

Page 4 of 28

framework in [2] and the one proposed here seem to converge to local maximizers

that are not necessarily global maximizers. However, our numerical experience on

synthetic examples indicates that the DH structure–preserving framework converges

to the global maximizer more frequently compared with the unstructured framework

in [2]. There are other approaches [5–7,10,17,23] for approximating the H∞norm

of a large-scale asymptotically stable system, but none of these employ subspace

projections.

The Hermitian restricted stability radius rHerm(R;B) poses more challenges.

Unlike the characterization for r(R;B,C), the eigenvalue optimization characteri-

zation for rHerm(R;B) (given in Theorem 3) does not appear to be related to any

transfer function. We derive a new subspace projection framework in which Her-

mite interpolation properties between the original and projected objective eigenvalue

functions hold.

Outline The paper is organized as follows. Section 2proposes subspace frameworks

for approximating r(R;B,C), while Section 3concerns rHerm(R;B). For the latter,

even the solution of the small-scale problems is challenging. Hence, we first discuss

how small-scale problems can be solved in Section 3.1. The new structured subspace

framework is then presented in Section 3.2. In Sections 2and 3, we illustrate the

new frameworks by means of numerical experiments on the disk brake example and

several synthetic examples.

A longer version of this article discussing also perturbations in coefficients other

than R, and featuring additional numerical results is available in [4].

2 The non-Hermitian stability radius for large-scale problems

In this section, we study the estimation of the non-Hermitian restricted stability radius

r(R;B,C) for large-scale asymptotically stable DH systems. Our approach operates

on the characterization [21, Theorem 3.3]

r(R;B,C) =1/GH∞,

GH∞:= sup

s∈C+

σmax(G(s)) =max

ω∈Rσmax(G(iω)). (2.1)

for an asymptotically stable DH system of the form (1.1) and restriction matrices

B∈Cn×m,C∈Cp×n. Above, we use the notation

G(s) := CQ(sI −(J −R)Q)−1B, (2.2)

the notation σmax(G(s)) for the largest singular value of G(s),andC+:= {s∈

C|Re(s) > 0}for the open right-half of the complex plane. Note that the second

equality in the second line in (2.1) follows from the maximum modulus principle, as

the matrix (J −R)Q does not have any eigenvalue in C+.

The matrix-valued function G(s) in (2.2) corresponds to the transfer function of

the control system

˙x=

Ax +

Bu, y =

Cx, (2.3)

Adv Comput Math (2020) 46:6 Page 5 of 28 6

where we set 

A:= (J −R)Q,

B:= Band 

C:= CQ. Moreover, the quantity

GH∞is commonly referred as the H∞norm of the transfer function G(s).

2.1 The subspace framework of [2]

Recently, in [2], a subspace framework for the approximation of the H∞norm of a

large-scale asymptotically stable system has been proposed. At iteration k, it requires

two subspaces Vkand Wkof equal dimension, as well as matrices Vkand Wk, respec-

tively, whose columns form orthonormal bases for these subspaces. The state xof

system (2.3) is restricted to Vk, i.e., in (2.3)thestatexis replaced by Vkxk, and it is

imposed that the residual of the differential part after this restriction is orthogonal to

Wk.

This gives rise to the reduced order system



Ek˙xk=

Akxk+

Bku, yk=

Ckxk,(2.4)

with 

Ek:= WH

kVk,

Ak:= WH

k

AVk,

Bk:= WH

k

B,and

Ck:= 

CVk. Then letting

Gk(s) := 

Ck(s 

Ek−

Ak)−1

Bkbe the reduced order transfer function, and assuming

that 

Ekis invertible, a maximizer ωk+1∈arg maxω∈Rσmax(Gk(iω)) can be com-

puted efficiently and reliably if the dimensions of Vk,Wkare small by employing the

globally convergent method in [8,9]. Once such an ωk+1has been determined, then

the subspaces Vkand Wkare expanded into larger subspaces Vk+1and Wk+1in such

a way that the corresponding reduced transfer function Gk+1(s) satisfies the Hermite

interpolation conditions

σmax(G(iωk+1)) =σmax(Gk+1(iωk+1)) ,

dω σmax(G(iω))ω=ωk+1

dω σmax(Gk+1(iω))ω=ωk+1

. (2.5)

Denoting the range space of a matrix by Aby Ran(A),itisshownin[2] that the

inclusions

Ran((iωk+1I−

A)−1

B) ⊆Vk+1,Ran((

C(iωk+1I−

A)−1)H)⊆Wk+1,

ensure that the conditions in (2.5) are satisfied. For instance when m=p,

in practice, we set Vk+1:= orth Vk(iωk+1I−

A)−1

B,andWk+1:=

orth Wk(iωk+1I−

A)−H

CH, where orth(M) denotes a matrix whose columns

form an orthonormal basis for Ran(M). The definitions of the matrices Vk+1,W

k+1

can be modified in a simple way when m= pso that they have equal num-

ber of columns; we refer to [2] for the details. The procedure is then repeated

with the expanded subspaces Vk+1,Wk+1spanned by the columns of Vk+1,W

k+1.

The sequence {ωk}always seems to converge to a local maximizer, but a formal

proof is open at the moment. In [2], assuming that the sequence {ωk}converges

to a local maximizer, it is proven that this sequence converges at a superlinear

rate due to the satisfaction of (2.5)forallk. The solution of the reduced problem

maxω∈Rσmax(Gk(iω)) by the method in [8,9] requires extraction of all eigenval-

ues of 2d×2dpencils, where dis the order of the system, i.e., the dimension of

the subspaces Vk,Wk. Assuming that dis small, this is much cheaper than solving

Adv Comput Math (2020) 46:6

Page 6 of 28

the linear systems (iωk+1I−

A)−1

B,(iωk+1I−

A)−H

CHinvolving the full order

system, which is needed to form Vk+1,W

k+1.

Even though 

A=(J −R)Q has the DH structure, this is in general not true for

the reduced system 

Ak. In the next subsection, we modify the procedure of [2]to

preserve the DH structure.

2.2 A structure-preserving subspace framework

In this subsection, we introduce an interpolating and DH structure–preserving ver-

sion of the subspace projection framework. Structure-preserving subspace projection

methods in the context of model order reduction of large-scale PH and DH systems

have been proposed in [15,16,24]. Our approach is inspired by the following inter-

polation result, see [16, Remark 3]. For this result and the subsequent arguments, we

introduce the matrix-valued function

T(s):= (J −R)Q −sI (2.6)

depending on s∈Cassociated with the DH system in (2.3).

Theorem 1 (Right Tangential Interpolation [16]) Let G(s) be the transfer function

for the system (2.3), and let Gk(s) be the transfer function for the reduced system

(2.4). For given

s∈C,

b∈Cmand a positive integer N,if



s)−

B

b∈Vkfor =1,...,N, (2.7)

and Wkis such that WH

kVk=I, then, denoting the -th derivatives of G(s) and

Gk(s) at the point

swith G()(

s) and G()

k(

s), we have

G()(

s)

b=G()

k(

s)

bfor =0,...,N −1 (2.8)

provided that both 

sI −

Aand 

sI −

Akare invertible.

The characterization of r(R;B,C) involves the maximization of the largest sin-

gular value of the transfer function G(s) on the imaginary axis. In the following, we

make use of Theorem 1 to obtain a reduced order system satisfying the interpolation

conditions (2.8) while retaining the structure.

For a given 

s∈Cand 

b∈Cm, choose Vkto satisfy (2.7). Then define

Wk:= QVk(V H

kQVk)−1,(2.9)

so that

kVk=Iand (WkVH

k)2=WkVH

k,(2.10)

i.e., WkVH

kis an oblique projector onto Ran(QVk). The reduced system matrices



Ak,

Bk,

Ckthen satisfy



Ak=WH

k

AVk=WH

k(J −R)QVk=WH

k(J −R)WkVH

kQVk

=(Jk−Rk)Qk,(2.11)

Adv Comput Math (2020) 46:6 Page 7 of 28 6

where we define

Jk:= WH

kJW

k=−JH

k,R

k:= WH

kRWk=RH

k≥0,

Qk:= VH

kQVk=QH

k>0,(2.12)

and



Ck=

CVk=CQVk=CWkVH

kQVk=CkQk,C

k:= CWk,



Bk=WH

kB=: Bk. (2.13)

Moreover, 

Ek=WH

kVk=I. Using this construction and Theorem 1, we deduce

the following result that indicates particular projection subspaces to achieve Hermite

interpolation between the full and reduced order systems in a structure-preserving

way.

Theorem 2 Consider a linear system as in (2.3)with the transfer function G(s) in

(2.2). Furthermore, for a given point 

s∈Cand a given tangent direction 

b∈Cm,

suppose that Vkis a matrix with orthonormal columns such that



s)−B

b∈Ran(Vk)for =1,...,N

for some positive integer N,whereT(s) is as in (2.6). Define Wkas in (2.9),

Jk,R

k,Q

kas in (2.12), and Bk,C

kas in (2.13). Then the resulting reduced order

system

˙xk=(Jk−Rk)Qkxk+Bku,

yk=CkQkxk

is a DH system with the transfer function

Gk(s) := CkQk(sI −(Jk−Rk)Qk)−1Bk(2.14)

that satisfies

G()(ˆs)ˆ

b=G()

k(ˆs)ˆ

bfor =0...,N −1,

where G()(ˆs),G()

k(ˆs) denote the -th derivatives of G(s), Gk(s) at ˆs.

According to Theorem 2, for a given

s∈C, setting

Vk:= orth T(



s)−1BT(



s)−2B,W

k:= QVk(V H

kQVk)−1,

we obtain G(

s) =Gk(

s) and G(

s) =G

k(

s), and thus the Hermite interpolation

conditions

σmax(G(

s)) =σmax(Gk(

s)), d

dsσmax(G(s))s=

dsσmax(Gk(s))s=

(2.15)

are satisfied. This suggests the greedy subspace framework outlined in Algorithm 1

for the approximation of r(R;B,C).

Adv Comput Math (2020) 46:6

Page 8 of 28

In line 6 of Algorithm 1, at every iteration, the subspace framework computes the

H∞norm of a reduced system, in particular it computes the point iω∗on the imagi-

nary axis where this H∞norm is attained. Then the current left and right subspaces

are expanded in a way so that the resulting reduced system still has the DH structure

and its transfer function Hermite interpolates the original transfer function at iω∗.

Since the Hermite interpolation conditions (2.15) are satisfied at 

s=iω1,...,iωk

at the beginning of iteration k, the analysis in [2] shows a superlinear rate of con-

vergence for the sequence {ωk}, assuming that this sequence converges to a local

maximizer. We remark that σk+1=σmax(G(iωk+1)) due to line 10 of the algorithm

and the Hermite interpolation property. Hence, if the sequence {ωk}converges to a

global maximizer, the reciprocal of the limit of the sequence {σk}yields r(R;B,C).

The most expensive part of Algorithm 1 occurs in lines 2 and 7, where several

linear systems have to be solved. If this is done with a direct solver, for each value

ω∈Rone LU factorization of the matrix T(iω) suffices. For large n, the computa-

tion time is usually dominated by these LU factorizations. In contrast, the reduced

problem in line 6 involves a small system; its solution can be obtained efficiently and

robustly by the algorithm in [8,9].

2.3 Numerical experiments

In this subsection, we illustrate the performance of MATLAB implementations of

Algorithm 1 and the subspace framework from [2] via several numerical experiments

carried out in MATLAB R2017b on a machine with Mac OS 10.12.6, an Intel®

Core™i5-4260U CPU and 4GB RAM.

Algorithm 1 and the subspace framework from [2] are terminated when at least

one of the following three conditions is fulfilled:

Adv Comput Math (2020) 46:6 Page 9 of 28 6

1. The relative distance between ωkand ωk−1is less than a prescribed tolerance,

i.e., |ωk−ωk−1|<ε·|ωk|.

2. Letting fk:= maxω∈Rσmax(Gk(iω)), two consecutive iterates fk,f

k−1are close

enough in a relative sense, i.e., |fk−fk−1|<ε·fk.

3. The number of iterations exceeds a specified integer, i.e., k>k

max.

In all of the presented numerical examples, we use ε=10−12 and kmax =80 for

both Algorithm 1 and the subspace framework from [2].

Both Algorithms seem to converge locally in our experiments. The initial interpo-

lation points affect whether convergence to a global maximizer occurs. A procedure

to choose the initial interpolation points is outlined below. It uses an interval [l,u],

ideally containing a global minimizer. A reasonable strategy is to set l,u equal to

the approximations of the imaginary parts of the eigenvalues of (J −R)Q with the

smallest and largest imaginary parts.

(i) First, we discretize [l,u]into ρequally spaced points, say ω0,1,...,ω

0,ρ,

including the end-points l,u,whereρis specified by the user.

(ii) Then we either approximate the eigenvalues z1,...,z

ρof (J −R)Q closest

to iω0,1,...,iω0,ρ (using eigs in MATLAB), respectively, or otherwise set

zj=iω0,j for j=1,...,ρ.

(iii) Next, we permute z1,...,z

ρinto zj1,...,z

jρwhere {j1,...,j

ρ}={1,...,ρ}

is such that σmax(G(izj1)) ≥···≥σmax(G(izjρ)).

(iv) The interpolation points ω1,...,ω

κemployed initially are chosen as the

imaginary parts of zj1,...,z

jκ, where again κ≤ρis specified by the user.

For the synthetic examples below, we use l=−2000, u=0, ρ=25, κ=10, and,

in part (ii), z1,...,z

ρare set equal to the eigenvalues closest to iω1,...,iωρ.Forthe

disk brake example, we use l=−2·105,u=−1.7 ·105,ρ=15, κ=15, and, in

part (ii), we set zj=iω0,j for j=1,...,ρ. The values of land uemployed in these

examples are chosen based on rough sketches of the largest singular value functions

to be maximized.

2.3.1 Results on synthetic examples

We now present results for 2000 dense linear DH systems with random coefficient

matrices J,R,Q of order 500, and the restriction matrices Band Cchosen as 500×2

and 2 ×500 random matrices; MATLAB code generating such random coefficient

matrices are openly available.1

The relative errors of the values returned by Algorithm 1 and [2, Algorithm

1] for the first 5 random examples are presented in Table 1,wherefBB and

fSF denote the results returned by the Boyd-Balakrishnan (BB) algorithm [8,9]

and the subspace framework (Algorithm 1 or [2, Algorithm 1]). Note that for the

1http://home.ku.edu.tr/∼emengi/software/DH-stabradii-files/randomDH.m

This function needs to be called as [J,R,Q,B,C] = randomDH(500,2,2) to generate such random

matrices.

Adv Comput Math (2020) 46:6

Page 10 of 28

Table 1 Relative errors of the values returned by Algorithm 1 and [2, Algorithm 1] on 5 dense random

DH examples of order 500. The number of subspace iterations, run-time (in seconds) are also listed

|fSF −fBB|/fBB # iterations run-time

Ex. Alg. 1 [2]Alg.1[2]Alg.1[2]

12.13·10−14 1.85 ·10−14 868.86.8

22.98·10−15 5.60 ·10−2446.86.7

35.88·10−15 5.51 ·10−16 637.26.6

43.26·10−15 1.63 ·10−14 9 53 7.3 33.3

51.23·10−14 2.77 ·10−15 7157.27.7

second example, [2, Algorithm 1] returns an estimate with a significant error (high-

lighted in bold face). The unstructured distance to instability [27]((J −R)Q) :=

inf 2|∃∈Cn×n, ((J −R)Q +) ∩iR=∅

for all of these 5 examples

is larger than r(R;B,C) by ten times or even more.

Considering all 2000 examples, the non-Hermitian stability radii are estimated

with relative errors less than the prescribed tolerance 10−12 for 1921 of these exam-

ples by Algorithm 1 and for 1821 by [2, Algorithm 1]. These numbers correspond

to 96.45% and 91.45% accuracy for Algorithms 1 and [2, Algorithm 1], respectively.

Algorithms 1 performs 10.92 subspace iterations on average, smaller than the average

number of iterations 14.95 by [2, Algorithm 1]. The average run-times on this data

set are quite similar, specifically 10.48 and 9.65 s for Algorithms 1 and [2, Algorithm

1], respectively.

Table 2gives more detailed insight into the performance of the algorithms. It

reports the accuracy of the results returned by Algorithm 1 and [2, Algorithm 1], as

well as the average number of iterations and the average run-times, when only those

examples in the data set for which the rank of Ris greater than 14,20,30,40,50,

respectively, are taken into account. (Note that for all examples we have Rank(R) >

14.) We call a result returned by one of the subspace frameworks accurate if the rela-

tive error |fSF −fBB|/fBB is less than the prescribed tolerance 10−12. According to

Table 2 The accuracy of Algorithm 1 and [2, Algorithm 1] with respect to Rank(R) on the DH systems of

order 500. The second column lists the number of examples in the data set that satisfy the rank constraint,

whereas the third column lists the number of examples among these for which the returned result has error

less than the prescribed tolerance. The average number of iterations and the average run-times (in seconds)

are listed in the fifth and sixth columns

Rank # accurate Accuracy % # iter. Run-time

of R> # Ex. Alg. 1 [2]Alg.1[2]Alg.1[2]Alg.1[2]

14 2000 1929 1829 96.45% 91.45% 10.92 14.95 10.48 9.65

20 1853 1807 1744 97.52% 94.12% 10.39 14.13 9.55 8.98

30 1619 1595 1562 98.52% 96.48% 9.75 13.08 8.96 8.43

40 1390 1382 1363 99.42% 98.06% 9.20 12.09 8.61 8.08

50 1162 1157 1141 99.57% 98.19% 8.85 11.39 8.47 7.89

Adv Comput Math (2020) 46:6 Page 11 of 28 6

Related document tools

Academic integrity tools for documents

Plag helps compare academic text against other sources. Identific can complement similarity checking with verification support. They can help teams review documents with more confidence.

plag.ai

the figures in the table, not only the accuracy of the algorithms improves as the rank

of Rincreases, but also the average number of iterations and the average run-time

decrease.

2.3.2 The FE model of a disk brake

The only large-scale computation required by Algorithm 1 (after the preprocessing

step that yield initial interpolation points) is the solution of the linear systems

T(iω) X =Band T(iω)2Y=B(2.16)

at a given ω∈Rin lines 2 and 7, where T(iω) is as in (2.6). For the DH system of

the disk brake in (1.4)and(1.5), the mass matrix Mand the stiffness matrix K()

and thus the matrix Q−1are sparse, but Qturns out to be dense. Trying to invert Q−1

and/or solve a linear system with the coefficient matrix Qis computationally very

expensive and would require full matrix storage.

This difficulty can be avoided by exploiting that

((J −R)Q −iωI)−1B=Q−1((J −R) −iωQ−1)−1B.

Hence, to compute X, Y as in (2.16), we proceed as follows.

1. We first solve ((J −R) −iωQ−1)

X=Bfor 

X,andsetX=Q−1

2. Then we solve ((J −R) −iωQ−1)

Y=Xfor 

Y,andsetY=Q−1

A second observation that further speeds up the computations is the particular struc-

ture of the coefficient matrix ((J −R) −iωQ−1)with N=0. Setting 

M(iω;) :=

iωM +D() +G(),wehave

(J −R) −iωQ−1=−

M(iω;) −K()

K() −iωK() .

Hence, to solve ((J −R) −iωQ−1)Z =Wfor a given W=WT

1WT

2Tand the

unknown Z=ZT

1ZT

2Twith W1,W

2,Z

1,Z

2having all equal number of rows,

we perform a column block permutation and then eliminate the lower left block to

obtain −K() −

M(iω;)

0K() +iω

M(iω;) Z2

Z1=W1

W2−iωW1.

At every subspace iteration, the highest costs arise from the computation of the LU

factorizations of the sparse matrices K() and K() +iω

M(iω;).

We have applied Algorithm 1 to estimate the non-Hermitian stability radius

r(R;B,BT)for a DH system of the form (1.4)and(1.5) resulting from a FE brake

model with N=0. In this example, G(), K(), D(), M ∈R4669×4669 yielding

J,R,Q ∈R9338×9338, whereas B∈R9338×3.

The plot of the computed estimate values for r(R;B,BT)with respect to the rota-

tion speed ∈[2.5,1800]is presented on the left in Fig. 1. It seems that r(R;B,BT)

as a function of exhibits nonsmoothness at several points, but such points in the

figure are usually local or global maximizers. Two notable maximizers where non-

smoothness is present are near =1120 and =1590. Denoting the former of

Adv Comput Math (2020) 46:6

Page 12 of 28

0 200 400 600 800 1000 1200 1400 1600 1800

12 10-3

1100 1105 1110 1115 1120 1125 1130

12 10-3

Fig. 1 (Left) Plot of the approximation to the stability radius r(R;B,BT)by Algorithm for the DH system

specified by (1.4)and(1.5) with N=0 as it varies with the rotation speed ∈[2.5,1800]. (Right) Plot

is zoomed for ∈[1100,1130]

these maximizers with ∗, the plot of r(R;B,BT)for ∈[1100,1130]containing

∗is zoomed on the right-hand plot in Fig. 1. The nonsmoothness here is due to the

fact that σmax(BTQ(iωI −(J −R)Q)−1B) as a function of ωhas multiple global

maximizers for the particular parameter value ∗. A similar remark applies to several

other points where nonsmoothess is present.

The estimated values for r(R;B,BT)are listed in Table 3for some values of

. For comparison purposes, we also estimate r(R;B,BT)by employing [2, Algo-

rithm 1]. (This requires the solution of T(iω)HY=CHinstead of the solution

of T(iω)2Y=Bin (2.16). It is straightforward to extend the approach discussed

above for the efficient solution of T(iω)X =Bto this setting.) The two algorithms

return the same values of r(R;B,BT)up to prescribed tolerances for all values in

the table. In all cases, 2 subspace iterations are sufficient to achieve the prescribed

accuracy for Algorithm 1. Both algorithms in the end replace the full problem of

order 9338 with considerably smaller reduced systems, in particular of order 102 for

Algorithm 1.

Table 3 Estimated values for the stability radii r(R;B,BT)in the second column by Algorithm 1 and

[2, Algorithm 1] for a few values for the system originating from the FE brake model. The other

columns display ω∗corresponding to arg maxωσmax(BTQ(iωI −(J −R)Q)−1B), the number of subspace

iterations, total run-time (in seconds) and subspace dimension at termination

r(R;B,BT)# iter. Run-time Dimension

Alg. 1 & [2]ω∗Alg. 1 [2]Alg.1[2]Alg.1[2]

5 0.00893 −1.741 ·1052 10 30.9 46.8 102 75

50 0.00904 −1.741 ·1052 13 31.4 55.1 102 84

300 0.00752 −1.742 ·1052 3 31.5 31.2 102 54

1100 0.00834 −1.789 ·1052 7 30.8 39.7 102 66

1200 0.00407 −1.742 ·1052 8 30.9 41.9 102 69

Adv Comput Math (2020) 46:6 Page 13 of 28 6

3 Estimation of the hermitian restricted stability radius

In this section, we focus on Hermitian structured perturbations in the dissipation

matrix, in particular approximation of rHerm(R;B) as in (1.6). The algorithms

we propose exploit the following characterization of rHerm(R;B) established in

[21, Theorem 4.9], which is originally presented in terms of an orthonormal basis

U(λ)for the kernel of (I −BB+)T (λ),whereT(λ)is as in (2.6)andB+denotes the

Moore-Penrose pseudoinverse of B. However, it turns out that U(λ) does not have to

be orthonormal, rather the theorem can be stated in terms of any basis for the kernel

of (I −BB+)T (λ). In our version, given in Theorem 3 below, we employ a particular

basis that simplifies the formulas and facilitates the computation.

Theorem 3 For an asymptotically stable DH system of the form (1.1), and a

restriction matrix B∈Cn×mof full column rank, let

1. T(λ)in (2.6)at a given λ∈Cbe such that T(λ)is invertible,

2. L(λ) be a lower triangular Cholesky factor of



H0(λ) := (BHQT (λ)−1B)H(BHQT (λ)−1B), (3.1)

i.e., L(λ) is a lower triangular matrix satisfying 

H0(λ) =L(λ)L(λ)H,

3. H0(λ) := L(λ)−1L(λ)−H,

4. H1(λ) := i(

H1(λ) −

H1(λ)H), with 

H1(λ) :=

L(λ)−1(BHQT (λ)−1B)HL(λ)−H.

Then rHerm(R;B) is finite, and given by

rHerm(R;B) =inf

ω∈Rsup

t∈R

λmin(H0(iω) +tH1(iω))1/2

where λmin(·)denotes the smallest eigenvalue of its Hermitian matrix argument, and

the inner supremum is finite if and only if H1(iω) is indefinite.

We first describe a numerical technique for small-scale problems in Section 3.1

and then develop a subspace framework in Section 3.2, both based on the eigenvalue

optimization characterization of rHerm(R;B) in Theorem 3. Note that in this section,

Bis always assumed to be of full column rank.

3.1 An algorithm for small-scale problems

The eigenvalue optimization characterization of rHerm(R;B) is a min-max prob-

lem, where the inner maximization problem is concave; indeed, it can alternatively

be expressed as a semidefinite program (SDP). Formally, for a given ω∈R,and

H0(iω), H1(iω) representing the Hermitian matrices defined in Theorem 3, let us

consider

ηHerm(R;B,iω) := sup

t∈R

λmin(H0(iω) +tH1(iω))

=sup{z|z, t ∈Rs.t. H0(iω) +tH1(iω) −zI ≥0}(3.2)

Adv Comput Math (2020) 46:6

Page 14 of 28

so that

rHerm(R;B) =inf

ω∈RηHerm(R;B,iω). (3.3)

In the following subsections, we solve the minimization problem inside the square

root in (3.3).

3.1.1 Inner maximization problems

The characterization in the second line of (3.2) is a linear convex SDP. Some of

the most widely used techniques to solve a linear convex SDP are different forms

of interior-point methods. Implementations of some of these by the interior-point

methods available through the package cvx [13,14].

An alternative is to employ the software package eigopt [22] for the eigenvalue

optimization problem in the first line of (3.2). This software package is based on

an approach that approximates the eigenvalue function with a piecewise quadratic

function that lies globally above the eigenvalue function. It requires a global upper

bound for the second derivative of the eigenvalue function, which can be chosen as

any slightly positive real number (e.g., 10−6) for the concave eigenvalue function in

the first line of (3.2).

In our experience, eigopt performs the computation of ηHerm(R;B,iω) signifi-

cantly faster than cvx. The only downside is that an interval containing the optimal

tfor (3.2) must be supplied to eigopt, whereas such an interval is not needed by

cvx. In practice, we employ eigopt due to its efficiency.

3.1.2 Outer minimization problems

The minimum of ηHerm(R;B,iω) with respect to ω∈Ryields the square of the

radius rHerm(R;B), and it turns out that the minimizing ω∈Rcorresponds to the

point iωthat first becomes an eigenvalue on the imaginary axis under the smallest

perturbation possible [21, Theorem 4.9]. This is a nonconvex optimization problem.

Indeed, the objective ηHerm(R;B,iω) mayevenblowupatsomeω.

We again resort to eigopt for the minimization of ηHerm(R;B,iω).Forthesake

of completeness, a formal description is provided in Algorithm 2 below, where we

use the abbreviations

ηj:= ηHerm(R;B,iωj)and η

j:= d

dω ηHerm(R;B,iω)ω=ωj

Every convergent subsequence of the sequence {ωk}in Algorithm 2 is guaranteed

to converge to a global minimizer of ηHerm(R;B,iω) provided γis chosen suffi-

ciently small. Moreover, by continuity, it also follows that ηHerm(R;B,iωk)in the

limit approaches rHerm(R;B) for all γvalues sufficiently small. In our experiments,

modestly small values of γ,suchasγ=−20, seem to lead to convergence.

Adv Comput Math (2020) 46:6 Page 15 of 28 6

At step kof the algorithm, ηHerm(R;B,iω) and its derivative need to be computed

at ωk. For instance, one can rely on one of the two approaches (cvx or eigopt)in

Section 3.1.1 for the computation of ηHerm(R;B,iωk), and employ a finite difference

formula to approximate its derivative, which would require an additional computation

of ηHerm(R;B,iω) at an ωclose to ωk.

3.2 An interpolation-based subspace framework for large-scale problems

In this section, we consider the Hermitian restricted stability radius for large DH

systems. The characterization in Theorem 3 is in terms of the matrix-valued functions

H0(iω) and H1(iω), which are of small size provided that Bhas few columns. The

large-scale nature of this characterization is hidden in the matrix-valued function

T(iω) =(J −R)Q −iωI, because the computation of H0(iω) and H1(iω) requires

the solution of the linear system T(iωk)Z =B.

3.2.1 Derivation of the reduced problems

To cope with the large-scale setting, we benefit from structure-preserving two-sided

projections similar to those described in Section 2.2. In particular, for a given matrix

Vkwith orthonormal columns, we define Wkas in (2.9), Jk,R

k,Q

kas in (2.12), and

Bkas in (2.13), respectively. Again, Vk,Wkrefer to the right, left subspaces spanned

by the columns of Vk,W

However, we no longer have a tool such as Theorem 1 to establish interpolation

results, because there is no apparent transfer function, as there is indeed no apparent

linear PH system that can be tied to the eigenvalue optimization characterization. The

following simple observation turns out to be very useful. The subsequent arguments

make use of the matrix-valued function

Tk(λ) := (Jk−Rk)Qk−λI (3.4)

for λ∈Cassociated with the reduced problems.

Adv Comput Math (2020) 46:6

Page 16 of 28

Lemma 1 Consider a DH model (1.1), and a reduced model ˙xk=(Jk−Rk)Qkxk

with coefficients as in (2.12)defined in terms of a given Vkand Wkas in (2.9).

Letting T(λ)and Tk(λ) be as in (2.6)and (3.4), respectively, we then have Tk(λ) =

kT(λ)V

kfor all λ∈C.

Proof From the definitions of Jk,R

k,Q

kwe obtain

Tk(λ) =(WH

k(J −R)Wk)V H

kQVk−λI

=WH

k(J −R)QVk−λWH

kVk

=WH

k((J −R)Q −λI)Vk=WH

kT(λ)V

where we have employed (2.10) in the second equality.

In the following, we develop a subspace framework in which the Hermite inter-

polation properties between the original problem defined in terms of T(λ) and the

reduced problem defined in terms of Tk(λ) hold. For this, we first show an auxiliary

interpolation result between BHQT (λ)−1Band its reduced counterpart.

Lemma 2 Consider a DH model (1.1), and a reduced model ˙xk=(Jk−Rk)Qkxk

with coefficients as in (2.12)defined in terms of a given Vkand Wkas in (2.9).

Furthermore, let T(λ)and Tk(λ) be as in (2.6)and (3.4), respectively. For a given



λ∈Csuch that T(



λ) and Tk(

λ) are invertible, the following assertions hold:

(i) If Ran(T (

λ)−1B) ⊆Vk,thenBHQT (

λ)−1B=BH

kQkTk(

λ)−1Bk.

(ii) Additionally, if Ran(T (

λ)−2B) ⊆Vkand the orthonormal basis Vkfor Vkis of

the form Vk=

Vk

Vk, where the columns of 

Vkform an orthonormal basis

for Ran(T (

λ)−1B),then BHQT (

λ)−2B=BH

kQkTk(

λ)−2Bk.

Proof (i) If Ran(T (

λ)−1B) ⊆Vk,then

BHQT (

λ)−1B=BHQVkVH

kT(



λ)−1B=BHWkVH

kQVkVH

kT(



λ)−1B

=BH

kQkVH

kT(



λ)−1B,

(3.5)

where the first equality is due to the fact that VkVH

kis the orthogonal projector

onto Vk, and the second follows, since WkVH

kis a projector onto Ran(QVk).

We complete the proof by showing that VH

kT(



λ)−1B=Tk(

λ)−1Bk.Tothis

end, let Z:= T(



λ)−1B,andZkbe such that VkZk=Z. (There exists a unique

Zkwith this property, because Ran(Z) ⊆Vk.) Then T(



λ)Z =Bimplies that



λ)VkZk=B, and thus WH

kT(



λ)VkZk=WH

kB=Bk. Hence, by Lemma 1,

we see that Zk=Tk(

λ)−1Bk, which in turn implies

kT(



λ)−1B=VH

kZ=VH

k(VkZk)=Tk(

λ)−1Bk. (3.6)

(ii) Following the steps at the beginning of the proof of part (i), in particular from

a line of reasoning similar to that in (3.5), we have

BHQT (

λ)−2B=BH

kQkVH

kT(



λ)−2B.

Adv Comput Math (2020) 46:6 Page 17 of 28 6

It remains to show that VH

kT(



λ)−2B=Tk(

λ)−2Bk. To this end, we exploit

kT(



λ)−2B=(V H

kT(



λ)−1

Vk)(

kT(



λ)−1B). (3.7)

Now define Z:= T(



λ)−1

Vkand Zksuch that VkZk=Z(once again such a

Zkexists uniquely, because Ran(Z) ⊆Vk)sothatT(



λ)VkZk=

Vk. Then, due

to the property WH

kVk=I

k,wehaveWH

kT(



λ)VkZk=WH

k

Vk=I

k,m,where

I

k,m is the matrix consisting of the first mcolumns of the 

k×

kidentity matrix

I

kand 

k:= dim Vk≥m. By Lemma 1, this implies that Zk=Tk(

λ)−1I

k,m,so,

for the term inside the first parenthesis on the right-hand side of (3.7), we have

kT(



λ)−1

Vk=VH

kZ=VH

k(VkZk)=Tk(

λ)−1I

k,m. (3.8)

For the term inside the second parenthesis on the right-hand side of (3.7), we

make use of the following observation:

kT(



λ)−1B=



kT(



λ)−1B=

kT(



λ)−1B

0,(3.9)

where the last equality follows, since the columns of 

Vkform an orthonormal

basis for Ran(T (

λ)−1B). Inserting these in (3.7), we obtain

kTk(

λ)−2Bk=(V H

kT(



λ)−1

Vk)(

kT(



λ)−1B)

=Tk(

λ)−1I

k,m(

kT(



λ)−1B)

=Tk(

λ)−1I

k(V H

kT(



λ)−1B)

=Tk(

λ)−1I

k(Tk(

λ)−1Bk)=Tk(

λ)−2Bk,

where in the second and third equality we exploit (3.8)and(3.9), respectively,

and in the fourth equality we employ the equivalence in (3.6).

Our reduced problems are expressed in terms of the reduced versions of H0(λ),

L(λ),andH1(λ) defined via

Hk,0(λ) := Lk(λ)−1Lk(λ)−H,(3.10)

with Lk(λ) denoting a lower triangular Cholesky factor of



Hk,0(λ) := (BH

kQkTk(λ)−1Bk)H(BH

kQkTk(λ)−1Bk), (3.11)

and Hk,1(λ) := i(

Hk,1(λ) −

Hk,1(λ)H)with



Hk,1(λ) := Lk(λ)−1(BH

kQkTk(λ)−1Bk)HLk(λ)−H. (3.12)

To ensure the uniqueness of L(λ) and Lk(λ), we define them as the Cholesky factors

of 

H0(λ) and 

Hk,0(λ) with real and positive entries along the diagonal. We emphasize

here that the matrix functions H0(λ),L(λ),H1(λ) and their reduced counterparts are

of the same size. However, the reduced matrix-valued functions are defined in terms

of Tk(λ), which is small if the dimensions of Vk,Wkare small, in contrast to T(λ)in

the definitions of H0(λ),L(λ),H1(λ).

Adv Comput Math (2020) 46:6

Page 18 of 28

Our goal is to come up with a reduced counterpart of ηHerm(R;B,iω),that

Hermite-interpolates the full function at prescribed points. For a given Vk,andWkas

in (2.9), we introduce

ηHerm

k(R;B,iω) := sup

t∈R

λmin(Hk,0(iω) +tHk,1(iω)).

Next, we establish that the quantity ηHerm

k(R;B,iω) is independent of the choice

of the basis for the subspace Vk. For this result, we introduce the notation

ηHerm

Vk,Wk(R;B,iω)) to emphasize the particular choices of the bases Vk,W

kused in the

definition of ηHerm

k(R;B,iω). Similarly, we indicate the choices of the bases used in

the definitions of Jk,R

k,Q

k,andBkwith the help of the notations JWk,RWk,QVk,

and BWk.

Lemma 3 Let the columns of Vkand 

Vkform orthonormal bases for the subspace

Vk, and let Wk:= QVk(V H

kQVk)−1,

Wk:= Q

Vk(

kQ

Vk)−1.Then

ηHerm

Vk,Wk(R;B,iω) =ηHerm



Vk,

Wk(R;B,iω)

for all ω∈R.

Proof The proof makes use of the characterization

ηHerm(R;B,iω) =

inf{2|=H,iω∈(J −(R +BBH))Q}2(3.13)

shown in [21, Theorem 4.9].

Now since the columns of Vkand 

Vkform bases for the same space, there exists a

unitary matrix Psuch that 

Vk=VkP. Furthermore,



Wk=Q(VkP )(P HVH

kQVkP)

−1=QVkPPH(V H

kQVk)−1P=WkP.

The assertion then follows from (3.13) and the following set of equivalences:

iω∈(JWk−RWk)QVk−(BWkBH

Wk)QVk⇐⇒

det(WH

k(J −R)Q −BBHQ−iωIVk)=0⇐⇒

det(

k(J −R)Q −BBHQ−iωI

Vk)=0⇐⇒

iω∈(J

Wk−R

Wk)Q

Vk−(B

WkBH



Wk)Q

Vk.

Above, the first and third equivalences are due to the identities in (2.10), whereas the

second equivalence is due to 

Vk=VkPand 

Wk=WkP.

We now prove our main interpolation result.

Theorem 4 (Hermite Interpolation of Hermitian Backward Errors) Consider a DH

model (1.1), and a reduced model ˙xk=(Jk−Rk)Qkxkwith coefficients as in (2.12)

defined in terms of a given Vkand Wkas in (2.9). Furthermore, let T(λ)and Tk(λ)

be as in (2.6)and (3.4), respectively. For a given ω∈R, suppose that the subspace

Vk:= Ran(Vk)is such that

Ran(T (iω)−1B) ⊆Vk,Ran(T (iω)−2B) ⊆Vk. (3.14)

Adv Comput Math (2020) 46:6 Page 19 of 28 6

(i) The quantity ηHerm(R;B,iω) is finite if and only if ηHerm

k(R;B,iω) is finite. If

ηHerm(R;B,iω) is finite, then

ηHerm(R;B,iω) =ηHerm

k(R;B,iω). (3.15)

(ii) Moreover, if ηHerm(R;B,iω) and ηHerm

k(R;B,iω) are differentiable at ω,then

we have

dω ηHerm(R;B,iω)ω=ω

dω ηHerm

k(R;B,iω)ω=ω

. (3.16)

Proof Without loss of generality, we may assume that the matrix Vkis such

that Vk=

Vk

Vk,with the columns of 

Vkforming an orthonormal basis for

Ran(T (iω)−1B). By Lemma 3, it suffices to prove the claims for such a particular

choice of orthonormal basis.

(i) By the definitions of 

H0(λ), 

Hk,0(λ) in (3.1), (3.11), respectively, and by part

(i) of Lemma 2, we have



H0(iω) =(BHQT (iω)−1B)H(BHQT (iω)−1B)

=(BH

kQkTk(iω)−1Bk)H(BH

kQkTk(iω)−1Bk)=

Hk,0(iω).

This also implies that L(iω) =Lk(iω) due to the uniqueness of the

Cholesky factors of 

H0(iω),

Hk,0(iω),soH0(iω) =L(iω)−1L(iω)−H=

Lk(iω)−1Lk(iω)−H=Hk,0(iω). Furthermore, recalling the definitions of



H1(λ), H1(λ) in part 4 of Theorem 3, the definitions of 

Hk,1(λ), Hk,1(λ) in

(3.12) and the line above (3.12), we have



H1(iω) =L(iω)−1(BHQT (iω)−1B)HL(iω)−H

=Lk(iω)−1(BH

kQkTk(iω)−1Bk)HLk(iω)−H=

Hk,1(iω)

and H1(iω) =Hk,1(iω). From the latter equality and Theorem 3, we deduce

that ηHerm(R;B,iω) is finite if and only if ηHerm

k(R;B,iω) is finite. Addition-

ally, if ηHerm(R;B,iω) is finite, then

ηHerm(R;B,iω) =maxt∈Rλmin(H0(iω) +tH1(iω))

=maxt∈Rλmin(Hk,0(iω) +tHk,1(iω)) =ηHerm

k(R;B,iω),

completing the proof of (3.15).

(ii) Now let us suppose that ηHerm(R;B,iω) and ηHerm

k(R;B,iω) are differentiable

at ω. To prove the interpolation property in the derivatives, we benefit from ana-

lytical expressions. It follows from the standard perturbation theory of simple

eigenvalues (e.g., see [20]) that

dω ηHerm(R;B,iω)ω=ω

=vHd

dω H0(iω) +

tH1(iω)ω=ωv,

dω ηHerm

k(R;B,iω)ω=ω

=vHd

dω Hk,0(iω) +

tHk,1(iω)ω=ωv,

(3.17)

Adv Comput Math (2020) 46:6

Page 20 of 28

where

tis such that



t∈arg max

tλmin(H0(iω) +tH1(iω)) =arg max

tλmin(Hk,0(iω) +tHk,1(iω))

and vis a unit eigenvector corresponding to λmin(H0(iω) +

tH1(iω)) =λmin(

Hk,0(iω) +

tHk,1(iω)). Thus, it suffices to prove

dωH0(iω)ω=ω

dωHk,0(iω)ω=ω

dωH1(iω)ω=ω

dωHk,1(iω)ω=ω

We first establish the interpolation properties between the derivatives of H0(iω)

and Hk,0(iω) at ω=ω. To this end, observe that parts (i) and (ii) of Lemma 2

imply that

dω 

H0(iω)ω=ω

=i(BHQT (iω)−1B)H(BHQT (iω)−2B)

−i(BHQT (iω)−2B)H(BHQT (iω)−1B)

=i(BH

kQkTk(iω)−1Bk)H(BH

kQkTk(iω)−2Bk)

−i(BH

kQkTk(iω)−2Bk)H(BH

kQkTk(iω)−1Bk)=d

dω 

Hk,0(iω)ω=ω

(3.18)

Next, for a given ω, consider the Cholesky factorizations 

H0(iω) =

L(iω)L(iω)Hand 

Hk,0(iω) =Lk(iω)Lk(iω)H. By differentiating these two

equations and exploiting (3.18), we have ]selectfont

L(iω) d

dωL(iω)H+d

dωL(iω)L(iω)Hω=ω

Lk(iω) d

dωLk(iω)H+d

dωLk(iω)Lk(iω)Hω=ω

L(iω) d

dωLk(iω)H+d

dωLk(iω)L(iω)Hω=ω

wherewehaveusedL(iω) =Lk(iω), established in part (i), in the sec-

ond equality. Thus, both d

dωL(iω)|ω=ωand d

dωLk(iω)|ω=ωare lower triangular

solutions of the matrix equation

dω 

H0(iω)ω=ω

dω 

Hk,0(iω)ω=ω

=L(iω)XH+XL(iω)H,

which has a unique lower triangular solution with real entries along the

diagonal, so d

dωL(iω)|ω=ω=d

dωLk(iω)|ω=ω. Furthermore, it follows from

the definitions of H0(iω) and Hk,0(iω) in part 3 of Theorem 3 and (3.10)

that L(iω)H0(iω)L(iω)H=I=Lk(iω)Hk,0(iω)Lk(iω)H, which we dif-

ferentiate at ω=ω, and exploit L(iω) =Lk(iω),d

dωL(iω)|ω=ω=

dωLk(iω)|ω=ω,H0(iω) =Hk,0(iω) to deduce that the equality d

dωH0(iω)|ω=ω

dωHk,0(iω)|ω=ωholds.

Finally, by exploiting L(iω) 

H1(iω)L(iω)H=(BHQT (iω)−1B)H,which

follows from the definition of H1(iω), Hk,1(iω) in part 4 of Theorem 3, we

Adv Comput Math (2020) 46:6 Page 21 of 28 6

obtain

dω L(iω) 

H1(iω)L(iω)Hω=ω

=−i(BHQT (iω)−2B)H

=−i(BH

kQkTk(iω)−2Bk)H

dω Lk(iω) 

Hk,1(iω)Lk(iω)Hω=ω

where we exploit L(iω) =Lk(iω),d

dωL(iω)|ω=ω=d

dωLk(iω)|ω=ω,

H1(iω) =



Hk,1(iω) to deduce d

dω 

H1(iω)|ω=ω=d

dω 

Hk,1(iω)|ω=ω. It immediately follows

that d

dωH1(iω)|ω=ω=d

dωHk,1(iω)|ω=ω, and the proof of (3.16) is complete.

Remark 1 The function ηHerm(R;B,iω) is differentiable at ωwhenever

1. ηHerm(R;B,iω) is finite, equivalently H1(iω) is indefinite,

2. the global minimum of λmin(H0(iω) +tH1(iω)) is attained at a unique t,and

3. λmin(H0(iω) +

tH1(iω)) is simple, where 

t:= arg maxt∈Rλmin(H0(iω) +

tH1(iω)).

These conditions guarantee also the differentiability of ηHerm

k(R;B,iω) at ωpro-

vided that the subspace inclusion in (3.14) holds. This latter differentiability property

is due to H0(iω) =Hk,0(iω) and H1(iω) =Hk,1(iω) from part (i) of Theorem 4.

Additionally, when the concave function g(t) := λmin(H0(iω) +tH1(iω)) attains

its maximum, the maximizer is nearly always unique. The function g(t) is the min-

imum of mreal analytic functions [19,25], each corresponding to an eigenvalue of

H0(iω)+tH1(iω).Ifg(t) does not have a unique maximizer, then at least one of these

real analytic functions must be constant and equal to ηHerm(R;B,iω) =maxtg(t).

Thus, a simple sufficient condition that ensures the uniqueness of the maximizer is

that H1(iω) has full rank, in which case all eigenvalues of H0(iω) +tH1(iω) blow

up, either to ∞or −∞,ast→∞implying that each of the real analytic functions

is non-constant.

3.2.2 The subspace framework for

Herm(

;

)

The Hermite interpolation result of Theorem 4 immediately suggests the subspace

framework in Algorithm 3 for the approximation of rHerm(R;B). This resembles

the structure-preserving subspace framework to estimate the non-Hermitian stability

radius r(R;B,C), especially in the way the subspaces Vk,Wkare built. At every iter-

ation, a reduced problem is solved in line 5 as discussed in Section 3.1, specifically

by employing Algorithm 2. Letting ωbe a global minimizer of the reduced problem,

the subspaces are expanded so that the original function ηHerm(R;B,iω) is Hermite

interpolated by its reduced counterpart at ω=ω.

Assuming that the sequence {ωk}converges to a minimizer ω∗of the function

ηHerm(R;B,iω) such that

Adv Comput Math (2020) 46:6

Page 22 of 28

1. H1(iω∗)is indefinite with full rank, and

2. λmin(H0(iω∗)+t∗H1(iω∗)) is simple, with t∗:= arg maxt∈Rλmin(H0(iω∗)+

tH1(iω∗)),

the analysis in [3] shows that the sequence {ωk}converges at a superlinear rate.

As discussed in Remark 1, the two conditions above ensure the differentiability of

ηHerm(R;B,iω) and ηHerm

k(R;B,iω) for all large kin a neighborhood of ω∗.This

differentiability property is essential for the convergence analysis in [3].

3.3 Numerical experiments

In this section, we present numerical results for the estimation of the Hermitian sta-

bility radius. Most of the examples are large-scale, and, hence, involve the application

of the proposed subspace framework (Algorithm 3). However, recall that Algorithm 3

uses Algorithm 2 for solving reduced subproblems. For Algorithm 3, we use the same

stopping criteria as in Section 2.3, but now fk:= minω∈RηHerm

k(R;B,iω),andwe

set kmax =80 and ε=10−6. The larger value for εis due to the fact that we compute

the derivatives of ηHerm

k(R;B,iω) with respect to ωby means of forward differences,

which yield approximate values accurate up to about half of the double precision.

We form the initial subspaces nearly as described in Section 2.3 for the non-

Hermitian stability radius. The only difference is that, in part (iii), the points

z1,...,z

ρare permuted into zj1,...,z

jρso that ηHerm(R;B,izj1)≤ ··· ≤

ηHerm(R;B,izjρ). For the synthetic examples in Section 3.3.1, the parameter val-

ues l=−200, u=0, ρ=κ=5 are used, and z1,...,z

ρare set equal to the

eigenvalues of (J −R)Q closest to iω1,...,iωρ. As for the disk brake example in

Section 3.3.2,weusel=−2·105,u=−1.7 ·105,ρ=κ=15, as well as zj=iωj

for j=1,...,ρ.

Adv Comput Math (2020) 46:6 Page 23 of 28 6

Fig. 2 A DH system of order 20 with random system matrices. Eigenvalues of (J −R)Q are marked with

red asterisks, whereas the black regions represent the eigenvalues of all matrices of the form (J −(R +

BBH))Q for 100000 randomly chosen real symmetric matrices with 2=rHerm(R;B). The red

circle marks 1.9794i, where 1.9794 is the computed global minimizer of ηHerm(R;B,iω) over ω∈R.

3.3.1 Synthetic examples

We first present a numerical result for a small dense random example, where

J,R,Q ∈R20×20 and B∈R20×2. Application of Algorithm 2 to this example yields

rHerm(R;B) =0.0501, and the point that is first reached on the imaginary axis under

the smallest perturbation is 1.9794i. Figure 2illustrates that there exist real symmet-

ric matrices with 2=0.0501 such that 1.9794i is contained (nearly) in the

spectrum of (J −(R +BBH))Q.

Next we experiment with larger dense random matrices, specifically with random

A∈Rn×n,B∈Rn×2. The results of Algorithm 3 are reported in Tables 4and 5.For

n=1000, as shown in Table 4, the direct application of Algorithm 2 and the subspace

framework (Algorithm 3) return exactly the same values for rHerm(R;B) up to the

prescribed tolerances, yet the subspace framework already needs less computing time

than Algorithm 2. Listed in this table is also the unstructured distance to instability

((J −R)Q) defined in Section 2.3.1. It appears that ((J −R)Q) overestimates

rHerm(R;B) considerably.

Table 4 Performance of Algorithm 3 to estimate rHerm(R;B) on dense random DH systems of order

1000. The results from Algorithm 2, the non-Hermitian stability radius r(R;B,BH)and the unstructured

distance to instability ((J −R)Q) are also included. The fifth column contains the number of subspace

iterations, the sixth the run-times (in seconds)

rHerm(R;B) r(R;B,BH) ((J −R)Q) # iter. Run-time

#Alg.2&3 Alg.3Alg.2 Alg.3

1 0.012922 0.011263 2.04 7 2090.0 463.5

2 0.010110 0.010104 0.36 1 1544.1 91.8

3 0.014067 0.012826 3.63 5 2840.2 335.6

4 0.009605 0.007189 0.94 11 2926.5 739.6

Adv Comput Math (2020) 46:6

Page 24 of 28

Table 5 Performance of Algorithm 3 to estimate rHerm(R;B) on dense random DH systems of order

n=2000 and n=4000. The fifth column contains the number of subspace iterations, and the sixth the

run-times (in seconds)

rHerm(R;B) r(R;B,BH)# iter. Run-time

n# Alg. 3 Alg. 3 Alg. 3

2000 1 0.010217 0.005231 6 408.6

2000 2 0.003835 0.003754 13 811.8

2000 3 0.012456 0.010876 5 369.2

2000 4 0.010096 0.010031 3 221.8

4000 1 0.008982 0.008705 1 213.1

4000 2 0.008376 0.006803 3 343.2

4000 3 0.010400 0.008646 2 267.8

4000 4 0.002068 0.002055 11 828.9

For larger systems, Algorithm 2 becomes computationally too expensive, so we

do not report the results of Algorithm 2. For the same reason, we also do not report

((J −R)Q) for larger systems. Rather, we report only the results of Algorithm 3

for n=2000 and n=4000 in Table 5, as well as the non-Hermitian stability radii.

We remark that in these experiments the number of subspace iterations to reach the

prescribed accuracy is usually small and seems independent of n.

All of these examples involve optimization of highly nonconvex functions.

Figure 3depicts ηHerm(R;B,iω) as a function of ωwith the solid curve for the

first example in Table 4with n=1000. The same figure also depicts the reduced

function ηHerm

k(R;B,iω) with the dashed curve for the same example at termination

after 7 subspace iterations. Even though the reduced function ηHerm

k(R;B,iω) uses

projected matrices onto 48 dimensional subspaces, it captures the original function

ηHerm(R;B,iω) remarkably well near the global minimizer ω∗=−70.9581.

Fig. 3 Application of

Algorithm 3 to estimate

rHerm(R;B) on a dense random

DH system of order 1000. The

full and reduced functions

ηHerm(R;B,iω) and

ηHerm

k(R;B,iω) at termination

after 7 subspace iterations are

plotted with the solid and

dashed curves, respectively. The

point (ω∗,η

Herm(R;B,iω∗))

with ω∗denoting the global

minimizer of ηHerm(R;B,iω) is

marked with a circle

-200 -150 -100 -50 0

0.2

0.4

0.6

0.8

1.2

1.4 10-3

Adv Comput Math (2020) 46:6 Page 25 of 28 6

Table 6 Estimated values for the Hermitian stability radii rHerm(R;B) by Algorithm 3 in the second

column for the FE model of a disk brake of order 9338 for a few values of .Thecolumnω∗displays

the computed global minimizer of ηHerm(R;B,iω) by the algorithm, whereas the last columns depicts the

total run-time (in seconds) for Algorithm 3

r

Herm(R;B) r(R;B,BT)ω

∗Run-time

5 0.01038 0.00893 −1.938 ×105227.4

50 0.01001 0.00904 −1.938 ×105222.3

100 0.00988 0.00914 −1.938 ×105223.9

1100 0.00837 0.00834 −1.789 ×105228.5

1200 0.00408 0.00407 −1.742 ×105224.1

3.3.2 FE model of a disk brake

We also applied our implementation of Algorithm 3 to the FE model of the disk

brake described in Section 2.3.2. The estimated values for the Hermitian structured

radius rHerm(R;B) along with the associated minimizer ω∗of ηHerm(R;B,iω),as

well as the run-time (in seconds) are listed in Table 6for a few values of .Only

one subspace iteration is performed in these examples and the subspace dimension at

termination is 96. It is worth comparing the values of rHerm(R;B) in this table with

those for the non-Hermitian stability radius r(R;B,BT), also included in the table.

As expected, the listed values for the Hermitian structured stability radii are larger.

The estimates for the two stability radii differ by significant amounts for smaller

values of , but only slightly at larger .

The results show that the FE model of the disk brake is close to instability for

all . They indicate that further damping via shims would be helpful to avoid the

instabilities due to uncertainties.

4 Concluding remarks

We have proposed subspace frameworks to estimate the stability radii for large-

scale dissipative Hamiltonian systems. At every iteration, we apply DH structure-

preserving projections to small subspaces, then compute the corresponding stability

radii for the resulting reduced system by exploiting its eigenvalue optimization

characterization. We expand the subspaces used in the projections so that Hermite

interpolation properties between the objective eigenvalue function of the full and the

reduced problems are attained at the optimizer of the reduced problem.

The proposed frameworks seem to converge to local optimizers, but a formal proof

of this observation is open at the moment. They do converge at a superlinear rate in

theory under the local convergence assumption. With the purpose of improving the

likelihood of convergence to a global maximizer, we have initiated the subspaces to

attain Hermite interpolation at several points on the imaginary axis between the full

and initial reduced problems.

Adv Comput Math (2020) 46:6

Page 26 of 28

MATLAB implementations of the proposed algorithms and subspace frameworks,

as well as some of the data are made publicly available.2

A related research direction that is currently investigated is the maximization of

the stability radii when J,R,Q depend on parameters, e.g., the FE model of the disk

brake depends on the rotation speed .

Acknowledgments The authors thank two anonymous referees for helpful comments on initial versions

of this manuscript.

Funding Information Open access funding provided by Projekt DEAL.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License,

which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long

as you give appropriate credit to the original author(s) and the source, provide a link to the Creative

Commons licence, and indicate if changes were made. The images or other third party material in this

article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line

to the material. If material is not included in the article’s Creative Commons licence and your intended use

is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission

directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/

by/4.0/.

References

1. Akay, A.: Acoustics of friction. J. Acoust. Soc. Am. 111(4), 1525–1548 (2002)

2. Aliyev, A., Benner, P., Mengi, E., Schwerdtner, P., Voigt, M.: Large-scale computation of L∞-norms

by a greedy subspace method. SIAM J. Matrix Anal. Appl. 38(4), 1496–1516 (2017)

3. Aliyev, A., Benner, P., Mengi, E., Voigt, M.: A subspace framework for H∞-norm minimization.

arXiv:1508.04214v2 [math.NA]. Submitted to SIAM J. Matrix Anal. Appl (2019)

4. Aliyev, N., Mehrmann, V., Mengi, E.: Computation of stability radii for large-scale dissipative hamil-

tonian systems. Preprint 2018-09, Institut f¨ur Mathematik, TU Berlin, D-10623 Berlin, FRG. http://

www.math.tu-berlin.de/menue/forschung/veroeffentlichungen/preprints 2018 (2018)

5. Benner, P., Lowe, R., Voigt, M.: L∞norm computation for large-scale descriptor systems using

structured iterative eigensolvers. Numerical Algebra Control and Optimization 8(1), 119–133 (2018)

6. Benner, P., Mitchell, T.: Faster and more accurate computation of the H∞norm via optimization.

SIAM J. Sci. Comput. 40(5), A3609–A3635 (2018)

7. Benner, P., Voigt, M.: A structured pseudospectral method for H∞-norm computation of large-scale

descriptor systems. Math. Control Signals Syst. 26(2), 303–338 (2014)

8. Boyd, S., Balakrishnan, V.: A regularity result for the singular values of a transfer matrix and a

quadratically convergent algorithm for computing its L∞-norm. Syst. Control Lett. 15(1), 1–7 (1990)

9. Bruinsma, N.A., Steinbuch, M.: A fast algorithm to compute the H∞-norm of a transfer function

matrix. Syst. Control Lett. 14(4), 287–293 (1990)

10. Freitag, M.A., Spence, A., Van Dooren, P.: Calculating the H∞-norm using the implicit determinant

method. Linear Algebra Appl. 35(2), 619–635 (2014)

11. Gallivan, K., Vandendorpe, A., Van Dooren, P.: Model reduction of mIMO systems via tangential

interpolation. SIAM J. Matrix Anal. Appl. 26(2), 328–349 (2005)

12. Gr¨

abner, N., Mehrmann, V., Quraishi, S., Schr¨

oder, C., von Wagner, U.: Numerical methods for para-

metric model reduction in the simulation of disc brake squeal. Z. Angew. Math. Mech. 96, 1388–1405

(2016). https://doi.org/10.1002/zamm.201500217

2http://home.ku.edu.tr/∼emengi/software/DH-stabradii.html

Adv Comput Math (2020) 46:6 Page 27 of 28 6

13. Grant, M., Boyd, S.: Graph implementations for nonsmooth convex programs. In: Blondel, V., Boyd,

S., Kimura, H. (eds.) Recent Advances in Learning and Control, Lecture Notes in Control and

Information Sciences, pp. 95–110. Springer-Verlag Limited (2008)

14. Grant, M., Boyd, S.: CVX: Matlab Software for disciplined convex programming, version 2.1 http://

cvxr.com/cvx (2014)

15. Gugercin, S., Polyuga, R.V., Beattie, C., van der Schaft, A.J.: Interpolation-based model reduction for

port-Hamiltonian systems. In: Proceedings of the 48th IEEE Conference on Decision and Control,

2009 held jointly with the 2009 28th Chinese Control Conference. CDC/CCC 2009, pp. 5362–5369

(2009)

16. Gugercin, S., Polyuga, R.V., Beattie, C., van der Schaft, A.J.: Structure-preserving tangential

interpolation for model reduction of port-Hamiltonian systems. Automatica 48(9), 1963–1974 (2012)

17. Guglielmi, N., G¨urb¨uzbalaban, M., Overton, M.L.: Fast approximation of the H∞-norm via optimiza-

tion over spectral value sets. SIAM J. Matrix Anal. Appl. 34(2), 709–737 (2013)

18. Jacob, B., Zwart, H.: Linear Port-Hamiltonian Systems on Infinite-Dimensional Spaces Operator

Theory: Advances and Applications, vol. 223. Birkh¨

auser/Springer Basel AG, Basel (2012)

19. Kato, T.: Perturbation Theory for Linear Operators. Springer, Berlin (1995)

20. Lancaster, P.: On eigenvalues of matrices dependent on a parameter. Numer. Math. 6, 377–387 (1964)

21. Mehl, C., Mehrmann, V., Sharma, P.: Stability radii for linear Hamiltonian systems with dissipa-

tion under structure-preserving perturbations. SIAM J. Matrix Anal. Appl. 37(4), 1625–1654 (2016).

https://doi.org/10.1137/16M1067330

22. Mengi, E., Yildirim, E.A., Kilic¸, M.: Numerical optimization of eigenvalues of Hermitian matrix

functions. SIAM J. Matrix Anal. Appl. 35(2), 699–724 (2014). https://doi.org/10.1137/130933472

23. Mitchell, T., Overton, M.L.: Hybrid expansion-contraction: a robust scaleable method for approximat-

ing the H∞norm. IMA J. Numer. Anal. 36(3), 985–1014 (2016)

24. Polyuga, R.V., van der Schaft, A.J.: Structure preserving moment matching for port-Hamiltonian

systems: Arnoldi and Lanczos. IEEE Trans. Autom. Cont. 56(6), 1458–1462 (2011)

25. Rellich, F.: Perturbation Theory of Eigenvalue Problems. Notes on Mathematics and its Applications.

Gordon and Breach, New York (1969)

26. van der Schaft, A.J., Jeltsema, D.: Port-Hamiltonian systems theory: an introductory overview. Found

Trends Syst Control 1(2-3), 173–378 (2014)

27. Van Loan, C.F.: How near is a stable matrix to an unstable matrix? Contemporary Math. 47, 465–478

(1985)

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps

and institutional affiliations.

Aﬃliations

Nicat Aliyev1·Volker Mehrmann2·Emre Mengi3

Nicat Aliyev

naliye[email protected]

Volker Mehrmann

[email protected]berlin.de

Emre Mengi

[email protected]

1Institute of Mathematics and Mechanics, Azerbaijan National Academy of Sciences, B.Vahabzade 9,

AZ1141, Baku, Azerbaijan

2Institut fur Mathematik, Technische Universit¨

at Berlin,

MA 4-5. Str. des 17 Juni 136, 0623, Berlin, Germany

3Department of Mathematics, Koc¸ University, Rumeli Feneri Yolu 34450, Sarıyer, Istanbul, Turkey

Adv Comput Math (2020) 46:6

Page 28 of 28