On adaptive finite element methods for the simulation of two-dimensional photonic crystals [original]

Technische Universität Berlin

Institut für Mathematik

On adaptive finite element methods for the simulation

of two-dimensional photonic crystals

Marine Froidevaux

Preprint 06-2018

Preprint-Reihe des Instituts für Mathematik

Technische Universität Berlin

http://www.math.tu-berlin.de/preprints

Preprint 06-2018 July 2018

On adaptive finite element methods for the simulation of two-dimensional

photonic crystals∗

MARINE FROIDEVAUX‡

Abstract. The first part of this paper is devoted to the modeling of wave propagation

inside a perfect two-dimensional photonic crystal and the spectral analysis of the resulting

eigenvalue problem. In the second part of the paper, we focus on a special case, where

the eigenvalue problem is linear and Hermitian. We introduce a residual-based estimator

approximating the algebraic error, i.e., the error in the computed eigenvector induced

by iterative methods. The estimator for the algebraic error approximates the error in

the same norm as the one used in already existing estimators for the discretization error,

therefore enabling error balancing between both types of errors, and thus improving the

efficiency of the adaptive finite element procedure.

Key words. Photonic crystals, error estimators, adaptive finite element methods

AMS subject classifications. 35P15,78M10,65F15

1. Introduction

Photonic crystals are composite materials, made of periodically arranged components,

whose periodic structure affects the propagation of electromagnetic waves. These waves

may propagate through or be reflected by the crystal depending on their frequencies ωand

on their wave-vector k, but also on parameters such as the electrical permittivity of the

constituent materials, or the geometry of the spatial arrangement of the crystal.

Photonic crystals creating so-called band gaps are of special interest for applications

in engineering. A band gap denotes a range of frequencies at which no wave can propa-

gate in any direction through the crystal. The design of photonic crystals for industrial

applications requires the optimization of many geometrical and material parameters, in

order to obtain large bad-gaps around certain frequencies specified by the application. For

this reason, we are interested in developing accurate mathematical models and numerical

methods which can enable the time-efficient simulation of these physical systems.

The mathematical modeling of a photonic crystal is based on the Maxwell equations. In

the case of a perfect infinite crystal, the Maxwell equation lead to a sequence of parameter-

dependant PDE eigenvalue problems (EVPs), which can be discretized via a conforming

finite element method. One can show that eigenpairs of the discretized problem converge to

the eigenpairs of the PDE EVP as one increases the number of basis functions in the finite

element space, see e.g. [Hac92]. However, having time-efficient computations in mind, the

number of basis functions in the finite element space, as well as the spatial localization,

should be chosen in a way that minimizes the size of the discretized system, while warranty-

ing a certain accuracy. This kind of trade-off problems is typically addressed via adaptive

‡Institut für Mathematik, MA 4-5, Technische Universität Berlin, Straße des 17. Juni

136, 10623 Berlin, Germany

E-mail address:[email protected].

∗Research supported by the Einstein Center ECMath via project OT10 Model Reduction for Nonlinear

Parameter-Dependent Eigenvalue Problems in Photonic Crystals.

2 M. FROIDEVAUX

finite element methods, and error estimators are the central tool enabling this adaptiv-

ity. In [GG12], a reliable and efficient error estimator was developed in order to estimate

the error produced by the finite element method. However, for large-scaled problems it

is computationally too expensive to solve the discrete EVPs exactly via direct methods.

Instead, iterative methods are used, such as e.g. Lanczos’ method, in order to solve the

large-scale discrete problems up to a prescribed tolerance. In this setting, the error arising

from the solution of discrete EVPs via iterative methods, also called the algebraic error,

cannot be ignored as a contribution to the total numerical error. Moreover, as part of the

adaptive process, it may be sufficient to solve the coarsely discrete EVPs with an accuracy

of the magnitude of the discretization error. For these reasons, error estimators are not

only needed to approximate the discretization error, but also to approximate the algebraic

error. Moreover, both types of error estimators should be part of a unified framework,

where all errors are measured in the same norm, so that meaningful comparisons can be

made as part of error balancing procedures. For the Laplace eigenvalue problem, error

balancing between the discretization and the algebraic errors was addressed in [CDM+17]

and a fully-adpative algorithm called AFEMLA algorithm was introduced in [MM11], and

extensively discussed in [Mię10].

The mathematical description and simulation of photonic crystals has already been the

subject of many research papers. Discretization via Nédélec finite elements for three-

dimensional photonic crystals has been considered in [BCG06], while the same prob-

lem was solved in [HLM16] via a finite difference scheme followed by a Newton-type

numerical method. For two-dimensional photonic crystals, a spectral analysis was per-

formed in [GG12] for the special case where the electric permittivity is real and frequency-

independent. The extensive spectral analysis of the non-Hermitian non-linear case can be

found in [Eng10] and [ELT17]. Moreover, a linearization technique for the Drude-Lorentz

model of the electrical permittivity was introduced in [EKE12].

In the three first sections of this paper, we review the modeling of a perfect photonic

crystals based on the Maxwell equations which leads to the operator NLEVP. In the fourth

section, we consider the case where the electric permittivity is a real-valued and frequency-

independent function. We introduce the weak formulation of the resulting Hermitian linear

EVP and analyse its properties. In particular we show that its spectrum consists in at

most countably many real and positive eigenvalues with finite geometric multiplicities.

In the fifth section, we develop a residual-based error estimator which approximate the

algebraic error on the computed eigenvector in the norm defined by the sesquilinear forms

of the weak formulation. Finally, the last section of this paper is devoted to a numerical

experiment, where we simulate a two-dimensional photonic crystal and test the reliability,

as well as the efficiency, of the algebraic error estimator.

2. Preliminaries on the Maxwell equations

2.1. General Form. The Maxwell equations in their general form and in SI units are

stated as follows:







−∂tD(x, t) + ∇×H(x, t) = J(x, t),(Ampère’s circuital law)

∇·D(x, t) = ρ(x, t),(Gauss’ law)

∂tB(x, t) + ∇×E(x, t)=0,(Faraday’s law of induction)

∇·B(x, t)=0,(Gauss’ law for magnetism)

where E, H, D, B and Jare three-dimensional vector fields and ρis a real scalar field,

see e.g. [Jac99]. These fundamental equations come along with the following empirical

ON AFEM FOR THE SIMULATION OF 2D PHOTONIC CRYSTALS 3

constitutive equations:

D(x, t) = 0E(x, t) + 0χE(x, t)∗E(x, t),

B(x, t) = µ0H(x, t) + µ0χM(x, t)∗H(x, t),

where 0, µ0are strictly positive constants, χE, χMare causal transfer functions, and ∗

denotes a convolution in time, i.e.

χE(x, t)∗E(x, t) =

∞

−∞

χE(x, t −τ)E(x, τ)dτcausality

−∞

χE(x, t −τ)E(x, τ)dτ.

Remark 2.1.In the remainder, we omit writing the explicit dependence of vector field on

space, time and frequency when the dependence is clear from the context.

In the remainder, we will assume that no free current nor free charges are present in

the system, i.e. J≡0and ρ≡0, and that the materials are non magnetizable, i.e.

χM≡0. Under these assumptions, the Maxwell equations simplify to the following system

of equations:

(2.1) −∂t(0E+0χE∗E) + ∇×H= 0, ∂t(µ0H) + ∇×E= 0,

∇·(0E+0χE∗E) = 0,∇·(µ0H)=0.

2.2. The eigenvalue problem. We recall briefly the definition of the Fourier transform.

Given a function f(t)∈L1(R), i.e., f(t)is absolutely integrable on R, its Fourier transform

is defined by

F{f}(ω):=ZR

f(t)e−iωt dt.

Note that when the variable trepresents time with units [s], then ωrepresents the angular

frequency and has units [rad/s]. To shorten the notation, we write formally ˆ

f(ω):=

F{f}(ω). On a subspace of L1(R), namely the space S ⊂ L1(R)of so-called Schwartz

functions, the Fourier transformation satisfies a number of important properties. A rigorous

definition of Scan be found e.g. in [Jan71], but it is sufficient for our purpose to think of

it as the space of infinitely differentiable functions decaying rapidly at infinity. For any

function f∈ S it holds that ˆ

f∈ S and the inverse Fourier transform can be defined by

F−1{ˆ

f}(t) = 1

2πZR

f(ω)e+iωtdω =f(t).

Moreover, given two functions f, g ∈ S, the following properties of the Fourier transform

follow from its definition:

(2.2) F{f∗g}(ω) = ˆ

f(ω)ˆg(ω)and F{f0}(ω) = +iω ˆ

f(ω),

where f0denotes the first-order derivative of f. The Fourier transform is not restricted

to functions in L1(R)and can be extended to the space of tempered distributions, i.e.,

to the dual space of S. For a rigorous definition of these concepts and for the proof of

properties analogous to (2.2) satisfied by the Fourier transform of tempered distributions,

see e.g. [Jan71].

It is physically meaningful to expect that Eand Hstay bounded at all times since the

system contains no energy source. Therefore, we can assume Eand Hto be tempered

4 M. FROIDEVAUX

distributions and we can apply a Fourier transformation in time to the Maxwell equations

(2.1). This yields to the following constrained eigenvalue problem (EVP):

(2.3) 01

(x,ω)∇×

−1

µ0∇× 0!ˆ

E(x, ω)

H(x, ω)= +iω ˆ

E(x, ω)

H(x, ω)

∇·((x, ω)ˆ

E(x, ω)) = 0,

∇·(µ0ˆ

H(x, ω)) = 0,

where (x, ω):=0(1 + ˆχE(x, ω)). The first-order (order of derivation in space) EVP

of dimension 6 in (2.3) can be turned into a second-order EVP of dimension 3 simply

by substituing ˆ

E= + 1

iω

∇ × ˆ

Hin the second line or, similarly, by substituing ˆ

−1

iω

µ0∇× ˆ

Ein the first line. This yields two equivalent formulations of the EVP:

(2.4) ∇×1

(x, ω)∇× ˆ

H(x, ω)=µ0ω2ˆ

H(x, ω)

and

(2.5) ∇×∇× ˆ

E(x, ω) = µ0(x, ω)ω2ˆ

E(x, ω).

Remark that in these two formulations, the constraints ∇ · (ˆ

E)=0and ∇· (µ0ˆ

H)=0

are automatically satisfied since

∇·(∇×f)≡0

for any function ftwice continuously differentiable.

In order to simplify the equations, we restrict our solution space to separable electro-

magnetic fields, i.e. we take the ansatz

(2.6) H(x, t) = H(x)f(t)and E(x, t) = E(x)g(t),

where fand gare two nonzero tempered distributions.

Incorporating the first ansatz into Equation (2.4) and simplifying the terms ˆ

f(ω)ap-

pearing on both sides of the equation yields

(2.7) ∇×1

(x, ω)∇×H(x)=µ0ω2H(x).

Similarly, Equation (2.5) can be rewritten as

(2.8) ∇×∇×E(x) = µ0(x, ω)ω2E(x).

2.2.1. The two-dimensional case. In this subsection, we assume all properties of the system

to be periodic in a two-dimensional (2D) plane and to be constant along the direction

perpendicular to this plane. Without loss of generality, we assume that the plane with

periodic properties is spanned by x1and x2, and that the the direction with constant

properties is along x3. Using the short notation ∂j=∂

∂xjfor j= 1,2,3, the assumption of

constant material properties along x3writes ∂3= 0, and implies ∂3E=∂3H= 0 since no

parameter in Equations (2.7) and (2.8) depends on x3.

Let us introduce some 2D equivalents of the standard three-dimensional (3D) differential

operators, as well as some notation. We consider a 3D vector u= (u1, u2, u3)T, a 2D vector

v= (v1, v2)T, and a scalar function w, and we define

u:=u1

u2, v⊥:=−v2

v1,

grad2Dw:=∂1w

∂2w,div2Dv:=∂1v1+∂2v2,

ON AFEM FOR THE SIMULATION OF 2D PHOTONIC CRYSTALS 5

curl2Dw:=−(grad2Dw)⊥=∂2w

−∂1w,curl2Dv:=−div2D(v⊥) = ∂1v2−∂2v1,

laplace2Dw:=∂1∂1w+∂2∂2w=−curl2Dcurl2Dw=div2Dgrad2Dw.

It follows that

∇×u=curl2Du3

curl2Du+∂3u⊥

0.

With these tools we can easily show that, in the 2D case, (2.7) becomes



curl2D1

curl2DH1

H2

curl2D1

curl2D(H3)

=µ0ω2H

and (2.8) becomes 

curl2Dcurl2DE1

E2

laplace2D(E3)

=µ0ω2E.

Therefore the equations including H3and E3are decoupled from the rest of the equations.

This decoupling can be best expressed in terms of the transverse electric (TE) and trans-

verse magnetic (TM) modes, which we introduce now. Let us start from (2.3) again and

add the effects of the continuous translational symmetry along x3. We obtain







curl2D(ˆ

H3)

curl2Dˆ

H2

−µ0curl2D(ˆ

E3)

−µ0curl2Dˆ

E2







=iω







ˆ

E2

(ˆ

E3)

ˆ

H2

(ˆ

H3)







where the first and fourth block lines, which define the TE mode (ˆ

1,ˆ

2,0,0,0,ˆ

3)T, are

decoupled from the second and third block lines, which define the TM mode (0,ˆ

1,ˆ

3,0,0)T. Combining the equations describing the TE mode yields

(2.9)

(2.10)







−1

µ0

curl2D

iω

curl2Dˆ

H3=iω ˆ

H3,

curl2Dˆ

H3=iω ˆ

E2,

and combining the equations describing the TM mode yields

(2.11)

(2.12)







curl2D

iω

−µ0

curl2Dˆ

E3=iω ˆ

E3,

−µ0

curl2Dˆ

E3=iω ˆ

H2.

It follows that the components ˆ

E1,ˆ

E2,ˆ

H1,ˆ

H2can be easily computed from the solutions

H3,ˆ

E3of (2.9) and (2.11).

6 M. FROIDEVAUX

Moreover, the constraints in (2.3) are automatically satisfied in the formulation of Equa-

tions (2.9)–(2.12) since

∇·(ˆ

E) = ∇· 1

iω curl2Dˆ

µ0ω2curl2Dcurl2Dˆ

E3!=∇·∇× 1

µ0ω2curl2Dˆ

iω ˆ

H3!= 0

and

∇·(µ0ˆ

H) = ∇· 1

−iω curl2Dˆ

curl2D1

ω2curl2Dˆ

H3!=∇·∇×1

ω2curl2Dˆ

−1

iω ˆ

E3= 0.

Finally, note that with the ansatz (2.6), Equations (2.9) and (2.11) can be reformulated

as follows

−div2D(1

grad2DH3) = µ0ω2H3

−laplace2DE3=µ0ω2E3,

or equivalently (for ∂3=∂3H3=∂3E3≡0)

(2.13)

(2.14)

−∇·(1

∇H3) = µ0ω2H3

−∆E3=µ0ω2E3.

The TE mode can be determined by the solution of (2.13) while the TM mode can be

determined by the solution of (2.14).

2.3. Models for the electric permittivity. It has been mentioned in Section 2.1 that we

only consider nonmagnetic materials, i.e. the magnetic permeability in the whole system is

constant and is equal to that of vacuum, namely µ0. The electric permittivity however is

assumed to differ from one material to the other but to stay constant in space within a given

material. Mathematically this means that, if the crystal is composed of Nmat materials

and we denote by Ωjthe domain filled with the j-th material, the electric permittivity of

the crystal can be defined as follows

(x, ω) =

Nmat

j=1

j(ω)χΩj(x)

where j(ω)are constant functions in space and χΩjis the indicator function of Ωj.

The functions j(ω)need to be approximated by empirical models. In the remainder

of this subsection, we introduced two of the most commonly used models. Note that we

only consider the dependency of the electric permittivity on the frequency and neglect the

influence of other factors, such as e.g. the temperature.

2.3.1. The frequency-independent model. The simplest model for i(ω)is to assume it to

be constant in frequency, i.e.

i(ω) = (i)

r0

where (i)

rdenotes the relative permittivity of the i-th material and 0the electric permit-

tivity of vacuum.

With this model, the EVPs (2.7), (2.8), (2.13), and (2.14) become linear in the eigenvalue

λ:=ω2.

ON AFEM FOR THE SIMULATION OF 2D PHOTONIC CRYSTALS 7

2.3.2. The Drude-Lorentz model. We follow here the derivation of the Drude-Lorentz model

presented in [Jac99, p.309-312]. We consider an electron bound to a kernel in a crystal

and submitted to an electric field. We model the interaction between the electron and

the kernel with a spring of stiffness kand of damping constant γ. We make the following

approximations:

•The difference between the local and applied electric fields is neglectable, i.e. the

material has low density;

•Magnetic forces are neglectable;

•The amplitudes of electron oscillations are small enough to evaluate the field at a

constant average position ¯x;

•The applied field Eis assumed to vary harmonically in time.

We denote by q=−ethe charge of the electron and by xits position (with reference

point at the equilibrium point of the spring). Newton’s equation of motion writes in this

case

m¨x=−kx −γ˙x+qE

After defining ω0:=pk/m we obtain

(2.15) m(¨x+γ

m˙x+ω2

0x) = −eE.

Under the assumption that Eis time-harmonic, i.e. with the ansatz E(¯x, t) = E(¯x)eiωt

for a certain ωin C, the Fourier transformation of (2.15) yields to

F{x(t)}(ν) = 2πeE(¯x)

ω2

0−ν2+iνγ δ(ω−ν) = C(ω)δ(ω−ν)

where C(ω):= 2πeE(¯x)(m(ω2

0−ω2+iωγ))−1is constant in ν. Therefore x(t)is time

harmonic with (angular) frequency ωas well and satifies

x(t) = −e

(ω2

0−ω2) + iωγ E(¯x, t).

It follows that the dipole moment p:=qx equals

p=e2

ω2

0−ω2+iωγ .

Let us now consider the dipole moment created by multiple electrons having possibly

different binding frequencies. We denote by Nthe number of molecules per unit volume

and by Zthe number of electrons per molecule. We suppose that there are flelectrons per

molecule with binding frequency ωland damping constant γl, and therefore that Plfl=Z.

We call plthe dipole moment created by the flelectrons corresponding to the binding

frequency ωl. Given the relationships



0

= 1 + ˆχE

and

flˆpl=0ˆχEˆ

8 M. FROIDEVAUX

we deduce that



0

= 1 + e2N

m0X

ω2

l−ω2+iωγl

(2.16)

= 1 + e2N

m0X

fl(ω2

l−ω2)

(ω2

l−ω2)2+ω2γ2

l−iωγl

(ω2

l−ω2)2+ω2γ2

l.(2.17)

The damping constant γlare generally small compared to the binding frequencies ωl,

therefore is approximately real for almost all frequencies ω. We notice that the j-th term

in the sum in (2.16) becomes purely imaginary when ω=ωl. Moreover, at this point, the

imaginary part becomes significantly larger in magnitude than at points away from the

resonance.

2.3.3. Physical properties of the electric permittivity. In the previous section, we have seen

that taking into account the damping from the crystal leads to a complex permittivity.

Since the damping is a non-conservative force dissipating kinetic energy into other types

of energy, e.g. heat or sound, a non-real permittivity corresponds to a dissipative system.

Actually, it will be discussed in subection 4.2 that the eigenvalue problems 2.8 and

2.7 are self-adjoint and have their spectrum (i.e. the possible values of ωin the ansatz)

contained in Rif and only if is a real function.

Under the assumption that Evaries harmonically in time, i.e.

E(x, t) = <E(x)eiωt=<E(x)ei(<(ω)+i=(ω))t

=E(x)e−=(ω)t<ei<(ω)t,

the complex part of the frequency describes the decay of the eigenmode over time. In this

setting, in particular with this sign convention in the ansatz, ωcan only be located in the

upper half of the complex plane, since no energy source is present in the crystal.

3. The Floquet transformation & Bloch theorem

3.1. Some fundamentals concepts in crystallography. In this section we introduce

some important concepts in the theory of periodic structures. The reader can refer to the

standard textbook [Kit05] for a more extensive description of these concepts in a physical

context. A detailed presentation of these same concepts in a mathematical context can be

found in [Sim13].

An ideal crystal is formed of two or more components that are arranged in a periodic

manner, i.e. as an infinite repetition of an identical pattern. In this paper we consider

an ideal n-dimensional crystal composed of two different materials, with n=2 or n=3.

Mathematically, one uses the concept of lattice to represent such a periodic structure, and

multiple definitions of this concept can be found in the literature. We choose to use the

following one for its mathematical precision:

Definition 3.1. Let a1, . . . , an∈Rnbe linearly independent. Then the set

G:={g∈Rn|g=

i=1

νiaifor some νi∈Z}.

is called a (Bravais) lattice and the vectors a1, . . . , anare called the primitive lattice vectors.

With this definition, it follows that any periodic structure can be expressed as a lattice

of repeating objects [Sim13]. This statement is illustrated informally in Figure 3.1.

ON AFEM FOR THE SIMULATION OF 2D PHOTONIC CRYSTALS 9

Figure 3.1. Decomposition of a periodic structure into a tiling of repeti-

tive units located on a lattice.

Any lattice Gis the result of the tiling of Rnby infinitely many identical tiles called unit

cells, and a unit cell is called primitive if it contains exactly one lattice point. Equivalently,

a primitive unit cell can be thought of as a domain in Rnof smallest volume that can tile

Gusing only translations. In the remainder, we use a particular type of primitive unit cell,

namely the Wigner-Seitz cell defined as follows:

Definition 3.2. Let Gbe a lattice in Rnand g∈G. The Wigner-Seitz cell of Gat the

point gis

Ωg:={r∈Rn| kr−gk<kr−˜gkfor all ˜g∈G, ˜g6=g},

where k·kis the standard Euclidean norm in Rn. One may refer simply to the Wigner-

Seitz Ωof Gwithout mentioning any point gin order to denote tacitly the Wigner-Seitz

cell centered around the origin (0∈G).

To any Bravais lattice Gin Rncorresponds a reciprocal lattice G0defined by

G0:={g0∈Rn|for all g∈G, there exists m∈Zsatisfying g0·g= 2πm}.

It can be shown (see e.g. [Kit05]) that the primitive lattice vectors b1, . . . , bnof G0can

always be chosen in a such way that they satisfy the following constraints: For all j, l =

1, . . . , n,

(3.1) bj·al= 2πδjl,

where δjl denotes the Kronecker delta, and a1, . . . , andenote the primitive lattice vectors

of G.

The Wigner-Seitz cell of G0is called the Brillouin zone of G. In addition to the discrete

translational symmetries represented by the lattice vectors, a lattice Gcan be invariant

under other sorts of symmetries, such as e.g. rotational or axial symmetries. In such a

case, the Brillouin zone reduced by all the symmetries of the lattice is called the irreducible

Brillouin zone Kof G(see Figure 3.2).

3.2. Learning from the electron in a crystal. In this susbsection we recall physical

results historically arising from the analysis of an electron in a perfect crystal.

3.2.1. Translational operators. For any ∆x∈Rnwe denote by T∆x:L2(Rn)→L2(Rn)

the translational operator shifting a function to the right by ∆x, i.e.

T∆xv(x) = v(x−∆x)

for all v(x)∈L2(Rn). Its dual operator with respect to the inner product

(v, w)L2(Rn):=Z

v(x)w(x)dxfor all v, w ∈L2(Rn)

10 M. FROIDEVAUX

Irreducible

Brillouin

zone

of G.

Square lattice G with

lattice vectors a1, a2 and

Wigner-Seitz cell Ω.

2π/a1

2π/a2

Reciprocal lattice G’

of G with Brillouin

zone Ω’ of G.

Ω’

Figure 3.2. Illustrations of some fundamental concepts in crystallography.

T∗

∆x:=T−∆x,

since for all v, w, ∈L2(Rn)it holds

(T∗

∆xv, w)=(v, T∆xw) = Z

v(x)w(x−∆x)dx=Z

v(˜x+ ∆x)w(˜x)d˜x= (T−∆xv, w).

Thus

T∆xT∗

∆xv(x) = T∆xv(x+ ∆x) = v(x)

for all v∈L2(Rn)and therefore the translational operators is unitary. It follows in

particular that its spectrum is contained on the unit circle in the complex plane.

3.2.2. Bloch theorem.

Lemma 3.3. Given a n-dimensional lattice Gwith lattice vectors {aj}n

j=1, if a function

ψ∈L2(Rd)is an eigenfunction of all translational operators {Taj}n

j=1, then ψis a Bloch

wave, i.e., it is of the form

ψ(x) = eik·xuk(x)

for some periodic function uk(x)with periodicity vectors {aj}n

j=1 and for some vector kin

the Brillouin zone Ω0.

Proof. If ψ(x)is an eigenfunction of all translational operators {Taj}n

j=1, then for all j=

1, . . . , n there exists a real constant θj∈[−1

2,1

2)such that

ψ(x+aj) = e2πiθjψ(x),

since Tajis unitary. Let us denote by {bj}n

j=1 the lattice vectors of the reciprocal lattice

G0satisfying Equation (3.1), and define

k:=

θjbj.

Note that k∈Ω0per construction. Then the function

uk(x):=e−ik·xψ(x).

Related document tools

Review similarity, sources and trust signals

Plag supports a clearer review of reused or insufficiently cited text. Identific is designed for document-focused trust and verification tasks. They make it easier to notice issues early.

plag.ai

ON AFEM FOR THE SIMULATION OF 2D PHOTONIC CRYSTALS 11

satisfies

uk(x+aj) = e−ik·xe−ik·ajψ(x+aj) = e−ik·xe−i2πθjei2πθjψ(x) = e−ik·xψ(x) = uk(x)

for all j= 1, . . . , n, which terminates the proof. 

Theorem 3.4 (Bloch theorem).The wavefunction describing the state of electron in a

crystal can be expressed in a basis formed of Bloch waves. Moreover, these Bloch waves are

energy eigenstates.

Proof. Due to the discrete symmetry of the problem, the Hamiltonian and the translation

operators commute with each others. Therefore they can be diagonalized in a common

basis. This basis is thus formed of functions that are energy eigenstates and have the

form of Bloch waves. The expansion of the wavefunction in this basis yields to a linear

combination of Bloch waves, which terminates the proof. 

3.3. Floquet transformation. The Bloch theorem is very well-known in solid state

physics. In the mathematics community, this result is more commonly known as a part

of the Floquet theory. Eventhough the Bloch and the Floquet theories are analogous,

their derivations differs a bit. In this subsection we recall the definition of the Floquet

transformation, as well as some of the main results related to it.

Definition 3.5. For any function u∈L2(Rn), we define the Floquet transform of uwith

respect to the lattice Gas the function

F{u}(k, x):=X

g∈G

u(x−g)e−ik·(x−g).

For the sake of brevity, we use the notation uk(x):=F{u}(x, k). Some important

properties follow directly from this definition:

•ukis G−periodic in xand quasi-G0-periodic in k, i.e.

uk(x+g) = uk(x)for all g∈G,

uk+g0(x) = e−ig0·xuk(x)for all g0∈G0,

•given a G-periodic function (x)such that u ∈L2(Rn), it holds

F{u}(k, x) = (x)uk(x),

•for j= 1, . . . , n,

(3.2) F{∂ju}(k, x)=[∂j+ikj]uk(x),

where ∂jdenotes the partial derivative with respect to the j-th component of x

and kjis the j-th component of k.

The following theorem ensures that the Floquet transformation is invertible when the

codomain is properly defined in terms of the irreducible Brillouin zone Kof the lattice G.

Theorem 3.6 ([Kuc01]).Let us denote by Tnthe n-dimensional torus corresponding to the

Wigner-Seitz cell Ωof Gwith periodic boundary conditions. The Floquet transformation

with respect to G, considered as a mapping

F:L2(Rn)→L2(K, L2(Tn))

u→(k→uk(·)),

is isometric and its inverse is given by

u(x) = F−1{uk}(x) = 1

|K| Z

uk(x)eik·xdk.

12 M. FROIDEVAUX

3.4. Link between Bloch waves and Floquet transformations. Similarly to the

state of an electron in a crystal, the state of a photon in a crystal, represented by the

electromagnetic field, can be expressed as a linear combination of Bloch waves. Indeed, let

udenote either the electric field Eor the magnetic field H. Then

u(x) = F−1{F{u}}(x) = Z

|K|eik·xuk(x)dk =Z

|K|ψk(x)dk,

where ψk(x):=eik·xukxis a Bloch wave.

3.5. The Floquet transformation applied to the Maxwell equations. In the re-

mainder, G⊂Rnrepresents the lattice underlying the spatial distribution of , i.e. (x, ω)

is G-periodic in x. In this subsection, we apply a Floquet transformation with respect to

Gto the Maxwell EVPs for 3D and 2D crystals. In view of the property (3.2) we introduce

the notation ∇k:=∇+ik for any k∈R3and ∇k:= [∂1, ∂2]T+ik for any k∈R2.

3.5.1. 2D photonic crystals. Applying the Floquet transformation to (2.13) and (2.14) for

some kin the irreducible Brillouin zone K ⊂ R2gives

(3.3)

(3.4)

−∇k·(1

∇kH3,k) = µ0ω2H3,k

−∇k·∇kE3,k =µ0ω2E3,k,

respectively, where (3.3) and (3.4) are now two equations on the Wigner-Seitz cell Ω, with

periodic boundary conditions on the functions H3,k(x),E3,k(x)and (x).

3.5.2. 3D photonic crystals. Similarly as in the 2D case, applying the Floquet transforma-

tion to (2.7) and (2.5) for some kin the irreducible Brillouin zone K ⊂ R3gives

∇k×1

(x, ω)∇k×Hk(x)=µ0ω2Hk(x)

∇k×∇k×Ek(x) = µ0(x, ω)ω2Ek(x).

respectively, which are now two equations on the Wigner-Seitz cell Ω, with periodic bound-

ary conditions on the functions Hk(x),Ek(x)and (x).

4. Finite Element Formulation of the EVP

In the remainder we focus our analysis on 2D photonic crystals. Equations (3.3) and (3.4)

are very similar and so are the methods to solve them as well. Therefore, we concentrate

the detailed analysis on (3.4) and keep in my mind that the analysis of (3.3) is analogous.

Moreover, from now on, we consider the easiest empirical model for the electric permittivity,

i.e. we assume that =(x) = Pj=1,2jIΩjis independent of the (angular) frequency ω.

In this case, (3.4) depends only quadratically on ω, so we define the new variable λ:=ω2.

4.1. Weak and discrete formulations. We aim at solving the EVP (3.4) by means of

the finite element method. To do so, we first define the weak formulation of the EVP,

namely: For all k∈ K, find (u, λ)∈V×Csuch that

(4.1) ak(u, v) = λb(u, v)for all v∈V,

where V:=H1

per(Ω) = H1(T2)for T2the 2-dimensional torus corresponding to the Wigner-

Seitz cell Ω, and where a:V×V→Cand b:V×V→Care defined by

ak(u, v):=Z

Ω

∇ku·∇kvdx

ON AFEM FOR THE SIMULATION OF 2D PHOTONIC CRYSTALS 13

and, respectively,

b(u, v):=Z

Ω

µ0(x)uv dx.

Remark 4.1.We have used the following integration by parts formula for functions in V:

ZΩ−∇·∇kuv dx=ZΩ∇ku·∇vdx−Z∂Ω

(∇ku·ˆn)vds=ZΩ∇ku·∇vdx

where ˆndenotes the the outward unit surface normal to ∂Ωand where R∂Ω(∇ku·ˆn)vds= 0

is to be understood in the sense of traces. This yields

ZΩ−∇k·∇kuv dx=ZΩ∇ku·∇v−ik ·∇kuv dx=ZΩ∇ku·∇kvdx.

With the aim of solving Problem (4.1) numerically, we define a subspace Vh⊂Vof

dimension N < ∞and Nbasis functions {φj}j=1,...,N such that

(4.2) Vh=span{φ1,...φN}.

It follows that any vh∈Vhcan be expressed as a linear combination of the basis functions,

i.e. there exist scalars v1, . . . , vNsuch that vh=

j=1

vjφj. Thus we can define a discretized

problem corresponding to (4.1) as follows: For all k∈ K, find (λh, uh)∈C×Vhsuch that

(4.3) Aku=λhµ0Bu.

where u:= [u1, . . . , uN]T, and Ak, Bare defined by (Ak)j,i :=ak(φi, φj)and (B)j,i :=

b(φi, φj), for i, j = 1, . . . , N.

Before discussing about methods to solve Problem (4.1) and (4.3), let us analyze the

properties of the linear forms and the corresponding matrices defining the EVPs.

4.2. Properties of the EVPs. Let us start this subsection with some comments and

results concerning the considered vector spaces.

4.2.1. Vector spaces considerations. In the Hilbert spaces H:=L2(Ω) and V=H1(Ω) ⊂

Hwe consider the standard norms

kvkH:=ZΩ|v(x)|2dx

for all v∈Hand

k·kV:=k·kH1(Ω) =k∇·k2

H+k·k2

H1/2.

Remark 4.2.For any u∈V, it holds ∇u∈L2(Ω)2. Therefore the notation k∇ukL2(Ω)

is a bit abusive. In fact, for any w, w0∈L2(Ω)2we use the conventional notation

(w, w0)L2(Ω) :=ZΩ

w(x)·w0(x)dx

where ·denotes the standard dot product, namely w(x)·w0(x):=Pi=1,2wi(x)w0

i(x)for

all w(x), w0(x)∈C2. Additionally,

kwk2

L2(Ω) := (w, w)L2(Ω) =ZΩ|w|2dx

where |w|:=Pi=1,2wiwi1/2denotes the standard norm in C2.

14 M. FROIDEVAUX

When (x)is a real-valued function, we can define another inner product in Hthat will

be more handy for the spectral analysis of the problem.

Lemma 4.3. Let 0, ∞be some positive contants and let (x)be a real-valued function

satisfying 0< 0≤(x)≤∞for all x∈Ω. Then the sesqui-linear form bdefines an

inner product in H. Moreover, the norm k·kbderived from this inner product is equivalent

to k·kHin H.

Proof. The form bpossesses the following properties:

•Positivity: b(u, u) = RΩ(x)|u|2dx≥0kuk2

H≥0and b(u, u)=0if and only if

u≡0,

•Conjugate symmetry: b(u, v) = RΩ(x)uv dx=RΩ(x)vu dx=b(u, v)since

(x)∈R,

•Linearity in the first argument: b(αu, v) = αb(u, v)for all α∈C.

Moreover, it holds that ∞kuk2≥b(u, u)≥0kuk2, which shows the equivalence of the

norms. 

In the remainder, we assume (x)to be a real-valued function satisfying 0< 0≤(x)≤

∞for all x∈Ω. Hence, we can define the inclusion map I:V→V∗by means of the

Gelfand triple

V ,→H∼

=H∗,→V∗,

i.e., I=ιH∗→V∗jHιV→Hwhere jH:H→H∗is the Riesz isomorphism with respect to

the inner product b,ιV→His the embedding of Vinto H, and ιH∗→V∗= (ιV→H)∗. An

important property of this particular Gelfand triple is that the imbedding of Vinto His

compact, see e.g. [Eng10] and references therein. It follows that the embedding of H∗into

V∗is also compact [Hac92, Ch. 6].

4.2.2. Definition of the operator equation. In this subsection, we show that Problem (4.1)

can be reformulated as an equivalent problem in the dual space V∗. e start with a lemma

proving Young’s inequality in the specific case of complex vectors.

Lemma 4.4 (Young’s inequality).Consider a,b∈C2. Then, for every δ > 0we have an

estimate of the form

|a·b+a·b| ≤ 1

δ|a|2+δ|b|2.

Proof. For any two vectors c,d∈C2, the following estimates hold:

0≤ |c+d|2= (c+d)·(c+d) = |c|2+|d|2+c·d+c·d,

0≤ |c−d|2= (c−d)·(c−d) = |c|2+|d|2−c·d−c·d.

As a result, we have |c·d+c·d|≤|c|2+|d|2. The claim then follows by setting c=a/√δ,

and d=b√δ.

By means of the Young inequality, we can now show the boundedness of ak:

Lemma 4.5. For all k∈ K, the sesquilinear form ak(·,·)in (4.1) is bounded (or, equiva-

lently, continuous) in V, with constant CM= 2 max(1,|k|2). Moreover, akis Hermitian.

Proof. By means of the Cauchy-Schwarz inequality we obtain for all u, v ∈Vthat

|ak(u, v)| ≤ ZΩ|∇ku(x)||∇kv(x)|dx

≤ZΩ|∇ku(x)|2dx1/2ZΩ|∇kv(x)|2dx1/2

ON AFEM FOR THE SIMULATION OF 2D PHOTONIC CRYSTALS 15

Lemma 4.4 with δ= 1 yields

ZΩ|∇ku(x)|2dx≤ZΩ|∇u(x)|2+|ku(x)|2+iku∇u(x) + iku∇u(x)

2dx

≤ k∇uk2

H+|k|2kuk2

H+kkuk2

H+k∇uk2

H.

It follows that

|ak(u, v)| ≤ CMkukVkvkV

where CM= 2 max(1,|k|2). Moreover, it is straightforward to see that ak(u, v) = ak(v, u),

which implies the Hermiticity of ak.

Hence, we can define an operator Akmapping from the space Vto its dual V∗by the

equality

hAku, ·iV∗,V :=ak(u, ·)

for all u∈V. The EVP (4.1) can then be reformulated as an equation in V∗:

(4.4) Akuk=λIuk.

4.2.3. Properties of the resolvent. Instead of the eigenvalue problem (4.4) it is more con-

venient for the analysis to look at the equivalent shifted problem:

(Ak+σI)uk= (λ+σ)Iuk

where σ > 0is a fixed parameter. In this section we investigate the properties of the

resolvant (Ak+σI)−1of Ak. To this end, we demonstrate that the form ak,σ :=ak+σb

corresponding to the operator Ak,σ :=Ak+σIis an elliptic sesquilinear form.

Lemma 4.6. For any σ > 0, the sesquilinear form ak,σ is elliptic with constant

Cm(σ) = (min{1, σ0},if k= 0,

min nσε0

2|k|2+σε0,σ

2ε0o,if k6= 0.

Proof. If k= 0, then a0,σ(u, u) = RΩ|∇u|2+σ(x)|u|2dx≥min{1, σ0}kuk2

V. If k6= 0, we

observe that ak,σ(u, u) = RΩ|(∇+ik)u|2+σ(x)|u|2dx∈R+. Moreover,

ak,σ(u, u)≥k∇uk2

H+ (σ0+|k|2)kuk2

H−ZΩiku ·∇u+iku ·∇udx

and thanks to Lemma 4.4 we obtain :

ZΩiku ·∇u+iku ·∇udx≤|k|2kuk2

δ+δk∇uk2

for any δ > 0. Therefore,

ak,σ(u, u)≥(1 −δ)k∇uk2

H+σε0+|k|2−|k|2

δkuk2

and choosing δ=|k|2

|k|2+σ

2ε0

<1yields

ak,σ(u, u) = (1 −|k|2

|k|2+σ

2ε0

)

| {z }

k∇uk2

H+σ

2ε0

|{z}

kuk2

H≥Cm(σ)kuk2

with Cm(σ) = min nσε0

2|k|2+σε0,σ

2ε0o.

16 M. FROIDEVAUX

Corollary 4.7. For any σ > 0, the operator Ak,σ = (Ak+σI)is invertible and the

inverse is bounded with constant Cm(σ)−1, i.e. Ak,σ−1∈ L(V∗, V )satisfies

kAk,σ−1kV∗→V:= sup

w∈V∗

kwkV∗=1

kAk,σ−1wkV≤1

Cm(σ).

Proof. This follows directly from the Lax-Milgram lemma, see e.g. [Hac92, Lem. 6.98]. 

Corollary 4.8. For any σ > 0, the sesquilinear form ak,σ defines an inner product in V

and the corresponding norm kvkak+σb:=pak(v, v) + σb(v, v)is equivalent to the standard

norm kvkV.

Proof. The positivity of ak,σ follows from Lemma 4.6, and the sesquilinearity as well as

the conjugate symmetry are easily verified. Moreover Lemma 4.5 yields

kvkak+σb≤qCMkvk2

V+σb(v, v)≤qCMkvk2

V+σ∞kvk2

H≤pCM+σ∞kvkV

and by Lemma 4.6

kvkak+σb≥pCm(σ)kvkV

for all v∈V.

Many important properties of the spectrum of Akcan now be proven by applying the

Riesz-Schauder therory on the resolvant Rσ:V→V,R=A−1

k,σI. These are gathered in

the following lemma.

Lemma 4.9. The spectrum of Akconsists in at most countably many real eigenvalues

with finite geometrical multiplicities, which can only accumulate at infinity. Moreover, all

eigenvalues are nonnegative.

Proof. From the Hermiticity of akfollows directly that Akis Hemitian. Therefore all

eigenvalues λof Akmust be real since

λ=hAku, uiV∗,V

hIu, uiV∗,V

=hAku, uiV∗,V

hIu, uiV∗,V

=λ,

where udenotes the eigenvector corresponding to λ. The rest of the proof follows from

[Hac92, Satz 6.108]. In particular,it is shown there that for all λ∈Cone of the following

alternatives holds:

(1) (A−λI)−1∈ L(V∗, V ),

(2) λis an eigenvalue.

In Corollary 4.7 it was shown that (A−λI)−1exists and is bounded for all λ=−σ, where

σ > 0. Hence, all eigenvalues must be nonnegative. 

Corollary 4.10. Let Akand Bbe the matrices in the discretized problem (4.3). Then,

Akis Hermitian positive semi-definite and B, as well as (Ak+B), are Hermitian positive

definite.

Proof. It is straightforward to see that (Ak)j,i =ak(φi, φj) = ak(φj, φi) = (Ak)i,j and

similarly for B. Moreover, by Lemma 4.6 it holds for all σ > 0and for all v= [v1, . . . , vN]

with vh=PN

ivi

v∗Akv=a(vh, vh)≥Cm(σ)kvhkV−σb(vh, vh)→0as σ→0,

where the superscript ∗denotes the conjugate transpose of a vector or a matrix Since this

inequality holds for σ > 0arbitrarily small, to follows that v∗Akv≥0, and thus Akis

positive semi-definite. The rest of the proof follows easily. 

ON AFEM FOR THE SIMULATION OF 2D PHOTONIC CRYSTALS 17

5. Error estimators

We wish to approximate the solution of the discretize problem (4.3) by using an iterative

method. In the remainder, udenotes the exact eigenfunction of (4.1), uhdenotes the

exact eigenfunction of the discretized problem (4.3), and euh=PN

i=1 euiφidenotes the

approximated eigenfunction computed by msteps of an iterative method (e.g. Lanczos

method or LOBPCG [GVL96, Kny01]).

In order to be able to evaluate the quality of the approximate solution euhwe need

to construct an estimator for the error ku−euhkV. We start by applying the triangular

inequality

ku−euhkV=ku−uh+uh−euhkV≤ ku−uhkV+kuh−euhkV

and observe that the error can be separated into the discretization error ku−uhkVand

the algebraic error kuh−euhkV.

An estimator for the error introduced by the discretization of the linear EVP (3.4) to

obtain (4.3) has been developed in [GG12]. In that paper it is shown that an error estimator

ηdis, which can be computed a posteriori, gives a reliable and efficient approxmation of the

error ku−euhkak+σb where σ= 1. Since the norms k·kVand k·kak+σb are equivalent

(see Lemma 4.8), ηdis is also a reliable and efficient estimator of the error measured in the

standard norm in V. However, the equivalence constants Cm(σ)and CMmay be much

bigger or much smaller than 1, depending on the values of σand k.

In order to enable the error balancing between the discretization and the algebraic errors,

we need an error estimator which can be compared to ηdis in a meaningful manner, i.e., an

estimator which measures the algebraic error in the norm defined by the sesquilinear form

ak,σ on V. This is the subject of the next subsection.

5.1. Algebraic error. We aim at finding a bound for the algebraic errors kuh−˜uhkV

and kuh−˜uhkak+σb, which depend on the convergence of an iterative numerical method.

The convergence analysis of such methods is typically based on the concepts of angle

between vectors. While the definition of the angle between real vectors with respect to

a scalar product is clear, there exist multiple nonequivalent concepts of angle between

complex vectors. Each one of these angles has different interesting properties. We start

by introducing the Euclidean angle between complex vectors, which satisfies the law of

cosines:

Definition 5.1. Let vand wbe two vectors in a complex Hilbert space Vand let (·,·)I

as well as k·kIdenote an inner product in Vand its corresponding norm, respectively.

The (real-valued) Euclidean angle between vand wwith respect to the I-inner-product is

defined as the quantity ∠(E)

I(v, w)∈[0, π]satisfying

∠(E)

I(v, w) = acos <(v, w)I

kvkIkwkI

The Euclidean angle indeed satisfies the law of cosines since for all v, w ∈Vit holds

kv−wk2

I= (v−w, v −w)I=kvk2

I+kwk2

I−(v, w)I−(w, v)I

=kvk2

I+kwk2

I−2<(v, w)I=kvk2

I+kwk2

I−2 cos ∠(E)

I(v, w)kvkIkwkI.

The Euclidean angle inherits other properties from the real case, e.g. the angle between

vectors pointing in opposite directions is πsince ∠(E)

I(v, −v) = acos(−1) = π. However,

the Euclidean angle is not invariant under scaling since, for α∈C,∠(E)

I(αv, w)is generally

not equal to ∠(E)

I(v, w).

18 M. FROIDEVAUX

On the other hand, one can define another angle between complex vectors which is

invariant under scaling:

Definition 5.2. Let v, w ∈V,(·,·)Iand k·kIbe as in Definition 5.1. The Hermitian

angle between vand wwith respect to the I-inner-product is defined as the quantity

∠(H)

I(v, w)∈[0,π

2]satisfying:

∠(H)

I(v, w) = acos |(v, w)I|

kvkIkwkI

The link between both angles can be expressed as follows:

(5.1)

inf

vθ∈span{v}

wθ∈span{w}

∠(E)

I(vθ, wθ) = inf

θ1∈[0,2π)

θ2∈[0,2π)

∠(E)

I(eiθ1v, eiθ2w) = acos sup

θ∈[0,2π)

<eiθ(v, w)I

kvkIkwkI

= acos 1

kvkIkwkI<(w, v)I

|(v, w)I|(v, w)I

= acos |(v, w)I|

kvkIkwkI

=∠(H)

I(v, w).

Therefore, the Hermitian angle between vand wcan be thought of as the Euclidean angle

between the subspaces spanned by vand w. Moreover, the Hermitian and Euclidean angles

between two vectors v, w are equal if the vectors have the same orientation in V, i.e. if

they satisfy the condition

(v, w)I∈R+.

In the remainder, we consider the algebraic error kuh−˜uhkIwhere the inner prod-

uct (·,·)Irepresents either the standard inner product (·,·)Vin Vor the inner product

(·,·)ak+σblinked to the weak form (4.1). Moreover, both vectors uhand ˜uhare supposed

to be normalized and have the same orientation in V(with respect to the considered inner

product), i.e.

kuhkI=k˜uhkI= 1 and (uh,˜uh)I∈R+.

Under these conditions, the law of cosines combined with the following trigonometric iden-

tity

cos2(φ

2) = eiφ

2+e−iφ

2!2

=eiφ +e−iφ + 2

4=1

2(cos(φ) + 1),for all φ∈[0,2π),

yields

kuh−˜uhk2

I= 2 −2 cos ∠(E)

I(uh,˜uh)=41−1

2cos ∠(E)

I(v, w)+1

= 41−cos2∠(E)

I(v, w)

2= 4 sin2∠(E)

I(v, w)

2= 4 sin2∠(H)

I(v, w)

Therefore, we can bound the algebraic error by a term proportionnal to the sinus of the

Hermitian angle:

(5.2) kuh−˜uhkI= 2 sin ∠(H)

I(uh,˜uh)

2≤√2 sin ∠(H)

I(uh,˜uh).

ON AFEM FOR THE SIMULATION OF 2D PHOTONIC CRYSTALS 19

For small angles, the following expression based on Taylor expansions gives a more accurate

estimation of the error:

kuh−˜uhkI= 2 sin ∠(H)

I(uh,˜uh)

2= 2

∞

j=0

(−1)j

(2j+ 1)!∠(H)

I(uh,˜uh)

22j+1

∞

j=0

(−1)j

(2j+ 1)!∠(H)

I(uh,˜uh)2j+1 +

∞

j=1

(−1)j

(2j+ 1)!(1 −1

22j)∠(H)

I(uh,˜uh)2j+1

= sin ∠(H)

I(uh,˜uh) + O(∠(H)

I(uh,˜uh)3)≈sin ∠(H)

I(uh,˜uh).

5.1.1. Equivalence of continuous and discrete norms. In this subsection we establish equiv-

alences between the norms of functions uh, vhin the finite dimensional space Vhdefined

in (4.2) and the corresponding vectors u,v∈Cnexpressed in the basis formed by the

functions {φi}N

i=1. In the remainder of this paper, we assume σ > 0.

For any uh, vh∈Vhit holds

(uh, vh)V=

i=1

j=1

vjZΩ∇φi·∇φj+φiφjdx ui=v∗ΦVu

and

kuhk2

V=uhΦVu=:kuk2

ΦV,

where (ΦV)i,j := (φj, φi)V. Moreover,

(uh, vh)b=

i=1

j=1

vjZΩ

(x)φiφjdx ui=v∗Bu,

kuhk2

b=u∗Bu=kuk2

B

and

(uh, vh)2

ak+σb=

i=1

j=1

vjZΩ∇kφi·∇kφj+σφiφjdx ui=v∗(Ak+σB)u,

kuhk2

ak+σb=u∗(Ak+σB)u=kuk2

(Ak+σB).

Similarly, a discrete norm that is equivalent to a norm in V∗can be found.

Lemma 5.3. For any r∈V∗such that r=Irhfor some rh=

j=1

rjφj∈Vh, it holds

krkV∗=kBrk(Ak+σB)−1=krkB(Ak+σB)−1B,

where

kfkV∗:= sup

v∈V,

kvkak+σb=1

|f(v)|

for any f∈V∗, and where r= [r1, . . . , rN].

20 M. FROIDEVAUX

Proof. For any v∈Vwe denote by πh(v)the orthogonal projection of vonto Vhwith

orthogonality in the inner product (·,·)b. Then it holds

krkV∗= sup

v∈V,

kvk(ak+σb)=1

|(rh, v)b|= sup

v∈V,

kvk(ak+σb)=1

|(rh, v −πh(v))b+ (rh, πh(v))b|

= sup

v∈V,

kvk(ak+σb)=1

|(rh, πh(v))b|= sup

vh∈Vh,

kvhk(ak+σb)=1

|(rh, vh)b|

= sup

v∈CN,

kvk(Ak+σB)=1

|r∗Bv|= sup

v∈CN,

kvk(Ak+σB)=1

|r∗B(Ak+σB)−1(Ak+σB)v|.

For the sake of brevity, we introduce the notation Ak,σ :=Ak+σBand b:=A−1

k,σBr.

Using the Cauchy-Schwarz inequality [Bre11, Ch. 5.1] yields

krkV∗= sup

v∈CN,

kvkAk,σ =1

|b∗Ak,σv| ≤ sup

v∈CN,

kvkAk,σ =1

kbkAk,σ kvkAk,σ = sup

v∈CN,

kvkAk,σ =1

kbkAk,σ =kbkAk,σ .

The upper bound is attained by setting v=b

kbkAk,σ

which satisfies the constraint kvkAk,σ =

1. Therefore

krkV∗=kbkAk,σ =kA−1

k,σBrkAk,σ =kBrkA−1

k,σ =krkB(Ak+σB)−1B.

We show that the trigonometric functions of angles between vectors in Vhcan also be

expressed in terms of the corresponding discrete vectors in CN.

Theorem 5.4. Let vh, wh∈Vh⊂Vwith vh=

i=1

viφi,wh=

i=1

wiφiand v=

[v1, . . . , vN]T,w= [w1, . . . , wN]T. Then

cos ∠(H)

V(vh, wh) = cos ∠(H)

ΦV(v,w)

and

cos ∠(H)

ak+σb(vh, wh) = cos ∠(H)

Ak+σB(v,w)

where

∠(H)

M(v,w):= acos |w∗Mv|

√v∗Mv√w∗Mw∈h0,π

for any matrix M.

Proof.

cos ∠(H)

V(vh, wh) = |(vh, wh)V|

kvhkVkwhkV

=|P

i,j

wi(φj, φi)Vvj|

i,j

vi(φj, φi)VvjrP

i,j

wi(φj, φi)Vwj

=|w∗ΦVv|

√v∗ΦVv√w∗ΦVw

and similarly for the inner product ak+σbon V.

Corollary 5.5. Let vh, wh,v,was in Theorem 5.4. Then

sin ∠(H)

V(vh, wh) = sin ∠(H)

ΦV(v,w)

and

sin ∠(H)

Ak+σB(vh, wh) = sin ∠(H)

ak+σb(v,w).

ON AFEM FOR THE SIMULATION OF 2D PHOTONIC CRYSTALS 21

5.1.2. Sine theorems. In order to obtain a useful bound on the algebraic error, we need a

computable bound on sin ∠(H)

ΦV(v,w)or sin ∠(H)

Ak+σB(v,w). First, we recall a result for the

(non-generalized) linear eigenvalue problem.

Theorem 5.6 (The sine theorem).Let M∈CN×Nbe a Hermitian positive definite matrix

defining an inner product (v1,v2)Msuch that

(v1,v2)M:=v1∗Mv2

for all v1,v1∈CN. Let A∈CN×Nbe a Hermitian matrix with respect to this inner

product. We denote by {λi}N

i=1 the eigenvalues of Aand by {u(i)}N

i=1 the corresponding

normalized eigenvectors of Ain the norm k·kMinduced by (·,·)M. Given any vector

v∈CNand any scalar e

λ∈C, we order the eigenpairs (λi,u(i))such that |λ1−e

λ| ≤

|λ2−e

λ| ≤ . . . ≤ |λN−e

λ|and we define the residual r:=Ae

v−e

λe

vwhere e

v:=αvis the

scaled vector which is normalized and lies in the same directions as u(1), i.e.

vkM= 1,(e

v,u(1))M∈R+.

Then it holds

sin ∠(H)

M(e

v,u(1))≤krkM

δ.

where δ:=|λ2−e

λ|.

Proof. We follow the proof of Theorem 11.7.1 in [Par98] which proves the result for real

symmetric matrices and we extend it to the case of complex Hermitian matrices.

First recall that the eigenvectors of Aform an orthogonal basis of Cnsince Ais Hermitian

(see the spectral theorem in [Par98, Ch. 1.4]). Therefore we can expand

j=1

v,u(j))Mu(j),

which implies

r= (A−e

λ)(e

v,u(1))Mu(1) + (A−e

λ)

j=2

v,u(j))Mu(j)

= (λ1−e

λ)(e

v,u(1))Mu(1) +

j=2

(λj−e

λ)(e

v,u(j))Mu(j).

Since the eigenvectors are orthogonal to each other in the inner product (·,·)M, we can

apply the Pythagoreus theorem, which yields:

krk2

M=|λ1−e

λ|2|(e

v,u(1))M|2+

j=2 |λj−e

λ|2|(e

v,u(j))M|2

≥ |λ1−e

λ|2cos2∠(H)

M(e

v,u(1)) + |λ2−e

λ|2

j=2 |(e

v,u(j))M|2

=|λ1−e

λ|2cos2∠(H)

M(e

v,u(1)) + |λ2−e

λ|2(ke

vk2−|(e

v,u(1))M|2)

=|λ1−e

λ|2cos2∠(H)

M(e

v,u(1)) + |λ2−e

λ|2sin2∠(H)

M(e

v,u(1))

≥δ2sin2∠(H)

M(e

v,u(1)).

22 M. FROIDEVAUX

In order to apply Theorem 5.6 to Problem (4.3) we need to reformulate the latter as a

standard (non-generalized) EVP. Since the matrix Bin (4.3) is Hermitian positive definite,

it is also invertible and the EVP can be equivalently written as

B−1

Aku=λu.

Corollary 5.7. Let Ak, Bbe the Hermitian matrices defining the EVP (4.3). We de-

note by {λi}N

i=1 the eigenvalues of the pencil (Ak, B)and by {u(i)}N

i=1 the correspond-

ing normalized eigenvectors of (Ak, B)in the norm k·kAk+σBwhere σ > 0. Given

any vector v∈CNand any scalar e

λ∈C, we order the eigenpairs (λi,u(i))such that

|λ1−e

λ|≤|λ2−e

λ| ≤ . . . ≤ |λN−e

λ|and we define the residual r:=Ake

v−e

λBe

vwhere

v:=αvis the scaled vector which is normalized and lies in the same directions as u(1),

i.e.

vkAk+σB= 1,(e

v,u(1))Ak+σB∈R+.

Then it holds

(5.3) sin ∠(H)

Ak+σB(e

v,u(1))≤kB−1

rkAk+σB

δ=krkB−1

(Ak+σB)B−1



δ.

where δ:=|λ2−e

λ|.

Proof. Observe that the matrix B−1

Akis Hermitian in the inner product defined by Ak+

σBsince for any v1,v2∈CN

(B−1

Akv1,v2)Ak+σB=v∗

1AkB−1

(Ak+σB)v2=v∗

1AkB−1

Akv2+σv∗

1Akv2

=v∗

1(Ak+σB)B−1

Akv2= (v1, B−1

Akv2)(Ak+σB).

Therefore Theorem 5.6 can be applied with A:=B−1

Ak,M:= (Ak+σB).

Alternatively, one can follow [HL06, Section 2] and solve the EVP (4.3) with a shift-

invert method. Recall that for any real positive shift σ > 0, the matrix (Ak+σB)is

invertible, see Corollary 4.7. Therefore (4.3) is equivalent to

(5.4) (Ak+σB)−1Bu=νu,

where the eigenvalues of both problems are related via the equation

ν=1

λ+σ.

The matrix (Ak+σB)−1Bis Hermitian in the (Ak+σB)-inner product since for all

v1,v2∈CNit holds

(Ak+σB)−1Bv1,v2Ak+σB=v∗

1B(Ak+σB)−1(Ak+σB)v2=v∗

1Bv2

=v1,(Ak+σB)−1Bv2Ak+σB

We can now apply Theorem 5.6 to (5.4).

Corollary 5.8. Let Ak, Bbe the Hermitian matrices defining the EVP (4.3) and σ > 0

be a real and positive scalar shift. We denote by {λi}N

i=1 the eigenvalues of the Hermitian

pencil (Ak, B)and by {u(i)}N

i=1 the corresponding normalized eigenvectors of (Ak, B).

Given any vector v∈CNand any scalar e

λ∈C, we order the eigenpairs (λi,u(i))such that



λ1+σ−1

λ+σ≤

λ2+σ−1

λ+σ≤. . . ≤

λN+σ−1

λ+σ,

ON AFEM FOR THE SIMULATION OF 2D PHOTONIC CRYSTALS 23

and we define the residual r:=Ake

v−e

λBe

v, where e

v:=αvis normalized and lies in the

same directions as u(1), i.e.

vkAk+σB= 1,(e

v,u(1))Ak+σB∈R+.

Then it holds

(5.5) sin ∠(H)

Ak+σB(u(1),e

v)≤|λ2+σ|

|λ2−e

λ|krk(Ak+σB)−1.

Proof. Applying Theorem 5.6 with A:= (Ak+σB)−1B,M=Ak+σBand eν:= (e

λ+σ)−1

yields

sin ∠(H)

Ak+σB(u(1),e

v)≤k(Ak+σB)−1Be

v−eνe

vk(Ak+σB)

|eν−ν2|

=|eν|

|eν−ν2|keν−1Be

v−(Ak+σB)e

vk(Ak+σB)−1

=|λ2+σ|

|λ2−e

λ|ke

λBe

v−Ake

vk(Ak+σB)−1.

Remark 5.9.Observe that the norm of the residual rin the upper bound (5.5) can be

rewritten as

krk(Ak+σB)−1=ke

λBe

u−Ake

uk(Ak+σB)−1=ke

λe

u−B−1

Ake

ukB(Ak+σB)−1B.

Therefore, according to Lemma 5.3, krk(Ak+σB)−1can be seen as the norm of

(5.6) res(e

λ, e

u):=I N

i=1 B−1

riφi!

in the dual space V∗, i.e.

krk(Ak+σB)−1=kres(e

λ, e

u)kV∗

Note that (5.6) indeed defines a residual, since for all vh=PN

i=1 viφi∈Vhit holds that

hres(e

λ, e

u), vhiV∗,V =ZΩ

(x)

i=1 B−1

riφivhdx=

i,j=1 B−1

rivjZΩ

(x)φiφjdx

i,j=1 B−1

rivj(B)j,i =v∗BB−1

r=v∗r=v∗(Ake

u−e

λBe

=ak(euh, vh)−e

λb(euh, vh).

where euh:=PN

j=1 e

ujφj

Remark 5.10.Since (Ak+σB)is Hermitian positive definite we can factorize it with help

of the Choleski decomposition (Ak+σB) = LL∗where Lis invertible. Therefore the term

appearing on the right side of (5.3) can be rewritten as

kB−1

Ake

v−e

λe

vkAk+σB=kL∗B−1

AkL−∗ e

w−e

λe

with L∗e

v=e

w. Observe that the matrix L∗B−1

AkL−∗ is Hermitian since

(L∗B−1

AkL−∗)∗=L−1AkB−1

L=L−1AkB−1

(Ak+σB)L−∗L−1L

=L−1(Ak+σB)B−1

AkL−∗ =L∗B−1

AkL−∗.

24 M. FROIDEVAUX

Therefore, if e

wis the approximation of the eigenvector corresponding to the smallest

eigenvalue of L∗B−1

AkL−∗ returned after msteps of an inverse Lanczos or LOBPCG

method, then kL∗B−1

AkL−∗ e

w−e

λe

wkis directly available as a by-product of each iteration,

see e.g. [GVL96, Thm. 9.1.2] and [Kny01].

Similarly the term on the right hand side of (5.5) can be rewritten as

λBe

v−Ake

vk(Ak+σB)−1=kL−1AkL−∗ e

w−e

λL−1BL−∗ e

and can be obtained as a by-product of a LOBPCG iteration on the Hermitian pencil

(L−1AkL−∗, L−1BL−∗).

Remark 5.11.In Lanczos’ method and LOBPCG, the eigenvalue converges faster than the

eigenvector, therefore it is reasonable to approximate λ≈e

λand λ2≈e

λ2to compute the

error bound.

All results of this section can be summarized in the following theorem:

Theorem 5.12. Let (Ak, B)be the matrix pencil defining the discretized problem (4.3)

and let ube the eigenvector corresponding to the smallest eigenvalue of (Ak, B). We write

Lthe triangular matrix satisfying the Choleski decompostion LL∗=Ak+σBfor some

shift σ > 0. We denote by e

w=:L∗e

uthe approximation of L∗uobtained after miterations

of the LOBPCG method applied to the pencil (L−1AkL−∗, L−1BL−∗), and assume uand

uto be scaled in such a way that

kukAk+σB=ke

ukAk+σB= 1,(e

u,u)Ak+σB∈R+.

Let e

λand e

λ2be the approximations of the eigenvalue λclosest to σand, respectively, of the

eigenvalue λ2second closest t σ, of Problem (4.3) obtained from the LOBPCG iterations.

Then the distance between the functions uh=PN

j=1 ujφj∈Vhand euh=PN

j=1 e

ujφj∈Vh

can be approximately bounded by

kuh−˜uhkV≤pCm(σ)−1kuh−˜uhkak+σb=pCm(σ)−1sin ∠(H)

ak+σb(uh,euh) + O(∠(H)

ak+σb(uh,euh)3)

≈pCm(σ)−1sin ∠(H)

ak+σb(uh,euh)≤pCm(σ)−1|λ2+σ|

|λ2−e

λ|kAke

u−e

λBe

uk(Ak+σB)−1

≈pCm(σ)−1|e

λ2+σ|

λ2−e

λ|kL−1AkL−∗ e

w−e

λL−1BL−∗ e

when ∠(H)

ak+σb(uh,euh)is a small angle.

5.2. Numerical experiments. In this section, we compare the error estimator

(5.7) ηalg :=|e

λ2+σ|

λ2−e

λ|kL−1AkL−∗ e

w−e

λL−1BL−∗ e

to the error kweig −e

wk ≈ kuh−˜uhkak+σbwhere weig is the approximated eigenvector

returned by the MATLAB function eig and is therefore assumed here to be a very good

approximation of the exact eigenvector w=L∗u. We observe that ηalg is composed of a

factor ρ=|e

λ2+σ|/|e

λ2−e

λ|depending only on the approximations of the eigenvalues, and

of a factor corresponding to the norm or the residual. In this section, we denote with a

superscript (m)a quantity computed with the approximations e

λ, e

λ2,e

wobtained after m

iterations.

We have tested the error estimator (5.7) on a problem of the form (4.3) with (x, ω)as in

subsection 2.3.1, Ωbeing a square cell of length lΩ= 500·10−9m, Ω1⊂Ωbeing a centered

ON AFEM FOR THE SIMULATION OF 2D PHOTONIC CRYSTALS 25

disc of radius 0.2lΩ,1(ω)≡0= 8.854187817·10−6µF/m, 2≡8.90. We have set σ= 1,

and Figures 5.1 and 5.2 correspond to k= [0.6888,6.2832]T·106m−1. The problem was

discretized via the C++ library Concepts [FL02] with quadratic Lagrange basis functions.

The dimension of the resulting matrices is 896×896. We solved the Hermitian generalized

EVP with the LOBPCG algorithm.

On Figure 5.1 we see that the error estimator does not give reliable estimation of the

error during the first dozen of iterations, i.e. when the iterative algorithm is far from con-

vergence. However, after approximately 25 iterations, the error estimator starts mimicking

the decrease of the error. We call this behavior asymptotic, as opposed to the transient

behavior happening in the first steps of the iterative process. We observe that there exists

a constant shift Mbetween the graph of the error estimator and the one of the error in

the asymptotic regime, which corresponds to approximately M=13 iterations.

0 5 10 15 20 25 30 35 40 45 50 55 60 65 70

0.2

0.4

0.6

0.8

1.2

1.4

m [nb of iterations]

||weig −e

w||

sin ∠(weig,e

√2 sin ∠(weig,e

ηalg =ρ||res||

Figure 5.1. Comparison of the error with the error estimator

On Figure 5.2 we see that the error, the error estimator, and the norm of the residual

indeed decrease at the same speed, and that the beginning of the asymptotic regime cor-

relates with the stabilization of the factor ρat m≈. This correlation is made clearer in

Figure 5.3, where the rates of change of the factors ρand kreskare plotted. Indeed, we see

that the biggest mat which r(m)

ρ:=|ρ(m)−ρ(m−1)|

ρ(m)is bigger than r(m)

res :=|kres(m)k−kres(m−1)k|

kres(m)k

is also m≈20. In the range m > 20,r(m)

ρbecomes significantly smaller than r(m)

res .

Note that r(m)

ρand r(m)

res can be cheaply evaluated at each iteration step as a complement

of the error estimator ηalg. Comparing both rates of change gives a way to decide whether

the error estimator is reliable or not. The efficiency of the error estimator, however, depends

on the shift M, and it is for the moment not clear how to estimate this quantity.

We have made a qualitatively similar observations for other values of kon the boundary

of the irreducible Brillouin zone. However, the shift Mobserved between the error and the

error estimator in the asymptotic domain does not stay constant with kbut varies within

the range 6−20 iterations.

26 M. FROIDEVAUX

0 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150

10−20

10−15

10−10

10−5

100

105

m [nb of iterations]

||weig −e

w||

||res||

ρ=|e

λ2+σ|/|e

λ2+e

λ1|

ηalg =ρ||res||

Figure 5.2. Evolution of the factors composing the error estimator

0 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150

10−20

10−15

10−10

10−5

100

105

m [nb of iterations]

|ρ(m)−ρ(m−1)|/|ρ(m)|

abs(||res(m)||−||res(m−1)||)/||res(m)||

Figure 5.3. Rates of change of the factors composing the error estimator

6. Conclusion

We have developed a residual-based estimator which approximates the algebraic error

resulting from the solution of the discrete generalized linear EVP by an iterative method.

The error estimator approximates the error in the norm arising from the sesquilinear forms

defining the EVP, and can therefore by combined in an error balancing procedure with

already existing estimators of the discretization error. We have shown how to heuristically

determine whether enough iterations have been computed for the error estimator to be

reliable.

ON AFEM FOR THE SIMULATION OF 2D PHOTONIC CRYSTALS 27

References

[BCG06] D. Boffi, M. Conforti, and L. Gastaldi. Modified edge finite elements for photonic crystals.

Numer. Math., 105(2):249–266, 2006.

[Bre11] H. Brezis. Functional Analysis, Sobolev Spaces and Partial Differential Equations. Springer,

New York, 2011.

[CDM+17] E. Cancès, G. Dusson, Y. Maday, B. Stamm, and M. Vohralí k. Guaranteed and robust a

posteriori bounds for Laplace eigenvalues and eigenvectors: conforming approximations. SIAM

J. Numer. Anal., 55(5):2228–2254, 2017.

[EKE12] C. Effenberger, D. Kressner, and C. Engström. Linearization techniques for band structure

calculations in absorbing photonic crystals. Int. J. Numer. Meth. Engn, 89(2):180–191, 2012.

[ELT17] Christian Engström, Heinz Langer, and Christiane Tretter. Rational eigenvalue problems and

applications to photonic crystals. J. Math. Anal. Appl., 445(1):240–279, 2017.

[Eng10] C. Engström. On the spectrum of a holomorphic operator-valued function with applications to

absorptive photonic crystals. Math. Models Methods Appl. Sci., 20(8):1319–1341, 2010.

[FL02] P. Frauenfelder and C. Lage. Concepts - An Object-Oriented Software Package for Partial

Differential Equations. ESAIM: M2AN, 36(5):937–951, 2002.

[GG12] S. Giani and I. G. Graham. Adaptive finite element methods for computing band gaps in

photonic crystals. Numer. Math., 121(1):31–64, 2012.

[GVL96] G. H. Golub and C. F. Van Loan. Matrix computations. Johns Hopkins Studies in the Mathe-

matical Sciences. Johns Hopkins University Press, Baltimore, MD, third edition, 1996.

[Hac92] W. Hackbusch. Elliptic Differential Equations: Theory and Numerical Treatment. Springer,

Berlin, 1992.

[HL06] U. L. Hetmaniuk and R. B. Lehoucq. Uniform accuracy of eigenpairs from a shift-invert Lanczos

method. SIAM J. Matrix Anal. Appl., 28(4):927–948, 2006.

[HLM16] T.-M. Huang, W.-W. Lin, and V. Mehrmann. A Newton-type method with non-equivalence

deflation for nonlinear eigenvalue problems arising in photonic crystal modeling. SIAM J. Sci.

Comput., 38(2):B191–B218, 2016.

[Jac99] J. D. Jackson. Classical electrodynamics. John Wiley & Sons, Inc., New York-London-Sydney,

third edition, 1999.

[Jan71] L. Jantscher. Distributionen. Walter de Gruyter & Co, Berlin, 1971.

[Kit05] C. Kittel. Introduction to Solid State Physics. John Wiley & Sons, Inc., Hoboken, NJ, eighth

edition, 2005.

[Kny01] A. V. Knyazev. Toward the optimal preconditioned eigensolver: locally optimal block pre-

conditioned conjugate gradient method. SIAM J. Sci. Comput., 23(2):517–541, 2001. Copper

Mountain Conference (2000).

[Kuc01] P. Kuchment. The Mathematics of Photonic Crystals, chapter 7, pages 207–272. SIAM,

Philadelphia, PA, 2001.

[Mię10] A. Międlar. Inexact Adaptive Finite Element Methods for Elliptic PDE Eigenvalue Problems.

PhD thesis, TU Berlin, 2010.

[MM11] V. Mehrmann and V. Miedlar. Adaptive computation of smallest eigenvalues of self-adjoint

elliptic partial differential equations. Numer. Linear Algebra Appl., 18(3):387–409, 2011.

[Par98] B. N. Parlett. The symmetric eigenvalue problem. SIAM, Philadelphia, PA, 1998.

[Sim13] S. H. Simon. The Oxford Solid State Basics. Oxford University Press, New York, 2013.