Document [original]

Mariano Nicolas Cruz Bournazou, Tilman Barz, D. B. Nickel, Diana

Carolina Lopez Cárdenas, Florian Glauche, Andreas Knepper, Peter

Neubauer

Online optimal experimental re-design in robotic

parallel fed-batch cultivation facilities

Open Access via institutional repository of Technische Universität Berlin

Document type

Journal article | Accepted version

(i. e. final author-created version that incorporates referee comments and is the version accepted for

publication; also known as: Author’s Accepted Manuscript (AAM), Final Draft, Postprint)

This version is available at

https://doi.org/10.14279/depositonce-15644

Citation details

This is the peer reviewed version of the following article:

Cruz Bournazou, M. N., Barz, T., Nickel, D. B., Lopez Cárdenas, D. C., Glauche, F., Knepper, A., & Neubauer,

P. (2016). Online optimal experimental re-design in robotic parallel fed-batch cultivation facilities. In

Biotechnology and Bioengineering (Vol. 114, Issue 3, pp. 610–619). Wiley. https://doi.org/10.1002/bit.26192,

which has been published in final form at https://doi.org/10.1002/bit.26192. This article may be used for

non-commercial purposes in accordance with Wiley Terms and Conditions for Use of Self-Archived Versions.

Terms of use

This work is protected by copyright and/or related rights. You are free to use this work in any way permitted by

the copyright and related rights legislation that applies to your usage. For other uses, you must obtain

permission from the rights-holder(s).

For Peer Review

1

Online Optimal Experimental Re-Design in Robotic Parallel Fed-

Batch Cultivation Facilities for Validation of Macro-Kinetic

Growth Models using E. coli as an Example

M.N. Cruz Bournazou, T.Barz, D. Nickel, D. Lopez Cárdenas, F. Glauche, A. Knepper, P.

Neubauer

corresponding author:

Dr. M. Nicolas Cruz Bournazou

Laboratory of Bioprocess Engineering

Department of Biotechnology

Technische Universität Berlin (TU Berlin)

Ackerstr. 76, ACK24

D-13355 Berlin, Germany

Tel. direct: +49 30 314 72527

Secretary: +49 30 314 72573

Fax: +49 30 314 27577

www.bioprocess.tu-berlin.de

Page 1 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

2

Abstract

We present an integrated framework for the online optimal experimental re-design of

parallel nonlinear dynamic processes that aims to precisely estimate the parameter set of

macro kinetic growth models with minimal experimental effort. This provides a systematic

solution for rapid validation of a specific model to new strains, mutants, or products. In

biosciences, this is especially important as model identification is a long and laborious

process which is continuing to limit the use of mathematical modeling in this field.

The strength of this approach is demonstrated by fitting a macro-kinetic differential equation

model for Escherichia coli fed-batch processes after six hours of cultivation. The system

includes two fully-automated liquid handling robots; one containing eight mini-bioreactors

and another used for automated at-line analyses, which allows for the immediate use of the

available data in the modeling environment. As a result, the experiment can be continually

re-designed while the cultivations are running using the information generated by periodical

parameter estimations.

The advantages of an online re-computation of the optimal experiment are proven by a

fifty-fold lower average variation coefficient on the parameter estimates compared to the

sequential method (4.83% instead of 235.86%). The success obtained in such a complex

system is a further step towards a more efficient computer aided bioprocess development.

Page 2 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

3

Introduction

Today, mathematical modeling in biotechnology is not hampered by computer capacity or

by insufficient understanding of the microbial systems, but rather by the lack of fast, cheap,

and informative experiments – especially in the case of dynamic processes such as

cultivations describing industrial conditions. When properly validated, simplified

mathematical models are capable of describing the dynamics of complex systems (Cruz

Bournazou et al. 2012; Kokkalis et al. 2014; Legmann et al. 2009; Rosen et al. 2006; Zavrel

et al. 2008). In literature, we can find models that successfully describe biological systems

at all scales (Brunk et al. 2012; Neubauer and Junne 2010), including models for

optimization of dynamic processes (Cruz Bournazou et al. 2013; Delgado San Martín et al.

2014; Hidalgo-Bastida et al. 2012; Lu et al. 2013; Oddone et al. 2007; Venkata Mohan et al.

2005) and processes with mixtures of microbial populations with low quality data (Junker

and Wang 2006; Su et al. 2005). However, contrary to other industries where processes are

typically predicted using existing equations and literature values (thermodynamics, chemical

reactions, combustion, etc.), model building in biotechnology is inevitably coupled with

iterative and laborious experimental validation for each process-product pair (Ataman and

Hatzimanikatis 2015; Medema et al. 2012). Regardless of its level of complexity, a model

needs to be constantly re-fitted against observations in order to adapt its outputs to

variations in the environment or in the microorganism itself (e.g mutations). Therefore,

creating an experimental facility that can generate the data required to fit a specific model

with high cost-time efficiency is certainly relevant for research and industry. This implies

that the experimental set-ups should: i) run fast and cheap, ii) emulate process relevant

conditions, and iii) consider the evolution of the system over time.

Page 3 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

4

These requirements have pushed the development in automation, miniaturization, and

parallelization of experimental facilities developing automatic Liquid Handling Stations

(LHS) for High Throughput Screening (HTS) and High Throughput Bioprocess Design

(HTBD), (Long et al. 2014; Puskeiler et al. 2005; Schäpper et al. 2009) as well as sensor

and Process Analytical Technologies (PAT) for a better and less invasive insight into

biological systems (Neubauer et al. 2013). The rapid growth of these technologies brings

additional challenges, such as the correct handling and analysis of very large data sets

(Kozak 2014; Shockley 2015) and the design of extremely complex experiments. The

design of multiple parallel experiments can be increased by using methods for design of

experiments (Glauche et al. 2016; Lutz et al. 1996) and neural networks (Glassey et al.

1994) among others. Nevertheless, we need to go beyond “endpoint” or “steady state”

experiments to uncover the dynamics of the process and predict its evolution over time. This

means that the design of the dynamic experiment cannot be done exclusively with statistical

tools. This time dependent behavior is better described by nonlinear differential equations.

Methods for Optimal Experimental Design (OED), also called Model Based Design of

Experiments (MBDoE), which design the experiments based on nonlinear differential

equation systems, have been developed (Franceschini and Macchietto 2007a; Körkel et al.

2004) and specific applications in biotechnology exist (Baltes et al. 1994; Takors et al.

1997). In order to maximize the precision of the parameter estimates, the experimenter

needs to find the optimal combination of input variables (e.g. feeding strategy) and sampling

setup (type and time) within the feasible region. In addition, each sampling consists of a

number of “actions” (pipetting, mixing, incubating, measuring, etc.) and requires different

“resources” (1-, 8-, or 96-chanel pipette, shaker, photometer, flow cytometer, reaction

Page 4 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

5

vessels, plates, etc.) which have to be optimally coordinated in an efficient schedule.

Unfortunately, tools to plan and perform dynamic experiments exploiting full LHS capacity

are not available. Despite various publications using OED in biotechnology (Franceschini

and Macchietto 2007b; Galvanin et al. 2011; Gernaey et al. 2002; Kreutz and Timmer 2009;

López Cárdenas et al. 2013; Skanda and Lebiedz 2010) and design of experiments applied

in HTS, to our knowledge, there are no publications dealing with the online design of

nonlinear dynamic parallel experiments as they are needed for HTBD.

Furthermore, in order to minimize the experimental effort and increase the robustness of the

experiment, the model should be re-fitted after each sample and the experiment re-designed

as data is being generated. This requires an efficient solution of a fairly large nonlinear and

possibly ill-conditioned optimization program.

Works dealing with online experimental re-design

(Galvanin et al. 2012; Stigter et al. 2006;

Zhu and Huang 2011) show that the information generated in each experiment is

significantly higher compared to experiments that are planned in sequence (offline). It may

appear that the extension to parallel experiments is fairly straight forward from the

theoretical point of view (Cruz Bournazou et al. 2014a); nevertheless, former studies are

applied on simple systems and its application to a real set of parallel cultivations presents

many challenges. This includes the complexity of the biological system, the operation of the

experimental facilities, the typically long delays and the limited information content of the

analyses, and the scheduling of all actions considering resource availability (Mayer and

Raisch 2004). Finally, a robust and cheap computation of both optimization programs,

namely the Parameter Estimation (PE) and the Experimental Design (ED) problem, is

Page 5 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

6

critical in order to handle data and compute the optimal strategy at the speed dictated by the

running experiment.

Problem statement

The goal of this work is to achieve an online computation of optimal parallel dynamic

experiments together with its automatic implementation in a LHS for fed-batch cultivations

of the bacterium E. coli (Figure 1) using a Sliding Window Optimal experimental Re-

Design (SWORD). We regularize the singularity problem by an appropriate selection of the

parameter subset to assure a well-conditioned parameter estimation even with reduced

experimental information, and use a moving horizon approach to reduce the computational

burden of the optimization. Ultimately, it is possible to re-design the running experiment

after each measurement and take full advantage of the present state of information to plan

the following steps.

Materials and Methods

The most relevant aspects of the experiment are presented here, for a more detailed

description of the experimental setting, the reader is referred to the Appendix in the

supplementary material and to (Nickel et al. submitted). The experiments were carried out

using two different LHSs. The mini-bioreactor system bioREACTOR 48 (2mag AG,

Munich, Germany) was integrated in a TECAN Freedom Evo LHS (Tecan, Crailsheim,

Germany) (Figure 2) to automate all liquid handling steps. General process data

(temperature, stirrer speed, pH, dissolved oxygen) was stored in the iLab-Bio database

(infoteam, Bubenreuth, Germany). For pH control of the vessels, a LabVIEW (National

Instruments, Munich, Germany) based pH controller was used. Worklists were generated of

titrant volumes for the Tecan Gemini software. Samples were analyzed on a Hamilton

Page 6 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

7

Microlab Star platform (Hamilton Robotics, Bonaduz, Switzerland), which is equipped with

a Biotek Synergy MX microplate reader (BioTek Instruments GmbH, Bad Friedrichshall.

Germany). The data was stored in the iLab-Bio database for further processing.

Bacterial strain and cultivation medium

The experiments were performed using E. coli W3110 (DSM No. 5911) stored at – 80 °C in

LB medium containing 20% glycerol. Cultivations were carried out in EnPresso media

(BioSilta Ltd., Cambridge, UK): the seed cultures were grown in EnPresso B medium and

the main culture was performed in EnPresso B defined medium. Unless otherwise stated,

media were prepared according to the manufacturer’s instructions.

Seed and main cultures

The seed culture was performed in a 125 mL UltraYield Flask (Thompson Instrument

Company, Oceanside, USA) containing 25 mL Enpresso B, which was inoculated with 24

µL of cryo culture. For glucose release, 1.5 U L

-1

Reagent A (BioSilta) were added. The

shake flask was covered with AirOtop enhanced shake flask seals (Thompson Instrument

Company) and incubated at 30°C and 220 rpm in a shaking incubator with 50 mm offset

until the mid-log phase was reached (4 – 6 h). Then the main culture was prepared in 100

mL of EnPresso B defined medium containing 10 g L

-1

glucose with an initial OD

620

of

0.01. For the initial batch phase, pre-calibrated baffled 500 mL sensor flasks for online pH

and DO measurements were used (PreSens GmbH, Regensburg, Germany). The flasks were

incubated overnight at 30°C and 200 rpm with 25 mm offset. At 20 h, the 11 mL of culture

were transferred into the 8 mini-bioreactors.

The cultures were incubated at 30°C with an aeration rate of 0.1 L min

-1

and a stirrer speed

of 2600 rpm until they reach glucose limitation. This was indicated by a sudden increase of

Page 7 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

8

DO. Then 6 UL

-1

of Reagent A were added to initiate the glucose limited fed-batch phase.

The pH was adjusted to 7.0 before starting the SWORD experiment by the addition of

6.25 % (v/v) ammonia solution via the Freedom Evo LHS.

SWORD experiment

Deck layout of Freedom Evo LHS

For addition of Reagent A, a 3000 U L

–1

Reagent A solution was placed on a cooled plate

carrier at 4 °C. The EnPresso B defined medium, the 40 gL

-1

acetate additive solution and

the glucose additive solution were pipetted individually into 24 deep well plates (Ritter,

Schwabmünchen, Germany). Samples for analysis were collected into a V-bottom plate. The

pH was adjusted by adding 6.25 % (v/v) ammonia solution prepared in a sterile container.

Sterilization of the metallic pipetting tips was done in a washing station using 70 % (v/v)

ethanol (EtOH).

Sequential arrangement of additions during SWORD

The SWORD experiment was divided into cycles of 20 minutes with a total experimental

time of 6 h. One cycle comprised 5 subroutines each of which lasted 4 minutes (Figure 3).

During each 4-minute subroutine, all steel needles were firstly sterilized. Then, either a

MATLAB work list for the addition of a specific pulse was loaded into the Gemini

environment and executed or the LHS took 300 µL samples from each mini reactor into the

V-bottomed plate.

Analytics

Sampling was done every 20 minutes. The samples were pipetted into plates containing

anhydrous NaOH and mixed thoroughly to a final NaOH concentration of 0.1 M.

Page 8 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

9

Afterwards, 20 µL of sample were used for optical density measurements. The remaining

sample volumes were centrifuged at 3000 rpm and 4 °C for 10 min to separate the cells.

Duplicates of 75 µL supernatant were pipetted for glucose and acetate analyses. Samples

from four successive time points were cooled at 4 °C and analyzed together.

The OD

620

samples were diluted 15 fold in flat bottom plates and were measured manually

with a PHOmo microplate reader (Anthos, Krefeld, Germany). Samples were shaken inside

the microplate reader before the measurement. The OD

620

values were corrected against the

blank, correlated to OD

600

in a 1 cm cuvette by multiplication with the factor 2.62. The cell

dry weight (CDW) was approximated by dividing the OD values by 3.3 as obtained from

dry weight calibrations. OD-measurements were carried out every 20 min.

The extracellular glucose concentration was measured with the enzymatic Glucose

Hexokinase FS kit (DiaSys Diagnostic Systems, Holzheim, Germany) on the Hamilton LHS

as described previously (Knepper et al. 2014). The extracellular acetic acid concentration

was measured with the enzymatic Enzytec fluid

TM

kit (R-Biopharm, Darmstadt, Germany).

Both enzymatic assays were performed on the Microlab Star LHS.

Software

All numeric computations were performed using MATLAB Release 2013b. Model and

parameter sensitivity equations were integrated using CVODES solver from SundialsTB

Toolbox (Hindmarsh et al. 2005).

The computed parameter sensitivities, normalized with respect to initial parameter guesses

and measurements, were used to accurately calculate the gradients of the PE problem and

the Fisher Information Matrix (FIM). The PE problems were solved by single shooting and

using TOMLABs clsSolve solver with the Wang, Li, Qi Structured MBFGS method and

Page 9 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

10

user defined gradients (Holmström et al. 2003). The experimental design (ED) problems

were solved using the TOMLAB implementation of SNOPT (Gill et al. 2002). The objective

was computed from the FIM. The gradients were computed with finite differences with four

threads in parallel.

E. coli kinetic model

The macro-kinetic model used in this work describes E. coli substrate limited growth and a

Matlab® version is available online (see Appendix). The validity of the model is

constrained to a narrow operability region, namely i) glucose concentrations lower than 0.2

g/l, ii) at least 20% of dissolved oxygen tension, and iii) overflow phases must be shorter

than 60 seconds. In addition to substrate feeding, the model considers glucose release by the

EnBase system (Panula-Perälä et al. 2008). The model consists of 6 state variables, namely

biomass concentration () in [g/l], substrate concentration () in [g/l], dissolved oxygen

tension () in [% of saturation], acetic acid () in [g/l], liquid volume () in [l], and

glucose release enzyme concentration () in [Units/l]. These are represented by an Ordinary

Differential Equation (ODE) system as given in the appendix in equations A1 to A6. The

structure of the model is based on the work of (Lin et al. 2001). The model was modified to

add recent discoveries in the acetate production mechanism (Valgepea et al. 2010). As a

result, acetate production and consumption are always active and at equilibrium with a net

acetate production of 0.

SWORD

We present a brief introduction to the basics of OED. The reader is referred to (Barz et al.

2012; Cruz Bournazou et al. 2014b; Franceschini and Macchietto 2007a) for further details.

Page 10 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

11

Optimal experimental design

Let us consider a model consisting of a nonlinear implicit differential equation system that

describes the dynamic behavior of a process (e.g. an E. coli fed-batch cultivation). This

model will contain a set of unknown parameters that can be varied to fit the outputs of the

model against observations of the real process. OED aims at finding experimental setups

such that the statistical uncertainty of estimates of the unknown model parameters is

minimized (Korkel 2002). Generally, the main factors that affect the identifiability of a

model (i.e. the size of its confidence intervals) are: i) the structure of the model, ii) the

quality of the measurements (type, frequency, accuracy), and iii) the design of the

experiment used to generate the data (inputs, conditions, measurmenents, etc.). In other

words, the computed optimal conditions aim to maximize the information content of the

measurements so that the parameters are determined most accurately.

Roughly speaking, the inputs should excite the system so as to let the experimenter observe

its dynamics by sampling in the most sensitive points of the experiment. To find this optimal

setup, mixed second order sensitivities of the least square functional with regard to

parameters and states have to be computed.

Mathematical formulation

A simplified description of the problem is presented, the reader is referred to Appendix for

further details. We consider a twice continuously differentiable nonlinear Ordinary

Differential Equation (ODE) system:











=













,



,











;











=

ℎ













;













=





1

where  ∈



,



⊆ℝ is the time, ∈ℝ



#

 are dependent state variables, ∈ℝ



$

are the time-varying input or experimental design variables and  ∈ℝ



%

the unknown

Page 11 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

12

parameter vector and initial conditions are given by 



. The vector ∈ℝ



&

contains the

predictions of the observed responses. In Eq. (1) we consider a set of ℛ ≔ {*

+

,⋯,*



-

}

parallel experiments, with /

0

being the number of reactors. The robot simultaneously feeds

all mini-reactors at discrete feeding time instances 1

2

≔3

+

,⋯,



4

5, with /

2

being the

total number of injections into one reactor. Each reactor has its corresponding input signals

or experimental design variables 

0

∈ℝ



$

, with  ∈1

2

and *∈ℛ. For all reactors, the

experimental design variables are collected in the vector

6 ≔7

08+



28+



9

,⋯,

08+

:



4

;

9

<

9

,…,7



-



28+



9

,⋯,



-

:



4

;

9

<

9

∈ℝ



$

⋅

4

⋅

-

with the number of possible individual species to be injected /

?

, the total number of

injections /

2

and the number of reactors /

0

. The robot simultaneously takes samples from

all mini-reactors at discrete measurement time instances 1

@

≔At

+

,⋯,t

C

D

E, with n

@

being

the number of samples taken by the robot from one reactor. The measurements are defined

by G

H

≔I

08+

H



J8+



9

,⋯,

08+

H





K



9

L

9

,…,I



-

H



J8+



9

,⋯,



-

H





K



9

L

9

∈ℝ



&

⋅

K

⋅

-

with the number of predicted responses /

M

, the number of samplings taken by the robot /

J

and the number of reactors /

0

. In the same way, corresponding predicted response variables

are defined as 

0

;

0

,∈ℝ



&

, with  ∈1

J

and * ∈ℛ and collected in the vector

G



6

,





∈

ℝ



&

⋅



K

⋅



-

2

Note that the predicted responses are obtained from the solution of Eq. (1) for each reactor *

and therefore depend on 

0

and .

Model parameters  are estimated by maximum likelihood estimation minimizing the

quadratic residual (Bard 1974):

Page 12 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

13



N

:

=

arg

min

U

1

2



G



6

,





−

G

H



9





Y

H



Z

+





G



6

,





−

G

H



3

with the weighting matrix Y

H

∈ℝ



&

⋅

K

⋅

-

×

&

⋅

K

⋅

-

being the variance-covariance matrix of

the measurement errors in the data. Y

H

is assumed to be a diagonal matrix with the variance

\

]^

of each measurement _ in its diagonal entries. The precision of a maximum-likelihood

estimate 

N can be analysed by evaluation of the so called Fisher Information Matrix (FIM)

F6,

N or its inverse (lower bound on the covariance matrix Y

U

6,

N) (Bard 1974). The

design of an optimal experiment for improving parameter precision therefore minimizes

some metric of Y

U

6,

N by optimally choosing the inputs or experiment design variables 6

subject to equality and inequality constraints. In this contribution we apply the so called A-

optimal criterion a

b

to formulate the optimal experiment design objective function:

a

b





,



N



≔

1

/

c



tr

:

d

Z

+



6

,



N



;

4

Online optimal experimental re-design

The real information content of an ED depends on the accuracy of the assumed parameter

values (initial parameter guess). Unfortunately, uncertainties in the initial parameter guess

and initial conditions cause a mismatch between computed and experimental outputs leading

to suboptimal ED. This mismatch can be reduced by an adaptive online ED, where the

experiment is iteratively re-designed as information is generated. This idea has been

discussed in previous publications and is described in detail by (Barz et al. 2012).

In sequential OED, the identifiability of the parameter estimation problem can be studied

and solved offline. In contrast, when performing a recursive fit of the model and a design of

each sampling setup, an efficient solution of nonlinear and possibly ill-conditioned problems

is required. (Barz et al. 2012) propose the addition of a local parameter identifiability

Page 13 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

14

analysis method to select the parameters such that well-conditioned optimization problems

are guaranteed. Therefore, it is possible to redesign the running experiment in order to take

full advantage of the present state of information and plan the following step (optimal input

, sampling time 

]

, sampling setup) after each measurement. This is, to the best of our

knowledge, the first attempt to carry out an online parallel design of dynamic experiments

using real automated cultivations systems.

The F matrix needs to be well conditioned as it is used to approximate the covariance matrix

of the parameters C

θ

by inversion. If this is not the case, e.g. due to insufficient

measurement data or correlations in the parameters, a regularization method is applied to

approximate F by a well-conditioned matrix. The regularization is based on the Subset

Selection (SsS) method which is very effective when applied to ED problems with ill-

conditioned matrices (López Cárdenas et al. 2015). Moreover, the SsS method also proposes

a reduction of the parameter space (and the PE and ED problem) by finding the parameters

which cannot be identified for the existing measurement data (identifiability analysis).

Experimental implementation of SWORD

The duration of the SWORD experiment is 6 h and is divided in 18 cycles of 20 min. A

detailed description of one cycle is given in Figure 3. The initial experimental design is

carried out up to minute 140 ( first re-design is available). The first at-line data is available

after 100 minutes. Once new at-line data is available, i) the identifiability of the parameters

is checked, ii) a PE is solved for max. 20 min, and iii) the parameter estimates obtained are

passed to re-compute the optimal experiment (max. time 20 min). This procedure is repeated

three more times until 300 min where the last re-design is calculated. The experiment run is

depicted in Figure 4.

Page 14 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

15

Column C reads the real time. Column A shows the tasks performed by the first computer,

namely the PE (green; cell [4,Q]) and SsS and identifiability analysis (dark green; cell

[5,Q])). Column B depicts the performance of the second computer which was responsible

for the ED-SH optimizations of the short horizon (dark red; cell [6,Q])) and DE-LH random

seed global optimizations computing the long horizon (red; cell [7,Q])), and the data that is

available is depicted by bars (black; cell [10,Q]). The time points of the samples and designs

can be seen in row 1 (columns D to V). For example: at 100 min (cell [7,C]) the data from

sampling at 0 and 20 min is stored as depicted in the bar in cells [7,D- E] and the comp1

initiates the parameter estimation (cell [7,A]). We can also see the three optimization

horizons (orange; cell [8,Q]), which were implemented at 140 min, 240 min, and 340 and

finally the results of the long term horizon (blue; cell [9,Q]), which were available at 0 min,

220 min and 320 min. The run of the experiment is depicted at the example of biomass for

reactor 1 in Figure 5.

Results and discussion

SWORD was able to fit the model despite the fact that the enzymatic assay for acetate

quantification was inaccurate, proving the advantages of an online re-design approach

Figure 6. The program could obtain reliable parameter estimates by re-designing the

experiment taking the unexpectedly high variance in acetate into account. Further

screenshots of the monitoring station are presented in the Appendix to show the

development of the experiment and the inputs selected over time. The reactor replicates

proved to be statistically equal according to the hypothesis test and confidence intervals

(Montgomery and Runger 2010) carried out on the cell dry weight data over the complete

experiment, see Figure 7.

Page 15 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

16

In Figure 8, the normal probability density function of the whole parameter vector centered

at the normalized parameter estimate 

N with variance \

^

are displayed. The size of the

variability of the parameters with large uncertainty can be observed (i.e., Yam and Yofm).

‘qSmax’ has the lowest relative standard deviation \

0

=0.56% whereas ‘Yam’ and

‘Yofm’ have the highest ones with 341% and 822 % respectively.

The identifiability list for this problem can be found in Table 1. It should be stressed that

due to parameter correlations, the ranking between variance, sensitivity and SsS are not

equal. Still parameter with the largest eﬀect on the outputs is also 'qSmax' and the parameter

with the smallest effect on the measurable variables after all orthogonal projections is

'Yofm' showing consistency with the precision results above mentioned. In addition, the

sensitivity matrix is not full-rank but 23, i.e. only 23 parameters are identifiable. The

unidentifiable parameters being 'Yam', and 'Yofm'. Additionally, the sensitivity ranking list

built based on the overall output-parameter sensitivity information is displayed. The metric

used is the sensitivity measure δ=

+

l

m

,n

o

,n

p

∗∑s

]t

^

with s

]t

^

=

uM

v

uU

w

which considers the norm

of the sensitivity of the whole predicted response variables to a change in the parameter 

t

.

Table 1. Parameter ranking list according to: parameter variance x

y

, relative standard deviationx

z

and

mean sensitivity {

|

, of the predicted response variables } to a change in the parameter |.

Now let us show the advantages of the re-design of the experiment. We compare the results

obtained in the real experiment against a simulation of the initial design with the final

parameter values. The A-criterion, which represents the average coefficient of variation

(normed standard deviation) of the parameter estimates is almost 50 times lower (235.86%

vs. 4.83%) with SWORD considering all 23 identifiable parameters.

Furthermore, the relative variance σ

r

of the parameters at the estimated vector θ

f

is depicted

in Figure 9 in ascending order (light green bars). A comparison between the parameter

Page 16 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

17

precision at the initial guess θ

0

and the estimated vector θ

f

shows the superiority of the

estimated parameter vector θ

f

over θ

0

. The identifiability increased from only 13 parameters

with a variance lower than 10% with the original design (dark brown bars) to 23 due to the

continues re-design of the inputs. This proves that the final experiment cannot be computed

beforehand since, due to wrong initial estimates, the predicted outputs are in reality

suboptimal.

Conclusions

The online re-design of optimal experiments for 4x2 parallel fed-batch cultivations is

possible using a moving horizon approach which reduces the experimental effort compared

to sequential designs. In this case study, one experimental run generated sufficient

information to fit a macro-kinetic model of E. coli W3110, significantly reducing time and

costs of model validation. This is achieved by re-designing the optimal strategy of the

parallel cultivations using the existing data as it is generated during the experiment.

The results show that the SWORD program can overcome complications related with

ill-posed optimization problems and highly nonlinear designs in parallel bioreactors. This is

a first step towards new model based tools that can fully exploit the advantages of LHS and

speed up the development from screening to production in bioprocesses. Faster, cheaper,

and more efficient experiments will encourage the use of mechanistic models, allowing for

the application of computer-aided tools in design, monitoring, control, and optimization of

bioprocesses.

Page 17 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

18

Acknowledgments

The authors acknowledge financial support by the German Federal Ministry of Education

and Research (BMBF) within the Framework Concept ”Research for Tomorrow’s

Production”

(project no. 02PJ1150, AUTOBIO project) managed by the Project Management

Agency Karlsruhe (PTKA).

We would like to thank

Sven Olof Enfors for fruitful discussions during the development of

the model, Tiffany Wong, and students Michael Heisser, Lorenz Thielert, Sebastian Hans,

and Terrance Wilms.

Page 18 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

19

References

Ataman M, Hatzimanikatis V. 2015. Heading in the right direction: thermodynamics-

based network analysis and pathway engineering. Current opinion in

biotechnology 36:176-182.

Baltes M, Schneider R, Sturm C, Reuss M. 1994. Optimal experimental design for

parameter estimation in unstructured growth models. Biotechnology Progress

10(5):480-488.

Bard Y. 1974. Nonlinear parameter estimation. New York: Academic Press New York.

Barz T, López Cárdenas DC, Arellano-Garcia H, Wozny G. 2012. Experimental

evaluation of an approach to online redesign of experiments for parameter

determination. AIChE Journal 59(6):1981-1995.

Brunk E, Neri M, Tavernelli I, Hatzimanikatis V, Rothlisberger U. 2012. Integrating

computational methods to retrofit enzymes to synthetic pathways.

Biotechnology and Bioengineering 109(2):572-582.

Cruz Bournazou MN, Arellano-Garcia H, Wozny G, Lyberatos G, Kravaris C. 2012. ASM3

extended for two-step nitrification–denitrification: a model reduction for

sequencing batch reactors. Journal of Chemical Technology and Biotechnology

87(7):887-896.

Cruz Bournazou MN, Barz T, Nickel D, Neubauer P. 2014a. Sliding-Window Optimal

Experimental Re-Design in Parallel Microbioreactors. Chemie Ingenieur

Technik 86(9):1379-1380.

Cruz Bournazou MN, Hooshiar K, Arellano-Garcia H, Wozny G, Lyberatos G. 2013.

Model Based Optimization of the Intermittent Aeration Profile for SBRs under

Partial Nitrification. Water Research 47(10):3399-3410.

Cruz Bournazou MN, Junne S, Neubauer P, Barz T, Arellano-Garcia H, Kravaris C. 2014b.

An approach to mechanistic event recognition applied on monitoring organic

matter depletion in SBRs. AIChE Journal 60(10):3460-3472.

Delgado San Martín JA, Cruz Bournazou MN, Neubauer P, Barz T. 2014. Mixed integer

optimal control of an intermittently aerated sequencing batch reactor for

wastewater treatment. Computers & Chemical Engineering 71:298-306.

Franceschini G, Macchietto S. 2007a. Model-based design of experiments for parameter

precision: State of the art. Chem Eng Sci 63(19):4846–4872.

Franceschini G, Macchietto S. 2007b. Validation of a model for biodiesel production

through model-based experiment design. Ind Eng Chem Res 46(1):220-232.

Galvanin F, Barolo M, Macchietto S, Bezzo F. 2011. Optimal design of clinical tests for

the identification of physiological models of type 1 diabetes in the presence of

model mismatch. Medical and Biological Engineering and Computing

49(3):263-277.

Galvanin F, Barolo M, Pannocchia G, Bezzo F. 2012. Online model-based redesign of

experiments with erratic models: a disturbance estimation approach.

Computers & Chemical Engineering 42:138-151.

Gernaey K, Petersen B, Dochain D, Vanrolleghem PA. 2002. Modeling aerobic carbon

source degradation processes using titrimetric data and combined

respirometric–titrimetric data: Structural and practical identifiability.

Biotechnology and Bioengineering 79(7):754-767.

Page 19 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

20

Gill PE, Murray W, Saunders MA. 2002. SNOPT: An SQP algorithm for large-scale

constrained optimization. SIAM Journal on Optimization 12(4):979-1006.

Glassey J, Montague G, Ward A, Kara B. 1994. Artificial neural network based

experimental design procedure for enhancing fermentation development.

Biotechnology and Bioengineering 44(4):397-405.

Glauche F, Pilarek M, Cruz Bournazou MN, Grunzel P, Neubauer P. 2016. Design of

Experiments based high-throughput strategy for development and optimization

of efficient cell disruption protocols. Engineering in Life Sciences:accepted.

Hidalgo-Bastida LA, Thirunavukkarasu S, Griffiths S, Cartmell SH, Naire S. 2012.

Modeling and design of optimal flow perfusion bioreactors for tissue

engineering applications. Biotechnology and Bioengineering 109(4):1095-1099.

Hindmarsh AC, Brown PN, Grant KE, Lee SL, Serban R, Shumaker DE, Woodward CS.

2005. SUNDIALS: Suite of nonlinear and differential/algebraic equation solvers.

ACM Transactions on Mathematical Software (TOMS) 31(3):363-396.

Holmström K, Edvall MM, Göran AO. TOMLAB for large-scale robust optimization;

2003. p 218-222.

Junker BH, Wang HY. 2006. Bioprocess monitoring and computer control: Key roots of

the current PAT initiative. Biotechnology and Bioengineering 95(2):226-261.

Knepper A, Heiser M, Glauche F, Neubauer P. 2014. Robotic Platform for Parallelized

Cultivation and Monitoring of Microbial Growth Parameters in Microwell Plates.

Journal of laboratory automation 19(6):593-601.

Kokkalis A, Thygesen UH, Nielsen A, Andersen KH. 2014. Limits to the reliability of

size-based fishing status estimation for data-poor stocks. Fisheries Research

171:4-11.

Korkel DMS. 2002. Numerische Methoden fuer Optimale Versuchsplanungsprobleme

bei nichtlinearen DAE-Modellen: Ruprecht-Karls-Universitaet Heidelberg.

Körkel S, Kostina E, Bock HG, Schlöder JP. 2004. Numerical methods for optimal control

problems in design of robust optimal experiments for nonlinear dynamic

processes. Optim Method Softw 19(3-4):327-338.

Kozak K. 2014. Large scale data handling in biology: Bookboon.

Kreutz C, Timmer J. 2009. Systems biology: experimental design. FEBS Journal

276(4):923-942.

Legmann R, Schreyer HB, Combs RG, McCormick EL, Russo AP, Rodgers ST. 2009. A

predictive high-throughput scale-down model of monoclonal antibody

production in CHO cells. Biotechnology and Bioengineering 104(6):1107-1120.

Lin HY, Mathiszik B, Xu B, Enfors SO, Neubauer P. 2001. Determination of the

maximum specific uptake capacities for glucose and oxygen in glucose-limited

fed-batch cultivations of Escherichia coli. Biotechnol Bioeng 73(5):347-57.

Long Q, Liu X, Yang Y, Li L, Harvey L, McNeil B, Bai Z. 2014. The development and

application of high throughput cultivation technology in bioprocess

development. Journal of Biotechnology 192:323-338.

López Cárdenas DC, Barz T, Körkel S, Wozny G. 2015. Nonlinear ill-posed problem

analysis in model-based parameter estimation and experimental design.

Computers & Chemical Engineering 77:24-42.

López Cárdenas DC, Barz T, Peñuela M, Villegas A, Ochoa S, Wozny G. 2013. Model-

based identifiable parameter determination applied to a simultaneous

Page 20 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

21

saccharification and fermentation process model for bio-ethanol production.

Biotechnology Progress 29(4):1064-1082.

Lu F, Toh PC, Burnett I, Li F, Hudson T, Amanullah A, Li J. 2013. Automated dynamic

fed-batch process and media optimization for high productivity cell culture

process development. Biotechnology and Bioengineering 110(1):191-205.

Lutz MW, Menius JA, Choi TD, Laskody RG, Domanico PL, Goetz AS, Saussy DL. 1996.

Experimental design for high-throughput screening. Drug discovery today

1(7):277-286.

Mayer E, Raisch J. 2004. Time-optimal scheduling for high throughput screening

processes using cyclic discrete event models. Mathematics and Computers in

Simulation 66(2):181-191.

Medema MH, van Raaphorst R, Takano E, Breitling R. 2012. Computational tools for the

synthetic design of biochemical pathways. Nature Reviews Microbiology

10(3):191-202.

Montgomery DC, Runger GC. 2010. Applied statistics and probability for engineers:

John Wiley & Sons.

Neubauer P, Junne S. 2010. Scale-down simulators for metabolic analysis of large-scale

bioprocesses. Current opinion in biotechnology 21(1):114-21.

Nickel DB, Cruz Bournazou MN, Wilms T, Neubauer P, Knepper A. submitted. Online

bioprocess data generation, analysis and optimization for parallel fed-batch

fermentations at mL scale.

Oddone GM, Lan CQ, Rawsthorne H, Mills DA, Block DE. 2007. Optimization of fed-

batch production of the model recombinant protein GFP in Lactococcus lactis.

Biotechnology and Bioengineering 96(6):1127-1138.

Panula-Perälä J, Siurkus J, Vasala A, Wilmanowski R, Casteleijn MG, Neubauer P. 2008.

Enzyme controlled glucose auto-delivery for high cell density cultivations in

microplates and shake flasks. Microb Cell Fact 7(3):1.

Puskeiler R, Kaufmann K, Weuster-Botz D. 2005. Development, parallelization, and

automation of a gas-inducing milliliter-scale bioreactor for high-throughput

bioprocess design (HTBD). Biotechnology and Bioengineering 89(5):512-523.

Rosen C, Vrecko D, Gernaey K, Pons M-N, Jeppsson U. 2006. Implementing ADM1 for

plant-wide benchmark simulations in Matlab/Simulink. Water Science and

Technology 54(4):11-19.

Schäpper D, Alam MNHZ, Szita N, Lantz AE, Gernaey KV. 2009. Application of

microbioreactors in fermentation process development: a review. Analytical

and bioanalytical chemistry 395(3):679-695.

Shockley KR. 2015. Quantitative high-throughput screening data analysis: challenges

and recent advances. Drug discovery today 20(3):296-300.

Skanda D, Lebiedz D. 2010. An optimal experimental design approach to model

discrimination in dynamic biochemical systems. Bioinformatics 26(7):939-945.

Stigter J, Vries D, Keesman K. 2006. On adaptive optimal input design: a bioreactor case

study. AIChE Journal 52(9):3290-3296.

Su WW, Liu B, Lu W-B, Xu N-S, Du G-C, Tan J-L. 2005. Observer-based online

compensation of inner filter effect in monitoring fluorescence of GFP-

expressing plant cell cultures. Biotechnology and Bioengineering 91(2):213-

226.

Page 21 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

22

Takors R, Wiechert W, Weuster-Botz D. 1997. Experimental design for the

identification of macrokinetic models and model discrimination. Biotechnology

and Bioengineering 56(5):564-576.

Valgepea K, Adamberg K, Nahku R, Lahtvee P-J, Arike L, Vilu R. 2010. Systems biology

approach reveals that overflow metabolism of acetate in Escherichia coli is

triggered by carbon catabolite repression of acetyl-CoA synthetase. BMC

Systems Biology 4(1):166.

Venkata Mohan S, Chandrasekhara Rao N, Krishna Prasad K, Murali Krishna P,

Sreenivas Rao R, Sarma PN. 2005. Anaerobic treatment of complex chemical

wastewater in a sequencing batch biofilm reactor: Process optimization and

evaluation of factor interactions using the Taguchi dynamic DOE methodology.

Biotechnology and Bioengineering 90(6):732-745.

Zavrel M, Schmidt T, Michalik C, Ansorge-Schumacher M, Marquardt W, Büchs J, Spiess

AC. 2008. Mechanistic kinetic model for symmetric carboligations using

benzaldehyde lyase. Biotechnology and Bioengineering 101(1):27-38.

Zhu Y, Huang B. 2011. Constrained receding-horizon experiment design and parameter

estimation in the presence of poor initial conditions. AIChE Journal

57(10):2808-2820.

Page 22 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

Figure 1: simplified flow diagram of the online re-design of experiments procedure

172x39mm (150 x 150 DPI)

Page 23 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

Figure 2 (a) bioREACTOR 48 integrated into the Freedom Evo LHS. (b) Data flow concept of SWORD. On-

line signals from the mini-bioreator system (MBR, I) and measured data from samples (II) is stored in a

database, which can be accessed for the sliding window re-

design (III). The LHS was controlled via MATLAB.

(c): Deck layout of the Freedom Evo LHS: bioREACTOR 48 (1), Additives: Reagent A (2), EnPresso B defined

(8), 40 g L-1 sodium acetate (9), glucose (10); sample plate (11).The steel needles were washed in

deionised water (3), twice in EtOH (4, 6) and in sterile deionised water (7) pH adjustment was done with

6,25 % ammonia (5).

152x101mm (150 x 150 DPI)

Page 24 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

Figure 3: Scheduling of the SWORD experiment. Cycles of 20 min were repeated for 6h. During these

intervals, in cycles of 4 min EnPresso B defined medium, Reagent A, acetate and glucose were added to the

cultivations. Afterwards, 300 µL samples were taken from each reactor. Each addition routine consisted of

several subroutines (right).

133x51mm (150 x 150 DPI)

Page 25 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

Figure 4: Time schedule of the experiment. Following time column C, the development of the experimental

setup is to be seen (starting at the top). The activity of the computers (computer1 PE in column A, and

computer2 ED in column B). Availability of at line measurement (biomass, glucose, and acetate) are

represented by black bars in relation to measurement number and sampling time, row 1.

127x81mm (150 x 150 DPI)

Page 26 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

Figure 5: Evolution of SWORD experiment at the example of cell dry weight of the firs reactor (from plot A

to D). Red dashed and blue lines represent the simulation results before and after parameter estimation

respectively. The bars represent the horizon of available online and at-

line data (black), the time required to

compute the parameter estimation (blue), the time for ED optimization (red), and horizon length of the

experimental re-design (green).

165x107mm (150 x 150 DPI)

Page 27 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

Figure 6: Screenshot of the monitoring station. Data collected over the complete experiment. Continuous

lines represent the simulations. The parameters are shown in the left side of the Monitor. Active parameters

(considered in the PE) are

marked with “#” on the left side. Measurements vs. simulations of all four reactor

pairs (first reactor blue second reactor red) are depicted in a 5x4 format. Column-wise beginning from the

upper graph, cell dry weight (X) in grams per liter is first plotted followed by glucose concentration (Gl) in

grams per liter, dissolved oxygen tension (DOT) in % of saturation, and acetate (Ac) in grams per liter in

the fourth graph. In the semi-log graph at the bottom of each column, the volumes in micro liters added

every 4 min (or extracted for sampling) are depicted with bars. Following the order of addition in each cycle,

medium additions are represented by green, enzyme by red, acetate by magenta, glucose by blue and

samples (fixed at 300 µl) by cyan bars. While this screenshot was taken, the additions between time 140

and 240 min were re-designed with the new parameter values.

160x85mm (150 x 150 DPI)

Page 28 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

Figure 7: Graphical representation of the 95% confidence intervals (shaded area) of the difference in the

means µ_1-

µ_2 (solid line) for 4 experimental conditions (2 reactor replicates) over the whole experimental

run. µ_1 and µ_2 represent the means of the dry biomass of each reactor in a replicate. A confidence

interval containing the zero-value suggests no difference in the means at the evaluated experimental point.

The percentages of accepted measurements are: A=89%, B=79%, C=68%, and D=95%.

120x96mm (150 x 150 DPI)

Page 29 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

Parameter σ

σσ

σ

2

σ

σσ

σ

r

δ

δδ

δ

Ranking

Parameter

σ

σσ

σ

2

σ

σσ

σ

r

δ

δδ

δ

Ranking

Variance

Sensitivity

SsS

Variance

Sensitivity

SsS

'qSmax' 3.13E-05 1% 19.543

1 1 1 'Kaq' 1.02E-03

3% 7.272

14 19 16

'qAmax' 3.97E-04 2% 16.100

2 16 10

'Ks' 1.50E-03

4% 6.933

15 21 17

'KLa-4' 5.01E-04 2% 11.309

3 5 4 'Ke' 2.48E-03

5% 6.889

16 17 19

'KLa-3' 5.01E-04 2% 9.825

4 7 5 'Yem' 6.05E-03

8% 5.334

17 3 3

'KLa-1' 5.02E-04 2% 9.427

5 12 9 'Kap' 7.24E-03

9% 4.220

18 18 18

'KLa-2' 5.03E-04 2% 9.358

6 14 14

'Yaof' 9.73E-03

10% 3.610

19 15 15

'KLa-7' 5.04E-04 2% 9.236

7 10 7 'Ksq' 1.16E-02

11% 1.461

20 23 22

'KLa-8' 5.04E-04 2% 9.229

8 11 8 'Yosresp' 1.26E-02

11% 1.213

21 6 20

'KLa-5' 5.07E-04 2% 9.229

9 9 6 'Ko' 1.74E-02

13% 0.709

22 22 21

'KLa-6' 5.08E-04 2% 8.997

10 13 11

'qm' 7.90E-02

28% 0.626

23 20 23

'pAmax' 5.62E-04 2% 8.932

11 4 13

'Yam' 1.16E+01

341% 0.218

24 24 24

'Yaresp' 7.11E-04 3% 8.629

12 2 2 'Yofm' 6.75E+01

822% 0.111

25 25 25

'GRmax' 8.14E-04 3% 8.029

13 8 12

Page 30 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

Figure 8: Estimator analysis through normal probability density functions of the parameters.

118x62mm (150 x 150 DPI)

Page 31 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

For Peer Review

Figure 9: Estimator analysis with parameter precision ranking in terms of relative standard deviation.

Comparison between the variance obtained with the original design (brown bars) and with 3 re-designs

(green bars).

133x89mm (150 x 150 DPI)

Page 32 of 92

John Wiley & Sons

Biotechnology & Bioengineering

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60