Adaptive Sampling of Dynamic Systems for
Generation of Fast and Accurate Surrogate Models
Torben Talis
{
, Joris Weigert
{,
*, Erik Esche, and Jens-Uwe Repke
DOI: 10.1002/cite.202100109
This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any
medium, provided the original work is properly cited.
For economic nonlinear model predictive control and dynamic real-time optimization fast and accurate models are neces-
sary. Consequently, the use of dynamic surrogate models to mimic complex rigorous models is increasingly coming into
focus. For dynamic systems, the focus so far had been on identifying a system’s behavior surrounding a steady-state opera-
tion point. In this contribution, we propose a novel methodology to adaptively sample rigorous dynamic process models
to generate a dataset for building dynamic surrogate models. The goal of the developed algorithm is to cover an as large as
possible area of the feasible region of the original model. To demonstrate the performance of the presented framework it is
applied on a dynamic model of a chlor-alkali electrolysis.
Keywords: Adaptive sampling, Dynamic data-driven modeling, Recurrent neural networks, Surrogate modeling
Received: June 14, 2021; revised: September 15, 2021; accepted: October 20, 2021
1 Motivation and Introduction
The need for online reoptimization of continuously oper-
ated chemical plants becomes ever more important given
the increase in demand response activity of industry, in-
creases in feed fluctuations, or changes in demand, etc. [1].
For processes with complex dynamics and slow return to
steady-state, economic nonlinear model predictive control
or dynamic real-time optimization has long been investi-
gated [2, 3]. Apart from the necessity to have highly accu-
rate process models and reliable state estimators, fast and
robust solution of the associated optimization problems is
of the essence.
Hence, many research groups have started working on
dynamic surrogate models, which accurately mimic the
behavior of complex rigorous models of chemical processes
and allow for fast computation of both state estimation and
real-time optimization problems [4].
In these schemes, simulation problems using rigorous
models are carried out offline and their results are then
employed to train, e.g., recurrent neural networks, for
online application [5]. In these settings, the number of sim-
ulations performed offline does not need to be limited.
Rather, it is important that the simulations cover a large
swath of the original model’s feasible region in terms of
both inputs (controls and initial conditions) and outputs
(state variables) as most surrogate models have no guaran-
tees regarding extrapolation.
Sampling and surrogate modeling for steady-state sys-
tems is well established [6, 7]. For dynamic systems, the
focus so far had been on system identification, i.e., identify-
ing a system’s behavior surrounding a steady-state opera-
tion point [8]. These methods are in general not capable to
generate surrogate models capable of mimicking the behav-
ior of a chemical plant from start-up to shutdown and have
only a small range of validity.
1.1 Objective
To fill this gap, the present contribution proposes a novel
methodology to adaptively sample rigorous dynamic pro-
cess models, with the goal of covering an as large as possible
area of the feasible region of the original model.
f_
x;x;u;d;p;tðÞ¼0 (1)
The systems of interest are defined by Eq. (1), wherein fis
a set of differential-algebraic equations (DAE), xare state
variables, ucontrol variables, ddisturbances, pmodel
parameters, and ttime. The goal is to describe fby a surro-
gate model g, which predicts xof the next time point (t
k+1
)
based on the values of xand uof the current time point
(t
k
).
Chem. Ing. Tech. 2021,93, No. 12, 2097–2104 ª2021 The Authors. Chemie Ingenieur Technik published by Wiley-VCH GmbH www.cit-journal.com
–
Torben Talis, Joris Weigert, Dr. Erik Esche,
Prof. Dr. Jens-Uwe Repke
Technische Universita¨t Berlin, Process Dynamics and Operations
Group, Sekr. KWT 9, Straße des 17. Juni 135, 10623 Berlin, Ger-
many.
{
These authors contributed equally.
Research Article 2097
Chemie
Ingenieur
Technik
As a starting point for the sampling of f, we shall limit
ourselves to the realistic assumptions of only one known set
of initial values (x
0
=x(t
0
), u
0
) and that upper and lower
bound for all controls are known (u
L
£u£u
U
).
Based on this initial knowledge, we here aim to create a
dataset for building gfrom scratch. Given that this initial
information does not contain any information on the extent
of the feasible region of f, nor does it hold information of
the systems time constants beyond the initial point (x
0
,u
0
).
By consequence, the proposed method will have to both
explore the space of state variables xas well as investigate
frequencies at which the system fshows excitations, which
is subsequently relevant to determine the minimum time
step for g.
2 State of the Art
For system identification, step experiments and oscillating
input signals can be used for simple systems. These perturb
a process at steady-state and generate data, which can be
used to approximate the process by surrogate models valid
in a limited area surrounding the steady-state operation
point [9]. In case of more complex systems, the choice of
excitation signal is paramount. Multisine [10] as well as
chirp [11] and amplitude modulated pseudo random binary
signals (APRBS), which ‘‘can be understood as a sequence
of step functions’’ [12], need to be tailored depending on
the system’s characteristics, i.e., delays, nonlinearity, time
constants, etc. APRBS combines highly dynamic steps and
low dynamic constant parts and covers the whole input
space [11]. Design of experiments may be used to maximize
the information that can be achieved with every (simula-
tion) experiment of the process [12]. These methods typi-
cally focus on excitation of the system by manipulating u
and to hence generate data for x, while always starting from
the same initial point x
0
.
Naturally, this does not necessarily induce a large cover-
age of the feasible area in x. Many different methods are
available to sample in hypercubes. Distributing points
evenly in a k-dimensional hypercube can be achieved by a
uniform grid. However, it requires an exponentially growing
number of sample points with an increase in k. Nonuniform
sampling techniques, such as Latin hypercube [13], Ham-
mersley sequence [14], and Sobol [15], are more efficient,
but cannot avoid the exponential growth in terms of
required number of points. Halton and Hammersley se-
quences are used to generate well distributed, space-filling
samples even in higher dimensions. Both are deterministic
and every subsequence has the same space-filling properties
[16]. ‘‘Hammersley points are an optimal design for placing
npoints on a k-dimensional hypercube’’ [17].
Applying these to generate different initial points for x
0
,
however, is ill-advised as these will almost certainly lead to
infeasibilities. Given the complexities of sampling both in
steady-state and dynamic systems, many different sampling
methods have been developed for surrogate model creation.
‘‘One-shot approaches’’ generate all samples at once, with-
out incorporating any prior knowledge of the system. They
provide a good coverage of the input space [18].
Adaptive sampling methods for static systems have
recently become popular. They can be divided into explora-
tion- and exploitation-based methods. The former try to
obtain a wide coverage of the input space, while exploita-
tion-based methods are driven by the training progress of
the model. The latter require multiple iterations of model
training.
In [7] an exploration-based method is proposed that esti-
mates the feasible region in parameter space by using a pre-
determined number of samples. ‘‘Automated learning of
algebraic models for optimization’’ (ALAMO) can be used
to sequentially sample data and structurally improve the
surrogate model of algebraic systems.
An exploitation-based method is presented in [19]: The
input space is divided into regions, which are sampled inde-
pendently. The model is trained and evaluated on those
regions. New samples are added to the region with the high-
est model error, improving the prediction.
A different method is proposed in [20]. It combines
exploration and exploitation and reduces the number of
function evaluations. However, multiple surrogate models
on different subsets of data are trained. Another hybrid
method is described in [18]. The exploration criterion is
based on a Voronoi tessellation in the input space, and the
exploitation part uses local linear approximations of the
objective function.
All of these methods are used for steady-state models.
Adaption to dynamic models and time series forecast is not
easily possible. Olofsson et al. use design of dynamic experi-
ments for model discrimination [21]. The exploration-
based methods focus on coverage of the input space, while
the exploitation-based methods focus on minimizing the
number of samples and function evaluations. Contrarily to
that, our proposed method is based on coverage of the out-
put space and minimizes training time.
3 Proposed Algorithm for Adaptive Sampling
The proposed algorithm aims at generating a dataset for
building a surrogate model. An overview is given in Fig. 1.
Multiple simulations with a short time horizon, a fixed
timestep, and different inputs
uare used to obtain a good
coverage of the input space. (Bio-)chemical systems can
have time constants differing by orders of magnitude. To
identify these, a frequency modulated APRBS (FAPRBS) is
proposed here and added on the inputs. It can be under-
stood as a sequence of multiple APRBS with different fre-
quencies and is depicted in Fig. 2. The maximum amplitude
of the FAPRBS is small compared to the valid range of u.
The overall algorithm is based on geometric quantities,
especially the Euclidean distance of samples. The curse of
www.cit-journal.com ª2021 The Authors. Chemie Ingenieur Technik published by Wiley-VCH GmbH Chem. Ing. Tech. 2021,93, No. 12, 2097–2104
2098 Research Article
Chemie
Ingenieur
Technik
dimensionality restricts the number of output variables
which can be considered. A subset of all variables, that can
contain state and non-state variables, must be selected.
These variables form the output space Y. The dimensional-
ity of Yis currently limited to 7 by the applied implementa-
tion of the Quickhull algorithm [22, 23]. Using a different
implementation to determine the convex hull, however,
could eliminate this limitation.
The trajectory of each simulation run will oscillate around
a single point, which is called seed from here on. Based on
the seeds, poorly covered areas in the output space are identi-
fied and new inputs for the next simulation are estimated
under the assumption that the system is mostly linear be-
tween the seeds. If this assumption is fulfilled, it can be
ensured that new samples are placed in the target region of
the output space. In case the as-
sumption was invalid and the new
point is not in the target region,
new polygons will still be gener-
ated, which can be re-evaluated
and allow the algorithm to contin-
ue.
The algorithm is passed multi-
ple times. One iteration is called
an epoch. The initial conditions
x
0
are kept the same for all simu-
lations in one epoch.
An epoch is composed of four
phases (see Fig. 1). Phase 1 uses
classical sampling methods to
create the basis for the following
adaptive part. Phase 2 expands
the convex hull of the seeds in the output space, while phase
3 populates empty regions inside the hull. Phase 4 creates a
new set of initial conditions for the next epoch. In the
following, each of these phases are detailed further and the
settings and termination of the algorithm are discussed.
3.1 Phase 1 – Initial Sampling
Phase 1 creates the basis for the adaptive sampling. The input
space is a hypercube of dimension d
i
. Hammersley sequence
sampling is used to create samples for
u, which are well dis-
tributed in the input domain. For input space dimensions
d
i
> 5, the points in a Hammersley set tend to align, which
reduces the usability as a sampling method for the input space.
For this reason, we recommend using a Halton sequence sam-
pling in this case [16]. Additional samples are set directly on
the corners and the center of the faces of this hypercube (see
Fig. 3a). FAPRBS samples are generated around each sample
of
u(see Fig.2 and 3a) and dynamic simulations are carried
out for all resulting dynamic control trajectories using the ini-
tial conditions x
0
of the current epoch (see Fig. 3b).
3.2 Phase 2 – Expansion
The goal of phase 2 is to increase the coverage of the output
space Y, specifically to extend the convex hull of the seeds to
Chem. Ing. Tech. 2021,93, No. 12, 2097–2104 ª2021 The Authors. Chemie Ingenieur Technik published by Wiley-VCH GmbH www.cit-journal.com
Phase 1:
Initial Sampling
Phase 2:
Expansion
Phase 3:
Population
Phase 4:
Restart
New Initial Condition
Rigorous Model
Simulation
Specified Variables
Initial Condition
Final Dataset
Figure 1. Overview of proposed adaptive sampling method.
Figure 2. Frequency and amplitude modulated pseudo-random
binary signal with 30 samples at f
1
and 10 samples at f
2
.f
1
f
2
.
The input signal is sampled around the given mean
u.
a) b)
Figure 3. a) Projection of three-dimensional input space with mean of inputs. b) Seeds of all
simulations in output space.
Research Article 2099
Chemie
Ingenieur
Technik
cover a larger space. The seed of one simulation is calcu-
lated by taking the weighted mean of all simulation results
(Fig. 3b). To achieve this, possible candidates (a new input)
and targets (expected value in the output space) are com-
puted. The targets are designed to be close to the current
perimeter of the hull and as far away as possible from the
seeds. They are scored accordingly. The best candidate gets
selected, and the simulation is started.
A candidate consists of input and target and is created by
combining exactly two previously run experiments. Accord-
ing to the linearity assumption the input of the candidate is
u*¼
u1þ
u2
ðÞ=2 and the target value in y can be deter-
mined as t*¼
y1þ
y2
=2. All combinatorial possible can-
didates are calculated and scored.
For scoring the center point of all seeds, M, is computed,
and for every target t* the Euclidean distance to M, l*, and
to the closest seed, r*, are calculated.
l*¼t*Mkk
2(2)
r*¼min t*
y1
2;...;t*
yn
2
(3)
One example is shown in Fig. 4. All possible targets are
then scored: s*=f(l*, r*). fis chosen in such a way, that the
score improves for larger l* and larger r*. To prevent an
infinite loop, targets are declared invalid, if they are too
close to any previously used target: t*–t
used,i2
<r
used,i
.
Phase 2 is repeated until there are no more valid targets,
the maximum number of simulations in phase 2 is reached,
or a threshold for the scoring function is surpassed. The lat-
ter two are hyperparameters for this phase.
3.3 Phase 3 – Population
The goal of phase 3 is to populate empty regions inside the
convex hull of the seeds in the output space. Identifying
these empty regions is equivalent to the largest empty sphere
problem, which is known in computational geometry and
can be solved using Voronoi diagrams [5]. A Voronoi-algo-
rithm returns vertices [4], which are the center of spheres
defined by the closest seeds and can be used in higher
dimensions.
Applying the algorithm on the seeds off the previously
run simulations, every vertex defines a set of d+1 experi-
ments. The number of vertices and the computational cost
of the algorithm ðOðndimðYÞ=2
simulations in epochÞÞ is small in compari-
son to an exhaustive search ðOðndimðYÞ
simulations in epochÞÞ.
For every vertex, a candidate is computed and scored.
The criterion is based on the size of spheres surrounding
the targets and the number of simulation results – y(t)–
inside of them, favoring big spheres with few points inside
of them.
Candidates and targets are computed similarly to phase 2,
by combining d+1 experiments.
u*
i¼1
dþ1X
dþ1
j¼1
uj(4)
t*
i¼1
dþ1X
dþ1
j¼1
yj(5)
A radius is defined as the smallest distance between the
target and the defining seeds.
r*
i¼min t*
iy*
1
2;...;t*
iy*
dþ1
2
(6)
The target is scored by the function s*=f
k
(r*, n*), where-
in n* describes the number of simulation results inside the
d-ball centered at t*. The original outputs ywith fixed time-
steps are used for counting the simulation results inside a
d-ball. kis a hyperparameter, which defines the number of
d-balls that are considered (see Fig. 5). Especially in higher
dimensions the d-ball with radius 1 r* often is empty, so
multiple d-balls with radius r=1r*,K,kr*) are evaluated.
www.cit-journal.com ª2021 The Authors. Chemie Ingenieur Technik published by Wiley-VCH GmbH Chem. Ing. Tech. 2021,93, No. 12, 2097–2104
Figure 4. Two-dimensional output space with an exemplary tar-
get, the corresponding seeds and necessary values l*, r* for cal-
culation of the scoring function.
Figure 5. Phase 3 candidate selection. The vertex defines the
selection of seeds, for target calculation. Size and number of
simulation results inside the blue n-balls are used in the scoring
function.
2100 Research Article
Chemie
Ingenieur
Technik
The inner shells have a bigger influence on the scoring
function.
The score improves for big radii and small number of
results inside the d-balls. To prevent an infinite loop, targets
which are close to already used ones, are declared invalid
and are not evaluated further.
Phase 3 is repeated until there are no more valid targets
left or the maximum number of experiments is reached.
During phase 3, as the empty regions are filled, the mean of
the computed radii r* decreases. This serves as an additional
termination criterion. The maximum deviation for the
mean radius and the number of iterations below that value
are hyperparameters as well as the number of evaluated
n-balls k.
3.4 Phase 4 – Restart
If the maximum number of epochs is not reached, a new set
of initial conditions for the next epoch is determined with
the intention to expand the covered region in output space
Y. Selecting new initial conditions for a DAE-system is non-
trivial. By taking a point from a formerly traversed trajec-
tory it can be guaranteed that the selected point is a valid
initialization of the system.
The new initial condition is computed by using all simu-
lations from all epochs. To overcome the issue of the curse
of dimensionality, a subset of all state variables must be
selected that is considered further. The center of all results
is calculated and the point with the largest distance to the
center is selected as new initial condition. A minimum dis-
tance to all previously used initial conditions must be main-
tained. It is proposed to use the average distance between
two random points in a hypercube as minimum distance,
but it can be chosen freely [24].
The algorithm terminates when there are no more valid
initial conditions, or the maximum number of epochs is
reached.
3.5 Computational Complexity
The main influencing factors for each phase are stated
below: The number of simulations in phase 1 depends on
the dimensionality of the input space and the chosen num-
ber of Hammersley samples.
nPI ¼2duþ2duþnPI
HSS (7)
The number of candidates for each iteration in phase 2 is
nsim;epoch
2
¼On
2
sim;epoch
, wherein n
sim,epoch
is the
number of simulations in the current epoch, which have to
be evaluated.
In phase 3, for each iteration the most expensive opera-
tion is to calculate and evaluate the matrix of Euclidean dis-
tances between the targets and the simulation results. The
Voronoi algorithm returns OðndimðYÞ=2
sim;epoch Þvertices and there-
fore targets. With a fixed timestep and time horizon for all
simulations, there are ny¼nsim;epochnDt=sim simulation
results. So, the matrix is of size ndimðYÞ=2
sim;epoch ·nyand must be
evaluated for every considered radius for a total ktimes.
In phase 4 the distance matrix of size
nepochs ·nsim;totalnDt=sim must be computed once.
4 Case Study
To demonstrate the performance and the applicability for
dynamic data-driven modeling, the presented adaptive sam-
pling framework is applied on a dynamic model of a chlor-
alkali electrolysis (CAE) and a recurrent neural network is
trained and tested based on the generated dynamic data
sets.
4.1 Problem Description
The chlor-alkali electrolysis produces chlorine, hydrogen
and caustic soda for sodium chloride brine using electrical
power. A flowsheet of the modeled process is shown in
Fig. 6a. Here, the CAE cell is represented as a coupled sys-
tem of two continuously stirred-tank reactors. For a detailed
description of the used model, the reader is referred to [25].
The control variables uused for the case study are the cur-
rent density japplied to the CAE cell, the inlet temperature
of the catholyte feed T
in
and the volume feed flow of the
sodium chloride brine V
in
. To manipulate the two latter con-
trols, the two controllers marked in dashed lines in Fig. 6a
had to be removed from the original model. The lower and
upper bounds of uas well as the maximum possible control
changes in one time step (amplitude of the FAPRBS) used in
the sampling algorithm are listed in Tab. 1.
The variables that are supposed to be described in the
dynamic surrogate model (output space Y) are the tempera-
ture in the CAE cell T
cell
and the sodium ion mass fraction
in the anolyte wNaþ. Both variables are controlled variables
of the removed controllers (marked dashed in Fig. 6a).
Chem. Ing. Tech. 2021,93, No. 12, 2097–2104 ª2021 The Authors. Chemie Ingenieur Technik published by Wiley-VCH GmbH www.cit-journal.com
Table 1. Specification of the used FAPRBS control sampling.
Control uLower bound
of u
Upper bound
of u
Amplitude of
FAPRBS
j[A m
–2
] 5000 6000 200
T
in
[C] 59 89 6
V
in
[L s
–1
] 0.05 0.07 0.004
Research Article 2101
Chemie
Ingenieur
Technik
4.2 Adaptive Sampling
The presented framework was applied on the CAE system
described above. The algorithm finished using five epochs
(initial conditions) and performed 145, 15 and 121 dynamic
simulations in the phases 1, 2 and 3, respectively. Each
dynamic simulation used a FAPRBS signal with 30 samples
at a frequency of 1000 s
–1
and 10 samples at a frequency of
2000 s
–1
. The FAPRBS’s amplitude specifications are listed
in Tab. 1.
The resulting dynamic samples in the in- and output
spaces are shown in Fig. 7. Since both output variables used
in the algorithm are algebraic variables in the CAE model,
the initial results at t
0
are distributed over four areas, each
corresponding to an initial condition. 97.5 % of the compu-
tation time was used for the simulations, with the rest spent
on the algorithm. Here, calculation and evaluation of the
matrix in phase 3 took 81.5 % of the computing time, deter-
mination of all input signals 14.2 %, and the calculation of
targets 2.5 %. All other subroutines can be neglected with a
maximum time usage of less than 0.5 % each.
4.3 Dynamic Data-Driven Modeling
To model the dynamic behavior of the predefined output
variables y, a recurrent neural network was trained for each
output separately. The in- and output specifications of the
used recurrent neural network are shown in Fig. 6b. To pre-
dict yat time point t
k+1
the last Ocontrol variable values at
the time points t
k–O
,K,t
k
and the last N values of the mod-
eled output variable yat the time points t
k–N
,K,t
k
are fed
into the recurrent neural network as input variables.
To find a suitable parameterization of the neural net-
works a hyperparameter tuning using Bayesian optimiza-
tion is performed in addition to the standard model train-
ing. The varied hyperparameters and the results of the
tuning are listed in Tab. 2.
To test the quality of the resulting models, an additional
test set consisting of dynamic data of five simulations is
used. The testing control variables are again sampled from
an FAPRBS using the same specifications as in the adaptive
sampling (see Tab. 1) but with mean control values
uthat
were not used in the training data.
The standard model training is per-
formed using Adaptive Moment Estima-
tion (Adam) [26]. The trained models of
the cell temperature and the anolyte
composition show a mean squared error
regarding the testing data of 4.62 10
–6
and 5.36 10
–7
(in a normalized output
space between 0 and 1), respectively.
Fig. 8 shows the testing results of both
modeled variables. It can be seen that the
dynamic behavior of both variables can
be predicted with a high degree of accu-
racy over a wide value range in the out-
put space. This behavior indicates that
the data generated using the presented
adaptive sampling algorithm, provides
www.cit-journal.com ª2021 The Authors. Chemie Ingenieur Technik published by Wiley-VCH GmbH Chem. Ing. Tech. 2021,93, No. 12, 2097–2104
Data-Driven
Process Model
zi=1
Data-Driven
Process Model
zi=n
OH-
Na+
H2O
prod
water
Cl2(g), H2O(g) H2(g) , H2O(g)
in in
out tank, out
out
tank, out
Anolyte Catholyte
CAE cell
QC QC
TC
FC
LC
HE1
HE2
T1 T2
a) b)
Figure 6. Model overview: a) Flowchart of chlor-alkali process model, dashed controllers are removed from model and associated ma-
nipulated variables are used as input variables in sampling algorithm. b) Structure of used recurrent neural networks. Each output is
modeled separately. Parameters Nand Oare determined in hyperparameter tuning.
Figure 7. Results of the adaptive sampling algorithm for the CAE model. a) Two-dimen-
sional control space representation for j(u
1
) and T
in
(u
2
). b) Seeds and simulation results
of the output space.
2102 Research Article
Chemie
Ingenieur
Technik
sufficient information over the entire feasible area of the
output variables of interest. The comparison with a conven-
tional method for dynamic system identification, which
uses an APRBS sampling with an amplitude between the
lower and upper bounds of the defined controls (see Tab. 1),
could not be carried out, since the simulation did not con-
verge at such large changes.
5 Conclusion and Outlook
A novel methodology to adaptively sample rigorous dynam-
ic process models to generate a dataset for building a surro-
gate model is presented. The goal of the developed algo-
rithm is to cover an as large as possible area of the feasible
region of the original model. To do so multiple simulations
with a short time horizon, a fixed timestep, and different
inputs
uare carried out. In order to maximize the dynamic
information of the simulation results the here proposed
FAPRBS sampling is used to generate a dynamic trajectory
for the different inputs. In the course of the algorithm,
empty areas in the output space are identified and the cor-
responding values in the input space are esti-
mated in order to generate new data in the re-
quired area.
To demonstrate the performance and the ap-
plicability for dynamic data-driven modeling,
the presented framework is applied on a dynam-
ic model of a chlor-alkali electrolysis. It can be
shown that the generated data is sufficient for
training highly accurate recurrent neural net-
works for describing the dynamic behavior of
the defined output variables over the entire fea-
sible region.
In future work, we will focus on developing
techniques to estimate the uncertainty of the
trained recurrent neural networks to directly
identify areas in the input space where
additional data is required.
Open access funding enabled and organized
by Projekt DEAL.
Symbols used
d[–] number of selected output variables /
dimensionality of output space
d
u
[–] number of control variables /
dimensionality of input space
l[–] distance
n[–] number of results in d-ball
r[–] radius
s[–] score
t[–] time, target
u[–] mean of APRBS
U[–] input space, control variables
X[–] state variables
y[–] seed
Y[–] output space
k[–] number of considered d-balls
t[–] time
Chem. Ing. Tech. 2021,93, No. 12, 2097–2104 ª2021 The Authors. Chemie Ingenieur Technik published by Wiley-VCH GmbH www.cit-journal.com
Table 2. Parameters and results of hyperparameter tuning using Bayesian optimization.
Model Nodes hidden layer L2 penalty parameter NOof jOofT
in
Oof V
in
T
cell
119 0.046 15 12 21 8
wNaþ242 0.149 19 24 7 21
a)
b)
Figure 8. Results of comparison between test data and model prediction for
5 simulations over 110 h: a) cell temperature, b) mass fraction of sodium ions in
anolyte.
Research Article 2103
Chemie
Ingenieur
Technik
References
[1] E. Esche, J.-U. Repke, Chem. Ing. Tech. 2020,92 (12), 1898–1909.
DOI: https://doi.org/10.1002/cite.202000091
[2] L. T. Biegler, X. Yang, G. Fischer, J. Process Control 2015,30,
104–116. DOI: https://doi.org/10.1016/j.jprocont.2015.02.001
[3] J. V. Kadam, W. Marquardt, M. Schlegel, O. Bosgra, T. Backx,
P.-J Brouwer, G. Du
¨nnebier, S. de Wolf, Towards integrated
dynamic real-time optimization and control of industrial
processes, in Proc. of FOGAPO 2003 (Eds: I. E. Grossmann,
C. M. McDonald), 2003, 593–596.
[4] F. Fiedler, A. Cominola, S. Lucia, IFAC-
PapersOnLine 2020,53 (2), 16636–16643. DOI: https://doi.org/
10.1016/j.ifacol.2020.12.793
[5] Y. Tian, J. Zhang, J. Morris, Chem. Eng. Process. 2002,41 (6),
531–538. DOI: https://doi.org/10.1016/S0255-2701(01)00173-8
[6] Z. T. Wilson, N. V. Sahinidis, Comput. Chem. Eng. 2019,127,
88–98. DOI: https://doi.org/10.1016/j.compchemeng.2019.05.020
[7] R. Heese, M. Walczak, T. Seidel, N. Asprion, M. Bortz, Comput.
Chem. Eng. 2019,124, 326–342. DOI: https://doi.org/10.1016/
j.compchemeng.2019.01.007
[8] R. Isermann, M. Mu
¨nchhof, Identification of Dynamic Systems,
Springer, Berlin 2011.
[9] O. Nelles, Nonlinear system identification: From classical ap-
proaches to neural networks, fuzzy models and Gaussian processes,
Springer, Cham 2021.
[10] D. E. Rivera, H. Lee, H. D. Mittelmann, M. W. Braun, J. Process
Control 2009,19 (4), 623–635. DOI: https://doi.org/10.1016/
j.jprocont.2008.08.006
[11] T. O. Heinz, O. Nelles, at – Automatisierungstechnik 2018,66 (9),
714–724. DOI: https://doi.org/10.1515/auto-2018-0027
[12] M. Deflorian, S. Zaglauer, IFAC Proc. Vol. 2011,44 (1), 13179–
13184. DOI: https://doi.org/10.3182/20110828-6-it-1002.01502
[13] M. D. McKay, R. J. Beckman, W. J. Conover, Technometrics 1979,
21 (2), 239. DOI: https://doi.org/10.2307/1268522
[14] J. M. Hammersley, D. C. Handscomb, Monte Carlo Methods,
Monographs on Applied Probability and Statistics, Springer
Netherlands, Dordrecht 1964.
[15] I. Sobol’, USSR Comput. Math. Math. Phys. 1967,7 (4), 86–112.
DOI: https://doi.org/10.1016/0041-5553(67)90144-9
[16] T.-T. Wong, W.-S. Luk, P.-A. Heng, J. Graphics Tools 1997,2 (2),
9–24. DOI: https://doi.org/10.1080/10867651.1997.10487471
[17] U. M. Diwekar, J. R. Kalagnanam, AIChE J. 1997,43 (2), 440–447.
DOI: https://doi.org/10.1002/aic.690430217
[18] K. Crombecq, L. de Tommasi, D. Gorissen, T. Dhaene, in Proc. of
the 2009 Winter Simulation Conference (WSC), IEEE, Piscataway,
NJ 2009.
[19] V. K. Devabhaktuni, Q.-J. Zhang, in 30th European Microwave
Conference 2000, IEEE, Piscataway, NJ 2000.
[20] C. Nentwich, J. Winz, S. Engell, Ind. Eng. Chem. Res. 2019,58
(40), 18703–18716. DOI: https://doi.org/10.1021/acs.iecr.9b02758
[21] S. Olofsson, E. S. Schultz, A. Mhamdi, A. Mitsos, M. P. Deisen-
roth, R. Misener, Design of Dynamic Experiments for Black-Box
Model Discrimination, arXiv 2021. https://arxiv.org/abs/
2102.03782
[22] www.qhull.org/html/ (Accessed on June 10, 2021)
[23] C. B. Barber, D. P. Dobkin, H. Huhdanpaa, ACM Trans. Math.
Software 1996,22 (4), 469–483. DOI: https://doi.org/10.1145/
235815.235821
[24] R. S. Anderssen, R. P. Brent, D. J. Daley, P. A. P. Moran, SIAM J.
Appl. Math. 1976,30 (1), 22–30.
[25] J. Weigert, C. Hoffmann, E. Esche, P. Fischer, J.-U. Repke, Com-
put. Chem. Eng. 2021,149, 107287. DOI: https://doi.org/10.1016/
j.compchemeng.2021.107287
[26] D. P. Kingma, J. Ba, Adam: A Method for Stochastic Optimization,
arXiv 2014. https://arxiv.org/abs/1412.6980
www.cit-journal.com ª2021 The Authors. Chemie Ingenieur Technik published by Wiley-VCH GmbH Chem. Ing. Tech. 2021,93, No. 12, 2097–2104
2104 Research Article
Chemie
Ingenieur
Technik