scieee Science in your language
[en] (orig)
CONSENSORS: A Neural Network Framework
for Sensor Data Analysis
Burkhard Hoppenstedt, R¨udiger Pryss, Klaus Kammerer, and Manfred
Reichert
Institute of Databases and Information Systems, Ulm University, Germany
Abstract. Machine breakdowns in industrial plants cause production
delays and financial damage. In the era of cyber-physical systems, ma-
chines are equipped with a variety of sensors to monitor their status. For
example, changes to sensor values might indicate an abnormal behav-
ior and, in some cases, detected anomalies can be even used to predict
machine breakdowns. This procedure is called predictive maintenance,
which pursues the goal to increase machine productivity by reducing
down times. Thereby, anomalies can be either detected by training data
models based on historic data or by implementing a self-learning ap-
proach. In this work, the use of neural networks for detecting anomalies
is evaluated. In the considered scenarios, anomaly detection is based on
temperature data from a press of a machine manufacturer. Based on
this, a framework was developed for different types of neural networks
as well as a high-order linear regression approach. We use the proposed
neural networks for restoring missing sensor values and to improve over-
all anomaly detection. An evaluation of the used techniques revealed
that the high-order linear regression and an autoencoder constitute best
practices for data recovery. Moreover, deep neural networks, especially
convolutional neural networks, provide the best results with respect to
overall anomaly detection.
Keywords: Anomaly Detection ·Sensor Data Recovery
1 Introduction
In modern industrial plants, a rising number of sensors offers advanced oppor-
tunities for automatically detecting machine faults. In this context, machine-
learning methods have proven to be beneficial [1]. Although first models related
to neural networks were already introduced in the 1940s [2], their usage in an
industrial context only increased significantly during the last years. Thereby,
data storage capabilities constitute an important factor for the development of
neural networks as large training data sets usually enhance the overall outcome.
Furthermore, the developments in the field of deep learning enable a better read-
ability of neural networks. This improvement is achieved by abstracting infor-
mation into hierarchically ordered layers. In this work, we apply neural networks
techniques to an industrial data set, provided by an automotive press company.
2 B. Hoppenstedt et al.
More precisely, the data set contains temperature sensor data from three engines
in a press and the difference between engine temperatures is used to represent
anomalies (cf. Fig. 1, left). These data have been collected over a year of oper-
ation before a machine breakdown occurred. With the help of machine learning
techniques, we tackle two use cases: First, we aim to recognize anomalies. Sec-
ond, we try to recover lost (i.e., missing) sensor data. Regarding the first use
case, the results can be utilized in advanced information systems for the provi-
sion of alerts and condition monitoring. Following the idea that anomalies occur
more frequently before a breakdown, they can be used as input for predictive
maintenance. Concerning the second use case, data recovery is useful to improve
overall data quality and, therefore, enhance data analytics approaches.
Fig. 1. Extract from Measured Sensor Data (left) and Increasing Occurences of Anoma-
lies before a Breakdown (right)
2 Data Set & Evaluation
The data set stems from a machine manufacturer and consists of values from
a press line with six presses. It was recorded over a period of one year, using
a sampling rate of one measurement per minute. At the end of this year, a
breakdown of one press occurred (cf. Fig. 1, right part). To apply methods of
supervised learning, the data points are labeled. Thereby, a single data point was
marked as anomaly if any of the three engine temperature differences was greater
than 5 Kelvin (K). Then, a data section of 120 data points (i.e., each with three
temperature values) was used as the input for the neural networks. Note that
this corresponds to a 2-hour measurement. Furthermore, an input is labeled as
anomaly when it contains at least 105 single data points that are regarded as an
anomaly. Using this approach, we train the neural network to recognize anomaly
patterns. Hereby, we recognize exceeded temperature thresholds as well as the
time series structure. Next, the data set is split up into training and test data.
As the anomalies are underrepresented in the whole data set (cf. Tab. 1), test
data is randomly generated by 1% normal input data and 20% of the anomaly
input data.
The proposed framework hierarchy is shown in Fig 3. It is optimized for sensor
data and includes various types of neural networks. To calculate the precision for
CONSENSORS: A Neural Network Framework for Sensor Data Analysis 3
Table 1. Distribution of the Data Set
ID Data Label Data Set Size Percentage of Parent
1 All Data 524162
2 Normal Data 518095 98.8% of 1
3 Anomaly Data 6067 1.2% of 1
4 Training Data Normal 512915 99.0% of 2
5 Training Data Anomaly 4854 80.0% of 3
6 Test Data Normal 5180 1.0% of 2
7 Test Data Anomaly 1213 20.0% of 3
the use case Sensor Value Restoration, the mean derivation of the expected value
is used as evaluation criterion. For the use case Anomaly Detection, the number
of unrecognized anomalies (false negative) as well as the number of wrongly
discovered anomalies (false positives) are used to calculate the F1-score. These
two evaluation criteria are shown in Fig. 2.
Fig. 2. F1-Scores (left) and Mean Deviation (right)
In Fig 3, the most promising results of the framework are shown. The autoen-
coder (AE) performed well for recovering sensor data, but its variants denoising-
autoencoder (DAE) and compression-autoencoder (CAE) achieve a low F1-score.
In contrast, a fully connected neural network (FCNN) performed better for
anomaly detection than sensor value recovery. Note that the order of the best lin-
ear regression (LR) was four. Finally, convolutional neural networks are promis-
ing for anomaly detection.
3 Summary and Outlook
The presented framework performs well on the shown data set. It restores missing
temperature values with an average deviation of less than 1 Kelvin and recog-
nizes anomaly patterns reliably. Overall, the main advantages of the framework
4 B. Hoppenstedt et al.
are as follows: First, due to the hierarchical class structure, it is possible to
switch between performance and reliability. High-level networks (e.g., the au-
toencoder) need more computation time, but achieve better results. Second, the
framework is designed for sensor data and allows to select a time window to be
evaluated. However, the neural networks need high computation power and, for
supervised learning, a pre-labeled data set is required. In future work, recurrent
neural networks [3] may provide further improvements, as they include previous
decisions into a classification. Other approaches dealing with the application of
neural networks for sensor data exist. For example, [4] discusses an application
of neural networks in a motor system, where vibration signatures are analyzed
with the help of neural network agents. Next, [5] reached F1-scores up to 0.987,
using linear autoencoders to detect anomalies in labeled audio databases. How-
ever, to our best knowledge, the presented framework combined with data of a
press manufacture illustrate new insights.
Interface: ITransformator
LinearRegression
Perceptron
BasicNetwork
NeuralNetwork
Specialized
ConvolutionalNetwork
Autoencoder
Regenerating
Autoencoder
Model Info std(K) std>2k
(Distortion)
AE 0.00 0.95 11.5%
0.01 0.94 10.7%
0.05 0.93 10.5%
(Order)
LR 1 0.88 10.0%
2 0.87 9.8%
3 0.80 8.4%
4 0.79 7.9%
5 0.78 8.6%
FCNN 1.02 12.6%
Model
PER.
DAE
CAE
FCNN
CN(1,0)
CN(1,1)
CN(2,1)
CN(2,2)
F1
0.70
0.70
0.72
0.983
0.988
0.983
0.993
0.972
1 2 3
FCNN = Fully Connected Neural Network, AE = Autoencoder, PER = Perceptron, LR = Linear Regression
DAE = denoising-autoencoder, CAE = compression-autoencoder, CN = Convolutional Neural Network
Fig. 3. Framework (1), Recovery Results (2) and Anomaly Detection Results (3)
Altogether, neural networks are able to adapt to production changes by ad-
justing their weights. Moreover, they are robust to irrelevant noise, as they are
trained to discover only relevant sensor patterns. Therefore, the application of
neural networks is promising in the fields of sensor data recovery and anomaly
detection.
References
1. Zhao, R. et al.: Deep learning and its applications to machine health monitoring: A
survey. arXiv:1612.07640. (2016)
2. Schmidhuber, J.: Deep learning in neural networks: An overview. Neural networks,
61, 85-117 (2015)
3. Gugulothu, N. et al.: Predicting Remaining Useful Life using Time Series Embed-
dings based on Recurrent Neural Networks. arXiv:1709.01073. (2017)
4. Li, B. et al.: Neural-network-based motor rolling bearing fault diagnosis. IEEE trans-
actions on industrial electronics 47, 1060-1069 (2000)
5. Marchi, E. et al.: Deep recurrent neural networkbased autoencoders for acoustic
novelty detection. Computational intelligence and neuroscience (2017)