Conference PaperPDF Available

Condition Monitoring for Confined Industrial Process Based on Infrared Images by Using Deep Neural Network and Variants

Authors:

Abstract and Figures

Some industrial processes take place in confined settings only observable by sensors, e.g. infrared (IR) cameras. Drying processes take place while a material is transported by means of a conveyor through a "black box" equipped with internal IR cameras. While such sensors deliver data at high rates, this is beyond what human operators can analyze and calls for automation. Inspired by numerous implementations monitoring techniques that analyse IR images using deep learning, this paper shows how they can be applied to the confined microwave drying of porous foams, with bench-marking their effectiveness at condition monitoring to conduct fault detection. Convolutional neural networks, derived transfer learning, and deep residual neural network methods are already regarded as cutting-edge and are studied here, using a set of conventional approaches for comparative evaluation. Our comparison shows that state-of-the-art deep learning techniques significantly benefit condition monitoring, providing an increase in fault finding accuracy of up to 48% over conventional methods. Nevertheless, we also found that derived transfer learning and deep residual network techniques do not in our case yield increased performance over normal convolutional neural networks.
Content may be subject to copyright.
Condition Monitoring for Confined Industrial Process Based on
Infrared Images by Using Deep Neural Network and Variants
Yuchong Zhang
Chalmers University of Technology
Gothenburg, Sweden
yuchong@chalmers.se
Morten Fjeld
Chalmers University of Technology
Gothenburg, Sweden
fjeld@chalmers.se
ABSTRACT
Some industrial processes take place in conned settings only ob-
servable by sensors, e.g. infrared (IR) cameras. Drying processes
take place while a material is transported by means of a conveyor
through a “black box” equipped with internal IR cameras. While
such sensors deliver data at high rates, this is beyond what human
operators can analyze and calls for automation. Inspired by numer-
ous implementations monitoring techniques that analyse IR images
using deep learning, this paper shows how they can be applied
to the conned microwave drying of porous foams, with bench-
marking their eectiveness at condition monitoring to conduct
fault detection. Convolutional neural networks, derived transfer
learning, and deep residual neural network methods are already
regarded as cutting-edge and are studied here, using a set of con-
ventional approaches for comparative evaluation. Our comparison
shows that state-of-the-art deep learning techniques signicantly
benet condition monitoring, providing an increase in fault nding
accuracy of up to 48% over conventional methods. Nevertheless, we
also found that derived transfer learning and deep residual network
techniques do not in our case yield increased performance over
normal convolutional neural networks.
CCS CONCEPTS
Computing methodologies Activity recognition and
understanding
;
Neural networks
;
Classification and
regression trees.
KEYWORDS
Condition monitoring; neural networks; fault detection; deep learn-
ing; confined industrial process
1 INTRODUCTION
Condition monitoring for a conned industrial process is crucial to
detect undesired and defective products and improve eciency by
reducing faults with production equipment. The detection and diag-
nosis of defects and faults via improved monitoring is an active eld
of research in industrial systems due to its potential for reducing
Permission to make digital or hard copies of all or part of this work for personal or
classroom use is granted without fee provided that copies are not made or distributed
for prot or commercial advantage and that copies bear this notice and the full citation
on the rst page. Copyrights for components of this work owned by others than ACM
must be honored. Abstracting with credit is permitted. To copy otherwise, or republish,
to post on servers or to redistribute to lists, requires prior specic permission and/or a
fee. Request permissions from permissions@acm.org.
IVSP 2020, March 20–22, 2020, Singapore, Singapore
© 2020 Association for Computing Machinery.
ACM ISBN 978-1-4503-7695-2/20/03. . . $15.00
DOI: https://doi.org/10.1145/3388818.3388823
maintenance costs [
23
]. Condition monitoring enables the compar-
ison between regular and erroneous scenarios by use of external
or built-in devices. The progressively increasing sophistication of
industrial equipment brings a greater likelihood of faults due to
their complexity. Therefore, appropriate and eective condition
monitoring must become mandatory in the context of data-driven
industrial processes.
Modern industrial facilities are precisely operated and profusely
instrumented, creating large quantities of process data in various
forms. Sensors deliver monitoring data at rates beyond what human
operators can analyze, leading to a growing need for automation.
Typically, experts derive features to categorize conditions including
faults , but cannot handle the sheer amount of data that must be
sorted. For example, conned industrial processes impair an oper-
ator’s ability to inspect visual conditions, and instead produce a
copious number of images and data from sensors, supporting the
need for automatic monitoring [
15
]. Techniques for monitoring and
detecting must thus be exible and ecient enough to analyse a
large amount of a great variety of data in order to deliver accurate
results [32].
Figure 1. Schematic representation of feature engineering
Figure 2. Schematic representation of feature engineering
with multiple feature transformations
Conventional “intelligent” methods of condition monitoring are
taxonomically dene d as feature engineering (Figure 1) [
15
] also
called shallow learning models [
32
] which include Articial Neu-
ral Network [
24
], Support Vector Machine [
30
], and Logistic Regres-
sion [
4
] techniques. These well-established and veried methods
have already achieved considerable success in intelligent fault de-
tection [
27
], but are not capable of tackling complicated faults with
low amounts of prior knowledge [
32
]. These traditional models also
have diculties when processing complex errors, causing undesir-
able consequences, which motivates us to explore more functional
99
Figure 3. Schematic overview of confined microwave drying of porous foams. Array of Infrared (IR) cameras are mounted on
the ceiling inside of the confined chamber
and powerful methods for feature extraction and function compu-
tation, even in multiplex settings.
To achieve this, we propose to use feature learning (Figure 2)
[
3
,
7
,
15
] instead of feature engineering. Feature learning makes
use of algorithms that create and learn features derived from raw
data, with iterative steps to continue learning from newly obtained
data [
15
]. Deep learning [
25
] is typically representative of feature
learning: It is a class of machine learning algorithms that use mul-
tiple layers of nonlinear processing units for feature extraction and
transformation [
19
]. With deep learning, every hierarchical layer
can learn from the output from the previous layer and convey the
newly-generated output to the next layer. This allows it to extract
complicated features and resolve multiple complex functions, over-
coming some of the shortcomings of existing methods. There are
many categories of representations in deep learning, such as Deep
Neural Network (DNN) [
31
], Convolutional Neural Network (CNN)
[
18
], Recurrent Neural Network (RNN) [
21
], and others. Research
has shown that CNN is favorable in the realm of computer vision
[
13
] while RNN is superior in Natural Language Processing (NLP)
[8].
Nowadays, IR imaging is widely used in industrial processes be-
cause of its excellent performance in measuring temperature dier-
ences, important when monitoring safety-critical equipment [
15
].
In a conned industrial process where temperature is a crucial
parameter for evaluating overall performance, IR cameras and IR
imaging are especially useful for monitoring process health. In this
paper, we concentrate on an industrial process called microwave
drying for porous foams, which is operated in a conned chamber
independent from the rest of the process. As illustrated in Figure 3, a
“black-box” is used to cover the foam drying process. An array of IR
cameras is installed on the ceiling of the chamber to capture the com-
plete process. Other sensors (e.g. Microwave Tomography Sensor,
Electrical Capacitance Sensor) are employed either before or after
the drying process to measure the parameters of the foam, but only
the IR cameras can observe the process taking place inside the box.
The main research question here is whether, through analysing the
IR images,a state-of-the-art model-driven deep learning technique
can in this context more eectively determine process conditions
when compared to conventional methods. The contributions of our
work are:
Deriving a method to get access to monitor the conned
industrial process.
Deploying state-of-the-art deep learning with its derived
methods and verifying their ecacy.
Making use of IR images to broaden the industrial condition
monitoring horizon.
Comparing feature learning and conventional feature engi-
neering.
The layout of this paper is as follows. Section 1 introduces the pro-
posal and describes conventional feature engineering and advanced
feature learning. Section 2 presents related work on deep learn-
ing for condition monitoring and deep learning specications. An
overview of the approaches containing the baseline model we use
is elaborated in Section 3. The results from using IR images based
on deep learning are presented in section 4, followed by discussion
and our conclusions in section 5.
2 RELATED WORK
Deep learning is one of the derivatives of machine learning algo-
rithms which use multiple layers to progressively extract higher
level features from raw input [
10
]. For instance, when conducting
object detection using images, deep learning assigns shallow layers
to extract rough features like edges while deeper layers are capable
of analyzing more precise features like shapes. DNN and CNN are
100
prominent techniques within this domain that have proven suc-
cessful over the decades. With emerging technologies being rapidly
adopted, numerous variations of the original algorithms are being
published. Transfer learning is a typical variant of deep learning
that focuses on grafting knowledge obtained from one task to a dif-
ferent but related problem [
31
]. For example, the knowledge gained
from learning to classify dogs can also be applied to classifying
cats. A conventional conceptuation of transfer learning is shown in
Figure 4. Deep residual neural network is another mutation derived
from deep learning. This is able to carry out process in the same
way as normal DNN and CNN by skipping some mutual connec-
tions to jump over certain layers, but is easier to optimize compared
to general DNN, and can gain accuracy at considerably increased
depth [
14
]. The diagram of a basic deep residual neural network is
conceptualized in Figure 5.
Figure 4. The concept of transfer learning
Figure 5. The concept of a deep residual neural network
Deep learning for condition monitoring or fault diagnosis is ubiq-
uitously employed in industry. Fenton et al. [
11
] established a re-
search overview on fault detection in electronic systems wherein
the importance of DNN was emphasized. In addition, a retrospec-
tive review of deep learning in machine health monitoring was
conducted by Zhao et al. [
33
]. They concluded that general deep
learning methods in this context are mainly from Autoencoder
and its variants, Restricted Boltzmann Machines and its variants
including Deep Belief Network (DBN), Deep Boltzmann Machines
(DBM), CNN and RNN. Janssens et al. [
15
] conducted a compre-
hensive evaluation of feature engineering and feature learning
among several industrial cases, verifying its superiority for auto-
matically determining the condition of machine health, using IR
videos. Keerthi et al. [
19
] also fed IR videos as input into CNN to
automatically extract the relevant region of an interest’s features,
and subsequently make a prediction regarding their machine’s bear-
ing oil status. Their evaluation showed that the proposed system
achieved an accuracy of 96.67%. Ma et al. [
20
] implemented a deep
Autoencoder for diagnosing faults based on images and structured
data. A novel model which completely extracted features by DNN
and conducted analysis via a hidden Markov model (HMM) was
proposed by Qiu et al. [
22
] to handle indistinguishable faults. For an
industrial rolling element bearing (REB) fault classication process,
Amar et al [
1
] gave an example of creating then training vibration
spectrum images in DNN. Likewise, for a similar REB fault detection
task, Verma et al. [
28
] developed an autoencoder using intelligent
unsupervised learning towards vibration measurements. Chen et
al. [
6
] benchmarked CNN for condition monitoring and revealed
that their approach had extraordinary performance in comparison
to peer algorithms for fault identication and classication. To
overcome the shortcomings of traditional autoencoders in intelli-
gent fault diagnosis of machines, Jia et al. [
16
] developed a local
connection network (LCN) based on a reliable dataset constructed
by a normalized sparse autoencoder (NSAE), namely NSAE-LCN,
veried its superiority throughout experimentation, in contrast to
commonly-used methodologies.
3 DEEP LEARNING FOR CONDITION
MONITORING
We develop a method to automatically monitor process conditions
within a conned microwave oven used to dry porous foams, using
images from IR cameras placed inside the oven. To construct a
dataset large and convincing enough to validate the method, every
condition must be dened accurately. Our methodology rstly ad-
dresses the issue of determining the right set of conditions, secondly
in establishing a set of benchmark networks matched with transfer
learning, thirdly in selecting a set of baseline models.
3.1 Determining Set of Conditions
Our data comes from three distinct drying processes of dierent
foams with dierent drying durations. The three examples of IR
images derived from each drying process are shown in Figure 6,
where dierent colors represent dierent temperature values. A
crucial metric – moisture level of foam – is commonly used in the
evaluation of a drying process and is introduced in our context of
condition determination. It is worth noting that the temperature
value and moisture level of the foam are inversely-proportional;
with higher temperature corresponding to lower moisture level and
vice versa.
In our experimental setting, IR videos recorded the entire drying
process, from the foam being sent into the chamber to its reemer-
gence. We obtained nearly 10000 IR images to store in our dataset.
For further support of our observations, we divided the whole
dataset into a training set and a testing set, at a proportion of 8:2.
101
Figure 6. Three example IR images from our dataset taken from process 1, 2, and 3 (left to right). Color represents
temperature, thereby also showing moisture level. We intend to define faults based on the number of different moisture
levels in a single image
The overall eight conditions dened are presented in Table. 1. These
conditions incorporate the complete information received from the
three process, although not every single process contains all of the
conditions. Thus, the predened conditions became labels either
for ground-truth indexing in the training phase or outputting in
the testing phase. Among them, conditions 7 and 8 are recognized
as faults if detected.
3.2 Benchmark Networks
As mentioned earlier, CNN has been demonstrated to be capable of
handling images. Its properties, such as local connectivity, weight
sharing, and pooling, contribute to a quick training phase with few
training parameters [
15
]. A typical CNN consists an input and an
output layer with a number of hidden layers in between. These
layers can be convolutional, pooling, or fully connected:
The convolutional layer
: Convolutional layers imple-
ment convolution operations between the input—mostly a
matrix shued from input image—and the convolutional
lter, a square matrix. In many state-of-the-art CNN archi-
tectures, an activation function called Rectied Linear Unit
(ReLU), proposed by Krizhevsky et al. [
17
], is extensively
used, mitigating suering from gradient vanishing.
The pooling layer
: Max-Pool is the most widespread op-
eration which adopts only the maximum value from every
small region from the previous output.
The fully-connected layer
: Fully-connected layers are
equal to the normal neural networks where there will be no
convolution operations.
In Figure 7, the architecture of the CNN used in our work is dis-
played. There are nine layers in our model, performing feature
extraction, parameter training, and outputting. In the hidden con-
volutional layers, the ReLU function is chosen as the activation
function for feature transformation while the Softmax [
12
] func-
tion fulls the multi-class classication task.
Secondly, we also utilize transfer learning, on the assumption that
a model trained with a designated type of image should be able
to train from a dierent type of image. Thus, we hypothesize that
transfer learning is appropriate for handling our IR images in our
context. Two well-proven transfer learning networks – VGG-16 and
VGG-19 – are used in the trials to verify our assumption. VGG-16
net has 16 weighted layers and VGG-19 holds 19 layers. The specic
application of transfer learning is as follows:
After the removal of the last layer of the whole model, we add
three dense layers we have created ourselves. The resulting
new model consists of the pretrained model and our external
output layer such as the multiple classier with Softmax
function. The principle behind this method is to use the
features and weights learned by the pretrained models to
export the desired results by an increment of self-dened
output.
Concurrently, the deep residual network is used for investigating its
utility. We choose the ResNet-50 [
14
] as a representative of residual
nets to execute the entire implementation. The ResNet-50 is a 50-
layer residual network with several shortcuts built in, where the
formal training and testing procedures would overlook them.
3.3 Baseline Models
To triangulate our research results, another four conventional fea-
ture engineering methods were also assigned as the baseline models,
to deal with the identical IR dataset as well as our benchmark net-
work processes. A concise introduction for these methodologies is
given here:
Logistic regression (LR)
: LR [
29
] is a wide-ranging
machine learning algorithm mainly used in binary classi-
cation and recognition. Statistically, its basic formation uses
a logistic function to model a binary classication problem..
Naïve Bayes (NB)
: The NB classier, belonging to prob-
abilistic classiers, has received much attention since the
1960s [
5
]. It applies Bayesian theory into practical problems
with some pre-proposed hypotheses. Common extensions
would be Gaussian naive Bayes, Bernoulli naive Bayes.
Support Vector Machine (SVM)
: SVM [
26
], a supervised
learning algorithm which is also mainly used in binary clas-
sications. SVM builds up a hyperplane as boundary for
distinguishing samples from one type to another, making
itself a kind of non-probabilistic binary linear classier [9].
Linear Discriminant Analysis (LDA)
: LDA, as nomi-
nally called, conducts a linear analytics to characterize sam-
ples into two or more categories which is widely used in
classication and pattern recognition [
2
]. In some aspects,
LDA is eligible for performing data dimensionality reduction.
4 RESULTS
Results of the implementation of training and testing sessions for
the three conned microwave drying processes using both bench-
mark networks and baseline models are shown in Figure 8. Two
predicted examples from testing phase are exhibited, as our algo-
rithms successfully classify them into condition 2 and condition 8
102
Table 1. Summary of the 8 conditions defined in our dataset. N.A. implies that the corresponding condition does not exist in
our dataset
Foam position
Condition Foam moisture level(s) Low Low + Medium Low + Medium + High
Foam entering chamber Condition 1 Condition 2 Condition 3
Foam inside chamber N.A. Condition 4 Condition 5
Foam leaving chamber Condition 6 Condition 7 Condition 8
Figure 7. The schematics of the CNN model used in our experiment
(fault). Accuracy is chosen as the metric for evaluating condition
monitoring. This widely-used criterion species the rate between
the samples correctly classied and the total samples in the dataset,
as illustrated in Equation One. Results for feature learning-based
benchmark networks and feature engineering-based baseline mod-
els are displayed in Tables 2 and 3 as well as the statistical charts
in Figures 9 and 10, while the training accuracy (exploration) and
the testing accuracy (validation) are recorded respectively.
Accuracy (%) =
The number of samples rightly classied
The number of total samples in dataset (1)
From the statistics, we can see how the four feature learning meth-
ods completely outstrip the four conventional feature engineering
methods, based on their superior accuracy acquired both in the
training and testing phases. Deep learning is thus observed to have
satisfactory eciency in analysing the IR images from the monitor-
ing of this process, supporting our research question proposed in
Section One.
In training results, it is noteworthy that all four benchmark net-
works achieve higher accuracy (nearly 100%) when compared to
the other four baseline models. Among them, CNN, VGG-16, VGG-
19 and ResNet-50 provided 100% accuracy in training process 2.
LR, NB, SVM and LDA are only slightly inferior to the benchmark
networks within the training performance for processes 1 and 2,
however, they show large dierence in accuracy (27% to 38%) when
compared to the deep learning methods employed in process 3.
Likewise, the feature learning-based benchmark networks out- per-
form the baseline models comprehensively in the testing stage.
Deep learning approaches are capable of high testing accuracy; for
example CNN provides 99.71% accuracy when validating process
three. The disparity between baselines and benchmarks is even
more exaggerated. For example, by comparison SVM provides only
52.89% of testing accuracy.
Generally, the performance of transfer learning and deep residual
neural networks is more eective than normal DNN and CNN in the
context of computer vision tasks [
14
,
15
]. However, our research
shows that transfer learning and deep residual network do not
have greater abilities in condition monitoring and prognostic fault
detection within a conned microwave drying process. In fact,
neither VGG networks nor deep residual networks can be shown
to have similar training and testing accuracy when compared to
our general CNN model in three distinct processes.
5 DISCUSSION AND CONCLUSION
With the work presented, we show that the state-of-the-art feature
learning-based tool, CNN and its variants, are advantageous for
condition monitoring in a conned microwave drying process by
using IR images. The merit of feature learning is that it conducts
iterative feature transformation, which conventional feature engi-
neering does not have, making the implementation more reliable.
In addition, we also verify that transfer learning and deep resid-
ual neural networks do not outperform the standard CNN model.
Overall, feature learning-based methods are well-qualied to pro-
vide robust monitoring in various conditions and to detect faults
in a non-visible microwave drying process, as shown in our three
distinct demonstrations. Deep learning transcends traditional con-
dition monitoring methods by a range varying from approximately
3% to 48%, according to our research.
However, there is still room to advance our approaches. First, the
volume of the database should be enlarged to more closely ap-
proximate the massive database that would be required for truly
analogous research. In addition, the thickness, as well as other phys-
ical properties, of the foam chosen in the experiment may aect the
temperature distribution, which may inuence the desired results.
To make the research more robust, a greater variety should now be
tested.
103
Figure 8. The examples of two tested IR images by our proposed methods. The left is predicted as condition 2 (foam has low
and medium moisture levels). The right is classified as condition 8 (foam has low, medium, and high moisture levels), which
is the fault condition detected
Table 2. Overview of the training accuracy (%) conducted by four benchmark networks and four baseline models for the
three confined microwave drying processes via IR images
Process
Accuracy (%) Model used CNN VGG-16 VGG-19 ResNet-50 LR NB SVM LDA
Process 1 98.98 99.59 99.25 98.78 94.89 91.91 95.17 91.63
Process 2 100.00 100.00 100.00 100.00 93.44 89.91 93.13 97.32
Process 3 99.80 99.72 99.76 99.88 72.00 63.26 72.59 61.43
Table 3. Overview of the testing accuracy (%) conducted by four benchmark networks and four baseline models for the three
confined microwave drying processes via IR images
Process
Accuracy (%) Model used CNN VGG-16 VGG-19 ResNet-50 LR NB SVM LDA
Process 1 96.93 97.28 97.03 93.89 90.00 88.75 90.00 90.31
Process 2 98.48 98.04 98.52 98.36 90.25 88.71 85.13 88.97
Process 3 99.71 99.22 99.57 99.65 56.00 57.78 52.89 60.44
Figure 9. Training accuracy data (%) for our proposed four benchmark methods and four baseline models. The dark blue bar,
light blue bar and orange bar represent processes 1, 2, and 3 respectively
104
Figure 10. Testing accuracy data (%) for our proposed four benchmark methods and four baseline models. The dark blue bar,
light blue bar and orange bar represent processes 1, 2, and 3 respectively
In future work, we will focus on improving the current work to en-
rich the dataset by acquiring a larger amount of dierent IR images
and engaging in a variety of other industrial processes. Additionally,
dening more specic process conditions will be another core task,
where the data may be analyzed in a simultaneous spacious and
temporal form.
6 ACKNOWLEDGEMENTS
This project has received funding from the European Union’s Hori-
zon 2020 research and innovation programme under the Marie
Sklodowska-Curie grant agreement No 764902.
7 REFERENCES
[1]
Muhammad Amar, Iqbal Gondal, and Campbell Wilson. 2014. Vibration spectrum
imaging: A novel bearing fault classication approach. IEEE transactions on
Industrial Electronics 62, 1 (2014), 494–502.
[2]
Suresh Balakrishnama and Aravind Ganapathiraju. 1998. Linear discriminant
analysis-a brief tutorial. Institute for Signal and information Processing 18 (1998),
1–8.
[3]
Yoshua Bengio, Aaron Courville, and Pascal Vincent. 2013. Representation
learning: A review and new perspectives. IEEE transactions on pattern analysis
and machine intelligence 35, 8 (2013), 1798–1828.
[4]
Ana M Bianco and Elena Martínez. 2009. Robust testing in the logistic regression
model. Computational Statistics & Data Analysis 53, 12 (2009), 4095–4105.
[5]
Jingnian Chen, Houkuan Huang, Shengfeng Tian, and Youli Qu. 2009. Feature
selection for text classication with Naïve Bayes. Expert Systems with Applications
36, 3 (2009), 5432–5435.
[6]
ZhiQiang Chen, Chuan Li, and René-Vinicio Sanchez. 2015. Gearbox fault identi-
cation and classication with convolutional neural networks. Shock and Vibration
2015 (2015).
[7]
Youngjun Cho, Nadia Bianchi-Berthouze, Nicolai Marquardt, and Simon J Julier.
2018. Deep Thermal Imaging: Proximate Material Type Recognition in the Wild
through Deep Learning of Spatial Surface Temperature Patterns. In Proceedings
of the 2018 CHI Conference on Human Factors in Computing Systems. ACM, 2.
[8]
Ronan Collobert and Jason Weston. 2008. A unied architecture for natural lan-
guage processing: Deep neural networks with multitask learning. In Proceedings
of the 25th international conference on Machine learning. ACM, 160–167.
[9]
Corinna Cortes and Vladimir Vapnik. 1995. Support-vector networks. Machine
learning 20, 3 (1995), 273–297.
[10]
Li Deng, Dong Yu, et al
.
2014. Deep learning: methods and applications. Founda-
tions and Trends®in Signal Processing 7, 3–4 (2014), 197–387.
[11]
William G Fenton, T.Martin McGinnity, and Liam P Maguire. 2001. Fault diagnosis
of electronic systems using intelligent techniques: A review. IEEE Transactions
on Systems, Man, and Cybernetics, Part C (Applications and Reviews) 31, 3 (2001),
269–281.
[12]
Steven Gold, Anand Rangarajan, et al
.
1996. Softmax to softassign: Neural network
algorithms for combinatorial optimization. Journal of Articial Neural Networks
2, 4 (1996), 381–399.
[13]
Kasthurirangan Gopalakrishnan, Siddhartha K Khaitan, Alok Choudhary, and
Ankit Agrawal. 2017. Deep Convolutional Neural Networks with transfer learning
for computer vision-based data-driven pavement distress detection. Construction
and Building Materials 157 (2017), 322–330.
[14]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual
learning for image recognition. In Proceedings of the IEEE conference on computer
vision and pattern recognition. 770–778.
[15]
Olivier Janssens, Rik Van de Walle, Mia Loccuer, and Soe Van Hoecke. 2017.
Deep learning for infrared thermal image based machine health monitoring.
IEEE/ASME Transactions on Mechatronics 23, 1 (2017), 151–159.
[16]
Feng Jia, Yaguo Lei, Liang Guo, Jing Lin, and Saibo Xing. 2018. A neural network
constructed by deep learning technique and its application to intelligent fault
diagnosis of machines. Neurocomputing 272 (2018), 619–628.
[17]
Alex Krizhevsky, Ilya Sutskever, and Georey E Hinton. 2012. Imagenet classica-
tion with deep convolutional neural networks. In Advances in neural information
processing systems. 1097–1105.
[18]
Shih-Chung B Lo, Heang-Ping Chan, Jyh-Shyan Lin, Huai Li, Matthew T Freed-
man, and Seong K Mun. 1995. Articial convolution neural network for medical
image pattern recognition. Neural networks 8, 7-8 (1995), 1201–1214.
[19]
R. Rajavignesh M. Keerthi. 2018. Machine Health Monitoring Using Infrared Ther-
mal Image by Convolution Neural Network. International Journal of Engineering
Research Technology (IJERT) 6, 07 (2018), 1–5.
[20]
Yan Ma, Zhihong Guo, Jianjun Su, Yufeng Chen, Xiuming Du, Yi Yang, Chengqi
Li, Ying Lin, and Yujie Geng. 2014. Deep learning for fault diagnosis based on
multi-sourced heterogeneous data. In 2014 International Conference on Power
System Technology. IEEE, 740–745.
[21]
Tomáš Mikolov, Martin Karaát, Lukáš Burget, Jan Černock
`
y, and Sanjeev Khu-
danpur. 2010. Recurrent neural network based language model. In Eleventh
annual conference of the international speech communication association.
[22]
Jingwei Qiu, Wei Liang, Laibin Zhang, Xuchao Yu, and Meng Zhang. 2015. The
early-warning model of equipment chain in gas pipeline based on DNN-HMM.
Journal of Natural Gas Science and Engineering 27 (2015), 1710–1722.
[23]
C Ruiz-Cárcel, VH Jaramillo, D Mba, JR Ottewill, and Y Cao. 2016. Combination
of process and vibration data for improved condition monitoring of industrial
systems working under variable operating conditions. Mechanical Systems and
Signal Processing 66 (2016), 699–714.
[24]
B Samanta. 2004. Articial neural networks and genetic algorithms for gear fault
detection. Mechanical Systems and Signal Processing 18 (2004), 1273–1282.
105
[25]
Jürgen Schmidhuber. 2015. Deep learning in neural networks: An overview.
Neural networks 61 (2015), 85–117.
[26]
Johan AK Suykens and Joos Vandewalle. 1999. Least squares support vector
machine classiers. Neural processing letters 9, 3 (1999), 293–300.
[27]
Mien Van and Hee-Jun Kang. 2015. Bearing defect classication based on individ-
ual wavelet local sher discriminant analysis with particle swarm optimization.
IEEE Transactions on Industrial Informatics 12, 1 (2015), 124–135.
[28]
Nishchal K Verma, Vishal Kumar Gupta, Mayank Sharma, and Rahul Kumar
Sevakula. 2013. Intelligent condition based monitoring of rotating machines
using sparse auto-encoders. In 2013 IEEE Conference on Prognostics and Health
Management (PHM). IEEE, 1–7.
[29] Raymond E Wright. 1995. Logistic regression. (1995).
[30]
Junyan Yang, Youyun Zhang, and Yongsheng Zhu. 2007. Intelligent fault diagnosis
of rolling element bearing based on SVMs and fractal dimension. Mechanical
Systems and Signal Processing 21, 5 (2007), 2012–2024.
[31]
Jason Yosinski, Je Clune, Yoshua Bengio, and Hod Lipson. 2014. How transfer-
able are features in deep neural networks?. In Advances in neural information
processing systems. 3320–3328.
[32]
Guangquan Zhao, Guohui Zhang, Qiangqiang Ge, and Xiaoyong Liu. 2016. Re-
search advances in fault diagnosis and prognostic based on deep learning. In 2016
Prognostics and System Health Management Conference (PHM-Chengdu). IEEE,
1–6.
[33]
Rui Zhao, Ruqiang Yan, Zhenghua Chen, Kezhi Mao, Peng Wang, and Robert X
Gao. 2019. Deep learning and its applications to machine health monitoring.
Mechanical Systems and Signal Processing 115 (2019), 213–237.
106
... IPT underpins and facilitates the extraction of qualitative and quantitative data regarding the related industrial processes, which is usually visualized in various ways for people to understand its nature, measure the critical process characteristics, and implement process control in a complete feedback network [7]. Some typical representatives of IPT, such as microwave tomography (MWT) [8][9][10][11][12][13][14][15], electrical resistance tomography (ERT) [16], and electrical capacitance tomography (ECT) [17] are widely used for industrial purposes such as moisture detection [8,9], crack detection and powder flow in pipes, and flow pattern detection of granules. In our study, we concentrate on a unique industrial microwave drying process [10] which uses precise drying and heating equipment for polymer foams with the aid of MWT, as displayed in Figure 1. ...
... Some typical representatives of IPT, such as microwave tomography (MWT) [8][9][10][11][12][13][14][15], electrical resistance tomography (ERT) [16], and electrical capacitance tomography (ECT) [17] are widely used for industrial purposes such as moisture detection [8,9], crack detection and powder flow in pipes, and flow pattern detection of granules. In our study, we concentrate on a unique industrial microwave drying process [10] which uses precise drying and heating equipment for polymer foams with the aid of MWT, as displayed in Figure 1. The targets of this drying process are firstly heating the object foam placed on the conveyor belt and then detecting the post-drying moisture distribution, by adopting the MWT technique. ...
Article
Full-text available
Industrial process tomography (IPT) based process control is an advisable approach in industrial heating processes for improving system efficiency and quality. When using it, appropriate dataflow pipelines and visualizations are key for domain users to implement precise data acquisition and analysis. In this article, we propose a complete data processing and visualizing workflow regarding a specific case—microwave tomography (MWT) controlled industrial microwave drying system. Furthermore, we present the up-to-date augmented reality (AR) technique to support the corresponding data visualization and on-site analysis. As a pioneering study of using AR to benefit IPT systems, the proposed AR module provides straightforward and comprehensible visualizations pertaining to the process data to the related users. Inside the dataflow of the case, a time reversal imaging algorithm, a post-imaging segmentation, and a volumetric visualization module are included. For the time reversal algorithm, we exhaustively introduce each step for MWT image reconstruction and then present the simulated results. For the post-imaging segmentation, an automatic tomographic segmentation algorithm is utilized to reveal the significant information contained in the reconstructed images. For volumetric visualization, the 3D generated information is displayed. Finally, the proposed AR system is integrated with the on-going process data, including reconstructed, segmented, and volumetric images, which are used for facilitating interactive on-site data analysis for domain users. The central part of the AR system is implemented by a mobile app that is currently supported on iOS/Android platforms.
... Industrial process tomography (IPT) ( Fig. 1.a) is a dedicated and non-invasive imaging technique which is pervasively used in manufacturing scenarios for process monitoring or integrated control [23,35,[42][43][44]52,56,57]. It is an effective mechanism to extract complex data, visualize it and interpret it for domain users [5,20,44,54,58]. ...
Chapter
Full-text available
Industrial process tomography (IPT) is a specialized imaging technique widely used in industrial scenarios for process supervision and control. Today, augmented/mixed reality (AR/MR) is increasingly being adopted in many industrial occasions, even though there is still an obvious gap when it comes to IPT. To bridge this gap, we propose the first systematic AR approach using optical see-through (OST) head mounted displays (HMDs) with comparative evaluation for domain users towards IPT visualization analysis. The proof-of-concept was demonstrated by a within-subject user study (n=20) with counterbalancing design. Both qualitative and quantitative measurements were investigated. The results showed that our AR approach outperformed conventional settings for IPT data visualization analysis in bringing higher understandability, reduced task completion time, lower error rates for domain tasks, increased usability with enhanced user experience, and a better recommendation level. We summarize the findings and suggest future research directions for benefiting IPT users with AR/MR.KeywordsAugmented RealityIndustrial Process TomographyOptical See-through Head Mounted DisplayUser Study
... This information, for example, a 2D graph of the spectral information or a 3D reconstruction of the voxel data, if gathered interactively in real time by a related domain expert, can help in better understanding of the process [24]. Industrial tomography is found in many manufacturing units as well as in non-destructive or monitoring purposes [33][34][35][36]. Prominent examples are oil and gas [9], chemical engineering [29], seismology [17], nuclear waste processing [21], and flow inspection [20]. ...
Conference Paper
Full-text available
Augmented Reality (AR) has grown into a well-established technique with a compelling potential for interactive visualization. In spite of its clear potential, this novel tool has not yet been widely embraced as an industrial solution. In this paper, we address AR for a specific domain: industrial tomography. Within this domain, we conducted a need-finding study featuring 14 surveyed participants, each with sufficient years of experience. A systematic survey study was designed as the main body of our approach. Using this survey, we collected answers AQ1 helping us to establish findings and formulate novel insights. The study as a whole consisted of a pilot and a formal study for better robustness. Our findings uncovered the current status of AR being used in industrial tomography, and showed that the potential of AR in this domain was positively rated by the participants. Based on our findings, we present key challenges and propose potential for interdisciplinary synergies between AR and industrial tomography.
... Industrial process tomography (IPT) (Figure 1.a) is a dedicated and non-invasive imaging technique which is pervasively used in manufacturing scenarios for process monitoring or integrated controlIsmail et al. [2005], Tapp et al. [2003], Nolet [2012], Primrose [2015], Plaskowski et al. [1995], Rao et al. [2022], Zhang et al. [2020a], Zhang and Fjeld [2020]. IPT serves as an effective mechanism to extract complex data which is visualized in various representations and interpret it for domain users to comprehend the essence of the industrial processes Hampel et al. [2022], Beck et al. [2012], Yao and Takei [2017], Zhang et al. [2020bZhang et al. [ , 2021a. ...
Preprint
Full-text available
Industrial process tomography (IPT) is a specialized imaging technique widely used in industrial scenarios for process supervision and control. Today, augmented/mixed reality (AR/MR) is increasingly being adopted in many industrial occasions, even though there is still an obvious gap when it comes to IPT. To bridge this gap, we propose the first systematic AR approach using optical see-through (OST) head mounted displays (HMDs) with comparative evaluation for domain users towards IPT visualization analysis. The proof-of-concept was demonstrated by a within-subject user study (n=20) with counterbalancing design. Both qualitative and quantitative measurements were investigated. The results showed that our AR approach outperformed conventional settings for IPT data visualization analysis in bringing higher understandability, reduced task completion time, lower error rates for domain tasks, increased usability with enhanced user experience, and a better recommendation level. We summarize the findings and suggest future research directions for benefiting IPT users with AR/MR.
... Improving the quality of the visual cues will be another focus. Our research might have an impact on industry-based setups [16,21] and configurations empowering people's capabilities and efficiency [9,19]. ...
Conference Paper
Full-text available
Figure 1: Case study: book searching task with our proposed HMD AR solution.): Our AR app gives the book title stimulus for searching.): The visual cue here is a light-green blob pointing out the correct bookshelf floor where the target book is located.) Users successfully found the book with/without the visual cues. ABSTRACT Augmented reality (AR) is today becoming more widely utilized as it allows for interacting with the virtual objects. In this study, we propose an head-mounted display (HMD) AR system supporting book searching with visual cues. The visual cue is represented as a light-green blob hinting users for task completion, which significantly strengthens the overall performance. The system is implemented by using Microsoft HoloLens 2. The proof-of-concept version of the proposed solution is demonstrated in a pilot user study (n=8) comprising an experimental group (with visual cues, n=4) and a control group (without visual cues, n=4), followed by quantitative analysis of task completion time (TCT) and NASA task load index (TLX). The results show that our proposed HMD AR solution improved the task performance and had cognitive benefits for book searching tasks.
... Debonding and delamination are some of the most common defects in manufacturing, and research is being done to detect these defects using a hybrid of spatial and temporal deep learning [4]. The need for a completely automated process is further motivated in industrial processes in confined environments, that are only observable by sensors [5]. A comprehensive review of modern defect detection models is provided in [6]. ...
... Industrial process tomography (IPT), as a widely used non-intrusion imaging technique, has effectively demonstrated its high value in industrial process monitoring and product quality control [11,12]. Electrical Capacitance Tomography (ECT) is a specific mechanism to monitor processes in confined containers, with no access to the internal content of the targeted vessel [10]. ...
... Some typical types of IPT, such as microwave tomography (MWT) [Omrani et al., 2021, Yadav et al., 2020, Zhang et al., 2019, 2020a, electromagnetic induction tomography (MIT) [Soleimani et al., 2007], electrical resistance tomography (ERT) [Daily et al., 2004], and electrical capacitance tomography (ECT) [Nowak et al., 2019] are widely used for purposes such as moisture detection [Omrani et al., 2020, Yadav et al., 2021, crack detection and powder flow in pipes, and flow pattern detection of granules. In our study, we focus on a unique industrial drying process [Zhang and Fjeld, 2020] using precise drying and heating equipment with the aid of MWT, as displayed in Figure 1. The target of this drying process is detecting the International Conferences Interfaces and Human Computer Interaction 2021; ...
Conference Paper
Full-text available
Augmented Reality (AR), as a variation of Virtual Reality (VR), has been proved useful for decades. However, it is not widely utilized in most industries. To fill the gap between this technique and industrial usage, we propose a novel AR system in the context of industrial process tomography (IPT). As the pioneering AR deployment in IPT, this system offers a new solution to underpin the onsite data analysis regarding volumetric visualization. In our work, an endeavor to provide intelligent control for an industrial drying system is pursued by using microwave tomography (MWT), a breed of IPT, as an imaging modality. Here, the AR system is integrated with the MWT for post processing of its volumetric images, containing critical information of the industrial process. The core part of the AR system is implemented by an interactive mobile app that is supported on iOS/Android platforms. The overall system is generalized by four distinctive findings: interactivity, mobility, information richness, and mutual collaboration. Our proposed system opens the horizon of leveraging AR in IPT to benefit domain-related users regarding onsite data analysis and visualization.
Thesis
Full-text available
Few recent areas of research have had a more significant impact on industrial production than Machine Learning (ML). Ranging from intelligent quality inspection to fully autonomous robots and vehicles, Machine Learning in industrial applications (industrial ML) is aligned to permanently change how production systems operate. By offering powerful tools for automation and intelligent decision-making, it has become a highly sought-after technology across most industries. Applying industrial ML to various applications in the production system has also become a popular focus of recent research efforts. Several breakthrough applications and designs have further fueled the excitement for ML-empowered systems. Thus, many companies have increased their investment in this technology to drive disruptive innovation in cost-effectiveness and performance. Frequently, new scientific findings and discoveries offer new potentials for industrial ML applications. However, these applications depend on several factors, such as the deployment environment, the amount of available data, and the ML approach chosen, which makes application development a non-trivial pursuit. As a result, while ML is applied to a growing number of systems and environments in industry, many aspects of its development remain unclear. To alleviate the issues blocking further advancement in industrial ML, various research endeavors have been undertaken to better understand how ML applications work in industrial information systems, and which requirements and challenges practitioners face during development and implementation. This thesis ties into the current research and further contributes by exploring the development process, the challenges, and the value of libraries and frameworks in industrial ML applications. To this end, this thesis contains an extensive literature review and four independent studies on real-world applications of ML in the context of production at a large automobile OEM. The systematic literature review explores current research on industrial ML applications with a comprehensive quantitative and qualitative analysis. The four studies each contain a Design Science research case on the development and design of an industrial ML application. These four ML applications are 1) An Anomaly Detection System in the brownfield on a Monorail Conveyor, 2) A multi-model Quality Inspection application in Laser Beam Welding with Supervised Learning, 3) A Deep Reinforcement Learning system for fully Autonomous Assembly on industrial robots, and 4) A self-service Quality Inspection Toolkit augmented with Machine Teaching and Interactive ML functions. In these studies, the design and development process of the application is documented in detail. Furthermore, during each study, the benefits, challenges, creative application solutions, and results are reported upon, analyzed, and critically discussed. Finally, the findings across all studies are structured in the discussion, and connections between their insights are reviewed. In our literature review it is discovered that the vast majority of research on industrial applications of ML apply Supervised Learning as their primary AI approach. Unsupervised and Reinforcement Learning are used much less frequently. In our first study an Unsupervised and Semi-Supervised Anomaly Detection system for a Monorail Conveyor system is developed during ramp up that could perform well, even in the absence of labeled data. The system was able to be implemented much quicker and with fewer requirements than a comparable approach using Supervised Learning. In the second study a multi-model Quality Inspection System for Laser Beam Welding is created. The system used multiple ML architectures and algorithms to detect four distinct quality measures purely from a single source of grayscale images. It thereby outperformed the conventional methods in speed, accuracy and range of detected faults. The third study exhibits the development of a Deep Reinforcement Learning agent on an Industrial Robot that performs fully autonomous assembly of a body part. While the prototype performs well and is able to execute successful assemblies in the real world, the effort and expertise that was required during development are disproportionate to the gained value. However, given specific circumstances that require the particular strengths of DRL, it may present a viable option. In the last study Machine Teaching and Interactive Machine Learning is used to augment a self-service computer vision application. By using readable User Interfaces, explainability, and active learning users are enabled to train and evaluate their own Image Classification and Object Detection models. While also useful for evaluating production related quality inspection or detection use cases, it was especially remarkable that the users reported great enthusiasm in experiencing and better understanding modern ML applications. In this case, Transfer Learning proved to be a major enabler for fast and accessible model training. By developing industrial ML applications, it becomes apparent to us that just as innovation requires invention and commercialization, an industrial ML application requires an ML model with context and utility. In summary, this dissertation presents exploratory insights into the rapidly growing field of industrial Machine Learning applications by researching applications in context. Subsequently, the performed studies may serve as an example of how industrial ML applications can be designed, developed, and evaluated in the context of the manufacturing industry.
Conference Paper
Full-text available
Humans show different emotions in response to variant colormaps when facing visual presentations. The affect-colormap relationship thus becomes an important factor in human-in-the-loop systems. In this paper, we explore how to effectively exploit deep learning in affective colormaps within the domain of industrial tomography. Eleven pervasively used colormaps were picked as the stimuli, followed by a user study which gathered data on the human affect of each colormap as well as benchmarking our initial dataset. The affect was encoded into an emotional model over two dimensions ; valence (positive-negative), and arousal (exciting-calm). Our proposed convolutional neural network (CNN) consisting of 10 layers reached high recognition and prediction accuracy in the colormap-affect relationship. The obtained results affirmed our exploration, which could in future assist developers to construct more intelligent and reliable human-computer interaction (HCI) systems.
Conference Paper
We introduce Deep Thermal Imaging, a new approach for close-range automatic recognition of materials to enhance the understanding of people and ubiquitous technologies of their proximal environment. Our approach uses a low-cost mobile thermal camera integrated into a smartphone to capture thermal textures. A deep neural network classifies these textures into material types. This approach works effectively without the need for ambient light sources or direct contact with materials. Furthermore, the use of a deep learning network removes the need to handcraft the set of features for different materials. We evaluated the performance of the system by training it to recognize 32 material types in both indoor and outdoor environments. Our approach produced recognition accuracies above 98% in 14,860 images of 15 indoor materials and above 89% in 26,584 images of 17 outdoor materials. We conclude by discussing its potentials for real-time use in HCI applications and future directions.
Article
We introduce Deep Thermal Imaging, a new approach for close-range automatic recognition of materials to enhance the understanding of people and ubiquitous technologies of their proximal environment. Our approach uses a low-cost mobile thermal camera integrated into a smartphone to capture thermal textures. A deep neural network classifies these textures into material types. This approach works effectively without the need for ambient light sources or direct contact with materials. Furthermore, the use of a deep learning network removes the need to handcraft the set of features for different materials. We evaluated the performance of the system by training it to recognise 32 material types in both indoor and outdoor environments. Our approach produced recognition accuracies above 98% in 14,860 images of 15 indoor materials and above 89% in 26,584 images of 17 outdoor materials. We conclude by discussing its potentials for real-time use in HCI applications and future directions.
Article
Automated pavement distress detection and classification has remained one of the high-priority research areas for transportation agencies. In this paper, we employed a Deep Convolutional Neural Network (DCNN) trained on the ‘big data’ ImageNet database, which contains millions of images, and transfer that deep earning to automatically detect cracks in Hot-Mix Asphalt (HMA) and Portland Cement Concrete (PCC) surfaced pavement images that also include a variety of non-crack anomalies and defects. Apart from the common sources of false positives encountered in vision based automated pavement crack detection, a significantly higher order of complexity was introduced in this study by trying to train a classifier on combined HMA-surfaced and PCC-surfaced images that have different surface characteristics. A single-layer neural network classifier (with ‘adam’ optimizer) trained on ImageNet pre-trained VGG-16 DCNN features yielded the best performance.
Article
The condition of a machine can automatically be identified by creating and classifying features that summarize characteristics of measured signals. Currently, experts, in their respective fields, devise these features based on their knowledge. Hence, the performance and usefulness depends on the expert's knowledge of the underlying physics, or statistics. Furthermore, if new and additional conditions should be detectable, experts have to implement new feature extraction methods. To mitigate the drawbacks of feature engineering, a method from the sub-field of feature learning, i.e. deep learning, more specifically convolutional neural networks, is researched in this article. The objective of this article is to investigate if and how deep learning can be applied to infrared thermal video to automatically determine the condition of the machine. By applying this method on infrared thermal data in two use cases, i.e. machine fault detection and oil level prediction, we show that the proposed system is able to detect many conditions in rotating machinery very accurately (i.e. 95 % and 91.67 % accuracy for the respective use cases) without requiring any detailed knowledge about the underlying physics, and thus having the potential to significantly simplify condition monitoring using complex sensor data. Furthermore, we show that by using the trained neural networks, important regions in the infrared thermal images can be identified related to specific conditions which can potentially lead to new physical insights.
Conference Paper
Aiming to condition based maintenance for complex equipment, numerous intelligent fault diagnosis and prognostic methods based on machine learning have been researched. Compared with the traditional shallow models, which have problems of lacking expression capacity and existing the curse of dimensionality, using deep learning theory can effectively mine characteristics and accurately recognize the health condition. In consequence, fault diagnosis and prognostic based on deep learning have turned into an innovative and promising research field. This paper gives a review of fault diagnosis and prognostic based on deep learning. First of all, a brief introduction to deep learning architecture and the framework of fault diagnosis based on deep learning is described. Second, tracking describes the latest progress of fault diagnosis and prognostic based on deep learning in chronological order. In this section, the deep learning methods used in fault diagnosis and prognostic are discussed, including Deep Neural Network (DNN), Deep Belief Network (DBN) and Convolutional Neural Network (CNN). Then the engineering application fields are summarized, such as mechanical equipment diagnosis, electrical equipment diagnosis, etc. Finally, this paper indicates the potential future research issues in this field.
Article
Thesupport-vector network is a new learning machine for two-group classification problems. The machine conceptually implements the following idea: input vectors are non-linearly mapped to a very high-dimension feature space. In this feature space a linear decision surface is constructed. Special properties of the decision surface ensures high generalization ability of the learning machine. The idea behind the support-vector network was previously implemented for the restricted case where the training data can be separated without errors. We here extend this result to non-separable training data.High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated. We also compare the performance of the support-vector network to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.