Prévia do material em texto
Expert Systems With Applications 175 (2021) 114820
Available online 4 March 2021
0957-4174/© 2021 Elsevier Ltd. All rights reserved.
Review
Machine Learning for industrial applications: A comprehensive
literature review
Massimo Bertolini a, Davide Mezzogori b,*, Mattia Neroni b, Francesco Zammori b
a Enzo Ferrari Engineering Department, University of Modena and Reggio Emilia Via P. Vivarelli, 10, 41125 Modena, Italy
b Department of Engineering and Architecture, University of Parma, Parco Aree delle Scienze, 181/A, 43124 Parma, Italy
A R T I C L E I N F O
Keywords:
Literature review
Industrial applications
Deep Learning
Machine Learning
Operation management
A B S T R A C T
Machine Learning (ML) is a branch of artificial intelligence that studies algorithms able to learn autonomously,
directly from the input data. Over the last decade, ML techniques have made a huge leap forward, as demon-
strated by Deep Learning (DL) algorithms implemented by autonomous driving cars, or by electronic strategy
games. Hence, researchers have started to consider ML also for applications within the industrial field, and many
works indicate ML as one the main enablers to evolve a traditional manufacturing system up to the Industry 4.0
level. Nonetheless, industrial applications are still few and limited to a small cluster of international companies.
This paper deals with these topics, intending to clarify the real potentialities, as well as potential flaws, of ML
algorithms applied to operation management. A comprehensive review is presented and organized in a way that
should facilitate the orientation of practitioners in this field. To this aim, papers from 2000 to date are cate-
gorized in terms of the applied algorithm and application domain, and a keyword analysis is also performed, to
details the most promising topics in the field. What emerges is a consistent upward trend in the number of
publications, with a spike of interest for unsupervised and especially deep learning techniques, which recorded a
very high number of publications in the last five years. Concerning trends, along with consolidated research
areas, recent topics that are growing in popularity were also discovered. Among these, the main ones are pro-
duction planning and control and defect analysis, thus suggesting that in the years to come ML will become
pervasive in many fields of operation management.
1. Introduction
In the new global economy, competition fosters complexity, which
directly affects manufacturing processes, products, companies, and
supply chain dynamics. Now that we are entering into the Industry 4.0
era (Lu, 2017), the new managerial paradigm is shifting from the need
for low variability, through products’ commonalities and processes’
repeatability, as advocated in the lean thinking theory (Liker, 2004),
toward the so-called mass-customization where, conversely, wide-
markets goods should be rapidly modified and re-manufactured, at
low cost, to satisfy a specific customer’s need (Coronado et al. 2004). In
this scenario, resilience, reconfigurability, and flexibility are key issues
of competitiveness, as clearly expressed by the ‘smart manufacturing’
concept, indicating a company that has the potential to fundamentally
change how products are designed, manufactured, supplied, used,
remanufactured, and eventually retired (Kusiak, 2018). Information
technology, sensor networks, computerized controls, production man-
agement software, and, more in general, the Industrial Internet of Things
(IIoT) are basic prerequisites for a company to be smart. Yet, these de-
vices alone are not enough, and a manufacturing system cannot be
considered smart, unless its overall functioning is regulated by intelli-
gent control technologies, for a quick, accurate, and reliable response to
internal and external events (Mittal et al., 2016). Furthermore, as noted
by Kusiak (2017), smart manufacturing must embrace big data and, to
this aim, information system and production management software
must be coupled and/or enriched with deep analytical skills (Waller and
Fawcett, 2013) and with learning ability (Monostori, 2003), to ensure
competitiveness and effectiveness.
Shreds of evidence also suggest that data are one of the most valuable
assets of a firm and, especially for innovative companies, big data
management is a key issue of competitiveness (Harding et al, 2006). Not
only a proper data management may help in differentiating from
* Corresponding author at: Department of Engineering and Architecture, University of Parma, Parco Aree delle Scienze, 181/A, 43124 Parma, Italy.
E-mail addresses: massimo.bertolini@unimore.it (M. Bertolini), davide.mezzogori@unipr.it (D. Mezzogori), mattia.neroni@unipr.it (M. Neroni), francesco.
zammori@unipr.it (F. Zammori).
Contents lists available at ScienceDirect
Expert Systems With Applications
journal homepage: www.elsevier.com/locate/eswa
https://doi.org/10.1016/j.eswa.2021.114820
Received 15 March 2020; Received in revised form 29 December 2020; Accepted 28 February 2021
mailto:massimo.bertolini@unimore.it
mailto:davide.mezzogori@unipr.it
mailto:mattia.neroni@unipr.it
mailto:francesco.zammori@unipr.it
mailto:francesco.zammori@unipr.it
www.sciencedirect.com/science/journal/09574174
https://www.elsevier.com/locate/eswa
https://doi.org/10.1016/j.eswa.2021.114820
https://doi.org/10.1016/j.eswa.2021.114820
https://doi.org/10.1016/j.eswa.2021.114820
http://crossmark.crossref.org/dialog/?doi=10.1016/j.eswa.2021.114820&domain=pdf
Expert Systems With Applications 175 (2021) 114820
2
competitors and gaining a competitive advantage, but companies that
use data-driven decision-making approaches have proven to easily
outperform their competitors, being, on average, 5% more productive
and 6% more profitable (McAfee et al., 2012). Unfortunately, while in
many cases companies perceive the utility of their data, often they do
not have the knowledge needed to exploit their data-silos and lack a
clear understanding of what is important to be measured. As a result, the
informative content of the data is missed, and real and valuable
knowledge gets lost (Harding et al, 2006). If so, the well-known mana-
gerial expression that ‘quality trumps quantity’ becomes true because, if
managers do not know how to select truly meaningful data easily and
rapidly, a large and detailed data warehouse can be as harmful as a total
lack of relevant information. Hence, optimizing data collection, usage
and sharing have become vital for many companies (Kusiak, 2017) and
Machine Learning (ML), a branch of Artificial Intelligence (AI) dealing
with algorithms that learn directly from the input data, is expected to
play a key role in the fulfillment of these needs. Not surprisingly, many
works (Lu, 2017; Xu et al., 2018), indicate ML as one of the main en-
ablers to evolve a traditional manufacturing system up to the Industry
4.0 level. It is worth noting that, a spike of academic interest followed
the report by Pham and Afify (2005), one of the first to have shown
potential applications of ML to operation management. From that
moment, researchers started to consider ML applications also within
industrial fields, especially for pattern and image recognition, natural
language processing, operations optimization, data mining, and
knowledge discovery (Wuest et al., 2016). Since then, as it will be
described in later sections, the number of papers published in this field
has ever increased, and the trend has been recently fueled by many
government initiatives, like Industry 4.0 (Germany), Smart Factory
(South Korea), and Smart Manufacturing (USA), calling for a radical
change in the manufacturing paradigm, based on processes’ augmen-
tation and enhancements due to Information Technologies (IT).
Especially in the last decade, the state of the art of ML techniques has
made a huge leap forward, as demonstrated by the algorithms used byautonomous driving cars or by electronic strategy games. Both tasks
were considered many years away from a practical solution (Martínez-
Díaz and Soriguera, 2018; Müller, 2002) yet autonomous driving cars
are already being tested in urban environments, and AlphaGo has
overwhelmed the world champion of the Go game (Silver, 2016).
Similarly, DeepMind recently reported the development of an AI that
successfully learned to play better than humans in many other strategy
games (Silver, et al., 2017). Furthermore, enabling technologies (i.e.,
sensors, open-source software, public datasets, computational power,
cloud services, etc.) are now mature and available at low cost and
government initiatives offer interest-free (or even non-refundable) loans
and/or fiscal incentives to support investments in IT projects.
Owing to these favorable issues, the time seems to be right to
implement ML in the industry, and indeed, according to the Gartner
Hype Cycle for Emerging Technologies (Burton & Barnes, 2017), Arti-
ficial Intelligence and especially Machine and Deep Learning have
reached the peak of inflated expectation. Nonetheless, industrial appli-
cations of these technologies are still rare and generally confined within
a small cluster of big international companies. Should this trend
continue, a ‘disillusion phase’ may follow soon and the ‘plateau of pro-
ductivity’ may never be reached. Presumably, a detrimental element of
acceptance can be found in the widespread concern that AI could
jeopardize many jobs, increasing the unemployment phenomenon as
noted by Korinek and Stiglitz (2017) and as clearly indicated in the
McKinsey report by Manyika et al. (2017). In our opinion this is a
misconception: if on the one hand, it is true that automation will replace
human labor, on the other one hand replacement will concern redundant
and repetitive tasks. Having the ability to learn representations auton-
omously, ML and especially DL models can extract knowledge directly
from raw data, freeing researchers from the expensive and time-
consuming step of feature extraction and feature engineering (LeCun
et al, 2015). Thus, it is not daredevil to assume that the most successful
implementations will be those augmenting, and assisting human deci-
sion making, freeing people from low value-added tasks.
Apart from that, one of the main barriers to pervasive industrial
adoption of ML is the lack of a clear understanding of these methodol-
ogies and the lack of awareness of what ML can and cannot do (LaValle
et al., 2011). As posed by the notorious ‘No Free Lunch Theorem’
formulated by Wolpert and Macready (1997), ML cannot solve all in-
dustrial problems and its practical adoption, as an alternative to more
mature technologies, must be carefully evaluated and pondered. Clearly,
it is important to make an informed decision, without being influenced
by the trend and the fashion of the moment. The ability to choose an
algorithm (or a subset of algorithms), suitable for a specific task or
problem, is a core competence for data analysts and/or practitioners
who want to apply ML in industrial settings, as this choice can make the
difference between failure and success. Yet, in absence of experience
and/or on previous studies of similar nature, envisioning a way to
deploy ML at the industrial level to improve business’ performances is
challenging, especially considering the vast number of algorithms (and
possible variations differentiating in terms of operating characteristics
and of complexity) that have been proposed in technical literature. Such
variety can be disorienting and misleading, and the problem is further
complicated by the lack of a repository of best use-cases, for each in-
dustry and organization. So, we believe that a systematic literature re-
view focused on the historical developments of ML for industrial
applications, may be extremely useful to highlight present and future
trends and, above all, to orient industrial practitioners in the selection
and in a more conscious use of ML techniques.
Specifically, to clarify the real potentialities, as well as potential
flaws, of ML algorithms applied in the field of operation management,
papers from 2000 to date will be reviewed and categorized in terms of
applied algorithm and application field. Insights, concerning trends and
evolutions in the subject matter will be provided, and possible future
developments will be investigated as well.
The remainder of the paper is organized as follows. Section 2 gives a
brief introduction and defines the technical lexicon that will be used in
the paper. Section 3 describes the searching methodology that led to the
identification of the set of papers that will be analyzed, in a general and
more detailed way, in Section 4. Lastly, conclusions and general remarks
will be drawn in Section 5.
2. A brief introduction of Machine Learning theory
A single definition of ML cannot be properly formulated, as this term
encompasses a multitude of different approaches taken from the field of
computer science and of multivariate statistics. Nonetheless, a good
definition can be found in Murphy (2012), who defines ML as the «set of
methods that can automatically detect patterns in data, and then use the
uncovered patterns to predict future data, or to perform other kinds of de-
cision making under uncertainty». Although very clear, this definition
gives too much emphasis on pattern recognition and decision-making
that, as important as they may be, do not cover the whole spectrum of
ML approaches and methodologies. So, more in general, we could define
ML as a set of methodologies and algorithms capable of extracting
knowledge from data, and continuously improve their capabilities, by
learning from experience (i.e., from data accumulating over time).
Please note that learning, as defined by Simon (1983), denotes a change
that makes a system more and more adaptive, enabling it to perform the
same task (or tasks drawn from the same population) more effectively
the next time.
It is also worth noting that, in many ways, ML overlaps with the so-
called Statistical Learning (SL), an important field of statistics aimed to
model and to understand complex datasets (Gareth et al., 2013). Both
ML and SL models are characterized by the ability to self-adapt (at least
to some extent), to changes in the data and/or in the environment, and
to readjust their output accordingly. This pivotal element explains the
recent increasing interest in these disciplines, as they perfectly match
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
3
the need to process and to analyze Big Data generated by the widespread
use of electronic devices, web searches, social media, and social media
marketing.
2.1. Machine Learning areas
ML is commonly divided into three broad areas, namely Supervised
Learning (SL), Unsupervised Learning (UL), and Reinforcement Learning
(RL) (Murphy, 2012), as detailed below.
2.1.1. Supervised Learning (SL)
Supervised Learning, also called predictive learning, includes many
algorithms, of which the most commons are: Neural Networks, Support
Vector Machines, Decision Trees (and their extensions, such as Random
Forests and XGBoost), Logistic Regression, and Naïve Bayes Classifiers.
Apart from implementation and operational differences, all SL methods
aim to learn a good approximation ̂f of the true mapping f from the input
vector x→ to the outputs vector y→, using information contained in a
dataset of training examples, generated either performing experiments
or through the direct observation of the phenomenon under analysis.
More precisely, the data set is built by registering, for each observedexample, the true value of the response variable y, together with the
known values of the input vector x→. The data set of examples is then
split into a ‘training’ and ‘test’ set; the first one is used to reconstruct ̂f by
iteratively minimizing a predefined cost (or loss) function, whereas the
second one is used to assess the prediction accuracy of the model, on
data that were ‘not seen’ during the training phase.
Output variables may be either categorical or continuous. In the first
case, the problem is known as a classification task, and a classic example
could be that to generate a model to detect process failures or to predict
the quality level (expressed on a categorical scale) of new production
batches, starting from a dataset containing the physical properties x→ and
the quality level y of completed production batches. Conversely, if vari-
ables are continuous, the problem is known as a regression task, and an
industrial example could be that to predict a certain physical property,
such as the thickness or the surface roughness of items processed by a
numerical control machine. In this case, the task could be traced back to
an image recognition problem, as different pictures of the manufactured
items, taken before and after the machining process, could be converted
into a vector of features, to generate the predictive variables x→.
2.1.2. Unsupervised Learning (UL)
Unsupervised Learning is concerned with unlabelled datasets, where
no ground truth is available (i.e., the output vector y→ is missing). Hence,
the goal is not to make a prediction, but rather to detect and to extract
patterns in the data, whose nature or even whose existence could be
partially or completely unknown. For these reasons, UL is sometimes
referred to as descriptive learning and it is associated with knowledge
discovery techniques.
Broadly speaking, UL could be divided into three sub-areas (Murphy,
2012): clustering, density estimation, and dimensionality reduction.
Clustering is the task of grouping a set of objects in such a way that
objects in the same group are more similar to each other than to those in
other groups. A common example is a marketing-driven need to find
groups of customers similar in terms of purchase behavior. If informa-
tion about class membership is not known, notorious algorithms, such as
Hierarchical Clustering or K-Means, can be effectively used to this scope.
Density estimation is a wide set of techniques that can be used to
discover useful properties (e.g. skewness or multimodality) or even to
generate an estimate of an unobservable underlying probability density
function, of a dataset of observed data. Rescaled histograms are the most
basic approach for density estimation, but more complex techniques can
be also be used such, as Parzen Windows and vector quantization.
Dimensionality reduction is frequently needed, especially in the case
of Big-Data analysis, as a way to compress data, without altering and/or
distorting their original informative content. Principal Component
Analysis is the classical way to perform this task, but many neural
network topologies (such as Autoencoders) can be employed too, to
learn the best-compressed representation of the original data. In a
broader sense, all Deep Learning (DL) models can be considered as a way
to capture both the hidden representation of the data and the most
relevant relationships among them. Accordingly, DL is also referred to as
Representational Learning (Bengio et al., 2013).
2.1.3. Reinforcement Learning (RL)
Reinforcement Learning differentiates from the other ML ap-
proaches, as it implements a computational approach to learn from in-
teractions with an environment (Sutton and Barto, 1998). Rather than
generating a mapping from the input to the output space, RL generates a
mapping from situations (environment state) to actions. Akin the
learning process of a person, RL does not require a pre-existing dataset
but, with the goal to learn autonomously how to make decisions, it ex-
ploits a set of agents that learn by doing, following a rewarded trial and
error approach. More precisely, the agent is free to interact with the
environment, by performing a predefined set of actions, according to a
predefined policy. Each action modifies the system’s state, and such
modification is quantified through a specific reward signal, which is sent
back to the agent. Since the objective of the agent is to maximize its total
reward, it will learn, by doing, the best reaction to each possible external
scenario, or system’s state. It is worth noting that Q-learning (Watkins,
1989) is one of the most popular reinforcement learning algorithms, in
which the agent learns actions’ values, which define the agent policy,
without the need to have an explicit model of the environment.
In addition to the reward signal, the learning process can also be
supported by a superset of supervised and/or unsupervised algorithms,
which should optimize the exploration and the exploitation of the action
space of the agent. When all, or at least a part, of the implemented superset
of algorithms are neural networks, the approach is known as Deep Rein-
forcement Learning (Li, 2017). In this regard, double Q-Learning is one of
the examples of the application of Deep Learning models to improve the
classic Q-learning algorithm (Van Hasselt et al., 2015).
Anyhow, regardless of the implementation details, the final goal of
an RL algorithm is to produce an artificial agent (or multiple agents
interacting with each other) capable to make good decisions, based on
the current state of the environment and its experience. For instance,
from an industrial perspective, RL agents could be used to automate
ordering strategies in multi-tier supply chain networks, or to update
production parameters to maximize yield keeping operating costs at a
minimum level.
3. Searching methodology
In line with the objectives of the present work, and owing to identify
trends, potentialities, and criticalities concerning the use of ML for
operation management, the review focuses on the following Research
questions (Rq):
- Rq. 1 – Which are the main application domains (i.e., industrial pro-
cesses) where ML has been successfully adopted?
- Rq. 2 – Is the trend stable or has it modified through time, starting from
2000?
- Rq. 3 – Which are the most popular ML methodologies for operation
management?
- Rq. 4 – Is it possible to identify interesting development patterns?
- Rq. 5 – Are there any criticalities in the use of ML algorithms for In-
dustrial Applications?
- Rq. 6 – Which are the least studied domains and algorithms, which could
benefit from renewed approaches?
To answer the above-mentioned questions, the whole publications’
domain was investigated following a specific search-protocol, based on
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
4
four main steps, as detailed below.
3.1. Initial query-based search
To collect as many publications as possible, in January 2020, a
keywords-based search was made on three trustable and comprehensive
scientific databases: Scopus, Web of Science, and Google Scholar. Aim-
ing to restrain the search to the papers dealing with Machine and
Reinforcement Learning for operation management, possibly with a
focus on Industry 4.0, data were filtered using the following query,
where the asterisk (*) is the ‘all’ operator.
KEY ({manufact*} OR {supply chain} OR {industry 4*})
AND ({machine learning} OR {reinforcement learning} OR {deep learning})
AND PUBYEAR ≥ 2000
AND DOCTYPE (Article)
AND (LIMIT-TO (LANGUAGE, English))
The query is reported here with a syntax similar to the one requiredby
Scopus; yet, with minor adjustments, it was used to collect papers from
Web of Science too. Conversely, papers retrieved from Google Scholar
were manually filtered, due to a binding restriction, set by the search
engine, that allows searching and filtering by title only.
Anyhow, the filter (either applied manually or automatically)
returned papers with at least a keyword belonging to the Set A =
{manufacturing, supply chain, industry 4.0} and a keyword to Set B =
{machine learning, reinforcement learning, deep learning}, provided
that all the following inclusion criteria were met:
- C1 – Studies must be either conference of journals peer-reviewed
publications (i.e., other kinds of scientific works, such as books,
patents, and Ph.D. thesis were not considered);
- C2 – Language must be English;
- C3 – Only recent studies, published starting from 2000, are
considered.
Such an extensive search returned a total of 678 publications, 370
from Scopus, 218 from Web of Science, and 90 from Google Scholar.
3.2. Search enlargement
Next, despite the high number of collected papers, to avoid possible
omissions of other relevant works, the search was enlarged using cross-
reference and citation graph analysis, as detailed next.
3.2.1. Cross-reference analysis
To enlarge the search, we considered the list of all citations found in
the original set of 678 papers. Such a list was automatically generated
leveraging on the Scopus APIs (https://dev.elsevier.com/sc_apis.html),
which allows us to retrieve all citations and their related metadata (i.e.,
keywords, abstract, authors, etc.).
Also, to exclude papers unrelated to the stream of the research herein
considered, the inclusion criteria C1, C2, and C3 were re-used to filter
the obtained citations’ list. However, the constraints imposed on the
keywords were partially relaxed, as we accepted papers with at least one
keyword belonging to set A or to set B.
By operating in this way, we obtained a list of 767 filtered citations.
3.2.2. Relevance assessment through citation graph analysis
The 767 citations and the original set of 678 papers were joined
together and used as input for Gephi©, a freeware software application
for the creation of citation networks. The simplified version of the
generated network, where only connections among the main nodes are
displayed, is shown in Fig. 1. In the network, nodes correspond to pa-
pers, and arcs indicate citations among them. More precisely, green
nodes are the source of a citation, whereas blue nodes are the papers that
received at least one citation from the other ones. Also, the nodes’ size is
an indication of importance, evaluated as the number of received
citations.
Using this relevance criterion, we decided to add to the original list
all the nodes having at least three incoming arcs in the citation graph. As
an example, let us consider the blue node labeled as A in Fig. 1. This
Fig. 1. The simplified citation networks (sources of citation in green, cited papers in blue).
M. Bertolini et al.
https://dev.elsevier.com/sc_apis.html
Expert Systems With Applications 175 (2021) 114820
5
node, whose size has been enlarged for display purposes, refers to Shiue
(2009), a work that did not belong to the original list of the collected
papers. However, the citation graph allowed us to include A in the list
too, as A is cited, and thus connected, with three relevant works that
were already part of the list. These works, namely Priore et al. (2010),
Shiue et al. (2011), and Shiue et al. (2012), correspond to the green
nodes (or citation sources) labeled as B, C, and D, in Fig. 1.
By operating in this way, the original list increased from 678 to 714
papers.
3.3. Abstract analysis and final screening of the selected works
Lastly, to refine the selection, all the abstracts were read and filtered
using three additional inclusion criteria:
- C4 – Only works with an informative abstract clearly stating the
papers’ contributions and industrial results are considered;
- C5 – Studies must be unique, copies (or very similar papers) are
removed;
- C6 – Purely theoretical or conceptual studies were not considered.
Specifically, to be included, studies should present industrial appli-
cations tested on experimental data or, at least, tested on accessible
datasets (used as a benchmark by the research community).
By doing so, mainly due to the application of criteria C4 and C6, 569
papers were considered of low operating value and were discarded,
leaving a final corpus of 147 papers. The full list of the selected papers
can be found in Tables 3a–3d of Section 4, where the papers are analyzed
in detail.
4. Systematic review
4.1. Preliminary classification
To answer the first three research questions, all papers were carefully
read and classified in terms of their:
- Application Domain (AD) – The industrial area or process considered
in the paper,
- ML Area (MLA) – The SL, UL, and RL clusters, as described in Section
2, to which the adopted algorithms belong to.
In line with the content of the articles that were collected during the
search, we tried to define clusters of comparable size containing papers
sufficiently detailed and homogeneous.
In light of this, a good compromise was reached considering the
following four ADs:
1. Maintenance Management (MM), which includes 23 papers dealing
with:
- Failure modes classification and prediction (6),
- Condition monitoring and fault detection (14),
- Downtime minimization and maintenance planning (3).
2. Quality Management (QM), which includes 53 papers dealing with:
- On-line quality control (10),
- Defects detection and classification (33),
- Image recognition for defect identification (9),
- Life cycle management (1).
3. Production Planning and Control (PPC), which includes 49 papers
dealing with:
- Performance prediction and maximization (18),
- Job scheduling and dispatching (15),
- Dynamic process control (16).
4. Supply Chain Management (SCM), which includes 19 papers dealing
with:
- Demand planning and forecasting (6),
- Inventory management (4),
- Supply chain modeling and coordination (9).
The above-mentioned classification is graphically displayed in Fig. 2,
where the distribution of the papers in terms of AD and of MLA is clearly
shown.
Please note that the histogram chart includes an additional category,
namely Engineering Design (ED), that was purposely introduced to
insert three relevant papers, in the field of technical design (Cholette
et al., 2017; Loyer et al., 2016; Stocker et al., 2019), that could not have
been put in any other category. Also, note that the sum of the bars of a
certain AD may be greater than the number of papers belonging to the
same AD. This is because, quite frequently, there are papers that use
and/or compare different methodologies to solve the same problem.
As can be seen, the number of ML applications to the industrial
problem is relevant and, most of all, in terms of the application domain,
(i.e., Research Question #1) applications are distributed fairly evenly
among the various fields of operations management. Only SCM is not yet
a much-explored domain, a fact that can be probably explained
considering that most of the time, SCM involves strategic optimization
models, requiring complex and less known approaches, such as Deep
Learning and/or Reinforcement Learning. A further discussion on this
Fig. 2. Number of papers for Application Domain (AD) and Machine Learning Area (MLA).
M. Bertolini et al.Expert Systems With Applications 175 (2021) 114820
6
topic is postponed to Section 4.4, where a detailed analysis of the
collected paper is given.
For some additional statistics, the interested reader is referred to
Appendix B, where a bibliometric analysis (in terms of journals with
most publications, authors with more citations, etc.) is provided.
4.2. Trend analysis
The trend in the number of publications, for each AD, is shown in
Fig. 3, which incontrovertibly responds to Research Question #2.
Indeed, after an initial phase of latency, in which only some pioneering
works have been occasionally published, scientific and industrial in-
terest in ML applications has exploded. Especially over the last five
years, the growing trend of publications is evident, with a very high
spike in 2019.
Concerning the evolution through time of the application areas (i.e.,
Research Question #3), a clear picture is given by Fig. 4, which shows
the evolution of the distribution on the published papers, in terms of
MLAs, for each of the four 5-year-periods from 2000 to 2019.
In line with the overall increase of published papers, the trend is
positive in each of the three MLAs, and it is particularly pronounced for
SL approaches. This is not surprising because, historically, SL methods
have always been the most studied and applied ones. Indeed, thanks to
the ground-truth information (recorded in the training data set), they
fully exploit available data, and they are also easier to interpret.
Due to the relevance of SL approaches, the trend analysis is deepened
in Fig. 5, which shows the trend of Neural Networks (NN)s, Support
Vector Machine (SVM), and Tree-Based (TB) techniques (i.e., Decision
Trees, Random Forests and Gradient Boosting), which have shown to be
the most used techniques belonging to this ML area. As it is clear from
the chart, SVM was the prominent technique until 2010, and although
its use has not faded away, lately it has been overtaken by NNs. Albeit
informally, the start of the Deep Learning era can be approximately
placed around 2010–2012, and indeed, in the last 7–8 years, the use of
Neural Network (especially of deep architectures), has been prominent.
Nonetheless, collecting and labeling data is expensive and time-
consuming, and this explains why, more recently, UL methods are
increasingly being used too. As shown in Fig. 4, although they were
Fig. 4. Time distributions of ML Areas (MLA).
Fig. 3. The trend of publications, for each Application Domain (AD).
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
7
almost absent till 2014, in the last five years they account for about 1/5
of the total, with a very rapid growth trend, as clearly highlighted by the
black line displayed in Fig. 5.
Conversely, after a modest peak toward the end of the first decade of
2000, RL has stabilized at a lower growth rate. Probably, notwith-
standing recent breakthrough developments in such areas and the
greater understanding of RL’s potentialities, the high complexity of
reinforcement learning algorithms is still a hurdle for its full acceptance
and industrial applicability.
The above-mentioned analyses are summarized in Table 1, which
shows the number of published papers in terms of MLA and AD. Please
note that, as for the histogram of Fig. 3, also the rows’ sum of the values
of Table 1, may be higher than the corresponding number of papers.
4.3. Keywords analysis
From the corpus of the investigated papers, we extracted around 350
different keywords. Getting rid of the obvious ones (e.g., Machine
Learning, Supervised Learning, Unsupervised Learning, etc.), and
combining the remaining ones by synonyms, a total of 61 basic key-
words remained. Of these, 32 concern the application domain, the other
29 refer to the adopted ML techniques.
The total count is graphically shown in Figs. 6a and 6b, where three
fictional macro-keywords, namely ‘Metaheuristics’, ‘Statistic Tech-
niques’, and ‘Neural Networks’, have been added to group similar and
recurrent items.
As can be seen, following the results reported in the previous sec-
tions, NNs and SVMs are very common, together with RL and Meta-
heuristic, that occur quite frequently too. Relatively to the application
domain, ‘Diagnosis & Fault Detection’, ‘Additive Manufacturing’, and
‘Manufacturing Processes’ are, by far, the most frequent keywords.
Immediately after, other interesting fields follow, such as: ‘Supply Chain
Management’, ‘Big Data’, ‘Intelligent Manufacturing’, ‘Production
Planning & Control’, ‘Quality Control’ and ‘Simulation’.
For more in-depth information, a Word Cloud representation of the
20 most relevant keywords is also provided in Fig. 6c. As it is evident,
there is a very good matching between the most occurring keywords and
the Applications Domains that were used to classify the investigated
papers. Apart from this rather predictable result, the presence of the
‘Intelligent Manufacturing’ is a strong indication of how important
machine learning techniques are considered to obtain a competitive
edge in the Industry 4.0 era. Lastly, it is also worth noting that the term
Fig. 5. Publication trend of papers dealing with NNs, SVM, and TB algorithms.
Table 1
Rq. 3 – Trend Analysis: results summary.
Unsupervised Learning Reinforcement Learning Supervised Learning
NNs SVM TB Other SL
Maintenance Management (23) 3 4 13 9 7 6
Failure Mode Analysis (6) 1 [–] 5 2 1 2
Condition Monitoring (14) 2 [–] 7 6 6 3
Downtime Minimization (3) [–] 4 1 1 [–] 1
Quality Management (53) 16 1 27 24 19 24
On-Line Quality Control (10) 3 1 7 4 [–] [–]
Defect Detection & Class. (33) 12 [–] 16 14 15 20
Image Recognition (9) 1 [–] 3 5 3 4
Life Cycle Management (1) [–] [–] 1 1 1 [–]
Prod. Planning & Control (49) 10 12 22 10 8 10
Performance Prediction (18) 6 2 7 4 3 7
Scheduling (16) 1 7 6 2 5 2
Process Control (15) 3 3 9 4 [–] 1
Logistic & Supply Chain (19) [–] 10 3 5 3 3
Demand Forecasting (6) [–] [–] 3 4 1 1
Inventory Management (4) [–] 5 [–] [–] [–] [–]
Modelling & Coordination (9) [–] 5 [–] 1 2 2
Engineering Design (3) [–] 1 [–] 2 [–] 1
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
8
‘neural network’ is the only one that explicitly refers to a particular ML
algorithm. This is a further indication of the prominence and importance
attached by researchers to this specific technique. However, the pres-
ence of the ‘feature extraction’ term suggest that the practice of data pre-
processing and data engineering is still common and dominant. This fact
is in partial contrast with the development and dissemination of Deep
Learning techniques that, as known, can exploit raw data, without
needing sophisticated feature extraction techniques. Although the
presence of the ‘feature extraction’ term is probably due to the older
works (that used standard ML techniques), it may also indicate a rather
immature approach to Deep Learning technique, which is still influ-
enced by the most popular approaches in the recent past.
4.3.1. Current trends and hot topics
To get a better idea of the current trends, and to give an answer to
Research Question # 4, we also organized keywords in the 3D bubble
chart of Fig. 7. Each keyword k (denoted using the same abbreviations
used in Fig. 6b) is identified with a triplet of data (age, trend, size), and it
is plotted as a sphere, with volume proportional to the size, and centrally
located at coordinates(x, y) corresponding to ‘age’ and ‘trend’,
Fig. 6a. Keywords relative to the adopted ML technique.
Fig. 6b. Keywords relative to the solution domain.
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
9
respectively.
Specifically, age, trend, and size are defined as follows:
- Size (Sk) – The total number of occurrences of k,
- Age (Ak) – The number of years since the first occurrence of k,
- Trend (Tk) – The percentage misalignment of the Centre Of Gravity
(COG) of k, as defined in Eqs. (1) and (2):
T k = ((t (COG, k) − (t n − 0.5A k)))/A k = ((t (COG, k) − t k))/A k
(1)
t (COG, k) = (
∑
(i = 1)n(s (i, k)∙t i))/(
∑
is (i, k))
= (
∑
(i = 1)n(s (i, k)∙t i))/S k (2)
where: tn is the current year, t
−
k is the midpoint of the life of k, si,k is the
number of occurrences of k at year i, and tCOG,k is the ‘temporal’ coor-
dinate of the COG of k.
Specifically, for a consolidated and stable keyword k, tCOG,k should
lay at the midpoint of its life (i.e., tCOG,k = t
−
k) and Tk should be close to
zero. Instead, a positive value of Tk indicates a keyword that is being
used more and more frequently, or that has come back into vogue, after a
period of latency. Conversely, a negative value of Tk denotes a keyword
that is out of fashion or no longer in use.
Using these metrics, five main clusters can be identified. These are:
1. Question Marks (Low Age and Negative Trend) – Recently introduced
topics, that have not got a follow-up, yet. Thermography (THER),
Cyber-Physical Systems (CPS), and Design For (D4) belong to this
category.
2. Hot Topics (Low Age and Negative Trend) – Very recent topics of
booming interest. At present, none of the keywords properly belong
to this category. Yet, Additive Manufacturing (ADD_MN), Prediction
& Prognostic (PR_PR), and Industry 4.0 (I4.0) are those who come
closest to this category. For this reason, they have been labeled as
‘new promises’.
3. Consolidated (Medium Age and Stable Trend) – Not recent topics,
which are still studied, but without the initial spike of interests.
Topics such as Supply Chain Management (SCMI), Flexible
Manufacturing Systems (FMS), Inventory Control (INV_CTRI), and
Tool Monitoring (TLL_MN) belong to this category.
4. Stars (High Age and Positive Trend) – Old and consolidated topics
that are still attracting increasing research interest. Topics such as
Diagnosis and Fault Detection (DG_FLT), Manufacturing Process
(MN_PR), Intelligent Manufacturing (INT_MN), and Big Data analysis
(BD_DM) certainly belong to this class. Probably, Simulation (SIM)
and the Internet of Things (IoT) are on their way to become stars.
5. Obsoletes (High Age and Negative Trend) – Old topics that have never
received much scientific interest and that have almost disappeared
from the technical literature. Due to the recent introduction of ML,
for operation management, no keywords can be classified as obso-
letes yet. However, Order Management (OM) and, probably, also
Feature Extraction (FT_EX) are moving toward this class.
We also note that, as indicated by the direction arrows shown in
Fig. 7, according to a standard evolutionary trajectory, question marks
should become consolidated topics, moving diagonally from the bottom
left corner to the center of the graph. However, in case of rapid success,
question marks can move vertically to reach the hot topics area and, if
Fig. 7. Topics’ evolution map measured in terms of age and trend.
Fig. 6c. Word cloud of the 20 most relevant keywords.
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
10
the growing trend continues, they can proceed straightly toward the
stars’ area. In this regard, three additional clusters, namely New
Promises, Emerging Trends, and Young Stars can be identified. The first
one contains recent topics that have already overcome the initial phase
of uncertainty and that are likely to remain of interest in the years to
come. As already noted, Additive Manufacturing (ADD_MN), Prevision
& Prognostic (PR_PR), and Industry 4.0 (I4.0) belong to this cluster. The
second one contains rather recent topics growing in popularity, which
can be expected to become consolidated or even star topics in the next
few years. Production Planning & Control (PPC) and Defect Detection
(Q_DF) and Signal Processing (SIGN_PR) are the main topics in this area.
The last one contains young and consolidates topics that are still in a
booming phase. Automation (AUT) and Process Control (Q_PR) are the
main topics in this area.
4.3.2. Gaps’ investigation
To partially explain the difference between Stars and Question
Marks, a gap analysis is provided in Table 2. Specifically, the occurrence
of each ML algorithm is reported, both for the items classified as Stars
and Question Marks.
Some interesting differences, between the two groups, are clear.
Indeed:
- Applications of Reinforcement Learning algorithms are completely
missing in the Question Marks group.
- As far as the Supervised Learning algorithms are concerned, the
number of algorithms applied by the Question Marks group is lower
and limited to the most classic and widespread techniques. Several
gaps are noted, even in case of some very common techniques, such
as NN, RF and SVM, that are little used, if not completely ignored.
- A similar gap can also be found in terms of Unsupervised Learning
techniques. The gap is particularly marked in the Quality
Management area, where Unsupervised learning is widely investi-
gated by Stars, but it is totally neglected in the Question Marks group
Is therefore evident that, in case of Stars the whole spectrum of
possible ML solutions has been tested and, to emerge in this group,
where research is almost mature, researchers have to resort to innova-
tive and frontier techniques. Conversely, concerning Question Marks,
ML applications are still a niche and only standard and consolidate ML
techniques have been tested. There is therefore room for further in-
vestigations, which could certainly lead to a positive development in all
the involved Application Domains.
4.4. Detailed analysis of selected papers
To answer to Research Questions #5 and #6, all selected papers were
analyzed in detail. For each of the four ADs defined in Section 4.1, re-
sults are summarized by providing a brief description of the papers
deemed more significant and innovative and a summary table that
highlights the main features of all the analyzed papers.
Specifically, for each paper, the following fields are quantified:
- Article – The reference to the described paper.
- Sub Area -The sub-area to which the described paper belongs to.
- # Citations – The number of obtained citations.
- Alg_Class – The class (i.e., Supervised, Unsupervised, and Reinforced
Learning) to which the algorithms used in the paper belong to.
Algorithm
- The full list of the adopted algorithms. Please note that, for reason
of space, algorithms are indicated with an acronym; the full list is re-
ported in Table A1 in the appendix section.
- Sim_Based – A Boolean field that is equal to one for the papers based
on discrete event simulation. If none of the articles belonging to a
specific AD is based on simulation, this field is not considered.
- CPS – A Boolean field that is equal to one for the papers dealing with
a Cyber-Physical System. Also in this case, if this field is missing,
none of the papers deals with a CPS.
- Goals & Approaches – A small summaryof the papers’ methodologies
and objectives.
4.4.1. Maintenance management
Maintenance management concerns administrative, financial, and
technical approaches for assessing and planning maintenance opera-
tions, on a scheduled basis. The objective is to keep assets and machines
at a full operating state, so that production proceeds effectively, and no
money is wasted due to inefficiencies.
Papers belonging to this area are listed in Table 3a, from which it is
easy to see that ML perfectly fits this area, especially within the SL
framework, for condition monitoring and failure analysis (i.e., faults
detection and classification). Indeed, the problem can be easily inter-
preted as a prediction task, where historical data are collected on the
production floor, and faulty and non-faulty events are used as ‘ground-
truth data’ against which a prediction model can be trained. NNs and
SVM are commonly used, with a total of thirteen and nine applications,
respectively. Although SVM was generally considered as the best per-
forming techniques (see for example the review by Widodo and Yang,
2007), thanks to the introduction of new sophisticated algorithms
(generally taken from the Deep Learning area), in the last decade their
popularity has started to decrease, in favor of more promising NN ap-
proaches. Most of the papers dealing with ‘Failure Mode Analysis’ employ
NNs to efficiently determine the cause of failures of both equipment and
machines. For instance, Prieto et al. (2013) proposed a novel approach
for on-line fault detection of electrical machines, which considers both
local and distributed defects. The model integrates a curvilinear
Table 2
Gap analysis of consolidated and new emerging clusters.
Question Marks Stars
ED MM PPC QM MM PPC QM
Reinforcement Learning
Deep Q Learning 1
Proximal Policy Optimization 1
Trust Region Policy
Optimization
1
Supervised Learning
Boosting 1 1 1
Decision Tree 1 3 6
Linear Discriminant Analysis 1 1
Logistic Regression OGIT 1
Linear Regression 1 1
Neighbor Based Clustering 2 2
Neural Network 1 2 10 3 13
Quadratic Discriminant
Analysis
1
Random Forests 2 2
Rough Set Algorithm 2 3
Support Vector Data
Description
2
Super Vector Machines 2 2 6 12
Unsupervised Learning
Gaussian Density Estimation 1
Gaussian Mixture Modelling 1 3
Hierarchical Clustering 1
K-Means/K-Median 1
K-Means clustering 1 3
K Nearest Neighbors 1 1 3
Local Outlier Factor 2
Non-negative Matrix Factoriz. 1
Principal Comp. Analysis 4
Parzen Windows 3
Self-Organizing Maps 1 1
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
11
Table 3a
Maintenance Management Papers.
Sub-Area Article Received
Citations
Algorithm
Classification
Main
Algorithm
Simul.
Based
Goals & Approach
Condition
Monitoring
Cho et al., (2005) 62 Supervised Learning SVM 0 Multiple sensors are used to record cutting forces and power
consumption of milling machines. Using a Super Vector
Regression, the tool breakage detection rate is increased, with a
huge impact on manufacturing performance.
Condition
Monitoring
Saxena and Saad,
(2007)
116 Supervised Learning NN, GA 0 A Genetic algorithm is coupled to a NN for feature selection and
topology search. The NN is used for fault detection of roller
bearing health monitoring. Data were collected on-site, from
three accelerometers and one acoustic sensor.
Condition
Monitoring
Kankar et al.,
(2011)
100 Supervised Learning NN, SVM 0 NN and SVM are compared on a dataset of ball bearings’ faults,
that have been pre-processed, for dimensionality reduction.
Results show that an automated diagnosis system is feasible.
Condition
Monitoring
Azadeh et al.,
(2013)
38 Supervised Learning SVM, NN, GA 0 A flexible algorithm based on an ensemble of SVM, NN, and
metaheuristics is used for condition monitoring and fault
detection. The ensemble is tested against noisy and corrupted
data of centrifugal pumps.
Condition
Monitoring
Zhang et al.,
(2015)
31 Supervised Learning SVM, ACO 0 An Ant Colony Optimization metaheuristic is applied for features
selection and hyperparameters optimization of an SVM for
intelligent fault diagnosis. The method is evaluated on a rotor
system and locomotive roller bearings.
Condition
Monitoring
Li et al., (2017) 0 Supervised Learning CNN 0 A novel fault diagnosis algorithm, leveraging on an ensemble of
Deep Convolutional NN, is presented. The algorithm is tested on
a public database of bearings’ failure data.
Condition
Monitoring
Syafrudin et al.,
(2018)
3 Supervised &
Unsupervised Learning
RF 0 A two-steps approach for fault detection is presented. First, the
DBSCAN algorithm is used to detect possible outliers, next a
random forest is used to predict possible faults.
Condition
Monitoring
Liu et al. (2018b) 2 Supervised Learning LDA,
Clustering
0 Acoustic emissions signals, collected from additive
manufacturing machines, are used to recognize different
operating states. To this aim, data are pre-processed through
LDA (both in time and frequency domains) and clustered with
unsupervised methods.
Condition
Monitoring
Hesser and
Markert (2019)
0 Supervised Learning NN 0 A programmable prototype platform, equipped with onboard
sensors, is coupled with a NN to make existing milling machines
compliant to the Industry 4.0 standards.
Condition
Monitoring
Wang et al.,
(2019)
3 Supervised Learning NN 0 A newly developed deep heterogeneous GRU model is used with
local feature extraction for long-term prediction of equipment
deterioration.
Condition
Monitoring
Li et al., (2019) 0 Supervised &
Unsupervised Learning
PCA, DT, RF,
KNN, SVM
0 A tool wearing detection framework is proposed, based on audio
signal processing. A compression stage based on PCA is followed
by a classification stage that makes use of standard ML
techniques
Condition
Monitoring
Bukkapatnam
et al., (2019)
1 Supervised Learning Balanced -RF 0 The paper introduces a non-parametric random forest
(Manufacturing system-wide Balanced RF), that takes into
account complex dynamic dependencies among parts and
failures. The approach allows a long-term prognosis of machine
breakdowns and greatly reduces prediction error.
Condition
Monitoring
Kammerer et al.,
(2019)
0 Supervised Learning DT, RF, NN 0 The work considers two data sets (taken from Industry 4.0
scenarios) and has the goal to detect sensor data anomalies. The
focus is on the collection and processing steps, whereas analysis
is performed using standard machine learning techniques.
Condition
Monitoring
Alegeh et al.,
(2019)
0 Supervised Learning SVM, DT,
KNN
0 The paper focus on the “product-service system” (PSS).
Specifically, a case study is discussed where the manufacturer of
a 5 axes gantry machine monitors the degradation of the
equipment (using sensor data) and use the analysis to offer
maintenance services.
Downtime
Minimization
Susto et al.,
(2015)
20 Supervised Learning SVM, KNN 1 A multi-classifier is proposed to optimize a cost-based
maintenance decision system. Each classifier can deal with high-
dimensional censored data and is trained with different
prediction horizons.
Downtime
Minimization
Wan et al., (2017) 2 Supervised Learning NN 0 A NN is proposed to predict the remaining lifetime of mechanical
components, subjected to specific processing conditions. Using
the NN in a big-data system, an active preventive maintenance is
developed.
Downtime
Minimization
Kuhnle et al.
(2018)
0 ReinforcementLearning
DQN, VPF,
TRPO, PPO
1 Downtime reduction and lower maintenance costs are achieved
using a Reinforcement Learning approach, based on the
Proximal Policy Optimization algorithm.
Failure analysis Prieto et al.,
(2013)
95 Supervised Learning NN 0 The paper considers 6 bearing scenarios, in 25 operating
conditions. After feature selection and dimensionality reduction
(for physical interpretation), a NN is used for the
multiclassification task.
Failure analysis Perzyk et al.,
(2014)
5 Supervised Learning DT, RST,
NBC, NN,
SVM
0 The paper shows how simple statistical methods, such as
contingency tables, may perform similarly or better, than ML
techniques in detecting the main parameters for fault diagnosis.
Failure analysis 0 Supervised Learning CNN 1
(continued on next page)
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
12
component analysis (used for dimensionality reduction) with a final
classifier based on a two-level hierarchical NN. Li et al. (2017) used an
ensemble of Convolutional Neural Networks (CNN) for bearings’ fault
diagnosis and classification. The same problem was also tackled by Sobie
et al. (2018), who used Dynamic Time Warping along with CNN’s. In
doing so, they demonstrated that a training dataset generated through
high-resolution simulations may be effectively used to integrate or even
to replace missing and/or insufficient data. This is an important
achievement because all precedent works (concerning fault detection
and classification) were highly dependent on historical data collected on
the field; a clear detrimental fact for their adoption in the industry.
Instead, unsupervised techniques are less frequent, and they are gener-
ally limited to defects’ classification, as in the work by Wu et al. (2019),
where a Self-Organizing Map, based on acoustic data, is used to cluster
filaments in terms of different failure modes.
Even in the field of ‘Condition Monitoring’ NNs are, by far, the most
applied techniques and, in this case, the most common applications
concern condition monitoring of rotating mechanical systems (Saxena
and Saad, 2007; Zhang et al., 2015) and rolling bearings (Kankar et al.,
2011). In both cases, the problem is solved using vibrations and/or
acoustic signals as classifiers inputs, for faulty and non-faulty prediction.
It is interesting to note that, to exploit the information content of the
acoustic signal, the oldest works paid close attention to feature selec-
tions, hyper-parameters optimization, and dimensionality reduction.
More recent ML techniques, instead, have eliminated part of these lim-
itations and especially Deep Learning allows an ‘As Is’ use of the original
data set, without requiring careful data pre-processing. In this regard,
Azadeh et al. (2013), proposed an ensemble of Deep NNs and SVM for
condition monitoring of centrifugal pumps and effective maintenance
management. The ensemble, optimized with a novel metaheuristic, has
been proved to be particularly resilient concerning corrupted or noisy
data. On the other side, Deep Learning requires a very massive dataset,
that is not always available. For this reason, historical data are often
enriched with additional data generated through simulation, as in Sobie
et al. (2018) and in Kuhnle et al. (2018) where an innovative approach
for downtime reduction and lower maintenance costs, is proposed based
on four different Reinforcement Learning algorithms.
Similar approaches can also be found in the field of ‘Downtime
Minimization’, as in Susto et al. (2015), who used an ensemble of SVM
and k-Nearest Neighbors to plan predictive maintenance tasks in a way
that minimizes all the costs generated by unexpected breakdowns and/
or by machine unexploited lifetime. Their interesting approach was
successfully tested on a well-known semiconductor manufacturing
maintenance problem.
4.4.2. Quality management
Quality Management, a major area within the field of operation
management, can be defined as the process of achieving and maintain-
ing a certain level of business excellence so that products and/or services
are consistent with what customers want and are willing to pay for.
From this perspective, quality management is not limited to product
and/or service compliance, but it also encompasses all the processes that
are needed to achieve the desired quality level, such as quality planning,
quality assurance, quality control, and quality improvement.
As shown by Table 3b, in the context of ML the focus is mainly on
quality assurance and quality control and, overall, the main aim is to
understand what customers want and, more in general, which are the
true drivers for better quality.
A typical example is that of quality monitoring and ‘Defects’ Detection
and Classification’, a topic that counts several applications in the elec-
tronic industry. Typically, to discriminate between defective and non-
defective items, manufacturing data are collected from sensors, PLCs,
and Manufacturing Executions Systems, and they are used as decision
variables of an ensemble of classifiers. Lenz et al. (2013) used an
ensemble of Decision Trees, NNs, and SVM to tackle a virtual metrology
problem, that is to predict the thickness of dielectric layers deposited
during the manufacturing of semiconductor wafers. Saucedo-Espinosa
et al. (2014) used sound analysis to detect defective bearings in home
appliances and showed that Random Forests are the most effective
classification techniques. Liu et al. (2017) implemented a Deep Belief
Network (a composition of Restricted Boltzmann Machines) for fault
detection and isolation and demonstrated that this peculiar network
topology can capture highly discriminative semantic features; indeed,
impressive accuracy levels, up to 100%, was obtained.
It is interesting to note that, when the aim is to detect defective items,
the so-called imbalance problem is frequently found. Indeed, this issue is
rather common when the objective is to discriminate positive events
from negative and rare ones, such as defects. A detailed discussion of this
problem can be found in Lee et al. (2016) and in Kim et al. (2018), who
compared a comprehensive set of ML classification techniques showing
that, in case of heavily unbalanced data sets, Random Forests offer the
best results. Other relevant works are those by Ye et al. (2013) and by Ko
et al. (2017). The first one proposed an ensemble of NNs and SVMs
(based on a weighted majority vote), for functional diagnosis of printed-
circuit boards. The ensemble was successfully applied to a highly un-
balanced manufacturing dataset that was artificially augmented with
synthetic data. The second work presented a framework to detect
anomalies of heavy machinery engines, based on manufacturing, in-
spection, and after-sales data. Specifically, it was shown that in the case
of unbalanced data, Gaussian Mixture Models and Parzen Window
Density Estimation are very effective, compared to other techniques
such as Principal Component Analysis or K-Means Clustering.
Besides the assessment of product compliance, ML has also been used
to implement ‘On-Line Quality Control’ systems, thus enabling more
Table 3a (continued )
Sub-Area Article Received
Citations
Algorithm
Classification
Main
Algorithm
Simul.
Based
Goals & Approach
Sobie et al.,
(2018)
Statistical methods are compared with Convolutional NN for
bearing fault classification. Data are generated from high-
resolution simulations and a novel application of Dynamic Time
Warping is also presented.
Failure analysis Liu et al. (2018a) 0 Supervised Learning,
Unsupervised Learning
NN, SVM, AE 0 A Denoising Auto-Encoder is usedto extract meaningful
representations of failure modes, and newly generated data is
compared to historical ones, using KL-divergence. The approach
emphasizes new fault modes while maintaining a dynamic and
compensatory behavior.
Failure analysis Ren et al., (2018) 4 Unsupervised Learning AE DNN 0 To predict the remaining useful life of a rolling bearing, a Deep
Auto-Encoder and a Deep Neural Network are used. Specifically,
they are coupled with a novel eigenvector-based method and can
accurately reproduce the bearings’ degradation process.
Failure analysis Wu et al., (2019) 5 Unsupervised Learning SOM 0 The paper proposes a data-driven monitoring method, based on
acoustic emissions, for online process failure diagnosis of fused
filament fabrication. Specifically, the diagnosis of different
failure modes is formalized using a self-organizing map.
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
13
Table 3b
Quality Management Papers.
Sub-Area Article Received
Citations
Algorithm
Classification
Main Algorithm Simul. Based Goals & Approach
Defect
Detection
Kusiak and
Kurasek, (2001)
47 Supervised Learning RST, DT 0 Data mining techniques are used to identify the cause
of solder-ball defects, in circuit board manufacturing.
The Rough Set algorithm is used because it can provide
explicit rules, in contrast with NN or Linear
Regression.
Defect
Detection
Kim et al., (2012) 16 Supervised &
Unsupervised
Learning
GDE, GMM, PW,
KMC, SVM, PCA
0 Aiming to detect faulty wafers, 7 different ML
algorithms, and 3 dimensionality reduction methods
are used.
Defect
Detection
Çaydaş and Ekici
(2010)
48 Supervised Learning SVM, NN 0 SVM and NN are compared to estimate the surface
roughness of stainless steel. SVM shows the best
performances, but the NN is very shallow, and only
three input variables are used.
Defect
Detection
Ye et al., (2013) 36 Supervised Learning SVM, NN 0 An ensemble of NN and SVM, based on majority
voting, is applied both to defects detection (of three
complex boards) and to propose repair suggestions.
Defect
Detection
Lenz et al., (2013) 1 Supervised Learning DT, NN, SVM 0 Using 27 features from process data, Decision Trees,
NN, and SVM are compared for predicting the
thickness of dielectric layers in a semiconductor
manufacturing scenario.
Defect
Detection
Tan et al., (2015) 13 Supervised Learning Evolutionary NN 0 An evolutionary Neural Network is applied to an
imbalanced data set (of semiconductor
manufacturing) for defect detection. Based on the
adaptive resonance theory, it combines a fuzzy set and
stability-plasticity characteristic. It is benchmarked
against other cost sensitive NN and non-cost sensitive
ML algorithms.
Defect
Detection
Adly et al., (2015) 5 Supervised Learning SVM, NN 0 A novel regression algorithm is introduced and
compared to state-of-the-art ML methods for the
identification of defects in wafer manufacturing.
Results show comparable performance, with the
benefit of a reduced computational footprint.
Defect
Detection
Gao et al., (2016) 4 Unsupervised
Learning
NMF 0 A sparsity-adaptive sparse non-negative matrix
factorization is proposed to detect defects in an
unsupervised way, without requiring manual selection
of specific frequencies. Experimental tests are made on
metal manufacturing data.
Defect
Detection
Lee et al., (2016) 0 Supervised Learning SVM, DT, Bagging,
Boosting, RF, KNN
0 The performance of three sampling-based algorithms,
four ensemble algorithms, four instance-based
algorithms, and two support vector machine
algorithms are compared to effectively tackle the
imbalance problem for the development of high-
performance fault detection systems.
Defect
Detection
Mohammadi and
Wang, (2016)
0 Supervised Learning SVM 0 Based on data collected throughout an abrasion-
resistant material manufacturing process, product
quality prediction of burned balls is achieved using
Support Vector Machine.
Defect
Detection
Saucedo-Espinosa
et al., (2017)
1 Supervised Learning SVM, NN, NBC,
KNN, DT
0 Home appliances with defective embedded bearings
are detected using ML algorithms for sound signals
analysis. Results show that intuitive and simple
methods yield high performance.
Defect
Detection
Ko et al., (2017) 0 Supervised &
Unsupervised
Learning
GMM, PW, LOF, K-
MEANS, PCA, k-
PCA, SVDD
0 A novel method for feature extraction is proposed for
the manipulation of multidimensional time-series
data. Specifically, the method is tested on after-sales
data of heavy machine engines.
Defect
Detection
Tušar et al., (2017) 0 Supervised Learning DT, RF 0 A quality prediction framework based on machine
vision, Decision Tree-based algorithms, and
evolutionary optimization algorithms are studied in
terms of overfitting problems, and authors show that,
in some cases, over-optimization leads to overfitting.
Defect
Detection
Liu et al., (2017) 0 Unsupervised
Learning
RBM 0 A Deep Belief Network is employed to capture
different semantic representations of the voltage signal
for fault detection and isolation system. The method
proved to be superior to traditional feature extraction
methods.
Defect
Detection
Kim et al., (2018) 4 Supervised Learning DT 0 The paper deals with defect detection and focuses on
the imbalance problem. Using a die-cast data set, it is
shown that the AdaC2 algorithm, a cost-sensitive
Decision Tree algorithm, outperforms other classifiers
in case of unbalanced data.
Defect
Detection
Khanzadeh et al.,
(2017)
7 Unsupervised
Learning
SOM 0 A Self-Organizing Map is employed for measuring
geometric accuracy, with fewer data and avoiding the
need to define custom landmark features. Identified
(continued on next page)
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
14
Table 3b (continued )
Sub-Area Article Received
Citations
Algorithm
Classification
Main Algorithm Simul. Based Goals & Approach
clusters correspond to specific types of deviation from
the ideal shape.
Defect
Detection
Manohar et al.,
(2018)
1 Unsupervised
Learning
SS, PCA 0 ML is employed to learn, from past data, the
distribution of shim gaps in aircraft assembly. ML is
coupled with optimized sparse sensing to gather new
data. Also, Robust Principal Component Analysis is
used for dimensionality reduction.
Defect
Detection
Khanzadeh et al.,
(2018)
12 Supervised Learning DT, KNN, SVM, LDA,
QDA
0 ML algorithms are used to regress defect occurrence
from melt pool characteristics, in additive
manufacturing. DT shows the lowest type II error,
while KNN achieves the highest accuracy. The
combination of a morphological model with
supervised learning techniques outperforms state-of-
the-art results.
Defect
Detection
Zhu et al., (2018) 3 Supervised Learning Gauss. PR 0 A multi-task Gaussian Process is employed to analyze
in-plane geometric deviations from an additive
manufacturing process to estimate geometric
deviation.
Defect
Detection
Carvajal Soto et al.
(2019)
3 Supervised Learning GrB NN 1 A Multi-layer Perceptron, a Random Forest, and a
Gradient Boosting algorithm are applied to build a
real-time online failure identification solution.
Decision Tree-based methods outperform the NN,
mainly due to data unbalance.
Defect
Detection
Peres et al., (2019) 2 Supervised Learning NBC, KNN, XGB, RF,
SVM
0 Different methods are compared to recognize productdimensional variability, for defect detection in a real
automotive multistage assembly line.
Defect
Detection
Stoyanov et al.,
(2019)
0 Supervised Learning SVM 0 SVM is employed for failure testing in electronics
manufacturing. The objective is to develop an
intelligent optimization of the tests’ sequence and a
reduced number of tests.
Defect
Detection
Chen et al., (2019),
Chen et al. (2019b)
0 Supervised Learning NN, SVM 0 NN and SVM are compared for automatic detection
and classification of welding defects. Applied to a
dataset of galvanized steel sheets, NN outperformed
SVM.
Defect
Detection
Kim and Kang,
(2019)
0 Supervised Learning NN, DT, KNN 0 NN, DT, and KNN are compared for defect detection,
using data set containing irrelevant variables. KNN
shows the maximum degradation, while DT is more
resilient.
Defect
Detection
Ruiz et al., (2019) 0 Supervised Learning KNN, RF, NN 0 Three methodologies are compared to detect breakage
during the drawing of steel. The imbalance problem is
tackled using different techniques (under-sampling,
oversampling, SMOTE).
Defect
Detection
Caggiano et al.,
(2019)
3 Supervised Learning CNN 0 A Deep Convolutional NN is used for online defects
detection. Specifically, the NN is trained to analyze in-
process images of Selective Laser Melting
manufacturing process.
Defect
Detection
Tsutsui and
Matsuzawa, (2019)
1 Supervised Learning DNN 0 Deep Learning models are applied to Optical Emission
Spectroscopy for predicting measurements of ongoing
semiconductor process. The proposed network
topology outperforms standard models for image
analysis.
Defect
Detection
Imoto et al., (2019) 0 Supervised Learning CNN, TL 0 A Convolutional NN, trained on a real semiconductor
fabrication dataset, is used for defect classification,
based on the analysis of electron microscope images.
To reduce the amount of data of the training step a
Transfer Learning algorithm is also used.
Defect
Detection
Oh et al. (2019b) 0 Supervised Learning ASVM 0 The paper presents a framework for on-line-quality
control of a sunroof assembly line. Thanks to an
iterating loop between a data pre-processing module
and an SVM learning module, the defect classifier
continuously learns from past experiences.
Defect
Detection
Yacob et al., (2019) 2 Supervised &
Unsupervised
Learning
SVM, DT, KNN 0 The aim is to detect parts’ anomalies, based on surface
characteristics, and categorize them as systematic and
random variations. To reduce the number of physical
parts needed to train the models, also digital twins
(Skin Model Shapes) are used. This has the additional
benefit of avoiding biases and unbalancing problems.
Defect
Detection
Papananias et al.,
(2019)
2 Supervised Learning Bayesian R., ANOVA 0 The paper develops a probabilistic model, based on
Bayesian linear regression, for flatness tolerance
evaluation. Two case studies demonstrate the
effectiveness of the probabilistic model.
Defect
Detection
Saqlain et al.,
(2019)
5 Supervised Learning NN, LogR, GrB, RF 0 The paper proposes a soft voting ensemble classifier
with multi-types features, to identify wafer map defect
patterns in semiconductor manufacturing. Four
classifiers are used, and results are combined assigning
(continued on next page)
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
15
Table 3b (continued )
Sub-Area Article Received
Citations
Algorithm
Classification
Main Algorithm Simul. Based Goals & Approach
higher weights to the classifiers with higher prediction
accuracy.
Defect
Detection
Iqbal et al., (2019) 3 Supervised Learning AE, Clustering 1 The paper presents a novel approach for automated
Fault Detection and Isolation. Deep Auto Encoders
coupled with other ML tools (hierarchical clustering
and Markov Chains) model the spatial/temporal
patterns found in the data and successfully diagnose
and locate multiple classes of faults.
Image
Recognition
Ravikumar et al.,
(2011)
28 Supervised Learning NBC, DT 0 A Decision Tree and a Naïve Bayes classifier are
compared relative to an image classification task, for
automated visual inspection. Feature pre-processing is
performed to generate images’ histogram features
used as input for the classifiers.
Image
Recognition
El-Bendary et al.,
(2015)
10 Supervised &
Unsupervised
Learning
LDA, PCA, SVM 0 The aim is to classify tomato ripeness based on their
color. SVM, Linear Discriminant Analysis, and
Principal Components Analysis are combined and
tested on a sample of 250 images. Results are validated
with 10-fold cross-validation.
Image
Recognition
Chen et al., (2016) 2 Supervised Learning SVM 0 Aiming to enhance yield and to reduce defect rate, an
automatic optical inspection system is proposed. The
system makes use of an SVM classifier, which is
strengthened by a similarity approach capable to
reduce the number of false alarms.
Image
Recognition
Yang et al., (2018) 2 Supervised Learning CNN 0 A Convolutional Neural Network, coupled with a
three-point circle fitting method, is used for automatic
aperture detection of LED cups.
Image
Recognition
Gobert et al.,
(2018)
5 Supervised Learning SVM 0 To enable in-process re-melting and defects correction
(of an additive manufacturing process), an in-situ
defect detection protocol is proposed. Using SVM,
digital single-lens images are pre-processed and
classified, with an accuracy rate of around 80%.
Image
Recognition
Yuan et al., (2018) 2 Supervised Learning CNN 0 A Convolutional NN is trained (in a supervised
fashion) to analyze 10 ms video clips of laser powder
additive manufacturing. The CNN can predict LPBF
track widths and track continuity, from in situ video
data.
Image
Recognition
Scime and Beuth,
(2018)
1 Supervised Learning CNN 0 The input layer of a Convolutional NN is modified to
allow the NN to learn the appearance of the powder
bed anomalies and key contextual information with
the scale-invariance property. This alteration improves
accuracy and mitigates human biases.
Image
Recognition
Penumuru et al.
(2019)
0 Supervised Learning SVM, DT, RF, LogR,
KNN
0 Alternative methodologies are compared in the
recognition of metallic materials from surface images.
The robustness of the classifiers is checked for various
camera orientations, illuminations angle, and focal
length.
Image
Recognition
Scime and Beuth,
(2019)
12 Supervised &
Unsupervised
Learning
SIFTS, SVM 0 The goal is to detect keyholing porosity and balling
instabilities in laser powder bed fusion additive
manufacturing. A scale-invariant description of the
melting pool morphology is constructed applying the
“Bag-of-Words” technique to features extracted using
Scale Invariant Feature Transforms. SVM is then
applied to classify the observed melt pools.
Life cycle
Management
Jennings et al.,
(2016)
2 Supervised Learning RF, NN, SVM 0 The aim is to predict the obsolescence risk level at a
certain stage of the lifecycle of a device. Specifically,
NN, Random Forests, and SVM are compared, to
partition “active” and “obsolete” smartphones.
Online Quality
Control
Ribeiro, (2005) 47 Supervised Learning SVM, NN 0 The work compares NN and SVM to predict product
quality using process’ data. Using the real-data of a
molding injection process, the paper shows that both
methods can quickly react to unexpected disturbances.
Online Quality
Control
Lin et al., (2011) 16 Supervised Learning SVM, NN 0 Support Vector Machine and Neural Networks are
compared to effectively classify seven different control
charts patterns for specific causes. SVM results less
prone to overfitting and more robustto background
noise.
Online Quality
Control
Wuest et al. (2013) 16 Supervised &
Unsupervised
Learning
HC SVM n.a.
(theoretical)
Hierarchical Clustering and SVM are used to analyze
multidimensional data (of the product’s state along the
whole manufacturing process) and to trigger
corrective actions if needed.
Online Quality
Control
Yang and Zhou,
(2015)
2 Supervised Learning NN, LVQ 0 This study proposes a NN, ensemble-enabled,
autoregressive, and coefficient-invariant control chart
patterns recognition model. Each NN is trained to
recognize CCP with a specific autoregressive
(continued on next page)
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
16
flexible processes with the ability to automatically take corrective ac-
tions as soon as possible. For instance, Wuest et al. (2013), proposed a
hybrid approach, namely Semi-Supervised Learning, to estimate the
product’s state along its manufacturing process. At the first stage, an
unsupervised hierarchical clustering is used to label the records of the
training set. Next, labeled data are processed by a supervised layer,
based on SVM, which performs the final classification. In this way, the
need for a manually labeled data set is avoided, with great benefit in
terms of development time and classification accuracy. Similarly,
Nakata et al. (2017) proposed a Big-Data based, long-term automated
monitoring system of micro-conductors manufacturing, which allowed
production engineers to obtain significant yield enhancements.
More recently, visual quality inspection, supported by automated
‘Image Recognition’, is emerging as a promising field for defects identi-
fication and classification. For instance, Chen et al. (2016) used an
automatic optical inspection system, based on an SVM classifier with a
similarity approach, to reduce the false alarm rate (of defect classifica-
tion) in the production of CMOS image sensors. El-Bendary et al. (2015)
proposed the application of machine learning techniques to assess to-
mato ripeness. Posed as a multi-class classification task, the problem was
solved with a hybrid classifier (based on SVM and Linear Discriminant
Analysis), supported by Principal Component Analysis for feature
extraction. Other interesting works concern the use, increasingly com-
mon, of CNNs for image recognition and visual control, as in the work by
Yuan et al. (2018) and Scime and Beuth (2018), who used this neural
network topology to detect anomalies in laser powder additive
manufacturing.
Other interesting applications propose integrating ML and statistical
control charts, to understand if drift in operating parameters is taking
place. Notable examples can be found in Lin et al. (2011) and in Yang
and Zhou (2015), who used an ensemble of NNs to handle
autocorrelated data in control chart patterns. The model can detect up to
seven types of unnatural patterns and drifts and can be used by quality
managers to promptly identify the root causes of processes’ anomalies.
4.4.3. Production Planning & Control (PPC)
As noted in Section 4.3, in terms of ML applications, Process Planning
and Control is an emerging trend that is attracting much academic and
industrial interest in the last decade. Mainly, it includes all the activities
that are needed to manage a manufacturing process and to improve its
operating performance; as shown in Table 3c, ‘Performance Prediction
and Optimization’ is the most studied problem.
For instance, Arredondo and Martinez (2010) proposed an RL
approach based on Local Weighted Regression, to implement an order
acceptance policy, similar to Workload Control. In particular, jobs can
either be put in a rejection or in an acceptance set and, in this way, the
average revenues can be maximized relative to the installed capacity.
Doltsinis et al. (2012) used RL for production ramp-up optimization. To
this aim, they formulated the problem as a sequence of technical de-
cisions needed to progress the system toward the desired steady state, in
the shortest amount of time. Other interesting contributions can be
found in Li et al. (2016), who combined Q-Learning and SVM to reduce
the electricity consumption of an automated manufacturing system, and
in Agarwal et al. (2019), who used an autoencoder to find the best set of
process parameters for optimizing process productivity and profit.
‘Scheduling’ is the second most studied problem. This topic has al-
ways attracted a lot of industrial and academic interest, not only for its
immediate practical implications but also because it is extremely chal-
lenging from a research perspective. Indeed, scheduling problems are
known to be NP-hard (almost ever) and, for this reason, they create a
fertile ground for the application of novel ML algorithms. One of the first
works is that by Priore et al. (2001), who studied the implementation of
Table 3b (continued )
Sub-Area Article Received
Citations
Algorithm
Classification
Main Algorithm Simul. Based Goals & Approach
coefficient. The outputs are combined through
Learning Vector Quantization.
Online Quality
Control
Nakata et al.,
(2017)
0 Supervised Learning,
Unsupervised
Learning
CNN, K-means 0 A Convolutional NN is applied to classify wafers’
failure map patterns. It is integrated into a three-stage
automated monitoring system fed with real-time
massive manufacturing data.
Online Quality
Control
Zhang et al.,
(2019a), Zhang
et al. (2019b)
0 Supervised Learning LSTM 0 A Long-Short Term Memory NN takes as input
temperature and vibration data of an additive
manufacturing process and predicts the tensile
strength of the manufactured item. Layer-wise
Relevance Propagation is used to assess parameters’
influence.
Online Quality
Control
Oh et al. (2019a) 1 Supervised Learning SVM 0 A cost-effective SVM is used for online QC of a
manufacturing process. The SVM incorporates
inspection-related expenses and error types and is
tested against an automotive door-trim manufacturing
process. Design of Experiment is carried out to perform
sensitivity analysis.
Online Quality
Control
Zhu et al., (2019) 2 Reinforcement
Learning
QLrn, TS 0 Acoustic emissions sensing, through fiber brag grating,
is coupled with Q-Learning and Taboo Search for
quality monitoring of an additive manufacturing
process.
Online Quality
Control
Yu (2019) 0 Supervised Learning SDAE 0 An enhanced stacked denoising autoencoder (ESDAE),
with manifold regularization, is used for wafer map
pattern recognition (WMPR). The approach, which can
be used for on-line detection of map defects, has been
successfully validated using a real-world wafer map
dataset.
Online Quality
Control
Yu et al., (2019b) 4 Supervised &
Unsupervised
Learning
SDAE 1 A Stacked Denoising Autoencoder is used for pattern
recognition. SDAE denoises the input signal and
extracts the important features used as input of a final
classification layer. SDAE layers are trained in an
unsupervised way, whereas the regression is fine-
tuned with a supervised approach. By doing so SDAE
greatly improves its generalization performance and
can learn more robust and compact features.
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
17
Table 3c
Production Planning and Control Papers.
Sub-Area Article Rec.
Cit.
Algorithm Class. Main Alg. Sim.
Based
Cyber
Ph. Sys.
Goals & Approach
Performance
Prediction
Arredondo and
Martinez, (2010)
20 Reinforc.
LearningLWR 1 0 A locally weighted regression is used to learn the value of
accepting or rejecting a production order. The approach
maximizes the average revenue obtained per unit cost of the
installed capacity.
Performance
Prediction
Meidan et al., (2011) 19 Superv.
Learning
NBay 1 0 Using Conditional Mutual Information Maximization, a
selective Naïve Bayesian Classifier is used to select the most
discriminative features for cycle time prediction.
Performance
Prediction
Doltsinis et al.,
(2012)
2 Reinforc.
Learning
QLrn 0 0 A Q-Learning algorithm supports decision-making during
production ramp-up. It significantly reduces the time needed to
reach a stable state.
Performance
Prediction
Duan et al., (2015) 0 Superv.
Learning
SVM, DT,
Bay.R
0 0 Based on production capacity and orders’ properties and
requirements, a tree-based classifier is used to accept or reject
incoming orders, to maximize profit.
Performance
Prediction
Heger et al., (2016) 5 Superv.
Learning
Gauss. PR 1 0 Gaussian Process Regression predicts the best parameters’
settings, conditioned on current system status. Results showed a
significant mean tardiness reduction.
Performance
Prediction
Delgoshaei and
Gomes, (2016)
1 Superv.
Learning
SA, NN 1 0 A hybrid model based on NN and Simulated Annealing is used
to optimize the prediction mix. The focus is on a cellular
shopfloor with parallel machines and bottlenecks
Performance
Prediction
Li, (2016) 0 Superv. &
Reinfor.
Learning
SVM, QLrn 1 0 To reduce the electricity consumption of a multi-route
transportation system, SVM and Q-learning algorithms are
proposed. The approach is validated through simulation.
Performance
Prediction
Diaz-Rozo et al.,
(2017)
0 Unsuper.
Learning
K-Mean, HC,
GMM
0 1 A Cyber-Physical system is described, and 3 clustering
algorithms are compared, to group high throughput machining
cycle conditions.
Performance
Prediction
Rude et al. (2015) 0 Unsuper.
Learning
HMM 0 0 An unsupervised Hidden Markov Model, used for recognition of
worker activity in manufacturing processes, shows comparable
results with supervised techniques, thus reducing the need for
labeled data.
Performance
Prediction
Chan et al., (2018) 0 Superv.
Learning
LASSO,
Cluster.
1 0 The aim is to estimate the costs of new jobs, based on historical
data and technical features. A model based on dynamic
clustering for model selection, coupled with Lasso and/or
Elastic Regression is proposed.
Performance
Prediction
Ghadai et al., (2018) 0 Superv.
Learning
CNN 0 0 Difficult-to-manufacture geometries are predicted with a 3D
Convolutional NN. A second method is proposed to explain the
causes of non-manufacturability.
Performance
Prediction
Tulsyan et al., (2018) 3 Superv.
Learning
Gauss. PR 1 0 The paper addresses the “Low-N” problem, relatively to a batch
manufacturing process for which scarce historical data are
available. The problem is tackled using a multi-dimensional
approach based on Gaussian Processes.
Performance
Prediction
Gyulai et al., (2018) 2 Superv.
Learning
RF, SVM 0 1 Analytical and ML techniques are applied, within a Digital Data
Twin, to predict Lead Time. The focus is on flow-shops with
frequent changes in customer demand. Frequent retraining and
on-line learning are adopted.
Performance
Prediction
Silbernagel et al.,
(2019)
0 Superv. &
Unsuperv.
Learning
AE, PCA, K-
mean
0 0 A Convolutional Autoencoder is used to cluster images of the
processing of pure copper in a laser powder bed fusion printer.
The quality of each cluster is mapped manually, to the original
set of the process parameter.
Performance
Prediction
Stathatos and
Vosniakos, (2019)
0 Superv.
Learning
NN 0 0 Three NNs are used to predict, given a laser trajectory, the
evolution of temperature and density. The trajectory is
decomposed using a custom method that provides a local
description relative to the surroundings.
Performance
Prediction
Agarwal et al. (2019) 0 Superv. &,
Unsup. Learning
AE, NN, SVM 1 0 The paper presents 2 approaches to find the ranges of process
inputs optimizing process productivity and profit. Supervised
and unsupervised deep learning techniques are investigated,
and the layer-wise relevance propagation algorithm is used to
prune the inputs of the NNs.
Performance
Prediction
Jang et al., (2019) 0 Superv.
Learning
NN 0 0 The paper presents a model to predict the yield of new wafer
maps. The approach is based on a deep NN and exploits spatial
relationships relative to the positions of dies (on a wafer) and
die-level yield variations.
Performance
Prediction
Gurgenc et al.,
(2019)
0 Superv.
Learning
NN 0 0 A deep NN is used to estimate the machining times of a CNC
milling machine. Design and manufacturing parameters are
used as input and the network is trained with an extreme
learning machine (ELM), with optimal results.
Process Control Chinnam, (2002) 49 Superv.
Learning
SVM, NN 0 0 NNs and SVM are applied to recognize quality drifts in related
and unrelated manufacturing processes. It is shown that even
simple linear kernels perform better than Statistical Process
Control techniques.
Process Control Sun et al., (2004) 70 Superv.
Learning
NN, SVM 0 0 A tool condition monitoring system, based on acoustic emission
sensing, is presented. NN and SVM are used to classify the tool
state; the performance evaluation is based on manufacturing
loss, due to misclassification.
(continued on next page)
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
18
Table 3c (continued )
Sub-Area Article Rec.
Cit.
Algorithm Class. Main Alg. Sim.
Based
Cyber
Ph. Sys.
Goals & Approach
Process Control Shin et al., (2012) 13 Reinfor.
Learning
AHC 1 0 A fuzzy Reinforcement Learning system is applied for
manufacturing control. The agent has the ability of self-
regulating in response to the system’s changes. It can also
dynamically re-set its goal.
Process Control García Nieto et al.
(2012)
20 Superv.
Learning
SVM, NN 0 0 SVM and NN are used to control the manufacturing process of a
paper mill. NN and SVM are chosen, given their capabilities to
reproduce non-linear relationships among explanatory
variables.
Process Control Wang et al., (2018) 1 Superv.
Learning
CNN 0 0 A Convolutional NN is applied for continuous human motion
analysis. The aim is to infer human actions and future intentions
for human-robot collaboration. Experiments showed a 96%
accuracy.
Process Control Maggipinto et al.,
(2018)
0 Supervised
Learning
CNN 0 0 A Convolutional NNN is employed to avoid the feature
extraction phase that is generally needed for image processing
in virtual metrology. It is applied to optical emission
spectroscopy data.
Process Control Mezzogori et al.
(2020)
0 Superv.
Learning
NN 1 0 Deep NN and Linear Regression are used to predict throughput
time, given the current system’s state. The system is regulated
by Workload Control and the aim is to define reliable due dates
reducing the % of tardy jobs.
Process Control Zan et al., (2019) 5 Superv.
Learning
CNN, NN 1 0 A 1-D Convolutional NN is applied to a dataset with 6 control
chart patterns. The CNN performance is compared to manual
feature extraction methods and a simple NN, showing
advantages.
Process Control Ma et al., (2019) 4 Reinfor.
Learning
DDPG 1 0 Deep Deterministic Policy Gradient is applied to chemical
process control. Many operating features, such as action
boundaries and reward definitions are discussed.
Process Control Joswiak et al.,
(2019)
0 Superv. &
Unsuperv.
Learning
PCA, t-SNE,
UMAP
0 0 16 dimensionality reduction techniquesare compared using
data sets of chemical plants. UMAP (Uniform Manifold
Approximation and Projection) outperforms other methods
Process Control Zhang et al.,
(2019a), Zhang et al.
(2019b)
1 Unsuperv.
Learning
K-mean 0 0 A K-Means with Davies-Bouldin Criterion is used to decompose
the surface of additive manufactured parts, to optimize the
build orientation dynamically.
Process Control Gardner et al.,
(2019)
2 Superv.
Learning
NN, GrB. 0 0 The work proposes a combined approach (based on NN and
Gradient Boosting) for optimal parameters’ selection depending
on the location of a 3D printing process.
Process Control Chen et al., (2019),
Chen et al. (2019b)
1 Superv. &
Unsuperv.
Learning
NN 0 0 A Deep Neural Network is applied for energy consumption
modeling, which usually relies on abundant labeled data. The
NN is trained with a semi-supervised approach, to better exploit
non-labeled data. An experimental study on furnace energy
consumption data is described.
Process Control Dornheim et al.
(2019)
0 Reinforc.
Learning
QLrn 1 0 The paper proposes a self-learning optimal control algorithm
(based on Q Learning), for manufacturing processes subject to
nonlinear dynamics and stochastic influences. It accounts for
stochastic variations of the process conditions and can cope
with partial observability.
Process Control Denkena et al.,
(2019)
0 Superv.
Learning
SVM, SA 1 0 The aim is to optimize the operating parameters of a grinding
process of helical flutes. The model integrates simulation, SVM,
and an optimizer (based on simulated annealing) to fine-tune
both the cutting feed and the speed of the grinder.
Scheduling Aydin and Oztemel,
(2000)
96 Reinforc.
Learning
QLrn 1 1 An RL agent learns how to select the most appropriate
dispatching rule and performs dynamic scheduling based on
available information. An extension of the Q-learning
algorithm, called Q-III, is also presented.
Scheduling Priore et al., (2001) 7 Superv.
Learning
DT 1 1 Decision Trees are used to identify, the best dispatching rule for
flow and job shop systems. Results are good, but many
simulation runs are needed to generate training examples.
Scheduling Mönch et al., (2006) 40 Superv.
Learning
DT, NN 1 0 Decision Trees and NN are used to fine-tune a simple heuristic
for dispatching rule selection. Data is generated via simulation.
Scheduling Priore et al., (2006) 58 Superv.
Learning
DT, NN, CBR 1 0 Inductive Learning, NNs, and Case-Based Reasoning are
compared to find the best dispatching rule. Testes performed
via simulation
Scheduling Csáji et al., (2006) 19 Reinforc.
Learning
SA, TD, NN 1 0 Simulated Annealing, Temporal Difference Learning, and NNs
are used to solve a dynamic job-shop scheduling problem in a
distributed and iterative way. Each machine and job is
associated with an agent, which has the role of selecting the best
schedule.
Scheduling Shiue, (2009) 14 Superv.
Learning
DT 1 1 A Decision Tree is applied to a two-stage real-time scheduling
scenario with a nonstationary product mix: first, a knowledge
base class is selected, then a scheduling rule is chosen.
Scheduling Gaham and
Bouzouia, (2009)
3 Superv.
Learning
NN, GA 1 0 A Genetic Algorithm is used to solve a flexible job shop
scheduling, while 2 NNs are used for machine allocation for
priority assignment.
Scheduling Priore et al., (2010) 5 Superv.
Learning
SVM 1 1 SVM is used to find the best dispatching rule. Tests are made
with simulated data.
(continued on next page)
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
19
a Decision Tree algorithm, (namely C.4.5) to select the best dispatching
rule, depending on the current system state. The work was subsequently
extended and refined with the addition of other ML techniques, and it
was shown that SVM is the best approach when the objective is the
minimization of the mean tardiness and of the mean flow time (Priore
et al., 2006, 2010). Similar work was made by Heger et al. (2016), who
investigated the application of Gaussian Processes to dynamically
readjust the parameters of dispatching rules, based on current shop-floor
conditions. As a result, their approach drastically reduced jobs’ mean
tardiness.
More recently, scientific interest has moved from the selection of the
best dispatching rule to the definition of fully automated adaptive and
real-time schedulers. Multiple and diverse approaches have been pro-
posed, ranging from Self-Organizing Maps (SOM), that autonomously
extrapolate the suitable classes which best explain the data (as in Shiue
et al., 2011), to hybrid models based on simulation, population-based
metaheuristics, and NNs (as in Gaham & Bouzouia, 2009) that make it
possible to dynamically regenerate optimal scheduling sequences,
anytime certain manufacturing events take place. In some cases, even
hybrid models, based on metaheuristics and RL, are used. For instance,
Csáji et al. (2006) proposed a combination of a Simulated Annealing, RL,
and NN to implement an adaptive iterative distributed scheduling al-
gorithm for market-based production, in which each machine and job is
seen as an agent and can participate in a bid system with the global aim
of minimizing the total production time. Palombarini and Martínez
(2012), implemented a Q-Learning algorithm to reschedule jobs anytime
an unforeseen event (e.g. arrival of a rush order or breakdown of a
working machine) takes place on the shop floor.
Many works have also dealt with the problem of ‘Process Control’, a
field closely related to ‘Online Quality Control’, in which the objective is
that to govern a manufacturing process (regulating and fine-tuning its
main operating parameters) so that it behaves as planned with little or
none non-conforming situations. Shin et al. (2012) proposed an RL-
based approach to build a self-adapting manufacturing system. The RL
model, based on two collaborating neural networks, makes the
manufacturing system completely autonomous, as it becomes able to set
its goals and reconfigures itself to changing environmental conditions.
Next, similar work was proposed by Dornheim et al. (2019), who used a
Q-Learning method for optimal and automatic control of manufacturing
processes characterized by non-linear dynamics. To conclude, we cite
the interesting and very recent work by Joswiak et al. (2019) who
tackled process control following a more practitioner-oriented
approach. In particular, the authors compared different dimensionality
reduction algorithms to create a dashboard of meaningful process data
aimed to support and to enhance human decision-making.
4.4.4. Supply chain management
SCM is the process of planning, controlling, and executing all logistic
flows, from the acquisition of raw materials to the delivery of end
products, in the most streamlined and cost-effective way. In this sense,
SCM encompasses a diversified set of activities that broadly includes:
demand planning, sourcing, inventory management, and trans-
portations. As shown by the rather limited number of papers included in
Table 3d, in terms of ML applications, SCM is not yet a much-explored
domain, as already confirmed by the keywords’ analysis of Section
4.3, which revealed that, although SCM is a consolidated field, with a
recently increasing trend, it is not a star topic yet.
‘Modelling and Coordination’ is the most studied topic of the SCM
area. Generally, a two-stage supply chain with non-stationary demand is
considered and a variegated set of performance indicators is optimized
using a multi-agent-based simulated scenario. One of the first works of
this kind is that of Kim et al. (2008), who used an Action Value-basedRL
algorithm to optimize and to compare a centralized and a decentralized
supply chain, whose state is described by customer-demand patterns.
Chaharsooghi et al. (2008) tested the Q-Learning algorithm using, as a
simulative setting, that of the famous Beer Game (Coppini et al. 2010),
showing that purchasing orders generated by the RL based decision
system greatly reduce the bullwhip effect of the supply chain. More
recently, a similar approach was used by Mortazavi et al. (2015) who
used Q-Learning to coordinate a four-echelons supply chain with non-
stationary demand. Other than using RL based models, some recent
papers applied SL approaches to coordinate the supply chain. For
instance, Cavalcante et al. (2019) used K-Nearest Neighbours and Lo-
gistic Regression for suppliers’ selection, while Priore et al. (2019) used
a Decision Tree to dynamically select the best replenishment model for
each tier of the supply chain.
‘Demand Forecasts’, a cornerstone of SMC, is the second most
considered topic, with applications in different settings and demand
Table 3c (continued )
Sub-Area Article Rec.
Cit.
Algorithm Class. Main Alg. Sim.
Based
Cyber
Ph. Sys.
Goals & Approach
Scheduling Shiue et al., (2011) 2 Unsuper.
Learning
SOM 1 1 A Self-Organizing Map is used to select multiple scheduling
rules. The SOM outperforms, in the long run, traditional
approaches based on a single scheduling rule.
Scheduling Palombarini and
Martínez, (2012)
9 Reinforc.
Learning
QLrn 1 1 A Q-Learning system is proposed for adaptive rescheduling, to
respond to non-planned events such as new order or equipment
failures.
Scheduling Shiue et al., (2012) 5 Superv.
Learning
NN, DT,
Bagging SVM,
GA
0 1 A real-time scheduling system is proposed. NNs, SVM, and
Decision Tree (based on Bagging) are integrated, and a Genetic
algorithm is used for feature selection. The approach is
evaluated using 10-fold cross-validation.
Scheduling Drakaki and Tzionas,
(2017)
0 Reinforc.
Learning
QLrn 1 0 An order-picking scheduling problem is tackled through a Q-
learning algorithm (without a Neural Network function
approximator) coupled with hierarchical Colored Petri Nets.
Scheduling Priore et al., (2018) 2 Superv.
Learning
Bagging,
Boosting
1 1 Bagging, boosting, and stacking methods are tested for
dispatching rule selection. Mean tardiness and mean flow time
are improved.
Scheduling Tan et al., (2019) 1 Reinforc.
Learning
QLrn 1 1 A Multi-agent reinforcement learning approach for dynamic
planning and scheduling is proposed. The focus is on robot
assembly lines, to minimize the makespan.
Scheduling De Jong et al.,
(2019)
1 Superv.
Learning
CNN 1 0 A CNN is proposed for quick and accurate makespan forecast,
both for job and shop floor systems. A visual representation of
the layout and the system’s state is also provided as an
additional input.
Scheduling Lin et al., (2019) 2 Reinforc.
Learning
DQN 1 1 The paper integrates Deep Q-Learning with edge computing to
solve complex scheduling problems requiring different
dispatching rules
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
20
Table 3d
Logistic and Supply Chain Management Papers.
Sub-Area Article Received
Citations
Algorithm
Classification
Main
Algorithm
Simul.
Based
Goals & Approach
Modeling and
Coordination
Chi et al., (2007) 19 Supervised
Learning
SVM, GA 1 SVM for regression is compared to a DOE approach to predict 7
performance measures of Vendor Managed Inventory. The optimal
input settings are generated using a genetic algorithm. The main
benefit of SVM is the possibility to avoid system disruption, as
opposed to the DOE approach.
Modeling and
Coordination
Kim et al., (2008) 12 Reinforcement
Learning
AVL 1 An asynchronous RL agent is used for inventory control in a serial
supply chain. Time-varying rewards are used, and the approach is
tested either for centralized and decentralized supply chains.
Modeling and
Coordination
Chaharsooghi et al.,
(2008)
37 Reinforcement
Learning
QLrn 1 Q-Learning is proposed to coordinate a multi-agent supply chain
(with 4 tiers) and to minimize the bullwhip effect. The
environment state is described by inventory position, ordering size
to the upstream level, and distribution amount at each level and
the objective is to minimize the total inventory costs.
Modeling and
Coordination
Zarandi et al.
(2012)
2 Reinforcement
Learning
TD 1 A fuzzy-inference system is used to approximate the value
function returned by a Reinforcement Learning approach for
inventory control. Specifically, the agent models a supplier and
determines the number of orders for each retailer, with supply
capacity constraints.
Modeling and
Coordination
Mortazavi et al.,
(2015)
7 Reinforcement
Learning
QLrn 1 Q-Learning algorithm is used, in an agent-based simulation of a 4-
echelon chain, with non-stationary demand. The objective is to
coordinate the ordering processes. The Value-at-Risk methodology
is also applied both for risk evaluation and sensitivity analysis.
Modeling and
Coordination
Cavalcante et al.,
(2019)
0 Supervised
Learning
LogRT, KNN 1 Simulation and ML are combined for supplier selection in resilient
chains. K-nearest neighbors and Logistic Regression are compared
for the classification task.
Modeling and
Coordination
Priore et al., (2019) 3 Supervised
Learning
DT 1 A dynamic framework for automated inventory management is
proposed. Specifically, a Decision Tree periodically selects the best
inventory model for a node of the supply chain according to its
state and the network state.
Modeling and
Coordination
Du and Jiang,
(2019)
0 Reinforcement
Learning
QLrn 1 A multi-agent reinforcement learning approach is proposed. The
aim is to optimize the manufacturer’s strategies, in a dynamic
supply chain, mitigating the risk of the supplier. The approach is
successfully validated in a simulated environment with a single
manufacturer and a single supplier.
Modelling and
Coordination
González Rodríguez
et al. (2019)
0 Supervised Fuzzy Inf
System + Tree
0 The paper proposes a decision support system to coordinate a
Closed-Loop Supply Chain in presence of uncertainties. The
support system makes use of a Fuzzy Inference Systems, whose
rules are automatically generated with a regression tree. One of
the main contributions is the ability to limit the impact, on
inventories, of imbalances in the rest of the chain.
Demand
Forecasting
Carbonneau et al.,
(2007)
6 Supervised
Learning
SVM, NN,
RNN
0 ML algorithms are compared to statistical methods for demand
forecasting in supply chains. Tests showed that ML techniques are
generally outperformed when applied to single feature time-
series. The performance of ML rapidly increases using multi-
dimensional time-series.
Demand
Forecasting
Villegas et al.,
(2018)
0 Supervised
Learning
SVM 0 SVM is used to select the best forecasting model, based on the
prediction output of each model. Also, a comprehensive feature
selection analysis was carried out.
Demand
Forecasting
Mezzogori and
Zammori (2019)
0 Supervised
Learning
AE RNN 0 An Entity Embedding based neural network is used to learn vector
representation of past and current product. The vectorial
representations are exploited to trace similarities of the current
product to past products, so to build pseudo-time-series, analyzed
by an RNN based network to predict the quantity sold for each
product at the end of a sales campaign
Demand
Forecasting
Fu and Chien
(2019)
0 Supervised
Learning
KNN, SVM,
NN
0 Machine Learning and temporal aggregation mechanism areintegrated to forecast the demand for intermittent products. The
proposed framework is tested using the data of a semiconductor
distributor.
Demand
Forecasting
Ji et al., (2019) 0 Supervised
Learning
XGB 0 A novel XGBoost algorithm is proposed and tested against classical
ARIMA models, to forecast sales of an e-commerce platform.
Demand
Forecasting
Wu, (2010) 45 Supervised
Learning
SVM, PSO 0 A hybrid approach based on Particle Swarm Optimization and
SVM is tested on a dataset of car sales. The aim is to optimize the
reorder points of each tier of the supply chain.
Inventory
Management
Kim et al., (2005) 41 Reinforcement
Learning
AVL 1 Centralized and non-centralized inventory models are proposed to
manage a supply chain with one supplier and multiple retailers.
Specifically, an action-value based algorithm is proposed to
constantly react to the changes in customers’ demand.
Inventory
Management
Kwon et al., (2008) 7 Reinforcement
Learning
CBR 1 A case-based RL approach is presented to control inventory (at
supply chain scale) in case of non-stationary customer e.
Specifically, the case-based reasoning discretizes the state space,
thus reducing the number of possible configurations to be learned.
Inventory
Management
Jiang and Sheng,
(2009)
37 Reinforcement
Learning
CBR 1
(continued on next page)
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
21
patterns. For instance, Carbonneau et al. (2007), compared classical
statistical methods against ML and concluded that for multidimensional
time-series, the accurateness of the newest methods is evident. Wu
(2010) showed that a combination of Particle Swarm Optimization and
SVM can be used to optimize reorder points at each level of a supply
chain. Villegas et al. (2018) used SVM to select the best forecasting
model among a diverse pool of predictive models for non-stationary
and/or lumpy demand. Ji et al. (2019) presented a hybrid forecasting
model based on XGBoost and Arima and successfully tested it against an
e-commerce dataset. Mezzogori and Zammori (2019) integrated an
Entity Embedding and a Recurrent Neural Network for demand forecast
in the fashion market.
Finally, a limited number of works applied RL for ‘Inventory Control’.
The first paper in this area is that by Kim et al. (2005), who used an
Action Value RL algorithm to optimize centralized and non-centralized
inventory models. Kwon et al. (2008) applied a case-based myopic
approach to a Vendor Managed Inventory model, to minimize the
probabilities of infringements of the contracted service level. Next, Jiang
and Sheng (2009) expanded this model to study a simulated multi-agent
supply-chain with 80 retailers and 10 customers, in which each tier of
the supply chain is modeled as an independent reinforcement learning
agent. Lastly, Kara and Dogan (2018) focused on perishable products,
with random demand and deterministic lead-time. The aim was to
minimize the retailer’s total cost of a retailer, using two alternative in-
ventory policies optimized either with Q-Learning and SARSA algo-
rithms. Specifically, the latter ensured better results if applied to items
with a short lifetime and lumpy demand.
4.4.5. Models’ complexity, Input-Output variables
To give a better idea of the complexity and variety of the considered
problems, we deemed it useful to include Table 4, which provides some
indications concerning the variables that are commonly used per each
application domain and sub-area. Specifically, this information is
exemplified by the list of the input and output variables of the dataset
used by most representative papers, belonging to each investigated sub-
area.
4.4.6. Concluding remarks
As already mentioned, given the sudden rise of ML and DL applica-
tions, the vastness of scientific literature that is being produced can be
confounding if not even misleading, both for researchers and practi-
tioners aiming to apply these methodologies to specific industrial tasks.
Apart from this general criticality, some operational issues, that could
hamper the diffusion of ML in the industry, have also emerged from the
literature review (i.e., Research Question #5). Generally, problems are
related to the data set needed to train the ML models. Indeed, if data are
collected directly on the field, issues related to missing, dirty, or even
insufficient data are frequently encountered. Apart from the standard
ways used for data pre-processing (e.g., imputation using most frequent,
zero/constant and k-NN), many papers (see for example Sobie et al.,
2018) demonstrated that this problem can be reduced using a training
dataset generated using high-resolution simulations or using generative
methods, such as Generative Adversarial Networks (Douzas and Bacao,
2018). Also, and perhaps more important, Deep Learning methodologies
allow working on almost raw data with little or no need for data pre-
Table 4
Overview of dataset characteristics.
Area Sub-Area Article Input Variables Output Variable(s) # of
samples
Maintenance
Management
Condition
Monitoring
Saxena and Saad,
(2007)
38 statistical features of accelerometer and microphone
data
Type of fault predicted 1152
Downtime
minimization
Susto et al., (2015) 125 statistical moments calculated of 31 time series (i.e.
current, deceleration, position, pressure).
Predicted faultiness class 3671
Failure Analysis Prieto et al., (2013) 25 statistical-time features calculated from vibration signal Prediction of 6 bearing
status
120
Quality Management Defect Detection Kusiak and Kurasek,
(2001)
14: stencil composition, stencil thickness, …, paste
application, position
Presence of solder defect 2052
Image Recognition Ravikumar et al.,
(2011)
20 histogram features of image data 3 different component
status
300
Life Cycle
Management
Jennings et al., (2016) 18 product characteristics (weight, screen resolution, etc.) Prediction of product
discontinuity
7000
Online Quality
Control
Ribeiro, (2005) 26 sensors readings (temperature, pressures, etc.) 6 kind of plastic part fault 200
Production Planning &
Control
Performance
Prediction
Arredondo and
Martinez, (2010)
4 order type attributes (size, composition, due date, arrival
date)
Order value N.A.
Process Control Sun et al., (2004) 9 cutting conditions (speed, depth, feed rate), and statistics
of band power
3 tool states N.A.
Scheduling Mönch et al., (2006) 4 batch machine factor, due dates tightness, due date
variance, ready time tightness
Scheduling look-ahead
parameter
N.A.
Supply Chain
Management
Modeling and
Coordination
Priore et al., (2019) 7 firm and supply state variables Replenishment model 2000
Demand Forecasting Mezzogori and
Zammori (2019)
26 product attributes Product demand
prediction
1020
Inventory
Management
Kara and Dogan,
(2018)
4 state variables measuring product remaining life and
inventory position
Action value N.A.
Table 3d (continued )
Sub-Area Article Received
Citations
Algorithm
Classification
Main
Algorithm
Simul.
Based
Goals & Approach
Under a nonstationary demand simulated scenario, a case-based
RL approach is tested, both in a periodical review order-up-to
system and in an order-quantity reorder-point system.
Inventory
Management
Kara and Dogan,
(2018)
0 Reinforcement
Learning
QLrn, SARSA 1 The Q-learning algorithm and the Sarsa method are compared for
solving an inventory management problem with perishable
products. RL shows better results with high variance demand of
short lifetime products.
M. Bertolini et al.Expert Systems With Applications 175 (2021) 114820
22
processing (Azadeh et al., 2013). Even in the case of very noisy data
(especially for signal and/or image processing), data can be optimally
denoised using stacked autoencoders, as in Yu et al. (2019). Certainly,
on the other one side, Deep Learning techniques and, more, in general,
all the NN based approaches, are difficult to be interpreted and could be
negatively seen as a black box, by most of the practitioners. However,
new and effective techniques, such as the ‘layer-wise relevance propa-
gation’ and ‘Grad-cam’, can be effectively used either to interpret a
concept learned by a NN or for producing visual explanation for de-
cisions made by CNN’s (see, for example, Ayodele and Yussof, 2019;
Montavan et al., 2018; Selvaraju et al., 2017). Thus, also considering the
extreme flexibility of Deep-learning techniques, and the outstanding
results that have been obtained in seemingly unrelated applications,
such as Natural Language Processing (Vaswani et al., 2017), their use in
operation management is expected to further increase. It is not a wild
guess to speculate that, in the next future, deep learning could find its
way in many industrial fields where these techniques are still shallowly
explored (i.e., Research Question #6). The first evidence comes from
SCM, a domain area that, although still little explored, is rapidly
growing, thanks to the adoption of Deep and Reinforcement Learning
techniques that make it possible to model and optimize complex prob-
lems of strategic nature. It is not difficult to predict that a similar
approach could be helpful to obtain concrete improvements over state-
of-the-art results in traditional industrial problems, such as scheduling
and inventory management.
Finally, the so-called data unbalancing problem is worth mentioning.
This issue is typical for quality and/or defect classification tasks when
the objective is to discriminate positive events from negative and rare
ones. Also, in this case, standard methods exist, ranging from classical
under-sampling (e.g. Near Miss algorithm) and oversampling ap-
proaches (e.g. Synthetic Minority Oversampling techniques) to more
elaborated techniques based on Competitive NNs (Nugroho et al., 2002).
However, as noted by Ko et al. (2017) and by Kim et al. (2018), even the
use of ensemble methods (of Random Forest in the simplest case) is
frequently enough to overcome this criticality.
5. Conclusions and directions for future works
The hype surrounding Machine Learning and Deep Learning algo-
rithms is ever-growing and, given recent breakthrough developments,
their use has been experiencing a steep increase in many fields. This
trend is very marked in the industry, especially in the operation man-
agement area, as revealed by the literature analysis herein described.
The number of published papers is very large and covers the whole
spectrum of operation management. Moreover, all the application do-
mains considered in this study show a steady and significant increase in
the number of publications (especially in the last two years), thus further
demonstrating an ever-growing interest in such applications. Histori-
cally, in terms of application domains, studies concerning Maintenance
and Quality appeared first, followed by applications in Production
Planning and Control and, lastly, in Supply Chain Management. Quality
management, as of today, is the most studied topic, probably given its
relevance on total sales and, consequently, its quicker return on in-
vestment. Recently, the investigated domain has been extended with the
introduction of new research fields such as Cyber-physical systems,
Additive Manufacturing and, more generally, Industry 4.0. These fields
seem to be promising for ML applications, and preliminary results are
encouraging. However, this enthusiasm is not certain to be followed up
and the initial interest could rapidly fade off, as already happened in
other fields. A typical example is ‘Order Management’ that, after an
initial boom, is now displaying a rapid decrease in interest. Probably,
this is due to the use of boundary algorithms that, although appealing for
the academic community, are of scarce interest for industrial practi-
tioners. This fact highlights the need to find a trade-off between novelty
and industrial applicability; a trade-off that is particular critical espe-
cially for ‘young domains’ (or question marks) where, to foster
acceptance it may be preferable to leverage on simple and more
consolidated techniques, rather than on novel and complex one that,
conversely, might even have a detrimental effect.
Concerning the adopted techniques, the most explored ones are
based on Supervised Learning, closely followed by Unsupervised
Learning algorithms, fast-rising especially in the last decade. Rein-
forcement Learning methods, given their higher complexities, are still
few, but they are also increasing (with a spike in 2018–2019), mainly in
the SCM area.
Anyhow, this positive trend and the even distribution of ML appli-
cations in many different industrial areas confirm the flexibility of ML
methodologies and their high potentialities for operation management
tasks. It is also important to note that enabling technologies are now
mature and that only a few operational problems must still be solved, for
the definitive dissemination of these methods. As discussed, most of the
problems concern either the generation of meaningful benchmark
datasets or the low interpretability of the obtained results. However, as
clearly discussed in the paper, thanks to recently emerging techniques,
both problems can already be satisfactory solved. Perhaps, the only real
problem that still needs to be solved, is to provide practitioners with a
proper key to interpret and to choose appropriate ML methods, without
getting lost in the vastity of scientific works published in the subject
matter. Hence, we hope that this systematic literature review, which
classified the existing corpus of works in a structured and operative way,
could be of help to solve this problem. For the same reason, a topics’
trend analysis has also been made, aiming to give precious indications
on the research areas on which academic researchers should focus,
depending on the tasks at hand and the scope of their study. Surely, it
can be presumed that more effort should be placed on topics classified as
‘Question Marks’ and ‘Hot Topics’ that, being the youngest and least
explored, are the ones where the bigger innovations can be made. On the
other hand, if the problem falls within the ‘Consolidated’ or ‘Stars’
category, then the innovation rate will be lower, but a solid corpus of
works can be found with precise indications of the implementation
strategy. In this regard, we note that the creation and sharing of open
datasets (of real industrial data) could be very helpful to further accel-
erate the diffusion and acceptance process of ML methods. Indeed, this
would allow practitioners to develop, test, and compare new algorithms,
leveraging common datasets.
Our belief is that many opportunities and potentials are yet to be
discovered in the application and/or integration of ML methods to
existing operational management techniques. In a certain sense, the
adoption and hybridization of standard operation management ap-
proaches with ML algorithms could further strengthen the smart
manufacturing concept. Just to name a few examples, embedding ML
models within discrete event simulations (or in Digital Twins), could
exploit the concept of cyber-physical system, boosting operating per-
formance and bringing to light new and interesting results. Similarly,
Reinforcement Learning techniques should be studied not only for
classic ‘hard’ applications in the field of robotics and automation, but
also for more ‘soft’ tasks, such as expert systems and/or decision support
systems. Other fields worthinvestigating could be the applicability of
ML methods in a real-world environment, in terms of computing power
and excessive latency. Also, economical assessments of the impact of ML
techniques could be helpful to further show the utility of such methods.
All these could be interesting topics for future streams of research.
To conclude we note that, due to the vastness of the considered
domain, our analysis was mainly of explanatory nature. The aim, in fact,
was to assess the current diffusion of ML and the potential it offers and/
or it may offer to solve problems typical of the operation management
field. We have simply tracked which algorithms are used in which field,
without any ambition of establishing which are the best ones. This was
not the aim of our works and, frankly speaking, we do not think it is
possible to do so as the problems analyzed are so varied and specific that
it would be difficult to make a fair comparison. Certainly, by narrowing
the field (for instance to one of the sub areas identified in the paper) such
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
23
a detailed analysis and comparison could be made and would be
extremely useful for a further advancement of ML in the industry. Hence,
this could be another interesting field for future research.
CRediT authorship contribution statement
Massimo Bertolini: Conceptualization, Supervision. Davide Mez-
zogori: Data curation, Writing - original draft. Mattia Neroni: Visual-
ization, Resources. Francesco Zammori: Methodology, Writing -
review & editing.
Declaration of Competing Interest
The authors declare that they have no known competing financial
interests or personal relationships that could have appeared to influence
the work reported in this paper.
Appendix A Acronyms of the algorithms cited in the literature
review
Table A1
Glossary of the Acronyms of the cited algorithms.
Acronym Full Name Explanation
AHC Adaptive Heuristic Critic Reinforcement Learning Algorithm
ANOVA Analysis Of Variance Mean Test among groups
AVL Action Value Reinforcement Learning Algorithm
ASVM Adaptive Super Vector Machine Adaptive Classification
ACO Ant Colony Optimization Metaheuristic for optimization
AuNN Autoencoder Neural Network Network for Dimensionality Reduction
Bay.R Bayesian Regression Non-Parametric Regression model
Boosting Boosting Ensemble learning technique
Bagging Bootstrap Aggregating Resampling Technique for Variance Reduction
CBR Case Base Reasoning Using past knowledge to solve new problems
CNN Convolutional Neural Network Feed Forward Network
DT Decision Tree Classification and Regression
DDPG Deep Deterministic Policy Gradient Reinforcement Learning Algorithm
DQN Deep Q Learning Reinforcement Learning Algorithm
GMM Gaussian Mixture Modelling Clustering
Gauss. PR Gaussian Process Regression Non-Parametric Regression model
GA Genetic Algorithm Metaheuristic Optimization
GDE Gaussian Density Estimation Gaussian Distribution estimation method
GrB Gradient boosting Ensemble Technique for decision trees
HMM Hidden Markov Model Prediction and Prognostic Model
HC Hierarchical Clustering Clustering
KNN K nearest neighbor Clustering
K-PCA Kernel Principal Component Analysis Dimensionality Reduction
K-Means K-Means Clustering
KMC K-Means/K-Median Clustering
LASSO Lasso Regression Regression model
LVQ Learning Vector Quantization Classification (labeled data)
LDA Linear Discriminant Analysis Classification and Patter recognition
LOF Local Outlier Factor Anomaly Detection
LWR Locally Weighted Regression Regression
LogR Logistic Regression Parametric Regression model (for probabilities)
LSTM Long-Short Term Memory Recurrent Neural Network
Nbay Naive Bayes Classification Technique
NBC Neighbor Based Clustering Clustering
NN Neural Network (Multilayers perc.) Standard Feed Forward Network
NMF Non-Negative Matrix Factorization Matrix Factorization
PSO Particle Swarm Optimization Optimization Metaheuristic
PCA Principal Component Analysis Dimensionality Reduction
PPO Proximal Policy Optimization Reinforcement Learning Algorithm
PW Parzen Windows Unsupervised Density Estimation
QLrn Q-Learning Reinforcement Learning Algorithm
QDA Quadratic Discriminant Analysis Classification and Patter recognition
RnF Random Forest Ensemble of Decision Tree
RNN Recurrent Neural Network Neural Networks for time series analysis
RBM Restricted Boltzmann Machine Network for Probability Distribution Learning
RST Rough Set Algorithm Rule Mining Algorithm
SIFT Scale Invariant Feature Transform Computer Vision Feature Detection technique
SOM Self-Organizing Maps Network for Dimensionality Reduction
SA Simulated Annealing Optimization Heuristic
SS Sparse Sensing Signal Processing Technique
SDA Stacked Denoising Auto Encoder Network for Dimen. Reduction and data denoising
SARSA State–action–reward–state–action Reinforcement Learning Algorithm
SVM Super Vector Machine Classification and Regression
SVDD Support Vector Data Description Classification technique for unbalanced datasets
TS Taboo Search Optimization Heuristic
t-SNE t-distributed stochastic neighbor embedding Dimensionality Reduction
TD Temporal Difference Learning Reinforcement Learning Algorithm
TL Transfer Learning Storing past learning to solve new problems
TRPO Trust Region Policy Optimization Reinforcement Learning Algorithm
UMAP Uniform Manifold Approx. and Projection Dimensionality Reduction
VPF Variable Picket Fence Harmonic Analysis
XGB XGBoost Parallel Tree Gradient Boosting technique
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
24
Appendix B Bibliometric analysis
Some additional statistic, of bibliosmetric nature, are reported
herein. Specifically, the following figures and Table B1a–B1d show:
- the top journals, evaluated both in terms of number of publications
and number of obtained citations,
- the most cited authors, for each application domain.
Figs. B1 and B2
Fig. B2. Top 25 Journals measured in number of received citations.
Fig. B1. Top ten Journals measured in terms of number of published papers.
M. Bertolini et al.
Expert Systems With Applications 175 (2021) 114820
25
References
Adly, F., Alhussein, O., Yoo, P. D., Al-Hammadi, Y., Taha, K., Muhaidat, S., … Ismail, M.
(2015). Simplified subspaced regression network for identification of defect patterns
in semiconductor wafer maps. IEEE Transactions on Industrial Informatics, 11(6),
1267–1276. https://doi.org/10.1109/TII.2015.2481719
Agarwal, P., Tamer, M., Sahraei, M. H., & Budman, H. (2019). Deep learning for
classification of profit-based operating regions in industrial processes. Industrial &
Engineering Chemistry Research, 59(6), 2378–2395. https://doi.org/10.1021/acs.
iecr.9b04737
Alegeh, N., Shagluf, A., Longstaff, A. P., & Fletcher, S. (2019). Accuracy in detecting
failure in Ballscrew assessment towards machine tool servitization. International
Journal of Mechanical Engineering and Robotics Research, 8(5), 667–673. https://doi.
org/10.18178/ijmerr.8.5.667-673
Arredondo, F., & Martinez, E. (2010). Learning and adaptation of a policy for dynamic
order acceptance in make-to-order manufacturing. Computers & Industrial
Engineering, 58(1), 70–83. https://doi.org/10.1016/j.cie.2009.08.005Aydin, M. E., & Oztemel, E. (2000). Dynamic job-shop scheduling using reinforcement
learning agents. Robotics and Autonomous Systems, 33(2–3), 169–178. https://doi.
org/10.1016/s0921-8890(00)00087-7
Ayodele, O. O., & Yussof, N. (2019). Explainable deep learning: Methods and challenges.
Journal of Advanced Research in Dynamical and Control Systems, 11(8), 1186–1205.
Azadeh, A., Saberi, M., Kazem, A., Ebrahimipour, V., Nourmohammadzadeh, A., &
Saberi, Z. (2013). A flexible algorithm for fault diagnosis in a centrifugal pump with
corrupted data and noise based on ANN and support vector machine with hyper-
parameters optimization. Applied Soft Computing, 13(3), 1478–1485. https://doi.org/
10.1016/j.asoc.2012.06.020
Bengio, Y., Courville, A. & Vincent, P. (2013) Representation Learning: a review and new
persectives, IEEE Transaction on Patter Analysis and Machine Intelligence, 35(8),
1798–828. https://doi.org/10.1109/TPAMI.2013.50.
Table B1a
Most cited authors in Maintenance Management domain.
Authors # Papers # Citations
Abhinav Saxena 1 116
Ashraf Saad 1 116
Satish C. Sharma 1 100
Antonio Garcia Espinosa 1 95
Giansalvo Cirrincione 1 95
Humberto Henao 1 95
Juan Antonio Ortega 1 95
Miguel Delgado Prieto 1 95
Arzu Onar 1 62
Nandita Kaundinya 1 62
Table B1b
Most cited authors in Production Planning and Control domain.
Authors # Papers # Citations
Ercan Oztemel 1 96
M. Emin Aydin 1 96
Javier Puente 4 72
Paolo Priore 4 72
David De La Fuente 2 65
José Parreño 2 63
Ratna Babu Chinnam 1 49
Jens Zimmermann 1 40
Lars Moench 1 40
Peter Otto 1 40
Table B1c
Most cited authors in Quality Management domain.
Authors # Papers # Citations
Sami Ekici 1 48
Ulaş Çaydaş 1 48
Bernardete Ribeiro 1 47
Kusiak & Kurasek 1 47
Fangming Ye 1 36
Krishnendu Chakrabarty 2 36
Xinli Gu 1 36
Zhaobo Zhang 1 36
Linkan Bian 2 19
Mark A. Tschopp 2 19
Table B1d
Most cited authors in Supply Chain Management domain.
Authors # Papers # Citations
Qi Wu 1 45
Chengzhi Jiang 1 37
Jafar Heydari 1 37
S. Hessameddin Zegordi 1 37
S. Kamal Chaharsooghi 1 37
Zhaohan Sheng 1 37
Chang Ouk Kim 3 19
Herbert Moskowitz 1 19
Hoi-Ming Chi 1 19
Ick-Hyun Kwon 2 19
M. Bertolini et al.
https://doi.org/10.1109/TII.2015.2481719
https://doi.org/10.1021/acs.iecr.9b04737
https://doi.org/10.1021/acs.iecr.9b04737
https://doi.org/10.18178/ijmerr.8.5.667-673
https://doi.org/10.18178/ijmerr.8.5.667-673
https://doi.org/10.1016/j.cie.2009.08.005
https://doi.org/10.1016/s0921-8890(00)00087-7
https://doi.org/10.1016/s0921-8890(00)00087-7
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0030
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0030
https://doi.org/10.1016/j.asoc.2012.06.020
https://doi.org/10.1016/j.asoc.2012.06.020
Expert Systems With Applications 175 (2021) 114820
26
Bukkapatnam, S. T. S., Afrin, K., Dave, D., & Kumara, S. R. T. (2019). Machine learning
and AI for long-term fault prognosis in complex manufacturing systems. CIRP Annals,
68(1), 459–462. https://doi.org/10.1016/j.cirp.2019.04.104
Burton, B., & Barnes, H. (2017). Hype Cycles Highlight Enterprise and Ecosystem Digital
Disruptions: A Gartner Trend Insight.
Caggiano, A., Zhang, J., Alfieri, V., Caiazzo, F., Gao, R., & Teti, R. (2019). Machine
learning-based image processing for on-line defect recognition in additive
manufacturing. CIRP Annals, 68(1), 451–454. https://doi.org/10.1016/j.
cirp.2019.03.021
Carbonneau, R., Vahidov, R., & Laframboise, K. (2007). Machine learning-based demand
forecasting in supply chains. International Journal of Intelligent Information
Technologies, 3(4), 40–57. https://doi.org/10.4018/jiit.2007100103.
Carvajal Soto, J. A., Tavakolizadeh, F., & Gyulai, D. (2019). An online machine learning
framework for early detection of product failures in an Industry 4.0 context.
International Journal of Computer Integrated Manufacturing, 32(4–5), 452–465.
https://doi.org/10.1080/0951192x.2019.1571238
Cavalcante, I. M., Frazzon, E. M., Forcellini, F. A., & Ivanov, D. (2019). A supervised
machine learning approach to data-driven simulation of resilient supplier selection
in digital manufacturing. International Journal of Information Management, 49, 86–97.
https://doi.org/10.1016/j.ijinfomgt.2019.03.004
Çaydaş, U., & Ekici, S. (2010). Support vector machines models for surface roughness
prediction in CNC turning of AISI 304 austenitic stainless steel. Journal of Intelligent
Manufacturing, 23(3), 639–650. https://doi.org/10.1007/s10845-010-0415-2
Chaharsooghi, S. K., Heydari, J., & Zegordi, S. H. (2008). A reinforcement learning model
for supply chain ordering management: An application to the beer game. Decision
Support Systems, 45(4), 949–959. https://doi.org/10.1016/j.dss.2008.03.007
Chan, S. L., Lu, Y., & Wang, Y. (2018). Data-driven cost estimation for additive
manufacturing in cybermanufacturing. Journal of Manufacturing Systems, 46,
115–126. https://doi.org/10.1016/j.jmsy.2017.12.001
Chen, C., Liu, Y., Kumar, M., Qin, J., & Ren, Y. (2019a). Energy consumption modelling
using deep learning embedded semi-supervised learning. Computers & Industrial
Engineering, 135, 757–765. https://doi.org/10.1016/j.cie.2019.06.052
Chen, Y.-J., Fan, C.-Y., & Chang, K.-H. (2016). Manufacturing intelligence for reducing
false alarm of defect classification by integrating similarity matching approach in
CMOS image sensor manufacturing. Computers & Industrial Engineering, 99, 465–473.
https://doi.org/10.1016/j.cie.2016.05.009
Chen, Y., Chen, B., Yao, Y., Tan, C., & Feng, J. (2019b). A spectroscopic method based on
support vector machine and artificial neural network for fiber laser welding defects
detection and classification. NDT & E International, 108, 102176. https://doi.org/
10.1016/j.ndteint.2019.102176
Chi, H.-M., Ersoy, O. K., Moskowitz, H., & Ward, J. (2007). Modeling and optimizing a
vendor managed replenishment system using machine learning and genetic
algorithms. European Journal of Operational Research, 180(1), 174–193. https://doi.
org/10.1016/j.ejor.2006.03.040
Chinnam, R. B. (2002). Support vector machines for recognizing shifts in correlated and
other manufacturing processes. International Journal of Production Research, 40(17),
4449–4466. https://doi.org/10.1080/00207540210152920
Cho, S., Asfour, S., Onar, A., & Kaundinya, N. (2005). Tool breakage detection using
support vector machine learning in a milling process. International Journal of Machine
Tools and Manufacture, 45(3), 241–249. https://doi.org/10.1016/j.
ijmachtools.2004.08.016
Cholette, M. E., Borghesani, P., Gialleonardo, E. D., & Braghin, F. (2017). Using support
vector machines for the computationally efficient identification of acceptable design
parameters in computer-aided engineering applications. Expert Systems with
Applications, 81, 39–52. https://doi.org/10.1016/j.eswa.2017.03.050
Coppini, M., Rossignoli, C., Rossi, T., & Strozzi, F. (2010). Bullwhip effect and inventory
oscillations analysis using the beer game model. International journal of production
Research, 48(13), 3943–3956. https://doi.org/10.1080/00207540902896204
Coronado, A.E., Lyons, A.C., Kehoe, D.F. and Coleman, J. (2007) Enabling mass
customization: extending build-to-order concepts to supply chains. Production
Planning and Control, 15(4) Special Issue Mass Customisation, 398–411. https://doi.
org/10.1080/0953728042000238809.
Csáji, B. C., Monostori, L., & Kádár, B. (2006). Reinforcement learning in a distributed
market-based production control system. Advanced Engineering Informatics, 20(3),
279–288. https://doi.org/10.1016/j.aei.2006.01.001
De Jong, A. W., Rubrico, J. I. U., Adachi, M., Nakamura, T., & Ota, J. (2019).
A generalised makespan estimationfor shop scheduling problems, using visual data
and a convolutional neural network. International Journal of Computer Integrated
Manufacturing, 32(6), 559–568. https://doi.org/10.1080/0951192x.2019.1599430
Delgoshaei, A., & Gomes, C. (2016). A multi-layer perceptron for scheduling cellular
manufacturing systems in the presence of unreliable machines and uncertain cost.
Applied Soft Computing, 49, 27–55. https://doi.org/10.1016/j.asoc.2016.06.025
Denkena, B., Dittrich, M.-A., Böß, V., Wichmann, M., & Friebe, S. (2019). Self-optimizing
process planning for helical flute grinding. Production Engineering, 13(5), 599–606.
https://doi.org/10.1007/s11740-019-00908-0
Diaz-Rozo, J., Bielza, C., & Larrañaga, P. (2017). Machine learning-based CPS for
clustering high throughput machining cycle conditions. Procedia Manufacturing, 10,
997–1008. https://doi.org/10.1016/j.promfg.2017.07.091
Doltsinis, S., Ferreira, P., & Lohse, N. (2012). Reinforcement Learning for production
ramp-up: A Q-batch learning approach. In 2012 11th international conference on
machine learning and applications. https://doi.org/10.1109/icmla.2012.113
Dornheim, J., Link, N., & Gumbsch, P. (2019). Model-free adaptive optimal control of
episodic fixed-horizon manufacturing processes using reinforcement learning.
International Journal of Control, Automation and Systems., 18(6), 1593–1604. https://
doi.org/10.1007/s12555-019-0120-7
Douzas, G., & Bacao, F. (2018). Effective data generation for imbalanced learning using
conditional generative adversarial networks. Expert Systems with applications, 91,
464–471. https://doi.org/10.1016/j.eswa.2017.09.030
Drakaki, M., & Tzionas, P. (2017). Manufacturing scheduling using colored petri nets and
reinforcement learning. Applied Sciences, 7(2), 136. https://doi.org/10.3390/
app7020136
Du, H., & Jiang, Y.e. (2019). Backup or reliability improvement strategy for a
manufacturer facing heterogeneous consumers in a dynamic supply chain. IEEE
Access, 7, 50419–50430. https://doi.org/10.1109/Access.628763910.1109/
ACCESS.2019.2911620
Duan, Q., Zeng, J., Chakrabarty, K., & Dispoto, G. (2015). Data-driven optimization of
order admission policies in a digital print factory. ACM Transactions on Design
Automation of Electronic Systems, 20(2), 1–25. https://doi.org/10.1145/2699836
El-Bendary, N., El Hariri, E., Hassanien, A. E., & Badr, A. (2015). Using machine learning
techniques for evaluating tomato ripeness. Expert Systems with Applications, 42(4),
1892–1905. https://doi.org/10.1016/j.eswa.2014.09.057
Ye, F., Zhang, Z., Chakrabarty, K., & Xinli, G.u. (2013). Board-level functional fault
diagnosis using artificial neural networks, support-vector machines, and weighted-
majority voting. IEEE Transactions on Computer-Aided Design of Integrated Circuits and
Systems, 32(5), 723–736. https://doi.org/10.1109/tcad.2012.2234827
Fu, W., & Chien, C.-F. (2019). UNISON data-driven intermittent demand forecast
framework to empower supply chain resilience and an empirical study in electronics
distribution. Computers & Industrial Engineering, 135, 940–949. https://doi.org/
10.1016/j.cie.2019.07.002
Gaham, M., & Bouzouia, B. (2009). Intelligent product-driven manufacturing control: A
mixed genetic algorithms and machine learning approach to product intelligence
synthesis. In 2009 XXII International Symposium on Information, Communication and
Automation Technologies. https://doi.org/10.1109/icat.2009.5348452
Gao, B., Woo, W. L., Tian, G. Y., & Zhang, H. (2016). Unsupervised diagnostic and
monitoring of defects using waveguide imaging with adaptive sparse representation.
IEEE Transactions on Industrial Informatics, 12(1), 405–416. https://doi.org/10.1109/
tii.2015.2492924
García Nieto, P. J., Martínez Torres, J., Araújo Fernández, M., & Ordóñez Galán, C.
(2012). Support vector machines and neural networks used to evaluate paper
manufactured using Eucalyptus globulus. Applied Mathematical Modelling, 36(12),
6137–6145. https://doi.org/10.1016/j.apm.2012.02.016
Gardner, J. M., Hunt, K. A., Ebel, A. B., Rose, E. S., Zylich, S. C., Jensen, B. D., … Sauti, G.
(2019). Machines as craftsmen: Localized parameter setting optimization for fused
filament fabrication 3D printing. Advanced Materials Technologies, 4(3), 1800653.
https://doi.org/10.1002/admt.v4.310.1002/admt.201800653
Gareth, J., Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical
learning. New York: Springer.
Ghadai, S., Balu, A., Sarkar, S., & Krishnamurthy, A. (2018). Learning localized features
in 3D CAD models for manufacturability analysis of drilled holes. Computer Aided
Geometric Design, 62, 263–275. https://doi.org/10.1016/j.cagd.2018.03.024
Gobert, C., Reutzel, E. W., Petrich, J., Nassar, A. R., & Phoha, S. (2018). Application of
supervised machine learning for defect detection during metallic powder bed fusion
additive manufacturing using high resolution imaging. Additive Manufacturing, 21,
517–528. https://doi.org/10.1016/j.addma.2018.04.005
González Rodríguez, G., Gonzalez-Cava, J. M., & Méndez Pérez, J. A. (2019). An
intelligent decision support system for production planning based on machine
learning. Journal of Intelligent Manufacturing, 31(5), 1257–1273. https://doi.org/
10.1007/s10845-019-01510-y
Gurgenc, T., Ucar, F., Korkmaz, D., Ozel, C., & Ortac, Y. (2019). A study on the extreme
learning machine based prediction of machining times of the cycloidal gears in CNC
milling machines. Production Engineering, 13(6), 635–647. https://doi.org/10.1007/
s11740-019-00923-1
Gyulai, D., Pfeiffer, A., Nick, G., Gallina, V., Sihn, W., & Monostori, L. (2018). Lead time
prediction in a flow-shop environment with analytical and machine learning
approaches. IFAC-PapersOnLine, 51(11), 1029–1034. https://doi.org/10.1016/j.
ifacol.2018.08.472
Li, H. (2016). An approach to improve flexible manufacturing systems with machine
learning algorithms. In IECON 2016–42nd annual conference of the IEEE industrial
electronics society. https://doi.org/10.1109/iecon.2016.7793838
Harding, J., Shahbaz, M., & Kusiak, A. (2006). Data mining in manufacturing: A review.
Journal of Manufacturing Science and Engineering, 128(4), 969–976. https://doi.org/
10.1115/1.2194554
Heger, J., Branke, J., Hildebrandt, T., & Scholz-Reiter, B. (2016). Dynamic adjustment of
dispatching rule parameters in flow shops with sequence-dependent set-up times.
International Journal of Production Research, 54(22), 6812–6824. https://doi.org/
10.1080/00207543.2016.1178406
Hesser, D. F., & Markert, B. (2019). Tool wear monitoring of a retrofitted CNC milling
machine using artificial neural networks. Manufacturing Letters, 19, 1–4. https://doi.
org/10.1016/j.mfglet.2018.11.001
Imoto, K., Nakai, T., Ike, T., Haruki, K., & Sato, Y. (2019). A CNN-based transfer learning
method for defect classification in semiconductor manufacturing. IEEE Transactions
on Semiconductor Manufacturing, 32(4), 455–459. https://doi.org/10.1109/
TSM.2019.2941752
Iqbal, R., Maniak, T., Doctor, F., & Karyotis, C. (2019). Fault detection and isolation in
industrial processes using deep learning approaches. IEEE Transactions on Industrial
Informatics, 15(5), 3077–3084. https://doi.org/10.1109/TII.2019.2902274
Jang, S.-J., Kim, J.-S., Kim, T.-W., Lee, H.-J., & Ko, S. (2019). A wafer map yield
prediction based on machine learning for productivity enhancement. IEEE
Transactions on Semiconductor Manufacturing, 32(4), 400–407. https://doi.org/
10.1109/TSM.2019.2945482
M. Bertolini et al.
https://doi.org/10.1016/j.cirp.2019.04.104
https://doi.org/10.1016/j.cirp.2019.03.021
https://doi.org/10.1016/j.cirp.2019.03.021
https://doi.org/10.1080/0951192x.2019.1571238
https://doi.org/10.1016/j.ijinfomgt.2019.03.004https://doi.org/10.1007/s10845-010-0415-2
https://doi.org/10.1016/j.dss.2008.03.007
https://doi.org/10.1016/j.jmsy.2017.12.001
https://doi.org/10.1016/j.cie.2019.06.052
https://doi.org/10.1016/j.cie.2016.05.009
https://doi.org/10.1016/j.ndteint.2019.102176
https://doi.org/10.1016/j.ndteint.2019.102176
https://doi.org/10.1016/j.ejor.2006.03.040
https://doi.org/10.1016/j.ejor.2006.03.040
https://doi.org/10.1080/00207540210152920
https://doi.org/10.1016/j.ijmachtools.2004.08.016
https://doi.org/10.1016/j.ijmachtools.2004.08.016
https://doi.org/10.1016/j.eswa.2017.03.050
https://doi.org/10.1080/00207540902896204
https://doi.org/10.1016/j.aei.2006.01.001
https://doi.org/10.1080/0951192x.2019.1599430
https://doi.org/10.1016/j.asoc.2016.06.025
https://doi.org/10.1007/s11740-019-00908-0
https://doi.org/10.1016/j.promfg.2017.07.091
https://doi.org/10.1109/icmla.2012.113
https://doi.org/10.1007/s12555-019-0120-7
https://doi.org/10.1007/s12555-019-0120-7
https://doi.org/10.1016/j.eswa.2017.09.030
https://doi.org/10.3390/app7020136
https://doi.org/10.3390/app7020136
https://doi.org/10.1109/Access.628763910.1109/ACCESS.2019.2911620
https://doi.org/10.1109/Access.628763910.1109/ACCESS.2019.2911620
https://doi.org/10.1145/2699836
https://doi.org/10.1016/j.eswa.2014.09.057
https://doi.org/10.1109/tcad.2012.2234827
https://doi.org/10.1016/j.cie.2019.07.002
https://doi.org/10.1016/j.cie.2019.07.002
https://doi.org/10.1109/icat.2009.5348452
https://doi.org/10.1109/tii.2015.2492924
https://doi.org/10.1109/tii.2015.2492924
https://doi.org/10.1016/j.apm.2012.02.016
https://doi.org/10.1002/admt.v4.310.1002/admt.201800653
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0225
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0225
https://doi.org/10.1016/j.cagd.2018.03.024
https://doi.org/10.1016/j.addma.2018.04.005
https://doi.org/10.1007/s10845-019-01510-y
https://doi.org/10.1007/s10845-019-01510-y
https://doi.org/10.1007/s11740-019-00923-1
https://doi.org/10.1007/s11740-019-00923-1
https://doi.org/10.1016/j.ifacol.2018.08.472
https://doi.org/10.1016/j.ifacol.2018.08.472
https://doi.org/10.1109/iecon.2016.7793838
https://doi.org/10.1115/1.2194554
https://doi.org/10.1115/1.2194554
https://doi.org/10.1080/00207543.2016.1178406
https://doi.org/10.1080/00207543.2016.1178406
https://doi.org/10.1016/j.mfglet.2018.11.001
https://doi.org/10.1016/j.mfglet.2018.11.001
https://doi.org/10.1109/TSM.2019.2941752
https://doi.org/10.1109/TSM.2019.2941752
https://doi.org/10.1109/TII.2019.2902274
https://doi.org/10.1109/TSM.2019.2945482
https://doi.org/10.1109/TSM.2019.2945482
Expert Systems With Applications 175 (2021) 114820
27
Jennings, C., Wu, D., & Terpenny, J. (2016). Forecasting obsolescence risk and product
life cycle with machine learning. IEEE Transactions on Components, Packaging and
Manufacturing Technology, 6(9), 1428–1439. https://doi.org/10.1109/
tcpmt.2016.2589206
Ji, S., Wang, X., Zhao, W., & Guo, D. (2019). An application of a three-stage XGBoost-
based model to sales forecasting of a cross-border e-commerce enterprise.
Mathematical Problems in Engineering, 2019, 1–15. https://doi.org/10.1155/2019/
8503252
Jiang, C., & Sheng, Z. (2009). Case-based reinforcement learning for dynamic inventory
control in a multi-agent supply-chain system. Expert Systems with Applications, 36(3),
6520–6526. https://doi.org/10.1016/j.eswa.2008.07.036
Joswiak, M., Peng, Y., Castillo, I., & Chiang, L. H. (2019). Dimensionality reduction for
visualizing industrial chemical process data. Control Engineering Practice, 93, 104189.
https://doi.org/10.1016/j.conengprac.2019.104189
Kammerer, K., Hoppenstedt, B., Pryss, R., Stökler, S., Allgaier, J., & Reichert, M. (2019).
Anomaly detections for manufacturing systems based on sensor data-insights into
two challenging real-world production settings. Sensors, 19(24), 5370. https://doi.
org/10.3390/s19245370
Kankar, P. K., Sharma, S. C., & Harsha, S. P. (2011). Fault diagnosis of ball bearings using
machine learning methods. Expert Systems with Applications, 38(3), 1876–1886.
https://doi.org/10.1016/j.eswa.2010.07.119
Kara, A., & Dogan, I. (2018). Reinforcement learning approaches for specifying ordering
policies of perishable inventory systems. Expert Systems with Applications, 91,
150–158. https://doi.org/10.1016/j.eswa.2017.08.046
Khanzadeh, M., Rao, P., Jafari-Marandi, R., Smith, B. K., Tschopp, M. A., & Bian, L.
(2017). Quantifying geometric accuracy with unsupervised machine learning: using
self-organizing map on fused filament fabrication additive manufacturing parts.
Journal of Manufacturing Science and Engineering, 140(3). https://doi.org/10.1115/
1.4038598
Khanzadeh, M., Chowdhury, S., Marufuzzaman, M., Tschopp, M. A., & Bian, L. (2018).
Porosity prediction: Supervised-learning of thermal history for direct laser
deposition. Journal of Manufacturing Systems, 47, 69–82. https://doi.org/10.1016/j.
jmsy.2018.04.001
Kim, A., Oh, K., Jung, J.-Y., & Kim, B. (2018). Imbalanced classification of manufacturing
quality conditions using cost-sensitive decision tree ensembles. International Journal
of Computer Integrated Manufacturing, 31(8), 701–717. https://doi.org/10.1080/
0951192x.2017.1407447
Kim, C. O., Jun, J., Baek, J. K., Smith, R. L., & Kim, Y. D. (2005). Adaptive inventory
control models for supply chain management. The International Journal of Advanced
Manufacturing Technology, 26(9–10), 1184–1192. https://doi.org/10.1007/s00170-
004-2069-8
Kim, C. O., Kwon, I.-H., & Baek, J.-G. (2008). Asynchronous action-reward learning for
nonstationary serial supply chain inventory control. Applied Intelligence, 28(1), 1–16.
https://doi.org/10.1007/s10489-007-0038-2
Kim, D., Kang, P., Cho, S., Lee, H., & Doh, S. (2012). Machine learning-based novelty
detection for faulty wafer detection in semiconductor manufacturing. Expert Systems
with Applications, 39(4), 4075–4083. https://doi.org/10.1016/j.eswa.2011.09.088
Kim, D., & Kang, S. (2019). Effect of irrelevant variables on faulty wafer detection in
semiconductor manufacturing. Energies, 12(13), 2530. https://doi.org/10.3390/
en12132530
Ko, T., Lee, J. H., Cho, H., Cho, S., Lee, W., & Lee, M. (2017). Machine learning-based
anomaly detection via integration of manufacturing, inspection and after-sales
service data. Industrial Management & Data Systems, 117(5), 927–945. https://doi.
org/10.1108/imds-06-2016-0195
Korinek, A., & Stiglitz, J. E. (2017). Artificial intelligence and its implications for income
distribution and unemployment (No. w24174). National Bureau of Economic
Research. https://doi.org/ 10.3386/w24174.
Kuhnle, A., Jakubik, J., & Lanza, G. (2018). Reinforcement learning for opportunistic
maintenance optimization. Production Engineering, 13(1), 33–41. https://doi.org/
10.1007/s11740-018-0855-7
Kusiak, A., & Kurasek, C. (2001). Data mining of printed-circuit board defects. IEEE
Transactions on Robotics and Automation, 17(2), 191–196. https://doi.org/10.1109/
70.928564
Kusiak, A. (2017). Smart manufacturing must embrace big data. Nature, 544(7648),
23–25. https://doi.org/10.1038/544023a
Kusiak, A. (2018). Smart manufacturing. International Journal of Production Research, 56
(1–2), 508–517. https://doi.org/10.1080/00207543.2017.1351644
Kwon, I., Kim, C., Jun, J., & Lee, J. (2008). Case-based myopic reinforcement learning for
satisfying target service level in supply chain. Expert Systems with Applications, 35
(1–2), 389–397. https://doi.org/10.1016/j.eswa.2007.07.002
LaValle, S., Lesser, E., Shockley, R., Hopkins, M., & Kruschwitz, N. (2011). Big data,
analytics and the path from insights to value. MIT Sloan Management Review, 52(2),
21–31.
LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436–444.
https://doi.org/10.1038/nature14539
Lee, T., Lee, K. B., & Kim, C. O. (2016). Performance of machine learning algorithms for
class-imbalanced process fault detection problems. IEEE Transactions on
Semiconductor Manufacturing,29(4), 436–445. https://doi.org/10.1109/
tsm.2016.2602226
Lenz, B., Barak, B., Muhrwald, J., Leicht, C., & Lenz, B. (2013). Virtual metrology in
semiconductor manufacturing by means of predictive machine learning models. In
2013 12th international conference on machine learning and applications. https://doi.
org/10.1109/icmla.2013.186
Li, S., Liu, G., Tang, X., Lu, J., & Hu, J. (2017). An ensemble deep convolutional neural
network model with improved D-S evidence fusion for bearing fault diagnosis.
Sensors, 17(8), 1729. https://doi.org/10.3390/s17081729
Li, Y. (2017). Deep reinforcement learning: An overview. arXiv preprint arXiv:
1701.07274.
Li, Z., Liu, R., & Wu, D. (2019). Data-driven smart manufacturing: Tool wear monitoring
with audio signals and machine learning. Journal of Manufacturing Processes, 48,
66–76. https://doi.org/10.1016/j.jmapro.2019.10.020
Liker, J. K. (2004). The Toyota Way: 14 management principles from the world’s greatest
manufacturer. New York: McGraw-Hill.
Lin, C.-C., Deng, D.-J., Chih, Y.-L., & Chiu, H.-T. (2019). Smart manufacturing scheduling
with edge computing using multiclass deep Q network. IEEE Transactions on
Industrial Informatics, 15(7), 4276–4284. https://doi.org/10.1109/TII.942410.1109/
TII.2019.2908210
Lin, S.-Y., Guh, R.-S., & Shiue, Y.-R. (2011). Effective recognition of control chart
patterns in autocorrelated data using a support vector machine based approach.
Computers & Industrial Engineering, 61(4), 1123–1134. https://doi.org/10.1016/j.
cie.2011.06.025
Liu, J., An, Y., Dou, R., & Ji, H. (2018a). Dynamic deep learning algorithm based on
incremental compensation for fault diagnosis model. International Journal of
Computational Intelligence Systems, 11(1), 846. https://doi.org/10.2991/ijcis.11.1.64
Liu, J., Hu, Y., Wu, B., & Wang, Y. (2018b). An improved fault diagnosis approach for
FDM process with acoustic emission. Journal of Manufacturing Processes, 35,
570–579. https://doi.org/10.1016/j.jmapro.2018.08.038
Liu, Z., Jia, Z., Vong, C.-M., Bu, S., Han, J., & Tang, X. (2017). Capturing high-
discriminative fault features for electronics-rich analog system via deep learning.
IEEE Transactions on Industrial Informatics, 13(3), 1213–1226. https://doi.org/
10.1109/tii.2017.2690940
Loyer, J.-L., Henriques, E., Fontul, M., & Wiseall, S. (2016). Comparison of Machine
Learning methods applied to the estimation of manufacturing cost of jet engine
components. International Journal of Production Economics, 178, 109–119. https://
doi.org/10.1016/j.ijpe.2016.05.006
Lu, Y. (2017). Industry 4.0: A survey on technologies, applications and open research
issues. Journal of Industrial Integration Information, 6, 1–10. https://doi.org/10.1016/
j.jii.2017.04.005
Ma, Y., Zhu, W., Benton, M. G., & Romagnoli, J. (2019). Continuous control of a
polymerization system with deep reinforcement learning. Journal of Process Control,
75, 40–47. https://doi.org/10.1016/j.jprocont.2018.11.004
Maggipinto, M., Terzi, M., Masiero, C., Beghi, A., & Susto, G. A. (2018). A computer
vision-inspired deep learning architecture for virtual metrology modeling with 2-
dimensional data. IEEE Transactions on Semiconductor Manufacturing, 31(3),
376–384. https://doi.org/10.1109/tsm.2018.2849206
Manyika, J., Lund, S., Chui, M., Bughin, J., Woetzel, J., Batra, P., Ko, R., & Sanghvi, S.
(2017). Job lost job gained: what the future of work will mean for jobs, skills and
wages. McKinsey report, access on line at: https://www.mckinsey.com/featured-
insights/future-of-work/jobs-lost-jobs-gained-what-the-future-of-work-will-mean-
for-jobs-skills-and-wages#.
Manohar, K., Hogan, T., Buttrick, J., Banerjee, A. G., Kutz, J. N., & Brunton, S. L. (2018).
Predicting shim gaps in aircraft assembly with machine learning and sparse sensing.
Journal of Manufacturing Systems, 48, 87–95. https://doi.org/10.1016/j.
jmsy.2018.01.011
Martínez-Díaz, M., & Soriguera, F. (2018). Autonomous vehicles: Theoretical and
practical challenges. Transportation Research Procedia, 33, 275–282. https://doi.org/
10.1016/j.trpro.2018.10.103
McAfee, A., Brynjolfsson, E., & Davenport, T. (2012). Big data: The management
revolution. Harvard Business Review, 90(10), 60–68.
Meidan, Y., Lerner, B., Rabinowitz, G., & Hassoun, M. (2011). Cycle-time key factor
identification and prediction in semiconductor manufacturing using machine
learning and data mining. IEEE Transactions on Semiconductor Manufacturing, 24(2),
237–248. https://doi.org/10.1109/tsm.2011.2118775
Mezzogori, D., & Zammori, F. (2019). An entity embeddings deep learning approach for
demand forecast of highly differentiated products. Procedia Manufacturing, 39,
1793–1800. https://doi.org/10.1016/j.promfg.2020.01.260
Mezzogori, D., Romagnoli, G., & Zammori, F. (2020). Defining accurate delivery dates in
make to order job-shops managed by workload control. Flexible Services and
Manufacturing Journal, 1–36. https://doi.org/10.1007/s10696-020-09396-2
Mittal, S., Khan, M., Romero, D., & Wuest, T. (2016). Smart manufacturing:
Characteristics and technologies. In International conference on product lifecycle
management (pp. 539–548). Cham: Springer, 10.1007/978-3-319-54660-5_48.
Mohammadi, P., & Wang, Z. J. (2016). Machine learning for quality prediction in
abrasion-resistant material manufacturing process. In 2016 IEEE Canadian conference
on electrical and computer engineering (CCECE). https://doi.org/10.1109/
ccece.2016.7726783
Mönch, L., Zimmermann, J., & Otto, P. (2006). Machine learning techniques for
scheduling jobs with incompatible families and unequal ready times on parallel
batch machines. Engineering Applications of Artificial Intelligence, 19(3), 235–245.
https://doi.org/10.1016/j.engappai.2005.10.001
Monostori, L. (2003). AI and machine learning techniques for managing complexity,
changes and uncertainties in manufacturing. Engineering Applications of Artificial
Intelligence, 16(4), 277–291. https://doi.org/10.1016/S0952-1976(03)00078-2
Montavon, G., Samek, W., & Müller, K.-R. (2018). Methods for interpreting and
understanding deep neural networks. Digital Signal Processing: a review journal, 73,
1–15. https://doi.org/10.1016/j.dsp.2017.10.011
Mortazavi, A., Arshadi Khamseh, A., & Azimi, P. (2015). Designing of an intelligent self-
adaptive model for supply chain ordering management system. Engineering
Applications of Artificial Intelligence, 37, 207–220. https://doi.org/10.1016/j.
engappai.2014.09.004
Müller, M. (2002). Computer go. Artificial Intelligence, 134(1–2), 145–179. https://doi.
org/10.1016/S0004-3702(01)00121-7
M. Bertolini et al.
https://doi.org/10.1109/tcpmt.2016.2589206
https://doi.org/10.1109/tcpmt.2016.2589206
https://doi.org/10.1155/2019/8503252
https://doi.org/10.1155/2019/8503252
https://doi.org/10.1016/j.eswa.2008.07.036
https://doi.org/10.1016/j.conengprac.2019.104189
https://doi.org/10.3390/s19245370
https://doi.org/10.3390/s19245370
https://doi.org/10.1016/j.eswa.2010.07.119
https://doi.org/10.1016/j.eswa.2017.08.046
https://doi.org/10.1115/1.4038598
https://doi.org/10.1115/1.4038598
https://doi.org/10.1016/j.jmsy.2018.04.001
https://doi.org/10.1016/j.jmsy.2018.04.001
https://doi.org/10.1080/0951192x.2017.1407447
https://doi.org/10.1080/0951192x.2017.1407447
https://doi.org/10.1007/s00170-004-2069-8
https://doi.org/10.1007/s00170-004-2069-8
https://doi.org/10.1007/s10489-007-0038-2
https://doi.org/10.1016/j.eswa.2011.09.088
https://doi.org/10.3390/en12132530
https://doi.org/10.3390/en12132530
https://doi.org/10.1108/imds-06-2016-0195
https://doi.org/10.1108/imds-06-2016-0195
https://doi.org/10.1007/s11740-018-0855-7
https://doi.org/10.1007/s11740-018-0855-7
https://doi.org/10.1109/70.928564
https://doi.org/10.1109/70.928564https://doi.org/10.1038/544023a
https://doi.org/10.1080/00207543.2017.1351644
https://doi.org/10.1016/j.eswa.2007.07.002
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0395
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0395
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0395
https://doi.org/10.1038/nature14539
https://doi.org/10.1109/tsm.2016.2602226
https://doi.org/10.1109/tsm.2016.2602226
https://doi.org/10.1109/icmla.2013.186
https://doi.org/10.1109/icmla.2013.186
https://doi.org/10.3390/s17081729
https://doi.org/10.1016/j.jmapro.2019.10.020
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0430
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0430
https://doi.org/10.1109/TII.942410.1109/TII.2019.2908210
https://doi.org/10.1109/TII.942410.1109/TII.2019.2908210
https://doi.org/10.1016/j.cie.2011.06.025
https://doi.org/10.1016/j.cie.2011.06.025
https://doi.org/10.2991/ijcis.11.1.64
https://doi.org/10.1016/j.jmapro.2018.08.038
https://doi.org/10.1109/tii.2017.2690940
https://doi.org/10.1109/tii.2017.2690940
https://doi.org/10.1016/j.ijpe.2016.05.006
https://doi.org/10.1016/j.ijpe.2016.05.006
https://doi.org/10.1016/j.jii.2017.04.005
https://doi.org/10.1016/j.jii.2017.04.005
https://doi.org/10.1016/j.jprocont.2018.11.004
https://doi.org/10.1109/tsm.2018.2849206
https://doi.org/10.1016/j.jmsy.2018.01.011
https://doi.org/10.1016/j.jmsy.2018.01.011
https://doi.org/10.1016/j.trpro.2018.10.103
https://doi.org/10.1016/j.trpro.2018.10.103
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0495
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0495
https://doi.org/10.1109/tsm.2011.2118775
https://doi.org/10.1016/j.promfg.2020.01.260
https://doi.org/10.1007/s10696-020-09396-2
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0515
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0515
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0515
https://doi.org/10.1109/ccece.2016.7726783
https://doi.org/10.1109/ccece.2016.7726783
https://doi.org/10.1016/j.engappai.2005.10.001
https://doi.org/10.1016/S0952-1976(03)00078-2
https://doi.org/10.1016/j.dsp.2017.10.011
https://doi.org/10.1016/j.engappai.2014.09.004
https://doi.org/10.1016/j.engappai.2014.09.004
https://doi.org/10.1016/S0004-3702(01)00121-7
https://doi.org/10.1016/S0004-3702(01)00121-7
Expert Systems With Applications 175 (2021) 114820
28
Murphy, K. (2012). Machine learning: A probabilistic perspective. Cambridge: The MIT
Press.
Nakata, K., Orihara, R., Mizuoka, Y., & Takagi, K. (2017). A comprehensive big-data-
based monitoring system for yield enhancement in semiconductor manufacturing.
IEEE Transactions on Semiconductor Manufacturing, 30(4), 339–344. https://doi.org/
10.1109/tsm.2017.2753251
Nugroho, A. S., Kuroyanagi, S., & Iwata, A. (2002). A solution for imbalanced training
sets problem by CombNET-II and its application to fog forecast. IEICE transactions in
Information and Systems, 85(7), 1165–1174.
Oh, Y., Busogi, M., Ransikarbum, K., Shin, D., Kwon, D., & Kim, N. (2019a). Real-time
quality monitoring and control system using an integrated cost effective support
vector machine. Journal of Mechanical Science and Technology, 33(12), 6009–6020.
https://doi.org/10.1007/s12206-019-1145-9
Oh, Y., Ransikarbum, K., Busogi, M., Kwon, D., & Kim, N. (2019b). Adaptive SVM-based
real-time quality assessment for primer-sealer dispensing process of sunroof
assembly line. Reliability Engineering & System Safety, 184, 202–212. https://doi.org/
10.1016/j.ress.2018.03.020
Palombarini, J., & Martínez, E. (2012). SmartGantt – An intelligent system for real time
rescheduling based on relational reinforcement learning. Expert Systems with
Applications, 39(11), 10251–10268. https://doi.org/10.1016/j.eswa.2012.02.176
Papananias, M., McLeay, T. E., Mahfouf, M., & Kadirkamanathan, V. (2019). A Bayesian
framework to estimate part quality and associated uncertainties in multistage
manufacturing. Computers in Industry, 105, 35–47. https://doi.org/10.1016/j.
compind.2018.10.008
Penumuru, D. P., Muthuswamy, S., & Karumbu, P. (2019). Identification and
classification of materials using machine vision and machine learning in the context
of industry 4.0. Journal of Intelligent Manufacturing., 31(5), 1229–1241. https://doi.
org/10.1007/s10845-019-01508-6
Peres, R. S., Barata, J., Leitao, P., & Garcia, G. (2019). Multistage quality control using
machine learning in the automotive industry. IEEE Access, 7, 79908–79916. https://
doi.org/10.1109/ACCESS.2019.2923405
Perzyk, M., Kochanski, A., Kozlowski, J., Soroczynski, A., & Biernacki, R. (2014).
Comparison of data mining tools for significance analysis of process parameters in
applications to process fault diagnosis. Information Sciences, 259, 380–392. https://
doi.org/10.1016/j.ins.2013.10.019
Pham, D. T., & Afify, A. A. (2005). Machine-learning techniques and their applications in
manufacturing. Proceedings of the Institution of Mechanical Engineers, Part B: Journal of
Engineering Manufacture, 219(5), 395–412. https://doi.org/10.1243/
095440505X32274
Prieto, M. D., Cirrincione, G., Espinosa, A. G., Ortega, J. A., & Henao, H. (2013). Bearing
fault detection by a novel condition-monitoring scheme based on statistical-time
features and neural networks. IEEE Transactions on Industrial Electronics, 60(8),
3398–3407. https://doi.org/10.1109/tie.2012.2219838
Priore, P., De La Fuente, D., Gomez, A., & Puente, J. (2001). Dynamic scheduling of
manufacturing systems with machine learning. International Journal of Foundations of
Computer Science, 12(06), 751–762. https://doi.org/10.1142/s0129054101000849
Priore, P., de la Fuente, D., Puente, J., & Parreño, J. (2006). A comparison of machine-
learning algorithms for dynamic scheduling of flexible manufacturing systems.
Engineering Applications of Artificial Intelligence, 19(3), 247–255. https://doi.org/
10.1016/j.engappai.2005.09.009
Priore, P., Priore, J., Pino, R., Gomez, A., & Puente, J. (2010). Learning-based scheduling
of flexible manufacturing systems using support vector machines. Applied Artificial
Intelligence, 24(3), 194–209. https://doi.org/10.1080/08839510903549606
Priore, P., Ponte, B., Puente, J., & Gomez, A. (2018). Learning-based scheduling of
flexible manufacturing systems using ensemble methods. Computers & Industrial
Engineering, 126, 282–291. https://doi.org/10.1016/j.cie.2018.09.034
Priore, P., Ponte, B., Rosillo, R., & de la Fuente, D. (2019). Applying machine learning to
the dynamic selection of replenishment policies in fast-changing supply chain
environments. International Journal of Production Research, 57(11), 3663–3677.
https://doi.org/10.1080/00207543.2018.1552369
Ravikumar, S., Ramachandran, K. I., & Sugumaran, V. (2011). Machine learning
approach for automated visual inspection of machine components. Expert Systems
with Applications, 38(4), 3260–3266. https://doi.org/10.1016/j.eswa.2010.09.012
Ren, L., Sun, Y., Cui, J., & Zhang, L. (2018). Bearing remaining useful life prediction
based on deep autoencoder and deep neural networks. Journal of Manufacturing
Systems, 48, 71–77. https://doi.org/10.1016/j.jmsy.2018.04.008
Ribeiro, B. (2005). Support vector machines for quality monitoring in a plastic injection
molding process. IEEE Transactions on Systems, Man and Cybernetics, Part C
(Applications and Reviews), 35(3), 401–410. https://doi.org/10.1109/
tsmcc.2004.843228
Rude, D. J., Adams, S., & Beling, P. A. (2015). Task recognition from joint tracking data
in an operational manufacturing cell. Journal of Intelligent Manufacturing, 29(6),
1203–1217. https://doi.org/10.1007/s10845-015-1168-8
Ruiz, E., Cuartas, M., Ferreno, D., Romero, L., Arroyo, V., & Gutierrez-Solana, F. (2019).
Optimization of the fabrication of cold drawn steel wire through classification and
clustering. Machine Learning Algorithms IEEE Access, 7, 141689–141700. https://doi.
org/10.1109/ACCESS.2019.2942957
Saqlain, M., Jargalsaikhan, B., & Lee, J. Y. (2019). A voting ensemble classifierfor wafer
map defect patterns identification in semiconductor manufacturing. IEEE
Transactions on Semiconductor Manufacturing, 32(2), 171–182. https://doi.org/
10.1109/TSM.6610.1109/TSM.2019.2904306
Saucedo-Espinosa, M. A., Escalante, H. J., & Berrones, A. (2014). Detection of defective
embedded bearings by sound analysis: A machine learning approach. Journal of
Intelligent Manufacturing, 28(2), 489–500. https://doi.org/10.1007/s10845-014-
1000-x
Saxena, A., & Saad, A. (2007). Evolving an artificial neural network classifier for
condition monitoring of rotating mechanical systems. Applied Soft Computing, 7(1),
441–454. https://doi.org/10.1016/j.asoc.2005.10.001
Scime, L., & Beuth, J. (2018). A multi-scale convolutional neural network for
autonomous anomaly detection and classification in a laser powder bed fusion
additive manufacturing process. Additive Manufacturing, 24, 273–286. https://doi.
org/10.1016/j.addma.2018.09.034
Scime, L., & Beuth, J. (2019). Using machine learning to identify in-situ melt pool
signatures indicative of flaw formation in a laser powder bed fusion additive
manufacturing process. Additive Manufacturing, 25, 151–165. https://doi.org/
10.1016/j.addma.2018.11.010
Selvaraju, R. R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., & Batra, D. (2017).
Grad-cam: Visual explanations from deep networks via gradient-based localization.
In Proceedings of the IEEE international conference on computer vision (pp. 618–626).
Shin, M., Ryu, K., & Jung, M. (2012). Reinforcement learning approach to goal-
regulation in a self-evolutionary manufacturing system. Expert Systems with
Applications, 39(10), 8736–8743. https://doi.org/10.1016/j.eswa.2012.01.207
Tan, S. C., Watada, J., Ibrahim, Z., & Khalid, M. (2015). Evolutionary fuzzy ARTMAP
neural networks for classification of semiconductor defects. IEEE Transactions on
Neural Networks and Learning Systems, 26(5), 933–950. https://doi.org/10.1109/
tnnls.2014.2329097
Shiue, Y.-R. (2009). Development of two-level decision tree-based real-time scheduling
system under product mix variety environment. Robotics and Computer-Integrated
Manufacturing, 25(4–5), 709–720. https://doi.org/10.1016/j.rcim.2008.06.002
Shiue, Y.-R., Guh, R.-S., & Lee, K.-C. (2011). Study of SOM-based intelligent multi-
controller for real-time scheduling. Applied Soft Computing, 11(8), 4569–4580.
https://doi.org/10.1016/j.asoc.2011.07.022
Shiue, Y.-R., Guh, R., & Lee, K. (2012). Development of machine learning-based real time
scheduling systems: Using ensemble based on wrapper feature selection approach.
International Journal of Production Research, 50(20), 5887–5905. https://doi.org/
10.1080/00207543.2011.636389
Silbernagel, C., Aremu, A., & Ashcroft, I. (2019). Using machine learning to aid in the
parameter optimisation process for metal-based additive manufacturing. Rapid
Prototyping Journal, 26(4), 625–637. https://doi.org/10.1108/RPJ-08-2019-0213
Silver, D., Huang, A., Maddison, C. J., Guez, A., Sifre, L., van den Driessche, G., …
Hassabis, D. (2016). Mastering the game of Go with deep neural networks and tree
search. Nature, 529(7587), 484–489. https://doi.org/10.1038/nature16961
Silver, D., Hubert, T., Schrittwieser, J., Antonoglou, I., Lai, M., Guez, A., Hassabis, D.
(2017). Mastering Chess and Shogi by Self-Play with a General Reinforcement
Learning Algorithm. arXiv preprint arXiv:1712.01815.
Simon, H. (1983). Machine learning: An artificial intelligence approach. Tioga Press.
Sobie, C., Freitas, C., & Nicolai, M. (2018). Simulation-driven machine learning: Bearing
fault classification. Mechanical Systems and Signal Processing, 99, 403–419. https://
doi.org/10.1016/j.ymssp.2017.06.025
Stathatos, E., & Vosniakos, G.-C. (2019). Real-time simulation for long paths in laser-
based additive manufacturing: A machine learning approach. The International
Journal of Advanced Manufacturing Technology, 104(5–8), 1967–1984. https://doi.
org/10.1007/s00170-019-04004-6
Stocker, C., Schmid, M., & Reinhart, G. (2019). Reinforcement learning-based design of
orienting devices for vibratory bowl feeders. The International Journal of Advanced
Manufacturing Technology, 105(9), 3631–3642. https://doi.org/10.1007/s00170-
019-03798-9
Stoyanov, S., Ahsan, M., Bailey, C., Wotherspoon, T., & Hunt, C. (2019). Predictive
analytics methodology for smart qualification testing of electronic components.
Journal of Intelligent Manufacturing, 30(3), 1497–1514. https://doi.org/10.1007/
s10845-018-01462-9
Sun, J., Rahman, M., Wong, Y., & Hong, G. (2004). Multiclassification of tool wear with
support vector machine by manufacturing loss consideration. International Journal of
Machine Tools and Manufacture, 44(11), 1179–1187. https://doi.org/10.1016/j.
ijmachtools.2004.04.003
Susto, G. A., Schirru, A., Pampuri, S., McLoone, S., & Beghi, A. (2015). Machine learning
for predictive maintenance: A multiple classifier approach. IEEE Transactions on
Industrial Informatics, 11(3), 812–820. https://doi.org/10.1109/tii.2014.2349359
Sutton, R., & Barto, A. (1998). Reinforcement learning: An introduction. Cambridge: MIT
press.
Syafrudin, M., Alfian, G., Fitriyani, N., & Rhee, J. (2018). Performance Analysis of IoT-
based sensor, big data processing, and machine learning model for real-time
monitoring system in automotive manufacturing. Sensors, 18(9), 2946. https://doi.
org/10.3390/s18092946
Tan, Q., Tong, Y., Wu, S., & Li, D. (2019). Modeling, planning, and scheduling of shop-
floor assembly process with dynamic cyber-physical interactions: A case study for
CPS-based smart industrial robot production. The International Journal of Advanced
Manufacturing Technology, 105(9), 3979–3989. https://doi.org/10.1007/s00170-
019-03940-7
Tsutsui, T., & Matsuzawa, T. (2019). Virtual metrology model robustness against
chamber condition variation using deep learning. IEEE Transactions on Semiconductor
Manufacturing, 32(4), 428–433. https://doi.org/10.1109/TSM.6610.1109/
TSM.2019.2931328
Tulsyan, A., Garvin, C., & Ündey, C. (2018). Advances in industrial biopharmaceutical
batch process monitoring: Machine-learning methods for small data problems.
Biotechnology and Bioengineering, 115(8), 1915–1924. https://doi.org/10.1002/bit.
v115.810.1002/bit.26605
Tušar, T., Gantar, K., Koblar, V., Ženko, B., & Filipič, B. (2017). A study of overfitting in
optimization of a manufacturing quality control procedure. Applied Soft Computing,
59, 77–87. https://doi.org/10.1016/j.asoc.2017.05.027
M. Bertolini et al.
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0550
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0550
https://doi.org/10.1109/tsm.2017.2753251
https://doi.org/10.1109/tsm.2017.2753251
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0560
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0560
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0560
https://doi.org/10.1007/s12206-019-1145-9
https://doi.org/10.1016/j.ress.2018.03.020
https://doi.org/10.1016/j.ress.2018.03.020
https://doi.org/10.1016/j.eswa.2012.02.176
https://doi.org/10.1016/j.compind.2018.10.008
https://doi.org/10.1016/j.compind.2018.10.008
https://doi.org/10.1007/s10845-019-01508-6
https://doi.org/10.1007/s10845-019-01508-6
https://doi.org/10.1109/ACCESS.2019.2923405
https://doi.org/10.1109/ACCESS.2019.2923405
https://doi.org/10.1016/j.ins.2013.10.019
https://doi.org/10.1016/j.ins.2013.10.019
https://doi.org/10.1243/095440505X32274
https://doi.org/10.1243/095440505X32274
https://doi.org/10.1109/tie.2012.2219838
https://doi.org/10.1142/s0129054101000849
https://doi.org/10.1016/j.engappai.2005.09.009
https://doi.org/10.1016/j.engappai.2005.09.009
https://doi.org/10.1080/08839510903549606
https://doi.org/10.1016/j.cie.2018.09.034https://doi.org/10.1080/00207543.2018.1552369
https://doi.org/10.1016/j.eswa.2010.09.012
https://doi.org/10.1016/j.jmsy.2018.04.008
https://doi.org/10.1109/tsmcc.2004.843228
https://doi.org/10.1109/tsmcc.2004.843228
https://doi.org/10.1007/s10845-015-1168-8
https://doi.org/10.1109/ACCESS.2019.2942957
https://doi.org/10.1109/ACCESS.2019.2942957
https://doi.org/10.1109/TSM.6610.1109/TSM.2019.2904306
https://doi.org/10.1109/TSM.6610.1109/TSM.2019.2904306
https://doi.org/10.1007/s10845-014-1000-x
https://doi.org/10.1007/s10845-014-1000-x
https://doi.org/10.1016/j.asoc.2005.10.001
https://doi.org/10.1016/j.addma.2018.09.034
https://doi.org/10.1016/j.addma.2018.09.034
https://doi.org/10.1016/j.addma.2018.11.010
https://doi.org/10.1016/j.addma.2018.11.010
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0685
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0685
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0685
https://doi.org/10.1016/j.eswa.2012.01.207
https://doi.org/10.1109/tnnls.2014.2329097
https://doi.org/10.1109/tnnls.2014.2329097
https://doi.org/10.1016/j.rcim.2008.06.002
https://doi.org/10.1016/j.asoc.2011.07.022
https://doi.org/10.1080/00207543.2011.636389
https://doi.org/10.1080/00207543.2011.636389
https://doi.org/10.1108/RPJ-08-2019-0213
https://doi.org/10.1038/nature16961
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0730
https://doi.org/10.1016/j.ymssp.2017.06.025
https://doi.org/10.1016/j.ymssp.2017.06.025
https://doi.org/10.1007/s00170-019-04004-6
https://doi.org/10.1007/s00170-019-04004-6
https://doi.org/10.1007/s00170-019-03798-9
https://doi.org/10.1007/s00170-019-03798-9
https://doi.org/10.1007/s10845-018-01462-9
https://doi.org/10.1007/s10845-018-01462-9
https://doi.org/10.1016/j.ijmachtools.2004.04.003
https://doi.org/10.1016/j.ijmachtools.2004.04.003
https://doi.org/10.1109/tii.2014.2349359
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0765
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0765
https://doi.org/10.3390/s18092946
https://doi.org/10.3390/s18092946
https://doi.org/10.1007/s00170-019-03940-7
https://doi.org/10.1007/s00170-019-03940-7
https://doi.org/10.1109/TSM.6610.1109/TSM.2019.2931328
https://doi.org/10.1109/TSM.6610.1109/TSM.2019.2931328
https://doi.org/10.1002/bit.v115.810.1002/bit.26605
https://doi.org/10.1002/bit.v115.810.1002/bit.26605
https://doi.org/10.1016/j.asoc.2017.05.027
Expert Systems With Applications 175 (2021) 114820
29
Van Hasselt, H., Guez, A., & Silver, D. (2015). Deep reinforcement learning with double
q-learning. arXiv preprint arXiv:1509.06461.
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., …
Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information
Processing Systems, 5998–6008.
Villegas, M. A., Pedregal, D. J., & Trapero, J. R. (2018). A support vector machine for
model selection in demand forecasting applications. Computers & Industrial
Engineering, 121, 1–7. https://doi.org/10.1016/j.cie.2018.04.042
Waller, M. A., & Fawcett, S. E. (2013). Data science, predictive analytics, and big data: A
revolution that will transform supply chain design and management. Journal of
Business Logistics, 34(2), 77–84.
Wan, J., Tang, S., Li, D., Wang, S., Liu, C., Abbas, H., & Vasilakos, A. V. (2017).
A manufacturing big data solution for active preventive maintenance. IEEE
Transactions on Industrial Informatics, 13(4), 2039–2047. https://doi.org/10.1109/
tii.2017.2670505
Wang, J., Yan, J., Li, C., Gao, R. X., & Zhao, R. (2019). Deep heterogeneous GRU model
for predictive analytics in smart manufacturing: Application to tool wear prediction.
Computers in Industry, 111, 1–14. https://doi.org/10.1016/j.compind.2019.06.001
Wang, P., Liu, H., Wang, L., & Gao, R. X. (2018). Deep learning-based human motion
recognition for predictive context-aware human-robot collaboration. CIRP Annals,
67(1), 17–20. https://doi.org/10.1016/j.cirp.2018.04.066
Watkins, C. J. C. H. (1989). Learning from delayed rewards. PhD thesis. England:
University of Cambridge.
Widodo, A., & Yang, B.-S. (2007). Support vector machine in machine condition
monitoring and fault diagnosis. Mechanical Systems and Signal Processing, 21(6),
2560–2574. https://doi.org/10.1016/j.ymssp.2006.12.007
Wolpert, D. H., & Macready, W. G. (1997). No free lunch theorems for optimization. IEEE
Transactions on Evolutionary Computation, 1(1), 67–82. https://doi.org/10.1109/
4235.585893
Wu, Q. (2010). Product demand forecasts using wavelet kernel support vector machine
and particle swarm optimization in manufacture system. Journal of Computational
and Applied Mathematics, 233(10), 2481–2491. https://doi.org/10.1016/j.
cam.2009.10.030
Wu, H., Yu, Z., & Wang, Y. (2019). Experimental study of the process failure diagnosis in
additive manufacturing based on acoustic emission. Measurement, 136, 445–453.
https://doi.org/10.1016/j.measurement.2018.12.067
Wuest, T., Irgens, C., & Thoben, K.-D. (2013). An approach to monitoring quality in
manufacturing using supervised machine learning on product state data. Journal of
Intelligent Manufacturing, 25(5), 1167–1180. https://doi.org/10.1007/s10845-013-
0761-y
Wuest, T., Weimer, D., Irgens, C., & Thoben, K. D. (2016). Machine learning in
manufacturing: Advantages, challenges, and applications. Production &
Manufacturing Research, 4(1), 23–45. https://doi.org/10.1080/
21693277.2016.1192517
Yacob, F., Semere, D., & Nordgren, E. (2019). Anomaly detection in Skin Model Shapes
using machine learning classifiers. The International Journal of Advanced
Manufacturing Technology, 105(9), 3677–3689. https://doi.org/10.1007/s00170-
019-03794-z
Xu, L. D., Xu, E. L., & Li, L. (2018). Industry 4.0: State of the art and future trends.
International Journal of Production Research, 56(8), 2941–2962. https://doi.org/
10.1080/00207543.2018.1444806
Yang, W.-A., & Zhou, W. (2015). Autoregressive coefficient-invariant control chart
pattern recognition in autocorrelated manufacturing processes using neural network
ensemble. Journal of Intelligent Manufacturing, 26(6), 1161–1180. https://doi.org/
10.1007/s10845-013-0847-6
Yang, Y., Lou, Y., Gao, M., & Ma, G. (2018). An automatic aperture detection system for
LED cup based on machine vision. Multimedia Tools and Applications, 77(18),
23227–23244. https://doi.org/10.1007/s11042-018-5639-8
Yu, J. (2019). Enhanced stacked denoising autoencoder-based feature learning for
recognition of wafer map defects. IEEE Transactions on Semiconductor Manufacturing,
32(4), 613–624. https://doi.org/10.1109/TSM.6610.1109/TSM.2019.2940334
Yu, J., Zheng, X., & Wang, S. (2019). A deep autoencoder feature learning method for
process pattern recognition. Journal of Process Control, 79, 1–15. https://doi.org/
10.1016/j.jprocont.2019.05.002
Yuan, B., Guss, G. M., Wilson, A. C., Hau-Riege, S. P., DePond, P. J., McMains, S., …
Giera, B. (2018). Machine-learning-based monitoring of laser powder bed fusion.
Advanced Materials Technologies, 3(12), 1800136. https://doi.org/10.1002/admt.
v3.1210.1002/admt.201800136
Zan, T., Liu, Z., Wang, H., Wang, M., & Gao, X. (2019). Control chart pattern recognition
using the convolutional neural network. Journal of Intelligent Manufacturing, 31(3),
703–716. https://doi.org/10.1007/s10845-019-01473-0
Zarandi, M. H. F., Moosavi, S. V., & Zarinbal, M. (2012). A fuzzy reinforcement learning
algorithm for inventory control in supply chains. The International Journal of
Advanced Manufacturing Technology, 65(1–4), 557–569. https://doi.org/10.1007/
s00170-012-4195-z
Zhang, J., Wang, P., & Gao, R. X. (2019a). Deep learning-based tensile strength
prediction in fused deposition modeling. Computers in Industry, 107, 11–21. https://
doi.org/10.1016/j.compind.2019.01.011
Zhang, X., Chen, W., Wang, B., & Chen, X. (2015). Intelligent fault diagnosis of rotating
machinery using support vector machine with ant colony algorithm for synchronous
feature selection and parameter optimization. Neurocomputing,167, 260–279.
https://doi.org/10.1016/j.neucom.2015.04.069
Zhang, Y., Harik, R., Fadel, G., & Bernard, A. (2019b). A statistical method for build
orientation determination in additive manufacturing. Rapid Prototyping Journal, 25
(1), 187–207. https://doi.org/10.1108/rpj-04-2018-0102
Zhu, Z., Anwer, N., Huang, Q., & Mathieu, L. (2018). Machine learning in tolerancing for
additive manufacturing. CIRP Annals, 67(1), 157–160. https://doi.org/10.1016/j.
cirp.2018.04.119
M. Bertolini et al.
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0800
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0800
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0800
https://doi.org/10.1016/j.cie.2018.04.042
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0810
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0810
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0810
https://doi.org/10.1109/tii.2017.2670505
https://doi.org/10.1109/tii.2017.2670505
https://doi.org/10.1016/j.compind.2019.06.001
https://doi.org/10.1016/j.cirp.2018.04.066
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0830
http://refhub.elsevier.com/S0957-4174(21)00261-X/h0830
https://doi.org/10.1016/j.ymssp.2006.12.007
https://doi.org/10.1109/4235.585893
https://doi.org/10.1109/4235.585893
https://doi.org/10.1016/j.cam.2009.10.030
https://doi.org/10.1016/j.cam.2009.10.030
https://doi.org/10.1016/j.measurement.2018.12.067
https://doi.org/10.1007/s10845-013-0761-y
https://doi.org/10.1007/s10845-013-0761-y
https://doi.org/10.1080/21693277.2016.1192517
https://doi.org/10.1080/21693277.2016.1192517
https://doi.org/10.1007/s00170-019-03794-z
https://doi.org/10.1007/s00170-019-03794-z
https://doi.org/10.1080/00207543.2018.1444806
https://doi.org/10.1080/00207543.2018.1444806
https://doi.org/10.1007/s10845-013-0847-6
https://doi.org/10.1007/s10845-013-0847-6
https://doi.org/10.1007/s11042-018-5639-8
https://doi.org/10.1109/TSM.6610.1109/TSM.2019.2940334
https://doi.org/10.1016/j.jprocont.2019.05.002
https://doi.org/10.1016/j.jprocont.2019.05.002
https://doi.org/10.1002/admt.v3.1210.1002/admt.201800136
https://doi.org/10.1002/admt.v3.1210.1002/admt.201800136
https://doi.org/10.1007/s10845-019-01473-0
https://doi.org/10.1007/s00170-012-4195-z
https://doi.org/10.1007/s00170-012-4195-z
https://doi.org/10.1016/j.compind.2019.01.011
https://doi.org/10.1016/j.compind.2019.01.011
https://doi.org/10.1016/j.neucom.2015.04.069
https://doi.org/10.1108/rpj-04-2018-0102
https://doi.org/10.1016/j.cirp.2018.04.119
https://doi.org/10.1016/j.cirp.2018.04.119
Machine Learning for industrial applications: A comprehensive literature review
1 Introduction
2 A brief introduction of Machine Learning theory
2.1 Machine Learning areas
2.1.1 Supervised Learning (SL)
2.1.2 Unsupervised Learning (UL)
2.1.3 Reinforcement Learning (RL)
3 Searching methodology
3.1 Initial query-based search
3.2 Search enlargement
3.2.1 Cross-reference analysis
3.2.2 Relevance assessment through citation graph analysis
3.3 Abstract analysis and final screening of the selected works
4 Systematic review
4.1 Preliminary classification
4.2 Trend analysis
4.3 Keywords analysis
4.3.1 Current trends and hot topics
4.3.2 Gaps’ investigation
4.4 Detailed analysis of selected papers
4.4.1 Maintenance management
4.4.2 Quality management
4.4.3 Production Planning & Control (PPC)
4.4.4 Supply chain management
4.4.5 Models’ complexity, Input-Output variables
4.4.6 Concluding remarks
5 Conclusions and directions for future works
CRediT authorship contribution statement
Declaration of Competing Interest
Appendix A Acronyms of the algorithms cited in the literature review
Appendix B Bibliometric analysis
References