Aplicationartificialintelligence_Souz12233a_2020

UNISUL

Rodrigo Eduardo dos Santos
em 08/10/2024
Conteúdos escolhidos para você

10 pág.
Monitoramento de Rolamento de Caixa de Engrenagens de Turbina Eólica

UNIFACS
30 pág.
Estratégias de Controle de Sistemas de Armazenamento de Bateria em Micro-redes

UNIUBE
16 pág.
Datadriven performance analyses of wastewater treatment plants

UCL
281 pág.
Avanços em Soft Computing

UCAM
9 pág.
IET Renewable Power Gen - 2019 - Koukoura - Comparison of wind turbine gearbox vibration analysis algorithms based on

UNIFACS
Perguntas dessa disciplina

3. Em uma universidade de ponta no Brasil, um grupo de pesquisadores desenvolve um projeto voltado à utilização da Internet das Coisas (IoT) e do B...

Uniasselvi
4 Em uma universidade de ponta no Brasil, um grupo de pesquisadores desenvolve um projeto voltado à utilização da Internet das Coisas (loT) e do Bi...

UNIASSELVI
Em uma universidade de ponta no Brasil, um grupo de pesquisadores desenvolve um projeto voltado à utilização da Internet das Coisas (IoT) e do Big ...

Uniasselvi
“O lançamento do ChatGPT no final de 2022, como a primeira ferramenta de inteligência artificial generativa (lAGen) de fácil utilização disponibili...

UniCesumar
Material
Crie sua conta grátis para liberar esse material. 🤩

Já tem uma conta?
Ao continuar, você aceita os Termos de Uso e Política de Privacidade
Crie sua conta grátis para liberar esse material. 🤩

Já tem uma conta?
Ao continuar, você aceita os Termos de Uso e Política de Privacidade
Crie sua conta grátis para liberar esse material. 🤩

Já tem uma conta?
Ao continuar, você aceita os Termos de Uso e Política de Privacidade
Crie sua conta grátis para liberar esse material. 🤩

Já tem uma conta?
Ao continuar, você aceita os Termos de Uso e Política de Privacidade
Crie sua conta grátis para liberar esse material. 🤩

Já tem uma conta?
Ao continuar, você aceita os Termos de Uso e Política de Privacidade
Crie sua conta grátis para liberar esse material. 🤩

Já tem uma conta?
Ao continuar, você aceita os Termos de Uso e Política de Privacidade
Crie sua conta grátis para liberar esse material. 🤩

Já tem uma conta?
Ao continuar, você aceita os Termos de Uso e Política de Privacidade
Crie sua conta grátis para liberar esse material. 🤩

Já tem uma conta?
Ao continuar, você aceita os Termos de Uso e Política de Privacidade
Crie sua conta grátis para liberar esse material. 🤩

Já tem uma conta?
Ao continuar, você aceita os Termos de Uso e Política de Privacidade
Crie sua conta grátis para liberar esse material. 🤩

Já tem uma conta?
Ao continuar, você aceita os Termos de Uso e Política de Privacidade
Conteúdos escolhidos para você

10 pág.
Monitoramento de Rolamento de Caixa de Engrenagens de Turbina Eólica

UNIFACS
30 pág.
Estratégias de Controle de Sistemas de Armazenamento de Bateria em Micro-redes

UNIUBE
16 pág.
Datadriven performance analyses of wastewater treatment plants

UCL
281 pág.
Avanços em Soft Computing

UCAM
9 pág.
IET Renewable Power Gen - 2019 - Koukoura - Comparison of wind turbine gearbox vibration analysis algorithms based on

UNIFACS
Perguntas dessa disciplina

3. Em uma universidade de ponta no Brasil, um grupo de pesquisadores desenvolve um projeto voltado à utilização da Internet das Coisas (IoT) e do B...

Uniasselvi
4 Em uma universidade de ponta no Brasil, um grupo de pesquisadores desenvolve um projeto voltado à utilização da Internet das Coisas (loT) e do Bi...

UNIASSELVI
Em uma universidade de ponta no Brasil, um grupo de pesquisadores desenvolve um projeto voltado à utilização da Internet das Coisas (IoT) e do Big ...

Uniasselvi
“O lançamento do ChatGPT no final de 2022, como a primeira ferramenta de inteligência artificial generativa (lAGen) de fácil utilização disponibili...

UniCesumar
Prévia do material em texto
<p>UNIVERSIDADE FEDERAL DO RIO GRANDE DO NORTE</p><p>CENTRO DE TECNOLOGIA</p><p>PÓS-GRADUAÇÃO EM ENGENHARIA DE PRODUÇÃO</p><p>APLICATION OF ARTIFICIAL INTELLIGENCE FOR WIND</p><p>TURBINE OPERATION AND MAINTENANCE: PROPOSAL OF</p><p>FRAMEWORK</p><p>MATEUS GUILHERME MELO DE SOUZA</p><p>NATAL</p><p>2020</p><p>UNIVERSIDADE FEDERAL DO RIO GRANDE DO NORTE</p><p>CENTRO DE TECNOLOGIA</p><p>PÓS-GRADUAÇÃO EM ENGENHARIA DE PRODUÇÃO</p><p>APLICATION OF ARTIFICIAL INTELLIGENCE FOR WIND</p><p>TURBINE OPERATION AND MAINTENANCE: PROPOSAL OF</p><p>FRAMEWORK</p><p>This thesis is the result of the studies as a requisite for</p><p>acquiring the title of Master of research in Industrial</p><p>Engineering of the Federal University of Rio Grande</p><p>do Norte.</p><p>Supervisor: Prof. Dr. Mario Orestes Aguirre</p><p>González</p><p>MATEUS GUILHERME MELO DE SOUZA</p><p>NATAL</p><p>2020</p><p>Souza, Mateus Guilherme Melo de.</p><p>Aplication of artificial intelligence for wind turbine</p><p>operation and maintenance: proposal of framework / Mateus</p><p>Guilherme Melo de Souza. - 2021.</p><p>94f.: il.</p><p>Dissertação (Mestrado) - Universidade Federal do Rio Grande</p><p>do Norte, Centro de tecnologia, Programa de Pós graduação em</p><p>Engenharia de Produção, Natal, 2021.</p><p>Orientador: Dr. Mario Orestes Aguirre González.</p><p>1. Artificial Intelligence - Dissertação. 2. Operation and</p><p>Maintenance - Dissertação. 3. Systematic Literature Review -</p><p>Dissertação. 4. ANN - Dissertação. 5. LSTM - Dissertação. I.</p><p>González, Mario Orestes Aguirre. II. Título.</p><p>RN/UF/BCZM CDU 658.5</p><p>Universidade Federal do Rio Grande do Norte - UFRN</p><p>Sistema de Bibliotecas - SISBI</p><p>Catalogação de Publicação na Fonte. UFRN - Biblioteca Central Zila Mamede</p><p>Elaborado por RAIMUNDO MUNIZ DE OLIVEIRA - CRB-15/429</p><p>ACKNOWLEDGMENTS</p><p>This work is dedicated to my family as they always got my back in moments of</p><p>doubt. I also would like to thank the professor Mario González for pushing me to the limit and</p><p>for believing that I could explore the research theme further and further, you helped me finding</p><p>my purpose. Moreover, I also thank my friends of the Creation research group since I would</p><p>not have accomplished this research without them. You guys are amazing!</p><p>This study received support from the Brazilian National Council for Scientific and</p><p>Technological Development (CNPq) a federal organization under the administration of the</p><p>Ministry of Science and Technology, which is dedicated to promoting scientific research and</p><p>formation of human resources in Brazil.</p><p>RESUMO</p><p>O aquecimento global alertou à comunidade internacional da necessidade de uso de fontes</p><p>renováveis e limpas na geração de energia elétrica. Nesse cenário, a energia eólica se encontra</p><p>em expansão. Como essa fonte de geração depende de um equipamento central, o aerogerador,</p><p>a sua operação e manutenção é fundamental para a viabilidade do negócio, e por isso, a</p><p>aplicação de novas tecnologias, como a Inteligência Artificial, pode trazer maior</p><p>competitividade do setor. O objetivo do estudo é apresentar um framework para a aplicação de</p><p>Inteligência Artificial na operação e manutenção de parques eólicos. Para tanto, foi realizada</p><p>uma pesquisa teórica e de campo. A pesquisa teórica considerou uma revisão tradicional da</p><p>literatura e uma revisão bibliográfica sistemática. O estado da arte foi identificado pela análise</p><p>detalhada de 51 artigos obtidos na Plataforma Periódicos da Capes. Foram identificados os</p><p>equipamentos estudados, métodos e métricas adotadas e etapas da aplicação da IA. A pesquisa</p><p>de campo foi realizada pela aplicação do framework em um parque eólico, mediante a</p><p>simulação de uma aplicação de monitoramento da condição de rolamentos através da</p><p>modelagem da temperatura, mediante dados SCADA. Três modelos de redes neurais foram</p><p>testados: Rede Neural Feedforward, Rede Neural Autorregressiva e Long Short-Term Memory</p><p>O modelo de Long Short-Term Memory apresentou a melhor performance dentre os algoritmos</p><p>testados, mesmo quando comparado com outros estudos, o que mostra que ele pode ser usado</p><p>para esse tipo de aplicação. O framework proposto é dividido em quatro macro-etapas: Seleção</p><p>da aplicação, preparação de dados, desenvolvimento do modelo e avaliação de resultados.</p><p>Palavras-chave: Inteligência Artificial, Aerogerador, Operação e Manutenção, Revisão</p><p>bibliográfica sistemática, ANN, LSTM.</p><p>ABSTRACT</p><p>Global warming has alerted the international community to the need to of renewable and clean</p><p>energy sources in the generation of electricity. In this scenario, wind energy is expanding. As</p><p>this source depends on a central equipment, the wind turbine, its operation and maintenance are</p><p>fundamental for the viability of the business and, therefore, the application of technologies,</p><p>such as Artificial Intelligence, can improve the competitiveness of the sector. The objective of</p><p>this study is to present a framework for the application of Artificial Intelligence in the operation</p><p>and maintenance of wind farms. For this, theoretical and field research were carried out. The</p><p>theoretical research complemented a traditional literature review and a systematic literature</p><p>review. State-of-the-art was identified through the analysis of 51 articles obtained from the</p><p>Periodicos Capes Platform. The research identified the equipment studied, data, methods and</p><p>metrics adopted in the application of AI. The field research was carried out by applying the</p><p>framework to a wind farm, simulating an application of condition monitoring for bearings</p><p>through the modelling of its temperature using SCADA data. Three neural networks models</p><p>were tested: Feedforward Neural Network, Autoregressive Neural Network and Long Short-</p><p>Term Memory (LSTM). The LSTM model presented the best performance among the tested</p><p>algorithms, even when compared to other studies, which shows that it can be used for this type</p><p>of application. The proposed framework is composed by four macro-steps: Selecting</p><p>application, data preparation, model development and evaluation of results.</p><p>Keywords: Artificial Intelligence, Wind Turbine, Operation and Maintenance, Systematic</p><p>Literature Review, ANN, LSTM.</p><p>FIGURE INDEX</p><p>Figure 1.1: Global Energy matrix evolution. ............................................................................ 15</p><p>Figure 1.2: Countries with wind power as the most competitive energy source. ..................... 15</p><p>Figure 1.3: 17 objectives of the Agenda 2030. ......................................................................... 18</p><p>Figure 1.4: Average capacity factor by country. ...................................................................... 18</p><p>Figure 1.5: Number of publications about artificial intelligence applications for wind turbine</p><p>maintenance according to a survey carried out in the Periodicos Capes Platform. .................. 19</p><p>Figure 2.1: Research procedure. ............................................................................................... 22</p><p>Figure 2.2: Systematic Literature Review procedure. .............................................................. 23</p><p>Figure 3.1: Evolution of offshore wind turbines since 1991. ................................................... 26</p><p>Figure 3.2: Main components of a typical large-scale wind turbine. ....................................... 26</p><p>Figure 3.3: Relationship between AI, ML and deep learning. ................................................. 31</p><p>Figure 3.4: Typical neuron model. ........................................................................................... 33</p><p>Figure 3.5: A plane separating data using SVM ....................................................................... 36</p><p>Figure 4.1: Trend of the publications about the theme per year. .............................................. 40</p><p>in order to act in the most important alarm</p><p>when more than one trigger at the same period. Stetco et al. (2019) reviewed machine learning</p><p>applications for wind turbine condition monitoring. They analysed 144 articles, concluding that</p><p>most models use SCADA or simulation data; Machine Learning models are applied mainly for</p><p>classification, with less studies using regression; And that Neural networks, SVM and Decision</p><p>Trees are the most common algorithms used.</p><p>Hence, in this research, it was found studies that developed applications of neural</p><p>networks and other AI methods in a variety of applications for wind turbine maintenance. Here</p><p>are the main applications found in the studies according to its main goal:</p><p> Condition monitoring systems: It is the most common application that</p><p>appeared in the studies, they are systems designed to monitor the equipment</p><p>health, identifying faults before they happen and, in some cases, diagnose</p><p>it. According to Helbing and Ritter (2018), condition monitoring systems</p><p>can be divided into model-based approaches (numerical model of the wind</p><p>turbine or subcomponents), signal processing approach (vibration-based</p><p>signal processing and analysis) and data-driven approaches. Normal</p><p>behaviour models are a highly studied data-driven approach and it is similar</p><p>to Statistical Process Control. It consists in creating a model capable of</p><p>mimic the normal behaviour of parameters based on other variables, the</p><p>39</p><p>predicted values are than compared with the actual values measured in the</p><p>sensors, generating a prediction error, or residual. The residuals are than</p><p>analysed in residual control charts as large values or trends may be an</p><p>indicator of incipient faults. Another approach to be mentioned is the</p><p>Remaining useful life prediction that estimates how much time is left for the</p><p>machine or the component until a system malfunctioning occurs, which does</p><p>not require an in-depth understanding of the physics of the system as it is</p><p>uses historical failure data (LI et al., 2019).</p><p> Maintenance optimization: Applications that result in improving the</p><p>maintenance strategies of single wind turbines or the whole wind farm.</p><p>Usually, they use some sort of condition monitoring method with addition</p><p>of further analysis, such as clustering (HAMEED; WANG, 2012),</p><p>scheduling (YANG; HUANG; HUANG, 2016) or cost (LU et al., 2018).</p><p> Study of indicators: The use of data-mining techniques to study parameters</p><p>and indicators of wind turbines in order to evaluate its performance (SU;</p><p>HU, 2018) or propose new indicators to help maintenance decision makers</p><p>(ASTOLFI et al., 2015).</p><p> Inspection systems: Systems created with the goal of providing non-</p><p>continuous inspections, detecting faults or defects through a test</p><p>(JIMENEZ; MUÑOZ; MARQUES, 2018). There are also Applications that</p><p>allow the use of machines with this end such as drones (SHIHAVUDDIN</p><p>et al., 2019) or robots (WANG et al., 2013) for inspection of components.</p><p>40</p><p>CHAPTER 4: STATE OF THE ART</p><p>This chapter presents the state of the art of the topic as a result of the systematic</p><p>literature review. It is organized in a bibliometric analysis of the selected articles and their</p><p>classification according to their characteristics.</p><p>4.1 Descriptive analysis</p><p>After analyzing the articles that underwent the filtering established in the SLR</p><p>method, the graph presented in Figure 4.1 was developed. It shows the number of publications</p><p>per year. As can be seen in the figure, the first paper was published in 1999 and after that there</p><p>is a 11-year window without publication of articles on the topic until 2011, that came up with</p><p>2 articles published.</p><p>Figure 4.1: Trend of the publications about the theme per year.</p><p>Source: Author, 2020.</p><p>For this moment on, there are publications in every year, varying from 1 in 2015 to</p><p>14 in 2018, year with the most publications. The year of 2019 is the second one with 11</p><p>publications, however, it is worth noticing that the search for articles was held in June 2019, so</p><p>this year has potential to surpass 2018 as the one with most publications about this topic in</p><p>Periodicos Capes Platform.</p><p>1</p><p>0 0 0 0 0 0 0 0 0 0 0</p><p>2</p><p>3</p><p>2</p><p>4</p><p>1</p><p>6</p><p>7</p><p>14</p><p>11</p><p>Number of publications</p><p>41</p><p>4.1.1 Geographic distribution</p><p>This session addresses the geographical distribution of the institutions which the</p><p>authors of the selected articles represent, considering countries. The Figure 4.2 presents the</p><p>number publications by each country. It is important to highlight that some articles had</p><p>contributions from authors from more than one continent, this explains the fact that the sum of</p><p>the percentages of contributions between continents is greater than the total number of articles.</p><p>Figure 4.2: Geographic distribution of the institutions that the authors represent.</p><p>Source: Author, 2020.</p><p>From the figure, one can see that China has been pushing the research about the</p><p>theme with 18 publications in total, representing 35% of participation in the studies. It can be</p><p>explained by the fact that China is the biggest wind energy market in the world. USA is the</p><p>second in the number of publications, since it came up with 5, representing around 10% of the</p><p>studies. UK and Spain are both in third place with 4 publications and 8% of the studies each.</p><p>4.1.2 Publications by journal</p><p>When it comes to journals, there is 36 different scientific journals that had</p><p>publications about the research theme and came up after filtering. The journal named Energies</p><p>is the one with most publications, having 9 in total. It is followed by the Renewable Energy</p><p>journal with 6 in total and Wind Energy with 3 publications. The remaining journals ended up</p><p>with 1 or 2 publications each.</p><p>42</p><p>4.2 Content analysis</p><p>In the case of this research these applications were analysed and then classified</p><p>according to their main goal. Figure 4.3 shows a pie chart of the distribution of papers by</p><p>applications. As said previously in the Session 3.4, there are 4 literature review studies. The</p><p>articles of Vesely (2017) and Kusiak (2016) are opinion articles published in Power and Nature</p><p>scientific journals respectively, these are not considered in the pie chart of the Figure 4.3 since</p><p>it considers only the articles that are applied research.</p><p>Figure 4.3: Distribution of articles according to application.</p><p>Source: Author, 2020.</p><p>4.2.1 Studies developed by equipment</p><p>The applied research papers were classified by equipment, having different</p><p>objectives, data and artificial intelligence methods used by them. Most of the studies were</p><p>oriented for building applications for the drivetrain/gearbox with 15 papers in total, followed</p><p>by the generator with 8 studies, it can be explained by the Figure 4.4, which presents a</p><p>relationship between equipment, failure frequency and downtime per failure. It can be seen that</p><p>generator, gearbox and drivetrain have the biggest downtime per failure. There are also 7 studies</p><p>for blades, 2 for the pitch system, 3 for the sensors and 12 for the whole wind turbine, with</p><p>some studies focusing in more than one equipment at the same time.</p><p>33</p><p>4</p><p>2</p><p>6</p><p>Condition Monitoring</p><p>Systems</p><p>Maintenance</p><p>Optimization</p><p>Study of indicators</p><p>Inspection Applications</p><p>43</p><p>Figure 4.4: Failure annual frequency and downtime per failu re for the wind turbines.</p><p>Source: IGBA et al. (2015).</p><p>4.2.1.1 Blades</p><p>Due to its operational characteristics, the papers that proposed applications for the</p><p>blades focused on the development of systems to improve inspections using signals (sound,</p><p>guided waves), machines (robots, drones) and other approaches.</p><p>Wang et al. (2013) used neural networks in order to optimize the path of a two-leg</p><p>robot around the whole blade using the blade 3d design as model. Gantasala, Luneno and</p><p>Aidanpää (2017) studied a system capable of detecting imbalance in a non-rotating beam using</p><p>its natural frequencies on neural networks, and finite elements analysis. They suggest that the</p><p>system could be applied to wind turbine blades.</p><p>Jiménez, Muñoz and Marques (2018) used guided waves to build an inspection</p><p>system for identification of delamination faults through a series of machine learning algorithms,</p><p>having the best result with ANN, they applied it in a wind turbine blades in laboratory</p><p>conditions. Jiménez et al. (2019) propose an inspection system that uses guided waves in the</p><p>task of identifying ice imbalance in wind turbine blades, having the best results with SVM and</p><p>KNN, they also applied it in a wind turbine blade in laboratory conditions.</p><p>Chen et al. (2019) present a condition monitoring method for ice accretion</p><p>detection that reduces SCADA data imbalance (difference between normal operating data and</p><p>44</p><p>abnormal operating data) using triplet loss as data pre-processing and Deep neural networks for</p><p>modelling.</p><p>Shihavuddin et al. (2019) proposed a suggestion-based system that facilitates the</p><p>identification of four wind turbine blade damage using drone images, they applied image</p><p>augmentation in order to extent the dataset.</p><p>Liu et al. (2019) developed a wind turbine blade strain prediction inspection system</p><p>method using a combination of genetic algorithm and neural networks, having load, load</p><p>positions and distributions as data applied to a prototype.</p><p>4.2.1.2 Drivetrain and gearbox</p><p>The studies that aimed at wind turbine drivetrain and gearbox proposed applications</p><p>related to condition monitoring systems, however, they used different approaches and goals,</p><p>like monitoring the whole gearbox or just specific components. Some studies aimed at just</p><p>monitoring the equipment for fault detection, not focusing on fault diagnosis. Elhor et al.</p><p>(2009), proposed a vibration-based normal behaviour model for the gearbox using auto-</p><p>associative neural networks.</p><p>Schleichtingen and Santos (2011) made a comparative analysis between linear</p><p>regression, feedforward neural networks and auto-associative neural networks in the task of</p><p>building a normal behaviour model for monitoring the gearbox bearing temperature, generator</p><p>bearing temperature and generator stator temperature using SCADA data of offshore wind</p><p>turbines, in order to detect faults in these components through residual error between the</p><p>modelled temperature and the real sensor temperature.</p><p>Strdczkiewicz and Barszcz (2016) proposed a method for detecting gear faults in</p><p>the gearbox using data related to vibration, load and rotation speed applied in a combination of</p><p>backpropagation neural networks and linear regression.</p><p>Bangalore et al. (2017) built a normal behaviour model for the gearbox bearing</p><p>temperature and gearbox lubrication oil temperature using neural networks and SCADA data.</p><p>They focus mainly on data pre-processing and post-processing, presenting methods for dealing</p><p>with data-filtering, data discontinuity. They also use Malahanobis distance of residuals as the</p><p>mean to detect incipient faults.</p><p>Guo, Fu, and Yang (2018) used SCADA data in convolutional neural networks in</p><p>order to model bearing temperature of the gearbox and then compare it statistically with other</p><p>45</p><p>turbines of the same wind farm in order to identify over-temperature bearing faults. In the same</p><p>line of thought, Fu et al. (2019) present a normal behaviour model for the gearbox bearing</p><p>temperature using SCADA data in an approach combining convolutional neural networks and</p><p>long short-term memory (LSTM), also aimed at detecting over-temperature faults in bearings.</p><p>Li et al. (2019) proposes a combination of regression and ANN models to predict</p><p>the remaining useful life of gearbox bearings, simulating it using vibration data from a test rig.</p><p>On the other hand, some other studies go a step further and, instead of just detecting</p><p>faults earlier, they also aimed at diagnosing it. Ren and Qu (2014) propose a method for shaft</p><p>centreline orbit condition monitoring using simulated annealing algorithm and Hopfield neural</p><p>networks.</p><p>Tang et al. (2014) used manifold learning for pre-processing and SVM for</p><p>classification in order to build a fault diagnosis system for the wind turbine drivetrain using</p><p>vibration data from a real wind turbine.</p><p>Yang, Wang and Zhong (2016) presented a simulation experiment that uses</p><p>Extreme Learning Machine (ELM) to perform multi-fault classification of wind turbine</p><p>gearboxes using vibration data.</p><p>Wu et al. (2017) used wavelet transform and principal component analysis (PCA)</p><p>for pre-processing vibration data of a test rig and Improved Extreme Learning Machine (IELM)</p><p>for model a system capable of diagnose faults in wind turbine gearboxes. Yu, Huang and Xiao</p><p>(2018) present a wind turbine gearbox fault diagnosis method that uses convolutional neural</p><p>networks with batch regularization for training and testing, and compare it with other machine</p><p>learning algorithms, they used the same group of data presented in Wu et al. (2017), which</p><p>might indicate collaboration between studies.</p><p>Zhong et al. (2018) propose a data-driven multi-fault diagnosis method for the wind</p><p>turbine gearbox that uses Hilber-Huang transforms for vibration data pre-processing and</p><p>Extreme Learning Machine for modelling, they verify its effectiveness through data taken from</p><p>a wind turbine simulator.</p><p>As mentioned before, the study of Bach-Andersen, Romer-Odgaard and Winther</p><p>(2018) also have fault diagnosis as the final goal of their research.</p><p>Zhao et al. (2018) established a method for fault detection and analysis in the</p><p>generator and the gearbox using SCADA data as input of a model composed by deep auto-</p><p>encoder (DAE) neural networks that uses the relationship between signals, its reconstruction</p><p>error and an adaptive threshold to identify faults and where they are located.</p><p>46</p><p>Wang et al. (2019) propose a Dual-Extreme Learning Machine fault diagnosis</p><p>framework for single and multi-fault diagnosis in the drivetrain, the results were than</p><p>demonstrated using vibration from a test rig.</p><p>4.2.1.3 Generator</p><p>The papers that aimed at studying wind turbine generators or its specific</p><p>components focused specially in building condition monitoring systems. As said previously in</p><p>the gearbox section, Shleichtingen and Santos (2011) and Zhao et al. (2018) also focused on</p><p>specific components of the wind turbine generator.</p><p>Verma and Kusiak (2012) used four data-mining algorithms in order to predict wind</p><p>turbine generator brushes faults using SCADA data, having the best result with Boosting-tree</p><p>algorithm. Kusiak and Verma (2012) proposed a neural network normal behaviour model</p><p>approach that uses SCADA data to identify generator bearing faults in wind turbines.</p><p>In order to focus on the specific problem of detecting islanding faults in doubly-fed</p><p>induction generators (DFIG) of wind turbines, Abd-Elkader, Allam and Tageldim (2014) used</p><p>simulation to propose a method that first uses Fourier transform to process voltage and current</p><p>measurements and then fit it into a neural network that is responsible for identifying quality</p><p>disturbances or actual islanding operation.</p><p>Adouni et al. (2016) present a fault detection and identification method for low-</p><p>voltage-ride-through faults in wind turbine DFIG generators, they made an experiment for</p><p>simulating voltage amplitudes and phases with a test rig, and applied it in an artificial neural</p><p>network capable of detecting and diagnose this specific fault.</p><p>In order to detect faults in generator bearings, Ali et al. (2018) presents an online</p><p>vibration-based fault diagnosis method using vibration history data from real wind turbines</p><p>generators, they did time-domain and frequency domain feature extraction and applied it to an</p><p>adaptive resonance theory</p><p>two neural network in order to classify it into healthy state, degraded</p><p>state and failure state, in an unsupervised learning approach.</p><p>Moreover, the study of Amina, Tayeb and Mouloud (2016) proposed fast fuzzy-</p><p>logic based open switch fault detection for the rotor side converter of a doubly fed induction</p><p>generator (DFIG) wind turbine system using simulated three phase rotor currents data.</p><p>47</p><p>4.2.1.4 Pitch System</p><p>Two studies focused on the development of condition monitoring applications for</p><p>the pitch system. Chen, Matthews and Tavner (2013) present a supervised learning fault</p><p>prognosis procedure for wind turbine pitch system using SCADA data of six known wind</p><p>turbine pitch faults to train an Adaptive Neuro-Fuzzy Inference System, after that, they tested</p><p>it in other wind farm and compared it to the general SCADA alarm approach. Talebi, Sadrnia</p><p>and Darabi (2014) utilize a dynamic model for simulating mechanical and electric components</p><p>of wind turbines and suggest a fault detection system that detect faults on the generator angular</p><p>velocity sensor, pitch angle sensor and pitch angle actuators, using Recurrent Neural Networks</p><p>(RNN).</p><p>4.2.1.5 Sensors</p><p>Three studies focused on the development of applications of fault detection in wind</p><p>turbine sensors. Talebi, Sadrnia and Darabi (2014) was already mentioned in the last session.</p><p>Kavaz and Barutcu (2018) developed a technique for detection of calibration drifts in</p><p>temperature sensors of wind turbines using SCADA data, they had best results with auto-</p><p>associative backpropagation neural networks for distinguishing faulty situations and multi-</p><p>input single output backpropagation neural network to isolate the faulty sensor. Bakri, Koumir</p><p>and Boumhidi (2019) propose a fault detection and isolation scheme for rotation speed sensors</p><p>and generator output using extreme learning machine, they used simulation in order to test the</p><p>effectiveness of the proposed approach.</p><p>4.2.1.6 Whole Wind Turbine</p><p>Twelve papers focused on the study of artificial intelligence applications for the</p><p>whole wind turbine, having different approaches. Six studies aimed on the development of</p><p>condition monitoring applications for the wind turbine. Kusiak and Verma (2011) applied</p><p>association rule mining algorithm to identify the most frequent status patterns in wind turbines</p><p>using SCADA data from 100 wind turbines, then they used PCA for dimensionality reduction</p><p>and data-mining algorithms in order to predict the status patterns of the wind turbines, having</p><p>the best result with Random Forest Algorithm.</p><p>Choi and Kim (2017) used SCADA data to investigate the operating behaviour of</p><p>wind turbines by clustering data through Gustafsson-Kessel Fuzzy Clustering Method,</p><p>48</p><p>identifying the actual operating point of the wind turbine according to the distance in relation</p><p>to the cluster centres.</p><p>Lind et al. (2017) compared a stochastic approach with a neural network model in</p><p>the task of reconstructing the offshore wind turbine tower top acceleration signal, showing that</p><p>the stochastic approach outperformed the neural network approach. They also provided the</p><p>open-source code for the application.</p><p>Yang et al. (2018) proposes a method of early fault detection in wind turbines that</p><p>applies data-mining techniques in SCADA data to select the most important variables and then</p><p>build a model-based EWMA control chart and Multivariate EWMA in order evaluate their fault</p><p>detection ability.</p><p>Huang et al. (2018) presents a global condition monitoring method based on copula</p><p>function and autoregressive neural networks, they first used the copula function to build a joint-</p><p>probability density function of both power and wind speed with SCADA data, then established</p><p>a fault-free condition monitoring model with the autoregressive neural network, the fault is</p><p>identified with a statistical analysis of the model’s predictions.</p><p>Sun and Sun (2018) introduce a hybrid method which combines the analysis of</p><p>variance of SCADA parameters using RNN to evaluate the health status of wind turbines, they</p><p>separate the data in several different categories according to its variance ranges and then LSTM</p><p>model is trained in each category. The models are then compared with the actual parameter</p><p>values and after that, they adopt a weighted assessment method to define the health status of</p><p>the wind turbine.</p><p>As can be seem, most of these studies used model-based approaches in the task of</p><p>predicting the health condition of wind turbines, only Choi and Kim (2017) used a clustering</p><p>approach in the task of identifying the operational point of wind turbines.</p><p>Four studies focused on building applications that aimed at improving the</p><p>maintenance strategies of wind farms in a more holistic way, not only in condition monitoring</p><p>but also giving an overview of where and when the maintenance activity should be carried out.</p><p>Hammed and Wang (2012) proposed a method to optimize the maintenance</p><p>strategies of offshore wind farms using SOM neural networks to separate wind turbines into</p><p>clusters and then predict the expected power output per cluster using Backpropagation neural</p><p>networks, the maintenance activities can then be designed for the whole cluster instead for a</p><p>single wind turbine.</p><p>Yang, Huang and Huan (2016) used wind turbine SCADA data to present a three-</p><p>phase method to detect emerging faults in wind turbines that integrates power-curve residual</p><p>49</p><p>control charts with auto-associative neural networks to detect anomalous components by the</p><p>comparison of mean squared error of the components, after that a schedule chart is obtained</p><p>consisting in anomalous schedule, preventive schedule and ideal schedule.</p><p>The only study that brought results associated with maintenance cost reduction was</p><p>the one proposed by Lu et al. (2018). They studied a condition-based maintenance method that</p><p>uses neural networks to evaluate the major components life percentage and a conditional failure</p><p>probability of the component that represents the deterioration of the Wind Turbine, the</p><p>maintenance method is defined as an optimized two-level threshold failure probability to find</p><p>when the optimal condition based maintenance actions will be carried out, they use failure</p><p>information and maintenance costs of offshore wind turbines to illustrate the effectiveness of</p><p>the approach and then compare it to time-based maintenance information from other studies.</p><p>In order to improve management of power companies, Yeh et al. (2019) proposes a</p><p>machine learning-based method that predicts long cycle maintenance time of wind turbines.</p><p>Operation, maintenance and event codes were collected from wind turbines and then a hybrid</p><p>network model is built composed by convolutional neural networks (CNN) and support vector</p><p>machines (SVM).</p><p>Two studies attained at studying maintenance indicators of wind farms. Astolfi et</p><p>al. (2015) applied post-processing data mining methods on SCADA data of onshore wind farms</p><p>in order to formulate innovative indicators of goodness of performance, then they create a</p><p>malfunctioning index, detail malfunctioning index, stationary index and misalignment index</p><p>for the wind turbines. Su and Hu (2017) applied data mining techniques in a wind farm Scada</p><p>data to analyse its reliability characteristics. They study reliability indexes to determine the</p><p>components that influence the most on the wind turbines reliability, comparing the wind farm</p><p>indicators with the ones in EU and analyse the seasonality of the failure rate and correlation</p><p>with temperature.</p><p>Figure 4.5 shows the main applications for each equipment that were found in the</p><p>studies.</p><p>50</p><p>Figure 4.5: Main applications found in the studies by equipment.</p><p>Source: Adapted from Turbosquid (2020).</p><p>4.2.2 Scada vs vibration data for condition monitoring</p><p>Most of the applications</p><p>found for condition monitoring in this research has used</p><p>SCADA or vibration data. SCADA is low frequency data, generally recorded every 10 seconds</p><p>and averaged over 10 minutes, which means poor performance since it brings difficulties related</p><p>to the loss of noise characteristics and, on top of that, sometimes there are missing values and</p><p>imperfections in the data collected by the system. On the other hand, wind turbines are mostly</p><p>equipped with the SCADA system, being a built-in part of every large-scale wind turbine, which</p><p>means no additional cost for data acquisition (Kusiak, 2016; Kavaz, Barutcu, 2018).</p><p>SCADA data often appears in building normal behaviour models for early fault</p><p>detection; however, its disadvantages make it necessary to spend big effort in the pre-processing</p><p>phase of the study (BANGALORE, 2017). Some SCADA pre-processing procedure that</p><p>appears in the studies are outline removal, data scaling (SCHLEICHTINGEN, SANTOS;</p><p>2011), Wrapper Method for parameter selection (KUSIAK, VERMA, 2012; YANG et al.,</p><p>2018) and Principal Component Analysis (KUSIAK, VERMA, 2011). Nevertheless, it is the</p><p>most common data used when it comes to evaluate studies on real operating wind turbine data.</p><p>Even though SCADA data is more accessible, vibration data is still widely used,</p><p>especially when aiming at going a step further than just fault detection, targeting fault diagnosis.</p><p>However, there is difficulties associated with this kind of studies. Fault diagnosis generally</p><p>51</p><p>needs ground truth labelled data, containing the operational parameters of the machine to serve</p><p>as training data for supervised learning approaches, which is not easy to obtain from operating</p><p>large-scale wind turbines, hence, it is usually taken from test rigs or simulations where</p><p>researchers can induce faulty components without jeopardizing the operation. It is a problem</p><p>for testing the availability of these studies in operating wind turbines. In order to deal with this</p><p>issue, one possible approach can be the one presented in Bach-Andersen, Romer-Odgaard and</p><p>Winther (2018) who used vibration data from gearbox of onshore and off-shore wind turbines</p><p>and created a supervised dataset through the opinion of specialists. With this dataset they built</p><p>a model for fault diagnosis using convolutional neural networks. Furthermore, some vibration</p><p>pre-processing methods are cited in the studies such as Fourier Transform (YU, HUANG,</p><p>XIAO, 2018), Hilber-Huang Transform (ZHONG et al, 2018) and Wavelet Transform (WU et</p><p>al., 2017; JIMÉNEZ, MUÑOZ, MÁRQUEZ, 2018; JIMÉNEZ et al., 2019). The Figures 4.6</p><p>and 4.7 show the distribution of articles that used SCADA and vibration for condition</p><p>monitoring, as well as their final objective.</p><p>Figure 4.6: Distribution of articles that used SCADA or vibration data for condition monitoring.</p><p>Source: Author, 2020.</p><p>Figure 4.7: Distribution of articles that used Vibration or SCADA data for condition monitoring according to</p><p>their main goal.</p><p>Source: Author, 2020.</p><p>16</p><p>4</p><p>6</p><p>SCADA Vibration</p><p>Real WT Other sources</p><p>52</p><p>4.2.3 Evaluation metrics</p><p>There is a variety of evaluation metrics to present the results of the studies, what</p><p>can make it difficult for comparing their results and seek the best performance between them.</p><p>The metrics were divided into three groups and they are going to be presented in the following</p><p>sections.</p><p>4.2.3.1 Algorithm performance metrics</p><p>Algorithm performance metrics are used in order to evaluate the performance of the</p><p>method in classification or regression tasks, especially in the model selection phase of the study.</p><p>Some authors use these metrics as the final result of the research. Usually, classification models</p><p>use metrics that evaluate the performance of the algorithm based on the number of True</p><p>Positives (TP - Number of samples classified as positive that are actually positive), True</p><p>Negatives (TN - Number of samples classified as negative that are actually negative), False</p><p>Positives (FP - Number of samples classified as positives that are actually negative) and False</p><p>Negatives (FN - Number of samples classified as negatives that were actually positives). On</p><p>the other hand, regression models use metrics related to the error between the real and modelled</p><p>parameter. Some of these metrics are presented in the Table 4.1 (STETCO et al., 2019):</p><p>Table 4.1: Some classification and regression metrics found in the studies.</p><p>Classification Regression</p><p>Accuracy</p><p>𝑇𝑃 + 𝑇𝑁</p><p>𝑇𝑃 + 𝑇𝑁 + 𝐹𝑃 + 𝐹𝑁</p><p>Mean squared error (MSE)</p><p>1</p><p>𝑁</p><p>∗ (𝑌 (𝑖) − 𝑌 (𝑖))²</p><p>Precision</p><p>𝑇𝑃</p><p>𝑇𝑃 + 𝐹𝑃</p><p>Root mean squared error</p><p>(RMSE)</p><p>1</p><p>𝑁</p><p>∗ (𝑌 (𝑖) − 𝑌 (𝑖))²</p><p>Recall</p><p>𝑇𝑃</p><p>𝑇𝑃 + 𝐹𝑁</p><p>Mean absolute percentage error</p><p>1</p><p>𝑁</p><p>∗</p><p>𝑌 (𝑖) − 𝑌 (𝑖)</p><p>𝑌 (𝑖)</p><p>F1 score</p><p>2 ∗ 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 ∗ 𝑅𝑒𝑐𝑎𝑙𝑙</p><p>𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 + 𝑅𝑒𝑐𝑎𝑙𝑙</p><p>Mean absolute error</p><p>1</p><p>𝑁</p><p>∗ |𝑌 (𝑖) − 𝑌 (𝑖)|</p><p>Source: Author, 2020.</p><p>53</p><p>4.2.3.2 Time and cost-based metrics</p><p>Some studies go a step further than evaluating the algorithm and, after some post-</p><p>processing and analysis, present the results in terms of time and cost. Time-based metrics</p><p>appears in this research as the ones that present the results in terms of early warning before</p><p>abnormal operation happens or even the time for fault detection. They normally appear in</p><p>studies that use the normal behaviour model approaches allied with the residual control charts</p><p>that aims at detecting abnormal behaviour or faults, sometimes comparing it to the actual</p><p>SCADA alarm system. It seems that this metric demonstrates how the implementation of the</p><p>proposed approach can impact in the maintenance planning activity directly.</p><p>Moreover, only Lu et al. (2018) present a cost-based metric, bringing the idea of</p><p>monetary savings in the application of their approach. Since only one study brings this idea, it</p><p>can be seen as a gap in the research. Future research should focus case studies of the application</p><p>of artificial intelligence in real wind turbine operations and present results in terms of savings.</p><p>It might enhance the appreciation of the wind turbine industry to this area, making it more open</p><p>to share data resources because it brings direct attention for improvements in the</p><p>competitiveness of the maintenance activities.</p><p>54</p><p>CHAPTER 5: CONCEPTUAL FRAMEWORK FOR APPLICATION OF</p><p>ARTIFICIAL INTELLIGENCE IN WIND TURBINE OPERATION AND</p><p>MAINTENANCE</p><p>When studying these applications, there are some factors that can influence in the</p><p>research success, hence, this paper suggests a conceptual framework with a pathway for</p><p>researchers and developers for implementing artificial intelligence applications for wind turbine</p><p>maintenance by analysing the steps taken in the papers. The framework is divided in 10 steps,</p><p>and the implementation of the studies can follow different pathways depending on the tasks</p><p>addressed to the AI methods. Figure 5.1 shows the proposed framework while the Table 5.1</p><p>shows the relation of papers that mention each step.</p><p>Figure 5.1: Conceptual framework with the main factors to consider when building Artificial Intelligence</p><p>Applications for wind turbine operations and maintenance.</p><p>Source: Author, 2020.</p><p>55</p><p>Table 5.1: Steps mentioned by the articles selected in for literature reviewing.</p><p>Authors GD DP PS/FE MSV FD RMP PP C</p><p>Elhor et al. (1999) X X X X X</p><p>Schlechtingen and Santos (2011) X X X X X X X</p><p>Kusiak and Verma (2011) X X X X X</p><p>Hameed and Wang (2012) X X X X</p><p>Verma and Kusiak (2012) X X X X X</p><p>Kusiak and Verma (2012) X X X X</p><p>Chen et al. (2013) X X X X X X X</p><p>Wang et al. (2013) X X X</p><p>Tang et al. (2014) X X X X</p><p>Abd-Elkader et al. (2014) X X</p><p>Talebi et al. (2014) X X X X</p><p>Ren and Qu (2014)</p><p>Astolfi et al. (2015) X X X</p><p>Yang et al. (2016a) X X X X X</p><p>X</p><p>Strdczkiewicz and Barszcz (2016) X X X X X X</p><p>Amina et al. (2016) X X X</p><p>Adouni et al. (2016) X X X X</p><p>Yang et al. (2016) X X X X X</p><p>Choi and Kim (2017) X X X X X</p><p>Bangalore et al. (2017) X X X X X X X</p><p>Lind et al. (2017) X X X X</p><p>Gantasala et al. (2017) X X X X</p><p>Wu et al. (2017) X X X X X</p><p>Zhao et al. (2018) X X X X X</p><p>Lu et al. (2018) X X X X</p><p>Su and Hu (2017) X X</p><p>Bach-Andersen et al. (2018) X X X X</p><p>Yang et al. (2018) X X X X X</p><p>Ali et al. (2018) X X X X X</p><p>Jiménez et al. (2018) X X X X X</p><p>Guo et al. (2018) X X X X X</p><p>Huang et al. (2018) X X X X</p><p>Kavaz and Barutcu (2018) X X X X X</p><p>Sun and Sun (2018) X X X X X X</p><p>Yu et al. (2018) X X X X X</p><p>Zhong et al. (2018) X X X X X X</p><p>Fu et al. (2019) X X X X</p><p>Jiménez et al. (2019) X X X X X</p><p>Chen et al. (2019) X X X X X</p><p>Shihavuddin et al. (2019) X X X X</p><p>Yeh et al. (2019) X X X</p><p>Bakri et al. (2019) X X X X</p><p>Yang et al. (2019) X X X X X</p><p>Li et al. (2019) X X X X X</p><p>Liu et al. (2019) X</p><p>Source: Author, 2020.</p><p>The acronyms stand for GT: Getting data, DP: Data pre-processing, PS/FE:</p><p>Parameter selection and feature extraction, MSV: Model selection and evaluation, FD: Fit best</p><p>56</p><p>model with new data, RMP: Retrain model periodically, PP: Post-processing, C: Comparison</p><p>with other studies.</p><p>5.1 Getting data</p><p>Artificial intelligence algorithms need great amount of data to work properly,</p><p>sometimes it is necessary to train models with long term parameters in order to extract natural</p><p>operating characteristics of the turbine (STRDCZKIEWICZ, BARSZCZ, 2016). Therefore,</p><p>when building applications for wind turbine O&M, one must check data availability before</p><p>defining the study scope.</p><p>According to Kusiak (2016), wind industry is still very closed when it comes to data</p><p>availability since companies usually hold most of the data acquired in the wind turbines</p><p>operation. SCADA data is a good option because it does not require additional cost for data</p><p>acquisition since almost all wind turbines are equipped with this system. Vibration data is more</p><p>difficult to be acquired from operating large-scale wind turbines.</p><p>According to Strdczkiewicz and Barszcz (2016), it is difficult to access raw</p><p>vibration data, therefore, researchers sometimes might go for test rigs or simulations to have</p><p>enough data for their studies. Other kinds of data that appears in this research are drone pictures,</p><p>sound and guided waves for blade applications and current for the electric system.</p><p>5.2 Define how AI can help in the approach</p><p>Artificial intelligence methods can be useful in wind farm O&M approaches</p><p>through 4 main tasks: regression, for building normal behaviour models and remaining useful</p><p>life prediction; classification for faults and abnormal behaviour detection; clustering for</p><p>grouping up wind turbine behaviour patterns; and data-mining for extract useful information</p><p>and create indicators from data.</p><p>These tasks can be combined within the same approach, being used strategically for</p><p>achieving the goal of the study. For example, Hammed and Wang (2012) used clustering for</p><p>separating wind turbines into clusters of performance, and regression for model the power</p><p>output of the wind turbines, comparing the power output of the wind turbines in the same cluster</p><p>in order to optimize the wind farm maintenance strategy.</p><p>57</p><p>5.3 Data pre-processing</p><p>Data pre-processing is an essential step in this workflow. Machine learning models</p><p>learn based on the data that they are exposed to in the training stage, so it is important that the</p><p>data are error free which is difficult to find in the real world. For example, SCADA data usually</p><p>are discontinuous and containing inconsistencies, which lead to inaccuracies in Neural</p><p>Networks models (BANGALORE et al., 2017). According to Schlechtingen and Santos (2011),</p><p>data needs to be pre-processed in order to apply neural networks because even though they are</p><p>able to handle fuzzy or incomplete data, they are sensitive to invalid data, which could bring its</p><p>generalisation performance down.</p><p>The most common data pre-processing steps are data scaling, and outlier removal.</p><p>Classification tasks often require ways to deal with unbalanced data classes because is difficult</p><p>to find operational data of unhealth operating wind turbines. In order to do so, they adopt steps</p><p>such as oversampling to increase the amount of data labelled as faulty.</p><p>Machine learning models can also be used in this step. For example, Bangalore et</p><p>al. (2017) used the cluster filter method proposed in Kusiak and Verma (2013) to remove data</p><p>that do not belong to the overall normal behaviour of wind turbines.</p><p>5.4 Parameter selection and Feature extraction</p><p>Parameter selection (or feature selection) is important to build AI applications.</p><p>Scada data usually gets more than 100 parameters of a wind turbine and, for accurate</p><p>predictions, the dimensionality of the dataset needs to be reduced because not all parameters of</p><p>the dataset are significant for the desired task, hence, selecting the most important among them</p><p>improves the performance of the model as well as reduces the computational cost (KUSIAK</p><p>VERMA, 2011). Data-mining algorithms such as boosting tree, PCA and wrapper approach are</p><p>widely used for parameter selection.</p><p>However, according to Bangalore et al. (2017) using data mining approaches for</p><p>this task may lead to selection of a large number of input parameters, thus, physical and domain</p><p>knowledge can be applied in order to keep this number in a reasonable amount.</p><p>When it comes to feature extraction, Mörchen (2003) states that it should be applied</p><p>in time series in order to compress it, keeping only important information. Therefore, cleaning</p><p>noise and removing correlations, usually improves the performance of data-mining algorithms.</p><p>In this research, feature extraction techniques appear often in studies that use vibration data.</p><p>According to Stetco (2019), an interesting approach is the auto-encoders, which are deep NN</p><p>that converts input data in a low dimension representation and a decoder that transform it to</p><p>58</p><p>proper outputs. An example is the study of Yang et al. (2014) that compared some feature</p><p>extraction approaches in vibration signals in order to achieve better accuracy for single and</p><p>multi-fault detection in the gearbox.</p><p>5.3 Model selection and validation</p><p>Machine learning techniques encompasses various methods that can be set in</p><p>different ways. In order to find the best configuration, one has to fit the models with test and</p><p>validation datasets, evaluate the best performance in the validation dataset, and then select the</p><p>configuration that reduces the error function the most to generalize better to unseen data.</p><p>Neural networks can be set in different types, architectures and hyperparameters. In</p><p>order to find the best Neural Network configuration, it is recommended to perform at least 10</p><p>runs changing the network architecture, choosing the one that gives the best network</p><p>generalization. It reduces the risk of finding solutions that do not generalize properly</p><p>(SCHLECHTINGEN, SANTOS, 2011; TARASSENKO, 1998). Therefore, setting the best</p><p>network configuration is a trial-and-error process that is essential to find the one that delivers</p><p>the best performance.</p><p>It is also important to test different methods. Jiménez et al. (2018) compared</p><p>different classifiers and compare the best performance among them. It is also adopted in Wu et</p><p>al. (2017), Bach-Andersen et al. (2018), Yu et al. (2018), Zhong et al. (2018), Jimenez et al.</p><p>(2019), Chen et al. (2019), Shihavuddin et al. (2019) and Bakri et al. (2019). The K-fold cross</p><p>validation process is used in some studies to iteratively evaluate the algorithm</p><p>with K different</p><p>partition of training and validation datasets (JIMÉNEZ et al., 2019; KUSIAK, VERMA, 2012;</p><p>BERMEJO et al., 2019).</p><p>5.4 Fit best model to unseen data</p><p>Once the best model is selected it is time to fit it to unseen data, and evaluate the</p><p>results. Shihavuddin et al. (2019) states that it is important to save a reasonable number of</p><p>unseen samples to evaluate the performance of the model in the test phase. In normal behaviour</p><p>models, the predictions of the unseen data are confronted with the real outputs in the figure of</p><p>the residual error, which is going to be useful in building residual control charts. The residual</p><p>error is given by the Equation 1, where 𝑌 represents measured data and 𝑌 represents modelled</p><p>data of the i-th sample.</p><p>59</p><p>𝑅𝑒𝑠𝑖𝑑𝑢𝑎𝑙 𝑒𝑟𝑟𝑜𝑟 (𝑖) = 𝑌 (𝑖) − 𝑌 (𝑖) (5.1)</p><p>5.5 Retrain Model periodically</p><p>As the wind turbines are exposed to harsh conditions and endure for long periods,</p><p>it is expected that it degrades with time, and its normal operating conditions as well. Thus, in</p><p>order to maintain the continuous monitoring of the wind turbine components, models might</p><p>have to be retrained as the time goes by. Stetco et al. (2019) states that the performance of the</p><p>models can degrade due to failing hardware responsible for feed data to the models, thus,</p><p>retraining models may be an option. They also say that classification models might not need</p><p>such an effort since they can be set to learn online with new data.</p><p>On the other hand, retraining regression models built with normal behaviour wind</p><p>turbine data may be troublesome because new data may pursue different behaviour from the</p><p>dataset which the model was once trained with. Schleichtingen and Santos (2011) used data</p><p>from 3 months after bearings replacement as training data for modelling the bearing</p><p>temperature. In generalizing this approach for every normal behaviour model, one can conclude</p><p>that they could be retrained every time the component that is being monitored is replaced so the</p><p>model would acquire the normal behaviour data for the new component.</p><p>When it comes to clustering, Choi and Kim (2017), state that operating behaviour</p><p>of wind turbines change along the period of operation, in this sense clusters centres can also</p><p>change, so they should be updated regularly (monthly or weekly) by training the model with</p><p>new data. The change in clusters centres can also be a measure of how the wind turbine degrades</p><p>with time.</p><p>5.6 Post Processing</p><p>In order to present a complete approach, further analysis can be implemented as</p><p>post-processing methods to provide decision-making potential. Astolfi et al. (2015) states that,</p><p>the more detailed the scale of the phenomenon to analyse, the more complex and smart the post</p><p>processing data should be. One example of applications that usually requires post-processing</p><p>are the normal behaviour models. According to Lind et al. (2017) and Bangalore et al. (2017)</p><p>the normal behaviour models for condition monitoring of wind turbines can be upgraded by</p><p>both pre-processing and post-processing, improving their training and reducing false alarms in</p><p>order to provide a good tool for decision making. Some example of post-processing adopted by</p><p>the papers are anomaly detection charts (SCHLECHTINGEN, SANTOS, 2011; TALEBI et al.,</p><p>60</p><p>2014; BANGALORE et al., 2017; ZHAO et al. 2018; YANG et al., 2018; HUANG et al., 2018),</p><p>study of indicators (ASTOLFI et al., 2015), maintenance schedule charts (YANG et al., 2016a)</p><p>and cost analysis (LU et al., 2018).</p><p>5.7 Comparison with other studies</p><p>Some papers show a comparison between methods in their results, analysing the</p><p>performance of different approaches, AI algorithms, and even other studies with the same goal.</p><p>It seems like a good practice to prove the performance of the proposed approach against other</p><p>studies, establishing benchmarking. This step is taken by Chen et al. (2013), Bangalore et al.</p><p>(2017), Lu et al. (2018), Ali et al. (2018) and Bakri et al. (2019).</p><p>61</p><p>CHAPTER 6: CONCEPTUAL FRAMEWORK APPLICATION AND LEARNING</p><p>LESSONS</p><p>This chapter describes the field research stage. It shows the process of selecting the</p><p>application to be simulated; the structuring of the dataset for modeling; description of the</p><p>method and simulation. The field research step-by-step follows the flowchart in Figure 2.1.</p><p>6.1 Selection of application</p><p>The application was selected through an interview with a specialist in the field of</p><p>operation and maintenance of wind farms who works in Brazil. Topics related to the current</p><p>status of condition monitoring systems in wind farms, data availability, most critical equipment</p><p>and bottlenecks were addressed, always taking into account how new condition monitoring</p><p>tools that may contribute to operations and maintenance activities. The questions discussed in</p><p>the interview are available in Appendix 1.</p><p>Regarding the current status of condition monitoring systems in wind farms, this</p><p>step showed that there are systems that monitor the performance of the wind turbine as a whole,</p><p>identifying which ones may be faulty according to their current and historical performance. In</p><p>addition, the SCADA system monitors sensitive parameters of the equipment, stopping the</p><p>operation if any of them exceeds limits defined by the manufacturer through an alarm system.</p><p>The SCADA system creates a database with the measurement of parameters and the moments</p><p>of alarms. The creation of a database with the history of this equipment may allow the</p><p>development of AI applications for monitoring purposes, if the data is available.</p><p>As for the criticality of the equipment, it was considered by the specialist that the</p><p>transmission system is the most critical among all, both for its monetary value and for the</p><p>frequency in which interventions are needed. The SCADA system does the job of general</p><p>monitoring, stopping the operation in case of occurrence of parameters outside the limits</p><p>specified through the alarm system. However, more specific components are still subject to</p><p>unforeseen failures by SCADA, especially the bearings, which have frequent failures. In other</p><p>words, artificial intelligence applications developed for the monitoring of bearing failure can</p><p>be of great value when planning the maintenance of this wind farm in Brazil, and for this reason,</p><p>it is simulated by this research.</p><p>Thus, considering that condition monitoring applications for wind turbine bearings</p><p>would be welcomed for the Brazilian wind farm under study and that SCADA data would be</p><p>62</p><p>available for simulation, we crossed this information with the studies analyzed in the systematic</p><p>literature review, coming up with 4 studies that had the same characteristics. The results of</p><p>these studies are presented in the Table 6.1.</p><p>Table 6.1: Some quantitative results of studies that used SCADA data to build applications of condition</p><p>monitoring of wind turbine bearings.</p><p>Source: Author, 2020.</p><p>6.2 Application of the conceptual framework</p><p>In this section the application of the framework is presented with s simulation on a</p><p>wind turbine from the wind farm in order to test a condition monitoring application for wind</p><p>turbine bearing maintenance. Four turbines presented failure in the front generator bearing, as</p><p>presented in the Table 6.2.</p><p>Table 6.2:2Front generator bearing faults that happened in the wind farm.</p><p>Turbine Operation start Day of bearing replacement</p><p>Period</p><p>T16 14/03/2013 26/05/2017</p><p>1534</p><p>T46 14/03/2013 01/06/2017</p><p>1540</p><p>T23 24/01/2012 12/07/2018</p><p>2361</p><p>T59 16/07/2013 11/05/2019</p><p>2125</p><p>Source: Author, 2020.</p><p>The wind turbine that presented the earliest bearing fault in terms of time of</p><p>operation is selected is the T16, and due to that, it is going to be used as object of simulation</p><p>for this research.</p><p>Author Goal Metric Results</p><p>Schlechtingen</p><p>and Santos (2011)</p><p>Normal behaviour models</p><p>for failure detections in</p><p>generator and gearbox</p><p>bearings and generator</p><p>Early fault detection</p><p>Gearbox bearing:186 days,</p><p>Generator stator: 70 days,</p><p>Generator bearing: 25 days.</p><p>Kusiak and Verma (2012)</p><p>Fault prediction in wind</p><p>turbine generator bearings</p><p>Early fault detection 1.5 hours</p><p>Early fault detection Gearbox bearing: 3 months</p><p>Root mean squared error Generator bearing: 0.77</p><p>Fu et al (2019)</p><p>Condition monitoring of</p><p>wind turbine generator</p><p>bearing</p><p>Root mean squared error 0.78-2.02</p><p>Fault detection in gearboxes</p><p>through monitoring its</p><p>bearing and oil temperature</p><p>Bangalore et al. (2017)</p><p>63</p><p>6.2.1 Getting data</p><p>In the development of the artificial intelligence application, it is first necessary to</p><p>check the availability of the dataset. The SCADA data of the wind turbines was available for</p><p>this study. It is an industrial control system that monitor assets geographically dispersed, and</p><p>have centralized data acquisition and storage. In the case of wind farms, they allow operators</p><p>to control and store parameters for wind turbines, electrical substations and anemometric towers</p><p>from remote sensing, storing data related to environmental variables, state variables and</p><p>equipment parameters (PEDROSA, 2016). These systems became common in modern wind</p><p>turbine technology due to their adequate cost, simplicity of implementation and interpretation,</p><p>and versatility (MANA; PICCIONI; TERZI, 2017). Figure 6.1 shows a schematic of the</p><p>SCADA systems architecture in wind farms.</p><p>Figure 6.1: SCADA system architecture in wind farms.</p><p>Source: Pedrosa (2016).</p><p>Therefore, as SCADA data of the wind farm is available for the study, it is going to</p><p>be used for simulation of artificial intelligence methods in the task of building normal behaviour</p><p>models for bearings.</p><p>6.2.2 Define how AI can help in the approach</p><p>For the purpose of building normal behaviour models for wind turbine bearings</p><p>condition monitoring using SCADA data, artificial intelligence methods are useful in the task</p><p>of regression or classification. As there is not ground truth labelled data specifying weather the</p><p>bearing is in normal or abnormal conditions, it is imperative to build normal behaviour models</p><p>based on regression. The 4 approaches mentioned in the Table 6.1 use artificial intelligence</p><p>64</p><p>methods to create normal behaviour models for monitoring the temperature of the bearings and</p><p>this research follows the same line.</p><p>6.2.3 Parameter selection</p><p>In order to build a normal behaviour model for bearing monitoring, the parameters</p><p>selected in Schleichtingen and Santos (2011) were used. In their article, they used physical</p><p>domain knowledge to select them, which means no additional use of statistical or data-driven</p><p>methods in this stage. They also had the best results in terms of early fault detection among the</p><p>studies that were analysed.</p><p>A simple network configuration is required with parameters in the input, hidden</p><p>layers and an output layer, which facilitates networks constructions, analysis and training. Chart</p><p>6.1 shows the parameters used as input signals for the gearbox and generator bearing</p><p>temperatures modelling while Figure 6.2 shows the location of the monitored parameters.</p><p>Chart 6.1: Input data for building the normal behaviour model proposed by Schlechtingen and Santos (2011).</p><p>Modelled parameter Input Signals</p><p>Generator Bearing Temperature</p><p>Nacelle Temperature (1)</p><p>High-speed shaft velocity (4)</p><p>Winding temperature (6)</p><p>Power output (7)</p><p>Gearbox Bearing Temperature</p><p>Nacelle temperature (1)</p><p>Oil sump temperature (2)</p><p>High-speed shaft velocity (4)</p><p>Power output (7)</p><p>Ambient temperature (8)</p><p>Source: Adapted from Schlechtingen and Santos (2011).</p><p>Figure 6.2: Sensor location of the SCADA monitoring parameters selected by Schleichtingen and Santos (2011)</p><p>for building the normal behaviour models.</p><p>Source: (SCHLECHTINGEN; SANTOS, 2011).</p><p>65</p><p>Where:</p><p>1. Nacelle Temperature in °C;</p><p>2. Gearbox oil chunk temperature in °C;</p><p>3. Gearbox bearing temperature in °C;</p><p>4. High velocity shaft rotation in RPM;</p><p>5. Generator bearing temperature in °C;</p><p>6. Generator stator temperature in °C;</p><p>7. Power output in KW;</p><p>8. Ambient temperature in °C.</p><p>As only the generator bearing had faults in this period, the simulation phase of this</p><p>study will leave the gearbox bearing model aside, so the fault prediction ability of the method</p><p>can be evaluated.</p><p>The heatmap of the correlation between parameters the generator dataset is</p><p>presented in the Figure 6.3. From the figure, the bearing temperature is highly correlated with</p><p>active power and winding temperature, and less correlated with shaft rotation and nacelle</p><p>temperature.</p><p>Figure 6.3: Generator bearing data correlation for the wind turbine.</p><p>Source: Author, 2020.</p><p>66</p><p>According to Schleichtingen and Santos (2011) the parameters with less correlation</p><p>are used for stabilization of the network. Both configurations with and without the less</p><p>correlated parameters were tested in this research, with the inclusion of them showing better</p><p>results.</p><p>6.2.4 Data Pre-processing</p><p>In the signal pre-processing phase, the input data is treated so that the database is</p><p>adequate to serve for neural networks. Schleichtingen and Santos (2011) presents the following</p><p>sequence for data treatment and standardization:</p><p> Data validity check: the data used in the model must be checked, filtering</p><p>outliers and unexpected variations.</p><p> Data normalization: Equation 1 is used for data normalization, where S is</p><p>the normalized variable and V is the actual variable.</p><p>𝑆 =</p><p>𝑉 − 𝑉𝑚𝑖𝑛</p><p>𝑉𝑚𝑎𝑥 − 𝑉𝑚𝑖𝑛</p><p>(6.1)</p><p> Missing data processing: rows with missing data are deleted from the</p><p>dataset.</p><p> Lag removal: the cross-correction function is used to identify delays in the</p><p>linear dependency between functions in order to build a database consistent</p><p>in terms of input-output.</p><p>At the beginning the dataset contains 321866 data points and, after pre-processing,</p><p>it became a 259365 data-points dataset. Therefore, the pre-processing phase took around 20%</p><p>of the dataset which is quite extensive.</p><p>6.2.4 Model Selection</p><p>In selecting the method that best performs in modelling the generator bearing</p><p>temperature it is first necessary to determine the separation for training, validation and testing</p><p>datasets in order to guarantee that the training phase encompasses the normal operation bearing</p><p>data and have plenty of datapoints. Second, the network types, structures and hyperparameters</p><p>are chosen. At the end, the models are trained and evaluated with the validation dataset.</p><p>67</p><p>6.2.4.1 Separation in training, validation and testing datasets</p><p>In separating the datasets for training, Schleichtingen and Santos (2011) suggest</p><p>that 3 months of operational data should be used because, as the time goes by, the normal</p><p>operation of the machine deteriorates and then the parameters taken from the SCADA data may</p><p>not be useful to build a normal behaviour model.</p><p>In order to identify the most suitable period to serve as training, seasonality analysis</p><p>with the most correlated parameters with the generator bearing temperature were performed.</p><p>The figures 6.4 and 6.5 show the relation between the means of active power and winding</p><p>temperature with the generation bearing temperature mean grouped by month, respectively. The</p><p>figures show how the means of the parameters varied with months, showing seasonality effect.</p><p>Figure 6.4: Seasonality analysis between active power and front generator bearing temperature.</p><p>Source: Author, 2020.</p><p>68</p><p>Figure 6.5: Seasonality analysis between winding temperature and front generation bearing temperature.</p><p>Source: Author, 2020.</p><p>This seasonality effect may affect the training period, as it should have the whole</p><p>operational regime in order to generalize well for all operational conditions. Hence, in this</p><p>research, the first year of operation is taken as training data while the following year is taken as</p><p>validation data. The Figure 6.6 and 6.7 show the periods of training and validation data before</p><p>and after pre-processing, it also shows when the failure occurred with the hashed line in black.</p><p>Figure 6.6: Training dataset (pink), validation dataset (green) and failure (dashed black line) before pre-</p><p>processing.</p><p>Source: Author, 2020.</p><p>69</p><p>Figure 6.7: Training dataset (pink), validation dataset (green) and failure (dashed black line) after pre-processing.</p><p>Source: Author, 2020.</p><p>6.2.4.2 Models tested</p><p>In order to select the model that best performs when submitted to unseen data, the</p><p>mean squared error (RMSE) is selected as evaluation metric. Three kids of networks were tested</p><p>as they appear in the studies presented in the Table 6.1:</p><p> Feed-Forward Neural Network, presented in Schleichtingen and Santos</p><p>(2011) and Kusiak and Verma (2012).</p><p> Autoregressive Neural Networks, shown in Schleichtingen and Santos</p><p>(2011);</p><p> LSTM Neural Network presented in Fu et al. (2019).</p><p>6.2.4.2.1 Feed forward neural network (Full Signal Reconstruction)</p><p>The neural networks of the feed-forward type consist of neurons that are organized</p><p>in layers, the first being the input layer, the last, the output layer and those that are in between</p><p>are the hidden layers (SVOZIL; KVASNIEKA; POSPICHAL, 1997). The work of</p><p>Schleichtingen and Santos (2011) and Kusiak and Verma (2012) use feed-forward neural</p><p>networks to reconstruct the bearing temperature signal, however, they use different network</p><p>architectures and hyperparameters. The feedforward NN presented in Schleichtingen and</p><p>Santos (2011), is put into use. It consists of two layers, the hidden layer and the output layer, as</p><p>shown in Figure 6.8. It is recommended that the optimal number of neurons in the hidden layer</p><p>should be defined by experiment, executing the method at least 10 times, changing only the</p><p>number of neurons, and thus the number of neurons is selected for the best result, with the aim</p><p>70</p><p>of decreasing the risk of finding solutions that do not generalize well (RAFIQ; BUGMANN;</p><p>EASTERBROOK, 2001; SCHLECHTINGEN; SANTOS, 2011).</p><p>Figure 6.8: Feed-forward neural network architecture.</p><p>Source: Author, 2020.</p><p>The neural network training method applied is the gradient descent with</p><p>momentum. Gradient descent is an iterative method that changes the weights of the neural</p><p>network in order to reduce the generalization. The weights are initialized randomly (usually a</p><p>number between -0.01 and +0.01), and then, at each iteration, they are approximated in the</p><p>direction that most reduces the generalization error, until it arrives in an acceptable error value</p><p>(RUSSEL; NORVING, 2013). The moment is a term added to the gradient descent to accelerate</p><p>and stabilize learning, which can have a satisfactory effect on the process (HAYKIN, 2008).</p><p>6.2.4.2.2 Autoregressive Neural Networks</p><p>Some signals are strongly autocorrelated, as future time values can be predicted by</p><p>learning signal autocorrelations. The autoregressive approach used in Schleichtingen and</p><p>Santos (2011) consists in the same inputs used in the neural network presented in the section</p><p>before, but adding the previous bearing temperature also as input signal, as presented in the</p><p>Figure 6.9. The same training method, parameter initialization and activation function used in</p><p>the Full Signal Reconstruction neural network is applied for this one as well.</p><p>71</p><p>Figure 6.9: Autoregressive Feed-forward neural network architecture.</p><p>Source: Author, 2020.</p><p>6.2.4.2.3 Long Short-Term Memory Neural Network</p><p>Recurrent neural networks are able to process arbitrary length sequences by</p><p>memorizing dynamic information of input parameters (SUN, SUN; 2019). Long short-term</p><p>memory neural network is an improved model of the recurrent neural network (RNN). In this</p><p>model, the neurons in the hidden layer of the neural network have a feedback mechanism that</p><p>is able to transmit context, so the model can process sequence data. It uses a length of</p><p>sequenced data as training set and predicts the output of the following moment after completing</p><p>the training (FU et al., 2019).</p><p>As a variant of recurrent neural network, LSTM exploits the internal memory from</p><p>input data to analyse the intrinsic dependencies. Generally speaking, the dependencies in the</p><p>time series can be classified into two types: short-term dependency and long-term dependency.</p><p>The difference from RNN lies in the submodules. The standard LSTM cell is composed by</p><p>three gates, including the forget gate, input gate and output gate (SUN, SUN; 2018). The LSTM</p><p>design allows it leave invalid information aside in the long period. These models are built as</p><p>following the steps:</p><p>The LSTM cell would be regulated to update with three gates:</p><p> The input gate receives information of the current sample;</p><p> The forget gate determines how much of which information would be</p><p>relevant from historical data;</p><p> The output gate controls the context output from the cell;</p><p>In the context of this research, the model built to predict the bearing temperatures</p><p>is set according to the following:</p><p>72</p><p> The LSTM neural network contains 100 units in the first hidden layer and</p><p>50 units in the second hidden layer, both with the Relu activation function;</p><p> The input sequence is set to be 12, or two ours of data;</p><p> The training method is the stochastic gradient descent with momentum.</p><p>6.2.4.4 Results on the validation dataset</p><p>The models presented in the subsection above had the following results in terms of</p><p>Mean Squared Error (MSE) and Root Mean Absolute Error (MAPE):</p><p>Table 6.3:3 Performance results of the networks in the validation dataset.</p><p>Model MAE</p><p>RMSE</p><p>Feed-forward Neural Network 1.879</p><p>2.320</p><p>Autoregressive Neural Network 0.605</p><p>1.004</p><p>LSMT Neural Network 0.353</p><p>0.411</p><p>Source: Author, 2020.</p><p>From this table, the LSTM model performed better than both Autoregressive and</p><p>Full Signal Reconstruction NN’s, being superior in the two metrics observed. Therefore, the</p><p>LSTM model is selected to go further, being applied to the rest of the data. The Figure 6.10</p><p>shows the comparison between the modelled temperature and the real bearing temperature taken</p><p>by the SCADA system. Most of the time, the model follows the real bearing temperature</p><p>closely, which indicate that deviations of normal behaviour of the bearing temperature should</p><p>be identified by the method.</p><p>Figure 6.10: Modelled bearing temperature x temperature measured by the SCADA.</p><p>Source: Author, 2020.</p><p>73</p><p>6.2.5 Post processing</p><p>After selecting the model with best results in the validation dataset, two post</p><p>processing methods were applied to identify incipient faults in the bearing. Both methods use</p><p>residual control charts to identify faults, varying only the threshold definition.</p><p>According to Montgomery (2008), the control charts present a characteristic of a</p><p>sample versus its number or time that it was measured. It usually contains a centre line, which</p><p>corresponds to the mean value of the characteristic; Upper and Lower control limits, which are</p><p>thresholds that, if the process is under control, limit the measured characteristic inside their</p><p>limits so nearly all points are located between them.</p><p>In the case of this research, the characteristic is the residual error between the</p><p>modelled and the measured SCADA data, while the control limits are defined as constant</p><p>threshold, as presented in Schleichtingen, Santos and Achiche (2013), and an adaptive</p><p>threshold, proposed by Talebi Sadrnia and Darabi (2014) and Patan (2008), with some</p><p>modifications.</p><p>6.2.5.1 Constant threshold</p><p>The choice of the limit value is a</p><p>tradeoff between the sensitivity of the model in</p><p>the identification of faults and the number of false alarms (SCHLECHTINGEN; SANTOS,</p><p>2011). For this research, the constant threshold limit selection method discussed in</p><p>Schleichtingen, Santos and Achiche (2013) is implemented, which define the limit in a</p><p>probabilistic way. The error is considered to be normally distributed and should fall between</p><p>upper and lower control limits.</p><p>The threshold is set as a constant multiple of the standard deviation. The choice of</p><p>constant influences the sensitivity of the application and can be adjusted if necessary. Choosing</p><p>generously can increase the number of false alarms, generating an unwanted enhance in</p><p>unscheduled maintenance activities. It can lead to loss of system credibility among</p><p>professionals. In addition, diagnostic failures can cause the unnecessary exchange of</p><p>components, causing additional maintenance to correct errors (SCHLECHTINGEN, SANTOS,</p><p>ACHICHE, 2013).</p><p>Figure 6.11 below shows the example of Schleichtingen, Santos and Achiche</p><p>(2013), presenting a graph of the probability distribution and the evolution of the forecast error</p><p>during a day. It shows the region considered to be in normal operation (center) and the regions</p><p>considered to be in abnormal operation (extremes).</p><p>74</p><p>Figure 6.11: Probability density function graph of the residual error and daily residual error mean.</p><p>Source: (SCHLECHTINGEN; SANTOS; ACHICHE, 2013).</p><p>Hence, for this simulation, the constant threshold is set to be 3 times the standard</p><p>deviation of the residuals taken in the training dataset, and may be set to other values if changes</p><p>the sensibility of the model are needed. The control graph is tested with the values of the daily</p><p>average of the forecast errors. To avoid false alarms, the alarm is only triggered when the daily</p><p>average of the forecast error breaks the limit at least 3 times within a week, recommended by</p><p>the article.</p><p>6.2.5.2 Adaptive Threshold</p><p>Even after all the pre-processing phase, training and testing, modelling uncertainty,</p><p>variations in data and noise still exist. Therefore, in order to avoid too many false alarms, the</p><p>threshold might be set to larger values, which may diminish the fault detection sensitivity since</p><p>bigger thresholds can leave real problems undetected. Hence, an adaptive threshold might be a</p><p>good solution since it is supposed to vary in time along with disturbances or uncontrolled effects</p><p>(PATAN, 2008).</p><p>Taking that into account, this research also implements the approach presented in</p><p>Patan (2008) and Talebi, Sadrnia and Darabi (2014) which developed an adaptive threshold</p><p>based on the estimation of statistical parameters on past observations of residuals, however,</p><p>with one modification, the addition of a constant (𝑎) times the standard deviation (𝜎). The</p><p>75</p><p>proposed method is described in the following equations, assuming that the residual is an</p><p>approximation of the normal distribution:</p><p>𝑇(𝑘) = 𝑡 ∗ �̅�(𝑘) ± 𝑚(𝑘) + 𝑎 ∗ 𝜎</p><p>(6.2)</p><p>�̅�(𝑘) = ζ ∗ 𝑣(𝑘) + (1 − ζ) ∗ 𝑣(𝑘 − 1)</p><p>(6.3)</p><p>𝑚(𝑘) = ζ ∗ 𝑚(𝑘) + (1 − ζ) ∗ 𝑚(𝑘 − 1)</p><p>(6.4)</p><p>Where 𝑇 is the threshold value at the moment 𝑘; 𝑣(𝑘) and 𝑚(𝑘) are the variance</p><p>and the mean of the last 𝑛 samples, respectively, ζ is the momentum parameter, which is</p><p>considered close to 1. 𝛽 is the significance level, that a residual exceeds the random value 𝑡 :</p><p>𝛽 = 𝑃</p><p>𝑟(𝑘) − 𝑚</p><p>𝑣</p><p>> 𝑡</p><p>(6.5)</p><p>The value of 𝑡 is tabulated, therefore, assuming a significance level 𝛽 = 0.003,</p><p>the value of 𝑡 obtained is 3.</p><p>6.2.5.3 Post processing results</p><p>At first, that visualization on the validation dataset is performed in order to better</p><p>see the behaviour of the residual error. Figure 6.12 shows the residual error on a 10 min</p><p>frequency as is taken from the SCADA data. The error varies too much, which may be difficult</p><p>to analyse since it brings lots of high values at once, therefore, as said in the section 6.2.5.1, the</p><p>residual will be analysed on a daily mean basis. Figure 6.13 and 6.14 presents the error averaged</p><p>hourly and daily, respectively.</p><p>76</p><p>Figure 6.12: 10 min residual error on the validation dataset.</p><p>Source: Author, 2020.</p><p>Figure 6.13: 1-hour residual error mean on the validation dataset.</p><p>Source: Author, 2020.</p><p>77</p><p>Figure 6.14: 1-day residual error mean on the validation dataset with constant and adaptive thresholding.</p><p>Source: Author, 2020.</p><p>The Figure 6.14 shows that the model behaves well in the validation dataset. Even</p><p>though the error surpasses the adaptive limit in August, it is not considered a false alarm since</p><p>it does not surpass it 3 times within a week, as can seem in the Figure 6.15. The figure shows</p><p>the table taken from the simulation results, with the data when the limit violation occurred in</p><p>the first column, the measured bearing temperature in the second column, the prediction made</p><p>by the model in the third column and the error with the difference between the measured and</p><p>the prediction made by the model at this timestamp. The error in this sample is large (-5.288)</p><p>compared to the behaviour of the dataset and it might be an indicator of invalid data.</p><p>The adaptive threshold is violated in 2 occasions because of this sample as can be</p><p>seen in the Figure 6.16. Besides the data presented in the Figure 6.15, the Figure also shows the</p><p>upper and lower limits of the adaptive threshold in the fifth and sixth positions. The lower limit</p><p>of the adaptive threshold is clearly affected by this, which is a downside of the adaptive</p><p>thresholding approach. And, as said before the alarm is still not triggered as it does not happen</p><p>3 times within a week.</p><p>Figure 6.15: Error surpasses the constant threshold in August.</p><p>Source: Author, 2020.</p><p>78</p><p>Figure 6.16: Error surpasses the adaptive threshold in August and May.</p><p>Source: Author, 2020.</p><p>The Figure 6.17 shows the method applied in the rest of the dataset, including the</p><p>period of fault. The line in black represents the time when the bearing was changed due to fault.</p><p>The figure shows that the model performed well but was not able to alarm before the fault.</p><p>Figure 6.17: 1-day residual error mean data applied to the test dataset.</p><p>Source: Author, 2020.</p><p>Figure 6.18: Error surpasses the limit in October-2015.</p><p>Source: Author, 2020.</p><p>No alarm is triggered at first, hence, the constant threshold with 3 standard</p><p>deviations may be too generous for this level of model performance, and a more restricted one</p><p>is also tested, with 2 standard deviations for both constant and adaptive thresholding. Figure</p><p>6.19 shows the application of two standard deviations for both constant and adaptive thresholds.</p><p>Figure 6.19: 1-day residual error mean data applied to the test dataset with 2 standard deviations as limit.</p><p>79</p><p>Source: Author, 2020</p><p>Figure 6.20: Error surpasses the constant limit in August, October and November-2015; January, July, August,</p><p>September and October-2016.</p><p>Source: Author, 2020.</p><p>Figure 6.21: Error surpasses the adaptive limit in March-2017 and October-2015.</p><p>Source: Author, 2020.</p><p>It also does not present any alarm since no value surpasses the threshold limit 3</p><p>times within a week. Therefore, as no alarm was triggered in these thresholding approaches,</p><p>some questions arise. The method is designed to model the bearing temperature and, in order</p><p>80</p><p>to perceive its faults, the fault that happened in this point should present variations in the</p><p>behaviour of its temperature as failure mode.</p><p>6.2.6 Retraining</p><p>In order to maintain the application useful, the model should be retrained in order</p><p>to measure the operating conditions of the new bearing. Therefore, the model was retrained</p><p>with 1-year operation data after the failure, as presented in</p><p>the Figure 6.22. The result in this</p><p>validation dataset is similar to the ones presented above, with 0.371 of MAE and 0.431 for</p><p>RMSE. Hence, the new bearing temperature model has similar performance to the old one, and</p><p>might be able to follow the bearing temperature as well.</p><p>Figure 6.22: Training dataset (pink), validation dataset (green) and failure (dashed black line) for the retraining</p><p>phase.</p><p>Source: Author, 2020.</p><p>6.2.6 Comparison with other studies</p><p>In the very last part of the framework, the comparison with other studies is</p><p>undertaken. Three studies were considered in this phase, Kusiak and Verma (2012), Bangalore</p><p>et al. (2017) and Lu et al. (2019) because they have the same goal of predicting bearing</p><p>temperature and present their results explicitly in terms of performance metrics. Table 6.4 is an</p><p>adaptation brought from Bangalore et al. (2017), it brings the results of studies that aimed at</p><p>modelling wind turbine bearings temperature and compare the studies in terms of the</p><p>performance of the model in the validation phase.</p><p>81</p><p>Table 6.4:4 Comparison with other studies in terms of mean absolute error and mean squared error</p><p>Author MAE RMSE</p><p>Kusiak and Verma (2012) 0.693</p><p>Bangalore et. al. (2017) 0.44 0.77</p><p>Lu et al. (2019) (Gearbox bearing model) 0.781</p><p>This research 0.353 0.411</p><p>Source: Author, 2020.</p><p>From the table, the LSTM model presented in this research has better results when</p><p>compared to previous studies in terms of both mean absolute error and root mean squared error</p><p>and, even though the fault could not be identified, it is supposed to follow the bearing</p><p>temperature better than the mentioned models as it performed better when exposed to unseen</p><p>data.</p><p>6.3 Framework description</p><p>The application of the framework step-by-step with real wind turbine data provided</p><p>insights to improve it. First, the data pre-processing phase does note has to happen before the</p><p>parameter selection task. It usually happens in vibration-based studies since it is necessary to</p><p>separate features of vibration from data and it using time and frequency-based transformations.</p><p>However, in studies that use SCADA data, data pre-processing usually comes after parameter</p><p>selection, in both statistical and domain-knowledge selections, as happened in the application</p><p>of the framework of this chapter.</p><p>Second, the comparison with other studies does not have to come after the post</p><p>processing phase. In some studies, the comparison between studies take into account the results</p><p>in the model selection and validation phase, comparing the performance of the algorithms in</p><p>classification or the error metrics in regression tasks. Therefore, the final framework for wind</p><p>turbine application development is the one presented in the Figure 6.23. It is divided in 4</p><p>dimensions: Selecting application, data preparation, model development and evaluation of</p><p>results.</p><p>82</p><p>Figure 6.23: Framework for application of Artificial Intelligence in Wind Turbine Operation and Maintenance.</p><p>Source: Author, 2020.</p><p>The selecting application phase encompasses the process of defining the application</p><p>to be simulated by getting access to data, determining the main goal and how artificial</p><p>intelligence applications can help. The data preparation phase encompasses the processes</p><p>responsible for getting the data ready for AI models: data pre-processing, parameter selection</p><p>83</p><p>and feature extraction. These steps do not have a clear sequence since SCADA and vibration-</p><p>based applications have different paths for data preparation.</p><p>The model development phase is the stage of selecting and improving the AI model</p><p>and fitting it to up-to-date data, having the model selection and validation, fitting best model</p><p>with new data and model retraining as a clear cycle of continuous improvement of the</p><p>application performance when exposed to new data.</p><p>The last phase, the evaluation of results, takes into account the outcomes of the</p><p>simulation encompassing information extracted from the simulation, the decision-making</p><p>criteria and the comparison with other studies. Different from the previous framework, the</p><p>comparison with other studies can be held also after the model selection phase, as happened in</p><p>the simulation of this chapter, since it is important to compare studies according to their model’s</p><p>performance when exposed to new data.</p><p>84</p><p>CHAPTER 7: CONCLUSIONS AND RECOMMENDATIONS</p><p>In order to contain climate change, nations around the world must take measures</p><p>towards sustainable energy development. In this scenario, wind energy poses as the main green</p><p>energy resource in the next years (see Figure 1.1), being the most competitive in most of the</p><p>countries. In order to maintain this pattern, wind turbine O&M activities should be improved</p><p>continuously since the wind energy depends on this equipment to transform kinetic energy from</p><p>the wind into electricity, and also because O&M corresponds to a big slice of the investments</p><p>in wind farm projects.</p><p>In this context, artificial Intelligence plays an important role in improving</p><p>competitiveness of wind turbine O&M activities as improvements in processing, data</p><p>acquisition and storage made it possible to build AI applications for this end. Therefore, taking</p><p>this into account, the main objective of this research was to answer the research questions: How</p><p>to apply artificial intelligence tools in wind turbines operations and maintenance? How</p><p>would be an AI application that is relevant for wind turbine maintenance of a Brazilian</p><p>wind farm?</p><p>In order to answer these research questions, seven specific objectives were defined</p><p>and accomplished. The first one was the identification of state-of-the art knowledge about the</p><p>theme held in the chapter 4, in the exploratory research phase, that consisted in a systematic</p><p>literature review with the analysis of 51 articles. that came up with the following insights:</p><p> China has been on top of research about this theme with 18 publications in</p><p>total, representing 35% of participation in the studies. USA is the second in</p><p>the number of publications with 5, followed by UK and Spain with 4</p><p>publications each.</p><p> Artificial neural networks are the most used methods and have been used</p><p>for different purposes like regression, classification and clustering tasks.</p><p> The studies were classified according to their main goal in four different</p><p>types of applications, condition monitoring, maintenance optimization,</p><p>study of indicators, and inspections systems. Condition monitoring</p><p>approaches had the largest number of applications with different approaches</p><p>such as normal behaviour models and remaining useful life prediction,</p><p>aiming at fault detection or fault diagnosis.</p><p>85</p><p> SCADA data is widely used because it is more accessible than other sorts</p><p>of data since every wind turbine is equipped with this system, however, it</p><p>has problems related to poor performance of acquisition due to its low</p><p>frequency data, which brings issues for modelling. Moreover, vibration data</p><p>is widely used for fault diagnosis tasks, even though it is difficult to be</p><p>obtained. Nevertheless, researchers sometimes go to test rigs or simulations</p><p>to test the effectiveness of the proposed approach.</p><p> The studies aimed at building applications for the whole wind turbine and</p><p>for its subcomponents, with the gearbox being the equipment with most</p><p>studies followed by the generator and the blades.</p><p> As only one study brought results in terms of monetary savings in adopting</p><p>AI in wind turbines operations and maintenance, it can be perceived as a</p><p>gap in the research and should be explored by further studies in a way of</p><p>bringing even more attention from the industry to the theme.</p><p>The second specific objective was to develop a conceptual</p><p>Figure 4.2: Geographic distribution of the institutions that the authors represent. .................. 41</p><p>Figure 4.3: Distribution of articles according to application. ................................................... 42</p><p>Figure 4.4: Failure annual frequency and downtime per failure for the wind turbines. ........... 43</p><p>Figure 4.5: Main applications found in the studies by equipment. .......................................... 50</p><p>Figure 4.6: Distribution of articles that used SCADA or vibration data for condition</p><p>monitoring. ............................................................................................................................... 51</p><p>Figure 4.7: Distribution of articles that used Vibration or SCADA data for condition</p><p>monitoring according to their main goal. ................................................................................. 51</p><p>Figure 5.1: Conceptual framework with the main factors to consider when building Artificial</p><p>Intelligence Applications for wind turbine operations and maintenance. ................................ 54</p><p>Figure 6.1: SCADA system architecture in wind farms. .......................................................... 63</p><p>Figure 6.2: Sensor location of the SCADA monitoring parameters selected by Schleichtingen</p><p>and Santos (2011) for building the normal behaviour models. ................................................ 64</p><p>Figure 6.3: Generator bearing data correlation for the wind turbine. ....................................... 65</p><p>Figure 6.4: Seasonality analysis between active power and front generator bearing</p><p>temperature. .............................................................................................................................. 67</p><p>Figure 6.5: Seasonality analysis between winding temperature and front generation bearing</p><p>temperature. .............................................................................................................................. 68</p><p>Figure 6.6: Training dataset (pink), validation dataset (green) and failure (dashed black line)</p><p>before pre-processing. .............................................................................................................. 68</p><p>Figure 6.7: Training dataset (pink), validation dataset (green) and failure (dashed black line)</p><p>after pre-processing. ................................................................................................................. 69</p><p>Figure 6.8: Feed-forward neural network architecture. ............................................................ 70</p><p>Figure 6.9: Autoregressive Feed-forward neural network architecture. ................................... 71</p><p>Figure 6.10: Modelled bearing temperature x temperature measured by the SCADA. ........... 72</p><p>Figure 6.11: Probability density function graph of the residual error and daily residual error</p><p>mean. ........................................................................................................................................ 74</p><p>Figure 6.12: 10 min residual error on the validation dataset. ................................................... 76</p><p>Figure 6.13: 1-hour residual error mean on the validation dataset. .......................................... 76</p><p>Figure 6.14: 1-day residual error mean on the validation dataset with constant and adaptive</p><p>thresholding. ............................................................................................................................. 77</p><p>Figure 6.15: Error surpasses the constant threshold in August. ............................................... 77</p><p>Figure 6.16: Error surpasses the adaptive threshold in August and May. ................................ 78</p><p>Figure 6.17: 1-day residual error mean data applied to the test dataset. .................................. 78</p><p>Figure 6.18: Error surpasses the limit in October-2015. .......................................................... 78</p><p>Figure 6.19: 1-day residual error mean data applied to the test dataset with 2 standard</p><p>deviations as limit. .................................................................................................................... 78</p><p>Figure 6.20: Error surpasses the constant limit in August, October and November-2015;</p><p>January, July, August, September and October-2016. ............................................................. 79</p><p>Figure 6.21: Error surpasses the adaptive limit in March-2017 and October-2015. ................ 79</p><p>Figure 6.22: Training dataset (pink), validation dataset (green) and failure (dashed black line)</p><p>for the retraining phase. ............................................................................................................ 80</p><p>Figure 6.23: Framework for application of Artificial Intelligence in Wind Turbine Operation</p><p>and Maintenance. ...................................................................................................................... 82</p><p>TABLE INDEX</p><p>Table 4.1: Some classification and regression metrics found in the studies. ........................... 52</p><p>Table 5.1: Steps mentioned by the articles selected in for literature reviewing. ...................... 55</p><p>Table 6.1: Some quantitative results of studies that used SCADA data to build applications of</p><p>condition monitoring of wind turbine bearings. ....................................................................... 62</p><p>Table 6.2:1Front generator bearing faults that happened in the wind farm. ............................. 62</p><p>Table 6.3:1Performance results of the networks in the validation dataset. ............................... 72</p><p>Table 6.4:1Comparison with other studies in terms of mean absolute error and mean squared</p><p>error .......................................................................................................................................... 81</p><p>CHART INDEX</p><p>Chart 2.1: Research classification. ........................................................................................... 21</p><p>Chart 2.2: Key-words combination for searching for papers in the Periodicos Capes platform.</p><p>.................................................................................................................................................. 23</p><p>Chart 3.1: Artificial intelligence definitions separated by categories. ..................................... 28</p><p>Chart 6.1: Input data for building the normal behaviour model proposed by Schlechtingen and</p><p>Santos (2011). ........................................................................................................................... 64</p><p>ACRONYMS</p><p>O&M Operations and Maintenance</p><p>AI Artificial Intelligence</p><p>SCADA Supervisory control and data acquisition</p><p>SLR Systematic Literature Review</p><p>ML Machine Learning</p><p>ANN Artificial Neural Networks</p><p>NN Neural Network</p><p>ANFIS Adaptive Neural Fuzzy Inference System</p><p>SVM Support Vector Machines</p><p>KNN K-Nearest Neighbours</p><p>LSTM Long Short-Term Memory</p><p>ELM Extreme Learning Machine</p><p>PCA Principal Component Analysis</p><p>DAE Deep Auto-Encoder</p><p>DFIG Doubly-fed Induction Generator</p><p>RNN Recurrent Neural Networks</p><p>EWMA Exponentially Weighted Moving Average</p><p>SOM self-Organizing Maps</p><p>CNN Convolutional Neural Networks</p><p>MSE Mean Squared Error</p><p>RMSE Root Mean Squared Error</p><p>MAPE Mean Absolute Percentage Error</p><p>MAE Mean Absolute Error</p><p>FFNN Feed-Forward Neural Network</p><p>ARNN Autoregressive Neural Network</p><p>SUMMARY</p><p>ACKNOWLEDGMENTS .......................................................................................................... 3</p><p>RESUMO ................................................................................................................................... 4</p><p>ABSTRACT ............................................................................................................................... 5</p><p>FIGURE INDEX ........................................................................................................................</p><p>framework for the</p><p>application of AI in wind turbine operations and maintenance, held in Chapter 5. After</p><p>analysing the articles and mapping the steps taken by them, a framework that summarizes the</p><p>procedure of developing applications of AI in wind turbine operations and maintenance was</p><p>proposed. The framework is composed by 10 steps: Getting data; define how AI can help; data</p><p>pre-processing; feature selection and extraction; model selection and validation; fit best model</p><p>with new data; post-processing; model retraining; comparison with other studies; and extract</p><p>information from data.</p><p>The third specific objective was to disclosure relevant applications for a Brazilian</p><p>wind farm within the studies and compare methods, considering main goal, data needed to</p><p>compose the study, complexity and results. It was performed in the Section 6.1 in an interview</p><p>with a Brazilian wind farm O&M specialist. In this interview, the application for wind turbine</p><p>bearing condition monitoring was selected to be simulated since the necessary data was</p><p>available for use, the methods are not too complex and the fact that this kind of application is</p><p>relevant for the wind farm in question.</p><p>After that, the fourth and fifth specify objectives were achieved by validating the</p><p>framework procedure with the application of it in a Brazilian wind farm. A condition</p><p>monitoring application using normal behaviour model for the generator bearing temperature</p><p>was conducted, using SCADA data from the turbine that had the earliest generator bearing</p><p>failure. The steps were followed according to practices taken from the articles:</p><p>86</p><p> The generator bearing temperature was modelled using SCADA data of</p><p>Nacelle temperature, high speed shaft rotation, power output and winding</p><p>temperature as in Schleichtingen and Santos (2011);</p><p> General data pre-processing steps were performed and separation in</p><p>training, validation and testing dataset was discussed;</p><p> FFNN, ARNN and LSTM NN were used to model the parameter, with the</p><p>LSTM having superior performance when exposed to unseen data, even</p><p>when compared with other studies;</p><p> Constant and adaptive thresholds were implemented as pos-processing</p><p>methods; however, the model could not identify the fault before it happens</p><p>in both approaches.</p><p> The adaptive threshold approach is sensitive to high gradients in the</p><p>residual.</p><p>Even though the approach could not identify the faulty bearing properly, it</p><p>presented better results in terms of performance when exposed to unseen data, which indicates</p><p>that it mimics the behaviour of the bearing temperature accordingly.</p><p>The last specific objective was achieved by re-modelling the conceptual framework</p><p>with considerations learned after the simulation, and it was achieved in the Section 6.3. In this</p><p>phase, four dimensions were created: selecting application, model preparation, model</p><p>development and evaluation of results. The data pre-processing, and parameter selection and</p><p>feature extraction phases were rearranged as they do not have to follow a clear sequence since</p><p>sometimes the data pre-processing comes before the parameter selection and feature extraction</p><p>phase (case of vibration studies) and the opposite also happens. The comparison with other</p><p>studies phase was also linked to the model selection and validation one since studies can also</p><p>be compared in this step, in terms of algorithm performance.</p><p>The proposed model is supposed to mimics the bearing temperature and can only</p><p>be able to identify faults if the bearing temperature disturbances happen as failure mode.</p><p>Therefore, further studies with turbines that have a clear history of this kind of fault are</p><p>important to test the effectiveness of the proposed framework and algorithms.</p><p>87</p><p>REFERENCES</p><p>ABD-ELKADER, A. G.; ALLAM, D. F.; TAGELDIN, E. Islanding detection method for DFIG wind turbines</p><p>using artificial neural networks. Electrical Power and Energy Systems, v. 62, p. 335-343, 2014.</p><p>ABEEÓLICA. Boletim Anual de Geração Eólica 2017. 2018.</p><p>ABEEÓLICA. Energia eólica chega a 14,71 GW de capacidade instalada. Disponível em:</p><p>. Acesso em: 28 fev.</p><p>2019.</p><p>ADOUNI, A. et al. FDI based on Artificial Neural Network for Low-Voltage-Ride-Through in DFIG-based Wind</p><p>Turbine. ISA Transactions, v. 64, p. 353–364, 2016.</p><p>ALI, J. B. et al. Online automatic diagnosis of wind turbine bearings progressive degradations under real</p><p>experimental conditions based on unsupervised machine learning. Applied Acoustics, v. 132, p. 167-181. 2018.</p><p>AMINA, B.; TAYEB, A.; MOULOUD, D. Intelligent Open Switch Fault Detection for Power Converter in Wind</p><p>Energy System. Applied Artificial Intelligence, v. 30, n. 9, p. 886–898, 2016.</p><p>ANEEL. Ranking das Tarifas. Disponível em: .Acesso em:15 nov.</p><p>2018.</p><p>ASTOLFI, D. et al. Data mining techniques for performance analysis of onshore wind farms. Applied Energy, v.</p><p>148, p. 220–233, 2015.</p><p>AZADEGAN, A. et al. Fuzzy logic in manufacturing: A review of literature and a specialized application.</p><p>International Journal of Production Economics, v. 132, n. 2, p. 258–270, 2011.</p><p>BACH-ANDERSEN, M.; RØMER-ODGAARD, B.; WINTHER, O. Deep learning for automated drivetrain fault</p><p>detection. Wind Energy, v. 21, n. 1, p. 29–41, 2018.</p><p>BANGALORE, P. et al. An artificial neural network-based condition monitoring method for wind turbines, with</p><p>application to the monitoring of the gearbox. Wind Energy, v. 20, n. 8, p. 1421–1438, ago. 2017.</p><p>BANGALORE, P. Load and Risk Based Maintenance Management of Wind Turbines. 2014. Phs Thesis -</p><p>Department of Energy and Environment, Chalmers University of technology, 2014.</p><p>BAKRI, A. E.; KOUMIR, M.; BOUMHIDI, I. Extreme learning machine-based non-linear observer for fault</p><p>detection and isolation of wind turbine. Australian Journal of Electrical and Electronics Engineering, v. 16. P.</p><p>12-20, 2019.</p><p>BERMEJO, J. et al. A Review of the Use of Artificial Neural Network Models for Energy and Reliability</p><p>Prediction. A Study of the Solar PV, Hydraulic and Wind Energy Sources. Applied Sciences, v. 9, 2019.</p><p>BERNARDES, F. et al. Winds of Change: How Up-To-Date Forecasting Methods Could Help Change Brazilian</p><p>Wind Energy Policy and Save Billions of US$. Energies, v. 11, n. 11, p. 2952, 2018.</p><p>BLOOMBERG NEF. World’s Best Breezes Lead to Cheapest Wind Power in Brazil. Disponível em:</p><p>. , 2016.</p><p>CABRAL, M. A. L. CLASSIFICAÇÃO AUTOMATIZADA DE FALHAS TRIBOLÓGICAS DE</p><p>SISTEMAS ALTERNATIVOS COM O USO DE REDES NEURAIS ARTIFICIAIS NÃO</p><p>SUPERVISIONADAS. Phd Thesis - Departamento de Engenharia Mecânica, Universidade Federal do Rio</p><p>Grande do Norte, 2017.</p><p>88</p><p>CHEN, B.; MATTHEWS, P. C.; TAVNER, P. J. Wind turbine pitch faults prognosis using a-priori knowledge-</p><p>based ANFIS. Expert Systems with Applications journal, v. 40, p 6863-6876, 2013.</p><p>CHEN, L. et al. Learning deep representation of imbalanced SCADA data for fault detection of wind turbines.</p><p>Measurement, v. 139, p. 370-379. 2019.</p><p>CHOI, S.; KIM, S. An investigation of operating behavior characteristics of a wind power system using a fuzzy</p><p>clustering method. Expert Systems with Applications, v. 81, p. 244-250, 2017.</p><p>CRESWELL, J. W. Research Design. 3. ed. Los Angeles: SAGE, 2009.</p><p>CUNNINGHAM, P.; DELANY, S. J. k-Nearest Neighbour Classifiers. Technical Report. 2007.</p><p>DEMO, P. Introdução à metologia da ciência. São Paulo. Atlas. 1985.</p><p>ELHOR, N. et al. Neural networks for wind turbine supervision. p. 366–369, 1999.</p><p>ESBJERG. PORT ESBJERG – OFSHORE WIND HUB PRESENTATION. 2020.</p><p>FU, J. et al. Condition Monitoring of Wind Turbine Gearbox Bearing Based on Deep Learning Model. IEEE</p><p>Access, v. 7. 2019.</p><p>GANTASALA, S.; LUNENO, J. C.; AIDANPÄÄ, J. O. Investigating how an artificial</p><p>neural network model can</p><p>be used to detect added mass on a non-rotating beam using its natural frequencies: A possible application for wind</p><p>turbine blade ice detection. Energies, v. 10, n. 2, p. 1–21, 2017.</p><p>GONZÁLEZ, M. O. A. Processo para Gerenciar a Integração de Clientes no Processo de Desenvolvimento</p><p>do Produto. 2010. Phd Thesis - Departamento de Engenharia de Produção, Universidade Federal de São Carlos,</p><p>São Carlos, 2010.</p><p>GONZÁLEZ, M. O. A.; GONÇALVES, J. S.; VASCONCELOS, R. M. Sustainable development: Case study in</p><p>the implementation of renewable energy in Brazil. Journal of Cleaner Production, v. 142, p. 461–475, 2017.</p><p>GUO, P.; FU, J.; YANG, X. Condition monitoring and fault diagnosis of wind turbines gearbox bearing</p><p>temperature based on kolmogorov-smirnov test and convolutional neural network model. Energies, v. 11, n. 9,</p><p>2018.</p><p>GUPTA, A. K. Efficient Wind Energy Conversion: Evolution to Modern Design. Journal of Energy Resources</p><p>Technology, v. 137, n. 5, p. 051201, 2015.</p><p>GWEC. GLOBAL WIND REPORT 2018. 2019.</p><p>HAMEED, Z.; WANG, K. Clustering Analysis to Improve the Reliability and Maintainability of Wind Turbines</p><p>with Self-Organizing Map Neural Network. International Journal of Performability Engineering, v. 9, n. 3, p.</p><p>245–260, 2013.</p><p>HAMEED, Z.; WANG, K. Development of Optimal Maintenance Strategies for Offshore Wind Turbine by Using</p><p>Artificial Neural Network. Wind Engineering, v. 36, n. 3, p. 353–364, 2012.</p><p>HAYKIN, S. Neural Networks and Learning Machines. 3. ed. Upper Saddle River, New Jersey: Pearson, 2008.</p><p>HELBING, G.; RITTER, M. Deep Learning for fault detection in wind turbines. Renewable and Sustainable</p><p>Energy Reviews, v. 98, p. 189-198, 2018.</p><p>HOSSAIN, M. L.; ABU-SIADA, A.; MUYEEN, S. M. Methods for advanced wind turbine condition monitoring</p><p>and early diagnosis: A literature review. Energies, v. 11, n. 5, 2018.</p><p>HUANG, Z. et al. Condition Monitoring of Wind Turbine Based on Copula Function and Autoregressive Neural</p><p>Network. MATEC Web of Conferences. v. 04008, p. 1–5, 2018.</p><p>89</p><p>HUANG, G. B.; DING, X.; ZHOU, H. Optimization method based extreme learning machine for classificatio.</p><p>Neurocomputing, v. 74, p. 155-163. 2010.</p><p>IGBA, J. et al. Performance assessment of wind turbine gearboxes using in-service data: Current approaches and</p><p>future trends. Renewable and Sustainable Energy Reviews, v. 50, p. 144–159, 2015.</p><p>IRENA. Renewable capacity statistics 2018, 2018.</p><p>IRENA. Global Energy Transformation: A Roadmap to 2050, 2018a.</p><p>JEFFCOCK. What´s the Difference Between AI, Machine learning, and Deep Learning. Available in:</p><p>. Aceess in 30 august 2020.</p><p>JHA, S. K. et al. Renewable energy: Present research and future scope of Artificial Intelligence. Renewable and</p><p>Sustainable Energy Reviews, v. 77, n. May 2016, p. 297–317, 2017.</p><p>JIMÉNEZ, A. A. et al. Linear and nonlinear features and machine learning for wind turbine blade ice detection</p><p>and diagnosis. Renewable Energy, v. 132, p. 1034–1048, 2019.</p><p>JIMÉNEZ, A. A.; MUÑOZ, C. Q.; MÁRQUEZ, F. P. Machine learning for wind turbine blades maintenance</p><p>management. Energies, v. 11, n. 1, p. 1–16, 2018.</p><p>KALOGIROU, S. A. Artificial Intelligence in Renewable Energy Systems Modelling and Prediction, 2002.</p><p>KAVAZ, A. G.; BARUTCU, B. Fault detection of wind turbine sensors using neural networks. Journal of</p><p>Sensors, v. 2018, p. 103–108, 2018.</p><p>KOHONEN, T.; SOMERVUO, P. How to make large self-organizing maps for nonvectorial data. Neural</p><p>Networks, v. 15, p. 945-953. 2002.</p><p>KREMER, M. A. AUTOASSOCIATIVE NEURAL NETWORKS. Computers & Chemical Engineering, v. 16,</p><p>p. 313-328. 1992.</p><p>KUSIAK, A. Renewables: Share data on wind energy. Nature, v. 529, p. 19-21. 2016.</p><p>KUSIAK, A.; VERMA, A.. A data-mining approach to monitoring wind turbines. IEEE Transactions on</p><p>sustainable energy, v. 3, n. 1, p. 150–157, 2012a.</p><p>KUSIAK, A.; VERMA, A. Analyzing bearing faults in wind turbines: A data-mining approach. Renewable</p><p>Energy, v. 48, p. 110–116, 2012b.</p><p>KUSIAK, A.; VERMA, A. Prediction of Status Patterns of Wind Turbines: A Data-Mining Approach. Journal of</p><p>Solar Energy Engineering, v. 133, n. 1, p. 011008, 2011.</p><p>Kusiak A., Verma A. Monitoring wind farms with performance curves. IEEE Transactions on Sustainable</p><p>Energy. 4, 192–199, 2013</p><p>LAKATOS, E; MARCONI, M. Fundamentos de metodologia científica. São Paulo. Atlas S. A., 2003.</p><p>LETCHER, T. M. et al. Wind Energy Engineering: A handbook for onshore and offshore wind turbines. Londres,</p><p>Elsevier, 2017.</p><p>LI, X. et al. Remaining Useful Life Prediction of Rolling Element Bearings Using Supervised Machine Learning.</p><p>Energies, v. 12, 2019.</p><p>LIMA, F.; FERREIRA, P.; VIEIRA, F. Strategic impact management of wind power projects Fátima. Renewable</p><p>and Sustainable Energy Reviews, v. 25, p. 277–290, 2013.</p><p>90</p><p>LIND, P. et al. Normal Behaviour Models for Wind Turbine Vibrations: Comparison of Neural Networks and a</p><p>Stochastic Approach. Energies, v. 10, n. 12, p. 1944, 2017.</p><p>LIU, Z. et al. GA-BP Neural Network-Based Strain Prediction in Full-Scale Static Testing of Wind Turbine Blades.</p><p>Energies, v. 12, p. 1-15. 2019.</p><p>LOH, W. Y.; SHIH, Y. S. SPLIT SELECTION METHODS FOR CLASSIFICATION TREES. Statistica Sinica,</p><p>v. 7, n. 4, p. 815–840, 1997.</p><p>LU, Y. et al. Condition based maintenance optimization for offshore wind turbine considering opportunities based</p><p>on neural network approach. Applied Ocean Research, v. 74, p. 69–79, 2018.</p><p>MANA, M.; PICCIONI, E.; TERZI, L. Wind turbine fault diagnosis through temperature analysis: An Artificial</p><p>Neural Network approach. Diagnostyka, v. 18, n. 1, p. 9–16, 2017.</p><p>MARUGÁN, A. P. et al. A survey of artificial neural network in wind energy systems. Applied Energy, v. 228,</p><p>n. April, p. 1822–1836, 2018.</p><p>MAZIDI, P.; TJERNBERG, L. B.; BOBI, M. A. S. Wind turbine prognostics and maintenance management based</p><p>on a hybrid approach of neural networks and a proportional hazards model. Journal of Risk and Reliability, v.</p><p>231, n. 2, p. 121–129, 2017.</p><p>MCCULLOCH, W. S.; PITTS, W. H. A Logical Calculus of the Ideas Immanent in Nervous Activity. Bulletin of</p><p>Mathematical Biophysics, v. 5, p. 115–133, 1943.</p><p>MÖRCHEN, F. Time series feature extraction for data mining using dwt and dft. 2003.</p><p>MHI VESTAS. MHI Vestas Unveils Pioneering MVOW SMART Turbine Product Portfolio. Available in:</p><p>. Acesso em: 5 dez.</p><p>2018.</p><p>NILSSON, N. J. The quest for artificial Intelligence. Cambridge University Press, 2009.</p><p>UN. Plataforma Agenda 2030. Available in: . Access: 7 mar. 2020.</p><p>PAI, M. et al. Systematic reviews and meta-analyses: an illustrated, step-by- step guide. Natl Med J India, v. 17,</p><p>n. 2, p. 86-95, 2004.</p><p>PAL, M.; MATHER, P. M. An assessment of the effectiveness of decision tree methods for land cover</p><p>classification. Remote Sensing of Environment, v. 86, n. 4, p. 554–565, 2003.</p><p>PATAN, K. Artificial Neural Networks for the Modelling and Fault Diagnosis of Technical Processes. v 377.</p><p>Berlin, Springer-Verlag. 2008.</p><p>PEDROSA, G. Detecção e Diagnóstico de Falhas na Performance de Aerogeradores. Master thesis -</p><p>Departamento de Engenharia Mecânica, Universedade Federal de Pernambuco, 2016.</p><p>PEREIRA, E. B. et al. The impacts of global climate changes on the wind power density in Brazil. Renewable</p><p>Energy, v. 49, p. 107–110, 2013.</p><p>RAFIQ, M. Y.; BUGMANN, G.; EASTERBROOK, D. J. Neural network design for engineering applications.</p><p>Computers and Structures, v. 79, n. 17, p. 1541–1552, 2001.</p><p>REGAN, T.; BEALE, C.; INALPOLAT, M. Wind Turbine Blade Damage Detection Using Supervised Machine</p><p>Learning Algorithms. Journal of Vibration and Acoustics, v. 139, n. 6, p. 061010, 2017.</p><p>REN, K.; QU, J. Identification of shaft centerline orbit for wind power units based on hopfield neural network</p><p>improved by simulated annealing.</p><p>Mathematical Problems in Engineering, v. 2014, 2014.</p><p>91</p><p>ROSA, J. L. G. Fundamentos da Inteligência Artificial. Rio de Janeiro: LTC Ltda., 2011.</p><p>ROSENBLATT, F. Perceptrons and the Theory of Brain Mechanics. 1962.</p><p>ROWE, N. C. Artificial Intelligence through Prolog. Upper Saddle River, NJ: Prentice-Hall, 1988.</p><p>RUSSEL, S. J.; NORVING, P. Artificial Intelligence: A Modern Approach. 3. ed. New Jersey: Pearson, 2013.</p><p>SAMPAIO, P. G. V.; GONZÁLEZ, M. O. A. Photovoltaic solar energy: Conceptual framework. Renewable and</p><p>Sustainable Energy Reviews, v. 74, p. 590 - 601, 2017.</p><p>SANTOS, M.; GONZÁLEZ, M. Factors that influence the performance of wind farms. Renewable Energy. 2019.</p><p>SCHLECHTINGEN, M.; SANTOS, I. F.; ACHICHE, S. Wind turbine condition monitoring based on SCADA</p><p>data using normal behavior models. Part 1: System description. Applied Soft Computing, v. 13, p. 259–270,</p><p>2013.</p><p>SCHLECHTINGEN, M.; SANTOS, I. Comparative analysis of neural network and regression based condition</p><p>monitoring approaches for wind turbine fault detection. Mechanical Systems and Signal Processing, v. 25, n. 5,</p><p>p. 1849–1875, 2011.</p><p>SHIHAVUDDIN, ASM. et al. Wind Turbine Surface Damage Detection by Deep Learning Aided Drone</p><p>Inspection Analysis. Energies, v. 12, p. 1-15. 2019.</p><p>SONG, C. SUPPORT VECTOR MACHINES (SVM). Available in: . Acessed in: 29 jan. 2019.</p><p>STETCO, A. et al. Machine learning methods for wind turbine condition monitoring: A review. Renewable</p><p>Energy journal, v. 133, p. 620-635, 2019.</p><p>STRDCZKIEWICZ, M.; BARSZCZ, T. Application of Artificial Neural Network for Damage Detection in</p><p>Planetary Gearbox of Wind Turbine. Shock and Vibration, v. 2016, p. 1–13, 2016.</p><p>SU, C.; HU, Z. Reliability assessment for Chinese domestic wind turbines based on data mining techniques. Wind</p><p>Energy, v. 21, n. 3, p. 198–209, 2018.</p><p>SUN, Z.; SUN, H. Health Status Assessment for Wind Turbine. Mathematical Problems in Engineering, v.</p><p>2018, n. 2, p. 16, 2018.</p><p>SVOZIL, D.; KVASNIEKA, V.; POSPICHAL, J. Introduction to multi-layer feed-forward neural networks.</p><p>Chemometrics and intelligent laboratory systems, v. 39, p. 43–62, 1997.</p><p>TARASSENKO, L. A guide to neural computing applications. 1 ed., John Wiley and Sons, USA, 1998.</p><p>TALEBI, N.; SADRNIA, M. A.; DARABI, A. Robust fault detection of wind energy conversion systems using</p><p>dynamic neural networks. International Journal of Sustainable Energy, v. 2014, p. 1–19, 2014.</p><p>TANG, et al. Fault diagnosis for a wind turbine transmission system based on manifold learning and Shannon</p><p>wavelet support vector machine. Renewable Energy, v. 62, p. 1-9. 2014.</p><p>TURBOSQUID. Gerador de Turbina Eólica Modelo 3D. 2020. Avaliable in: . Acessed in: 20 dec. 2020.</p><p>TURING, A. M. Computing Machinery and Intelligence Author. Mind, New Series, v. 59, n. 236, p. 433–460,</p><p>1950.</p><p>VAPNIK, V. N. An overview of statistical learning theory. IEEE Transactions on Neural Networks, v. 10, n. 5,</p><p>p. 988–999, 1999.</p><p>92</p><p>VERMA, A.; KUSIAK, A. Fault Monitoring of Wind Turbine Generator Brushes: A Data-Mining Approach.</p><p>Journal of Solar Energy Engineering, v. 134, n. 2, p. 021001, 2012.</p><p>VESELY, E. Consider automated machine learning for wind turbine asset maintenance. Power, v. 161, n. 9, p. 1–</p><p>15, 2017.</p><p>VESTAS. Annual Report 2019. 2020.</p><p>WANG, B. et al. Path planning for detection robot climbing on rotor blade surfaces of wind turbine based on</p><p>neural network. Advances in Mechanical Engineering, v. 2013, 2013.</p><p>WANG, X. B. et al. Novel paralleled extreme learning machine networks for fault diagnosis of wind turbine</p><p>drivetrain. Memetic Computing, v. 11, p. 127-142. 2019.</p><p>WANG, Q. et al. A Sensor Network Modeling and Fault Detection Method for Large Wind Farms by Using Neural</p><p>Networks. IEEE International Conference on Control & Automation. 2014.</p><p>WANG, X. Fault Diagnosis on Transmission System of Wind Turbines Based on Wavelet Packet Transform and</p><p>RBF Neural Networks. Applied Mechanics and Materials. v. 375, p. 1102–1105, 2013.</p><p>WEBSTER, J.; WATSON, R. T. Analyzing the Past to Prepare de Future: Wrinting a Literature Review. MIS Q,</p><p>v. 26, n. 2, p. 13-23. 2002.</p><p>WINOGRAD, T. Natural language understanding. Frontiers in Artificial Intelligence and Applications, v. 3, n.</p><p>1, p. 1–191, 1972.</p><p>WU, B. et al. Fault diagnosis for wind turbine based on improved extreme learning machine. Journal of Shanghai</p><p>Jiaotong University (Science), v. 22, n. 4, p. 466–473, 2017.</p><p>WU, C. M.; SONG, Q. H. Fault diagnosis of wind turbine vibration based on wavelet transform and neural</p><p>network. Journal of Information Hiding and Multimedia Signal Processing, v. 7, n. 4, p. 898–905, 2016.</p><p>YANG, H. H.; HUANG, M. L.; HUANG, P. C.. Detection of Wind Turbine Faults Using a Data Mining Approach.</p><p>Journal of Energy Engineering, v. 142, n. 3, p. 04015021, 2016.</p><p>YANG, H. H. et al. An approach combining data mining and control charts-based model for fault detection in</p><p>wind turbines. Renewable Energy, v. 115, p. 808–816, 2018.</p><p>YANG, Z. X.; WANG, X. Bo; ZHONG, Jian Hua. Representational learning for fault diagnosis of wind turbine</p><p>equipment: A multi-layered extreme learning machines approach. Energies, v. 9, n. 6, p. 1–17, 2016.</p><p>YEH, C. H. et al. Machine Learning for Long Cycle Maintenance Prediction of Wind Turbine. Sensors, v. 19,</p><p>2019.</p><p>YIN, R. K. Estudo De Caso. 2. ed. Porto Alegre: Bookman, 2001.</p><p>YU, W.; HUANG, S.; XIAO, W. Fault Diagnosis Based on an Approach Combining a Spectrogram and a</p><p>Convolutional Neural Network with Application to aWind Turbine System. Energies, v. 11, n. 10, 2018.</p><p>ZHAO, H. et al. Anomaly detection and fault analysis of wind turbine components based on deep learning network.</p><p>Renewable Energy, v. 127, p. 825-834, 2018.</p><p>ZADEH, L. A. The role of fuzzy logic in the management of uncertainty in expert systems. v. 11, p. 199–227,</p><p>1983.</p><p>ZHONG, J. H. et al. Multi-Fault Rapid Diagnosis for Wind Turbine Gearbox Using Sparse Bayesian Extreme</p><p>Learning Machine. IEEE Access, v. 7, p. 773-781. 2019.</p><p>93</p><p>APPENDIX 1 – QUESTIONS OF THE INTERVIEW WITH THE WIND FARM O&M</p><p>SPECIALIST</p><p>Q - How are the applications used in the day-to-day operation and maintenance of wind</p><p>turbines that assist in activities such as decision making and equipment monitoring?</p><p>Q - What are their characteristics?</p><p>Q - Do these tools allow the creation of a database?</p><p>Q - Which system is most concerned with the time of operation and maintenance, that is,</p><p>which system is the most critical?</p><p>Q - Are the existing tools efficient in monitoring this system today?</p><p>Q - Are there any bottlenecks that can be improved by other applications?</p><p>Q - Among the reviewed tools in the systematic bibliographic review, are there any that can</p><p>contribute to monitoring the condition of this system? Which one?</p><p>Q – The database is available for the development of this application?</p><p>6</p><p>TABLE INDEX .......................................................................................................................... 8</p><p>CHART INDEX ......................................................................................................................... 9</p><p>ACRONYMS ........................................................................................................................... 10</p><p>SUMMARY ............................................................................................................................. 11</p><p>CHAPTER 1: INTRODUCTION ............................................................................................. 14</p><p>1.1 Background .................................................................................................................... 14</p><p>1.2 Objectives ....................................................................................................................... 17</p><p>1.2.1 Main objective .......................................................................................................... 17</p><p>1.2.2 Specific objectives .................................................................................................... 17</p><p>1.3 Justification of research .................................................................................................. 17</p><p>1.4 Thesis structure ............................................................................................................... 19</p><p>CHAPTER 2: RESEARCH METHOD .................................................................................... 21</p><p>2.1 Research procedure ........................................................................................................ 21</p><p>2.2.1 Exploratory research ................................................................................................ 22</p><p>2.2.2 Development of framework for the application of ai in wind turbine operations and</p><p>maintenance .............................................................................................................. 23</p><p>2.2.3 Identification of relevant ai application for a brazilian wind farm maintenance ..... 23</p><p>2.2.4 Data acquisition ........................................................................................................ 24</p><p>2.2.5 Implementation of the framework for the application suggested by the o&m</p><p>specialist ................................................................................................................... 24</p><p>2.2.6 Framework restructuring after practical experience ................................................. 24</p><p>CHAPTER 3: LITERATURE REVIEW .................................................................................. 25</p><p>3.1 Wind energy .................................................................................................................... 25</p><p>3.2 Artificial intelligence ...................................................................................................... 28</p><p>3.3.1 ARTIFICIAL INTELLIGENCE METHODS AND CONCEPTS .............................. 32</p><p>3.3.1 Artificial neural networks (ann) ............................................................................... 32</p><p>3.3.2 Support vector machines (svm) ................................................................................ 35</p><p>3.3.3 Decision trees ........................................................................................................... 36</p><p>3.3.4 Ensemble learning .................................................................................................... 37</p><p>3.3.5 K-nearest neighbours ................................................................................................ 37</p><p>3.3.5 Fuzzy logic ............................................................................................................... 37</p><p>3.4 Ai applications for wind turbine maintenance ............................................................... 37</p><p>CHAPTER 4: STATE OF THE ART ...................................................................................... 40</p><p>4.1 Descriptive analysis ........................................................................................................ 40</p><p>4.1.1 Geographic distribution ............................................................................................ 41</p><p>4.1.2 Publications by journal ............................................................................................. 41</p><p>4.2 Content analysis ............................................................................................................. 42</p><p>4.2.1 Studies developed by equipment .............................................................................. 42</p><p>4.2.2 Scada vs vibration data for condition monitoring .................................................... 50</p><p>4.2.3 Evaluation metrics .................................................................................................... 52</p><p>CHAPTER 5: CONCEPTUAL FRAMEWORK FOR APPLICATION OF ARTIFICIAL</p><p>INTELLIGENCE IN WIND TURBINE OPERATION AND MAINTENANCE .................. 54</p><p>5.1 Getting data .................................................................................................................... 56</p><p>5.2 Define how ai can help in the approach .......................................................................... 56</p><p>5.3 Data pre-processing ........................................................................................................ 57</p><p>5.4 Parameter selection and feature extraction ..................................................................... 57</p><p>5.3 Model selection and validation ....................................................................................... 58</p><p>5.4 Fit best model to unseen data ......................................................................................... 58</p><p>5.5 Retrain model periodically ............................................................................................. 59</p><p>5.6 Post processing ............................................................................................................... 59</p><p>5.7 Comparison with other studies ....................................................................................... 60</p><p>CHAPTER 6: CONCEPTUAL FRAMEWORK APPLICATION AND LEARNING</p><p>LESSONS ................................................................................................................................. 61</p><p>6.1 Selection of application .................................................................................................. 61</p><p>6.2 Application of the conceptual framework ...................................................................... 62</p><p>6.2.1 Getting data .............................................................................................................. 63</p><p>6.2.2 Define how ai can help in the approach ................................................................... 63</p><p>6.2.3 Parameter selection .................................................................................................. 64</p><p>6.2.4 Data pre-processing .................................................................................................. 66</p><p>6.2.4 Model selection ........................................................................................................ 66</p><p>6.2.5 Post processing ......................................................................................................... 73</p><p>6.2.6 Retraining ................................................................................................................. 80</p><p>6.2.6 Comparison with other studies ................................................................................. 80</p><p>6.3 Framework description ................................................................................................... 81</p><p>CHAPTER 7: CONCLUSIONS AND RECOMMENDATIONS</p><p>........................................... 84</p><p>REFERENCES ......................................................................................................................... 87</p><p>APPENDIX 1 – QUESTIONS OF THE INTERVIEW WITH THE WIND FARM O&M</p><p>SPECIALIST ............................................................................................................................ 93</p><p>14</p><p>CHAPTER 1: INTRODUCTION</p><p>1.1 Background</p><p>Electricity generation is an essential topic when it comes to socioeconomic</p><p>development, being the central role in some nation’s economy (GONZÁLEZ; GONÇALVES;</p><p>VASCONCELOS, 2017; LIMA; FERREIRA; VIEIRA, 2013). Recently, this topic generates</p><p>big concerns in nations worldwide related to the global power matrix because the energy</p><p>production is the main source of fossil fuels, which are pointed by the scientific community as</p><p>the reason of climate changes.</p><p>The climate changes can be considered as one of the greatest challenges for human</p><p>kind as it has potential to transform all ecosystems. According to a report published by the</p><p>Intergovernmental Panel on Climate Change (united nations body for accessing science related</p><p>to the climate change) in 2018, maintaining the increase in temperature below 1.5°C more than</p><p>pre-industrial levels is fundamental for human kind to adapt to the climate changes and,</p><p>therefore, the reduction of emissions in the industrial processes is needed.</p><p>In this sense, according to the International Renewable Energy Agency (2018), the</p><p>renewable energy sources have been increasing in the global energy matrix and this trend will</p><p>continue while the fossil fuels will reduce its participation, as is presented in the Figure 1.1.</p><p>However, despite this ongoing shift to renewable energy sources, it still seems not enough. The</p><p>report published by the International Energy Agency in 2018 points that after three years of</p><p>stability, the carbon dioxide emissions related to the production of energy has increased 1.6%</p><p>in 2017. It pushes us further away from a consistent path towards the objectives set in the Paris</p><p>Agreement, which aims at limiting the increase in the average world temperature bellow 2°C.</p><p>Therefore, the development of renewable energy, such as solar and wind energy, is important</p><p>for the sustainable development of the planet.</p><p>15</p><p>Figure 1.1: Global Energy matrix evolution.</p><p>Source: IRENA, 2020.</p><p>When it comes to the renewable energy growth presented in the Figure 1.1, one can</p><p>highlight the development of the wind energy source that will represent the great portion of</p><p>energy production in the total world energy matrix in 2050 (IRENA, 2020). According to Vestas</p><p>(2020), the wind energy is already the most competitive in two thirds of the world, as can be</p><p>seem in the Figure 1.2. In this context, the Brazil wind energy market is the biggest and most</p><p>promising one in Latin America (GONZÁLEZ; GONÇALVES; VASCONCELOS, 2017;</p><p>SANTOS; GONZÁLEZ, 2019). The country is the 7° in the world in installed capacity with</p><p>more than 14 GW (ABEEÓLICA, 2019; GWEC, 2019). Moreover, only in 2017, the Brazilian</p><p>wind energy industry avoided 20.97 million of ton of 𝐶𝑂 emissions and attracted around R$</p><p>11 billion in investments, generating 30 thousand jobs. It is also strategic in the production of</p><p>electricity, mainly for the northeast region since it goes through long periods of draughts that</p><p>jeopardize the hydro energy production (ABEEOLICA, 2018). Hence, the development of wind</p><p>energy in Brazil can be strategic for environmental and economic reasons</p><p>.</p><p>Figure 1.2: Countries with wind power as the most competitive energy source.</p><p>Source: Vestas, 2020.</p><p>16</p><p>According to Amina et al. (2017), wind energy technology is considered the most</p><p>competitive among the renewable energy sources. The wind energy consists in converting the</p><p>kinetic energy from the wind into electric energy through the wind turbine. Therefore,</p><p>emphasizing the central role that this equipment plays in the wind energy, its operation and</p><p>maintenance (O&M) influences in its competitiveness. Corroborating with this statement,</p><p>Hameed and Wang (2013), Kusiak and Verma (2011) and Wang et al. (2014), say that the costs</p><p>associated with wind turbine operation and maintenance are high, and reductions in this cost</p><p>would lead to higher probability of the projects to be lucrative, enhancing wind power</p><p>competitiveness. In this same line, Vesely (2017) stands that the O&M costs are responsible for</p><p>around 30% of the Levelized Cost per KWh (indicator that measures the life cycle cost of the</p><p>project per its power production).</p><p>Artificial intelligence can be an important tool for O&M optimization. The</p><p>improvements in processing and data acquisition made artificial intelligence (AI) more famous</p><p>in society and, in striving to improve operation and maintenance competitiveness, a variety of</p><p>fields of the industry is looking towards AI as a way for building new tools to improve</p><p>performance, which could not be different in wind energy. Vesely (2017) states that companies</p><p>like GE, Siemens and SAP have been investing in research and development of machine</p><p>learning tools for fault prediction in wind turbines. Furthermore, Vestas announced a data</p><p>acquisition system 600 times faster than traditional SCADA (Supervisory, Control and Data</p><p>Acquisition System) in its MVOW Smart Turbine Portfolio (MHI VESTAS, 2018), which</p><p>enables much more data acquisition that can serve as fuel for building machine learning</p><p>applications.</p><p>After all, taking into account the importance of wind energy in achieving the goals</p><p>of reduction in carbon dioxide emissions set in the Paris Agreement; the strategic role it has</p><p>when it comes to energy security in Brazil as well as the enhance in investments in the sector;</p><p>the influence O&M has on wind power competitiveness; and the AI potential for building tools</p><p>for wind turbine maintenance; this thesis urge to answer the following research questions: How</p><p>to apply artificial intelligence tools in wind turbines operations and maintenance? How</p><p>would be an AI application that is relevant for wind turbine maintenance of a Brazilian</p><p>wind farm?</p><p>17</p><p>1.2 Objectives</p><p>After presenting the research questions, the main objective and the specific</p><p>objectives are identified as following.</p><p>1.2.1 Main Objective</p><p>This research has the main goal of proposing a framework for AI applications in</p><p>wind turbines.</p><p>1.2.2 Specific Objectives</p><p> Identify state-of-the-art knowledge about AI applications for wind turbine</p><p>operation and maintenance.</p><p> Compare methods, considering main goal, data needed to compose the study,</p><p>complexity and results.</p><p> Develop a conceptual framework for the application of AI in wind turbine</p><p>operations and maintenance.</p><p> Validate the framework procedure by the application of it in a Brazilian wind</p><p>farm, develop algorithms for simulation;</p><p> Modelling the final framework with considerations learned after the</p><p>simulation;</p><p>1.3 Justification of research</p><p>The justification is analysed considering the socioeconomics, environmental and</p><p>academic aspects.</p><p>The 2030 Agenda for the sustainable development is a plan of action firmed in 2015</p><p>by worldwide leaders in the United Nations that aims at eradicating poverty, protect the planet</p><p>and guarantee prosperity and peaceful life for people. It is composed by 17 objectives for</p><p>sustainable development with bold indicators to be achieved by the year of 2030, as it is</p><p>presented in the Figure 1.3 (ONU, 2020). This research aims at contributing to this Agenda,</p><p>especially in two specific objectives, the objective 7 related to affordable and clean energy and</p><p>the objective 9 related to industry, innovation and infrastructure.</p><p>18</p><p>Figure 1.3: 17 objectives of the Agenda 2030.</p><p>Source: (UN, 2020).</p><p>According to Bloomberg</p><p>NEF (2016), the consistent winds of Brazil, especially in</p><p>the northeast region, make the Brazilian wind farms present the best capacity factors in the</p><p>world, as it is shown in the Figure 1.4. Moreover, this characteristic will not be changing with</p><p>climate changes since it is expected that the climate transformations would be favourable for</p><p>the availability of wind power projects (PEREIRA et al., 2013).</p><p>Figure 1.4: Average capacity factor by country.</p><p>Source: BLOOMBERG NEF, 2016.</p><p>As a result of this favourable environmental conditions, it is expected that the wind</p><p>energy will continue to grow in Brazil. According to ABEEólica (2018), the wind energy is</p><p>expected to enhance its installed capacity to 17 GW until 2023, moreover, Bernardes et al.</p><p>19</p><p>(2018) mentions Bloomberg NEF (2016) and ANEEL (2018) when stating that the new</p><p>technologies already available for the last auctions reduced the cost of wind energy around</p><p>400%, which shows increasing competitiveness in the country.</p><p>In the academy, the relationship between wind turbines maintenance and AI is a</p><p>relatively new line of research that has been gaining relevance. The evaluation of articles used</p><p>in the systematic literature review phase of this research came up with 51 peer-reviewed articles</p><p>in English in the Periodicos Capes Platform (gathers the main bases like Web of Science,</p><p>Scopus, among others), with the first one published in 1999, it can be seen a growing trend in</p><p>the research about the topic, which show its actual relevance for the academic community.</p><p>Figure 1.5: Number of publications about artificial intelligence applications for wind turbine maintenance</p><p>according to a survey carried out in the Periodicos Capes Platform.</p><p>Source: Author, 2020.</p><p>Hence, the research performed in this thesis justifies itself environmentally and</p><p>socioeconomically because it aims at contributing to the objectives 7 and 9 of the 2030 Agenda,</p><p>creating knowledge that can be used to enhance wind energy competitiveness through</p><p>improvements in wind turbine maintenance efficiency, therefore, fomenting the development</p><p>of clean energy. It also justifies itself academically because this theme is relatively new to the</p><p>academic community, and it can contribute to generation of knowledge to the field of wind</p><p>energy, when it comes to wind turbine maintenance.</p><p>1.4 Thesis structure</p><p>This thesis is divided into 7 chapters. After this introduction, the Chapter 2 brings</p><p>the research method, classifying it according to the research genre, objective, argumentation,</p><p>approach and scientific procedure, it also will present the step-by-step procedure carried out for</p><p>building the systematic literature review and the field research. Next, the Chapter 3 presents the</p><p>traditional literature review about wind energy, artificial intelligence and wind turbine</p><p>maintenance. The fourth chapter presents state-of-the knowledge about the research theme,</p><p>11</p><p>14</p><p>7</p><p>6</p><p>1</p><p>4</p><p>2</p><p>3</p><p>2</p><p>00000000000</p><p>1</p><p>20</p><p>19</p><p>20</p><p>18</p><p>20</p><p>17</p><p>20</p><p>16</p><p>20</p><p>15</p><p>20</p><p>14</p><p>20</p><p>13</p><p>20</p><p>12</p><p>20</p><p>11</p><p>20</p><p>10</p><p>20</p><p>09</p><p>20</p><p>08</p><p>20</p><p>07</p><p>20</p><p>06</p><p>20</p><p>05</p><p>20</p><p>04</p><p>20</p><p>03</p><p>20</p><p>02</p><p>20</p><p>01</p><p>20</p><p>00</p><p>19</p><p>99</p><p>N</p><p>um</p><p>be</p><p>r o</p><p>f</p><p>pu</p><p>bl</p><p>ic</p><p>at</p><p>io</p><p>ns</p><p>20</p><p>showing the classification of articles according to region and journal that they are published, as</p><p>well as discussions about the content found in the articles related to data used, sources, wind</p><p>turbine equipment that were focused, main goal of applications, results and metrics for</p><p>evaluation. The Chapter 5 brings proposed framework, showing the steps mapped for building</p><p>AI applications for Wind Turbine maintenance. Chapter 6 presents the field research, the</p><p>process for selecting the application and the step-by-step of the simulation. Chapter 7 shows</p><p>the final considerations and future studies.</p><p>21</p><p>CHAPTER 2: RESEARCH METHOD</p><p>This chapter presents the method used in the development of this research. It</p><p>describes the research procedure, showing the steps taken in order to develop the literature</p><p>review and the field research.</p><p>According to Demo (1985), the methodology is the instrumental part of the science,</p><p>accounting for the proceedings, tools and path of science activity. There are criteria to</p><p>characterize and classify the research methods (CRESWELL, 2009; DEMO, 1985; LAKATOS;</p><p>MARCONI, 2003; YIN, 2001). The Chart 2.1 presents a structure adapted from González</p><p>(2010), where it is possible to classify the research method directly.</p><p>Chart 2.1: Research classification.</p><p>Source: Adapted from González (2010).</p><p>Therefore, this study can be classified as a practical research, with explainable</p><p>objective, inductive argumentation, quantitative approach and uses the simulation method in</p><p>order to achieve its main goal.</p><p>2.1 Research procedure</p><p>The research procedure of this thesis encompasses 6 stages as it is presented in the</p><p>Figure 2.1: 1) Exploratory research (traditional and systematic literature review); 2)</p><p>Development of conceptual framework for the application of AI in wind turbine maintenance;</p><p>Criteria Classification Choice</p><p>Theoretical</p><p>Methodological</p><p>Empirica</p><p>Practical X</p><p>Exploratory</p><p>Descriptive</p><p>Explanatory X</p><p>Inductive X</p><p>Deductive</p><p>Hypotetical-Deductive</p><p>Dialetic</p><p>Quantitative X</p><p>Qualitative</p><p>Quantitative-qualitative</p><p>Survey</p><p>Simulation X</p><p>Research-action</p><p>Case study</p><p>Study of cases</p><p>According to research genre (DEMO, 1985)</p><p>According to research objective (YIN, 2001)</p><p>According to argumentation (MARCONI; LAKATOS, 2003)</p><p>According to approach (CRESWELL, 2009)</p><p>According to research proceedure (YIN, 2001)</p><p>22</p><p>3) Identification of relevant AI application for a Brazilian Wind Farm maintenance; 4) Data</p><p>acquisition; 5) Application of the framework for the AI application proposed and; 6) Structuring</p><p>the framework with practical experience obtained in the 5 stage.</p><p>Figure 2.1: Research procedure.</p><p>Source: Author, 2020.</p><p>2.2.1 Exploratory research</p><p>This step was undertaken in building the theoretical background and the state-of-</p><p>the-art knowledge of this research, and it was carried out through 2 methods, the traditional</p><p>literature review and the systematic literature review. The traditional literature review was</p><p>performed through the reading of books, thesis, articles and websites about wind energy, AI</p><p>and wind turbine O&M, in order to present the history of wind power and artificial intelligence</p><p>and the main concepts behind this study. The systematic literature review was performed by</p><p>selecting articles about the theme, with the goal of identifying state-of-the-art knowledge.</p><p>The systematic literature review is a reliable and replicable method that describes</p><p>the means for obtaining the results, allowing structured and unbiased generation of knowledge</p><p>about the theme (PAI, et al; 2004; WEBSTER; WATSON, 2002; SAMPAIO; GONZÁLEZ,</p><p>2017). In this thesis, it was performed in 5 stages, as it is presented in the Figure 2.2.</p><p>In the first stage, after reading masters and PhD thesis about the theme, it was found</p><p>key-words associated to it and the research question was defined as: How to apply artificial</p><p>intelligence tools in wind turbines operations and maintenance? How would be an AI</p><p>application that is relevant for wind turbine maintenance of a Brazilian wind farm? After</p><p>that, the second stage was initiated, it consists in searching for papers in the Periodicos Capes</p><p>platform. The Periodicos Capes Platform contains 130 scientific bases, including Scopus, Web</p><p>of Science and Science Direct, and other content dedicated to the study of patents, books,</p><p>encyclopedias and norms. The search for papers was performed with a combination of key-</p><p>23</p><p>words related to AI and wind energy, as it is presented in the Chart 2.2. The combinations were</p><p>set to be found only on the article’s titles. The search for articles was filtered in order to find</p><p>only peer-reviewed articles written in English.</p><p>Figure 2.2: Systematic</p><p>Literature Review procedure.</p><p>Source: Author, 2020.</p><p>Chart 2.2: Key-words combination for searching for papers in the Periodicos Capes platform.</p><p>Source: Author, 2020.</p><p>In Article selection phase, the abstracts of the articles found in the search were read,</p><p>discarding the ones that did not have relationship with the theme and the repeated ones, leading</p><p>to 51 articles in total. The selected articles were then fully read, having their relevant</p><p>information collected in Excel Spreadsheets in order to facilitate classification and analysis.</p><p>2.2.2 Development of framework for the application of AI in wind turbine operations</p><p>and maintenance</p><p>After the systematic literature review stage, a framework for applying AI in wind</p><p>turbine maintenance was developed. In order to do so, the main steps taken in the articles were</p><p>mapped, showing an overview of the path for building an artificial intelligence application for</p><p>wind turbine maintenance which can serve as a guide for researchers and developers.</p><p>2.2.3 Identification of relevant AI application for a Brazilian wind farm maintenance</p><p>In this step, an interview with a wind farm O&M specialist was undertaken in order</p><p>to define applications in the systematic literature review that would be relevant for the</p><p>AI WIND ENERGY</p><p>Artificial Intelligence Wind Energy</p><p>Neural Networks Wind Power</p><p>Deep Learning Wind Turbines</p><p>Machine Learning Wind Farms</p><p>Data Mining Wind Park</p><p>IOT</p><p>AND</p><p>Key Words</p><p>24</p><p>maintenance routine of the wind farm. The main applications found in the SLR were presented,</p><p>showing their functionalities, data used to build them and the main results.</p><p>Furthermore, the interview also included topics related to the tools used for wind</p><p>turbine operation and maintenance used in wind farms, data availability, the most critical</p><p>equipment and bottlenecks of operation. The questions asked in the interview are presented in</p><p>the Appendix 1.</p><p>After this discussion, the application for temperature condition monitoring of wind</p><p>turbine bearings was selected due to its relevance for the Brazilian wind farm and viability to</p><p>be simulated.</p><p>2.2.4 Data acquisition</p><p>In the same interview, after defining the applications in the SLR that would be</p><p>relevant for the wind farm, the availability of data was also discussed in order to verify which</p><p>wind turbine data would be available for simulating the application. Issues related to data</p><p>acquisition and data pre-processing were also taken into account. It was concluded that SCADA</p><p>data would be available for the wind farm under study since it does not bring difficulties related</p><p>to data gathering.</p><p>2.2.5 Implementation of the framework for the application suggested by the O&M</p><p>specialist</p><p>Once the availability of the data is checked, the framework proposed in 2.2.2 was</p><p>performed. The step-by-step procedure was followed accordingly, using the methods mapped</p><p>in the systematic literature review. The simulations were performed using Python 3.7 in Jupyter</p><p>Notebook environment. After simulations, the analysis of the application of the framework is</p><p>performed addressing the results and the viability of implementation.</p><p>2.2.6 Framework restructuring after practical experience</p><p>In this phase, learning lessons from the simulation of the conceptual framework are</p><p>discussed and the steps considered in it were rearranged in order to better attend the</p><p>observations and come up with a more concise step-by-step procedure, proposing a final</p><p>framework.</p><p>25</p><p>CHAPTER 3: LITERATURE REVIEW</p><p>This chapter comes with the main content related to the research theme, presenting</p><p>definitions about wind energy, history, and the main wind turbine components; it will also</p><p>present knowledge about Artificial intelligence and the main methods found in the SLR.</p><p>3.1 Wind energy</p><p>The wind energy can be understood as the conversion of the wind kinetic energy</p><p>into useful energy. It is a process that has being used for centuries since it is estimated that 2000</p><p>years ago the Chinese used windmills for pumping water up and corn processing. Following</p><p>this timeline, in the XIV century the Dutch adopted windmills for draining soil, and in the</p><p>century XIX, millions of windmills were installed in the US in order to pump water (LETCHER,</p><p>2017).</p><p>It was also in the XIX century that it was noticed the first wind machines in the US,</p><p>they were used in the operation of equipment, however, the electric generation to the grid started</p><p>only in the 70’s, launched by the necessity of replacement of fossil fuels for renewable energy</p><p>sources (LETCHER, 2017). The interest for this “new” type of energy generation boosted its</p><p>development.</p><p>According to ESBJERG (2020), the wind turbines have been increasing both in</p><p>power and height since the 90’s, as can be seen in the Figure 3.1. It shows the evolution of</p><p>offshore wind turbines that, since the 90s, the wind turbines increased in 7 times their height</p><p>and in 33 times their power output.</p><p>According to Hossain, Abu-Siada and Muyeen (2018), modern wind turbines are</p><p>complex electromechanics systems composed by some components and subcomponents. The</p><p>Figure 3.2 shows an illustration of a typical wind turbine, its equipment and systems.</p><p>26</p><p>Figure 3.1: Evolution of offshore wind turbines since 1991.</p><p>Source: Esbjerg, 2020.</p><p>Figure 3.2: Main components of a typical large-scale wind turbine.</p><p>Source: Hossain, Abu-Siada and Muyeen (2018)</p><p>The wind turbines are usually located in remote areas on and off-shore, which turns</p><p>in severe operation conditions due to the harsh environment that they are exposed to, which</p><p>lead to failure propensity. The wind turbine main components and the failures that they are</p><p>exposed to are discussed as follows (HOSSAIN; ABU-SIADA; MUYEEN, 2018):</p><p> Rotor: it is composed by blades and hub and is propense to failures such as</p><p>asymmetry, fatigue, cracks, superficial roughness increases, stiffness</p><p>reduction and blade deformation.</p><p> Gearbox: it is responsible for the transmission of the velocity of the rotor to</p><p>the generator. It is one of the most critical components of the wind turbines,</p><p>27</p><p>being responsible for a lot of wind turbine downtime. The most usual</p><p>failures in the gearbox include broken gear teeth and bearing failure. These</p><p>failures lead to increase in the bearing temperature and lubricant oil</p><p>temperature, which shows the importance of these parameters to the gearbox</p><p>condition monitoring.</p><p> Main shaft: it connects the drivetrain with the gearbox, having failures</p><p>caused by corrosion, misalignment, cracks and coupling problems, which</p><p>affect the natural rotation of the shaft and leads to a ripple effect in other</p><p>subcomponents.</p><p> Hydraulic system: it is used for mechanical break control, pitch and yaw</p><p>system adjustments, even though there are also some cases of electrical</p><p>hydraulic systems. It is subject of failures such as oil leak and valve</p><p>blocking.</p><p> Mechanical break: it is normally coupled with the main shaft and is used in</p><p>cases of extreme rotation velocity of the rotor or in cases of other component</p><p>failure. The main issue about the mechanical break is the failures in the disk</p><p>break due to stress or overheating, which can be diagnosed by temperature</p><p>and vibration monitoring.</p><p> Tower: the tower that maintains the nacelle in the proper height for energy</p><p>production, it is subject of failures related to structure damages such as</p><p>cracks and corrosion.</p><p> Electric Machine (generator): is the component responsible for the</p><p>conversion of mechanic energy into electricity, therefore, the failures in this</p><p>component can be mechanic or electric related. The electric failures are</p><p>related to open circuits, isolation damage and electric imbalance, while the</p><p>main mechanical failures are shaft damage and imbalance and bearing</p><p>damages.</p><p> Electronic Power Converter: the reliability</p><p>of this component gets more</p><p>complex with the increase in the power of the wind turbines. They are</p><p>subject of failures in the capacitors, circuit board and transistors.</p><p> Sensors: the wind turbine have lots of sensors for continuous parameter</p><p>monitoring such as temperature, voltage and current. Failures related to</p><p>28</p><p>sensors unbalance, physical failures and processing failures can harm the</p><p>performance of the wind turbine.</p><p> Control system: it regulates the wind turbine operation and has failures</p><p>associated with hardware and software. Hardware failures include failures</p><p>in sensors, actuators, controlling boards and communication links, they can</p><p>be detected through model-based methods. The software failures include</p><p>buffer overflow, lack of memory, resource lost and can be detected through</p><p>software diagnostics.</p><p>3.2 Artificial intelligence</p><p>The artificial intelligence is very broad in the field of computer science which</p><p>makes it tricky to be defined since it is composed by different lines of thought. In order to define</p><p>Artificial Intelligence broadly, Russel and Norving (2013) present its definition in 4</p><p>dimensions, as it is presented in the Chart 3.1. The superior part of the chart is dedicated to</p><p>thinking, the inferior part is dedicated to action, the left side is dedicated to mimic the human</p><p>behaviour and the right side is related to rationality or do something right with the available</p><p>information. The approaches centred in the human kind empirical sciences that involve</p><p>hypotheses and experimental confirmation while the rationalist approaches are related to</p><p>combinations between math and engineering.</p><p>Chart 3.1: Artificial intelligence definitions separated by categories.</p><p>Source: Russel and Norving (2013).</p><p>29</p><p>Therefore, the definition of artificial intelligence goes through: (i) Thinking</p><p>humanly: the ability of a machine of thinking as humans and execute activities such as decision</p><p>making, problem resolution and learning. (ii) Acting humanly: the ability of a machine of acting</p><p>like human in a way that humans cannot differ between the machine and a person (Turing test).</p><p>(iii) Thinking rationally: machines with the ability to solve complex problems through logical</p><p>reasoning. (iv) Acting rationally: intelligent action of agents with the capacity of operating</p><p>autonomously, perceive the environment, adapt, create and follow objectives (RUSSEL;</p><p>NORVING, 2013).</p><p>According to Russel and Norving (2013), the first paper acknowledged as being</p><p>part of the artificial intelligence field is the one published by McCulloch and Pittts (1943) that,</p><p>based on the knowledge of the brain neuron behaviour, the Turing computing theory and</p><p>positional logics, presented a model of artificial neurons that, if combined in proper networks,</p><p>could be able to learn.</p><p>The paper of Turing (1950) is considered as the most influent in the field of artificial</p><p>intelligence. It proposes the Turing Test, that evaluate the ability of a machine in expressing</p><p>indistinguishable human behaviour. It is still relevant in artificial intelligence research in the</p><p>XXI century. Moreover, this paper has presented concepts of machine learning, genetic</p><p>algorithms, reinforcement learning and child programming.</p><p>Artificial intelligence was defined as a separate field from other sciences (such as</p><p>mathematics, control, operational research and decision theory) in 1956, at Dartmounth College</p><p>in the US. On that occasion, John McCarthy, Nathaniel Rochester, Claude Shannon and Marvin</p><p>Minsky organized a two-month Artificial Intelligence study seminar called the “Artificial</p><p>Intelligence Summer Research Project”. They gathered a total of 10 scholars on the topic in this</p><p>seminar who came up with the term artificial intelligence and introduced important characters</p><p>from the field (NILSSON, 2009; RUSSEL; NORVING, 2013).</p><p>From then on, there was a period of great enthusiasm in artificial intelligence research</p><p>(1952-1969). McCulloch and Pitts (1943) work has continued in other studies related to neural</p><p>networks, such as the research of Rosenblatt (1962), who created artificial neural networks</p><p>using perceptron. The evolution of general-purpose methods of logical reasoning enabled the</p><p>development of applications such as the Shakey robotics project, in 1966, developed at the</p><p>Stanford Reserch Institute (SRI), being the first project to integrate logical reasoning and</p><p>movement (RUSSEL; NORVING, 2013). In the field of natural language processing, the mini-</p><p>block world created through the SHRDLU program was able to identify commands by phrases</p><p>and execute them by manipulating blocks (RUSSEL; NORVING, 2013; WINOGRAD, 1972).</p><p>30</p><p>After this initial enthusiasm and encouraging prognosis with simple problem-solving</p><p>systems, artificial intelligence research ran into difficulties when they were applied to more</p><p>robust problems, leading governments to reduce incentives for research in the area in the United</p><p>States and the United Kingdom. (RUSSEL; NORVING, 2013).</p><p>This reality changed with the expert systems, consisting of programming aimed at</p><p>solving problems combined with several rules with a specific purpose. As an example, the</p><p>system known as MYCIN that used more than 400 rules in identifying infections. The</p><p>popularization and commercialization of these systems in large corporations made investments</p><p>in expert systems expand in the 1980s, consolidating Artificial Intelligence as an industry.</p><p>Shortly after that, the artificial intelligence market slowed down, in a period known as the AI</p><p>Winter (NILSSON, 2009; RUSSEL; NORVING, 2013).</p><p>In the 80s the neural networks have returned to the academy, as well as the adoption</p><p>of scientific method in studies related to artificial intelligence. There was a separation between</p><p>AI and cognitive science, leading the modern research in neural networks to be separated into</p><p>two fields, one aimed at creating effective network models and understanding their</p><p>mathematical properties, and the other concerned with modeling the properties of real neurons.</p><p>The AI emerged as a science when researchers started to adopt the scientific method in their</p><p>work, studying hypotheses through empirical experiments and analyzing the results</p><p>statistically, allowing replicable experiments. It has also become more common to adopt</p><p>existing theories in different applications of relevance instead of proposing new theories. In</p><p>addition, this period was also marked by the creation of Bayesian networks and the emergence</p><p>of the data-mining industry (RUSSEL; NORVING, 2013).</p><p>In the 1990s, the effort was directed towards the development of intelligent agents,</p><p>they became popular mainly on the internet with the “bots”, search engines, recommendation</p><p>systems and websites. Improvements in sensors, control and planning allowed agents to have</p><p>contact with different fields, such as autonomous cars (RUSSEL; NORVING, 2013)</p><p>In the 21st century, the AI-related studies are more concerned with the abundance</p><p>and quality of data than with the algorithms themselves. This is due to the growth in data</p><p>acquisition and availability, and the discovery that poor algorithms that are executed by a large</p><p>amount of data perform better than robust algorithms with little data (RUSSEL; NORVING,</p><p>2013).</p><p>The growth and popularization of terms such as artificial intelligence, machine</p><p>learning and deep learning makes it difficult for differentiate its concepts. In the Figure 3.3,</p><p>31</p><p>Jeffcock (2018) states that, as discussed earlier, artificial intelligence started around the 50’s,</p><p>and it is defined as any technique that allows computers to mimic human behavior, the figure</p><p>also shows machine learning as a subpart if artificial intelligence and deep learning as a subpart</p><p>of machine learning.</p><p>Figure 3.3: Relationship between AI, ML and deep learning.</p><p>Source: Jeffcock (2018).</p><p>Other</p><p>authors have also tried to map the areas of AI, as in Rosa (2011) and Rowe</p><p>(1988). For them, artificial intelligence includes some areas that fall into:</p><p>• Natural language processing: machines that communicate in human languages,</p><p>understanding what is communicated and generating responses;</p><p>• Artificial vision: machines capable of looking at the environment through</p><p>cameras and recognizing what it is that appears in the image;</p><p>• Logical inference: machines that recognize complicated interrelated facts and</p><p>obtain logical conclusions from them;</p><p>• Planning: machines that plan sequences of actions aiming to achieve goals;</p><p>• Expert systems: machines that, based on complicated rules for various</p><p>situations, are able to help human beings with different tasks.</p><p>• Robotics: machines that move and handle real-world objects.</p><p>According to Kalogirou (2002), generally the term artificial intelligence can be</p><p>defined as the ability of a machine or artifact to perform functions that characterize human</p><p>thought, therefore, in computing, this term has been applied to programs and systems that can</p><p>perform more complex tasks than direct programming, even if it is far from human thinking.</p><p>32</p><p>He further divides AI into two branches with regard to the applications of artificial intelligence</p><p>in renewable energies: neural networks and expert systems.</p><p>Artificial intelligence, as an area of knowledge, seeks to understand human thinking</p><p>in order to develop intelligent and efficient systems for solving complicated problems, reducing</p><p>issues with manual computation. However, while imitating the human brain is a complicated</p><p>task, some areas of AI perform better than that due to the computer's ability to perform</p><p>mathematical calculations millions of times faster than our capacity. Since the beginning of the</p><p>21st century, AI has been applied in various fields such as product design, financial accounting,</p><p>economics, medicine, and is based on several theories of machine learning such as statistical</p><p>learning, neural learning and evolutionary learning, being neural learning the most used in</p><p>several applications (JHA et al., 2017).</p><p>Previously aimed at academic studies, machine learning applications have been</p><p>growing in the industry with the use of artificial intelligence combined with the internet of</p><p>things (IoT), which are driving significant advances in the sector (VESELY, 2017). Thus, when</p><p>it comes to wind energy, the principles of artificial intelligence have been used in several</p><p>applications. Jha et al. (2017), consider that the research of AI applications in wind energy uses</p><p>three main approaches: (i) neural networks, (ii) statistical analysis, (iii) evolutionary learning</p><p>and their combinations as hybrid applications of AI. They also point out that most research</p><p>relating to AI and wind energy focuses on forecasting wind speed and power generation,</p><p>however, areas such as power generation system design, risk optimization, and failure diagnosis</p><p>are also part of the research.</p><p>3.3.1 Artificial intelligence methods and concepts</p><p>In the scope of data science, there is a diversity of methods that are used according</p><p>to the final objective of the study. This section presents the main families of AI methods found</p><p>in the articles selected in the systematic literature review.</p><p>3.3.1 Artificial Neural Networks (ANN)</p><p>In general, Artificial Neural Networks are machines designed for modeling</p><p>particular tasks performed by the human brain. More specifically, they are processors made up</p><p>of simple processing units with a propensity of learning after going through a training process,</p><p>where the connections between neurons, known as synaptic weights, store the knowledge</p><p>acquired from the environment (HAYKIN, 2008).</p><p>33</p><p>These models mimic the structure of neurons for modelling real problems having,</p><p>as similarity with the brain, the ability to acquire knowledge through the learning process</p><p>(BANGALORE, 2014). According to Cabral (2017), artificial neural networks are artificial</p><p>intelligence systems robust to failure, which can learn and present results in real time, and</p><p>therefore have applicability in several branches of research that require data analysis,</p><p>combining statistics with computation. in the optimization of dynamic processes with high</p><p>reliability.</p><p>The main component of artificial neural networks (ANN) is the neuron, which has</p><p>the function of generating an output based on a series of inputs. These inputs are associated</p><p>with weights and the scalar product of this weights and inputs plus the bias is passed through</p><p>an activation function, which determines the output of the neuron, as shown in Figure 3.4</p><p>(HAYKIN, 2008).</p><p>Figure 3.4: Typical neuron model.</p><p>Source: Adapted from Haykin (2008).</p><p>Normally, neural networks are composed of input layer, hidden layer and output</p><p>layer, and the gradient descent method is used to adjust the weights according to training to</p><p>simulate a relationship between inputs and output (WU; SONG, 2016). After that, the neuron</p><p>output is compared with the real truth values in order to calculate the error between them, the</p><p>process can be repeated several times in order to minimize it as much as possible. These</p><p>iterations are called epochs (STRDCZKIEWICZ; BARSZCZ, 2016). There are some types of</p><p>neural networks that vary their input/output ratio according to the network configuration, which</p><p>consists of the number of neurons in their layers and their connections (BANGALORE, 2014).</p><p>When it comes to the process of learning neural network methods, the relationship</p><p>between inputs and outputs is stored in the weights (synapitc weights) and bias (bias), which</p><p>determine the output of each neuron. These weights are determined after the algorithm is</p><p>34</p><p>submitted to a training and test datasets, characterizing the learning process, which can be</p><p>supervised or unsupervised. The supervised learning process requires that the training dataset</p><p>has ground truth data, having an output associated with the inputs, thus establishing a</p><p>relationship between input and output at the end of the learning process. Therefore, supervised</p><p>learning is a process applied for modeling input/output relationship. On the other hand, the</p><p>unsupervised learning process can be carried out without labelled data, being frequently applied</p><p>in data clustering (BANGALORE, 2014).</p><p>According to Haykin (2008), the properties of neural networks have some</p><p>characteristics and advantages:</p><p>• Non-linearity: Neural networks can be linear and non-linear.</p><p>• Input-output mapping: Neural networks are able to map classified datasets</p><p>through supervised learning.</p><p>• Adaptability: Neural networks have the ability to adapt their synaptic weights</p><p>to changes in the environment in which they are exposed to, characterizing</p><p>learning.</p><p>• Evident answer: In the pattern classification, a neural network can be built to</p><p>select between patterns with reliability;</p><p>• Contextual information: knowledge is represented by the entire structure of the</p><p>neural network, with the participation of all neurons globally, offering</p><p>information in the context of the environment that is inserted.</p><p>• Fault tolerance: Neural networks present a moderate degradation in</p><p>performance instead of catastrophic failures;</p><p>• Uniform analysis and design: The same notation is used in all applications</p><p>involving neural networks because neurons are the common component in all</p><p>neural networks. This allows the sharing of theories and learning algorithms</p><p>about neural networks of different configurations. In addition, modular neural</p><p>networks can be built.</p><p>• Analogy with neurobiology: The design of a neural network is inspired by the</p><p>brain behavior.</p><p>Regarding the maintenance of wind turbines, an advantage of applying neural</p><p>networks is that it is required less specific technical information about them. In other</p><p>words,</p><p>neural networks-based methods do not require deep technical knowledge about the component</p><p>to be monitored (BANGALORE et al., 2017; ELHOR et al., 1999). In addition, the methods</p><p>35</p><p>can be applied to a variety of equipment, since there is a large amount of data available, and</p><p>can be applied to several sets of wind turbines (BANGALORE, et al., 2017).</p><p>In this research, neural networks appear in lots of different types such as auto-</p><p>associative neural networks (KREMER, 1992), self-organizing maps neural networks</p><p>(KOHONEN; SOMERVUO, 2002), ANFIS neural networks (CHEN; MATHEWS; TAVNER,</p><p>2013), Recurrent neural networks (SUN; SUN, 2018), Hopfield neural networks (REN; QU,</p><p>2014), Backpropagation neural networks (HAYKIN, 1999), Extreme learning machine neural</p><p>networks (HUANG; DING; ZHOU, 2010), convolutional neural networks (GUO; FU, YANG,</p><p>2018), Auto-Encoder Neural Networks (ZHAO et al., 2018).</p><p>3.3.2 Support Vector Machines (SVM)</p><p>According to Vapnik (1999), Support vector machines are methods that map</p><p>databases in a space of various dimensions and then separate them through optimized plans.</p><p>Thus, they allow an optimal classification of the data, as shown in Figure 3.5. In other words,</p><p>they are machine learning algorithms that use supervised learning to separate data into classes.</p><p>These algorithms are popular in data classification because they are multi-variables robust and</p><p>work well with small datasets, which is a benefit for small amounts of supervised (labelled)</p><p>data. As the complexity of the model can be controlled, these methods generalize well to simple</p><p>and complex problems (REGAN; BEALE; INALPOLAT, 2017).</p><p>Support vector machines work by considering input data as points in a space. After</p><p>that, an optimal plan is defined for separating the data into classes, and a function known</p><p>through the Kernel method is used to transform the input data into a space of many dimensions</p><p>where a decision boundary can be stipulated. Thus, the training process is performed in order</p><p>to find the weights that maximize the margin between the classes and the boundary (JIMÉNEZ</p><p>et al., 2019). Therefore, the objective of SVM is to build a plan or series of plans in a space of</p><p>various dimensions that can classify a dataset, separating it into labels, obtaining better results</p><p>as the distance of the plane to the nearest point the dataset increases (KUSIAK; VERMA,</p><p>2012a).</p><p>36</p><p>Figure 3.5: A plane separating data using SVM</p><p>. Source: (SONG, [S.d.]).</p><p>3.3.3 Decision Trees</p><p>In the scope of Artificial Intelligence, decision trees are supervised learning</p><p>techniques applied in predicting an output based on a series of inputs. In this method, the</p><p>decision tree is built by creating decisions divided into nodes in order to minimize the forecast</p><p>error as much as possible. Thus, at each node, the input variable that most minimizes the error</p><p>is the correct candidate to pass to the subsequent node. Therefore, the forecast happens when</p><p>walking from the root node to the leaf node that delivers the forecast value (MAZIDI;</p><p>TJERNBERG; BOBI, 2017).</p><p>According to Loh and Shih (1997), decision trees are used in classification because</p><p>they facilitate complex decision processes by segregating them into simple decision processes.</p><p>In this method, the classification tree is represented by nodes and branches, where each node</p><p>represents a decision between characteristics, leading to two or more branches. The model is</p><p>trained through input functions and each branch corresponds to a result of the process, forming</p><p>a path from the roots to the leaves of the decision tree, and thus performing the classification</p><p>(JIMÉNEZ et al., 2019).</p><p>An advantage of decision tree methods is the possibility of applying it in non-linear</p><p>relationships between characteristics and classes, thus being a very flexible method (JIMÉNEZ</p><p>et al., 2019; PAL; MATHER, 2003). In the articles analyzed in this work, some authors used</p><p>variations of these algorithms, mainly for modeling and parameter selection.</p><p>37</p><p>3.3.4 Ensemble Learning</p><p>Ensemble learning methods consist in using a series of algorithms together in order</p><p>to build one good single classifier (OPITZ; MACLIN, 1999). It works by training the next</p><p>classifier of the sequence with a resampled training dataset, containing more prediction errors</p><p>made by the previous classifier using bagging or boosting strategies. Some ensemble learning</p><p>algorithms appear in this research such as the Random Forrest Algorithm (KUSIAK, VERMA,</p><p>2011) and Boosting Algorithms (VERMA, KUSIAK, 2012).</p><p>3.3.5 K-nearest neighbours</p><p>The Nearest Neighbour Classifier is one of the simplest classifying techniques in</p><p>the context of machine learning, being a supervised learning approach. It performs the</p><p>classification task by identifying the nearest neighbours to the point example in the space, using</p><p>those neighbours to determine the class of the point. One needs to set the number of neighbours</p><p>that are going to be used for classification. For example, if the number of neighbours is set to</p><p>7, the 7th nearest neighbours are used for classifying the point, which is going to be classified</p><p>as the majority of points with them 7 nearest neighbours Cunningham and Delany (2007).</p><p>3.3.5 Fuzzy Logic</p><p>In artificial intelligence, fuzzy logic tries to imitate human thinking by exploring</p><p>approximate rather than exact models of reasoning, working with variables ranging from 0</p><p>(totally false) to 1 (totally true) instead of binary variables (AZADEGAN et al., 2011). This</p><p>logic is often applied in building Expert Systems. As these types of systems are based on rules</p><p>of human experts, it can contain vague expressions that are often modeled better by fuzzy logic</p><p>than by Boolean logic (ZADEH, 1983).</p><p>3.4 AI Applications for wind turbine maintenance</p><p>According to Hossain, Abu-Siada and Muyeen (2018), regarding the operation and</p><p>maintenance of wind turbines, artificial intelligence can be used to monitor the condition of</p><p>equipment, including methods such as Neural Networks (ANN), expert systems, space vector</p><p>modulations and fuzzy logic systems. They also say that ANN are used to diagnose failure in</p><p>different components of the turbine such as generator, gearbox, bearings and electronics,</p><p>however, they are time-consuming methods and require a large database to cover all possible</p><p>conditions, making them impractical.</p><p>38</p><p>However, the studies found in the SLR of the present work have different</p><p>conclusions. Four literature review articles were selected within the studies and were analysed</p><p>taking the into consideration the insights about the research theme. Marugán et al. (2018)</p><p>presented a literature review about the use of Artificial Neural Networks in the wind energy, in</p><p>order to demonstrate that the method can be a useful alternative for a variety of cases. They</p><p>analysed 190 articles published over a 5-year period and separate them into four main</p><p>categories: Forecasting and prediction, design optimization, fault detection and diagnosis and</p><p>optimal control. Helbin and Ritter (2018) focused on the applications of deep learning for fault</p><p>detection in wind turbines. They separate articles between supervised and unsupervised learning</p><p>approaches, concluding that normal behaviour models using SCADA data can detect faults in</p><p>initial stages in a variety of components. They also infer that deep learning with SCADA data</p><p>has been applied successfully in tests, however, it is still not much used. Bermejo et al. (2019)</p><p>highlighted articles that used Artificial Neural Networks for energy and reliability prediction in</p><p>solar, hydraulic and wind energy. They state that recent research about wind turbine condition</p><p>monitoring aim more at individual components than the whole wind turbine, they also explain</p><p>the urge to determine the criticality of components</p>
Aplicationartificialintelligence_Souz12233a_2020

UNISUL

Ferramentas de estudo

Conteúdos escolhidos para você

Monitoramento de Rolamento de Caixa de Engrenagens de Turbina Eólica

Estratégias de Controle de Sistemas de Armazenamento de Bateria em Micro-redes

Datadriven performance analyses of wastewater treatment plants

Avanços em Soft Computing

IET Renewable Power Gen - 2019 - Koukoura - Comparison of wind turbine gearbox vibration analysis algorithms based on

Perguntas dessa disciplina

3. Em uma universidade de ponta no Brasil, um grupo de pesquisadores desenvolve um projeto voltado à utilização da Internet das Coisas (IoT) e do B...

4 Em uma universidade de ponta no Brasil, um grupo de pesquisadores desenvolve um projeto voltado à utilização da Internet das Coisas (loT) e do Bi...

Em uma universidade de ponta no Brasil, um grupo de pesquisadores desenvolve um projeto voltado à utilização da Internet das Coisas (IoT) e do Big ...

Em uma universidade de ponta no Brasil, um grupo de pesquisadores desenvolve um projeto voltado à utilização da Internet das Coisas (IoT) e do Big ...

“O lançamento do ChatGPT no final de 2022, como a primeira ferramenta de inteligência artificial generativa (lAGen) de fácil utilização disponibili...

Crie sua conta grátis para liberar esse material. 🤩

Crie sua conta grátis para liberar esse material. 🤩

Crie sua conta grátis para liberar esse material. 🤩

Crie sua conta grátis para liberar esse material. 🤩

Crie sua conta grátis para liberar esse material. 🤩

Crie sua conta grátis para liberar esse material. 🤩

Crie sua conta grátis para liberar esse material. 🤩

Crie sua conta grátis para liberar esse material. 🤩

Crie sua conta grátis para liberar esse material. 🤩

Crie sua conta grátis para liberar esse material. 🤩

Conteúdos escolhidos para você

Monitoramento de Rolamento de Caixa de Engrenagens de Turbina Eólica

Estratégias de Controle de Sistemas de Armazenamento de Bateria em Micro-redes

Datadriven performance analyses of wastewater treatment plants

Avanços em Soft Computing

IET Renewable Power Gen - 2019 - Koukoura - Comparison of wind turbine gearbox vibration analysis algorithms based on

Perguntas dessa disciplina

3. Em uma universidade de ponta no Brasil, um grupo de pesquisadores desenvolve um projeto voltado à utilização da Internet das Coisas (IoT) e do B...

4 Em uma universidade de ponta no Brasil, um grupo de pesquisadores desenvolve um projeto voltado à utilização da Internet das Coisas (loT) e do Bi...

Em uma universidade de ponta no Brasil, um grupo de pesquisadores desenvolve um projeto voltado à utilização da Internet das Coisas (IoT) e do Big ...

Em uma universidade de ponta no Brasil, um grupo de pesquisadores desenvolve um projeto voltado à utilização da Internet das Coisas (IoT) e do Big ...

“O lançamento do ChatGPT no final de 2022, como a primeira ferramenta de inteligência artificial generativa (lAGen) de fácil utilização disponibili...

Mais conteúdos dessa disciplina