Accepted manuscript for Mechanical Systems and Signal Processing (https://www.journals.elsevier.com/ mechanical-systems-and-signal-processing/)
c2019. Licensed under the Creative Commons CC-BY-NC-ND (https://creativecommons.org/licenses/ by-nc-nd/4.0/)
Highlights
2.1. Context
The performance and reliability of machines are important for several industries, as in certain circumstances failures or malfunctions may lead to high production losses, severe injuries or even loss of lives. Condition monitoring allows for the early detection of machine failures, which helps to avoid further damage and/or to reduce downtime. Many condition monitoring approaches are limited to analyzing a single machine [1]. However, some real-world industrial applications could benefit from monitoring multiple similar machines or entities (i.e., a fleet) simultaneously [2, 3]. This need arises in wind turbines [4], production lines, and aerospace engines (which often operate in pairs) [5]. Moreover, in fleet settings like monitoring the components of a train or a conveyor belt (e.g. the traction motors, bearings and bogies [6, 7]), the machines in the fleet are operating under similar conditions. A fleet-based approach leverages the fact that multiple entities are analyzed. In each of the previous examples, the different entities have comparable signature characteristics which deviate when a fault appears. A fault indicator can use this deviation.
2.2. Need for AI-based fleet monitoring approaches
The traditional condition monitoring approach involves handcrafting an indicator of the phenomena of interest, which requires significant expertise in machine signature analysis and machine usage. In deployment, the current value of the indicator is compared to a pre-selected threshold to decide on the health condition of a machine. In practice, this approach has several drawbacks. First, operational conditions can influence machine behavior, which requires either setting different thresholds [4, 8], identifying dedicated operational conditions for fault detection (e.g. ISO 10816-3:2009 [9]) or normalizing the fault indicator [10]. This approach is not scalable to a large number of operational conditions. Second, a machine can be sensitive to multiple fault types. Thus, a robust condition monitoring approach requires a manual tuning procedure for each expected machine fault and their combinations. For complex systems, it is often not possible to know all these upfront. For this reason, an alternative approach compares machine data with a simulation model of healthy behavior [11]. However, this assumes that every considered input parameter and the machine behavior does not vary over time.
Recent condition monitoring studies often suggest employing techniques from artificial intelligence as an alternative to these handcrafted solutions. In particular, the two most commonly mentioned subfields of AI are machine learning and data mining [4, 12, 13, 14]. Typically, researchers consider supervised, semi-supervised and unsupervised techniques from these two subfields.
Supervised approaches to fault detection such as Artificial Neural Networks (ANN) [15, 16, 17], Deep Learning (DL) [14, 18, 19, 20] and Support Vector Machine (SVM) [21, 22, 23] require large historical labeled data sets. That is, the data must be annotated with the true machine health condition. Moreover, they often assume the data contains examples of every possible fault type [24]. However, such data sets are not always available in industrial environments [25, 26]. In those applications, even a limited historical data set such as the [27] proposes can be challenging to obtain. As an alternative, a physical model can simulate faults [23, 28], but this is often very challenging in practice as not every possible fault can be accurately modeled.
To alleviate the need to collect data about faults, another approach is to consider semi-supervised techniques, like novelty detection [29, 30]. These techniques simply use a historical data set containing only healthy machines build a model of healthy behavior. When deployed, they detect behaviors that deviate compared to the model of normal behavior [31, 32, 33]. Consequently, it is very important that data from a faulty machine is not present in the training data. Unfortunately, this is hard to guarantee in reality as machines degrade over time. Moreover, this assumes that normal behavior does not change over time, which may not be the case. An alternative approach is to cluster historical data and assign a label to each cluster. This allows classifying new machine measurements by assigning it the label of its nearest cluster. However, this approach requires expert knowledge to provide the labels [34].
Finally, unsupervised anomaly detection can address the drawbacks of the supervised and semi-supervised approaches [35, 36, 37, 38]. These techniques require neither labeled data nor solely healthy behavior. Instead, unsupervised techniques assume that the majority of the data set belongs to the normal (healthy) class and identifies rare or infrequent behaviors as abnormal (faulty). Still, using these techniques poses two challenges in practice. First, they require selecting a threshold to determine the boundary between normal and anomalous behavior, which is challenging as these behaviors may overlap due to noise and uncertainties. Setting this parameter requires reasoning about the cost of misclassifying a faulty machine as healthy (and vice versa) and should thus be done by a domain expert. Second, in a condition monitoring context, it is often necessary to collect an anomaly detection data set of machines in similar operational conditions, especially when a historical data set is not available.
A drawback shared by many of the existing approaches is that they employ black-box models, and hence it is difficult to gain insight into why a model made a certain prediction. Consequently, domain experts may not trust the predictions that the model is making. Moreover, it is non-trivial to incorporate expert knowledge into models such as ANNs.
2.3. Our Contribution
This paper’s contribution is an artificial intelligence based general condition monitoring approach. The framework simultaneously analyzes a fleet of machines under the assumption that each machine is in a comparable operational condition. To perform the analysis, it uses three inter-related generic blocks: comparing pairs of machines, clustering the entire fleet of machines, and identifying anomalous machines. Each block should have a meaningful outcome that can be visualized to give a domain expert an overview related to the observed machine behavior. First, we discuss several different ways to compare pairs of machines. Most interestingly, we consider a distance measure based on the amount of warping that occurs in a dynamic time warping. While measures based on Dynamic Time Warping are common in the AI literature, these receive less attention in the context of mechanics and condition monitoring. Second, we employ clustering which is a core artificial intelligence technique [39]. While the clustering can be done with a variety of algorithms, we recommend using a hierarchical method because of its simplicity and interpretability. We then propose an approach to partition a hierarchical clustering into a flat one that is novel within the context of machinery diagnostics. Finally, to identify potentially anomalous machines, we exploit the reasonable assumption that the majority of the machines in the fleet are healthy. Consequently, a good indication of a fault is when a machine’s current behavior deviates from the behavior currently exhibited by the majority of the machines. Like all condition monitoring and anomaly detection applications, this assumes that a fault alters the data generated by a machine [40]. Empirically, we consider machinery diagnostics tasks for a fleet of electrical drive trains. We examine two use-cases (electrical and vibration signatures), each with several variants that use differing amounts of domain knowledge. Applying fleet-based condition monitoring to electrical signature analysis is novel, particularly in terms of the fact that we analyze the raw time-series data. We find that our AI-based framework outperforms the more tradition condition monitoring approach the vast majority of the time.
The proposed methodology offers four advantages over existing approaches. First, unlike other approaches, the framework does not require any historical data. Instead, it makes use of online comparisons among machines. However, if they are available, recent machine measurements can improve performance by extending these comparisons. Second, the framework enables interpretable visualizations of the inter-related generic blocks. These allow a domain expert to gain insight into the framework’s predictions. Hence, this work considers relatively simple but intuitive techniques. Third, exploiting relevant domain knowledge allows tailoring the framework towards the specific use-case. For example, a measure to compare a machine pair’s behavior can be based on mechanical knowledge. This is not possible with techniques such as neural networks and deep learning. Finally, the framework can be applied directly on time-series data (i.e. raw measurements) and does not require a feature-based representation. However, the framework can use these features if available.
The rest of the paper is organized as follows. Section 3 proposes the fleet-based condition monitoring framework. Multiple possible implementations are discussed for each of the building blocks. Next, the framework is applied and evaluated on two use-cases considering a fleet of electrical drive trains. Section 5 processes the electrical signatures in three different signal domains, while section 6 analyzes vibration data. Both experimental evaluations are compared to handcrafted signal processing techniques for condition monitoring as a benchmark. Finally, section 7 discusses some general insights and conclusions.
An anomaly detection framework for fleet condition monitoring of similar machines is proposed, which detects deviating, thus probably faulty, machine behavior. This approach can detect various faults, even those not considered a priori, and does not require a training data set. The fault detection process relies on three key assumptions. First, the majority of the machines are healthy, machines whose behavior deviates from the majority are considered faulty. Second, all machines are operating in a similar operational condition, having an identical machine signature (e.g., speed). Third, the machines are subject to similar relevant environmental parameters influencing the machines’ behavior. If this were not the case, deviating behavior could arise due to a different environment instead of a machine’s health status. Practical examples where all of these assumptions hold are applications such as monitoring a conveyor belt or railway components. In both of these cases, a deviating machine is an indication of a machine fault.
Thoughtful preprocessing could allow utilizing the proposed framework in many more applications by considering external or contextual variables as well as domain knowledge. First, a heterogeneous fleet can cause machine diversity. However, expert knowledge such as an ontology could help to select comparable subsets of machines [41, 42, 43]. The framework could then assess each of these subsets individually. Second, the machine fleet can operate in dissimilar operational conditions. In such a case, two strategies could allow machine comparisons: normalization and a short-term historical database. The former exploits domain knowledge. For example, if the physics of a machine is available, then it is possible to normalize the measurements with respect to the working conditions such as by removing the effect of speed using angular resampling [10]. The latter contains recent machine measurements in various operational conditions. Clustering operational parameters could provide relevant data samples of machines in comparable operational conditions, which the framework would then use to assess their current health status’. Third, the machine fleet might be subject to different environmental parameters. Clustering machines based on these parameters could result in identifying subsets of comparable machines.
Four building blocks interact in the proposed framework: machine comparison, fleet clustering, anomaly detection, and visualization (fig. 1). The design and implementation choices of each block can be adapted to the application’s need, based on expert knowledge. The first block measures the similarity between two machine behaviors, where a variety of measures are possible. While this work discusses several concrete possibilities, a user can select an appropriate one for the application at hand based on domain knowledge. The second block uses a clustering algorithm with the selected measure to group machines that are behaving similarly. The third block uses the discovered clusters to assign an anomaly score to each machine. In a fleet where the majority of the machines are healthy, a faulty machine could occur in any cluster that consists of less than half the machines. The fourth block provides a deeper understanding of the framework by visualizing the result of the other blocks to help guide a domain expert to analyze a specific deviating machine.
The framework provides an online fleet analysis by considering small analysis windows. The required window size depends on the implementation of the first block, for example when evaluating in the frequency domain. The robustness of the prediction can be improved by considering a machine faulty based on n consecutive analysis windows.
3.1. Building block 1: Comparing two machines
Clustering uses the similarity of entities to infer a number of groups (i.e., clusters). Deciding how to compare a pair of machines (X, Y ) is thus an important step in the proposed framework. Optionally, it allows exploiting engineering knowledge to guide the clustering. This block consists of two actions. First, the machine data is transformed into the desired signal domain. Second, the machines are compared in this new domain. Sequential data is represented as () where
is one data point and n is the length of the sequence. Each machine in the fleet is assumed to have the same total number of data points n.
3.1.1. Domain specific preprocessing
Data about machines is generated by sensors that capture measurements in the time domain. Transforming the raw data to another signal domain can emphasize certain physical phenomena. The framework
Figure 1: Interaction of the four blocks. Visualization can aid in interpreting the predictions. A user can select the machine comparison and anomaly score blocks based on domain knowledge, if available.
can support any type of transformation. This work focuses on four output domains: time, frequency, time/frequency, and a feature space.
For all domains, it is important to normalize the data before comparing machines to help remove differ-ences that simply arise due to the unique characteristics of each machine and cope with the fact that the measurement values can originate from different mechanical units. Without normalization, a comparison between machines would be dominated by the unit with the largest range.
Time domain. Waveforms can be used to analyze time domain measurements, as they contain information about amplitude, shape, and existing symmetries. Time domain analysis is particularly interesting to detect trends and patterns over time.
Time series normalization is used to scale all machine measurements to the same range. Methods such as min-max scaling, ()), and z-score scaling, (
¯
, are popular, but sensitive to outliers and should be used with care. If outliers are expected, more robust estimators like percentile scaling, (
)), are preferred.
Frequency domain. A Fast Fourier Transform (FFT) can be used to convert time domain data to the frequency domain, which can yield insights that are hidden in the time domain. However, it is not possible to track trends and patterns over time. Techniques such as cyclostationary analysis [44] can help to analyze rotating machinery.
The previously mentioned normalization techniques are applicable in this domain as well. To remove differences due to the unique machine characteristics, mechanical insights could be of use. For example, frequency spectra can be scaled by a specific frequency corresponding with a mechanical phenomenon or use signal processing techniques to obtain equal Parseval energy.
Time/frequency domain. The time/frequency domain detects frequency trends and patterns over time and can be shown in a spectrogram. Different normalization procedures can be applied, depending on the application’s need. One can normalize each of the obtained spectra using the techniques described in the frequency domain. Another option is to scale the complete spectrogram using the techniques from the time domain.
Feature space. Various feature extraction methodologies, based on statistics, signal processing or physics, can extract indicators for fault diagnostics from any signal domain. Examples are amplitude in the time domain, order extraction or more complex signal processing techniques such as envelope analysis. The framework can use any of these indicators as a representation of machine behavior. In this case, normalization is required to equally weigh all features.
3.1.2. Dissimilarity measure
A measure) is needed to compare the distance or dissimilarity of a machine pair (X, Y ), represented by sequences (
) and (
) with length n. A data point
or
can be 1-dimensional, such as for the time domain, or m-dimensional, for example when representing a spectrogram or a feature space with m features.
Various functions to measure dissimilarity between sequences can be considered. An existing one can be selected using engineering knowledge or one can be designed specifically for the problem at hand. Two popular measures are:
1. Euclidean distance. The Euclidean distance is the straight-line distance between two sequences of length n, given by:
with ) being the Euclidean norm between two m-dimensional data points:
A disadvantage of the Euclidean distance is its sensitivity to subtle shifts in the time series. This is solved by Dynamic Time Warping.
2. Dynamic Time Warping. Dynamic Time Warping (DTW) is a successful algorithm in time series analysis which aligns in a nonlinear fashion a sequence representing time series [45, 46]. The goal is to find an alignment of sequences X and Y that has the minimal total distance. The dissimilarity between X and Y is then expressed as:
Where p = () expresses an optimal alignment with length L, also referred to as the warping path. Each
represents a tuple where the first element refers to the element in the x series on position
and the second element refers to the element in the y series on the
position. It satisfies three conditions:
The dissimilarity of individual data points and
is defined by a cost function
), the Euclidean norm (eq. (2)) by default. The sequences should be normalized before applying DTW, to avoid suboptimal alignment due to differences in amplitude. Figure 2 shows an example in which two arbitrary time series are aligned. Note that the Euclidean distance is considerably larger compared to the DTW dissimilarity even though the series are quite similar.
A variant to DTW used in this work is Ψ-DTW. Periodic signals can be misaligned due to a phase offset, which can result in DTW introducing undesired warping at the beginning and end (fig. 3a). This additional warping can lead to an unwanted increase in the DTW measure. Ψ-DTW [47] addresses this issue and avoids the undesired warping (fig. 3b). It does this by relaxing the DTW boundary condition by = ([1, Ψ], 1) or
= (1, [1, Ψ]), and
= ([
) or
= (
]), where Ψ
is a user-defined parameter. Apart from this, it operates in a similar way as the classic DTW. The example in fig. 3 shows that for periodic signals, Ψ-DTW leads to a better match.
Figure 2: DTW finds an optimal alignment between two time series (blue) by means of a warping path (orange). Euclidean distance in (a) is 7.0, the non-linear DTW in (b) has a dissimilarity of 1.0.
Figure 3: DTW on sine waves with a phase shift. Warping paths are shown in orange. While identical, the sine waves are considered dissimilar by classic DTW due to additional warping in the beginning and end. Ψ-DTW removes this effect and correctly identifies the sine waves as identical.
Figure 4: A dendrogram representing a cluster hierarchy. Agglomerative (bottom-up) clustering begins by assigning each item (blue dot) to its own cluster and iteratively merges the two most similar clusters until only one cluster is left. In this example, the horizontal distance between the items is considered being their distance.
DTW warping amount (Ψ-)DTW is known to work well when it is important to compare time series based on their shape. However, in some cases, DTW removes the difference between healthy and faulty signals. One example is in electrical current waveforms which are considered in one of the use-cases. To overcome this challenge, a measure that considers the amount of warping that occurs in Ψ-DTW is used [48]:
It is advised to use a normalized form, s(X, Y )/L, when using Ψ-DTW, as the length of the paths can vary.
3.2. Building block 2: Clustering the machines in the fleet
This block identifies similarly behaving machines by employing a clustering algorithm that uses the selected measure s(X, Y ) to compare pairs of machines. There are two important points to consider when selecting a clustering algorithm. First, the selected clustering algorithm should work for an arbitrary measure s(X, Y ). Some algorithms only work with certain measures (e.g., K-means requires a space where the mean is defined). Therefore, it is preferable to use an algorithm that represents clusters by actual data points and is hence more generic, such as K-medoids [49, 50], spectral clustering [51], hierarchical clustering [52] and DB-scan [53]. Second, the number of clusters is unknown in anomaly detection. If all the machines are healthy, a single cluster is expected whereas multiple clusters are needed if there are faulty machines. Some methods, like K-medoids, require the user to specify the number of clusters k in advance. In contrast, approaches like hierarchical clustering and DB-scan derive the number of clusters from the structure of the data, often by using a parameter that describes how tight a cluster should be.
Hierarchical clustering is proposed as the default implementation for this block. It offers the benefits of being relatively simple, producing results that can be easily visualized, and not having to set k upfront.
Hierarchical clustering. Hierarchical clustering builds a hierarchy of clusters. This can be done in either an agglomerative (bottom-up) or divisive (top-down) manner. However, agglomerative clustering is more common. This approach begins by assigning each item to its own subcluster. Then, the algorithm iteratively merges the two nearest (or most similar) subclusters until there is only one cluster left. The function to measure the distance between two subclusters is a parameter of hierarchical clustering. Two common choices are: single linkage, which considers the minimum distance between the elements of two subclusters and complete linkage, which considers the maximum. Other alternatives are average linkage and Ward’s method [54].
The result of the clustering can be represented in a tree-like structure called a dendrogram (fig. 4). It is relatively easy to interpret this structure, which is an additional benefit over other clustering techniques. At the lowest level, each cluster contains a single item. Going from the bottom-up, each subcluster is merged with its most similar subcluster until a single cluster is obtained. The vertical axis shows the distance between each of these subclusters. The higher this merge occurs, the less similar the subclusters are.
The proposed anomaly detection block requires that each machine is assigned to only one cluster. The hierarchical structure of the clustering needs to be partitioned to satisfy this constraint. A naive approach is to cut the hierarchy by using a threshold on the distance between subclusters. Selecting an appropriate threshold depends on s(X, Y ) (e.g., its range) and does not take the underlying cluster structure into
Figure 5: Upper: 10 data points form one cluster. Single linkage hierarchical cluster generates the corresponding dendrogram. There is a clear difference between pairwise and dendrogrammic distances, shown in triangular matrices with a green (low) to yellow (high) color scale. This difference leads to a low correlation.
Lower: a second cluster (blue) of 3 additional data points. In this situation, the difference in pairwise and dendrogrammic distances is relatively small. This leads to a higher correlation, indicating the presence of multiple clusters.
account. To overcome this challenge, a threshold on the cophenetic correlation is proposed. This offers the advantages of being independent of the range of s(X, Y ) and considering the structure of the data.
The cophenetic correlation measures how well a clustering preserves the original pairwise distances [55], that is, how well the clustering reflects the original structure of the data. A high cophenetic correlation indicates that two or more cluster partitions are present in the data, while a low cophenetic correlation suggests that only a single cluster partition is present in the data. Formally, the cophenetic correlation is defined as eq. (5):
where P is the set of all machine pairs (X, Y ), s(X, Y ) is the measure used in the clustering, and t(X, Y ) is the dendrogrammic distance, which is the height at which machines X and Y are first joined in the dendrogram.Finally, ¯s and ¯t are the average pairwise dissimilarities and dendogrammic distances. A pseudocode for splitting a dendrogram is given in algorithm 1.
The intuition behind the cophenetic correlation procedure is illustrated with a simple example (fig. 5). First, the upper part considers a single cluster (shown in red). There is a difference between the pairwise and dendrogrammic distances. The correlation between them is rather low. Second, the lower part of the figure shows a situation where a second cluster is present (shown in blue). Now, the distances within a cluster are relatively small. In this situation, the larger distances between clusters dominate the correlation, which is now close to 1.
The cophenetic correlation threshold is the most important parameter in the default implementation, as it controls the cluster partitioning. It is set depending on the cost of not identifying a faulty machine (i.e., a false negative, missed detection) or incorrectly labeling a healthy machine as faulty (i.e., a false positive, false alarm). Moreover, it determines when an alarm occurs when monitoring slowly degrading machines. If
is set too high, a faulty machine might be partitioned together with machines considered to be healthy. The anomaly detection procedure would then report faulty machines as being healthy. Multiple small partitions containing machines considered to be healthy would be formed if
is set too low. Anomaly detection will then report these machines as being faulty. The value of this threshold is set depending on the application.
3.3. Building block 3: Anomaly detection
Block 3 uses the discovered clustering to assign an anomaly score to each machine. Machines with similar behavior are expected to be in the same cluster. The larger a cluster, the more likely it is to indicate healthy behavior. Each machine’s anomaly score is simply the fraction of total machines that are not assigned to its cluster, hence an anomaly score close to one indicates a machine that is likely to be faulty. Thresholding this score to classify each machine as either healthy or faulty requires reasoning about the size of the fleet and the expected proportion of faulty machines.
More sophisticated unsupervised anomaly and outlier detection techniques can be used. These can be based on clustering, neighborhood, or statistics [29, 35, 56]. Cluster-based techniques are preferred for fleet-condition monitoring applications. This class of techniques identifies anomalies as small clusters or unclustered data points. Neighborhood-based techniques like k-th Nearest Neighbor (kNN) or Local Outlier Factor (LOF) detect anomalies based on respectively the distance to the K-th nearest neighbor or the density of its neighborhood. However, the accuracy of these methods depends on the number of analyzed machines, which can be limited. Statistical-based techniques assume that anomalies arise in low-probability regions. However, selecting an appropriate hypothesis test for high-dimensional data is non-trivial. Moreover, these tests require making assumptions about the underlying data distribution, which might not hold in practice.
3.4. Building block 4: Visualization
Visualizing the results of each block can yield insights into the machine health conditions and the framework’s predictions. A simple visualization shows results of each block (e.g. fig. 6). The use-cases show more examples in figs. 12, 13 and 16.
First, both parts of the machine comparison block can give detailed insights. The template has space for preprocessed machine data, to give the user a more detailed view (fig. 6a.). A dissimilarity matrix (fig. 6b.) shows the result of pairwise machine comparisons. A color map helps to see relative differences, as absolute values depend on the selected comparison method. Second, clustering gives a more global overview of the machine fleet. A dendrogram can, for example, show a cluster hierarchy, to detect divergent machines (fig. 6c.). Moreover, colors can indicate cluster partitions (fig. 6d.). This helps to tune the threshold . Finally, the anomaly scores correspond to the predicted health conditions (fig. 6e.).
The proposed framework is validated in two different use-cases. Both involve the detection of a voltage unbalance in a fleet of electrical drivetrains. This results in early machine failure and is often caused by a grid fault or a bad wire connection. Early detection will extend the lifetime of a machine and will minimize wear due to increased temperature and additional vibrations.
Each use-case includes signal processing techniques for condition monitoring. These are used as a benchmark for the framework. Its broad applicability is shown by different variants, using different degrees of domain knowledge. Each variant makes use of different implementations for machine comparison but uses the default fleet clustering, anomaly detection, and visualization blocks. It analyzes and predicts machine status for 0.5s non-overlapping analysis windows. A machine is considered to be faulty after being identified as anomalous for five consecutive windows. This reduces misclassifications due to signal noise.
Figure 6: An example of the visualizations provided by the framework, which are analyzed in detail in section 5.3. In this example, drivetrains D1 2 and D2 10 are faulty. Signal data (a.) gives the user a detailed view. In this example, min-max normalization has been applied to the waveforms. A global overview of the fleet is given by pairwise comparisons (b.) and a dendrogram (c.). Finally, the cluster partitions (d.), obtained with 9, and their corresponding anomaly scores (e.) indicate the prediction for a machine’s health condition. In this example, the framework correctly identifies D1 2 and D2 10 as being anomalous.
Three performance metrics consider the two types of misclassifications: false negatives (fn) and false positive (fp) (table 1). Precision (eq. (6a)) and Recall (eq. (6b)) help the user to take the cost into account of labeling a healthy machine as being faulty (precision, false alarm) versus missing a faulty machine (recall, missed detection). The F1-score (eq. (6c)) considers both faults equally important and is used as a single performance measure.
The evaluation procedure considers both stationary as well as dynamic operational conditions. The former allows analyzing the consistency of the framework, the latter its generality. Within these operational conditions, the health status of machines D1 2 and D2 10 is varied.
Table 1: Possible prediction mistakes
Predicted condition Faulty True positive (tp) False positive (fp) Precision (eq. (6a))Healthy False negative (fn) True negative (tn) Recall (eq. (6b)) F1-score (eq. (6c))
This use-case involves electrical measurements of an electrical machine fleet. The task is to detect a voltage unbalance fault based on current signals. Three variants show the general applicability of the framework in different signal domains. The first variant uses time domain data and demonstrates a generic approach for waveform analysis. The second variant is an example on how to include domain knowledge. Finally, the third variant is a hybrid method and analyzes time/frequency data.
5.1. Experimental fleet test rig and data set
A single phase current of ten electrical drivetrains (fig. 7) is measured at 25600 Hz. Each drivetrain consists of an electrical motor pair, connected by a flexible jaw coupling. A 3-phase Squirrel cage induction motor (SCIM)drives the shaft. Ten ABB
drive controllers manage the drive sides with internal closed-loop direct-torque control. All drivetrains are set up to always have similar speeds. The other motor acts as load and is either a 3 kW Direct Current (DC) motor (drivetrains D1 1 – D1 5) or a 3 kW Wound Rotor Synchronous Motor (WRSM) (drivetrains D2 6 – D2 10). The specifications of each machine are shown in table 2. Resistors generate load while keeping the current excitation constant. The resulting load torque is proportional to speed and corresponds to the rated load at rated speed.
A 3 Ω external resistor emulates a voltage unbalance in drivetrains D1 2 and D2 10. This resistor is inserted between the drive controller and the motor (fig. 7b). A higher resistance would trigger the internal safeties of the ABB drive controller.
The data set includes both stationary and dynamic system behavior under loaded and unloaded operating conditions. For the stationary case, each drivetrain is running at a fixed speed (820 RPM or 1500 RPM). On the other hand, during the dynamic operational condition, all machines perform a run-up from 0 RPM to 1200 RPM.
Figure 7: The measurement setup consists of ten drivetrains, having a 3kW SCIM (drive, inner part of the fleet) coupled to a 3kW WRSM (left) or DC (right) motor (load). Drive and load side motors are connected with a rubber coupling.
Table 2: Rated parameters of motors
Figure 8: Voltage unbalance detection based on amplitude is not possible in a fleet context. Machine D2 10 is indicated in orange. The variance within the healthy fleet is larger than its amplitude increase due to a voltage unbalance. All machines are running stationary at 820 RPM (loaded).
5.2. Classic voltage unbalance diagnosis techniques
Motor Current Signature Analysis (MCSA) is a popular class of techniques for detecting motor faults that apply signal processing methods to current signals. It can be used to detect a voltage unbalance [57, 58]. Some of these methods require measuring multiple motor current signals. For example, the phase shift between the three current signals of a healthy system is 120 degrees. A voltage unbalance results in changes to this angle [59]. Another example measures an estimator for based on two
current signals. A fluctuating component at twice the fundamental frequency is observed due to the presence of inverse sequence current in the stator [58].
A single current sensor is sensitive to a secondary effect of the voltage unbalance [58]. The oscillations of the shaft lead to an increase in the third harmonic. This work compares the framework with three approaches that make use of this signal. First, careful time domain analysis reveals an increase in the current amplitude. However, a fleet-based approach cannot use this current amplitude as the difference due to a fault is lower than the variability within the fleet (fig. 8). Moreover, the current amplitude highly depends on the operational conditions (i.e. rotational speed and load). Second, the fault affects the current waveform. However, the shape of the waveform depends on the operational speed (fig. 9). Any type of deviation should thus trigger a machine fault prediction. Finally, the frequency domain shows an increase of the third harmonic amplitude (fig. 10). The harmonic amplitude is not sensitive to changes in speed but does depend on the load. Different load conditions could thus require different thresholds.
In each of these approaches, the machine is affected up to a certain speed (1400RPM). This is demonstrated in fig. 10, where the amplitude of the third harmonic decreases at that point. The hypothesis is that at higher speeds, a flux weakening region is entered. For example at the rated load above 1385 RPM, the stator frequency exceeds the rated 50 Hz. Therefore, the flux is decreased in the machine to keep the stator voltage under its limit. As a consequence, the magnetic saturation level decreases and the third harmonic in the current becomes less visible.
The remainder of this use-case utilizes a simple condition monitoring method as a benchmark to validate the proposed framework. This method considers a machine as healthy when its third harmonic amplitude is within standard deviations of the fleet’s mean. Note that this is a simplified fleet approach, which avoids setting thresholds manually. Table 3 shows the fault detection performance for this methodology with different values of
, Figure 10 illustrates the specific case for
= 2.
5.3. Variant 1: Time domain
This variant evaluates the different operational conditions in the time domain representation. The goal is to detect any deviation on the current waveform with a generic comparison measure.
Figure 9: Current waveform analysis can detect a voltage unbalance. The fault affects the current waveform of a faulty drivetrain (red). The exact shape is however dependent on the operational speed. Above 1385 RPM, the fault effect disappears due to an increase of the flux. A current signal of a healthy drivetrain (blue) is shown for reference.
Figure 10: The third harmonic can be used to detect a voltage unbalance. Faulty machines D1 2 (red) and D2 10 (orange) show a clear increase compared to the healthy (blue) machines between 300 and 1420 RPM. The orange band indicates the range in which a machine is considered healthy, being 2 standard deviations around the mean. This correctly detects the machine health condition in most of the cases (table 3).
Table 3: Performance of classic voltage unbalance detection using the third current harmonic. Scores corresponding to a scenario’s highest F1-score are indicated in bold.
Figure 11: DTW of a healthy (blue) and faulty (red) machine. While the measured current signals have a clear difference in shape (a), DTW can non-linearly align them so the signals become similar (b).
5.3.1. Block 1: Machine comparison
Due to the fact that the differences appearing in current amplitudes cannot be used in a fleet-based approach (fig. 8), min/max normalization is performed. Moreover, the current signals are downsampled to 50 samples per period. This removes the effect of signal noise and allows the analysis to focus on the waveforms.
The waveform of a machine affected by a voltage unbalance depends on the operational speed. A robust technique is thus required that detects any difference in this shape. DTW is a popular technique for pattern recognition, but the classic implementation considers waveforms of healthy and faulty machines as similar (fig. 11). However, aligning healthy and faulty signals involves significant amounts of warping. In contrast, little warping occurs when comparing two healthy machines. Therefore, the amount of warping (eq. (4)) can be used as a powerful indicator for waveform deviations, regardless of the exact shape. The Ψ-DTW extension avoids unnecessary warping in the boundary points. Unaligned start- and endpoints would otherwise dominate the DTW warping amount measure. The used value of Ψ is corresponding to half a period of the current signal, allowing to align any phase offset.
5.3.2. Block 2: Fleet clustering
Hierarchical clustering, combined with a cophenetic correlation partition procedure, generates machine clusters. First, hierarchical clustering uses pairwise machine comparisons and single linkage to construct a cluster hierarchy. Single linkage considers the minimum distance as the similarity of two subclusters. The cophenetic correlation procedure partitions this hierarchy with recursive top-down evaluations, if the cophenetic correlation is larger than (algorithm 1).
The value of affects the effectiveness and the performance of the method but its optimal value is application dependent. Therefore, the framework is evaluated for various values of
to show the parameter’s impact on predictive performance. A lower value results in more clusters, which leads to more machines incorrectly labeled as faulty (low precision, higher false alarm rate). A higher value, on the other hand, leads to more misses of faulty machines (low recall, higher missed detection rate).
5.3.3. Block 3: Anomaly detection
The anomaly score is defined as the fraction of machines outside each cluster. This is a simple scoring mechanism to demonstrate the concept of anomaly detection. At least of the machines are assumed to be healthy. The anomaly threshold
, above which a machine is predicted as faulty, is thus set equal to
. If required, more advanced anomaly detection techniques can be used.
5.3.4. Block 4: Visualization
Visualizations allow a domain expert to interpret the framework’s predictions. By showing the results of each block, an expert can confirm the predictions and avoid taking unneeded actions. All use-cases use the default implementation of section 3.4, being a (normalized) signal domain representation, machine dissimilarity, clustering, and anomaly scoring.
5.3.5. Results
The example of fig. 6 shows the evaluation of a single analysis window. A domain expert can use this visualization to validate the predictions or to tune the general framework. The time series show deviating behavior of machines D1 2 and D2 10 in each of the subfigures corresponding to the different building blocks.
The evaluation procedure uses performance metrics to evaluate the framework’s ability to detect a deviating waveform (table 4). Both the benchmark (table 3) and this implementation are able to detect an unbalance, except at 1500 RPM in a loaded condition due to the decrease of the flux. In other stationary scenarios, this implementation outperforms the benchmark method. Similar results are obtained in dynamic run-up scenarios. However, this implementation of the framework offers the advantage that it detects any deviations in the current waveforms, not only caused by an increase of the third harmonic.
Table 4: Performance of the proposed framework for fault detection using electrical signature analysis via waveform comparison. Scores corresponding to a scenario’s highest F1-score are indicated in bold. For reference, the last column shows the performance of the classic diagnostics approach (table 3) with the highest F1-score. In five out of the six cases, the proposed approach obtains the best F1 score.
5.4. Variant 2: Frequency domain
This variant demonstrates the case in which detailed domain knowledge is included. A voltage unbalance affects the third harmonic of the fundamental frequency, which is used in this variant to compare the machines. In general, this is very similar to the benchmark method, as both depend on domain knowledge.
5.4.1. Block 1: Machine comparison
A FFT converts time series within an analysis window to their frequency data, which is converted to a log scale and normalized using min/max scaling. No outliers are present in the considered data set, other cases might need more robust techniques. Machines are represented in the feature space, by the amplitude or their current’s third harmonic. This increases if one of the machines is faulty, as shown in section 5.2. The periodic nature of current signals allow for estimating the fundamental frequency. This corresponds to the highest peak in the frequency spectrum. The third harmonic is found at three times this frequency. The maximum value in a small window of 5Hz around the estimated frequency is used as its amplitude. This variant compares a machine pair by the difference in their third harmonic amplitudes (eq. (7)).
5.4.2. Blocks 2 – 4: Fleet clustering, Anomaly detection & Visualization
Clustering, anomaly detection, and visualization make use of the default implementations. Hierarchical clustering combined with cophenetic correlation clusters the fleet. Therefore, the framework is evaluated for various values of to show the parameter’s impact on predictive performance. Anomaly scores use these clusters and are defined as the fraction of machines outside each cluster. The implementation considers a score above
as anomalous behavior. The visualization shows the frequency spectra enriched with the harmonic amplitudes windows.
Figure 12: Visualization of frequency domain signals (a.), machine similarities (b.), clustering (c. & d.) and anomaly scores (e.). The red vertical line in (a.) indicate the small windows of 10Hz around the fundamental and third harmonic frequencies. Its amplitudes are determined by each window’s maximum amplitude. Clusters are partitioned with 9. Each machine is running stationary at 820 RPM. The framework correctly identifies D1 2 and D2 10 as anomalous.
5.4.3. Results
Figure 12 shows the evaluation of a single analysis window. The anomalous behavior of D1 2 and D2 10 is successfully detected.
The evaluation procedure uses performance metrics to evaluate the framework’s ability to detect deviating third harmonic amplitudes (table 5). This framework implementation has similar performance compared to the previous variant (table 4). It outperforms the benchmark method (table 3) in almost all stationary operational conditions. At 1500 RPM in loaded condition, both methods show poor behavior. In the dynamic run-up scenarios, on the other hand, the benchmark method is very competitive. It shows a higher recall (less missed detections), but its precision is slightly lower (more false alarms). This results in F1-scores in favor of the benchmark method.
Table 5: Performance of the proposed framework for fault detection using electrical signature analysis via harmonic amplitudes. Scores corresponding to a scenario’s highest F1-score are indicated in bold. For reference, the last column shows the performance of the classic diagnostics approach (table 3) with the highest F1-score. In five out of the six cases, the proposed approach obtains the best F1 score.
5.5. Variant 3: Time/Frequency domain
This variant demonstrates a hybrid approach, in which a domain expert expects a fault to manifest itself in the frequency content, without specifying the exact frequency. It enables fault detection in a large range of working conditions. In this case, the framework is implemented to detect deviations in the spectrogram, a time/frequency domain representation.
5.5.1. Block 1: Machine comparison
Each machine is represented by its spectrogram, a sequence of FFTs obtained from consecutive windows. This implementation converts the data on a log scale and normalizes using min/max scaling FFT to have a similar value ranges for each machine. A low-pass filter at 200 Hz removes unrelated high frequency content.
DTW (eq. (3)) is preferred as dissimilarity measure over Euclidean distance. If the drivetrain’s speed fluc-tuates, DTW will optimally align the spectrograms. In that case, these are considered as n-dimensional time series, with n the number of frequency bins. Each of these bins is equally weighted in this implementation. If desired, a user could opt to have different weighting for frequency bins of interest.
A high frequency resolution can result in false predictions due to the curse of dimensionality [60]. In that case, noise over unaffected frequencies would be larger than the amplitude increase in a single frequency bin. The task is however only to perform fault detection. Since the exact frequency at which a deviation occurs is not of interest, it is safe to have a low frequency resolution. In this variant, the selected FFT window size is 0.05s.
5.5.2. Blocks 2 – 4: Fleet clustering, Anomaly detection & Visualization
This variant reused the default clustering, anomaly detection, and visualization implementations. The first block uses hierarchical clustering and cophenetic correlation. Therefore, the framework is evaluated for various values of to show the parameter’s impact on predictive performance. The second considers the fraction of machines outside each cluster as anomaly scores. A score above
is considered as anomalous behavior. Finally, the visualization presents the signal data as a spectrogram.
5.5.3. Results
Figure 13 shows the evaluation of a single analysis window. A low-resolution spectrogram shows different machine behavior for faulty machines D1 2 and D2 10. Moreover, the dendrogram suggests using a lower value for , as the healthy machine D2 7 appears in its own cluster.
The evaluation procedure uses performance metrics to evaluate the framework’s ability to detect deviating spectrograms (table 6). It outperforms the benchmark method (table 3) in most stationary scenarios. None of the methods detect an unbalance at 1500 RPM in a loaded condition. In the dynamic run-up scenarios, on the other hand, both methods have similar results. In general, this variant offers the advantage that it does not require domain knowledge or handcrafted rules.
Figure 13: Visualization of time/frequency domain signals (a.), machine similarities (b.), clustering (c. & d.) and anomaly scores (e.). The frequency resolution in (a.) is course, to avoid the curse of dimensionality while clustering. Clusters are partitioned with 9. Each machine is running stationary at 820 RPM. The framework correctly identifies D1 2 and D2 10 as anomalous and incorrectly partitions D2 7 by itself. This suggests that the choice of
might not be optimal.
Table 6: Performance of the proposed framework for fault detection using electrical signature analysis via spectrogram analysis. Scores corresponding to a scenario’s highest F1-score are indicated in bold. For reference, the last column shows the performance for the classic diagnostics approach (table 3) with the highest F1-score. In five out of the six cases, the proposed approach has the highest F1 score.
Figure 14: A 3D accelerometer (indicated in red) is mounted on the drive side motor and is sampled at 12800Hz.
This use-case involves vibration measurements of an electrical machine fleet, similar to the previous use-case. The task is to detect a voltage unbalance fault based on accelerometer signals. Accelerometers are often already present for condition monitoring of other (mechanical) components, hence it is valuable if they can also be used for voltage unbalance detection. This section illustrates that it is indeed feasible and moreover shows how the framework can use multiple sensor channels and indicators of interest.
6.1. Experimental fleet test rig and data set
This use-case considers a setup of ten electrical drivetrains. A voltage unbalance is again emulated with an external resistor. PCB ICP AC accelerometers measure vibrations. One sensor is mounted on the side of the drive side motor (fig. 14) and is sampled at 12800 Hz. Operational speed is measured with a single tachometer mounted on D1 2, triggered once every rotation. This is representative for the full fleet, as all drivetrains are set up to always have the same speed, controlled by an ABB drive controller. One exception is machine D2 8, whose run-up parameters are set incorrectly. For this reason, this drivetrain is not used in the evaluation procedures when considering a run-up.
6.2. Classic voltage unbalance diagnosis techniques
For a healthy, symmetrical, motor, electrodynamic forces have no alternating components. In the balanced case, the forces are rotating with the flux. In the case of unbalance they have a component rotating in the opposite direction. This results in a pulsation of the amplitude of the forces at twice the stator frequency. This causes radial vibrations in the induction motor [58, 61, 62]. The forces have a tangential component (creating the torque) but also a radial component (the stator flux attracts the rotor like an electromagnet) generating the vibrations.
Detecting voltage unbalance based on vibration signatures is an example of a condition monitoring case that requires different thresholds, as operational conditions affect vibration amplitudes. For example, additional load or operating at a resonance frequency leads to an increase, while lower operating speeds result in lower vibration levels (fig. 15). Moreover, many frequencies and sensor directions can be considered. Detailed machine knowledge is required to know these frequencies a priori.
The machine response varies for D1 2 and D2 10, the faulty drive trains (fig. 15). Their resonance frequencies differ and in loaded conditions, D2 10 is significantly more affected by the fault compared to D1 2. The optimal indicator depends thus not only on operational conditions but also on the evaluated machine.
The remainder of this use-case utilizes a simple condition monitoring method as a benchmark to validate the proposed framework. This method considers a machine as healthy when the amplitude of all its harmonics 3 – 6 is within standard deviations of the fleet’s mean. If one harmonic is outside this range, the machine is predicted as being faulty. Note that this is a simplified fleet approach, which avoids setting threshold manually. Table 7 shows the fault detection performance for this methodology with different values of
, Figure 15 illustrates the specific case for
= 2.
Figure 15: Harmonics 1 (top) – 6 (bottom) of the X-axis accelerometer in a run-up. The faulty drivetrains are shown in blue (D1 2) and red (D2 10). The optimal indicator to detect the voltage unbalance depends on the machine of interest and the operational speed. The orange band indicates the range in which a machine is considered healthy, being 2 standard deviations around the mean.
Table 7: Performance of classic voltage unbalance detection using accelerometer harmonics 3 – 6. Scores corresponding to a scenario’s highest F1-score are indicated in bold.
6.3. Variant 1: Frequency domain
This variant demonstrates the use of multiple sensors. The implementation combines each of the accelerometer directions (X, Y, Z) in the comparison measure. Moreover, it considers domain knowledge by multiple harmonic frequencies for each of these directions. A data-driven approach automatically selects the best indicator value.
6.3.1. Block 1: Machine comparison
Machines are represented by harmonic frequencies 3 – 6 of each sensor direction. The implementation uses the tachometer to define the fundamental frequency. A FFT converts the time domain measurements to frequency spectra, in which the maximum value within a window of 5Hz around the estimated frequencies is considered as their amplitude. Together, these form a sequence with length 1 and 12 dimensions (3 sensor directions, 4 harmonics).
Accelerometer signals can be noisy and require a robust normalization procedure. This variant uses percentile scaling, as data can be affected by outliers. In this case, normalization also makes sure to equally weight every harmonic. Otherwise, a harmonic with a larger range could dominate the clustering.
The Euclidean distance measures the differences between a machine pair (eq. (1)).
6.3.2. Blocks 2 – 4: Fleet clustering, anomaly detection & visualization
This variant reuses the default clustering, anomaly detection, and visualization blocks. The first combines hierarchical clustering with a cophenetic correlation procedure. Therefore, the framework is evaluated for various values of to show the parameter’s impact on predictive performance. The second uses the fraction of machines in each cluster as the anomaly score. A score above
is considered as anomalous behavior. Finally, the frequency spectra of each drivetrain are shown combined with the considered harmonic windows.
6.3.3. Results
Figure 12 shows the evaluation of a single analysis window. The anomalous behavior of D1 2 and D2 10 is successfully detected in each block.
The evaluation procedure uses performance metrics to evaluate the framework’s ability to detect faults based on multiple sensor directions and harmonics (table 8). In loaded scenarios, it outperforms the benchmark method (table 7). Both approaches show poor results in unloaded scenarios, which can be explained by the reduced vibrations in the absence of a load. At 1500 RPM, none of the methods detect the fault due to a decrease of the flux. Lower general performance (compared to electrical signature analysis, section 5) in dynamic run-up scenarios is caused by the fault not manifesting itself in the vibration signature at every operational speed. In general, this implementation offers the advantage that it can consider multiple harmonics, without needing individual threshold values. This is an advantage especially when the optimal indicator varies with the operational conditions.
Figure 16: Visualization showing the three accelerometer frequency domain signals (a.), machine similarities (b.), clustering (c. & d.) and anomaly scores (e.). The red vertical lines in (a.) indicate the used harmonics 3 – 6. Clusters are partitioned with 9. Each machine is running stationary at 820 RPM, with D1 2 and D2 10 having a voltage unbalance. D1 2 is less affected by the voltage unbalance compared to D2 10, which is visible in the dendrogram.
Table 8: Performance of the proposed framework for fault detection using vibration signature analysis via harmonic amplitudes. For reference, the last column shows the performance for the classic diagnostics approach (table 7) with the highest F1-score. In five out of the six cases, the proposed approach has the highest F1 score.
Machine fleet condition monitoring offers several advantages over both traditional condition monitoring approaches and supervised machine learning techniques. First, it removes the need to know all potential machine faults a priori, as the fleet-based approach can detect any deviation in machine behavior. However, the framework can incorporate domain knowledge if available. Second, fleet monitoring does not require a high-quality historical (labeled) data set for training. Both traditional signal processing approaches as supervised machine learning need this data to learn faulty machine behavior. Moreover, these require ground-truth knowledge about the machine’s actual health status. This is not required for fleet monitoring, as it assumes the majority of the machines to be healthy. Third, fleet monitoring allows analysis in dynamic operational conditions. Online comparisons allow detecting deviating machine behavior even in unconsidered operational conditions. Finally, the framework offers a high level of interpretability. Visualizations allow a domain expert to get insights into the predictions and gain confidence in the methodology. This is especially in contrast with black box machine learning techniques, whose models are very hard to interpret.
The authors acknowledge the financial support of VLAIO (Flemish Innovation & Entrepreneurship) through the Baekeland PhD mandate [nr. HBC.2017.0226]; the O&O project REFLEXION [nr. IWT. 150334]. Jesse Davis is partially supported by the KU Leuven research funds [C14/17/070] and Research Foundation - Flanders [EOS No. 30992574]. Jesse Davis, Wannes Meert and Konstantinos Gryllias receive funding from the Flemish Government under the ”Onderzoeksprogramma Artificile Intelligentie (AI) Vlaanderen” programme.
[1] M.L.D. Wong, L.B. Jack, and A.K. Nandi. Modified self-organising map for automated novelty detection applied to vibration signal monitoring. Mechanical Systems and Signal Processing, 20(3):593–610, apr 2006.
[2] Jay Lee, Hung-An Kao, and Shanhu Yang. Service Innovation and Smart Analytics for Industry 4.0 and Big Data Environment. In 6th Conference on Industrial Product-Service Systems, volume 16, pages 3–8, Windsor, Ontario, Canada, 2014. Elsevier B.V.
[3] Bryan L Matthews, Santanu Das, Kanishka Bhaduri, Kamalika Das, Rodney Martin, and Nikunj Oza. Discovering Anomalous Aviation Safety Events Using Scalable Data Mining Algorithms. Journal of Aerospace Information Systems, 10(10):467–475, 2013.
[4] David Siegel. Prognostics and Health Assessment of a Multi-Regime System using a Residual Clustering Health Monitoring Approach. PhD thesis, University of Cincinnati, 2013.
[5] Will Jacobs, Huw Edwards, Visakan Kadirkamanathan, and Andrew R Mills. Inter-engine variation analysis for health monitoring of aerospace gas turbine engines. European PHM, 2018.
[6] Victoria J. Hodge, Simon O’Keefe, Michael Weeks, and Anthony Moulds. Wireless Sensor Networks for Condition Mon- itoring in the Railway Industry: A Survey. IEEE Transactions on Intelligent Transportation Systems, 16(3):1088–1106, jun 2015.
[7] Emanuele Fumeo, Luca Oneto, and Davide Anguita. Condition Based Maintenance in Railway Transportation Systems Based on Big Data Streaming Analysis. Procedia Computer Science, 53(1):437–446, 2015.
[8] Fabien Meinguet, Eric Semail, Xavier Kestelyn, Yves Mollet, and Johan Gyselinck. Change-detection algorithm for short-circuit fault detection in closed-loop AC drives. IET Electric Power Applications, 8(5):165–177, may 2014.
[9] International Organization for Standardization. ISO 10816-3:2009 – Mechanical vibration – Evaluation of machine vibra- tion by measurements on non-rotating parts – Part 3: Industrial machines with nominal power above 15 kW and nominal speeds between 120 r/min and 15 000 r/min when measured in situ, 2009.
[10] Charles R C.R. Farrar, Hoon Sohn, and Keith Worden. Data Normalization : A Key For Structural Health Monitoring, 2001.
[11] Xiaodong Jia, Chao Jin, Matt Buzza, Yuan Di, David Siegel, and Jay Lee. A deviation based assessment methodology for multiple machine health patterns classification and fault detection. Mechanical Systems and Signal Processing, 99(June):244–261, jan 2018.
[12] Jay Lee, Fangji Wu, Wenyu Zhao, Masoud Ghaffari, Linxia Liao, and David Siegel. Prognostics and health management design for rotary machinery systemsReviews, methodology and applications. Mechanical Systems and Signal Processing, 42(1-2):314–334, jan 2014.
[13] Ruonan Liu, Boyuan Yang, Enrico Zio, and Xuefeng Chen. Artificial intelligence for fault diagnosis of rotating machinery: A review. Mechanical Systems and Signal Processing, 108:33–47, aug 2018.
[14] Rui Zhao, Ruqiang Yan, Zhenghua Chen, Kezhi Mao, Peng Wang, and Robert X. Gao. Deep learning and its applications to machine health monitoring. Mechanical Systems and Signal Processing, 115:213–237, jan 2019.
[15] B.S. Yang, T. Han, and J.L. An. ARTKOHONEN neural network for fault diagnosis of rotating machinery. Mechanical Systems and Signal Processing, 18(3):645–657, may 2004.
[16] Muhammet Unal, Mustafa Onat, Mustafa Demetgul, and Haluk Kucuk. Fault diagnosis of rolling bearings using a genetic algorithm optimized neural network. Measurement, 58:187–196, dec 2014.
[17] Meik Schlechtingen and Ilmar Ferreira Santos. Comparative analysis of neural network and regression based condition monitoring approaches for wind turbine fault detection. Mechanical Systems and Signal Processing, 25(5):1849–1875, jul 2011.
[18] Feng Jia, Yaguo Lei, Jing Lin, Xin Zhou, and Na Lu. Deep neural networks: A promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data. Mechanical Systems and Signal Processing, 72-73:303–315, 2016.
[19] Samir Khan and Takehisa Yairi. A review on the application of deep learning in system health management. Mechanical Systems and Signal Processing, 107:241–265, 2018.
[20] Zhuyun Chen, Konstantinos Gryllias, and Weihua Li. Intelligent Fault Diagnosis for Rotary Machinery Using Transferable Convolutional Neural Network. IEEE Transactions on Industrial Informatics, pages 1–1, 2019.
[21] Ming Ge, R. Du, Guicai Zhang, and Yangsheng Xu. Fault diagnosis using support vector machine with an application in sheet metal stamping operations. Mechanical Systems and Signal Processing, 18(1):143–159, jan 2004.
[22] Achmad Widodo and Bo-Suk Yang. Support vector machine in machine condition monitoring and fault diagnosis. Mechanical Systems and Signal Processing, 21(6):2560–2574, aug 2007.
[23] K.C. Gryllias and I.A. Antoniadis. A Support Vector Machine approach based on physical model training for rolling element bearing fault detection in industrial environments. Engineering Applications of Artificial Intelligence, 25(2):326–344, mar 2012.
[24] Alexandre Mauricio, Carina Freitas, Jaques Cuenca, Konstantinos Gryllias, Bram Cornelis, Karl Janssens, and Kilian Hendrickx. Condition monitoring of gears under medium rotational speed. In 24th Internation Congress on Sound and Vibration, page 8, London, 2017.
[25] L. Bull, K. Worden, G. Manson, and N. Dervilis. Active learning for semi-supervised structural health monitoring. Journal of Sound and Vibration, 437:373–388, dec 2018.
[26] T.J. Rogers, K. Worden, R. Fuentes, N. Dervilis, U.T. Tygesen, and E.J. Cross. A Bayesian non-parametric clustering approach for semi-supervised Structural Health Monitoring. Mechanical Systems and Signal Processing, 119:100–119, mar 2019.
[27] Lei Zhang, Qixin Cao, and Jay Lee. Performance Assessment for a Fleet of Machines Using a Combined Method of Ant-Based Clustering and CMAC. Advances in Mechanical Engineering, 5:603071, jan 2013.
[28] Cameron Sobie, Carina Freitas, and Mike Nicolai. Simulation-driven machine learning: Bearing fault classification. Mechanical Systems and Signal Processing, 99:403–419, 2018.
[29] Marco A.F. Pimentel, David A. Clifton, Lei Clifton, and Lionel Tarassenko. A review of novelty detection. Signal Processing, 99:215–249, jun 2014.
[30] Konstantinos Gryllias, Ioannis Antoniadis, and Christos Yiakopoulos. A novel semi-supervised mathematical morphology- based fault detection & classification method for rolling element bearings. In The 22nd International Congress on Sound and Vibration, page 8, Florence, 2015.
[31] Ziad Hasan Abu-El-Zeet and Vijaykumar Chhaganlal Patel. METHOD OF CONDITION MONITORING (Patent US 7275018 B2), 2006.
[32] G.M. West, S.D.J. McArthur, and D. Towle. Industrial implementation of intelligent system techniques for nuclear power plant condition monitoring. Expert Systems with Applications, 39(8):7432–7440, jun 2012.
[33] Peng Li, Jens Eickmeyer, and Oliver Niggemann. Data Driven Condition Monitoring of Wind Power Plants Using Cluster Analysis. In 2015 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, pages 131–136. IEEE, sep 2015.
[34] Sameer Al-Dahidi, Francesco Di Maio, Piero Baraldi, Enrico Zio, and Redouane Seraoui. A framework for reconciliating data clusters from a fleet of nuclear power plants turbines for fault diagnosis. Applied Soft Computing, 69:213–231, aug 2018.
[35] Varun Chandola, Arindam Banerjee, and Vipin Kumar. Anomaly detection: A survey. ACM Computing Surveys, 41(3):1– 58, jul 2009.
[36] Niels Henrik Pontoppidan, Sigurdur Sigurdsson, and Jan Larsen. Condition monitoring with mean field independent components analysis. Mechanical Systems and Signal Processing, 19(6):1337–1347, nov 2005.
[37] Afrooz Purarjomandlangrudi, Amir Hossein Ghapanchi, and Mohammad Esmalifalak. A data mining approach for fault diagnosis: An application of anomaly detection algorithm. Measurement, 55:343–352, sep 2014.
[38] Vincent Vercruyssen, Wannes Meert, Gust Verbruggen, Koen Maes, Ruben Baumer, and Jesse Davis. Semi-Supervised Anomaly Detection with an Application to Water Analytics. In 2018 IEEE International Conference on Data Mining (ICDM), volume 2018-Novem, pages 527–536. IEEE, nov 2018.
[39] Stuart Russell and Peter Norvig. Artificial Intelligence: A Modern Approach. Prentice Hall, mar 2009.
[40] Charu C. Aggarwal and Saket Sathe. Outlier Ensembles. Springer International Publishing, Cham, 2017.
[41] Gabriela Medina-Oliva, Alexandre Voisin, Maxime Monnin, Flavien Peysson, and Jean-Baptiste L´eger. Prognostics as- sessment using fleet-wide ontology. In Annual Conference of the Prognostics and Health Management Society 2012, PHM Conference 2012, Minneapolis, Minnesota, 2012.
[42] Gabriela Medina-Oliva, Alexandre Voisin, Maxime Monnin, and Jean Baptiste L´eger. Predictive diagnosis based on a
fleet-wide ontology approach. Knowledge-Based Systems, 68:40–57, 2014.
[43] Alexandre Voisin, Gabriela Medina-Oliva, Maxime Monnin, Jean-Baptiste L´eger, and Benoˆıt Iung. Fleet-wide Diagnostic and Prognostic Assessment. In Annual Conference of the Prognostics and Health Management Society, volume 3, 2011.
[44] Robert B. Randall and J´erˆome Antoni. Rolling element bearing diagnosticsA tutorial. Mechanical Systems and Signal Processing, 25(2):485–520, feb 2011.
[45] Meinard M¨uller. Dynamic Time Warping. In Information Retrieval for Music and Motion, pages 69–84. Springer-Verlag Berlin Heidelberg, 1 edition, 2007.
[46] Donald J Bemdt and James Clifford. Using dynamic time warping to find patterns in time series. In Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining, pages 359–370, Seattle, 1994. AAAI Press.
[47] Diego Furtado Silva, Gustavo E. A. P. A. Batista, and Eamonn Keogh. On the Effect of Endpoints on Dynamic Time Warping. In , page 10, San Francisco, California, 2016. ACM.
[48] Kilian Hendrickx, Wannes Meert, Bram Cornelis, Karl Janssens, Konstantinos Gryllias, and Jesse Davis. A Fleet-Wide Approach for Condition Monitoring of Similar Machines Using Time-Series Clustering. In Springer, editor, Proceedings of pages 101–110. Springer, Cham, Santander, Spain, jun 2019.
[49] Leonard Kaufman and Peter J. Rousseeuw. Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, 1990.
[50] Hae Sang Park and Chi Hyuck Jun. A simple and fast algorithm for K-medoids clustering. Expert Systems with Applications, 36(2 PART 2):3336–3341, 2009.
[51] Ulrike von Luxburg. A tutorial on spectral clustering. Statistics and Computing, 17(4):395–416, dec 2007.
[52] F. Murtagh. A Survey of Recent Advances in Hierarchical Clustering Algorithms. The Computer Journal, 26(4):354–359, nov 1983.
[53] Martin Ester, Hans-Peter Kriegel, Sander Jorg, and Xiaowei Xu. A Density-Based Clustering Algorithms for Discovering Clusters. In pages 226–231, Portland, Oregon, 1996.
[54] Joe H. Ward. Hierarchical Grouping to Optimize an Objective Function. Journal of the American Statistical Association, 58(301):236–244, mar 1963.
[55] V. Parker Lessig. Comparing Cluster Analyses with Cophenetic Correlation. Journal of Marketing Research, 9(1):82, feb 1972.
[56] Shikha Agrawal and Jitendra Agrawal. Survey on Anomaly Detection using Data Mining Techniques. Procedia Computer Science, 60(1):708–713, 2015.
[57] Rasool Sharifi and Mohammad Ebrahimi. Detection of stator winding faults in induction motors using three-phase current monitoring. ISA Transactions, 50(1):14–20, 2011.
[58] Yves Mollet, Matteo Pergolesi, Mathieu Sarrazin, Karl Janssens, Herman Van der Auweraer, Paolo Chiariotti, Paolo Castellini, and Johan Gyselinck. Multi-Physical Signature Analysis of Induction Machines under Unbalanced Supply Voltage. In 2018 XIII International Conference on Electrical Machines (ICEM), volume 32, pages 2378–2384, Alexandroupoli, Greece, sep 2018. IEEE.
[59] Sang-Joon Lee, Jun-Koo Kang, and Seung-Ki Sul. A new phase detecting method for power conversion systems considering distorted conditions in power system. In Conference Record of the 1999 IEEE Industry Applications Conference. ThirtyForth IAS Annual Meeting (Cat. No.99CH36370), volume 4, pages 2167–2172, Phoenix, AZ, USA, 1999. IEEE.
[60] Jerome H. Friedman. On bias, variance, 0/1-loss, and the curse-of-dimensionality. Data Mining and Knowledge Discovery, 1(1):55–77, 1997.
[61] G.S Maruthi and K. Panduranga Vittal. Electrical Fault Detection in Three Phase Squirrel Cage Induction Motor by Vibration Analysis using MEMS Accelerometer. In 2005 International Conference on Power Electronics and Drives Systems, volume 2, pages 838–843. IEEE, 2006.
[62] P.J. Tavner. Review of condition monitoring of rotating electrical machines. IET Electric Power Applications, 2(4):215, 2008.