An intelligent fault diagnosis method of rotating machinery based on deep neural networks and time-frequency analysis

Xin, Yu; Li, Shunming; Cheng, Chun; Wang, Jinrui

doi:10.21595/jve.2018.19040

Journal of Vibroengineering

Browse Journal

Submit article

Published: 30 September 2018

Check for updates

An intelligent fault diagnosis method of rotating machinery based on deep neural networks and time-frequency analysis

Yu Xin¹

Shunming Li²

Chun Cheng³

Jinrui Wang⁴

^{1, 2, 3, 4}College of Energy and Power Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, 210016, China

Corresponding Author:

Yu Xin

Cite the article Download PDF

Downloads 2300

WoS Core Citations 16

CrossRef Citations 16

Abstract

As the crucial part of the health management and condition monitoring of mechanical equipment, the fault diagnosis and pattern recognition using vibration signal are essential researching contents. The time-frequency representation method cannot identify the fault patterns from time-frequency representation effectively because of the complex work conditions of rotating machinery parts and the interference of strong background noise. Considering these disadvantages, a new reliable and effective method based on the time-frequency representation and deep convolutional neural networks is presented. In this method, the time-frequency features are calculated by the short time Fourier transform (STFT), and the pseudo-color map as the new identification objects. A novel feature learning method based on the sparse autoencode with linear decode is used to extract these time-frequency features, which is an unsupervised feature learning method with the goal of minimizing the loss function. The convoluting and pooling are applied to establish the hierarchical deep convolutional neural networks and filter the useful features layer by layer from the output of sparse autoencode. And a softmax classifier is used to obtain the faults classification. The experimental datasets from roller bearing and gearbox have been taken to verify the reliability and effectiveness of the proposed method for fault diagnosis and pattern recognition. The results show that the proposed method have excellent performance of the recognized objects.

1. Introduction

Fault diagnosis and pattern identification are crucial to the health monitoring of rotating machinery, especially for the high speed machinery and its components, such as bearing, gear and rotor in the aircraft engine. Vibration signals of rotating components always exhibit the non-linear and non-stationary characteristics due to the degradation and deterioration of working conditions [1]. Therefore, the efficient fault diagnosis method plays a significant role in the health management and condition monitoring of mechanical equipment.

Basically, the vibration signal can be collected in-time and carries a large number of useful information which can accurately reflect the working status of mechanical equipment. As result, the collected vibration signal is wildly used in the fault diagnosis and condition detecting. Traditionally, the statistical features of the time and frequency domain are chosen as the identification object with low efficiency. Besides, it does not work for amounts of vibration signal measured. To extract the most efficient information from the signal, a various of methods were proposed, such as Fourier analysis [2], wavelet transform [3, 4], EMD [5], Hilbert spectrum analysis [6, 7] and so on. The SVM [8], PCA {Hu, 2014 #6177}[9], Markov model [10] and neural networks [11] were introduced for the fault classification and pattern recognition. All of these methods are used to extract the sensitive parameters of fault features and to identify the fault pattern from the time series of signals. However, they can only obtain excellent results in time or frequency domain for certain situations. Besides, they are not capable of analyzing amounts of data containing complex signals and are limited by the dimension of models. Hence, these methods are not suitable for the complex signals. And now, to monitor the working conditions of the key components in rotating machinery, the useful fault information must be identified from a large number of vibration signals which are detected by different sensors. Therefore, how to process the collected lots of data and extract the fault characteristics immediately and efficiently, it is a huge problem. The traditional methods do not work for this big data environment.

However, considering the limitation of time and frequency domain methods, the time-frequency representation methods were presented to extract the sensitive fault features efficiently. Still now, the short time Fourier transform (STFT) [12], wavelet transform [13], synchrosqueezing transform [14], Wigner-Ville distribution [15], Cohen methods [16] and other derived methods based on the traditional methods was proposed. Feng reviewed these methods in detailed [17]. As the basic analysis method, the STFT is still the commonly practical method. The time-frequency representation exhibits the fault information of mechanical working status. But the corresponding fault features are always recognized from the time-frequency representation which are already identified as some fault types. On the contrary, the particular fault pattern of time-frequency representation of the detected signals is always unknown. In addition, the time-frequency characteristics cannot be classified and distinguished from amount of time-frequency images one-by-one. Therefore, the efficient and reliable method should be studied to complete this task.

To identify the fault features from lots of time-frequency images, the prevalent deep learning method exhibit extra-ordinary serviceability [18]. It can obtain the excellent results for big data analysis and vibration signal processing. Based on the sparse filter, an un-supervising feature learning intelligence method was proposed to learn features from the amount raw signals [19]. An intelligent deep neural networks diagnosis method was proposed to mine the useful information from raw data, which overcome the limitation of prior knowledge and non-linear issues [20]. As a supervising learning method, convolution neural networks which proposed by LeCun [21] was used to diagnose the fault feature of gearbox [22] and analyzed the vibration spectrogram [23]. The time-frequency representation of vibration signals was directly put into the convolutional neural networks to learn and distinguish the different fault features of the rotating machinery [24, 25]. Considering the demand of machinery expertise and prior knowledge, the hierarchical learning adaptive convolutional neural networks were constructed to diagnose the bearing faults and its severity [26]. Anyway, many intelligent methods were used to diagnose the fault feature. But lots of them do not really consider the specific characteristics of mechanical vibration signal, and analyze the influence of model parameters on the diagnostic results.

Hence, coupling the advantage of the short time Fourier transform and deep learning model, in this paper, a new optimal deep convolutional neural networks model with sparse characteristics is constructed to distinguish the fault features from the time-frequency representation. The collected signals are divided into several segments and the time-frequency images of each segments are obtained through the STFT time-frequency representation methods. It is very impractical to recognize the large number of time frequency images by manual method. As the input of constructed deep convolutional neural networks, these images are preprocessed by the sparse autoencode algorithm with linear decode to improve the sparsity, which is an unsupervised feature learning method with the goal of minimizing the loss function to extract the time-frequency features. The convoluting and pooling is applied to establish the hierarchical deep convolutional neural networks and extract the useful features layer by layer from the output of sparse autoencode. A softmax classifier is used to distinguish the time-frequency images to obtain the different kinds of fault feature. The vibration signal dataset of bearing and gear are taken to verify the performance of proposed method. The bearing datasets contain of different fault locations and diameters under various working loads. And the gearbox datasets include the different fault types under the operating conditions.

The rest of this paper is organized as follows. Section 2 briefly introduce the theoretical background of STFT method and convolutional neural networks. The proposed intelligent fault feature identification method is described detailly in Section 3. In Section 4, the effectiveness of the presented method is verified by the rolling bearing datasets and planetary gearbox dataset. The conclusions are summarized in Section 5.

2. Theoretical background

2.1. Time-frequency representation method

As the basic time frequency analysis method, the STFT method just add the time variable to the traditional Fourier transform. To investigate the time-varying signal efficiently with the moving of short window, it assume that the every segment signal is stationary. Through the Fourier transform, the local Fourier spectrum character of each segment around the time center of the short time window can be acquired, and according to the local spectrum, the time variation features of signals should be revealed effectively [27].

Given an arbitrary signal $s (t)$ , the window function is $ω (τ - t)$ and centered at time $t$ , (where $τ$ is the time variable). Then the observed signal through this window is $x (τ) ω (τ - t)$ . Moving the window and applying the Fourier transform to each segment leads to the short time Fourier transform:

1

$S T F T_{x} (t, f) = \int_{- \infty}^{\infty} x (τ) ω (τ - t) e^{- j 2 π f τ} d τ .$

2.2. Convolutional neural networks theory

The convolutional neural networks generally contain input layer, convoluting layer, pooling layer and output layer, as illustrated in Fig. 1. As an end-to-end learning method, it can autonomously learn the representations of the data by their layer structure [28]. According to the convoluting theory, the feature maps of previous layer convolve with learnable kernels and put through the non-linear activation function, such as sigmoid, tanh, ReLU functions [29], to form the output feature mapping. Each output mapping integrates the convolutions with multiple input maps.

Fig. 1The schematic of convolutional neural networks

As a hierarchical architecture, inputting the arbitrary signal $x$ , each subsequent layer $x_{j}$ is derived as:

2

$x_{j} = ρ W_{j} x_{j - 1},$

where, the $W$ and $ρ$ are the linear operator and non-linear activation function, respectively. Typically, the $W_{j}$ is the convolution and $ρ$ is sigmoid $1 / 1 + e x p (- x)$ or rectifier $m a x (x, 0)$ in convolutional neural networks. The operator $W_{j}$ is a stack of convolutional filters maps and each layer can be written as a sum of convolutions of the previous layer [30]:

3

$x_{j} (u, k_{j}) = ρ (\sum_{k} (x_{j - 1} (\cdot, k) * W_{j, k_{j}} (\cdot, k)) (u)),$

where, the * is the convolution operator:

4

$(f * g) (x) = \sum_{u = - \infty}^{\infty} f (u) g (x - u) .$

In convolutional neural networks, the optimization problem is highly non-convex. Typically, the weights $W_{j}$ are computed by stochastic gradient descent, by the backpropagation algorithm to compute gradients.

After the convolution layer, a pooling layer is followed and used to obtain the nonlinear down-sampling features [31]. This operation divides the input data into non-overlapping regions, and make the same operation ‘pooling’ for each region. The max-pooling function make the features to be a form of translation invariance and improve the computational efficiency of networks.

3. Methodology

3.1. Time-frequency analysis

The time and frequency domain analysis cannot efficiently and completely represent the information of fault vibration signals. Therefore, the joint time-frequency representation method is used to display the fault information of the components. The collected vibration signals must be divided into several segments with 1024 points in every segment. Each segment is analyzed by the STFT method with Eq. (1). Then, the time-frequency representation of the segmented signals are obtain. Here, the pseudo-color map is used to visually display the time-frequency characteristics. Those images as the input of the proposed deep convolutional neural networks to training the models and recognizing the fault features.

3.2. Constructing the deep convolution neural networks (DCNN)

As one type of unsupervised neural networks, the sparse autoencode include the encoder and decoder, the former one transforms the input data from high-dimensional space into codes in a low dimensional space and the later one reconstructs the input from the corresponding codes. The autoencode with a hidden layer and linear output layer, which is the linear decode algorithm, is forced to learn a sparsity representation and reconstruct the original input.

The principle of CNN is very similar to the human’s visual processing and own the powerful performance in complex image identification. In the convolutional neural networks, the convoluting is the special filter method of feature extracting. The great innovation of convoluting and pooling layer is not full connect, so that the networks can extract the features, rather than fitting the input data.

As the core of convolutional neural networks, the different deep learning model is constructed by the group of convoluting and pooling. The constructed DCNN model in this paper include one sparse aturoencode with linear decode layer, two convoluting layers, two pooling layers and a softmax layer. The schematic of DCNN is shown in Fig. 2. Because of the limitation of computational resource and efficient, the different time-frequency images must be shrunk the size by the nearest neighbor interpolation algorithm, and the pictures are normalized and centralized. Although many features would be eliminated by this method, the key features must be reserved and hardly effect the identification results. As the input data of DCNN model, all of these time-frequency images are 2-D feature and the size is 64×64. The advantage of this preprocess can survive the key features and release the computational performance.

Fig. 2The schematic of deep neural networks

Here, the neural networks architecture is detailly described as follows:

(1) In order to reducing the raw redundant of the input data, the original inputs are preprocessed by PCA whiten method, and the mean of inputs are zero. Besides, the sparse autoencode with linear decode algorithm is used to improve the sparsity of the input data as the first layer.

(2) Then, two convolutional layers with 8 feature maps are followed continuously. The kernels are selected and each kernel in feature maps is connected to a $p \times p$ neighborhood of input (here, the p is the patch dimension). The same kernel and connecting weights will be shared by the all neurons in one feature map.

In this process, the number of hidden neuron is initially estimated by the empirical formula Eq. (5):

5

$H_{n u m} = \frac{n \times l + 0.5 \times (n^{2} + n) \times l - 1}{n + l},$

where, $n$ is the dimension of features, $l$ is the number of fault classification.

(3) Following the each convoluting layer, the pooling layer with 4 feature maps are used with max-pooling operation. This operation can introduce the local translation invariance to the model and reduce the size of input data to the 1/4 compared with the previous convolution layer.

Fig. 3The flow chart of proposed method

(4) As a generalization of logistic function, the softmax can squashes a $K$ -dimensional input vector of arbitrary real value to a $K$ -dimensional output vector of real value in the range of (0, 1). It can efficiently solve multi-class problems. Therefore, a softmax layer with full connection is following the last pooling layer, and the output of the last pooling layer as the input of the softmax networks to obtain the identification and classification of the fault types from the time-frequency images efficiently.

Based on the time-frequency representation and the constructed DCNN model, the flow chart of the fault feature extracting and classifying method is shown in Fig. 3. Although the STFT have limited time-frequency concentration and poor self-adaptability and affect the time-frequency representation, it does not affect the proposed DCNN model to identify the fault features obviously. Based on the DCNN model, the proposed method pretreats the input data by pre-whitening to eliminate the influence of relation and redundancy of input data, and the proposed model are constructed by the sparse decode and two convoluting layer and two pooling layer to strengthen the identifiability of fault information.

4. Experimental setup

As the key components of rotating machinery, the failure level and performance of rolling bearing and gear will greatly affect the reliability, service life and economic benefit of the machinery. Different kinds of faults will occur on the components under different working conditions. In this section, the proposed method is used to distinguish the fault features with the real testing vibration signals of fault bearing and gear. Simultaneously, some other existing methods such as CNN and LSSVM, which are already wildly used in fault diagnosis, are also used to compare their performance.

4.1. Case 1: Fault diagnosis of the rolling bearing

4.1.1. Data description

Here, the bearing fault data from the case west server university (CWSU) is used [32]. These data were collected from a bearing testing bed shown in Fig. 4. The single point faults of bearing components were produced to the testing bearing by electro-discharge with diameters 0.1778 mm, 0.3556 mm, 0.5334 mm, respectively. The loads of experiments were 0 hp, 1 hp, 2 hp and 3 hp, respectively. The vibration signals were collected by the accelerometers which were attached to the housing with 12000 samples per second. Here, the Normal condition (N), roller fault (RF), inner race fault (IF), outer race fault (OF) were selected under different fault diameters and loads. In this study, there are 200 signal samples under each fault conditions and each sample contain 4096 data points. The detailed of this strategy is shown in Table 1.

Fig. 4The schematic bearing testing bed

According to the proposed method, The STFT is performed to every segmented signal in all samples and the total of 800 different time-frequency images are obtained. There are divided into training sets and testing sets with the labels. The datasets as the input of the proposed DCNN model to perform the fault identification. The segmental vibration signals and the time-frequency pseudo-color maps of roller bearing are shown in Fig. 5. Here, the load is 3 hp and the diameters of faults are 0.1778 mm, 0.3556 mm, 0.5334 mm, respectively. In each picture, the upper waveform is the collected vibration signal and the left one is the power spectrum of the signal. The time-frequency representation through the STFT method is in the middle and the frequency is normalized. The sampling frequency is 1024, and the visualization threshold of time-frequency energy distribution on the images is 2 %, which can excellently display the detailed energy distribution of time-frequency representation.

Table 1The detailed datasets of bearing faults

Load (hp)	Samples number	Defect Location	Defect diameter (mm)	Classification label
1-2-3	200-200-200	N	0	1
	200-200-200	RD	0.178	2
	200-200-200	RD	0.356	3
	200-200-200	RD	0.533	4
	200-200-200	ID	0.178	5
	200-200-200	ID	0.356	6
	200-200-200	ID	0.533	7
	200-200-200	OD	0.178	8
	200-200-200	OD	0.356	9
	200-200-200	OD	0.533	10

Fig. 5The vibration signal and the time-frequency pseudo-color map of roller bearing

a) Normal bearing

b) Ball fault in 0.1778 mm

c) Ball fault in 0.3556 mm

d) Ball fault in 0.5334 mm

e) Inner fault in 0.1778 mm

f) Inner fault in 0.3556 mm

g) Inner fault in inch 0.5334 mm

h) Outer fault in 0.1778 mm

i) Outer fault in 0.3556 mm

j) Outer fault in 0.5334 mm

4.1.2. Diagnosis results

The initialized parameters of the bearing diagnostic model are listed in Table 2. In order to study the influence of the different key parameters on the proposed model, the influence of the key parameters on the classification accuracy of bearing faults are investigated. The number of hidden neurons, pooling dimensions, sparse parameter (desired average activation of the hidden units), batch size, the number of maximum iteration, different ratio of training samples and testing samples are selected to study. The number of hidden neuron has been determined by the Eq. (5). In this case, the $n$ is 64 and the $l$ is 10, and the hidden neuron is 290. The patch dimension and pooling dimension related to the input dimension, the former was the square root of 64, and the later one is the half of former. The number of maximum iteration and batch size are determined by the experience. And the other parameters, whiten parameter, weight decay, sparsity parameters, sparsity penalty, are determined as the constant according to the Ref. [19] and Ref. [20]. Here, the computer configuration is Inter i5-2430 and 16GB RAM in window 7 and Matlab 2016b.

Table 2The initialized parameters of bearing diagnosis model

Name	Value	Name	Value	Name	Value
Patch dimension	8	Pooling dimension	4	Whiten parameter	0.1
Hidden size	290	Weight decay	0.0003	Maximum iteration	200
Sparsity parameters	0.035	Sparsity penalty	5	Batch size	20

The computational results are shown in Fig. 6. Those figures show that the impact of selected key parameters on the computing accuracy and the exhausting time. From the Fig. 6(a), with the increasing of hidden neuron, the accuracy and time are both enlarged. When the number is greater than 220, the exhausting time is drastically increasing, but the accuracy is not significantly changing. It realizes that the optimal number of hidden neuron is 200 and the too large or too small number of hidden nodes will acquire the lower accuracy or too longer computational time. In the Fig. 6(b), it shows that the smaller pooling dimension can obtain the higher accuracy and less average time. Similarly, from the Fig. 6(c), (d), it can find that the minor sparsity parameter and the maximum iteration of sparse autoencoder have a lower accuracy and computing time. When the value is 0.02 and 60, respectively, it can achieve the satisfactory accuracy with suitable computing time. The bigger value will not improve the accuracy, instead of consuming more computational time. The Fig. 6(e), (f) show that the different batch and sample ratio have not significant effect to the classification accuracy. Oppositely, the bigger value consumes the more computational source. So, the appropriate values are 20 and 0.5.

Fig. 6The influence tendency of different parameters

a) Diagnosis result by various the number of hidden neuron

b) Diagnosis result by various pooling dimension

c) Diagnosis result by various sparse parameter

d) Diagnosis result by various maximum iteration number

e) Diagnosis result by various batch size

f) Diagnosis result by various sample ratio

According to the analyzing, the optimal parameters are selected to perform the DCNN effectively, which are pooling dimension, sparse parameter, the number of hidden neuron, the number of iteration, batch size and sample ratio. When the DCNN model is training the bearing fault datasets, the classification accuracy of the training set and testing set are 100 % and 99.75 %, respectively, and the error is very small. The diagnosis results and the error are listed in Table 3 and the comparing results are shown in Fig. 7. From the Table 3 and Fig. 7, it shows that the proposed DCNN method have the smaller error and the more excellent performance. In our proposed model, the input data are pre-whitened and eliminated the correlation and redundancy. The sparse linear decode is used as the first layer of deep neural networks to decouple automatically and improve the sparsity of time-frequency images, these improved the identifiability of convoluting and pooling operating to extract the deep features. When the deep features of time-frequency images are obtained, the multi-class features are classified by the softmax layer accurately. And all of the parameters in the proposed DCNN model are optimized for fault diagnosis problem. So, the result of proposed method must have more excellent performance than CNN and LSSVM model.

Table 3The comparing of proposed method with other methods

	Proposed method	CNN	LSSVM
Training set accuracy	100 %	94.75 %	86.25 %
Testing set accuracy	99.75 %	89.50 %	80.50 %
Mean time	12276.3491	15310.3437	22013.821
Training set error	0	0.0328	0.124
Testing set error	0.0025	0.0471	0.1128

Fig. 7The histogram of comparative results for bearing

4.2. Case 2: Fault diagnosis of gears

4.2.1. Experiments and data description

The proposed method is used to diagnose the gear fault in this section. Six types of gear faults are investigated in this experimental case, which are normal condition, a single broken tooth of wheel, a single pit of wheel, a single worn of pinion, coupled fault of wheel pit and pinion worn, coupled fault of wheel broken and pinion worn. The vibration signals were collected on a specially designed bench which is driven by a motor. The detailed experiment illustration is shown in Fig. 8. The nominal power and speed are 0.75 KW and 1500 rpm, respectively. The pinion and wheel gear are located in the gearbox and their parameters are listed in Table 4.

Fig. 8The testing bed of gearbox

Table 4The parameters of gears

Gear	Teeth	Module (mm)	Pressure angle (deg)	Materials
Pinion	55	2	20	S45C
Wheel	75	2	20	S45C

The time-frequency images of six types from the gear fault are displayed in Fig. 9, which load and rotational speed are 1 hp and 880 rpm, respectively.

Fig. 9The vibration signal and the time-frequency pseudo-color map of gear faults

a) Normal gear

b) Wear fault

c) Pit fault

d) Break fault

e) Pit and Wear compound fault

f) Break and Wear compound fault

4.2.2. Diagnosis results

In this case, the $l$ is 6 and the hidden neuron is 180. The determination method of other parameters are similar to the case 1. The initialized parameters of the DCNN model of gear fault diagnosis are listed in Table 5.

Table 5The initialized parameters of gear diagnosis model

Name	Value	Name	Value	Name	Value
Patch dimension	8	Pooling dimension	4	Whiten parameter	0.1
Hidden size	180	Weight decay	0.0003	Maximum iteration	200
Sparsity parameters	0.005	Sparsity penalty	5	Batch size	20

Fig. 10The influence of different key parameters

a) Diagnosis result by various the number of hidden neuron

b) Diagnosis result by various pooling dimension

c) Diagnosis result by various sparse parameter

d) Diagnosis result by various maximum iteration number

e) Diagnosis result by various batch size

f) Diagnosis result by various sample ratio

Here, the impacting of several parameters on the recognizing accuracy and computing time is studied in the gear fault diagnosis model. Similar to the bearing analysis, the number of hidden neuron, pooling dimension, sparse parameter, maximum iteration, batch size and sample ratio are selected. The detailed analyzing results are shown in Fig. 10. It shows that the number of hidden neurons and the ratio of samples have great influence on the diagnostic model. But the other parameters have stable effect on the model. When the different influence tendency of the parameters are determined, the optimal parameters are selected to diagnose the gear faults and the results are shown in Table 6 and Fig. 11. From the comparing results, it can significantly find that the proposed method has more excellent performance than other methods, not only for the computation accuracy, also for the efficiency. For the same reason in the case 1, it shows that the proposed DCNN model has more excellent performance than CNN and LSSVM model in the case 2.

Table 6The comparing of gear classification result

	Proposed DCNN	CNN	LSSVM
Training set accuracy	97 %	92.56 %	87 %
Testing set accuracy	96.78 %	89.72 %	78.33 %
Average time	11825.25	9032.3486	12730.4878
Training set error	0.0274	0.087	0.1334
Testing set error	0.0752	0.1035	0.2089

Fig. 11The histogram of gear fault classification result

5. Conclusions

In this work, a new intelligent fault diagnosis method of rotating machinery based on the time-frequency analysis and DCNN is proposed. The applicability and efficiency of the proposed method are verified by the collected vibration datasets of rolling bearing and gearbox with different fault characteristics. The conclusion of this work is summarized as follows:

1) Based on the convolutional neural networks, a new deep neural network called DCNN model is efficiently constructed with sparse autoencode, convoluting, pooling and softmax. This method integrates the sparse autoencode with linear decode method which can improve the sparsity of the input data and benefit to extract the fault features. The number of neurons and pooling dimension have great influence on the proposed DCNN model.

2) Because of the time-frequency representation contains more information of the fault feature than the time waveform or frequency spectrum, it is used as the new recognized object of proposed DCNN model. This method can extract the fault features without too many other transformations conveniently and precisely. The result show that the presented method can identify the different fault features from the time-frequency images and classify the fault conditions of the rotating mechanical effectively.

3) In the proposed method, the new application is advised, it not only works for the fault diagnosis of rotating mechanical whose measured signals are periodic, can also use to the other non-periodic vibration signals. In practice, the priori knowledge and fault data were hard to obtain, but many fault information still could collect to train the proposed DCNN model to verify the real fault features. In addition, the normal and fault condition of rotating components must be distinguished by the proposed model. After it was trained, the proposed model would be used to detect the fault types.

References

Jiang X., Li S., Wang Y. Study on nature of crossover phenomena with application to gearbox fault diagnosis. Mechanical Systems and Signal Processing, Vol. 83, 2017, p. 272-295.

Publisher
Hlawatsch F., Boudreaux Bartels F. Linear and quadratic time frequency signal representations. IEEE Signal Processing Magazine, Vol. 9, Issue 2, 1992, p. 21-67.

Publisher
Chen J., Li Z., Pan J. Wavelet transform based on inner product in fault diagnosis of rotating machinery: a review. Mechanical Systems and Signal Processing, Vol. 70, Issue 71, 2016, p. 1-35.

Publisher
Yan R., Gao R. X., Chen X. Wavelets for fault diagnosis of rotary machines: a review with applications. Signal Processing, Vol. 96, 2014, p. 1-15.

Publisher
Lei Y., Lin J., He Z. A review on empirical mode decomposition in fault diagnosis of rotating machinery. Mechanical Systems and Signal Processing, Vol. 35, Issues 1-2, 2013, p. 108-126.

Publisher
Huang N. E., Shen Z., Long S. R. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences, Vol. 454, Issue 1971, 1998, p. 903-999.

Publisher
Peng Z., Tse P., Chu F. An improved Hilbert-Huang transform and its application in vibration signal analysis. Journal of Sound and Vibration, Vol. 286, Issues 1-2, 2005, p. 187-205.

Publisher
Yin Z., Hou J. Recent advances on SVM based fault diagnosis and process monitoring in complicated industrial processes. Neurocomputing, Vol. 174, 2016, p. 643-650.

Publisher
Hu Z., Chen Z., Gui W. Adaptive PCA based fault diagnosis scheme in imperial smelting process. ISA Transactions, Vol. 53, Issue 5, 2014, p. 1446-1455.

Publisher
Geramifard O., Xu J., Panda Kumar S. Fault detection and diagnosis in synchronous motors using hidden Markov model-based semi-nonparametric approach. Engineering Applications of Artificial Intelligence, Vol. 26, Issue 8, 2013, p. 1919-1929.

Publisher
Azadeh A., Saberi M., Kazem A. A flexible algorithm for fault diagnosis in a centrifugal pump with corrupted data and noise based on ANN and support vector machine with hyper-parameters optimization. Applied Soft Computing Journal, Vol. 13, Issue 3, 2013, p. 1478-1485.

Publisher
Xie H., Lin J., Lei Y. Fast-varying AM-FM components extraction based on an adaptive STFT. Digital Signal Processing: A Review Journal, Vol. 22, Issue 4, 2012, p. 664-670.

Publisher
Bayram I. An analytic wavelet transform with a flexible time-frequency covering. IEEE Transactions on Signal Processing, Vol. 61, Issue 5, 2013, p. 1131-1142.

Publisher
Camarena Martinez D., Perez Ramirez C.-A., Valtierra Rodriguez M. Synchrosqueezing transform-based methodology for broken rotor bars detection in induction motors. Measurement, Vol. 90, 2016, p. 519-525.

Publisher
Staszewski W. J., Worden K., Tomlinson G. R. Time-frequency analysis gearbox fault detection using the Wigner-Ville distribution and pattern recognition. Mechanical Systems and Signal Processing, Vol. 11, Issue 5, 1997, p. 673-692.

Publisher
Loughlin P., Bernard G. Cohen Posch (Positive) time-frequency distributions and their application to machine vibration analysis. Mechanical Systems and Signal Processing, Vol. 11, Issue 4, 1997, p. 561-576.

Publisher
Feng Z., Liang M., Chu F. Recent advances in time-frequency analysis methods for machinery fault diagnosis: a review with application examples. Mechanical Systems and Signal Processing, Vol. 38, Issue 1, 2013, p. 165-205.

Publisher
Lecun Y., Bengio Y., Hinton G. Deep learning. Nature, Vol. 521, 2015, p. 436-444.

Publisher
Lei Y., Jia F., Lin J. An intelligent fault diagnosis method using unsupervised feature learning towards mechanical big data. IEEE Transactions on Industrial Electronics, Vol. 63, Issue 5, 2016, p. 3137-3147.

Publisher
Jia F., Lei Y., Lin J. Deep neural networks: a promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data. Mechanical Systems and Signal Processing, Vol. 72, Issue 73, 2016, p. 303-315.

Publisher
Lecun Y., Bottou L., Bengio Y. Gradient-based learning applied to document recognition. Proceedings of the IEEE, Vol. 86, Issue 11, 1998, p. 2278-2324.

Publisher
Chen Z., Li C., Sanchez R. Gearbox fault identification and classification with convolutional neural networks. Shock and Vibration, Vol. 2015, 2015, p. 1-10.

Publisher
Acquarelli J., Laarhoven T., Gerretzen J. Convolutional neural networks for vibrational spectroscopic data analysis. Analytica Chimica Acta, Vol. 954, 2017, p. 22-31.

Publisher
Zeng X., Liao Y., Li W. Gearbox fault classification using S-transform and convolutional neural network. 10th International Conference on Sensing Technology, 2016.

Publisher
Janssens O., Slavkovikj V., Vervisch B. Convolutional neural network based fault detection for rotating machinery. Journal of Sound and Vibration, Vol. 377, 2016, p. 331-345.

Publisher
Guo X., Chen L., Shen C. Hierarchical adaptive deep convolution neural network and its application to bearing fault diagnosis. Measurement, Vol. 93, 2016, p. 490-502.

Publisher
Smith S. Digital Signal Processing. Second Edition, California Technical Publishing, California, 1999.

Search CrossRef
Koushik J. Understanding convolutional neural networks. 29th Conference on Neural Information Processing Systems, Barcelona, 2016.

Search CrossRef
Jin X., Xu C., Feng J. Deep learning with s-shaped rectified linear activation units. Computer Vision and Pattern Recognition, 2016, p. 1737-1743.

Publisher
Sainath T., Kingsbury B., Saon G. Deep convolutional neural networks for large-scale speech tasks. Neural Networks, Vol. 64, 2015, p. 39-48.

Publisher
Sun M., Song Z., Jiang X. Learning pooling for convolutional neural network. Neurocomputing, Vol. 224, 2017, p. 96-104.

Publisher
Loparo K. A. Bearings vibration data set. http://www.eecs.cwru.edu/laboratory/bearing/download.htm.

Search CrossRef

Cited by

Data-driven machinery fault diagnosis: A comprehensive review

Dhiraj Neupane | Mohamed Reda Bouadjenek | Richard Dazeley | Sunil Aryal

(2025)

Review on deep learning classifiers for faults diagnosis of rotating industrial machinery

Ameer Ali Shaalan | Wafa Mefteh | Ali Mohsen Frihida

(2024)

An Intelligent Method Based on Time–Frequency Analysis and Deep Learning Semantic Segmentation for Investigating the Electromagnetic Pulse Features of Engine Digital Controllers

Minxiang Wei | Kai Chen | Shunming Li | Jie Cao | Amjad Ali

(2023)

Detection of tyre defects using weighted quality-based convolutional neural network

(2022)

Application of Local Histogram Clipping Equalization Image Enhancement in Bearing Fault Diagnosis

Zhiqiang Zeng | Rui Zhang | Wenan Cai | Yanfeng Li

(2022)

Bearing Fault Identification Method under Small Samples and Multiple Working Conditions

Yuhui Wu | Licai Liu | Shuqu Qian | Jianyong Tian | Praveen Kumar Reddy Maddikunta

(2022)

Multi-Stream Convolutional Neural Networks for Rotating Machinery Fault Diagnosis under Noise and Trend Items

Han Dong | Jiping Lu | Yafeng Han

(2022)

An Adaptive Fusion Convolutional Denoising Network and Its Application to the Fault Diagnosis of Shore Bridge Lift Gearbox

Rongqiang Zhao | Xiong Hu

(2022)

An Integrated Condition Monitoring Method for Rotating Machinery Based on Optimum Healthy State

Shiwei Yan | Haining Liu | Fajia Li | Fuhang Huang | Huanyong Cui

(2022)

A review on deep learning based condition monitoring and fault diagnosis of rotating machinery

Purushottam Gangsar | Aditya Raj Bajpei | Rajkumar Porwal

(2022)

Speech fault recognition method of music intelligent player based on communication feature analysis

Dongmei Li

(2021)

2021 Global Reliability and Prognostics and Health Management (PHM-Nanjing)

(2021)

Intelligent Fault Diagnosis of Rotary Machinery by Convolutional Neural Network with Automatic Hyper-Parameters Tuning Using Bayesian Optimization

Davor Kolar | Dragutin Lisjak | Michał Pająk | Mihael Gudlin

(2021)

A comprehensive review on convolutional neural network in machine fault diagnosis

Jinyang Jiao | Ming Zhao | Jing Lin | Kaixuan Liang

(2020)

Applications of machine learning to machine fault diagnosis: A review and roadmap

(2020)

Data-Driven Fault Diagnosis Method Based on Second-Order Time-Reassigned Multisynchrosqueezing Transform and Evenly Mini-Batch Training

Guodong Sun | Yuan Gao | Yun Xu | Wei Feng

(2020)

About this article

Received

04 September 2017

Accepted

12 January 2018

Published

30 September 2018

SUBJECTS

Fault diagnosis based on vibration signal analysis

DOI

https://doi.org/10.21595/jve.2018.19040

Keywords

time-frequency representation

STFT

deep convolution neural networks

sparse decode

fault classification

Acknowledgements

The research was supported by National Natural Science Foundation of China (51675262) and also supported by the Advance research field fund project of China (6140210020102) and the Project of National Key Research and Development Plan of China “New Energy-Saving Environmental Protection Agricultural Engine Development” (2016YFD0700800).

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Previous article in issue Previous Next article in issue Next

Research article

2024 02 18

Convolutional neural network intelligent fault diagnosis method for rotating machinery based on discriminant correlation analysis multi-domain feature fusion strategy

Guisheng Lan, Haibo Shi

Review article

2023 11 04

A comprehensive review of mechanical fault diagnosis methods based on convolutional neural network

Junjian Hou, Xikang Lu, Yudong Zhong, Wenbin He, Dengfeng Zhao, Fang Zhou

Research article

2023 08 01

A fault diagnosis method based on dilated convolution and attention for rolling bearing under multiple working conditions and noisy environments

Hui Zhang, Shengdong Liu, Ziwei Lv, Zhenlong Sang, Fangning Li

Research article

2022 06 30

A convolutional neural network method based on Adam optimizer with power-exponential learning rate for bearing fault diagnosis

Youming Wang, Zhao Xiao, Gongqing Cao

Y. Xin, S. Li, C. Cheng, and J. Wang, “An intelligent fault diagnosis method of rotating machinery based on deep neural networks and time-frequency analysis,” Journal of Vibroengineering, Vol. 20, No. 6, pp. 2321–2335, Sep. 2018, https://doi.org/10.21595/jve.2018.19040

Copy Extrica

Copied to clipboard!

TY  - JOUR
DO  - 10.21595/jve.2018.19040
UR  - https://doi.org/10.21595/jve.2018.19040
TI  - An intelligent fault diagnosis method of rotating machinery based on deep neural networks and time-frequency analysis
T2  - Journal of Vibroengineering
AU  - Xin, Yu
AU  - Li, Shunming
AU  - Cheng, Chun
AU  - Wang, Jinrui
PY  - 2018
DA  - 2018/09/30
PB  - JVE International Ltd.
SP  - 2321-2335
IS  - 6
VL  - 20
SN  - 1392-8716
SN  - 2538-8460
ER  - 

Copy Ris

Copied to clipboard!

@article{Xin_2018,
	doi = {10.21595/jve.2018.19040},
	url = {https://doi.org/10.21595/jve.2018.19040},
	year = 2018,
	month = {sep},
	publisher = {{JVE} International Ltd.},
	volume = {20},
	number = {6},
	pages = {2321--2335},
	author = {Yu Xin and Shunming Li and Chun Cheng and Jinrui Wang},
	title = {An intelligent fault diagnosis method of rotating machinery based on deep neural networks and time-frequency analysis},
	journal = {Journal of Vibroengineering}
}

Copy Bibtex

Copied to clipboard!

[1]Y. Xin, S. Li, C. Cheng, and J. Wang, “An intelligent fault diagnosis method of rotating machinery based on deep neural networks and time-frequency analysis,” Journal of Vibroengineering, vol. 20, no. 6, pp. 2321–2335, Sep. 2018, doi: 10.21595/jve.2018.19040.

Copy IEEE

Copied to clipboard!

Xin, Yu, Shunming Li, Chun Cheng, and Jinrui Wang. “An Intelligent Fault Diagnosis Method of Rotating Machinery Based on Deep Neural Networks and Time-Frequency Analysis.” Journal of Vibroengineering 20, no. 6 (September 30, 2018): 2321–35. https://doi.org/10.21595/jve.2018.19040.

Copy Chicago

Copied to clipboard!

An intelligent fault diagnosis method of rotating machinery based on deep neural networks and time-frequency analysis

Abstract

1. Introduction

2. Theoretical background

2.1. Time-frequency representation method

2.2. Convolutional neural networks theory

3. Methodology

3.1. Time-frequency analysis

3.2. Constructing the deep convolution neural networks (DCNN)

4. Experimental setup

4.1. Case 1: Fault diagnosis of the rolling bearing

4.1.1. Data description

4.1.2. Diagnosis results

4.2. Case 2: Fault diagnosis of gears

4.2.1. Experiments and data description

4.2.2. Diagnosis results

5. Conclusions

References

Cited by

About this article

Related Articles