# Deep Fault Diagnosis for Rotating Machinery with Scarce Labeled Samples

**Authors:** Jing Zhang, Jing Tian, Tao Wen, Xiaohui Yang, Yong Rao and, Xiaobin Xu

arXiv: 1907.09411 · 2019-07-23

## TL;DR

This paper introduces a deep fault diagnosis method for rotating machinery that effectively leverages scarce labeled data by combining shallow models, feature selection, and deep learning, improving accuracy and efficiency.

## Contribution

The novel DFD approach integrates transfer learning from shallow models with deep CNN training, specifically designed for scenarios with limited labeled samples.

## Key findings

- DFD outperforms standalone SVM and CNN models on fault diagnosis datasets.
- The method is computationally efficient and suitable for real-time applications.
- Experimental results validate the effectiveness of combining shallow model knowledge with deep learning.

## Abstract

Early and accurately detecting faults in rotating machinery is crucial for operation safety of the modern manufacturing system. In this paper, we proposed a novel Deep fault diagnosis (DFD) method for rotating machinery with scarce labeled samples. DFD tackles the challenging problem by transferring knowledge from shallow models, which is based on the idea that shallow models trained with different hand-crafted features can reveal the latent prior knowledge and diagnostic expertise and have good generalization ability even with scarce labeled samples. DFD can be divided into three phases. First, a spectrogram of the raw vibration signal is calculated by applying a Short-time Fourier transform (STFT). From those spectrograms, discriminative time-frequency domain features can be extracted and used to form a feature pool. Then, several candidate Support vector machine (SVM) models are trained with different combinations of features in the feature pool with scarce labeled samples. By evaluating the pretrained SVM models on the validation set, the most discriminative features and best-performed SVM models can be selected, which are used to make predictions on the unlabeled samples. The predicted labels reserve the expert knowledge originally carried by the SVM model. They are combined together with the scarce fine labeled samples to form an Augmented training set (ATS). Finally, a novel 2D deep Convolutional neural network (CNN) model is trained on the ATS to learn more discriminative features and a better classifier. Experimental results on two fault diagnosis datasets demonstrate the effectiveness of the proposed DFD, which achieves better performance than SVM models and the vanilla deep CNN model trained on scarce labeled samples. Moreover, it is computationally efficient and is promising for real-time rotating machinery fault diagnosis.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.09411/full.md

---
Source: https://tomesphere.com/paper/1907.09411