Transfer Learning-Based Fault Diagnosis under Data Deficiency

Seong Hee Cho; Seokgoo Kim; Joo-Ho Choi

doi:10.3390/app10217768

Transfer Learning-Based Fault Diagnosis under Data Deficiency

Applied Sciences ◽

10.3390/app10217768 ◽

2020 ◽

Vol 10 (21) ◽

pp. 7768

Author(s):

Seong Hee Cho ◽

Seokgoo Kim ◽

Joo-Ho Choi

Keyword(s):

Fault Diagnosis ◽

Transfer Learning ◽

Study Data ◽

Linear Motion ◽

Target Domain ◽

Source Domain ◽

The Real ◽

The Neural Network ◽

Linear Motion Guide ◽

Real Target

In the fault diagnosis study, data deficiency, meaning that the fault data for the training are scarce, is often encountered, and it may deteriorate the performance of the fault diagnosis greatly. To solve this issue, the transfer learning (TL) approach is employed to exploit the neural network (NN) trained in another (source) domain where enough fault data are available in order to improve the NN performance of the real (target) domain. While there have been similar attempts of TL in the literature to solve the imbalance issue, they were about the sample imbalance between the source and target domain, whereas the present study considers the imbalance between the normal and fault data. To illustrate this, normal and fault datasets are acquired from the linear motion guide, in which the data at high and low speeds represent the real operation (target) and maintenance inspection (source), respectively. The effect of data deficiency is studied by reducing the number of fault data in the target domain, and comparing the performance of TL, which exploits the knowledge of the source domain and the ordinary machine learning (ML) approach without it. By examining the accuracy of the fault diagnosis as a function of imbalance ratio, it is found that the lower bound and interquartile range (IQR) of the accuracy are improved greatly by employing the TL approach. Therefore, it can be concluded that TL is truly more effective than the ordinary ML when there is a large imbalance between the fault and normal data, such as smaller than 0.1.

Download Full-text

Deep Transfer Learning Method Based on 1D-CNN for Bearing Fault Diagnosis

Shock and Vibration ◽

10.1155/2021/6687331 ◽

2021 ◽

Vol 2021 ◽

pp. 1-16

Author(s):

Jun He ◽

Xiang Li ◽

Yong Chen ◽

Danfeng Chen ◽

Jing Guo ◽

...

Keyword(s):

Fault Diagnosis ◽

Transfer Learning ◽

Vibration Signal ◽

Rolling Bearing ◽

Learning Method ◽

Target Domain ◽

Source Domain ◽

Second Order Statistics ◽

Bearing Fault ◽

Bearing Fault Diagnosis

In mechanical fault diagnosis, it is impossible to collect massive labeled samples with the same distribution in real industry. Transfer learning, a promising method, is usually used to address the critical problem. However, as the number of samples increases, the interdomain distribution discrepancy measurement of the existing method has a higher computational complexity, which may make the generalization ability of the method worse. To solve the problem, we propose a deep transfer learning method based on 1D-CNN for rolling bearing fault diagnosis. First, 1-dimension convolutional neural network (1D-CNN), as the basic framework, is used to extract features from vibration signal. The CORrelation ALignment (CORAL) is employed to minimize marginal distribution discrepancy between the source domain and target domain. Then, the cross-entropy loss function and Adam optimizer are used to minimize the classification errors and the second-order statistics of feature distance between the source domain and target domain, respectively. Finally, based on the bearing datasets of Case Western Reserve University and Jiangnan University, seven transfer fault diagnosis comparison experiments are carried out. The results show that our method has better performance.

Download Full-text

A Classification Learning Research based on Discriminative Knowledge-Leverage Transfer

International Journal of Ambient Computing and Intelligence ◽

10.4018/ijaci.2018100104 ◽

2018 ◽

Vol 9 (4) ◽

pp. 52-68 ◽

Cited By ~ 3

Author(s):

Ding Xiong ◽

Lu Yan

Keyword(s):

Transfer Learning ◽

Data Sets ◽

Empirical Risk Minimization ◽

Risk Minimization ◽

Target Domain ◽

Source Domain ◽

The Real ◽

Empirical Risk ◽

Real Scene ◽

Source Data

Current transfer learning models study the source data for future target inferences within a major view, the whole source data should be used to explore the shared knowledge structure. However, human resources are constrained, the source domain data is collected as a whole in the real scene. However, this is not realistic, this data is associated with the target domain. A generalized empirical risk minimization model (GERM) is proposed in this article with discriminative knowledge-leverage (KL). The empirical risk minimization (ERM) principle is extended to the transfer learning setting. The theoretical upper bound of generalized ERM (GERM) is given for the practical discriminative transfer learning. The subset of the source domain data can be automatically selected in the model, and the source domain data is associated with the target domain. It can solve with only some knowledge of the source domain being available, thus it can avoid the negative transfer effect which is caused by the whole source domain dataset in the real scene. Simulation results show that the proposed algorithm is better than the traditional transfer learning algorithm in simulation data sets and real data sets.

Download Full-text

Transfer Learning Method for Rolling Bearing Fault Diagnosis under Different Working Conditions Based on CycleGAN

Measurement Science and Technology ◽

10.1088/1361-6501/ac3942 ◽

2021 ◽

Author(s):

Jiantong Zhao ◽

Wentao Huang

Keyword(s):

Fault Diagnosis ◽

Transfer Learning ◽

Working Conditions ◽

Working Condition ◽

Learning Method ◽

Target Domain ◽

Simulation Data ◽

Source Domain ◽

Bearing Fault Diagnosis ◽

Cross Domain

Abstract In practical bearing fault diagnosis tasks, the available labelled data are often not from the equipment to be diagnosed and cannot cover all manner of working conditions. The adopted data-driven method is required to have a certain degree of cross-domain and cross-working condition transfer learning diagnosis ability. However, limited by the performance of existing transfer learning methods, the potential difference between the source domain and the target domain poses a challenge for the accuracy of transfer diagnosis. In this paper, a cross-working condition data supplement method based on the cycle generative adversarial network (CycleGAN) and a dynamics model is proposed, which can use limited available data to approximate the missing parts of existing data and be used for diagnosis of the target domain. First, we considered the limited experimental data as the target domain, the simulation data corresponding to the working condition as the source domain and used the working condition as the benchmark to constrain the data correspondence between the two datasets. We then used the CycleGAN model to learn the feature mapping from simulation to experiment. Second, based on the working condition of the data to be tested, the corresponding simulation data were input into the trained generator to obtain labeled data with experimental characteristics under the corresponding working conditions, and transferred the dataset as the source domain data to the data to be tested. In the test using self-made simulation and experimental datasets, combined with the transfer learning method based on the probability distribution adaptation, it was shown that the proposed method could effectively improve the diagnostic impact of the single transfer learning method in cross-domain and cross-working conditions when the working condition span was large.

Download Full-text

Model Parameter Transfer for Gear Fault Diagnosis under Varying Working Conditions

Chinese Journal of Mechanical Engineering ◽

10.1186/s10033-020-00520-9 ◽

2021 ◽

Vol 34 (1) ◽

Author(s):

Chao Chen ◽

Fei Shen ◽

Jiawen Xu ◽

Ruqiang Yan

Keyword(s):

Fault Diagnosis ◽

Transfer Learning ◽

Working Conditions ◽

Rapid Development ◽

Target Domain ◽

Source Domain ◽

Model Parameter ◽

Operation Conditions ◽

Gear Fault ◽

Gear Fault Diagnosis

AbstractGear fault diagnosis technologies have received rapid development and been effectively implemented in many engineering applications. However, the various working conditions would degrade the diagnostic performance and make gear fault diagnosis (GFD) more and more challenging. In this paper, a novel model parameter transfer (NMPT) is proposed to boost the performance of GFD under varying working conditions. Based on the previous transfer strategy that controls empirical risk of source domain, this method further integrates the superiorities of multi-task learning with the idea of transfer learning (TL) to acquire transferable knowledge by minimizing the discrepancies of separating hyperplanes between one specific working condition (target domain) and another (source domain), and then transferring both commonality and specialty parameters over tasks to make use of source domain samples to assist target GFD task when sufficient labeled samples from target domain are unavailable. For NMPT implementation, insufficient target domain features and abundant source domain features with supervised information are fed into NMPT model to train a robust classifier for target GFD task. Related experiments prove that NMPT is expected to be a valuable technology to boost practical GFD performance under various working conditions. The proposed methods provides a transfer learning-based framework to handle the problem of insufficient training samples in target task caused by variable operation conditions.

Download Full-text

Hyperbolic markers in modeling hyperbole: a scenario-based account

Círculo de lingüística aplicada a la comunicación ◽

10.5209/clac.66249 ◽

2021 ◽

Vol 85 ◽

pp. 61-71

Author(s):

Carla Ovejas Ramírez

Keyword(s):

Cognitive Linguistics ◽

Language Use ◽

Meaning Construction ◽

The Other ◽

Target Domain ◽

Real Situation ◽

Source Domain ◽

The Real ◽

Theoretical Context ◽

Hypothetical Scenario

This article discusses hyperbolic markers in modeling hyperbole from the perspective of a scenario-based account of language use within the framework of Cognitive Linguistics. In this view, hyperbole is seen as a mapping across two conceptual domains (Peña y Ruiz de Mendoza, 2017), a source domain, here relabeled as the magnified scenario, which contains a hypothetical unrealistic situation based on exaggeration, and a target domain or observable scenario which depicts the real situation addressed by the hyperbolic expression. Since the hypothetical scenario is a magnified version of the observable scenario, the mapping contains source-target matches in varying degrees of resemblance. Within this theoretical context, the article explores resources available to speakers for the construction of magnified scenarios leading to hyperbolic interpretation. Among such resources, we find hyperbole markers and the setting up of domains of reference. Finally, the article also discusses hyperbole blockers, which cancel out the activity of the other hyperbolic meaning construction mechanisms.

Download Full-text

Cross-domain gearbox diagnostics under variable working conditions with deep convolutional transfer learning

Journal of Vibration and Control ◽

10.1177/1077546320933793 ◽

2020 ◽

pp. 107754632093379

Author(s):

Moslem Azamfar ◽

Jaskaran Singh ◽

Xiang Li ◽

Jay Lee

Keyword(s):

Fault Diagnosis ◽

Transfer Learning ◽

High Performance ◽

Training Sample ◽

Training Dataset ◽

Target Domain ◽

Training Sample Size ◽

Vibration Data ◽

Cross Domain ◽

Testing Dataset

This study proposes a novel 1D deep convolutional transfer learning method that is able to learn the high-dimensional domain-invariant feature from the labeled training dataset and perform diagnosis tasks on the unlabeled testing dataset subjected to a domain shift. To obtain the domain-invariant features, the cross-entropy loss in the source domain classifier and the maximum mean discrepancies between the source and target domain data are minimized simultaneously. To evaluate the performance of the proposed method, an experimental study is conducted on a gearbox under significant speed variation. Because of inherent limitations of the vibration data, in this research, the effectiveness of torque measurement signals has been explored for gearbox fault diagnosis. Comprehensive studies on network parameters and the training sample size are performed to illustrate the robustness and effectiveness of the proposed method. A comparison study is performed on similar techniques to illustrate the superiority and high performance of the proposed diagnosis method. The achieved results illustrate the effectiveness of torque signal in multiclass cross-domain fault diagnosis of gearboxes.

Download Full-text

A Weighted Subdomain Adaptation Network for Partial Transfer Fault Diagnosis of Rotating Machinery

Entropy ◽

10.3390/e23040424 ◽

2021 ◽

Vol 23 (4) ◽

pp. 424

Author(s):

Sixiang Jia ◽

Jinrui Wang ◽

Xiao Zhang ◽

Baokun Han

Keyword(s):

Fault Diagnosis ◽

Domain Adaptation ◽

Local Maximum ◽

Fault Classification ◽

Target Domain ◽

Local Data ◽

Source Domain ◽

Fine Grained ◽

Realistic Situation ◽

Partial Transfer

Domain adaptation-based models for fault classification under variable working conditions have become a research focus in recent years. Previous domain adaptation approaches generally assume identical label spaces in the source and target domains, however, such an assumption may be no longer legitimate in a more realistic situation that requires adaptation from a larger and more diverse source domain to a smaller target domain with less number of fault classes. To address the above deficiencies, we propose a partial transfer fault diagnosis model based on a weighted subdomain adaptation network (WSAN) in this paper. Our method pays more attention to the local data distribution while aligning the global distribution. An auxiliary classifier is introduced to obtain the class-level weights of the source samples, so the network can avoid negative transfer caused by unique fault classes in the source domain. Furthermore, a weighted local maximum mean discrepancy (WLMMD) is proposed to capture the fine-grained transferable information and obtain sample-level weights. Finally, relevant distributions of domain-specific layer activations across different domains are aligned. Experimental results show that our method could assign appropriate weights to each source sample and realize efficient partial transfer fault diagnosis.

Download Full-text

Transfer learning for Twitter sentiment analysis: Choosing an effective source dataset

10.5753/kdmile.2020.11972 ◽

2020 ◽

Author(s):

Eliseu Guimarães ◽

Jonnathan Carvalho ◽

Aline Paes ◽

Alexandre Plastino

Keyword(s):

Sentiment Analysis ◽

Transfer Learning ◽

Distance Metrics ◽

Learning Approaches ◽

Target Domain ◽

Social Media Data ◽

Inverse Document Frequency ◽

Source Domain ◽

Document Frequency ◽

Media Data

Sentiment analysis on social media data can be a challenging task, among other reasons, because labeled data for training is not always available. Transfer learning approaches address this problem by leveraging a labeled source domain to obtain a model for a target domain that is different but related to the source domain. However, the question that arises is how to choose proper source data for training the target classifier, which can be made considering the similarity between source and target data using distance metrics. This article investigates the relation between these distance metrics and the classifiers’ performance. For this purpose, we propose to evaluate four metrics combined with distinct dataset representations. Computational experiments, conducted in the Twitter sentiment analysis scenario, showed that the cosine similarity metric combined with bag-of-words normalized with term frequency-inverse document frequency presented the best results in terms of predictive power, outperforming even the classifiers trained with the target dataset in many cases.

Download Full-text

TLGP: a flexible transfer learning algorithm for gene prioritization based on heterogeneous source domain

BMC Bioinformatics ◽

10.1186/s12859-021-04190-9 ◽

2021 ◽

Vol 22 (S9) ◽

Author(s):

Yan Wang ◽

Zuheng Xia ◽

Jingjing Deng ◽

Xianghua Xie ◽

Maoguo Gong ◽

...

Keyword(s):

Transfer Learning ◽

Learning Algorithm ◽

Genomic Data ◽

Gene Prioritization ◽

Affinity Matrix ◽

Target Domain ◽

Gene Ranking ◽

Underlying Assumption ◽

Source Domain ◽

Target Cancer

Abstract Background Gene prioritization (gene ranking) aims to obtain the centrality of genes, which is critical for cancer diagnosis and therapy since keys genes correspond to the biomarkers or targets of drugs. Great efforts have been devoted to the gene ranking problem by exploring the similarity between candidate and known disease-causing genes. However, when the number of disease-causing genes is limited, they are not applicable largely due to the low accuracy. Actually, the number of disease-causing genes for cancers, particularly for these rare cancers, are really limited. Therefore, there is a critical needed to design effective and efficient algorithms for gene ranking with limited prior disease-causing genes. Results In this study, we propose a transfer learning based algorithm for gene prioritization (called TLGP) in the cancer (target domain) without disease-causing genes by transferring knowledge from other cancers (source domain). The underlying assumption is that knowledge shared by similar cancers improves the accuracy of gene prioritization. Specifically, TLGP first quantifies the similarity between the target and source domain by calculating the affinity matrix for genes. Then, TLGP automatically learns a fusion network for the target cancer by fusing affinity matrix, pathogenic genes and genomic data of source cancers. Finally, genes in the target cancer are prioritized. The experimental results indicate that the learnt fusion network is more reliable than gene co-expression network, implying that transferring knowledge from other cancers improves the accuracy of network construction. Moreover, TLGP outperforms state-of-the-art approaches in terms of accuracy, improving at least 5%. Conclusion The proposed model and method provide an effective and efficient strategy for gene ranking by integrating genomic data from various cancers.

Download Full-text

A domain adaptation model for early gear pitting fault diagnosis based on deep transfer learning network

Proceedings of the Institution of Mechanical Engineers Part O Journal of Risk and Reliability ◽

10.1177/1748006x19867776 ◽

2019 ◽

Vol 234 (1) ◽

pp. 168-182 ◽

Cited By ~ 4

Author(s):

Jialin Li ◽

Xueyi Li ◽

David He ◽

Yongzhi Qu

Keyword(s):

Neural Network ◽

Fault Diagnosis ◽

Transfer Learning ◽

Working Conditions ◽

Deep Neural Network ◽

Domain Adaptation ◽

Diagnostic Model ◽

Target Domain ◽

Vibration Signals ◽

Learning Network

In recent years, research on gear pitting fault diagnosis has been conducted. Most of the research has focused on feature extraction and feature selection process, and diagnostic models are only suitable for one working condition. To diagnose early gear pitting faults under multiple working conditions, this article proposes to develop a domain adaptation diagnostic model–based improved deep neural network and transfer learning with raw vibration signals. A particle swarm optimization algorithm and L2 regularization are used to optimize the improved deep neural network to improve the stability and accuracy of the diagnosis. When using the domain adaptation diagnostic model for fault diagnosis, it is necessary to discriminate whether the target domain (test data) is the same as the source domain (training data). If the target domain and the source domain are consistent, the trained improved deep neural network can be used directly for diagnosis. Otherwise, the transfer learning is combined with improved deep neural network to develop a deep transfer learning network to improve the domain adaptability of the diagnostic model. Vibration signals for seven gear types with early pitting faults under 25 working conditions collected from a gear test rig are used to validate the proposed method. It is confirmed by the validation results that the developed domain adaptation diagnostic model has a significant improvement in the adaptability of multiple working conditions.

Download Full-text