scholarly journals Transfer Learning-Based Fault Diagnosis under Data Deficiency

2020 ◽  
Vol 10 (21) ◽  
pp. 7768
Author(s):  
Seong Hee Cho ◽  
Seokgoo Kim ◽  
Joo-Ho Choi

In the fault diagnosis study, data deficiency, meaning that the fault data for the training are scarce, is often encountered, and it may deteriorate the performance of the fault diagnosis greatly. To solve this issue, the transfer learning (TL) approach is employed to exploit the neural network (NN) trained in another (source) domain where enough fault data are available in order to improve the NN performance of the real (target) domain. While there have been similar attempts of TL in the literature to solve the imbalance issue, they were about the sample imbalance between the source and target domain, whereas the present study considers the imbalance between the normal and fault data. To illustrate this, normal and fault datasets are acquired from the linear motion guide, in which the data at high and low speeds represent the real operation (target) and maintenance inspection (source), respectively. The effect of data deficiency is studied by reducing the number of fault data in the target domain, and comparing the performance of TL, which exploits the knowledge of the source domain and the ordinary machine learning (ML) approach without it. By examining the accuracy of the fault diagnosis as a function of imbalance ratio, it is found that the lower bound and interquartile range (IQR) of the accuracy are improved greatly by employing the TL approach. Therefore, it can be concluded that TL is truly more effective than the ordinary ML when there is a large imbalance between the fault and normal data, such as smaller than 0.1.

2021 ◽  
Vol 2021 ◽  
pp. 1-16
Author(s):  
Jun He ◽  
Xiang Li ◽  
Yong Chen ◽  
Danfeng Chen ◽  
Jing Guo ◽  
...  

In mechanical fault diagnosis, it is impossible to collect massive labeled samples with the same distribution in real industry. Transfer learning, a promising method, is usually used to address the critical problem. However, as the number of samples increases, the interdomain distribution discrepancy measurement of the existing method has a higher computational complexity, which may make the generalization ability of the method worse. To solve the problem, we propose a deep transfer learning method based on 1D-CNN for rolling bearing fault diagnosis. First, 1-dimension convolutional neural network (1D-CNN), as the basic framework, is used to extract features from vibration signal. The CORrelation ALignment (CORAL) is employed to minimize marginal distribution discrepancy between the source domain and target domain. Then, the cross-entropy loss function and Adam optimizer are used to minimize the classification errors and the second-order statistics of feature distance between the source domain and target domain, respectively. Finally, based on the bearing datasets of Case Western Reserve University and Jiangnan University, seven transfer fault diagnosis comparison experiments are carried out. The results show that our method has better performance.


2018 ◽  
Vol 9 (4) ◽  
pp. 52-68 ◽  
Author(s):  
Ding Xiong ◽  
Lu Yan

Current transfer learning models study the source data for future target inferences within a major view, the whole source data should be used to explore the shared knowledge structure. However, human resources are constrained, the source domain data is collected as a whole in the real scene. However, this is not realistic, this data is associated with the target domain. A generalized empirical risk minimization model (GERM) is proposed in this article with discriminative knowledge-leverage (KL). The empirical risk minimization (ERM) principle is extended to the transfer learning setting. The theoretical upper bound of generalized ERM (GERM) is given for the practical discriminative transfer learning. The subset of the source domain data can be automatically selected in the model, and the source domain data is associated with the target domain. It can solve with only some knowledge of the source domain being available, thus it can avoid the negative transfer effect which is caused by the whole source domain dataset in the real scene. Simulation results show that the proposed algorithm is better than the traditional transfer learning algorithm in simulation data sets and real data sets.


Author(s):  
Jiantong Zhao ◽  
Wentao Huang

Abstract In practical bearing fault diagnosis tasks, the available labelled data are often not from the equipment to be diagnosed and cannot cover all manner of working conditions. The adopted data-driven method is required to have a certain degree of cross-domain and cross-working condition transfer learning diagnosis ability. However, limited by the performance of existing transfer learning methods, the potential difference between the source domain and the target domain poses a challenge for the accuracy of transfer diagnosis. In this paper, a cross-working condition data supplement method based on the cycle generative adversarial network (CycleGAN) and a dynamics model is proposed, which can use limited available data to approximate the missing parts of existing data and be used for diagnosis of the target domain. First, we considered the limited experimental data as the target domain, the simulation data corresponding to the working condition as the source domain and used the working condition as the benchmark to constrain the data correspondence between the two datasets. We then used the CycleGAN model to learn the feature mapping from simulation to experiment. Second, based on the working condition of the data to be tested, the corresponding simulation data were input into the trained generator to obtain labeled data with experimental characteristics under the corresponding working conditions, and transferred the dataset as the source domain data to the data to be tested. In the test using self-made simulation and experimental datasets, combined with the transfer learning method based on the probability distribution adaptation, it was shown that the proposed method could effectively improve the diagnostic impact of the single transfer learning method in cross-domain and cross-working conditions when the working condition span was large.


2021 ◽  
Vol 34 (1) ◽  
Author(s):  
Chao Chen ◽  
Fei Shen ◽  
Jiawen Xu ◽  
Ruqiang Yan

AbstractGear fault diagnosis technologies have received rapid development and been effectively implemented in many engineering applications. However, the various working conditions would degrade the diagnostic performance and make gear fault diagnosis (GFD) more and more challenging. In this paper, a novel model parameter transfer (NMPT) is proposed to boost the performance of GFD under varying working conditions. Based on the previous transfer strategy that controls empirical risk of source domain, this method further integrates the superiorities of multi-task learning with the idea of transfer learning (TL) to acquire transferable knowledge by minimizing the discrepancies of separating hyperplanes between one specific working condition (target domain) and another (source domain), and then transferring both commonality and specialty parameters over tasks to make use of source domain samples to assist target GFD task when sufficient labeled samples from target domain are unavailable. For NMPT implementation, insufficient target domain features and abundant source domain features with supervised information are fed into NMPT model to train a robust classifier for target GFD task. Related experiments prove that NMPT is expected to be a valuable technology to boost practical GFD performance under various working conditions. The proposed methods provides a transfer learning-based framework to handle the problem of insufficient training samples in target task caused by variable operation conditions.


2021 ◽  
Vol 85 ◽  
pp. 61-71
Author(s):  
Carla Ovejas Ramírez

This article discusses hyperbolic markers in modeling hyperbole from the perspective of a scenario-based account of language use within the framework of Cognitive Linguistics. In this view, hyperbole is seen as a mapping across two conceptual domains (Peña y Ruiz de Mendoza, 2017), a source domain, here relabeled as the magnified scenario, which contains a hypothetical unrealistic situation based on exaggeration, and a target domain or observable scenario which depicts the real situation addressed by the hyperbolic expression. Since the hypothetical scenario is a magnified version of the observable scenario, the mapping contains source-target matches in varying degrees of resemblance. Within this theoretical context, the article explores resources available to speakers for the construction of magnified scenarios leading to hyperbolic interpretation. Among such resources, we find hyperbole markers and the setting up of domains of reference. Finally, the article also discusses hyperbole blockers, which cancel out the activity of the other hyperbolic meaning construction mechanisms.


2020 ◽  
pp. 107754632093379
Author(s):  
Moslem Azamfar ◽  
Jaskaran Singh ◽  
Xiang Li ◽  
Jay Lee

This study proposes a novel 1D deep convolutional transfer learning method that is able to learn the high-dimensional domain-invariant feature from the labeled training dataset and perform diagnosis tasks on the unlabeled testing dataset subjected to a domain shift. To obtain the domain-invariant features, the cross-entropy loss in the source domain classifier and the maximum mean discrepancies between the source and target domain data are minimized simultaneously. To evaluate the performance of the proposed method, an experimental study is conducted on a gearbox under significant speed variation. Because of inherent limitations of the vibration data, in this research, the effectiveness of torque measurement signals has been explored for gearbox fault diagnosis. Comprehensive studies on network parameters and the training sample size are performed to illustrate the robustness and effectiveness of the proposed method. A comparison study is performed on similar techniques to illustrate the superiority and high performance of the proposed diagnosis method. The achieved results illustrate the effectiveness of torque signal in multiclass cross-domain fault diagnosis of gearboxes.


Entropy ◽  
2021 ◽  
Vol 23 (4) ◽  
pp. 424
Author(s):  
Sixiang Jia ◽  
Jinrui Wang ◽  
Xiao Zhang ◽  
Baokun Han

Domain adaptation-based models for fault classification under variable working conditions have become a research focus in recent years. Previous domain adaptation approaches generally assume identical label spaces in the source and target domains, however, such an assumption may be no longer legitimate in a more realistic situation that requires adaptation from a larger and more diverse source domain to a smaller target domain with less number of fault classes. To address the above deficiencies, we propose a partial transfer fault diagnosis model based on a weighted subdomain adaptation network (WSAN) in this paper. Our method pays more attention to the local data distribution while aligning the global distribution. An auxiliary classifier is introduced to obtain the class-level weights of the source samples, so the network can avoid negative transfer caused by unique fault classes in the source domain. Furthermore, a weighted local maximum mean discrepancy (WLMMD) is proposed to capture the fine-grained transferable information and obtain sample-level weights. Finally, relevant distributions of domain-specific layer activations across different domains are aligned. Experimental results show that our method could assign appropriate weights to each source sample and realize efficient partial transfer fault diagnosis.


2020 ◽  
Author(s):  
Eliseu Guimarães ◽  
Jonnathan Carvalho ◽  
Aline Paes ◽  
Alexandre Plastino

Sentiment analysis on social media data can be a challenging task, among other reasons, because labeled data for training is not always available. Transfer learning approaches address this problem by leveraging a labeled source domain to obtain a model for a target domain that is different but related to the source domain. However, the question that arises is how to choose proper source data for training the target classifier, which can be made considering the similarity between source and target data using distance metrics. This article investigates the relation between these distance metrics and the classifiers’ performance. For this purpose, we propose to evaluate four metrics combined with distinct dataset representations. Computational experiments, conducted in the Twitter sentiment analysis scenario, showed that the cosine similarity metric combined with bag-of-words normalized with term frequency-inverse document frequency presented the best results in terms of predictive power, outperforming even the classifiers trained with the target dataset in many cases.


2021 ◽  
Vol 22 (S9) ◽  
Author(s):  
Yan Wang ◽  
Zuheng Xia ◽  
Jingjing Deng ◽  
Xianghua Xie ◽  
Maoguo Gong ◽  
...  

Abstract Background Gene prioritization (gene ranking) aims to obtain the centrality of genes, which is critical for cancer diagnosis and therapy since keys genes correspond to the biomarkers or targets of drugs. Great efforts have been devoted to the gene ranking problem by exploring the similarity between candidate and known disease-causing genes. However, when the number of disease-causing genes is limited, they are not applicable largely due to the low accuracy. Actually, the number of disease-causing genes for cancers, particularly for these rare cancers, are really limited. Therefore, there is a critical needed to design effective and efficient algorithms for gene ranking with limited prior disease-causing genes. Results In this study, we propose a transfer learning based algorithm for gene prioritization (called TLGP) in the cancer (target domain) without disease-causing genes by transferring knowledge from other cancers (source domain). The underlying assumption is that knowledge shared by similar cancers improves the accuracy of gene prioritization. Specifically, TLGP first quantifies the similarity between the target and source domain by calculating the affinity matrix for genes. Then, TLGP automatically learns a fusion network for the target cancer by fusing affinity matrix, pathogenic genes and genomic data of source cancers. Finally, genes in the target cancer are prioritized. The experimental results indicate that the learnt fusion network is more reliable than gene co-expression network, implying that transferring knowledge from other cancers improves the accuracy of network construction. Moreover, TLGP outperforms state-of-the-art approaches in terms of accuracy, improving at least 5%. Conclusion The proposed model and method provide an effective and efficient strategy for gene ranking by integrating genomic data from various cancers.


Author(s):  
Jialin Li ◽  
Xueyi Li ◽  
David He ◽  
Yongzhi Qu

In recent years, research on gear pitting fault diagnosis has been conducted. Most of the research has focused on feature extraction and feature selection process, and diagnostic models are only suitable for one working condition. To diagnose early gear pitting faults under multiple working conditions, this article proposes to develop a domain adaptation diagnostic model–based improved deep neural network and transfer learning with raw vibration signals. A particle swarm optimization algorithm and L2 regularization are used to optimize the improved deep neural network to improve the stability and accuracy of the diagnosis. When using the domain adaptation diagnostic model for fault diagnosis, it is necessary to discriminate whether the target domain (test data) is the same as the source domain (training data). If the target domain and the source domain are consistent, the trained improved deep neural network can be used directly for diagnosis. Otherwise, the transfer learning is combined with improved deep neural network to develop a deep transfer learning network to improve the domain adaptability of the diagnostic model. Vibration signals for seven gear types with early pitting faults under 25 working conditions collected from a gear test rig are used to validate the proposed method. It is confirmed by the validation results that the developed domain adaptation diagnostic model has a significant improvement in the adaptability of multiple working conditions.


Sign in / Sign up

Export Citation Format

Share Document