Transfer Convolutional Neural Network for Cross-Project Defect Prediction

Shaojian Qiu; Hao Xu; Jiehan Deng; Siyu Jiang; Lu Lu

doi:10.3390/app9132660

Transfer Convolutional Neural Network for Cross-Project Defect Prediction

Applied Sciences ◽

10.3390/app9132660 ◽

2019 ◽

Vol 9 (13) ◽

pp. 2660 ◽

Cited By ~ 2

Author(s):

Shaojian Qiu ◽

Hao Xu ◽

Jiehan Deng ◽

Siyu Jiang ◽

Lu Lu

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Reproducing Kernel ◽

Reproducing Kernel Hilbert Space ◽

Defect Prediction ◽

Classification Error ◽

Software Defect ◽

Novel Approach ◽

Distribution Matching ◽

Cross Project

Cross-project defect prediction (CPDP) is a practical solution that allows software defect prediction (SDP) to be used earlier in the software lifecycle. With the CPDP technique, the software defect predictor trained by labeled data of mature projects can be applied for the prediction task of a new project. Most previous CPDP approaches ignored the semantic information in the source code, and existing semantic-feature-based SDP methods do not take into account the data distribution divergence between projects. These limitations may weaken defect prediction performance. To solve these problems, we propose a novel approach, the transfer convolutional neural network (TCNN), to mine the transferable semantic (deep-learning (DL)-generated) features for CPDP tasks. Specifically, our approach first parses the source file into integer vectors as the network inputs. Next, to obtain the TCNN model, a matching layer is added into convolutional neural network where the hidden representations of the source and target project-specific data are embedded into a reproducing kernel Hilbert space for distribution matching. By simultaneously minimizing classification error and distribution divergence between projects, the constructed TCNN could extract the transferable DL-generated features. Finally, without losing the information contained in handcrafted features, we combine them with transferable DL-generated features to form the joint features for CPDP performing. Experiments based on 10 benchmark projects (with 90 pairs of CPDP tasks) showed that the proposed TCNN method is superior to the reference methods.

Download Full-text

Within‐project and cross‐project just‐in‐time defect prediction based on denoising autoencoder and convolutional neural network

IET Software ◽

10.1049/iet-sen.2019.0278 ◽

2020 ◽

Vol 14 (3) ◽

pp. 185-195

Author(s):

Kun Zhu ◽

Nana Zhang ◽

Shi Ying ◽

Dandan Zhu

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Defect Prediction ◽

Just In Time ◽

Denoising Autoencoder ◽

Cross Project

Download Full-text

A Suitable AST Node Granularity and Multi-Kernel Transfer Convolutional Neural Network for Cross-Project Defect Prediction

IEEE Access ◽

10.1109/access.2020.2985780 ◽

2020 ◽

Vol 8 ◽

pp. 66647-66661

Author(s):

Jiehan Deng ◽

Lu Lu ◽

Shaojian Qiu ◽

Yangpeng Ou

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Defect Prediction ◽

Cross Project

Download Full-text

An Adversarial Discriminative Convolutional Neural Network for Cross-Project Defect Prediction

IEEE Access ◽

10.1109/access.2020.2981869 ◽

2020 ◽

Vol 8 ◽

pp. 55241-55253

Author(s):

Lei Sheng ◽

Lu Lu ◽

Junhao Lin

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Defect Prediction ◽

Cross Project

Download Full-text

Software Defect Prediction via Convolutional Neural Network

2017 IEEE International Conference on Software Quality, Reliability and Security (QRS) ◽

10.1109/qrs.2017.42 ◽

2017 ◽

Cited By ~ 46

Author(s):

Jian Li ◽

Pinjia He ◽

Jieming Zhu ◽

Michael R. Lyu

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect

Download Full-text

Software defect prediction using hybrid model (CBIL) of convolutional neural network (CNN) and bidirectional long short-term memory (Bi-LSTM)

PeerJ Computer Science ◽

10.7717/peerj-cs.739 ◽

2021 ◽

Vol 7 ◽

pp. e739

Author(s):

Ahmed Bahaa Farid ◽

Enas Mohamed Fathy ◽

Ahmed Sharaf Eldin ◽

Laila A. Abd-Elmegid

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Short Term Memory ◽

Source Code ◽

Defect Prediction ◽

Software Defect Prediction ◽

Short Term ◽

Term Memory ◽

Software Defect ◽

Long Short Term Memory

In recent years, the software industry has invested substantial effort to improve software quality in organizations. Applying proactive software defect prediction will help developers and white box testers to find the defects earlier, and this will reduce the time and effort. Traditional software defect prediction models concentrate on traditional features of source code including code complexity, lines of code, etc. However, these features fail to extract the semantics of source code. In this research, we propose a hybrid model that is called CBIL. CBIL can predict the defective areas of source code. It extracts Abstract Syntax Tree (AST) tokens as vectors from source code. Mapping and word embedding turn integer vectors into dense vectors. Then, Convolutional Neural Network (CNN) extracts the semantics of AST tokens. After that, Bidirectional Long Short-Term Memory (Bi-LSTM) keeps key features and ignores other features in order to enhance the accuracy of software defect prediction. The proposed model CBIL is evaluated on a sample of seven open-source Java projects of the PROMISE dataset. CBIL is evaluated by applying the following evaluation metrics: F-measure and area under the curve (AUC). The results display that CBIL model improves the average of F-measure by 25% compared to CNN, as CNN accomplishes the top performance among the selected baseline models. In average of AUC, CBIL model improves AUC by 18% compared to Recurrent Neural Network (RNN), as RNN accomplishes the top performance among the selected baseline models used in the experiments.

Download Full-text

Joint feature representation learning and progressive distribution matching for cross-project defect prediction

Information and Software Technology ◽

10.1016/j.infsof.2021.106588 ◽

2021 ◽

Vol 137 ◽

pp. 106588

Author(s):

Quanyi Zou ◽

Lu Lu ◽

Zhanyu Yang ◽

Xiaowei Gu ◽

Shaojian Qiu

Keyword(s):

Representation Learning ◽

Feature Representation ◽

Defect Prediction ◽

Distribution Matching ◽

Cross Project

Download Full-text

A Novel Approach to Fault Diagnosis of High Voltage Transmission line - A Self Attentive Convolutional Neural Network Model

2020 IEEE Region 10 Symposium (TENSYMP) ◽

10.1109/tensymp50017.2020.9230660 ◽

2020 ◽

Author(s):

Shahriar Rahman Fahim ◽

Md. Rabiul Islam Sarker ◽

Md. Arifuzzaman ◽

Md. Sakhawat Hosen ◽

Subrata K Sarker ◽

...

Keyword(s):

Neural Network ◽

Fault Diagnosis ◽

Transmission Line ◽

Convolutional Neural Network ◽

Network Model ◽

High Voltage ◽

Neural Network Model ◽

High Voltage Transmission Line ◽

Novel Approach ◽

High Voltage Transmission

Download Full-text

A Novel Approach on Epileptic Seizures Detection Using Convolutional Neural Network

2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA) ◽

10.1109/iceca49313.2020.9297431 ◽

2020 ◽

Author(s):

Arpana Mahajan ◽

Sheshang Degadwala ◽

Prama Talukder ◽

Baron Meetei ◽

M. Rameshkumar

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Epileptic Seizures ◽

Novel Approach

Download Full-text

Enhanced Convolutional-Neural-Network Architecture for Crop Classification

Applied Sciences ◽

10.3390/app11094292 ◽

2021 ◽

Vol 11 (9) ◽

pp. 4292

Author(s):

Mónica Y. Moreno-Revelo ◽

Lorena Guachi-Guachi ◽

Juan Bernardo Gómez-Mendoza ◽

Javier Revelo-Fuelagán ◽

Diego H. Peluffo-Ordóñez

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Network Architecture ◽

Classification Model ◽

Classification Error ◽

Small Scale ◽

Post Processing ◽

Average Accuracy ◽

Processing Step ◽

Crop Classification

Automatic crop identification and monitoring is a key element in enhancing food production processes as well as diminishing the related environmental impact. Although several efficient deep learning techniques have emerged in the field of multispectral imagery analysis, the crop classification problem still needs more accurate solutions. This work introduces a competitive methodology for crop classification from multispectral satellite imagery mainly using an enhanced 2D convolutional neural network (2D-CNN) designed at a smaller-scale architecture, as well as a novel post-processing step. The proposed methodology contains four steps: image stacking, patch extraction, classification model design (based on a 2D-CNN architecture), and post-processing. First, the images are stacked to increase the number of features. Second, the input images are split into patches and fed into the 2D-CNN model. Then, the 2D-CNN model is constructed within a small-scale framework, and properly trained to recognize 10 different types of crops. Finally, a post-processing step is performed in order to reduce the classification error caused by lower-spatial-resolution images. Experiments were carried over the so-named Campo Verde database, which consists of a set of satellite images captured by Landsat and Sentinel satellites from the municipality of Campo Verde, Brazil. In contrast to the maximum accuracy values reached by remarkable works reported in the literature (amounting to an overall accuracy of about 81%, a f1 score of 75.89%, and average accuracy of 73.35%), the proposed methodology achieves a competitive overall accuracy of 81.20%, a f1 score of 75.89%, and an average accuracy of 88.72% when classifying 10 different crops, while ensuring an adequate trade-off between the number of multiply-accumulate operations (MACs) and accuracy. Furthermore, given its ability to effectively classify patches from two image sequences, this methodology may result appealing for other real-world applications, such as the classification of urban materials.

Download Full-text

An investigation of cross-project learning in online just-in-time software defect prediction

Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering ◽

10.1145/3377811.3380403 ◽

2020 ◽

Cited By ~ 1

Author(s):

Sadia Tabassum ◽

Leandro L. Minku ◽

Danyi Feng ◽

George G. Cabral ◽

Liyan Song

Keyword(s):

Defect Prediction ◽

Just In Time ◽

Software Defect Prediction ◽

Project Learning ◽

Software Defect ◽

Cross Project

Download Full-text