An Empirical Study of Classifier Combination for Cross-Project Defect Prediction

An empirical study of just-in-time defect prediction using cross-project models

Proceedings of the 11th Working Conference on Mining Software Repositories - MSR 2014 ◽

10.1145/2597073.2597075 ◽

2014 ◽

Cited By ~ 46

Author(s):

Takafumi Fukushima ◽

Yasutaka Kamei ◽

Shane McIntosh ◽

Kazuhiro Yamashita ◽

Naoyasu Ubayashi

Keyword(s):

Empirical Study ◽

Defect Prediction ◽

Just In Time ◽

Cross Project

Download Full-text

An Empirical Study on Software Defect Prediction Using CodeBERT Model

Applied Sciences ◽

10.3390/app11114793 ◽

2021 ◽

Vol 11 (11) ◽

pp. 4793

Author(s):

Cong Pan ◽

Minyan Lu ◽

Biao Xu

Keyword(s):

Deep Learning ◽

Software Engineering ◽

Empirical Study ◽

Empirical Studies ◽

Language Model ◽

Prediction Performance ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Cross Project

Deep learning-based software defect prediction has been popular these days. Recently, the publishing of the CodeBERT model has made it possible to perform many software engineering tasks. We propose various CodeBERT models targeting software defect prediction, including CodeBERT-NT, CodeBERT-PS, CodeBERT-PK, and CodeBERT-PT. We perform empirical studies using such models in cross-version and cross-project software defect prediction to investigate if using a neural language model like CodeBERT could improve prediction performance. We also investigate the effects of different prediction patterns in software defect prediction using CodeBERT models. The empirical results are further discussed.

Download Full-text

A Ranking-Oriented Approach to Cross-Project Software Defect Prediction: An Empirical Study

Proceedings of the 27th International Conference on Software Engineering and Knowledge Engineering ◽

10.18293/seke2016-047 ◽

2016 ◽

Cited By ~ 2

Author(s):

Guoan You ◽

Yutao Ma

Keyword(s):

Empirical Study ◽

Defect Prediction ◽

Software Defect Prediction ◽

Software Defect ◽

Oriented Approach ◽

Cross Project

Download Full-text

Cross-project defect prediction using data sampling for class imbalance learning: an empirical study

International Journal of Parallel Emergent and Distributed Systems ◽

10.1080/17445760.2019.1650039 ◽

2019 ◽

pp. 1-14

Author(s):

Lipika Goel ◽

Mayank Sharma ◽

Sunil Kumar Khatri ◽

D. Damodaran

Keyword(s):

Empirical Study ◽

Class Imbalance ◽

Defect Prediction ◽

Data Sampling ◽

Using Data ◽

Imbalance Learning ◽

Class Imbalance Learning ◽

Cross Project

Download Full-text

Implementation of Data Sampling in Class Imbalance Learning for Cross Project Defect Prediction: An Empirical Study

2018 Fifth International Symposium on Innovation in Information and Communication Technology (ISIICT) ◽

10.1109/isiict.2018.8613283 ◽

2018 ◽

Cited By ~ 2

Author(s):

Lipika Goel ◽

Mayank Sharma ◽

Sunil Kumar Khatri ◽

D. Damodaran

Keyword(s):

Empirical Study ◽

Class Imbalance ◽

Defect Prediction ◽

Data Sampling ◽

Imbalance Learning ◽

Class Imbalance Learning ◽

Cross Project

Download Full-text

Too trivial to test? An inverse view on defect prediction to identify methods with low fault risk

PeerJ Computer Science ◽

10.7717/peerj-cs.187 ◽

2019 ◽

Vol 5 ◽

pp. e187 ◽

Cited By ~ 1

Author(s):

Rainer Niedermayr ◽

Tobias Röhm ◽

Stefan Wagner

Keyword(s):

Empirical Study ◽

Association Rule ◽

Association Rule Mining ◽

Defect Prediction ◽

Efficient Allocation ◽

Rule Mining ◽

Scarce Resources ◽

Development Teams ◽

Code Metrics ◽

Cross Project

BackgroundTest resources are usually limited and therefore it is often not possible to completely test an application before a release. To cope with the problem of scarce resources, development teams can apply defect prediction to identify fault-prone code regions. However, defect prediction tends to low precision in cross-project prediction scenarios.AimsWe take an inverse view on defect prediction and aim to identify methods that can be deferred when testing because they contain hardly any faults due to their code being “trivial”. We expect that characteristics of such methods might be project-independent, so that our approach could improve cross-project predictions.MethodWe compute code metrics and apply association rule mining to create rules for identifying methods with low fault risk (LFR). We conduct an empirical study to assess our approach with six Java open-source projects containing precise fault data at the method level.ResultsOur results show that inverse defect prediction can identify approx. 32–44% of the methods of a project to have a LFR; on average, they are about six times less likely to contain a fault than other methods. In cross-project predictions with larger, more diversified training sets, identified methods are even 11 times less likely to contain a fault.ConclusionsInverse defect prediction supports the efficient allocation of test resources by identifying methods that can be treated with less priority in testing activities and is well applicable in cross-project prediction scenarios.

Download Full-text

Combined classifier for cross-project defect prediction: an extended empirical study

Frontiers of Computer Science ◽

10.1007/s11704-017-6015-y ◽

2018 ◽

Vol 12 (2) ◽

pp. 280-296 ◽

Cited By ~ 13

Author(s):

Yun Zhang ◽

David Lo ◽

Xin Xia ◽

Jianling Sun

Keyword(s):

Empirical Study ◽

Defect Prediction ◽

Combined Classifier ◽

Cross Project

Download Full-text

Simplify Your Neural Networks: An Empirical Study on Cross-Project Defect Prediction

10.1007/978-981-16-3728-5_7 ◽

2021 ◽

pp. 85-98

Author(s):

Ruchika Malhotra ◽

Abuzar Ahmed Khan ◽

Amrit Khera

Keyword(s):

Neural Networks ◽

Empirical Study ◽

Defect Prediction ◽

Cross Project

Download Full-text

Too trivial to test? An inverse view on defect prediction to identify methods with low fault risk

10.7287/peerj.preprints.27304v1 ◽

2018 ◽

Author(s):

Rainer Niedermayr ◽

Tobias Röhm ◽

Stefan Wagner

Keyword(s):

Empirical Study ◽

Association Rule ◽

Association Rule Mining ◽

Defect Prediction ◽

Efficient Allocation ◽

Rule Mining ◽

Scarce Resources ◽

Development Teams ◽

Code Metrics ◽

Cross Project

Background. Test resources are usually limited and therefore it is often not possible to completely test an application before a release. To cope with the problem of scarce resources, development teams can apply defect prediction to identify fault-prone code regions. However, defect prediction tends to low precision in cross-project prediction scenarios. Aims. We take an inverse view on defect prediction and aim to identify methods that can be deferred when testing because they contain hardly any faults due to their code being "trivial". We expect that characteristics of such methods might be project-independent, so that our approach could improve cross-project predictions. Method. We compute code metrics and apply association rule mining to create rules for identifying methods with low fault risk. We conduct an empirical study to assess our approach with six Java open-source projects containing precise fault data at the method level. Results. Our results show that inverse defect prediction can identify approx. 32-44% of the methods of a project to have a low fault risk; on average, they are about six times less likely to contain a fault than other methods. In cross-project predictions with larger, more diversified training sets, identified methods are even eleven times less likely to contain a fault. Conclusions. Inverse defect prediction supports the efficient allocation of test resources by identifying methods that can be treated with less priority in testing activities and is well applicable in cross-project prediction scenarios.

Download Full-text

An Empirical Study on the Effectiveness of Feature Selection for Cross-Project Defect Prediction

IEEE Access ◽

10.1109/access.2019.2895614 ◽

2019 ◽

Vol 7 ◽

pp. 35710-35718 ◽

Cited By ~ 5

Author(s):

Qiao Yu ◽

Junyan Qian ◽

Shujuan Jiang ◽

Zhenhua Wu ◽

Gongjie Zhang

Keyword(s):

Feature Selection ◽

Empirical Study ◽

Defect Prediction ◽

Selection For ◽

Cross Project

Download Full-text