Empirical Evaluation of Mixed-Project Defect Prediction Models

Author(s):  
Burak Turhan ◽  
Ayse Tosun ◽  
Ayse Bener
Author(s):  
Faimison Porto ◽  
Adenilso Da Silva Simao

The defect prediction models can be a good tool on organizing the project´s test resources. The models can be constructed with two main goals: 1) to classify the software parts - defective or not; or 2) to rank the most defective parts in a decreasing order. However, not all companies maintain an appropriate set of historical defect data. In this case, a company can build an appropriate dataset from known external projects - called Cross-project Defect Prediction (CPDP).The CPDP models, however, present low prediction performances due to the heterogeneity of data. Recently, Instance Filtering methods were proposed in order to reduce this heterogeneity by selecting the most similar instances from the training dataset. Originally, the similarity is calculated based on all the available dataset features (or independent variables).We propose that using only the most relevant features on the similarity calculation can result in more accurate filtered datasets and better prediction performances. In this study we extend our previous work. We analyse both prediction goals - Classification and Ranking. We present an empirical evaluation of 41 different methods by associating Instance Filtering methods with Feature Selection methods. We used 36 versions of 11 open source projects on experiments.The results show similar evidences for both prediction goals. First, the defect prediction performance of CPDP models can be improved by associating Feature Selection and Instance Filtering. Second, no evaluated method presented general better performances. Indeed, the most appropriate method can vary according to the characteristics of the project being predicted.


2020 ◽  
Vol 25 (6) ◽  
pp. 5047-5083
Author(s):  
Abdul Ali Bangash ◽  
Hareem Sahar ◽  
Abram Hindle ◽  
Karim Ali

2019 ◽  
Vol 45 (7) ◽  
pp. 683-711 ◽  
Author(s):  
Chakkrit Tantithamthavorn ◽  
Shane McIntosh ◽  
Ahmed E. Hassan ◽  
Kenichi Matsumoto

Sign in / Sign up

Export Citation Format

Share Document