Software Fault Proneness Prediction with Group Lasso Regression: On Factors that Affect Classification Performance

Clinical prediction of thrombectomy eligibility: A systematic review and 4-item decision tree

International Journal of Stroke ◽

10.1177/1747493018801225 ◽

2018 ◽

Vol 14 (5) ◽

pp. 530-539 ◽

Cited By ~ 5

Author(s):

Gaia T Koster ◽

T Truc My Nguyen ◽

Erik W van Zwet ◽

Bjarty L Garcia ◽

Hannah R Rowling ◽

...

Keyword(s):

Decision Tree ◽

Prediction Model ◽

External Validation ◽

Facial Asymmetry ◽

Decision Support Tool ◽

Patient Characteristics ◽

Group Lasso ◽

Lasso Regression ◽

Vessel Occlusion ◽

Support Tool

Background A clinical large anterior vessel occlusion (LAVO)-prediction scale could reduce treatment delays by allocating intra-arterial thrombectomy (IAT)-eligible patients directly to a comprehensive stroke center. Aim To subtract, validate and compare existing LAVO-prediction scales, and develop a straightforward decision support tool to assess IAT-eligibility. Methods We performed a systematic literature search to identify LAVO-prediction scales. Performance was compared in a prospective, multicenter validation cohort of the Dutch acute Stroke study (DUST) by calculating area under the receiver operating curves (AUROC). With group lasso regression analysis, we constructed a prediction model, incorporating patient characteristics next to National Institutes of Health Stroke Scale (NIHSS) items. Finally, we developed a decision tree algorithm based on dichotomized NIHSS items. Results We identified seven LAVO-prediction scales. From DUST, 1316 patients (35.8% LAVO-rate) from 14 centers were available for validation. FAST-ED and RACE had the highest AUROC (both >0.81, p < 0.01 for comparison with other scales). Group lasso analysis revealed a LAVO-prediction model containing seven NIHSS items (AUROC 0.84). With the GACE (Gaze, facial Asymmetry, level of Consciousness, Extinction/inattention) decision tree, LAVO is predicted (AUROC 0.76) for 61% of patients with assessment of only two dichotomized NIHSS items, and for all patients with four items. Conclusion External validation of seven LAVO-prediction scales showed AUROCs between 0.75 and 0.83. Most scales, however, appear too complex for Emergency Medical Services use with prehospital validation generally lacking. GACE is the first LAVO-prediction scale using a simple decision tree as such increasing feasibility, while maintaining high accuracy. Prehospital prospective validation is planned.

Download Full-text

Evaluation of Software Fault Proneness with a Support Vector Machine and Biomedical Applications

10.1201/9781003054405-4 ◽

2021 ◽

pp. 77-103

Author(s):

Renu Dalal ◽

Manju Khari ◽

Dimple Chandra

Keyword(s):

Support Vector Machine ◽

Biomedical Applications ◽

Support Vector ◽

Software Fault ◽

Fault Proneness

Download Full-text

Novel Grey Relational Feature Extraction Algorithm for Software Fault-Proneness Using BBO (B-GRA)

Arabian Journal for Science and Engineering ◽

10.1007/s13369-020-04445-2 ◽

2020 ◽

Vol 45 (4) ◽

pp. 2645-2662

Author(s):

Aarti ◽

Geeta Sikka ◽

Renu Dhir

Keyword(s):

Feature Extraction ◽

Feature Extraction Algorithm ◽

Extraction Algorithm ◽

Software Fault ◽

Grey Relational ◽

Fault Proneness

Download Full-text

Deriving models of software fault-proneness

Proceedings of the 14th international conference on Software engineering and knowledge engineering - SEKE '02 ◽

10.1145/568760.568824 ◽

2002 ◽

Cited By ~ 11

Author(s):

Giovanni Denaro ◽

Sandro Morasca ◽

Mauro Pezzè

Keyword(s):

Software Fault ◽

Fault Proneness

Download Full-text

Grey relational classification algorithm for software fault proneness with SOM clustering

International Journal of Data Mining Modelling and Management ◽

10.1504/ijdmmm.2020.10027275 ◽

2020 ◽

Vol 12 (1) ◽

pp. 28

Author(s):

Geeta Sikka ◽

Renu Dhir ◽

N.A. Aarti

Keyword(s):

Classification Algorithm ◽

Software Fault ◽

Som Clustering ◽

Grey Relational ◽

Fault Proneness ◽

Relational Classification

Download Full-text

Efficient prediction of software fault proneness modules using support vector machines and probabilistic neural networks

2011 Malaysian Conference in Software Engineering ◽

10.1109/mysec.2011.6140679 ◽

2011 ◽

Cited By ~ 8

Author(s):

Hamdi A. Al-Jamimi ◽

Lahouari Ghouti

Keyword(s):

Neural Networks ◽

Support Vector Machines ◽

Support Vector ◽

Probabilistic Neural Networks ◽

Vector Machines ◽

Efficient Prediction ◽

Software Fault ◽

Fault Proneness

Download Full-text

Improving the classification performance with group lasso-based ranking method in high dimensional correlated data

Journal of Theoretical and Computational Chemistry ◽

10.1142/s021963362040009x ◽

2020 ◽

Vol 19 (03) ◽

pp. 2040009

Author(s):

Abhijeet R Patil ◽

Bong-Jin Choi ◽

Sangjin Kim

Keyword(s):

Classification Accuracy ◽

Geometric Mean ◽

Classification Performance ◽

Group Lasso ◽

Correlated Data ◽

Support Vector ◽

Cpg Sites ◽

Highly Correlated ◽

Selection Operator ◽

Sensitivity Specificity

The high-throughput correlated DNA methylation (DNAmeth) dataset generated from Illumina Infinium Human Methylation 27 (IIHM 27K) BeadChip assay. In the DNAmeth data, there are several CpG sites for every gene, and these grouped CpG sites are highly correlated. Most of the current filtering-based ranking (FBR) methods do not consider the group correlation structures. Obtaining the significant features with the FBR methods and applying these features to the classifiers to attain the best classification accuracy in highly correlated DNAmeth data is a challenging task. In this research, we introduce a resampling of group least absolute shrinkage and selection operator (glasso) FBR method capable of ignoring the unrelated features in the data considering the group correlation among the features. The various classifiers, such as random forests (RF), Naive Bayes (NB), and support vector machines (SVM) with the significant CpGs obtained from the proposed resampling of group lasso-based ranking (RGLR) method helped to boost the classification accuracy. Through simulated and experimental prostate DNAmeth data, we showed that higher performance of accuracy, sensitivity, specificity, and geometric mean is achieved by ignoring the unimportant CpG sites through the RGLR method.

Download Full-text

A note on coding and standardization of categorical variables in (sparse) group lasso regression

Journal of Statistical Planning and Inference ◽

10.1016/j.jspi.2019.08.003 ◽

2020 ◽

Vol 206 ◽

pp. 1-11

Author(s):

Felicitas J. Detmer ◽

Juan Cebral ◽

Martin Slawski

Keyword(s):

Group Lasso ◽

Categorical Variables ◽

Lasso Regression ◽

Sparse Group Lasso

Download Full-text

Software fault proneness prediction: a comparative study between bagging, boosting, and stacking ensemble and base learner methods

International Journal of Data Analysis Techniques and Strategies ◽

10.1504/ijdats.2017.10003991 ◽

2017 ◽

Vol 9 (1) ◽

pp. 1 ◽

Cited By ~ 7

Author(s):

Iyad Alazzam ◽

Izzat Alsmadi ◽

Mohammed Akour

Keyword(s):

Comparative Study ◽

Software Fault ◽

Base Learner ◽

Fault Proneness

Download Full-text

Applying Machine Learning to Predict Software Fault Proneness Using Change Metrics, Static Code Metrics, and a Combination of Them

SoutheastCon 2018 ◽

10.1109/secon.2018.8478911 ◽

2018 ◽

Cited By ~ 3

Author(s):

Yasser Ali Alshehri ◽

Katerina Goseva-Popstojanova ◽

Dale G. Dzielski ◽

Thomas Devine

Keyword(s):

Machine Learning ◽

Code Metrics ◽

Change Metrics ◽

Software Fault ◽

Fault Proneness

Download Full-text