An Update on Statistical Boosting in Biomedicine

Machine Learning Approach to Model Rock Strength: Prediction and Variable Selection with Aid of Log Data

Rock Mechanics and Rock Engineering ◽

10.1007/s00603-020-02184-2 ◽

2020 ◽

Vol 53 (10) ◽

pp. 4691-4715 ◽

Cited By ~ 1

Author(s):

Mohammad Islam Miah ◽

Salim Ahmed ◽

Sohrab Zendehboudi ◽

Stephen Butt

Keyword(s):

Machine Learning ◽

Variable Selection ◽

Rock Strength ◽

Strength Prediction ◽

Learning Approach ◽

Log Data ◽

Machine Learning Approach

Download Full-text

A Naive Bayes machine learning approach to risk prediction using censored, time-to-event data

Statistics in Medicine ◽

10.1002/sim.6526 ◽

2015 ◽

Vol 34 (21) ◽

pp. 2941-2957 ◽

Cited By ~ 12

Author(s):

Julian Wolfson ◽

Sunayan Bandyopadhyay ◽

Mohamed Elidrisi ◽

Gabriela Vazquez-Benitez ◽

David M. Vock ◽

...

Keyword(s):

Machine Learning ◽

Risk Prediction ◽

Naive Bayes ◽

Naïve Bayes ◽

Learning Approach ◽

Event Data ◽

Time To Event ◽

Time To Event Data ◽

Machine Learning Approach

Download Full-text

A machine learning approach to open public comments for policymaking

Information Polity ◽

10.3233/ip-200256 ◽

2020 ◽

Vol 25 (4) ◽

pp. 433-448 ◽

Cited By ~ 1

Author(s):

Alex Ingrams

Keyword(s):

Machine Learning ◽

Latent Dirichlet Allocation ◽

Public Information ◽

Statistical Modelling ◽

The United States ◽

Digital Data ◽

Airport Security ◽

Learning Approach ◽

Machine Learning Approach ◽

Proposed Regulation

In this paper, the author argues that the conflict between the copious amount of digital data processed by public organisations and the need for policy-relevant insights to aid public participation constitutes a ‘public information paradox’. Machine learning (ML) approaches may offer one solution to this paradox through algorithms that transparently collect and use statistical modelling to provide insights for policymakers. Such an approach is tested in this paper. The test involves applying an unsupervised machine learning approach with latent Dirichlet allocation (LDA) analysis of thousands of public comments submitted to the United States Transport Security Administration (TSA) on a 2013 proposed regulation for the use of new full body imaging scanners in airport security terminals. The analysis results in salient topic clusters that could be used by policymakers to understand large amounts of text such as in an open public comments process. The results are compared with the actual final proposed TSA rule, and the author reflects on new questions raised for transparency by the implementation of ML in open rule-making processes.

Download Full-text

Extending Statistical Boosting

Methods of Information in Medicine ◽

10.3414/me13-01-0123 ◽

2014 ◽

Vol 53 (06) ◽

pp. 428-435 ◽

Cited By ~ 23

Author(s):

H. Binder ◽

O. Gefeller ◽

M. Schmid ◽

A. Mayr

Keyword(s):

Variable Selection ◽

Biomedical Research ◽

Statistical Models ◽

Statistical Modelling ◽

Gradient Boosting ◽

Model Choice ◽

Unified Framework ◽

Different Types ◽

Boosting Algorithms ◽

Substantial Interest

SummaryBackground: Boosting algorithms to simultaneously estimate and select predictor effects in statistical models have gained substantial interest during the last decade.Objectives: This review highlights recent methodological developments regarding boosting algorithms for statistical modelling especially focusing on topics relevant for biomedical research.Methods: We suggest a unified framework for gradient boosting and likelihood-based boosting (statistical boosting) which have been addressed separately in the literature up to now.Results: The methodological developments on statistical boosting during the last ten years can be grouped into three different lines of research: i) efforts to ensure variable selection leading to sparser models, ii) developments regarding different types of predictor effects and how to choose them, iii) approaches to extend the statistical boosting framework to new regression settings.Conclusions: Statistical boosting algorithms have been adapted to carry out unbiased variable selection and automated model choice during the fitting process and can nowadays be applied in almost any regression setting in combination with a large amount of different types of predictor effects.

Download Full-text

Modeling the Factors Associated with Mortality in Patients with Breast Cancer: A Machine Learning Approach

10.21203/rs.3.rs-57685/v1 ◽

2020 ◽

Author(s):

Mohammad Asghari Jafarabadi ◽

Zeynab Iraji ◽

Roya Dolatkhah ◽

Tohid Jafari Koshki

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Cause Of Death ◽

Additive Model ◽

P Value ◽

Learning Approach ◽

Time To Event ◽

Linear Effect ◽

Factors Associated ◽

Machine Learning Approach

Abstract Background: Breast cancer (BC) was the fifth leading cause of death worldwide in 2015 and the second leading cause of death in Iran in 2012. This study aimed to model the factors associated with mortality in patients with BC utilizing the machine learning approach.Methods: We used data of patients with primary BC during 2007-2016 in Tabriz, Iran. The data were analyzed using decision tree (DT), boosted tree (BT), random forest (RF), k-nearest neighbors (KNN) and generalized additive model (GAM) with inverse probability of censoring weighting (IPCW) technique to assess the risk factors of mortality. The models were compared by using diagnostic accuracy measures.Results: Accuracy of the models ranged from 76.0 to 93.0%, with sensitivity of 82.5-98.8% and specificity of 72.2-99.4%. The GAM fit the data best with accuracy of 93.0% (95% CI: [90.5, 95.0]), sensitivity of 98.8% (95% CI: [96.9, 99.7]) and specificity of 84.3% (95% CI: [78.8, 88.9]) where non-linear effect of age (p-value = 0.006), grade (p-value = 0.024) and time to event (p-value < 0.001) on mortality were significant. Conclusion: The GAM seems to be an optimal model for classifying the mortality in patients with BC. Considering the time to event, age and grade, as the prognostic factors obtained by GAM, more accurate prevention planning may be designed.

Download Full-text

Modeling The Factors Associated with Mortality in Patients with Breast Cancer: A Machine Learning Approach

10.21203/rs.3.rs-88922/v1 ◽

2020 ◽

Author(s):

Mohammad Asghari Jafarabadi ◽

Zaynab Iraji ◽

Roya Dolatkhah ◽

Tohid jafari koshki

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Cause Of Death ◽

Additive Model ◽

P Value ◽

Learning Approach ◽

Time To Event ◽

Linear Effect ◽

Factors Associated ◽

Machine Learning Approach

Abstract Background: Breast cancer (BC) was the fifth leading cause of death worldwide in 2015 and the second leading cause of death in Iran in 2012. This study aimed to model the factors associated with mortality in patients with BC utilizing the machine learning approach.Methods: We used data of patients with primary BC during 2007-2016 in Tabriz, Iran. The data were analyzed using decision tree (DT), boosted tree (BT), random forest (RF), k-nearest neighbors (KNN) and generalized additive model (GAM) with inverse probability of censoring weighting (IPCW) technique to assess the risk factors of mortality. The models were compared by using diagnostic accuracy measures.Results: Accuracy of the models ranged from 76.0 to 93.0%, with sensitivity of 82.5-98.8% and specificity of 72.2-99.4%. The GAM fit the data best with accuracy of 93.0% (95% CI: [90.5, 95.0]), sensitivity of 98.8% (95% CI: [96.9, 99.7]) and specificity of 84.3% (95% CI: [78.8, 88.9]) where non-linear effect of age (p-value = 0.006), grade (p-value = 0.024) and time to event (p-value < 0.001) on mortality were significant. Conclusion: The GAM seems to be an optimal model for classifying the mortality in patients with BC. Considering the time to event, age and grade, as the prognostic factors obtained by GAM, more accurate prevention planning may be designed.

Download Full-text

A Machine Learning Approach for High-Dimensional Time-to-Event Prediction With Application to Immunogenicity of Biotherapies in the ABIRISK Cohort

Frontiers in Immunology ◽

10.3389/fimmu.2020.00608 ◽

2020 ◽

Vol 11 ◽

Author(s):

Julianne Duhazé ◽

Signe Hässler ◽

Delphine Bachelet ◽

Aude Gleizes ◽

Salima Hacein-Bey-Abina ◽

...

Keyword(s):

Machine Learning ◽

High Dimensional ◽

Learning Approach ◽

Time To Event ◽

Event Prediction ◽

Machine Learning Approach

Download Full-text

Constructing and Validating Geographically Refined HAZUS-MH4 Hurricane Wind Risk Models: A Machine Learning Approach

Advances in Hurricane Engineering ◽

10.1061/9780784412626.092 ◽

2012 ◽

Cited By ~ 2

Author(s):

D. Subramanian ◽

J. Salazar ◽

L. Duenas-Osorio ◽

R. Stein

Keyword(s):

Machine Learning ◽

Learning Approach ◽

Risk Models ◽

Hurricane Wind ◽

Machine Learning Approach

Download Full-text

The impact of economic plans on the Chinese education system: a machine learning approach

CADMO ◽

10.3280/cad2018-001005 ◽

2018 ◽

pp. 37-49

Author(s):

Wenjun Lin ◽

Xuefu Xu ◽

Francesco Dell’Anna

Keyword(s):

Machine Learning ◽

Education System ◽

Learning Approach ◽

Chinese Education ◽

System A ◽

Machine Learning Approach ◽

The Impact

Download Full-text

Improving Bandwidth Utilization and Fairness between TCP Flows based on a Machine-learning Approach

IEEJ Transactions on Electronics Information and Systems ◽

10.1541/ieejeiss.133.1259 ◽

2013 ◽

Vol 133 (6) ◽

pp. 1259-1268

Author(s):

Akihiro Shiozu ◽

Syunji Yazaki ◽

K^|^ocirc;ki Abe

Keyword(s):

Machine Learning ◽

Learning Approach ◽

Bandwidth Utilization ◽

Machine Learning Approach

Download Full-text