scholarly journals XGBoost: An Optimal Machine Learning Model with Just Structural Features to Discover MOF Adsorbents of Xe/Kr

ACS Omega ◽  
2021 ◽  
Author(s):  
Heng Liang ◽  
Kun Jiang ◽  
Tong-An Yan ◽  
Guang-Hui Chen
2021 ◽  
Vol 309 ◽  
pp. 01007
Author(s):  
B. Srınıvasa Rao

The present paperreports an optimal machine learning model for an effective prediction of cardiovascular diseases that uses the ensemble learning technique. The present research work gives an insight about the coherent way of combining Naive Bayes and Random Forest algorithm using ensemble technique. It also discusses how the present model is different from other traditional approaches. The present experimental results manifest that the present optimal machine learning model is more efficient than the other models.


2021 ◽  
Vol 12 ◽  
Author(s):  
Aron S. Talai ◽  
Jan Sedlacik ◽  
Kai Boelmans ◽  
Nils D. Forkert

Background: Patients with Parkinson's disease (PD) and progressive supranuclear palsy Richardson's syndrome (PSP-RS) often show overlapping clinical features, leading to misdiagnoses. The objective of this study was to investigate the feasibility and utility of using multi-modal MRI datasets for an automatic differentiation of PD patients, PSP-RS patients, and healthy control (HC) subjects.Material and Methods: T1-weighted, T2-weighted, and diffusion-tensor (DTI) MRI datasets from 45 PD patients, 20 PSP-RS patients, and 38 HC subjects were available for this study. Using an atlas-based approach, regional values of brain morphology (T1-weighted), brain iron metabolism (T2-weighted), and microstructural integrity (DTI) were measured and employed for feature selection and subsequent classification using combinations of various established machine learning methods.Results: The optimal machine learning model using regional morphology features only achieved a classification accuracy of 65% (67/103 correct classifications) differentiating PD patients, PSP-RS patients, and HC subjects. The optimal machine learning model using only quantitative T2 values performed slightly better and achieved an accuracy of 75.7% (78/103). The optimal classifier using DTI features alone performed considerably better with 95.1% accuracy (98/103). The optimal multi-modal classifier using all features also achieved an accuracy of 95.1% but required more features and achieved a slightly lower F1-score compared to the optimal model using DTI features alone.Conclusion: Machine learning models using multi-modal MRI perform significantly better than uni-modal machine learning models using morphological parameters based on T1-weighted MRI datasets alone or brain iron metabolism markers based on T2-weighted MRI datasets alone. However, machine learnig models using regional brain microstructural integrity metrics computed from DTI datasets perform similar to the optimal multi-modal machine learning model. Thus, given the results from this study cohort, it appears that morphology and brain iron metabolism markers may not provide additional value for classification compared to using DTI metrics alone.


2021 ◽  
Vol 9 ◽  
Author(s):  
Juan I. Di Filippo ◽  
Mariela Bollini ◽  
Claudio N. Cavasotto

The development of computational models for assessing the transfer of chemicals across the placental membrane would be of the utmost importance in drug discovery campaigns, in order to develop safe therapeutic options. We have developed a low-dimensional machine learning model capable of classifying compounds according to whether they can cross or not the placental barrier. To this aim, we compiled a database of 248 compounds with experimental information about their placental transfer, characterizing each compound with a set of ∼5.4 thousand descriptors, including physicochemical properties and structural features. We evaluated different machine learning classifiers and implemented a genetic algorithm, in a five cross validation scheme, to perform feature selection. The optimization was guided towards models displaying a low number of false positives (molecules that actually cross the placental barrier, but are predicted as not crossing it). A Linear Discriminant Analysis model trained with only four structural features resulted to be robust for this task, exhibiting only one false positive case across all testing folds. This model is expected to be useful in predicting placental drug transfer during pregnancy, and thus could be used as a filter for chemical libraries in virtual screening campaigns.


Author(s):  
Sayeed Rushd ◽  
Mohammad Tanvir Parvez ◽  
Majdi Adel Al-Faiad ◽  
Mohammed Monirul Islam

2018 ◽  
Author(s):  
Steen Lysgaard ◽  
Paul C. Jennings ◽  
Jens Strabo Hummelshøj ◽  
Thomas Bligaard ◽  
Tejs Vegge

A machine learning model is used as a surrogate fitness evaluator in a genetic algorithm (GA) optimization of the atomic distribution of Pt-Au nanoparticles. The machine learning accelerated genetic algorithm (MLaGA) yields a 50-fold reduction of required energy calculations compared to a traditional GA.


Author(s):  
Dhilsath Fathima.M ◽  
S. Justin Samuel ◽  
R. Hari Haran

Aim: This proposed work is used to develop an improved and robust machine learning model for predicting Myocardial Infarction (MI) could have substantial clinical impact. Objectives: This paper explains how to build machine learning based computer-aided analysis system for an early and accurate prediction of Myocardial Infarction (MI) which utilizes framingham heart study dataset for validation and evaluation. This proposed computer-aided analysis model will support medical professionals to predict myocardial infarction proficiently. Methods: The proposed model utilize the mean imputation to remove the missing values from the data set, then applied principal component analysis to extract the optimal features from the data set to enhance the performance of the classifiers. After PCA, the reduced features are partitioned into training dataset and testing dataset where 70% of the training dataset are given as an input to the four well-liked classifiers as support vector machine, k-nearest neighbor, logistic regression and decision tree to train the classifiers and 30% of test dataset is used to evaluate an output of machine learning model using performance metrics as confusion matrix, classifier accuracy, precision, sensitivity, F1-score, AUC-ROC curve. Results: Output of the classifiers are evaluated using performance measures and we observed that logistic regression provides high accuracy than K-NN, SVM, decision tree classifiers and PCA performs sound as a good feature extraction method to enhance the performance of proposed model. From these analyses, we conclude that logistic regression having good mean accuracy level and standard deviation accuracy compared with the other three algorithms. AUC-ROC curve of the proposed classifiers is analyzed from the output figure.4, figure.5 that logistic regression exhibits good AUC-ROC score, i.e. around 70% compared to k-NN and decision tree algorithm. Conclusion: From the result analysis, we infer that this proposed machine learning model will act as an optimal decision making system to predict the acute myocardial infarction at an early stage than an existing machine learning based prediction models and it is capable to predict the presence of an acute myocardial Infarction with human using the heart disease risk factors, in order to decide when to start lifestyle modification and medical treatment to prevent the heart disease.


Sign in / Sign up

Export Citation Format

Share Document