scholarly journals Distribution Modeling and Factor Correlation Analysis of Landslides in the Large Fault Zone of the Western Qinling Mountains: A Machine Learning Algorithm

2021 ◽  
Vol 13 (24) ◽  
pp. 4990
Author(s):  
Tianjun Qi ◽  
Yan Zhao ◽  
Xingmin Meng ◽  
Wei Shi ◽  
Feng Qing ◽  
...  

The area comprising the Langma-Baiya fault zone (LBFZ) and the Bailongjiang fault zone (BFZ) in the Western Qinling Mountains in China is characterized by intensive, frequent, multi-type landslide disasters. The spatial distribution of landslides is affected by factors, such as geological structure, landforms, climate and human activities, and the distribution of landslides in turn affects the geomorphology, ecological environment and human activities. Here, we present the results of a detailed landslide inventory of the area, which recorded a total of 2765 landslides. The landslides are divided into three categories according to relative age, area, and type of movement. Sixteen factors related to geological structure, geomorphology, materials composition and human activities were selected and four machine learning algorithms were used to model the spatial distribution of landslides. The aim was to quantitatively evaluate the relationship between the spatial distribution of landslides and the contributing factors. Based on a comparison of model accuracy and the Receiver Operating Characteristic (ROC) curve, RandomForest (RF) (accuracy of 92%, area under the ROC of 0.97) and GradientBoosting (GB) (accuracy of 96%, area under the ROC curve of 0.97) were selected to predict the spatial distribution of unclassified landslides and classified landslides, respectively. The evaluation results reveal the following. The vegetation coverage index (NDVI) (correlation of 0.2, and the same below) and distance to road (DTR) (0.13) had the highest correlations with the distribution of unclassified landslides. NDVI (0.18) and the annual precipitation index (API) (0.14) had the highest correlations with the distribution of landslides of different ages. API (0.16), average slope (AS) (0.14) and NDVI (0.1) had the highest correlations with the landslide distribution on different scales. API (0.28) had the highest correlation with the landslide distribution based on different types of landslide movement.

Minerals ◽  
2021 ◽  
Vol 11 (8) ◽  
pp. 816
Author(s):  
Mohammad Jooshaki ◽  
Alona Nad ◽  
Simon Michaux

Machine learning is a subcategory of artificial intelligence, which aims to make computers capable of solving complex problems without being explicitly programmed. Availability of large datasets, development of effective algorithms, and access to the powerful computers have resulted in the unprecedented success of machine learning in recent years. This powerful tool has been employed in a plethora of science and engineering domains including mining and minerals industry. Considering the ever-increasing global demand for raw materials, complexities of the geological structure of ore deposits, and decreasing ore grade, high-quality and extensive mineralogical information is required. Comprehensive analyses of such invaluable information call for advanced and powerful techniques including machine learning. This paper presents a systematic review of the efforts that have been dedicated to the development of machine learning-based solutions for better utilizing mineralogical data in mining and mineral studies. To that end, we investigate the main reasons behind the superiority of machine learning in the relevant literature, machine learning algorithms that have been deployed, input data, concerned outputs, as well as the general trends in the subject area.


2021 ◽  
Author(s):  
Maaruf Hussain ◽  
Abduljamiu Amao ◽  
Khalid Al-Ramadan ◽  
Sunday Olatunji ◽  
Ardiansyah Negara

Abstract The knowledge of rock mechanical properties is critical to reducing drilling risk and maximizing well and reservoir productivity. Rock chemical composition, their spatial distribution, and porosity significantly influenced these properties. However, low porosity characterized unconventional reservoirs as such, geochemical properties considerably control their mechanical behavior. In this study, we used chemostratigraphy as a correlation tool to separate strata in highly homogenous formations where other traditional stratigraphic methods failed. In addition, we integrated the chemofacies output and reduced Young's modulus to outline predictable associations between facies and mechanical properties. Thus, providing better understanding of lithofacies-controlled changes in rock strength that are useful inputs for geomechanical models and completions stimulations.


2021 ◽  
Vol 10 (1) ◽  
pp. 99
Author(s):  
Sajad Yousefi

Introduction: Heart disease is often associated with conditions such as clogged arteries due to the sediment accumulation which causes chest pain and heart attack. Many people die due to the heart disease annually. Most countries have a shortage of cardiovascular specialists and thus, a significant percentage of misdiagnosis occurs. Hence, predicting this disease is a serious issue. Using machine learning models performed on multidimensional dataset, this article aims to find the most efficient and accurate machine learning models for disease prediction.Material and Methods: Several algorithms were utilized to predict heart disease among which Decision Tree, Random Forest and KNN supervised machine learning are highly mentioned. The algorithms are applied to the dataset taken from the UCI repository including 294 samples. The dataset includes heart disease features. To enhance the algorithm performance, these features are analyzed, the feature importance scores and cross validation are considered.Results: The algorithm performance is compared with each other, so that performance based on ROC curve and some criteria such as accuracy, precision, sensitivity and F1 score were evaluated for each model. As a result of evaluation, Accuracy, AUC ROC are 83% and 99% respectively for Decision Tree algorithm. Logistic Regression algorithm with accuracy and AUC ROC are 88% and 91% respectively has better performance than other algorithms. Therefore, these techniques can be useful for physicians to predict heart disease patients and prescribe them correctly.Conclusion: Machine learning technique can be used in medicine for analyzing the related data collections to a disease and its prediction. The area under the ROC curve and evaluating criteria related to a number of classifying algorithms of machine learning to evaluate heart disease and indeed, the prediction of heart disease is compared to determine the most appropriate classification. As a result of evaluation, better performance was observed in both Decision Tree and Logistic Regression models.


Forests ◽  
2021 ◽  
Vol 12 (4) ◽  
pp. 395
Author(s):  
Milan Koreň ◽  
Rastislav Jakuš ◽  
Martin Zápotocký ◽  
Ivan Barka ◽  
Jaroslav Holuša ◽  
...  

Machine learning algorithms (MLAs) are used to solve complex non-linear and high-dimensional problems. The objective of this study was to identify the MLA that generates an accurate spatial distribution model of bark beetle (Ips typographus L.) infestation spots. We first evaluated the performance of 2 linear (logistic regression, linear discriminant analysis), 4 non-linear (quadratic discriminant analysis, k-nearest neighbors classifier, Gaussian naive Bayes, support vector classification), and 4 decision trees-based MLAs (decision tree classifier, random forest classifier, extra trees classifier, gradient boosting classifier) for the study area (the Horní Planá region, Czech Republic) for the period 2003–2012. Each MLA was trained and tested on all subsets of the 8 explanatory variables (distance to forest damage spots from previous year, distance to spruce forest edge, potential global solar radiation, normalized difference vegetation index, spruce forest age, percentage of spruce, volume of spruce wood per hectare, stocking). The mean phi coefficient of the model generated by extra trees classifier (ETC) MLA with five explanatory variables for the period was significantly greater than that of most forest damage models generated by the other MLAs. The mean true positive rate of the best ETC-based model was 80.4%, and the mean true negative rate was 80.0%. The spatio-temporal simulations of bark beetle-infested forests based on MLAs and GIS tools will facilitate the development and testing of novel forest management strategies for preventing forest damage in general and bark beetle outbreaks in particular.


Author(s):  
B. Kalantar ◽  
N. Ueda ◽  
H. A. H. Al-Najjar ◽  
M. B. A. Gibril ◽  
U. S. Lay ◽  
...  

<p><strong>Abstract.</strong> Landslide is painstaking as one of the most prevalent and devastating forms of mass movement that affects man and his environment. The specific objective of this research paper is to investigate the application and performances of some selected machine learning algorithms (MLA) in landslide susceptibility mapping, in Dodangeh watershed, Iran. A 112 sample point of the past landslide, occurrence or inventory data was generated from the existing and field observations. In addition, fourteen landslide-conditioning parameters were derived from DEM and other topographic databases for the modelling process. These conditioning parameters include total curvature, profile curvature, plan curvature, slope, aspect, altitude, topographic wetness index (TWI), topographic roughness index (TRI), stream transport index (STI), stream power index (SPI), lithology, land use, distance to stream, distance to the fault. Meanwhile, factor analysis was employed to optimize the landslide conditioning parameters and the inventory data, by assessing the multi-collinearity effects and outlier detections respectively. The inventory data is divided into 70% (78) training dataset and 30% (34) test dataset for model validation. The receiver operating characteristics (ROC) curve or area under curve (AUC) value was used for assessing the model's performance. The findings reveal that TRI has 0.89 collinearity effect based on variance-inflated factor (VIF) and based on Gini factor optimization total curvature is not significant in the model development, therefore the two parameters are excluded from the modelling. All the selected MLAs (RF, BRT, and DT) shown promising performances on landslide susceptibility mapping in Dodangeh watershed, Iran. The ROC curve for training and validation for RF are 86% success rate and 83% prediction rate implies the best model performance compared to BRT and DT, with ROC curve of 72% and 70% prediction rate, respectively. In conclusion, RF could be the best algorithm for producing landslide susceptibility map, and such results could be adopted for the decision-making process to support land use planner for improving landslide risk assessment in similar environmental settings.</p>


Author(s):  
Giuseppe Nunnari

AbstractThis paper deals with the classification of volcanic activity into three classes, referred to as Quite, Strombolian and Paroxysm. The main purpose is to give a measure of the reliability with which such a classification, typically carried out by experts, can be performed by Machine Learning algorithms, by using the volcanic tremor as a feature. Both supervised and unsupervised methods are considered. It is experimentally shown that at least the Paroxysm activity can be reliably classified. Performances are rigorously assessed, in comparison with the classification made by expert volcanologists, in terms of popular indices such as the f1-score and the Area under the ROC curve (AuC). The work is basically a case study carried out on a dataset recorded in the area of the Mt Etna volcano. However, as volcanic tremor is a geophysical signal widely available, considered methods and strategies can be easily applied to similar volcanic areas.


2021 ◽  
Vol 8 ◽  
Author(s):  
Yun Han ◽  
Bo Wang ◽  
Jinjin Zhang ◽  
Su Zhou ◽  
Jun Dai ◽  
...  

Background: Population-based data on the risk assessment of newly diagnosed cervical cancer patients' bone metastasis (CCBM) are lacking. This study aimed to develop various predictive models to assess the risk of bone metastasis via machine learning algorithms.Materials and Methods: We retrospectively reviewed the CCBM patients from the Surveillance, Epidemiology, and End Results (SEER) database of the National Cancer Institute to risk factors of the presence of bone metastasis. Clinical usefulness was assessed by Akaike information criteria (AIC) and multiple machine learning algorithms based predictive models. Concordance index (C-index) and receiver operating characteristic (ROC) curve were used to define the predictive and discriminatory capacity of predictive models.Results: A total of 16 candidate variables were included to develop predictive models for bone metastasis by machine learning. The areas under the ROC curve (AUCs) of the random forest model (RF), generalized linear model (GL), support vector machine (SVM), eXtreme Gradient Boosting (XGBoost), artificial neutral network (ANN), decision tree (DT), and naive bayesian model (NBM) ranged from 0.85 to 0.93. The RF model with 10 variables was developed as the optimal predictive model. The weight of variables indicated the top seven factors were organ-site metastasis (liver, brain, and lung), TNM stage and age.Conclusions: Multiple machine learning based predictive models were developed to identify risk of bone metastasis in cervical cancer patients. By incorporating clinical characteristics and other candidate variables showed robust risk stratification for CCBM patients, and the RF predictive model performed best among these predictive models.


Sign in / Sign up

Export Citation Format

Share Document