Distribution Modeling and Factor Correlation Analysis of Landslides in the Large Fault Zone of the Western Qinling Mountains: A Machine Learning Algorithm

Tianjun Qi; Yan Zhao; Xingmin Meng; Wei Shi; Feng Qing; Guan Chen; Yi Zhang; Dongxia Yue; Fuyun Guo

doi:10.3390/rs13244990

Distribution Modeling and Factor Correlation Analysis of Landslides in the Large Fault Zone of the Western Qinling Mountains: A Machine Learning Algorithm

Remote Sensing ◽

10.3390/rs13244990 ◽

2021 ◽

Vol 13 (24) ◽

pp. 4990

Author(s):

Tianjun Qi ◽

Yan Zhao ◽

Xingmin Meng ◽

Wei Shi ◽

Feng Qing ◽

...

Keyword(s):

Machine Learning ◽

Spatial Distribution ◽

Fault Zone ◽

Roc Curve ◽

Human Activities ◽

Geological Structure ◽

Machine Learning Algorithms ◽

Qinling Mountains ◽

Western Qinling ◽

Landslide Distribution

The area comprising the Langma-Baiya fault zone (LBFZ) and the Bailongjiang fault zone (BFZ) in the Western Qinling Mountains in China is characterized by intensive, frequent, multi-type landslide disasters. The spatial distribution of landslides is affected by factors, such as geological structure, landforms, climate and human activities, and the distribution of landslides in turn affects the geomorphology, ecological environment and human activities. Here, we present the results of a detailed landslide inventory of the area, which recorded a total of 2765 landslides. The landslides are divided into three categories according to relative age, area, and type of movement. Sixteen factors related to geological structure, geomorphology, materials composition and human activities were selected and four machine learning algorithms were used to model the spatial distribution of landslides. The aim was to quantitatively evaluate the relationship between the spatial distribution of landslides and the contributing factors. Based on a comparison of model accuracy and the Receiver Operating Characteristic (ROC) curve, RandomForest (RF) (accuracy of 92%, area under the ROC of 0.97) and GradientBoosting (GB) (accuracy of 96%, area under the ROC curve of 0.97) were selected to predict the spatial distribution of unclassified landslides and classified landslides, respectively. The evaluation results reveal the following. The vegetation coverage index (NDVI) (correlation of 0.2, and the same below) and distance to road (DTR) (0.13) had the highest correlations with the distribution of unclassified landslides. NDVI (0.18) and the annual precipitation index (API) (0.14) had the highest correlations with the distribution of landslides of different ages. API (0.16), average slope (AS) (0.14) and NDVI (0.1) had the highest correlations with the landslide distribution on different scales. API (0.28) had the highest correlation with the landslide distribution based on different types of landslide movement.

Download Full-text

A Systematic Review on the Application of Machine Learning in Exploiting Mineralogical Data in Mining and Mineral Industry

Minerals ◽

10.3390/min11080816 ◽

2021 ◽

Vol 11 (8) ◽

pp. 816

Author(s):

Mohammad Jooshaki ◽

Alona Nad ◽

Simon Michaux

Keyword(s):

Machine Learning ◽

Systematic Review ◽

Raw Materials ◽

Relevant Literature ◽

Geological Structure ◽

Subject Area ◽

Ore Deposits ◽

Machine Learning Algorithms ◽

Global Demand ◽

Ore Grade

Machine learning is a subcategory of artificial intelligence, which aims to make computers capable of solving complex problems without being explicitly programmed. Availability of large datasets, development of effective algorithms, and access to the powerful computers have resulted in the unprecedented success of machine learning in recent years. This powerful tool has been employed in a plethora of science and engineering domains including mining and minerals industry. Considering the ever-increasing global demand for raw materials, complexities of the geological structure of ore deposits, and decreasing ore grade, high-quality and extensive mineralogical information is required. Comprehensive analyses of such invaluable information call for advanced and powerful techniques including machine learning. This paper presents a systematic review of the efforts that have been dedicated to the development of machine learning-based solutions for better utilizing mineralogical data in mining and mineral studies. To that end, we investigate the main reasons behind the superiority of machine learning in the relevant literature, machine learning algorithms that have been deployed, input data, concerned outputs, as well as the general trends in the subject area.

Download Full-text

Integration of Chemofacies and Rock Mechanical Properties Using Machine Learning Algorithms: Implications for Geomechanics and Hydraulic Fracture Stimulations in Paleozoic Formations, Saudi Arabia

10.2118/205951-ms ◽

2021 ◽

Author(s):

Maaruf Hussain ◽

Abduljamiu Amao ◽

Khalid Al-Ramadan ◽

Sunday Olatunji ◽

Ardiansyah Negara

Keyword(s):

Machine Learning ◽

Mechanical Properties ◽

Saudi Arabia ◽

Spatial Distribution ◽

Chemical Composition ◽

Rock Strength ◽

Machine Learning Algorithms ◽

Unconventional Reservoirs ◽

Geochemical Properties ◽

Rock Mechanical Properties

Abstract The knowledge of rock mechanical properties is critical to reducing drilling risk and maximizing well and reservoir productivity. Rock chemical composition, their spatial distribution, and porosity significantly influenced these properties. However, low porosity characterized unconventional reservoirs as such, geochemical properties considerably control their mechanical behavior. In this study, we used chemostratigraphy as a correlation tool to separate strata in highly homogenous formations where other traditional stratigraphic methods failed. In addition, we integrated the chemofacies output and reduced Young's modulus to outline predictable associations between facies and mechanical properties. Thus, providing better understanding of lithofacies-controlled changes in rock strength that are useful inputs for geomechanical models and completions stimulations.

Download Full-text

Comparison of Machine Learning Algorithms to Recognize Human Activities from Images and Videos Using Pose Estimation and Feature Extraction

Proceedings of the Future Technologies Conference (FTC) 2020, Volume 1 - Advances in Intelligent Systems and Computing ◽

10.1007/978-3-030-63128-4_7 ◽

2020 ◽

pp. 78-87

Author(s):

Md Hasibul Huq ◽

Mohammed Alnakli ◽

Zakiya Jafrin ◽

Tanjima Nasreen Jenia

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Pose Estimation ◽

Human Activities ◽

Learning Algorithms ◽

Machine Learning Algorithms

Download Full-text

Comparison of the Performance of Machine Learning Algorithms in Predicting Heart Disease

Frontiers in Health Informatics ◽

10.30699/fhi.v10i1.349 ◽

2021 ◽

Vol 10 (1) ◽

pp. 99

Author(s):

Sajad Yousefi

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Heart Disease ◽

Decision Tree ◽

Roc Curve ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Learning Models ◽

Algorithm Performance ◽

Machine Learning Models

Introduction: Heart disease is often associated with conditions such as clogged arteries due to the sediment accumulation which causes chest pain and heart attack. Many people die due to the heart disease annually. Most countries have a shortage of cardiovascular specialists and thus, a significant percentage of misdiagnosis occurs. Hence, predicting this disease is a serious issue. Using machine learning models performed on multidimensional dataset, this article aims to find the most efficient and accurate machine learning models for disease prediction.Material and Methods: Several algorithms were utilized to predict heart disease among which Decision Tree, Random Forest and KNN supervised machine learning are highly mentioned. The algorithms are applied to the dataset taken from the UCI repository including 294 samples. The dataset includes heart disease features. To enhance the algorithm performance, these features are analyzed, the feature importance scores and cross validation are considered.Results: The algorithm performance is compared with each other, so that performance based on ROC curve and some criteria such as accuracy, precision, sensitivity and F1 score were evaluated for each model. As a result of evaluation, Accuracy, AUC ROC are 83% and 99% respectively for Decision Tree algorithm. Logistic Regression algorithm with accuracy and AUC ROC are 88% and 91% respectively has better performance than other algorithms. Therefore, these techniques can be useful for physicians to predict heart disease patients and prescribe them correctly.Conclusion: Machine learning technique can be used in medicine for analyzing the related data collections to a disease and its prediction. The area under the ROC curve and evaluating criteria related to a number of classifying algorithms of machine learning to evaluate heart disease and indeed, the prediction of heart disease is compared to determine the most appropriate classification. As a result of evaluation, better performance was observed in both Decision Tree and Logistic Regression models.

Download Full-text

Geological mapping using remote sensing data: A comparison of five machine learning algorithms, their response to variations in the spatial distribution of training data and the use of explicit spatial information

Computers & Geosciences ◽

10.1016/j.cageo.2013.10.008 ◽

2014 ◽

Vol 63 ◽

pp. 22-33 ◽

Cited By ~ 203

Author(s):

Matthew J. Cracknell ◽

Anya M. Reading

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Spatial Distribution ◽

Spatial Information ◽

Learning Algorithms ◽

Remote Sensing Data ◽

Geological Mapping ◽

Machine Learning Algorithms ◽

Training Data ◽

Sensing Data

Download Full-text

Assessment of Machine Learning Algorithms for Modeling the Spatial Distribution of Bark Beetle Infestation

Forests ◽

10.3390/f12040395 ◽

2021 ◽

Vol 12 (4) ◽

pp. 395

Author(s):

Milan Koreň ◽

Rastislav Jakuš ◽

Martin Zápotocký ◽

Ivan Barka ◽

Jaroslav Holuša ◽

...

Keyword(s):

Machine Learning ◽

Spatial Distribution ◽

Discriminant Analysis ◽

Bark Beetle ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Forest Damage ◽

Spruce Forest ◽

Explanatory Variables ◽

The Mean

Machine learning algorithms (MLAs) are used to solve complex non-linear and high-dimensional problems. The objective of this study was to identify the MLA that generates an accurate spatial distribution model of bark beetle (Ips typographus L.) infestation spots. We first evaluated the performance of 2 linear (logistic regression, linear discriminant analysis), 4 non-linear (quadratic discriminant analysis, k-nearest neighbors classifier, Gaussian naive Bayes, support vector classification), and 4 decision trees-based MLAs (decision tree classifier, random forest classifier, extra trees classifier, gradient boosting classifier) for the study area (the Horní Planá region, Czech Republic) for the period 2003–2012. Each MLA was trained and tested on all subsets of the 8 explanatory variables (distance to forest damage spots from previous year, distance to spruce forest edge, potential global solar radiation, normalized difference vegetation index, spruce forest age, percentage of spruce, volume of spruce wood per hectare, stocking). The mean phi coefficient of the model generated by extra trees classifier (ETC) MLA with five explanatory variables for the period was significantly greater than that of most forest damage models generated by the other MLAs. The mean true positive rate of the best ETC-based model was 80.4%, and the mean true negative rate was 80.0%. The spatio-temporal simulations of bark beetle-infested forests based on MLAs and GIS tools will facilitate the development and testing of novel forest management strategies for preventing forest damage in general and bark beetle outbreaks in particular.

Download Full-text

AN EVALUATION OF LANDSLIDE SUSCEPTIBILITY MAPPING USING REMOTE SENSING DATA AND MACHINE LEARNING ALGORITHMS IN IRAN

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-iv-2-w5-503-2019 ◽

2019 ◽

Vol IV-2/W5 ◽

pp. 503-511 ◽

Cited By ~ 2

Author(s):

B. Kalantar ◽

N. Ueda ◽

H. A. H. Al-Najjar ◽

M. B. A. Gibril ◽

U. S. Lay ◽

...

Keyword(s):

Machine Learning ◽

Land Use ◽

Roc Curve ◽

Landslide Susceptibility ◽

Total Curvature ◽

Susceptibility Mapping ◽

Machine Learning Algorithms ◽

Landslide Susceptibility Mapping ◽

Inventory Data ◽

Prediction Rate

<p><strong>Abstract.</strong> Landslide is painstaking as one of the most prevalent and devastating forms of mass movement that affects man and his environment. The specific objective of this research paper is to investigate the application and performances of some selected machine learning algorithms (MLA) in landslide susceptibility mapping, in Dodangeh watershed, Iran. A 112 sample point of the past landslide, occurrence or inventory data was generated from the existing and field observations. In addition, fourteen landslide-conditioning parameters were derived from DEM and other topographic databases for the modelling process. These conditioning parameters include total curvature, profile curvature, plan curvature, slope, aspect, altitude, topographic wetness index (TWI), topographic roughness index (TRI), stream transport index (STI), stream power index (SPI), lithology, land use, distance to stream, distance to the fault. Meanwhile, factor analysis was employed to optimize the landslide conditioning parameters and the inventory data, by assessing the multi-collinearity effects and outlier detections respectively. The inventory data is divided into 70% (78) training dataset and 30% (34) test dataset for model validation. The receiver operating characteristics (ROC) curve or area under curve (AUC) value was used for assessing the model's performance. The findings reveal that TRI has 0.89 collinearity effect based on variance-inflated factor (VIF) and based on Gini factor optimization total curvature is not significant in the model development, therefore the two parameters are excluded from the modelling. All the selected MLAs (RF, BRT, and DT) shown promising performances on landslide susceptibility mapping in Dodangeh watershed, Iran. The ROC curve for training and validation for RF are 86% success rate and 83% prediction rate implies the best model performance compared to BRT and DT, with ROC curve of 72% and 70% prediction rate, respectively. In conclusion, RF could be the best algorithm for producing landslide susceptibility map, and such results could be adopted for the decision-making process to support land use planner for improving landslide risk assessment in similar environmental settings.</p>

Download Full-text

Machine Learning Algorithms with ROC Curve for Predicting and Diagnosing the Heart Disease

Soft Computing and Medical Bioinformatics - SpringerBriefs in Applied Sciences and Technology ◽

10.1007/978-981-13-0059-2_8 ◽

2018 ◽

pp. 63-72 ◽

Cited By ~ 10

Author(s):

R. Kannan ◽

V. Vasanthi

Keyword(s):

Machine Learning ◽

Heart Disease ◽

Roc Curve ◽

Learning Algorithms ◽

Machine Learning Algorithms

Download Full-text

Clustering activity at Mt Etna based on volcanic tremor: A case study

Earth Science Informatics ◽

10.1007/s12145-021-00606-5 ◽

2021 ◽

Author(s):

Giuseppe Nunnari

Keyword(s):

Machine Learning ◽

Volcanic Activity ◽

Roc Curve ◽

Volcanic Tremor ◽

Machine Learning Algorithms ◽

Etna Volcano ◽

Mt Etna ◽

Geophysical Signal

AbstractThis paper deals with the classification of volcanic activity into three classes, referred to as Quite, Strombolian and Paroxysm. The main purpose is to give a measure of the reliability with which such a classification, typically carried out by experts, can be performed by Machine Learning algorithms, by using the volcanic tremor as a feature. Both supervised and unsupervised methods are considered. It is experimentally shown that at least the Paroxysm activity can be reliably classified. Performances are rigorously assessed, in comparison with the classification made by expert volcanologists, in terms of popular indices such as the f1-score and the Area under the ROC curve (AuC). The work is basically a case study carried out on a dataset recorded in the area of the Mt Etna volcano. However, as volcanic tremor is a geophysical signal widely available, considered methods and strategies can be easily applied to similar volcanic areas.

Download Full-text

Risk Assessment of Bone Metastasis for Cervical Cancer Patients by Multiple Models: A Large Population Based Real-World Study

Frontiers in Medicine ◽

10.3389/fmed.2021.725298 ◽

2021 ◽

Vol 8 ◽

Author(s):

Yun Han ◽

Bo Wang ◽

Jinjin Zhang ◽

Su Zhou ◽

Jun Dai ◽

...

Keyword(s):

Machine Learning ◽

Risk Assessment ◽

Cervical Cancer ◽

Bone Metastasis ◽

Predictive Model ◽

Cancer Patients ◽

Roc Curve ◽

Predictive Models ◽

Population Based ◽

Machine Learning Algorithms

Background: Population-based data on the risk assessment of newly diagnosed cervical cancer patients' bone metastasis (CCBM) are lacking. This study aimed to develop various predictive models to assess the risk of bone metastasis via machine learning algorithms.Materials and Methods: We retrospectively reviewed the CCBM patients from the Surveillance, Epidemiology, and End Results (SEER) database of the National Cancer Institute to risk factors of the presence of bone metastasis. Clinical usefulness was assessed by Akaike information criteria (AIC) and multiple machine learning algorithms based predictive models. Concordance index (C-index) and receiver operating characteristic (ROC) curve were used to define the predictive and discriminatory capacity of predictive models.Results: A total of 16 candidate variables were included to develop predictive models for bone metastasis by machine learning. The areas under the ROC curve (AUCs) of the random forest model (RF), generalized linear model (GL), support vector machine (SVM), eXtreme Gradient Boosting (XGBoost), artificial neutral network (ANN), decision tree (DT), and naive bayesian model (NBM) ranged from 0.85 to 0.93. The RF model with 10 variables was developed as the optimal predictive model. The weight of variables indicated the top seven factors were organ-site metastasis (liver, brain, and lung), TNM stage and age.Conclusions: Multiple machine learning based predictive models were developed to identify risk of bone metastasis in cervical cancer patients. By incorporating clinical characteristics and other candidate variables showed robust risk stratification for CCBM patients, and the RF predictive model performed best among these predictive models.

Download Full-text