scholarly journals Gully Erosion Susceptibility Assessment in the Kondoran Watershed Using Machine Learning Algorithms and the Boruta Feature Selection

2021 ◽  
Vol 13 (18) ◽  
pp. 10110
Author(s):  
Hamed Ahmadpour ◽  
Ommolbanin Bazrafshan ◽  
Elham Rafiei-Sardooi ◽  
Hossein Zamani ◽  
Thomas Panagopoulos

Gully erosion susceptibility mapping is an essential land management tool to reduce soil erosion damages. This study investigates gully susceptibility based on multiple diagnostic analysis, support vector machine and random forest algorithms, and also a combination of these models, namely the ensemble model. Thus, a gully susceptibility map in the Kondoran watershed of Iran was generated by applying these models on the occurrence and non-occurrence points (as the target variable) and several predictors (slope, aspect, elevation, topographic wetness index, drainage density, plan curvature, distance to streams, lithology, soil texture and land use). The Boruta algorithm was used to select the most effective variables in modeling gully erosion susceptibility. The area under the receiver operating characteristic curve (AUC), the receiver operating characteristics, and true skill statistics (TSS) were used to assess the model performance. The results indicated that the ensemble model had the best performance (AUC = 0.982, TSS = 0.93) compared to the others. The most effective factors in gully erosion susceptibility mapping of the study region were topological, anthropogenic, and geological. The methodology and variables of this study can be used in other regions to control and mitigate the gully erosion phenomenon by applying biophilic and regenerative techniques at the locations of the most influential factors.

Entropy ◽  
2018 ◽  
Vol 20 (11) ◽  
pp. 884 ◽  
Author(s):  
Tingyu Zhang ◽  
Ling Han ◽  
Wei Chen ◽  
Himan Shahabi

The main purpose of the present study is to apply three classification models, namely, the index of entropy (IOE) model, the logistic regression (LR) model, and the support vector machine (SVM) model by radial basis function (RBF), to produce landslide susceptibility maps for the Fugu County of Shaanxi Province, China. Firstly, landslide locations were extracted from field investigation and aerial photographs, and a total of 194 landslide polygons were transformed into points to produce a landslide inventory map. Secondly, the landslide points were randomly split into two groups (70/30) for training and validation purposes, respectively. Then, 10 landslide explanatory variables, such as slope aspect, slope angle, altitude, lithology, mean annual precipitation, distance to roads, distance to rivers, distance to faults, land use, and normalized difference vegetation index (NDVI), were selected and the potential multicollinearity problems between these factors were detected by the Pearson Correlation Coefficient (PCC), the variance inflation factor (VIF), and tolerance (TOL). Subsequently, the landslide susceptibility maps for the study region were obtained using the IOE model, the LR–IOE, and the SVM–IOE model. Finally, the performance of these three models was verified and compared using the receiver operating characteristics (ROC) curve. The success rate results showed that the LR–IOE model has the highest accuracy (90.11%), followed by the IOE model (87.43%) and the SVM–IOE model (86.53%). Similarly, the AUC values also showed that the prediction accuracy expresses a similar result, with the LR–IOE model having the highest accuracy (81.84%), followed by the IOE model (76.86%) and the SVM–IOE model (76.61%). Thus, the landslide susceptibility map (LSM) for the study region can provide an effective reference for the Fugu County government to properly address land planning and mitigate landslide risk.


2021 ◽  
Author(s):  
Hossein Hamedi Sorajar ◽  
Ali Asghar Alesheikh ◽  
Mahdi Panahi ◽  
Saro Lee

Abstract Landslides are one of the most destructive natural phenomena in the world, which occur mostly in mountainous areas and cause damage to the economic sectors, agricultural lands, residential areas and infrastructures of any country, and also threaten the lives and property of human beings. Therefore, landslide susceptibility mapping (LSM) can play a critical role in identifying prone areas and reducing the damage caused by landslides in each area. In the present study, deep learning algorithms including convolutional neural network (CNN) and long short-term memory (LSTM) were used to identify landslide prone areas in Ardabil province, Iran. Equql to 312 landslide locations were identified and randomly divided into train and test datasets at 70–30% ratios. Then, according to previous studies and environmental conditions in the study area, twelve factors affecting the occurrence of landslides were selected, namely altitude, slope angle, slope aspect, topographic wetness index (TWI), profile curvature, plan curvature, land-use, lithology, distance to faults, distance to rivers, distance to roads, and rainfall. The ratio of the importance of each influential factor in landslide occurrence was obtained through information gain ranking filter (IGRF) method and it was found that land-use and profile curvature had the highest and lowest impacts, respectively. Afterwards, LSMs were generated using CNN and LSTM algorithms. In the next step, the performance of the models was evaluated based on the area under curve (AUC) value of receiver operating characteristics curve and the root mean square error (RMSE) method. The AUC values for CNN and LSTM models were 0.821 and 0.832, respectively. Furthermore, the RMSE values in the CNN model for each of the training and testing dataset were 0.121 and 0.132, respectively. The RMSE values in the LSTM model for each of the training and testing dataset were 0.185 and 0.188, respectively. Therefore, it can be concluded that CNN performance is slightly better than LSTM; but in general, both models have close performance and the accuracy of both models is acceptable.


Sensors ◽  
2019 ◽  
Vol 19 (22) ◽  
pp. 4893 ◽  
Author(s):  
Hejar Shahabi ◽  
Ben Jarihani ◽  
Sepideh Tavakkoli Piralilou ◽  
David Chittleborough ◽  
Mohammadtaghi Avand ◽  
...  

Gully erosion is a dominant source of sediment and particulates to the Great Barrier Reef (GBR) World Heritage area. We selected the Bowen catchment, a tributary of the Burdekin Basin, as our area of study; the region is associated with a high density of gully networks. We aimed to use a semi-automated object-based gully networks detection process using a combination of multi-source and multi-scale remote sensing and ground-based data. An advanced approach was employed by integrating geographic object-based image analysis (GEOBIA) with current machine learning (ML) models. These included artificial neural networks (ANN), support vector machines (SVM), and random forests (RF), and an ensemble ML model of stacking to deal with the spatial scaling problem in gully networks detection. Spectral indices such as the normalized difference vegetation index (NDVI) and topographic conditioning factors, such as elevation, slope, aspect, topographic wetness index (TWI), slope length (SL), and curvature, were generated from Sentinel 2A images and the ALOS 12-m digital elevation model (DEM), respectively. For image segmentation, the ESP2 tool was used to obtain three optimal scale factors. On using object pureness index (OPI), object matching index (OMI), and object fitness index (OFI), the accuracy of each scale in image segmentation was evaluated. The scale parameter of 45 with OFI of 0.94, which is a combination of OPI and OMI indices, proved to be the optimal scale parameter for image segmentation. Furthermore, segmented objects based on scale 45 were overlaid with 70% and 30% of a prepared gully inventory map to select the ML models’ training and testing objects, respectively. The quantitative accuracy assessment methods of Precision, Recall, and an F1 measure were used to evaluate the model’s performance. Integration of GEOBIA with the stacking model using a scale of 45 resulted in the highest accuracy in detection of gully networks with an F1 measure value of 0.89. Here, we conclude that the adoption of optimal scale object definition in the GEOBIA and application of the ensemble stacking of ML models resulted in higher accuracy in the detection of gully networks.


2018 ◽  
Vol 8 (8) ◽  
pp. 1369 ◽  
Author(s):  
Alireza Arabameri ◽  
Biswajeet Pradhan ◽  
Hamid Reza Pourghasemi ◽  
Khalil Rezaei ◽  
Norman Kerle

Gully erosion triggers land degradation and restricts the use of land. This study assesses the spatial relationship between gully erosion (GE) and geo-environmental variables (GEVs) using Weights-of-Evidence (WoE) Bayes theory, and then applies three data mining methods—Random Forest (RF), boosted regression tree (BRT), and multivariate adaptive regression spline (MARS)—for gully erosion susceptibility mapping (GESM) in the Shahroud watershed, Iran. Gully locations were identified by extensive field surveys, and a total of 172 GE locations were mapped. Twelve gully-related GEVs: Elevation, slope degree, slope aspect, plan curvature, convergence index, topographic wetness index (TWI), lithology, land use/land cover (LU/LC), distance from rivers, distance from roads, drainage density, and NDVI were selected to model GE. The results of variables importance by RF and BRT models indicated that distance from road, elevation, and lithology had the highest effect on GE occurrence. The area under the curve (AUC) and seed cell area index (SCAI) methods were used to validate the three GE maps. The results showed that AUC for the three models varies from 0.911 to 0.927, whereas the RF model had a prediction accuracy of 0.927 as per SCAI values, when compared to the other models. The findings will be of help for planning and developing the studied region.


Sensors ◽  
2019 ◽  
Vol 19 (16) ◽  
pp. 3451 ◽  
Author(s):  
Usman Salihu Lay ◽  
Biswajeet Pradhan ◽  
Zainuddin Bin Md Yusoff ◽  
Ahmad Fikri Bin Abdallah ◽  
Jagannath Aryal ◽  
...  

Cameron Highland is a popular tourist hub in the mountainous area of Peninsular Malaysia. Most communities in this area suffer frequent incidence of debris flow, especially during monsoon seasons. Despite the loss of lives and properties recorded annually from debris flow, most studies in the region concentrate on landslides and flood susceptibilities. In this study, debris-flow susceptibility prediction was carried out using two data mining techniques; Multivariate Adaptive Regression Splines (MARS) and Support Vector Regression (SVR) models. The existing inventory of debris-flow events (640 points) were selected for training 70% (448) and validation 30% (192). Twelve conditioning factors namely; elevation, plan-curvature, slope angle, total curvature, slope aspect, Stream Transport Index (STI), profile curvature, roughness index, Stream Catchment Area (SCA), Stream Power Index (SPI), Topographic Wetness Index (TWI) and Topographic Position Index (TPI) were selected from Light Detection and Ranging (LiDAR)-derived Digital Elevation Model (DEM) data. Multi-collinearity was checked using Information Factor, Cramer’s V, and Gini Index to identify the relative importance of conditioning factors. The susceptibility models were produced and categorized into five classes; not-susceptible, low, moderate, high and very-high classes. Models performances were evaluated using success and prediction rates where the area under the curve (AUC) showed a higher performance of MARS (93% and 83%) over SVR (76% and 72%). The result of this study will be important in contingency hazards and risks management plans to reduce the loss of lives and properties in the area.


2021 ◽  
Vol 8 (1) ◽  
Author(s):  
Pawan Gautam ◽  
Tetsuya Kubota ◽  
Aril Aditian

AbstractThe main objective of this study is to understand the overall impact of earthquake in upper Indrawati Watershed, located in the high mountainous region of Nepal. Hence, we have assessed the relationship between the co-seismic landslide and underlying causative factors as well as performed landslide susceptibility mapping (LSM) to identify the landslide susceptible zone in the study area. We assessed the landslides distribution in terms of density, number, and area within 85 classes of 13 causal factors including slope, aspect, elevation, formation, land cover, distance to road and river, soil type, total curvature, seismic intensity, topographic wetness index, distance to fault, and flow accumulation. The earthquake-induced landslide is clustered in Northern region of the study area, which is dominated by steep rocky slope, forested land, and low human density. Among the causal factors, 'slope' showed positive correlation for landslide occurrence. Increase in slope in the study area also escalates the landslide distribution, with highest density at 43%, landslide number at 4.34/km2, and landslide area abundance at 2.97% in a slope class (> 50°). We used logistic regression (LR) for LSM integrating with geographic information system. LR analysis depicts that land cover is the best predictor followed by slope and distance to fault with higher positive coefficient values. LSM was validated by assessing the correctly classified landslides under susceptibility categories using area under curve (AUC) and seed cell area index (SCAI). The LSM approach showed good accuracy with respective AUC values for success rate and prediction rate of 0.843 and 0.832. Similarly, the decreasing SCAI value from very low to very high susceptibility categories advise satisfactory accuracy of the LSM approach.


2021 ◽  
Author(s):  
Tingyu Zhang ◽  
Huanyuan Wang ◽  
Tianqing Chen ◽  
Zenghui Sun ◽  
Tao Wang ◽  
...  

Abstract The losses and damage caused by landslides are countless in the world every year. However, the existing approaches of landslide susceptibility mapping cannot fully meet the requirement of landslide prevention, and further excavation and innovation are also needed. Therefore, the main aim of this study is to develop a novel deep learning model namely landslide net (LSNet) to assess the landslide susceptibility in Hanyin County, China, meanwhile, support vector machine model (SVM) and kernel logistic regression model (KLR) were employed as reference model. The inventory map was generated based on 259 landslides, the training dataset and validation dataset were respectively prepared using 70% landslides and the remaining 30% landslides. The variance inflation factor (VIF) was applied to optimize each landslide predisposing factor. Three benchmark indices were used to evaluate the result of susceptibility mapping and area under receiver operating characteristics curve (AUROC) was used to compare the models. Result demonstrated that although the processing speed of LSNet model is the slowest, it still significantly outperformed its corresponding benchmark models with validation dataset, and has the highest accuracy (0.950), precision (0.951), F1 (0.951) and AUROC (0.941), which reflected excellent predictive ability in some degree. The achievements obtained in this study can improve the rapid response capability of landslide prevention for Hanyin County.


Entropy ◽  
2019 ◽  
Vol 21 (4) ◽  
pp. 372 ◽  
Author(s):  
Zhongjun Ma ◽  
Shengwu Qin ◽  
Chen Cao ◽  
Jiangfeng Lv ◽  
Guangjie Li ◽  
...  

Landslides are one of the most frequent geomorphic hazards, and they often result in the loss of property and human life in the Changbai Mountain area (CMA), Northeast China. The objective of this study was to produce and compare landslide susceptibility maps for the CMA using an information content model (ICM) with three knowledge-driven methods (the artificial hierarchy process with the ICM (AHP-ICM), the entropy weight method with the ICM (EWM-ICM), and the rough set with the ICM (RS-ICM)) and to explore the influence of different knowledge-driven methods for a series of parameters on the accuracy of landslide susceptibility mapping (LSM). In this research, the landslide inventory data (145 landslides) were randomly divided into a training dataset: 70% (81 landslides) were used for training the models and 30% (35 landslides) were used for validation. In addition, 13 layers of landslide conditioning factors, namely, altitude, slope gradient, slope aspect, lithology, distance to faults, distance to roads, distance to rivers, annual precipitation, land type, normalized difference vegetation index (NDVI), topographic wetness index (TWI), plan curvature, and profile curvature, were taken as independent, causal predictors. Landslide susceptibility maps were developed using the ICM, RS-ICM, AHP-ICM, and EWM-ICM, in which weights were assigned to every conditioning factor. The resultant susceptibility was validated using the area under the ROC curve (AUC) method. The success accuracies of the landslide susceptibility maps produced by the ICM, RS-ICM, AHP-ICM, and EWM-ICM methods were 0.931, 0.939, 0.912, and 0.883, respectively, with prediction accuracy rates of 0.926, 0.927, 0.917, and 0.878 for the ICM, RS-ICM, AHP-ICM, and EWM-ICM, respectively. Hence, it can be concluded that the four models used in this study gave close results, with the RS-ICM exhibiting the best performance in landslide susceptibility mapping.


Author(s):  
Hamed Rouhani ◽  
Aboalhasan Fathabadi ◽  
Jantiene Baartman

Identifying the vulnerability level of an area to soil erosion, particularly gully erosion, is key to the development of an efficient management strategy for policymakers. While efforts into susceptibility mapping of natural disasters have grown in recent years, understanding the most relevant predictive causal factors is still a challenge. As the selection of these factors, among many potentially relevant factors, is an important part of the model selection process, we propose a hybrid intelligent approach for the optimal selection of a set of relevant factors based on logistic regression (LR) and genetic algorithms. In order to verify the effectiveness of the proposed approach, this study also identified areas that were highly susceptible to gully erosion using three different classifiers – namely, the LR, support vector machine (SVM) and k-nearest neighbours (k-NN) techniques. We tested the approach in the Yeli Bedrag watershed in north-eastern Golestan province, Iran. The results showed that the elevation, distance to fault, slope and the index of connectivity were the most important causal factors affecting the successful prediction of gully occurrence. Comparison of maximum True Skill Statistic values showed that increased model sophistication did not necessarily result in a higher level of model performance. In terms of performance, k-NN was superior to the SVM and LR methods. This method can be effectively used for gully erosion susceptibility (GES) zonation in the study area, which is very important to support spatial planning to initiate designing mitigation strategies and conservation needs over a large area, or to plan additional conservation efforts and relocate soil conservation plans. In conclusion, our findings indicate that by incorporating the proposed hybrid intelligent approach, the number of relevant factors for GES mapping was reduced, while classification accuracy was increased.


Author(s):  
M. Z. Ali ◽  
H.-J. Chu ◽  
S. Ullah ◽  
M. Shafique ◽  
A. Ali

<p><strong>Abstract.</strong> The 2005 Kashmir earthquake has triggered thousands of landslides which devastated most of the livelihood and other infrastructure in the area. Landslide inventory and subsequently landslide susceptibility mapping is one of the main prerequisite for taking mitigation measure against landslide effects. This study has focused on developing most updated and realistic landslide inventory and Susceptibility mapping. The high resolution data of Worldveiw-2 having spatial resolution of 0.4 m is used for landslide inventory. Support Vector Machine (SVM) classifier was used for landslide inventory developing. Total 51460 number of landslides were classified using semi-automatic technique with covering area of 265 Km<sup>2</sup>, smallest landslide mapped is covering area of 2.01 m<sup>2</sup> and the maximum covered area of single landslide is 3.01 Km<sup>2</sup>. Nine influential causative factors are used for landslide susceptibility mapping. Those causative factors include slope, aspect, profile curvature, elevation, distance from fault lines, distance from streams and geology. Logistic regression model was used for the Landslides susceptibility modelling. From model the highest coefficient was assigned to geology which shows that the geology has higher influence in the area. For landslide susceptibility mapping the 70 % of the data was used and 30% is used for the validation of the model. The prediction accuracy of the model in this study is 92 % using validation data. This landslide susceptibility map can be used for land use planning and also for the mitigation measure during any disaster.</p>


Sign in / Sign up

Export Citation Format

Share Document