Evaluation of Rainfall Erosivity Factor Estimation Using Machine and Deep Learning Models

Jimin Lee; Seoro Lee; Jiyeong Hong; Dongjun Lee; Joo Hyun Bae; Jae E. Yang; Jonggun Kim; Kyoung Jae Lim

doi:10.3390/w13030382

Evaluation of Rainfall Erosivity Factor Estimation Using Machine and Deep Learning Models

Water ◽

10.3390/w13030382 ◽

2021 ◽

Vol 13 (3) ◽

pp. 382

Author(s):

Jimin Lee ◽

Seoro Lee ◽

Jiyeong Hong ◽

Dongjun Lee ◽

Joo Hyun Bae ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Soil Loss ◽

Deep Neural Network ◽

Rainfall Erosivity ◽

Learning Models ◽

R Factor ◽

Target Values ◽

R Factors ◽

Machine Learning Models

Rainfall erosivity factor (R-factor) is one of the Universal Soil Loss Equation (USLE) input parameters that account for impacts of rainfall intensity in estimating soil loss. Although many studies have calculated the R-factor using various empirical methods or the USLE method, these methods are time-consuming and require specialized knowledge for the user. The purpose of this study is to develop machine learning models to predict the R-factor faster and more accurately than the previous methods. For this, this study calculated R-factor using 1-min interval rainfall data for improved accuracy of the target value. First, the monthly R-factors were calculated using the USLE calculation method to identify the characteristics of monthly rainfall-runoff induced erosion. In turn, machine learning models were developed to predict the R-factor using the monthly R-factors calculated at 50 sites in Korea as target values. The machine learning algorithms used for this study were Decision Tree, K-Nearest Neighbors, Multilayer Perceptron, Random Forest, Gradient Boosting, eXtreme Gradient Boost, and Deep Neural Network. As a result of the validation with 20% randomly selected data, the Deep Neural Network (DNN), among seven models, showed the greatest prediction accuracy results. The DNN developed in this study was tested for six sites in Korea to demonstrate trained model performance with Nash–Sutcliffe Efficiency (NSE) and the coefficient of determination (R2) of 0.87. This means that our findings show that DNN can be efficiently used to estimate monthly R-factor at the desired site with much less effort and time with total monthly precipitation, maximum daily precipitation, and maximum hourly precipitation data. It will be used not only to calculate soil erosion risk but also to establish soil conservation plans and identify areas at risk of soil disasters by calculating rainfall erosivity factors.

Download Full-text

First-Break Picking Classification Models Using Recurrent Neural Network

10.2118/204862-ms ◽

2021 ◽

Author(s):

Mohammed Ayub ◽

SanLinn Kaka

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Neural Network ◽

Contextual Information ◽

Classification Model ◽

Superior Performance ◽

Learning Models ◽

Neural Network Models ◽

Minimum Number ◽

Machine Learning Models

Abstract Manual first-break picking from a large volume of seismic data is extremely tedious and costly. Deployment of machine learning models makes the process fast and cost effective. However, these machine learning models require high representative and effective features for accurate automatic picking. Therefore, First- Break (FB) picking classification model that uses effective minimum number of features and promises performance efficiency is proposed. The variants of Recurrent Neural Networks (RNNs) such as Long ShortTerm Memory (LSTM) and Gated Recurrent Unit (GRU) can retain contextual information from long previous time steps. We deploy this advantage for FB picking as seismic traces are amplitude values of vibration along the time-axis. We use behavioral fluctuation of amplitude as input features for LSTM and GRU. The models are trained on noisy data and tested for generalization on original traces not seen during the training and validation process. In order to analyze the real-time suitability, the performance is benchmarked using accuracy, F1-measure and three other established metrics. We have trained two RNN models and two deep Neural Network models for FB classification using only amplitude values as features. Both LSTM and GRU have the accuracy and F1-measure with a score of 94.20%. With the same features, Convolutional Neural Network (CNN) has an accuracy of 93.58% and F1-score of 93.63%. Again, Deep Neural Network (DNN) model has scores of 92.83% and 92.59% as accuracy and F1-measure, respectively. From the pexperiment results, we see significant superior performance of LSTM and GRU to CNN and DNN when used the same features. For robustness of LSTM and GRU models, the performance is compared with DNN model that is trained using nine features derived from seismic traces and observed that the performance superiority of RNN models. Therefore, it is safe to conclude that RNN models (LSTM and GRU) are capable of classifying the FB events efficiently even by using a minimum number of features that are not computationally expensive. The novelty of our work is the capability of automatic FB classification with the RNN models that incorporate contextual behavioral information without the need for sophisticated feature extraction or engineering techniques that in turn can help in reducing the cost and fostering classification model robust and faster.

Download Full-text

Predicting mortality in SARS-COV-2 (COVID-19) positive patients in the inpatient setting using a Novel Deep Neural Network

10.1101/2020.12.13.20247254 ◽

2020 ◽

Author(s):

Maleeha Naseem ◽

Hajra Arshad ◽

Syeda Amrah Hashimi ◽

Furqan Irfan ◽

Fahad Shabbir Ahmed

Keyword(s):

Neural Network ◽

Machine Learning ◽

Risk Factors ◽

Deep Neural Network ◽

Multivariate Analyses ◽

High Accuracy ◽

Inpatient Setting ◽

Learning Models ◽

Rt Pcr ◽

Machine Learning Models

ABSTRACTBackgroundThe second wave of COVID-19 pandemic is anticipated to be worse than the initial one and will strain the healthcare systems even more during the winter months. Our aim was to develop a machine learning-based model to predict mortality using the deep learning Neo-V framework. We hypothesized this novel machine learning approach could be applied to COVID-19 patients to predict mortality successfully with high accuracy.MethodsThe current Deep-Neo-V model is built on our previously statistically rigorous machine learning framework [Fahad-Liaqat-Ahmad Intensive Machine (FLAIM) framework] that evaluated statistically significant risk factors, generated new combined variables and then supply these risk factors to deep neural network to predict mortality in RT-PCR positive COVID-19 patients in the inpatient setting. We analyzed adult patients (≥18 years) admitted to the Aga Khan University Hospital, Pakistan with a working diagnosis of COVID-19 infection (n=1228). We excluded patients that were negative on COVID-19 on RT-PCR, had incomplete or missing health records. The first phase selection of risk factor was done using Cox-regression univariate and multivariate analyses. In the second phase, we generated new variables and tested those statistically significant for mortality and in the third and final phase we applied deep neural networks and other traditional machine learning models like Decision Tree Model, k-nearest neighbor models and others.ResultsA total of 1228 cases were diagnosed as COVID-19 infection, we excluded 14 patients after the exclusion criteria and (n=)1214 patients were analyzed. We observed that several clinical and laboratory-based variables were statistically significant for both univariate and multivariate analyses while others were not. With most significant being septic shock (hazard ratio [HR], 4.30; 95% confidence interval [CI], 2.91-6.37), supportive treatment (HR, 3.51; 95% CI, 2.01-6.14), abnormal international normalized ratio (INR) (HR, 3.24; 95% CI, 2.28-4.63), admission to the intensive care unit (ICU) (HR, 3.24; 95% CI, 2.22-4.74), treatment with invasive ventilation (HR, 3.21; 95% CI, 2.15-4.79) and laboratory lymphocytic derangement (HR, 2.79; 95% CI, 1.6-4.86). Machine learning results showed our DNN (Neo-V) model outperformed all conventional machine learning models with test set accuracy of 99.53%, sensitivity of 89.87%, and specificity of 95.63%; positive predictive value, 50.00%; negative predictive value, 91.05%; and area under the curve of the receiver-operator curve of 88.5.ConclusionOur novel Deep-Neo-V model outperformed all other machine learning models. The model is easy to implement, user friendly and with high accuracy.

Download Full-text

Deep neural network affinity model for BACE inhibitors in D3R Grand Challenge 4

10.1101/680306 ◽

2019 ◽

Author(s):

Bo Wang ◽

Ho-Leung Ng

Keyword(s):

Neural Network ◽

Machine Learning ◽

Crystal Structures ◽

Deep Neural Network ◽

Macrocyclic Ligands ◽

Support Vector ◽

Grand Challenge ◽

Scoring Functions ◽

Learning Models ◽

Machine Learning Models

AbstractDrug Design Data Resource (D3R) Grand Challenge 4 (GC4) offered a unique opportunity for designing and testing novel methodology for accurate docking and affinity prediction of ligands in an open and blinded manner. We participated in the beta-secretase 1 (BACE) Subchallenge which is comprised of cross-docking and redocking of 20 macrocyclic ligands to BACE and predicting binding affinity for 154 macrocyclic ligands. For this challenge, we developed machine learning models trained specifically on BACE. We developed a deep neural network (DNN) model that used a combination of both structure and ligand-based features that outperformed simpler machine learning models. According to the results released by D3R, we achieved a Spearman’s rank correlation coefficient of 0.43(7) for predicting the affinity of 154 ligands. We describe the formulation of our machine learning strategy in detail. We compared the performance of DNN with linear regression, random forest, and support vector machines using ligand-based, structure-based, and combining both ligand and structure-based features. We compared different structures for our DNN and found that performance was highly dependent on fine optimization of the L2 regularization hyperparameter, alpha. We also developed a novel metric of ligand three-dimensional similarity inspired by crystallographic difference density maps to match ligands without crystal structures to similar ligands with known crystal structures. This report demonstrates that detailed parameterization, careful data training and implementation, and extensive feature analysis are necessary to obtain strong performance with more complex machine learning methods. Post hoc analysis shows that scoring functions based only on ligand features are competitive with those also using structural features. Our DNN approach tied for fifth in predicting BACE-ligand binding affinities.

Download Full-text

Predicting S&P 500 Market Price by Deep Neural Network and Enemble Model

E3S Web of Conferences ◽

10.1051/e3sconf/202021402040 ◽

2020 ◽

Vol 214 ◽

pp. 02040

Author(s):

Feiyu Wang

Keyword(s):

Neural Network ◽

Machine Learning ◽

Support Vector Machine ◽

Linear Regression ◽

Deep Neural Network ◽

Market Price ◽

Support Vector ◽

Learning Models ◽

Conventional Machine ◽

Machine Learning Models

The method to predict the movement of stock market has appealed to scientists for decades. In this article, we use three different models to tackle that problem. In particular, we propose a Deep Neural Network (DNN) to predict the intraday direction of SP500 index and compare the DNN with two conventional machine learning models, i.e. linear regression, support vector machine. We demonstrate that DNN is able to predict SP500 index with relatively highest accuracy.

Download Full-text

Estimativa da erosividade da chuva por diferentes métodos e seu impacto na equação universal de perdas de solo, no semiárido pernambucano (Estimation of erosivity by different methods and their impact on the universal soil loss equation in Pernambuco semi-arid)

Revista Brasileira de Geografia Física ◽

10.26848/rbgf.v12.3.p859-875 ◽

2019 ◽

Vol 12 (3) ◽

pp. 859

Author(s):

Joaquim Pedro de Santana Xavier ◽

Alexandre Hugo Cezar Barros ◽

Daniel Chaves Webber ◽

Luciano José de Oliveira Accioly ◽

Flávio Adriano Marques ◽

...

Keyword(s):

Soil Loss ◽

Universal Soil Loss Equation ◽

Rainfall Erosivity ◽

R Factor ◽

Semi Arid Region ◽

Erosion Loss ◽

R Factors ◽

Semi Arid ◽

Soil Loss Equation

Dentre os diversos métodos indiretos para estimar as perdas de solo por erosão, a Equação Universal de Perdas de Solo (EUPS) é a mais utilizada devido a sua robustez e por ser constituída de uma simples estrutura fatorial, que integra fatores naturais e antrópicos atuantes na perda de solos. A erosão é um dos fenômenos mais danosos ao solo e às atividades humanas e por isso seu estudo é importante. Para o cálculo das perdas de solo por meio da EUPS, a avaliação da erosividade das chuvas (fator R) é essencial, pois estima o fenômeno produzido pelas chuvas. O objetivo deste trabalho foi avaliar três metodologias disponíveis de obtenção da erosividade das chuvas para a região do semiárido pernambucano, avaliando sua influência nos resultados da EUPS. Os três modelos selecionados para estimar o Fator R foram desenvolvidos por Wischmeier e Smith (mais conhecido e utilizado), por Silva que estimou valores para diversas regiões do País e por Cantalice e outros que trabalharam especificamente para cada região climática do estado de Pernambuco. Os resultados indicam que as metodologias de Wischmeier e Smith e Silva obtiveram resultados de erosividade da chuva semelhantes, tendo Silva alcançado valores maiores. Cantalice e outros obtiveram os resultados mais baixos. Os resultados da EUPS indicam que, quantitativamente, os diferentes fatores R geram grande diferença nas perdas de solo, porém, qualitativamente chegam a resultados semelhantes na classificação de áreas de maior erosão, de acordo com a FAO. Logo, as três metodologias são viáveis na identificação de áreas prioritárias para a mitigação da erosão. A B S T R A C TAmong several indirect methods to estimate soil erosion loss, the Universal Soil Loss Equation (EUPS) is the most used due to its robustness and because it is constituted of a simple factorial structure that integrates natural and anthropic factors which act in the loss of soils. Erosion is one of the most damaging phenomena to the soil and the human activities, evidencing the importance of studying it. The evaluation of rainfall erosivity (R factor) is essential for the calculation of soil loss through the EUPS, since it is possible to estimate how significant rainfall is to the occurrence of this phenomenon. The objective of this work was to evaluate three methodologies to obtain the rainfall erosivity available for the semi - arid region of Pernambuco, evaluating its influence on the results of the EUPS. The three models used to estimate the R-factor were developed by Wischmeier and Smith, the best known and used model, Silva who estimated values for several regions of the country and Cantalice and others who worked specifically for each climatic region of the state of Pernambuco. As a result, very similar results of rainfall erosivity were obtained between Wischmeier and Smith´s and Silva´s methodology, with Silva reaching higher values of energy amplitude, while Cantalice and others obtained the lowest results. The results of EUPS indicate that, quantitatively, the different R factors generate a large difference in soil loss, but qualitatively they reach similar results in the classification of areas where erosion are greater, according to the FAO. Therefore, the three methodologies are feasible in the identification of priority areas for erosion mitigation.Keywords: soil, rainfall erosivity, USLE, GIS

Download Full-text

Ensemble and Neural Network Machine Learning Models for Short-Term Load Forecasting of Open Cast Mining Companies

Electrotechnical Systems and Complexes ◽

10.18503/2311-8318-2021-3(52)-57-65 ◽

2021 ◽

pp. 57-65

Author(s):

Dmitry Antonenkov ◽

◽

Pavel Matrenin ◽

Keyword(s):

Neural Network ◽

Machine Learning ◽

Load Forecasting ◽

Learning Models ◽

Short Term ◽

Mining Companies ◽

Open Cast Mining ◽

Short Term Load Forecasting ◽

Open Cast ◽

Machine Learning Models

Download Full-text

Building-damage detection method based on machine learning utilizing aerial photographs of the Kumamoto earthquake

Earthquake Spectra ◽

10.1177/8755293019901309 ◽

2020 ◽

Vol 36 (3) ◽

pp. 1166-1187 ◽

Cited By ~ 4

Author(s):

Shohei Naito ◽

Hiromitsu Tomozawa ◽

Yuji Mori ◽

Takeshi Nagata ◽

Naokazu Monma ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Convolutional Neural Network ◽

Training Data ◽

Aerial Photographs ◽

Learning Models ◽

Visual Interpretation ◽

Damage Classification ◽

Kumamoto Earthquake ◽

Machine Learning Models

This article presents a method for detecting damaged buildings in the event of an earthquake using machine learning models and aerial photographs. We initially created training data for machine learning models using aerial photographs captured around the town of Mashiki immediately after the main shock of the 2016 Kumamoto earthquake. All buildings are classified into one of the four damage levels by visual interpretation. Subsequently, two damage discrimination models are developed: a bag-of-visual-words model and a model based on a convolutional neural network. Results are compared and validated in terms of accuracy, revealing that the latter model is preferable. Moreover, for the convolutional neural network model, the target areas are expanded and the recalls of damage classification at the four levels range approximately from 66% to 81%.

Download Full-text

Chained Anomaly Detection Models for Federated Learning: An Intrusion Detection Case Study

Applied Sciences ◽

10.3390/app8122663 ◽

2018 ◽

Vol 8 (12) ◽

pp. 2663 ◽

Cited By ~ 11

Author(s):

Davy Preuveneers ◽

Vera Rimmer ◽

Ilias Tsingenopoulos ◽

Jan Spooren ◽

Wouter Joosen ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Intrusion Detection ◽

Anomaly Detection ◽

Training Data ◽

Learning Models ◽

Traditional System ◽

Blockchain Technology ◽

Malicious Behavior ◽

Machine Learning Models

The adoption of machine learning and deep learning is on the rise in the cybersecurity domain where these AI methods help strengthen traditional system monitoring and threat detection solutions. However, adversaries too are becoming more effective in concealing malicious behavior amongst large amounts of benign behavior data. To address the increasing time-to-detection of these stealthy attacks, interconnected and federated learning systems can improve the detection of malicious behavior by joining forces and pooling together monitoring data. The major challenge that we address in this work is that in a federated learning setup, an adversary has many more opportunities to poison one of the local machine learning models with malicious training samples, thereby influencing the outcome of the federated learning and evading detection. We present a solution where contributing parties in federated learning can be held accountable and have their model updates audited. We describe a permissioned blockchain-based federated learning method where incremental updates to an anomaly detection machine learning model are chained together on the distributed ledger. By integrating federated learning with blockchain technology, our solution supports the auditing of machine learning models without the necessity to centralize the training data. Experiments with a realistic intrusion detection use case and an autoencoder for anomaly detection illustrate that the increased complexity caused by blockchain technology has a limited performance impact on the federated learning, varying between 5 and 15%, while providing full transparency over the distributed training process of the neural network. Furthermore, our blockchain-based federated learning solution can be generalized and applied to more sophisticated neural network architectures and other use cases.

Download Full-text

Fraudulent Face Image Detection

ITM Web of Conferences ◽

10.1051/itmconf/20203203005 ◽

2020 ◽

Vol 32 ◽

pp. 03005

Author(s):

Rahul Awhad ◽

Saurabh Jayswal ◽

Adesh More ◽

Jyoti Kundale

Keyword(s):

Neural Network ◽

Machine Learning ◽

Face Image ◽

Support Vector ◽

Learning Models ◽

Image Detection ◽

Software Applications ◽

Support Vector Classifier ◽

The Face ◽

Machine Learning Models

Due to the growing advancements in technology, many software applications are being developed to modify and edit images. Such software can be used to alter images. Nowadays, an altered image is so realistic that it becomes too difficult for a person to identify whether the image is fake or real. Such software applications can be used to alter the image of a person’s face also. So, it becomes very difficult to identify whether the image of the face is real or not. Our proposed system is used to identify whether the image of a face is fake or real. The proposed system makes use of machine learning. The system makes use of a convolution neural network and support vector classifier. Both these machine learning models are trained using real as well as fake images. Both these trained models will take an image as an input and will determine whether the image is fake or real.

Download Full-text

Machine learning model for predicting the optimal depth of tracheal tube insertion in pediatric patients: A retrospective cohort study

PLoS ONE ◽

10.1371/journal.pone.0257069 ◽

2021 ◽

Vol 16 (9) ◽

pp. e0257069

Author(s):

Jae-Geum Shim ◽

Kyoung-Ho Ryu ◽

Sung Hyun Lee ◽

Eun-Ah Cho ◽

Sungho Lee ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Artificial Neural Network ◽

Support Vector Machine ◽

Random Forest ◽

Tracheal Tube ◽

Pediatric Patients ◽

Support Vector ◽

Learning Models ◽

Machine Learning Models

Objective To construct a prediction model for optimal tracheal tube depth in pediatric patients using machine learning. Methods Pediatric patients aged <7 years who received post-operative ventilation after undergoing surgery between January 2015 and December 2018 were investigated in this retrospective study. The optimal location of the tracheal tube was defined as the median of the distance between the upper margin of the first thoracic(T1) vertebral body and the lower margin of the third thoracic(T3) vertebral body. We applied four machine learning models: random forest, elastic net, support vector machine, and artificial neural network and compared their prediction accuracy to three formula-based methods, which were based on age, height, and tracheal tube internal diameter(ID). Results For each method, the percentage with optimal tracheal tube depth predictions in the test set was calculated as follows: 79.0 (95% confidence interval [CI], 73.5 to 83.6) for random forest, 77.4 (95% CI, 71.8 to 82.2; P = 0.719) for elastic net, 77.0 (95% CI, 71.4 to 81.8; P = 0.486) for support vector machine, 76.6 (95% CI, 71.0 to 81.5; P = 1.0) for artificial neural network, 66.9 (95% CI, 60.9 to 72.5; P < 0.001) for the age-based formula, 58.5 (95% CI, 52.3 to 64.4; P< 0.001) for the tube ID-based formula, and 44.4 (95% CI, 38.3 to 50.6; P < 0.001) for the height-based formula. Conclusions In this study, the machine learning models predicted the optimal tracheal tube tip location for pediatric patients more accurately than the formula-based methods. Machine learning models using biometric variables may help clinicians make decisions regarding optimal tracheal tube depth in pediatric patients.

Download Full-text