Prediction of Combined Terrestrial Evapotranspiration Index (CTEI) over Large River Basin Based on Machine Learning Approaches

Ahmed Elbeltagi; Nikul Kumari; Jaydeo K. Dharpure; Ali Mokhtar; Karam Alsafadi; Manish Kumar; Behrouz Mehdinejadiani; Hadi Ramezani Etedali; Youssef Brouziyne; Abu Reza Md. Towfiqul Islam; Alban Kuriqi

doi:10.3390/w13040547

Prediction of Combined Terrestrial Evapotranspiration Index (CTEI) over Large River Basin Based on Machine Learning Approaches

Water ◽

10.3390/w13040547 ◽

2021 ◽

Vol 13 (4) ◽

pp. 547 ◽

Cited By ~ 3

Author(s):

Ahmed Elbeltagi ◽

Nikul Kumari ◽

Jaydeo K. Dharpure ◽

Ali Mokhtar ◽

Karam Alsafadi ◽

...

Keyword(s):

Machine Learning ◽

River Basin ◽

Mean Squared Error ◽

Gaussian Model ◽

Gaussian Process Regression ◽

Absolute Error ◽

Support Vector ◽

Learning Approaches ◽

Ganga Basin ◽

Svm Algorithm

Drought is a fundamental physical feature of the climate pattern worldwide. Over the past few decades, a natural disaster has accelerated its occurrence, which has significantly impacted agricultural systems, economies, environments, water resources, and supplies. Therefore, it is essential to develop new techniques that enable comprehensive determination and observations of droughts over large areas with satisfactory spatial and temporal resolution. This study modeled a new drought index called the Combined Terrestrial Evapotranspiration Index (CTEI), developed in the Ganga river basin. For this, five Machine Learning (ML) techniques, derived from artificial intelligence theories, were applied: the Support Vector Machine (SVM) algorithm, decision trees, Matern 5/2 Gaussian process regression, boosted trees, and bagged trees. These techniques were driven by twelve different models generated from input combinations of satellite data and hydrometeorological parameters. The results indicated that the eighth model performed best and was superior among all the models, with the SVM algorithm resulting in an R2 value of 0.82 and the lowest errors in terms of the Root Mean Squared Error (RMSE) (0.33) and Mean Absolute Error (MAE) (0.20), followed by the Matern 5/2 Gaussian model with an R2 value of 0.75 and RMSE and MAE of 0.39 and 0.21 mm/day, respectively. Moreover, among all the five methods, the SVM and Matern 5/2 Gaussian methods were the best-performing ML algorithms in our study of CTEI predictions for the Ganga basin.

Download Full-text

Snowmelt-Driven Streamflow Prediction Using Machine Learning Techniques (LSTM, NARX, GPR, and SVR)

Water ◽

10.3390/w12061734 ◽

2020 ◽

Vol 12 (6) ◽

pp. 1734 ◽

Cited By ~ 2

Author(s):

Samit Thapa ◽

Zebin Zhao ◽

Bo Li ◽

Lu Lu ◽

Donglei Fu ◽

...

Keyword(s):

Machine Learning ◽

Short Term Memory ◽

Mean Squared Error ◽

Gaussian Process Regression ◽

Absolute Error ◽

Machine Learning Techniques ◽

Coefficient Of Determination ◽

Support Vector ◽

Streamflow Prediction ◽

Moderate Resolution Imaging Spectroradiometer

Although machine learning (ML) techniques are increasingly popular in water resource studies, they are not extensively utilized in modeling snowmelt. In this study, we developed a model based on a deep learning long short-term memory (LSTM) for snowmelt-driven discharge modeling in a Himalayan basin. For comparison, we developed the nonlinear autoregressive exogenous model (NARX), Gaussian process regression (GPR), and support vector regression (SVR) models. The snow area derived from moderate resolution imaging spectroradiometer (MODIS) snow images along with remotely sensed meteorological products were utilized as inputs to the models. The Gamma test was conducted to determine the appropriate input combination for the models. The shallow LSTM model with a hidden layer achieved superior results than the deeper LSTM models with multiple hidden layers. Out of seven optimizers tested, Adamax proved to be the aptest optimizer for this study. The evaluation of the ML models was done by the coefficient of determination (R2), mean absolute error (MAE), modified Kling–Gupta efficiency (KGE’), Nash–Sutcliffe efficiency (NSE), and root-mean-squared error (RMSE). The LSTM model (KGE’ = 0.99) enriched with snow cover input achieved the best results followed by NARX (KGE’ = 0.974), GPR (KGE’ = 0.95), and SVR (KGE’ = 0.949), respectively. The outcome of this study proves the applicability of the ML models, especially the LSTM model, in predicting snowmelt driven discharge in the data-scant mountainous watersheds.

Download Full-text

Machine learning approaches for improving near-real-time IMERG rainfall estimates by integrating Cloud Properties from NOAA CDR PATMOSX

Journal of Hydrometeorology ◽

10.1175/jhm-d-21-0019.1 ◽

2021 ◽

Author(s):

Zhi Zhang ◽

Dagang Wang ◽

Jianxiu Qiu ◽

Jinxin Zhu ◽

Tingli Wang

Keyword(s):

Machine Learning ◽

Real Time ◽

River Basin ◽

Central China ◽

Support Vector ◽

Learning Approaches ◽

Satellite Precipitation ◽

Cloud Properties ◽

Spatial Coverage ◽

Extreme Gradient Boosting

AbstractThe Global Precipitation Measurement (GPM) mission provides satellite precipitation products with an unprecedented spatio-temporal resolution and spatial coverage. However, its near-real-time (NRT) product still suffers from low accuracy. This study aims to improve the early run of the Integrated Multi-satellitE Retrievals for GPM (IMERG) by using four machine learning approaches, i.e., support vector machine (SVM), random forest (RF), artificial neural network (ANN), and Extreme Gradient Boosting (XGB). The cloud properties are selected as the predictors in addition to the original IMERG in these approaches. All the four approaches show similar improvement, with 53%-60% reduction of root-mean-square error (RMSE) compared with the original IMERG in a humid area, i.e., the Dongjiang River Basin (DJR) in southeastern China. The improvements are even greater in a semi-arid area, i.e., the Fenhe River Basin (FHR) in central China, the RMSE reduction ranges from 63%-66%. The products generated by the machine learning methods performs similarly to or even outperform than the final run of IMERG. Feature importance analysis, a technique to evaluate input features based on how useful they are in predicting a target variable, indicates that the cloud height and the brightness temperature are the most useful information in improving satellite precipitation products, followed by the atmospheric reflectivity and the surface temperature. This study shows that a more accurate NRT precipitation product can be produced by combining machine learning approaches and cloud information, which is of importance for hydrological applications that requires NRT precipitation information including flood monitoring.

Download Full-text

Forecasting Of Covid-19 Cases Using Machine Learning Approach

Current Respiratory Medicine Reviews ◽

10.2174/1573398x17666210129131009 ◽

2021 ◽

Vol 17 ◽

Author(s):

Sachin Kumar ◽

Karan Veer

Keyword(s):

Machine Learning ◽

Regression Model ◽

Model Performance ◽

Real Data ◽

Absolute Error ◽

Viral Disease ◽

Support Vector ◽

Family Welfare ◽

Accuracy Score ◽

Learning Approaches

Aims: The objective of this research is to predict the covid-19 cases in India based on the machine learning approaches. Background: Covid-19, a respiratory disease caused by one of the coronavirus family members, has led to a pandemic situation worldwide in 2020. This virus was detected firstly in Wuhan city of China in December 2019. This viral disease has taken less than three months to spread across the globe. Objective: In this paper, we proposed a regression model based on the Support vector machine (SVM) to forecast the number of deaths, the number of recovered cases, and total confirmed cases for the next 30 days. Method: For prediction, the data is collected from Github and the ministry of India's health and family welfare from March 14, 2020, to December 3, 2020. The model has been designed in Python 3.6 in Anaconda to forecast the forecasting value of corona trends until September 21, 2020. The proposed methodology is based on the prediction of values using SVM based regression model with polynomial, linear, rbf kernel. The dataset has been divided into train and test datasets with 40% and 60% test size and verified with real data. The model performance parameters are evaluated as a mean square error, mean absolute error, and percentage accuracy. Results and Conclusion: The results show that the polynomial model has obtained 95 % above accuracy score, linear scored above 90%, and rbf scored above 85% in predicting cumulative death, conformed cases, and recovered cases.

Download Full-text

Visualization & Prediction of COVID-19 Future Outbreak by Using Machine Learning

International Journal of Information Technology and Computer Science ◽

10.5815/ijitcs.2021.03.02 ◽

2021 ◽

Vol 13 (3) ◽

pp. 16-32

Author(s):

Ahmed Hassan Mohammed Hassan ◽

◽

Arfan Ali Mohammed Qasem ◽

Walaa Faisal Mohammed Abdalla ◽

Omer H. Elhassan

Keyword(s):

Machine Learning ◽

Polynomial Regression ◽

Mean Squared Error ◽

Absolute Error ◽

Future Perspective ◽

Support Vector ◽

Squared Error ◽

Vector Machines ◽

The World ◽

Negative Factors

Day by day, the accumulative incidence of COVID-19 is rapidly increasing. After the spread of the Corona epidemic and the death of more than a million people around the world countries, scientists and researchers have tended to conduct research and take advantage of modern technologies to learn machine to help the world to get rid of the Coronavirus (COVID-19) epidemic. To track and predict the disease Machine Learning (ML) can be deployed very effectively. ML techniques have been anticipated in areas that need to identify dangerous negative factors and define their priorities. The significance of a proposed system is to find the predict the number of people infected with COVID19 using ML. Four standard models anticipate COVID-19 prediction, which are Neural Network (NN), Support Vector Machines (SVM), Bayesian Network (BN) and Polynomial Regression (PR). The data utilized to test these models content of number of deaths, newly infected cases, and recoveries in the next 20 days. Five measures parameters were used to evaluate the performance of each model, namely root mean squared error (RMSE), mean squared error (MAE), mean absolute error (MSE), Explained Variance score and r2 score (R2). The significance and value of proposed system auspicious mechanism to anticipate these models for the current cenario of the COVID-19 epidemic. The results showed NN outperformed the other models, while in the available dataset the SVM performs poorly in all the prediction. Reference to our results showed that injuries will increase slightly in the coming days. Also, we find that the results give rise to hope due to the low death rate. For future perspective, case explanation and data amalgamation must be kept up persistently.

Download Full-text

Predicting and Analysing the Behaviour of COVID-19

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit217213 ◽

2021 ◽

pp. 40-46

Author(s):

Gaurav Singh ◽

Shivam Rai ◽

Himanshu Mishra ◽

Manoj Kumar

Keyword(s):

Machine Learning ◽

Polynomial Regression ◽

Mean Squared Error ◽

Absolute Error ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Systems Science ◽

Data Repository ◽

Support Vector ◽

Squared Error

The prime objective of this work is to predicting and analysing the Covid-19 pandemic around the world using Machine Learning algorithms like Polynomial Regression, Support Vector Machine and Ridge Regression. And furthermore, assess and compare the performance of the varied regression algorithms as far as parameters like R squared, Mean Absolute Error, Mean Squared Error and Root Mean Squared Error. In this work, we have used the dataset available on Covid-19 Data Repository by the Center for Systems Science and Engineering (CSSE) at John Hopkins University. We have analyzed the covid19 cases from 22/1/2020 till now. We applied a supervised machine learning prediction model to forecast the possible confirmed cases for the next ten days.

Download Full-text

Estimation of Reference Evapotranspiration Using Spatial and Temporal Machine Learning Approaches

Hydrology ◽

10.3390/hydrology8010025 ◽

2021 ◽

Vol 8 (1) ◽

pp. 25

Author(s):

Ali Rashid Niaghi ◽

Oveis Hassanijalilian ◽

Jalal Shiri

Keyword(s):

Machine Learning ◽

Gene Expression Programming ◽

Absolute Error ◽

Support Vector ◽

Learning Approaches ◽

Climate Data ◽

Local Data ◽

Local Approach ◽

The Mean ◽

Statistical Indicators

Evapotranspiration (ET) is widely employed to measure amounts of total water loss between land and atmosphere due to its major contribution to water balance on both regional and global scales. Considering challenges to quantifying nonlinear ET processes, machine learning (ML) techniques have been increasingly utilized to estimate ET due to their powerful advantage of capturing complex nonlinear structures and characteristics. However, limited studies have been conducted in subhumid climates to simulate local and spatial ETo using common ML methods. The current study aims to present a methodology that exempts local data in ETo simulation. The present study, therefore, seeks to estimate and compare reference ET (ETo) using four common ML methods with local and spatial approaches based on continuous 17-year daily climate data from six weather stations across the Red River Valley with subhumid climate. The four ML models have included Gene Expression Programming (GEP), Support Vector Machine (SVM), Multiple Linear Regression (LR), and Random Forest (RF) with three input combinations of maximum and minimum air temperature-based (Tmax, Tmin), mass transfer-based (Tmax, Tmin, U: wind speed), and radiation-based (Rs: solar radiation, Tmax, Tmin) measurements. The estimates yielded by the four ML models were compared against each other by considering spatial and local approaches and four statistical indicators; namely, the root means square error (RMSE), the mean absolute error (MAE), correlation coefficient (r2), and scatter index (SI), which were used to assess the ML model’s performance. The comparison between combinations showed the lowest SI and RMSE values for the RF model with the radiation-based combination. Furthermore, the RF model showed the best performance for all combinations among the four defined models either spatially or locally. In general, the LR, GEP, and SVM models were improved when a local approach was used. The results showed the best performance for the radiation-based combination and the RF model with higher accuracy for all stations either locally or spatially, and the spatial SVM and GEP illustrated the lowest performance among the models and approaches.

Download Full-text

Automated Outlier Detection for Electrical Motors and Transformers

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2020.9304 ◽

2020 ◽

Vol 17 (9) ◽

pp. 4703-4708

Author(s):

K. Anitha Kumari ◽

Avinash Sharma ◽

S. Nivethitha ◽

V. Dharini ◽

V. Sanjith ◽

...

Keyword(s):

Machine Learning ◽

Polynomial Regression ◽

Mean Squared Error ◽

Absolute Error ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Support Vector ◽

Squared Error ◽

Electrical Motors

Electrical appliances most commonly consist of two electrical devices, namely, electrical motors and transformers. Typically, electrical motors are normally used in all sort of industrial purposes. Failures of such motors results in serious problems, such as overheat, shut down and even burnt, in their host systems. Thus, more attention have to be paid in detecting the outliers. In a similar way, to avoid the unexpected power reliability problems and system damages, the prediction of the failures in the transformers is expected to quantify the impacts. By predicting the failures, the lifetime of the transformers increases and unnecessary accidents is avoided. Therefore, this paper presents the detection of the outliers in electrical motors and failures in transformers using supervised machine learning algorithms. Machine learning techniques such as Support Vector Machine (SVM), Random Forest (RF) and regression techniques like Support Vector Regression (SVR), Polynomial Regression (PR) are used to analyze the use cases of different motor specifications. Evaluation and the efficiency of findings are proved by considering accuracy, precision, F-measure, and recall for motors. Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Square Error (RMSE) and R-squared Error (R2) are considered as metrics for transformers. The proposed approach helps to identify the anomalies like vibration loss, copper loss and overheating in the industrial motor and to determine the abnormal functioning of the transformer that in turn leads to ascertain the lifetime. The proposed system analyses the behaviour of the electrical machines using the energy meter data and reports the outliers to users. It also analyses the abnormalities occurring in the transformer using the parameters involved in the degradation of the paper-oil insulation system and the voltage of operation as a whole leads to the predict the lifetime.

Download Full-text

Using Machine Learning To Improve the Accuracy of Genomic Prediction on Reproduction Traits in Pigs

10.21203/rs.3.rs-1083849/v1 ◽

2021 ◽

Author(s):

Xue Wang ◽

Shaolei Shi ◽

Guijiang Wang ◽

Wenxue Luo ◽

Xia Wei ◽

...

Keyword(s):

Machine Learning ◽

Genomic Prediction ◽

Mean Squared Error ◽

Absolute Error ◽

Single Step ◽

Support Vector ◽

Prediction Ability ◽

Reproduction Traits ◽

Yorkshire Pigs ◽

Robust Prediction

Abstract Background Recently, machine learning (ML) is becoming attractive in genomic prediction, while its superiority in genomic prediction and the choosing of optimal ML methods are needed investigation. Results In this study, 2566 Chinese Yorkshire pigs with reproduction traits records were used, they were genotyped with GenoBaits Porcine SNP 50K and PorcineSNP50 panel. Four ML methods, including support vector regression (SVR), kernel ridge regression (KRR), random forest (RF) and Adaboost.R2 were implemented. Through 20 replicates of five-fold cross-validation, the genomic prediction abilities of ML methods were explored. Compared with genomic BLUP(GBLUP), single-step GBLUP (ssGBLUP) and Bayesian method BayesHE, our results indicated that ML methods significantly outperformed. The prediction accuracy of ML methods was improved by 19.3%, 15.0% and 20.8% on average over GBLUP, ssGBLUP and BayesHE, ranging from 8.9–24.0%, 7.6–17.5% and 11.1–24.6%, respectively. In addition, ML methods yielded smaller mean squared error (MSE) and mean absolute error (MAE) in all scenarios. ssGBLUP yielded improvement of 3.7% on average compared to GBLUP, and the performance of BayesHE was close to GBLUP. Among four ML methods, SVR and KRR had the most robust prediction abilities, which yielded higher accuracies, lower bias, lower MSE and MAE, and comparable computing efficiency as GBLUP. RF demonstrated the lowest prediction ability and computational efficiency among ML methods. Conclusion Our findings demonstrated that ML methods are more efficient than traditional genomic selection methods, and it could be new options for genomic prediction.

Download Full-text

Support vector regression algorithm modeling to predict the parturition date of small - to medium-sized dogs using maternal weight and fetal biparietal diameter

Veterinary World ◽

10.14202/vetworld.2021.829-834 ◽

2021 ◽

Vol 14 (4) ◽

pp. 829-834

Author(s):

Thanida Sananmuang ◽

Kanchanarat Mankong ◽

Suppawiwat Ponglowhapan ◽

Kaj Chokeshaiusaha

Keyword(s):

Machine Learning ◽

Support Vector Regression ◽

Mean Squared Error ◽

Absolute Error ◽

Kernel Functions ◽

Coefficient Of Determination ◽

Support Vector ◽

Maternal Weight ◽

Biparietal Diameter ◽

Parturition Date

Background and Aim: Fetal biparietal diameter (BPD) is a feasible parameter to predict canine parturition date due to its inverted correlation with days before parturition (DBP). Although such a relationship is generally described using a simple linear regression (SLR) model, the imprecision of this model in predicting the parturition date in small- to medium-sized dogs is a common problem among veterinarian practitioners. Support vector regression (SVR) is a useful machine learning model for prediction. This study aimed to compare the accuracy of SVR with that of SLR in predicting DBP. Materials and Methods: After measuring 101 BPDs in 35 small- to medium-sized pregnant bitches, we fitted the data to the routine SLR model and the SVR model using three different kernel functions, radial basis function SVR, linear SVR, and polynomial SVR. The predicted DBP acquired from each model was further utilized for calculating the coefficient of determination (R2), mean absolute error, and mean squared error scores for determining the prediction accuracy. Results: All SVR models were more accurate than the SLR model at predicting DBP. The linear and polynomial SVRs were identified as the two most accurate models (p<0.01). Conclusion: With available machine learning software, linear and polynomial SVRs can be applied to predicting DBP in small- to medium-sized pregnant bitches.

Download Full-text

Machine Learning Techniques for Identifying Fetal Risk During Pregnancy

International Journal of Image and Graphics ◽

10.1142/s0219467822500450 ◽

2021 ◽

Author(s):

S. Ravikumar ◽

E. Kannan

Keyword(s):

Machine Learning ◽

Fetal Heart ◽

Quantitative Description ◽

Absolute Error ◽

Outcome Data ◽

Machine Learning Techniques ◽

Support Vector ◽

Learning Approaches ◽

Fetal Risk ◽

Learning Techniques

Cardiotocography (CTG) is a biophysical method for assessing fetal condition that primarily relies on the recording and automated analysis of fetal heart activity. The quantitative description of the CTG signals is provided by computerized fetal monitoring systems. Even though effective conclusion generation methods for decision process support are still required to find out the fetal risk such as premature embryo, this proposed method and outcome data can confirm the assessment of the fetal state after birth. Low birth weight is quite possibly the main attribute that significantly depicts an unusual fetal result. These expectations are assessed in a constant experimental decision support system, providing valuable information that can be used to obtain additional information about the fetal state using machine learning techniques. The advancements in modern obstetric practice enabled the use of numerous reliable and robust machine learning approaches in classifying fetal heart rate signals. The Naïve Bayes (NB) classifier, support vector machine (SVM), decision trees (DT), and random forest (RF) are used in the proposed method. To assess these outcomes in the proposed method, some of the metrics such as precision, accuracy, F1 score, recall, sensitivity, logarithmic loss and mean absolute error have been taken. The above mentioned metrics will be helpful to predict the fetal risk.

Download Full-text