Analysis of Copernicus’ ERA5 Climate Reanalysis Data as a Replacement for Weather Station Temperature Measurements in Machine Learning Models for Olive Phenology Phase Prediction

Noelia Oses; Izar Azpiroz; Susanna Marchi; Diego Guidotti; Marco Quartulli; Igor G. Olaizola

doi:10.3390/s20216381

Analysis of Copernicus’ ERA5 Climate Reanalysis Data as a Replacement for Weather Station Temperature Measurements in Machine Learning Models for Olive Phenology Phase Prediction

Sensors ◽

10.3390/s20216381 ◽

2020 ◽

Vol 20 (21) ◽

pp. 6381

Author(s):

Noelia Oses ◽

Izar Azpiroz ◽

Susanna Marchi ◽

Diego Guidotti ◽

Marco Quartulli ◽

...

Keyword(s):

Machine Learning ◽

Olive Tree ◽

Reanalysis Data ◽

Temperature Data ◽

Temperature Measurements ◽

Weather Station ◽

Learning Models ◽

Phenological Model ◽

Machine Learning Models ◽

Phenological Phases

Knowledge of phenological events and their variability can help to determine final yield, plan management approach, tackle climate change, and model crop development. THe timing of phenological stages and phases is known to be highly correlated with temperature which is therefore an essential component for building phenological models. Satellite data and, particularly, Copernicus’ ERA5 climate reanalysis data are easily available. Weather stations, on the other hand, provide scattered temperature data, with fragmentary spatial coverage and accessibility, as such being scarcely efficacious as unique source of information for the implementation of predictive models. However, as ERA5 reanalysis data are not real temperature measurements but reanalysis products, it is necessary to verify whether these data can be used as a replacement for weather station temperature measurements. The aims of this study were: (i) to assess the validity of ERA5 data as a substitute for weather station temperature measurements, (ii) to test different machine learning models for the prediction of phenological phases while using different sets of features, and (iii) to optimize the base temperature of olive tree phenological model. The predictive capability of machine learning models and the performance of different feature subsets were assessed when comparing the recorded temperature data, ERA5 data, and a simple growing degree day phenological model as benchmark. Data on olive tree phenology observation, which were collected in Tuscany for three years, provided the phenological phases to be used as target variables. The results show that ERA5 climate reanalysis data can be used for modelling phenological phases and that these models provide better predictions in comparison with the models trained with weather station temperature measurements.

Download Full-text

Exploratory Analysis of Machine Learning Methods in Predicting Subsurface Temperature and Geothermal Gradient of Northeastern United States

10.21203/rs.3.rs-131433/v1 ◽

2021 ◽

Author(s):

Arya Shahdi ◽

Seho Lee ◽

Anuj Karpatne ◽

Bahareh Nojabaei

Keyword(s):

Machine Learning ◽

Oil And Gas ◽

Active Regions ◽

Geothermal Gradient ◽

Temperature Data ◽

Exploratory Analysis ◽

Measured Temperature ◽

Subsurface Temperature ◽

Learning Models ◽

Machine Learning Models

Abstract Geothermal scientists have used bottom hole temperature data from extensive oil and gas well datasets to generate heat flow and temperature-at-depth maps to locate potential geothermally active regions. Considering that there are some uncertainties and simplifying assumptions associated with the current state of physics-based models, in this study, the applicability of several machine learning models is evaluated for predicting temperature-at-depth and geothermal gradient parameters. Through our exploratory analysis, it is found that XGBoost results in the highest accuracy for subsurface temperature prediction with average mean-absolute-error and root-mean-square-error of 3.19[°C] and 4.94[°C], respectively. Furthermore, we apply our model to regions around the sites to provide 2D continuous temperature maps at three different depths using XGBoost model, which can be used to locate prospective geothermally active regions. We also validate the proposed XGBoost and DNN models using an extra dataset containing measured temperature data along the depth for fifty-eight wells in the state of West Virginia. Accuracy measures show that machine learning models are highly comparable to the physics-based model and can even outperform the thermal conductivity model. Also, a geothermal gradient map is derived for the whole region by fitting linear regression to the XGBoost predicted temperatures along the depth. Finally, thorough our analysis, the most favorable geological locations are suggested for potential future geothermal developments.

Download Full-text

Exploratory analysis of machine learning methods in predicting subsurface temperature and geothermal gradient of Northeastern United States

Geothermal Energy ◽

10.1186/s40517-021-00200-4 ◽

2021 ◽

Vol 9 (1) ◽

Author(s):

Arya Shahdi ◽

Seho Lee ◽

Anuj Karpatne ◽

Bahareh Nojabaei

Keyword(s):

Machine Learning ◽

Oil And Gas ◽

Active Regions ◽

Geothermal Gradient ◽

Temperature Data ◽

Exploratory Analysis ◽

Measured Temperature ◽

Subsurface Temperature ◽

Learning Models ◽

Machine Learning Models

AbstractGeothermal scientists have used bottom-hole temperature data from extensive oil and gas well datasets to generate heat flow and temperature-at-depth maps to locate potential geothermally active regions. Considering that there are some uncertainties and simplifying assumptions associated with the current state of physics-based models, in this study, the applicability of several machine learning models is evaluated for predicting temperature-at-depth and geothermal gradient parameters. Through our exploratory analysis, it is found that XGBoost and Random Forest result in the highest accuracy for subsurface temperature prediction. Furthermore, we apply our model to regions around the sites to provide 2D continuous temperature maps at three different depths using XGBoost model, which can be used to locate prospective geothermally active regions. We also validate the proposed XGBoost and DNN models using an extra dataset containing measured temperature data along the depth for 58 wells in the state of West Virginia. Accuracy measures show that machine learning models are highly comparable to the physics-based model and can even outperform the thermal conductivity model. Also, a geothermal gradient map is derived for the whole region by fitting linear regression to the XGBoost-predicted temperatures along the depth. Finally, through our analysis, the most favorable geological locations are suggested for potential future geothermal developments.

Download Full-text

Machine learning models for the estimation of monthly mean daily reference evapotranspiration based on cross-station and synthetic data

Hydrology Research ◽

10.2166/nh.2019.060 ◽

2019 ◽

Vol 50 (6) ◽

pp. 1730-1750 ◽

Cited By ~ 6

Author(s):

Lifeng Wu ◽

Youwen Peng ◽

Junliang Fan ◽

Yicheng Wang

Keyword(s):

Machine Learning ◽

Reference Evapotranspiration ◽

Irrigation Scheduling ◽

Temperature Data ◽

The Other ◽

Estimation Accuracy ◽

Gradient Boosting ◽

Support Vector ◽

Learning Models ◽

Machine Learning Models

Abstract The estimation of reference evapotranspiration (ET0) is important in hydrology research, irrigation scheduling design and water resources management. This study explored the capability of eight machine learning models, i.e., Artificial Neuron Network (ANN), Random Forest (RF), Gradient Boosting Decision Tree (GBDT), Extreme Gradient Boosting (XGBoost), Multivariate Adaptive Regression Spline (MARS), Support Vector Machine (SVM), Extreme Learning Machine and a novel Kernel-based Nonlinear Extension of Arps Decline (KNEA) Model, for modeling monthly mean daily ET0 using only temperature data from local or cross stations. These machine learning models were also compared with the temperature-based Hargreaves–Samani equation. The results indicated that the estimation accuracy of these machine learning models differed in various scenarios. The tree-based models (RF, GBDT and XGBoost) exhibited higher estimation accuracy than the other models in the local application. When the station has only temperature data, the MARS and SVM models were slightly superior to the other models, while the ANN and HS models performed worse than the others. When there was no temperature data at the target station and the data from adjacent stations were used instead, MARS, SVM and KNEA were the suitable models. The results can provide a solution for ET0 estimation in the absence of complete meteorological data.

Download Full-text

Improving XGBoost with Imagination Sampling

Communications of the Blyth Institute ◽

10.33014/issn.2640-5652.2.1.holloway.1 ◽

2020 ◽

Vol 2 (1) ◽

pp. 3-6

Author(s):

Eric Holloway

Keyword(s):

Machine Learning ◽

General System ◽

Learning Models ◽

Starting Point ◽

Machine Learning Models

Imagination Sampling is the usage of a person as an oracle for generating or improving machine learning models. Previous work demonstrated a general system for using Imagination Sampling for obtaining multibox models. Here, the possibility of importing such models as the starting point for further automatic enhancement is explored.

Download Full-text

Development of Machine Learning Models to Predict Student Performance in Computer Literacy Courses

International Review on Computers and Software (IRECOS) ◽

10.15866/irecos.v13i1.16863 ◽

2018 ◽

Vol 13 (1) ◽

pp. 21

Author(s):

George Anderson ◽

Oduronke T. Eyitayo

Keyword(s):

Machine Learning ◽

Student Performance ◽

Computer Literacy ◽

Learning Models ◽

Machine Learning Models

Download Full-text

Experimental Comparison of Machine Learning Models in Malware Packing Detection

2020 21st Asia-Pacific Network Operations and Management Symposium (APNOMS) ◽

10.23919/apnoms50412.2020.9237007 ◽

2020 ◽

Author(s):

Jong-Wouk Kim ◽

Juhong Namgung ◽

Yang-Sae Moon ◽

Mi-Jung Choi

Keyword(s):

Machine Learning ◽

Experimental Comparison ◽

Learning Models ◽

Machine Learning Models

Download Full-text

Epigenetic Target Prediction with Accurate Machine Learning Models

10.26434/chemrxiv.13522313 ◽

2021 ◽

Author(s):

Norberto Sánchez-Cruz ◽

Jose L. Medina-Franco

Keyword(s):

Machine Learning ◽

Small Molecules ◽

Predictive Models ◽

Large Scale ◽

Target Prediction ◽

Quantitative Measure ◽

Learning Models ◽

Discovery Research ◽

Drug Discovery Research ◽

Machine Learning Models

Epigenetic targets are a significant focus for drug discovery research, as demonstrated by the eight approved epigenetic drugs for treatment of cancer and the increasing availability of chemogenomic data related to epigenetics. This data represents a large amount of structure-activity relationships that has not been exploited thus far for the development of predictive models to support medicinal chemistry efforts. Herein, we report the first large-scale study of 26318 compounds with a quantitative measure of biological activity for 55 protein targets with epigenetic activity. Through a systematic comparison of machine learning models trained on molecular fingerprints of different design, we built predictive models with high accuracy for the epigenetic target profiling of small molecules. The models were thoroughly validated showing mean precisions up to 0.952 for the epigenetic target prediction task. Our results indicate that the herein reported models have considerable potential to identify small molecules with epigenetic activity. Therefore, our results were implemented as freely accessible and easy-to-use web application.

Download Full-text

A Comparative Study of Machine Learning Models for Stock Market Rate Prediction

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v7i6.985990 ◽

2019 ◽

Vol 7 (6) ◽

pp. 985-990

Author(s):

reeraksha M S ◽

Bhargavi M S

Keyword(s):

Machine Learning ◽

Stock Market ◽

Comparative Study ◽

Learning Models ◽

Rate Prediction ◽

Market Rate ◽

Machine Learning Models

Download Full-text

An Intelligent Approach for Prediction of Liver Disease using Machine Learning Models

International Journal of Emerging Trends in Engineering Research ◽

10.30534/ijeter/2020/568102020 ◽

2020 ◽

Vol 8 (10) ◽

pp. 6974-6983

Keyword(s):

Machine Learning ◽

Liver Disease ◽

Learning Models ◽

Intelligent Approach ◽

Machine Learning Models

Download Full-text

Utilizing Blockchain Technology in Social Media Bot Identification

10.36227/techrxiv.12049374 ◽

2020 ◽

Author(s):

Shreya Reddy ◽

Lisa Ewen ◽

Pankti Patel ◽

Prerak Patel ◽

Ankit Kundal ◽

...

Keyword(s):

Machine Learning ◽

Social Media ◽

Gold Standard ◽

The Internet ◽

Learning Models ◽

Current Time ◽

Machine Learning Methods ◽

Blockchain Technology ◽

Modern Age ◽

Machine Learning Models

As bots become more prevalent and smarter in the modern age of the internet, it becomes ever more important that they be identified and removed. Recent research has dictated that machine learning methods are accurate and the gold standard of bot identification on social media. Unfortunately, machine learning models do not come without their negative aspects such as lengthy training times, difficult feature selection, and overwhelming pre-processing tasks. To overcome these difficulties, we are proposing a blockchain framework for bot identification. At the current time, it is unknown how this method will perform, but it serves to prove the existence of an overwhelming gap of research under this area.

Download Full-text