Crisp discharge forecasts and grey uncertainty bands using data-driven models

S. Alvisi; E. Creaco; M. Franchini

doi:10.2166/nh.2012.121

Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology – Part 1: Concepts and methodology

Hydrology and Earth System Sciences Discussions ◽

10.5194/hessd-6-7055-2009 ◽

2009 ◽

Vol 6 (6) ◽

pp. 7055-7093 ◽

Cited By ~ 4

Author(s):

A. Elshorbagy ◽

G. Corzo ◽

S. Srinivasulu ◽

D. P. Solomatine

Keyword(s):

Polynomial Regression ◽

Predictive Accuracy ◽

Lower Layer ◽

Data Driven ◽

Support Vector ◽

K Nearest Neighbors ◽

Evolutionary Polynomial Regression ◽

Modeling Techniques ◽

Modeling Experiment ◽

Data Driven Modeling

Abstract. A comprehensive data driven modeling experiment is presented in two-part paper. In this first part, an extensive data-driven modeling experiment is proposed. The most important concerns regarding the way data driven modeling (DDM) techniques and data were handled, compared, and evaluated, and the basis on which findings and conclusions were drawn are discussed. A concise review of key articles that presented comparisons among various DDM techniques is presented. Six DDM techniques, namely, neural networks, genetic programming, evolutionary polynomial regression, support vector machines, M5 model trees, and K-nearest neighbors are proposed and explained. Multiple linear regression and naïve models are also suggested as baseline for comparison with the various techniques. Five datasets from Canada and Europe representing evapotranspiration, upper and lower layer soil moisture content, and rainfall-runoff process are described and proposed for the modeling experiment. Twelve different realizations (groups) from each dataset are created by a procedure involving random sampling. Each group contains three subsets; training, cross-validation, and testing. Each modeling technique is proposed to be applied to each of the 12 groups of each dataset. This way, both predictive accuracy and uncertainty of the modeling techniques can be evaluated. The implementation of the modeling techniques, results and analysis, and the findings of the modeling experiment are deferred to the second part of this paper.

Download Full-text

Comparison of three data-driven techniques in modelling the evapotranspiration process

Journal of Hydroinformatics ◽

10.2166/hydro.2010.029 ◽

2010 ◽

Vol 12 (4) ◽

pp. 365-379 ◽

Cited By ~ 21

Author(s):

I. El-Baroudy ◽

A. Elshorbagy ◽

S. K. Carey ◽

O. Giustolisi ◽

D. Savic

Keyword(s):

Case Studies ◽

Water Resource Management ◽

Polynomial Regression ◽

Hydrological Cycle ◽

Global Scale ◽

Data Driven ◽

Actual Evapotranspiration ◽

Time Lags ◽

Evolutionary Polynomial Regression ◽

Input Variables

Evapotranspiration is one of the main components of the hydrological cycle as it accounts for more than two-thirds of the precipitation losses at the global scale. Reliable estimates of actual evapotranspiration are crucial for effective watershed modelling and water resource management, yet direct measurements of the evapotranspiration losses are difficult and expensive. This research explores the utility and effectiveness of data-driven techniques in modelling actual evapotranspiration measured by an eddy covariance system. The authors compare the Evolutionary Polynomial Regression (EPR) performance to Artificial Neural Networks (ANNs) and Genetic Programming (GP). Furthermore, this research investigates the effect of previous states (time lags) of the meteorological input variables on characterizing actual evapotranspiration. The models developed using the EPR, based on the two case studies at the Mildred Lake mine, AB, Canada provided comparable performance to the models of GP and ANNs. Moreover, the EPR provided simpler models than those developed by the other data-driven techniques, particularly in one of the case studies. The inclusion of the previous states of the input variables slightly enhanced the performance of the developed model, which in turn indicates the dynamic nature of the evapotranspiration process.

Download Full-text

A symbolic data-driven technique based on evolutionary polynomial regression

Journal of Hydroinformatics ◽

10.2166/hydro.2006.020b ◽

2006 ◽

Vol 8 (3) ◽

pp. 207-222 ◽

Cited By ~ 174

Author(s):

Orazio Giustolisi ◽

Dragan A. Savic

Keyword(s):

Polynomial Regression ◽

Computing Methodology ◽

Resistance Coefficient ◽

Regression Method ◽

Data Driven ◽

Evolutionary Polynomial Regression ◽

Symbolic Data ◽

Computational Performance ◽

Regression Techniques ◽

Physical Insight

This paper describes a new hybrid regression method that combines the best features of conventional numerical regression techniques with the genetic programming symbolic regression technique. The key idea is to employ an evolutionary computing methodology to search for a model of the system/process being modelled and to employ parameter estimation to obtain constants using least squares. The new technique, termed Evolutionary Polynomial Regression (EPR) overcomes shortcomings in the GP process, such as computational performance; number of evolutionary parameters to tune and complexity of the symbolic models. Similarly, it alleviates issues arising from numerical regression, including difficulties in using physical insight and over-fitting problems. This paper demonstrates that EPR is good, both in interpolating data and in scientific knowledge discovery. As an illustration, EPR is used to identify polynomial formulæ with progressively increasing levels of noise, to interpolate the Colebrook-White formula for a pipe resistance coefficient and to discover a formula for a resistance coefficient from experimental data.

Download Full-text

Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology - Part 1: Concepts and methodology

Hydrology and Earth System Sciences ◽

10.5194/hess-14-1931-2010 ◽

2010 ◽

Vol 14 (10) ◽

pp. 1931-1941 ◽

Cited By ~ 120

Author(s):

A. Elshorbagy ◽

G. Corzo ◽

S. Srinivasulu ◽

D. P. Solomatine

Keyword(s):

Polynomial Regression ◽

Lower Layer ◽

Data Driven ◽

Support Vector ◽

K Nearest Neighbors ◽

Evolutionary Polynomial Regression ◽

Vector Machines ◽

Modeling Techniques ◽

Modeling Experiment ◽

Data Driven Modeling

Abstract. A comprehensive data driven modeling experiment is presented in a two-part paper. In this first part, an extensive data-driven modeling experiment is proposed. The most important concerns regarding the way data driven modeling (DDM) techniques and data were handled, compared, and evaluated, and the basis on which findings and conclusions were drawn are discussed. A concise review of key articles that presented comparisons among various DDM techniques is presented. Six DDM techniques, namely, neural networks, genetic programming, evolutionary polynomial regression, support vector machines, M5 model trees, and K-nearest neighbors are proposed and explained. Multiple linear regression and naïve models are also suggested as baseline for comparison with the various techniques. Five datasets from Canada and Europe representing evapotranspiration, upper and lower layer soil moisture content, and rainfall-runoff process are described and proposed, in the second paper, for the modeling experiment. Twelve different realizations (groups) from each dataset are created by a procedure involving random sampling. Each group contains three subsets; training, cross-validation, and testing. Each modeling technique is proposed to be applied to each of the 12 groups of each dataset. This way, both prediction accuracy and uncertainty of the modeling techniques can be evaluated. The description of the datasets, the implementation of the modeling techniques, results and analysis, and the findings of the modeling experiment are deferred to the second part of this paper.

Download Full-text

Comparison of data-driven methods for downscaling ensemble weather forecasts

Hydrology and Earth System Sciences Discussions ◽

10.5194/hessd-4-189-2007 ◽

2007 ◽

Vol 4 (1) ◽

pp. 189-210 ◽

Cited By ~ 3

Author(s):

X. Liu ◽

P. Coulibaly ◽

N. Evora

Keyword(s):

Polynomial Regression ◽

Daily Precipitation ◽

Temperature Series ◽

Data Driven ◽

Daily Maximum ◽

Ensemble Forecasts ◽

Weather Forecasts ◽

Evolutionary Polynomial Regression ◽

Medium Range Forecast ◽

Precipitation And Temperature

Abstract. This study investigates dynamically different data-driven methods, specifically a statistical downscaling model (SDSM), a time lagged feedforward neural network (TLFN), and an evolutionary polynomial regression (EPR) technique for downscaling numerical weather ensemble forecasts generated by a medium range forecast (MRF) model. Given the coarse resolution (about 200-km grid spacing) of the MRF model, an optimal use of the weather forecasts at the local or watershed scale, requires appropriate downscaling techniques. The selected methods are applied for downscaling ensemble daily precipitation and temperature series for the Chute-du-Diable basin located in northeastern Canada. The downscaling results show that the TLFN and EPR have similar performance in downscaling ensemble daily precipitation as well as daily maximum and minimum temperature series whatever the season. Both the TLFN and EPR are more efficient downscaling techniques than SDSM for both the ensemble daily precipitation and temperature.

Download Full-text

Comparison of data-driven methods for downscaling ensemble weather forecasts

Hydrology and Earth System Sciences ◽

10.5194/hess-12-615-2008 ◽

2008 ◽

Vol 12 (2) ◽

pp. 615-624 ◽

Cited By ~ 14

Author(s):

◽

P. Coulibaly ◽

N. Evora

Keyword(s):

Polynomial Regression ◽

Daily Precipitation ◽

Temperature Series ◽

Data Driven ◽

Daily Maximum ◽

Ensemble Forecasts ◽

Weather Forecasts ◽

Evolutionary Polynomial Regression ◽

Medium Range Forecast ◽

Precipitation And Temperature

Abstract. This study investigates dynamically different data-driven methods, specifically a statistical downscaling model (SDSM), a time lagged feedforward neural network (TLFN), and an evolutionary polynomial regression (EPR) technique for downscaling numerical weather ensemble forecasts generated by a medium range forecast (MRF) model. Given the coarse resolution (about 200-km grid spacing) of the MRF model, an optimal use of the weather forecasts at the local or watershed scale, requires appropriate downscaling techniques. The selected methods are applied for downscaling ensemble daily precipitation and temperature series for the Chute-du-Diable basin located in northeastern Canada. The downscaling results show that the TLFN and EPR have similar performance in downscaling ensemble daily precipitation as well as daily maximum and minimum temperature series whatever the season. Both the TLFN and EPR are more efficient downscaling techniques than SDSM for both the ensemble daily precipitation and temperature.

Download Full-text

Closure to “Gene-Expression Programming, Evolutionary Polynomial Regression, and Model Tree to Evaluate Local Scour Depth at Culvert Outlets” by Mohammad Najafzadeh and Ali Reza Kargar

Journal of Pipeline Systems Engineering and Practice ◽

10.1061/(asce)ps.1949-1204.0000533 ◽

2021 ◽

Vol 12 (2) ◽

pp. 07021002

Author(s):

Mohammad Najafzadeh ◽

Ali Reza Kargar

Keyword(s):

Gene Expression ◽

Polynomial Regression ◽

Gene Expression Programming ◽

Local Scour ◽

Scour Depth ◽

Evolutionary Polynomial Regression ◽

Model Tree

Download Full-text

Discussion of “Gene-Expression Programming, Evolutionary Polynomial Regression, and Model Tree to Evaluate Local Scour Depth at Culvert Outlets” by Mohammad Najafzadeh and Ali Reza Kargar

Journal of Pipeline Systems Engineering and Practice ◽

10.1061/(asce)ps.1949-1204.0000532 ◽

2021 ◽

Vol 12 (2) ◽

pp. 07021001

Author(s):

Manish Pandey ◽

H. Md Azamathulla

Keyword(s):

Gene Expression ◽

Polynomial Regression ◽

Gene Expression Programming ◽

Local Scour ◽

Scour Depth ◽

Evolutionary Polynomial Regression ◽

Model Tree

Download Full-text

An Indoor Visible Light Positioning System Using Tilted LEDs with High Accuracy

Sensors ◽

10.3390/s21030920 ◽

2021 ◽

Vol 21 (3) ◽

pp. 920

Author(s):

Neha Chaudhary ◽

Othman Isam Younus ◽

Luis Nero Alves ◽

Zabih Ghassemlooy ◽

Stanislav Zvanovec ◽

...

Keyword(s):

Visible Light ◽

Polynomial Regression ◽

Power Level ◽

Least Square ◽

Line Of Sight ◽

Received Power ◽

Strength Based ◽

Complex Linear ◽

Linear Least Square ◽

First Time

The accuracy of the received signal strength-based visible light positioning (VLP) system in indoor applications is constrained by the tilt angles of transmitters (Txs) and receivers as well as multipath reflections. In this paper, for the first time, we show that tilting the Tx can be beneficial in VLP systems considering both line of sight (LoS) and non-line of sight transmission paths. With the Txs oriented towards the center of the receiving plane (i.e., the pointing center F), the received power level is maximized due to the LoS components on F. We also show that the proposed scheme offers a significant accuracy improvement of up to ~66% compared with a typical non-tilted Tx VLP at a dedicated location within a room using a low complex linear least square algorithm with polynomial regression. The effect of tilting the Tx on the lighting uniformity is also investigated and results proved that the uniformity achieved complies with the European Standard EN 12464-1. Furthermore, we show that the accuracy of VLP can be further enhanced with a minimum positioning error of 8 mm by changing the height of F.

Download Full-text

A Spatiotemporal Convolutional Gated Recurrent Unit Network for Mean Wave Period Field Forecasting

Journal of Marine Science and Engineering ◽

10.3390/jmse9040383 ◽

2021 ◽

Vol 9 (4) ◽

pp. 383

Author(s):

Ting Yu ◽

Jichao Wang

Keyword(s):

Lead Time ◽

Wave Period ◽

Data Driven ◽

Lead Times ◽

Proposed Model ◽

Data Driven Approach ◽

Mean Wave Period ◽

Scattering Index ◽

Gated Recurrent Unit ◽

Methods Numerical

Mean wave period (MWP) is one of the key parameters affecting the design of marine facilities. Currently, there are two main methods, numerical and data-driven methods, for forecasting wave parameters, of which the latter are widely used. However, few studies have focused on MWP forecasting, and even fewer have investigated it with spatial and temporal information. In this study, correlations between ocean dynamic parameters are explored to obtain appropriate input features, significant wave height (SWH) and MWP. Subsequently, a data-driven approach, the convolution gated recurrent unit (Conv-GRU) model with spatiotemporal characteristics, is utilized to field forecast MWP with 1, 3, 6, 12, and 24-h lead times in the South China Sea. Six points at different locations and six consecutive moments at every 12-h intervals are selected to study the forecasting ability of the proposed model. The Conv-GRU model has a better performance than the single gated recurrent unit (GRU) model in terms of root mean square error (RMSE), the scattering index (SI), Bias, and the Pearson’s correlation coefficient (R). With the lead time increasing, the forecast effect shows a decreasing trend, specifically, the experiment displays a relatively smooth forecast curve and presents a great advantage in the short-term forecast of the MWP field in the Conv-GRU model, where the RMSE is 0.121 m for 1-h lead time.

Download Full-text