scholarly journals Estimating the Growing Stem Volume of Coniferous Plantations Based on Random Forest Using an Optimized Variable Selection Method

Sensors ◽  
2020 ◽  
Vol 20 (24) ◽  
pp. 7248
Author(s):  
Fugen Jiang ◽  
Mykola Kutia ◽  
Arbi J. Sarkissian ◽  
Hui Lin ◽  
Jiangping Long ◽  
...  

Forest growing stem volume (GSV) reflects the richness of forest resources as well as the quality of forest ecosystems. Remote sensing technology enables robust and efficient GSV estimation as it greatly reduces the survey time and cost while facilitating periodic monitoring. Given its red edge bands and a short revisit time period, Sentinel-2 images were selected for the GSV estimation in Wangyedian forest farm, Inner Mongolia, China. The variable combination was shown to significantly affect the accuracy of the estimation model. After extracting spectral variables, texture features, and topographic factors, a stepwise random forest (SRF) method was proposed to select variable combinations and establish random forest regressions (RFR) for GSV estimation. The linear stepwise regression (LSR), Boruta, Variable Selection Using Random Forests (VSURF), and random forest (RF) methods were then used as references for comparison with the proposed SRF for selection of predictors and GSV estimation. Combined with the observed GSV data and the Sentinel-2 images, the distributions of GSV were generated by the RFR models with the variable combinations determined by the LSR, RF, Boruta, VSURF, and SRF. The results show that the texture features of Sentinel-2’s red edge bands can significantly improve the accuracy of GSV estimation. The SRF method can effectively select the optimal variable combination, and the SRF-based model results in the highest estimation accuracy with the decreases of relative root mean square error by 16.4%, 14.4%, 16.3%, and 10.6% compared with those from the LSR-, RF-, Boruta-, and VSURF-based models, respectively. The GSV distribution generated by the SRF-based model matched that of the field observations well. The results of this study are expected to provide a reference for GSV estimation of coniferous plantations.

2021 ◽  
Vol 13 (14) ◽  
pp. 2740
Author(s):  
Xinyu Li ◽  
Hui Lin ◽  
Jiangping Long ◽  
Xiaodong Xu

Accurate measurement of forest growing stem volume (GSV) is important for forest resource management and ecosystem dynamics monitoring. Optical remote sensing imagery has great application prospects in forest GSV estimation on regional and global scales as it is easily accessible, has a wide coverage, and mature technology. However, their application is limited by cloud coverage, data stripes, atmospheric effects, and satellite sensor errors. Combining multi-sensor data can reduce such limitations as it increases the data availability, but also causes the multi-dimensional problem that increases the difficulty of feature selection. In this study, GaoFen-2 (GF-2) and Sentinel-2 images were integrated, and feature variables and data scenarios were derived by a proposed adaptive feature variable combination optimization (AFCO) program for estimating the GSV of coniferous plantations. The AFCO algorithm was compared to four traditional feature variable selection methods, namely, random forest (RF), stepwise random forest (SRF), fast iterative feature selection method for k-nearest neighbors (KNN-FIFS), and the feature variable screening and combination optimization procedure based on the distance correlation coefficient and k-nearest neighbors (DC-FSCK). The comparison indicated that the AFCO program not only considered the combination effect of feature variables, but also optimized the selection of the first feature variable, error threshold, and selection of the estimation model. Furthermore, we selected feature variables from three datasets (GF-2, Sentinel-2, and the integrated data) following the AFCO and four other feature selection methods and used the k-nearest neighbors (KNN) and random forest regression (RFR) to estimate the GSV of coniferous plantations in northern China. The results indicated that the integrated data improved the GSV estimation accuracy of coniferous plantations, with relative root mean square errors (RMSErs) of 15.0% and 19.6%, which were lower than those of GF-2 and Sentinel-2 data, respectively. In particular, the texture feature variables derived from GF-2 red band image have a significant impact on GSV estimation performance of the integrated dataset. For most data scenarios, the AFCO algorithm gained more accurate GSV estimates, as the RMSErs were 30.0%, 23.7%, 17.7%, and 17.5% lower than those of RF, SRF, KNN-FIFS, and DC-FSCK, respectively. The GSV distribution map obtained by the AFCO method and RFR model matched the field observations well. This study provides some insight into the application of optical images, optimization of the feature variable combination, and modeling algorithm selection for estimating the GSV of coniferous plantations.


2020 ◽  
Vol 12 (21) ◽  
pp. 3504
Author(s):  
Xingrong Li ◽  
Chenghai Yang ◽  
Wenjiang Huang ◽  
Jia Tang ◽  
Yanqin Tian ◽  
...  

Cotton root rot is a destructive cotton disease and significantly affects cotton quality and yield, and accurate identification of its distribution within fields is critical for cotton growers to control the disease effectively. In this study, Sentinel-2 images were used to explore the feasibility of creating classification maps and prescription maps for site-specific fungicide application. Eight cotton fields with different levels of root rot were selected and random forest (RF) was used to identify the optimal spectral indices and texture features of the Sentinel-2 images. Five optimal spectral indices (plant senescence reflectance index (PSRI), normalized difference vegetation index (NDVI), normalized difference water index (NDWI1), moisture stressed index (MSI), and renormalized difference vegetation index (RDVI)) and seven optimal texture features (Contrast 1, Dissimilarity 1, Entory 2, Mean 1, Variance 1, Homogeneity 1, and Second moment 2) were identified. Three binary logistic regression (BLR) models, including a spectral model, a texture model, and a spectral-texture model, were constructed for cotton root rot classification and prescription map creation. The results were compared with classification maps and prescription maps based on airborne imagery. Accuracy assessment showed that the accuracies of the classification maps for the spectral, texture, and spectral-texture models were 92.95%, 84.81%, and 91.87%, respectively, and the accuracies of the prescription maps for the three respective models were 90.83%, 87.14%, and 91.40%. These results confirmed that it was feasible to identify cotton root rot and create prescription maps using different features of Sentinel-2 imagery. The addition of texture features had little effect on the overall accuracy, but it could improve the ability to identify root rot areas. The producer’s accuracy (PA) for infested cotton in the classification maps for the texture model and the spectral-texture model was 2.82% and 1.07% higher, respectively, than that of the spectral model, and the PA for treatment zones in the prescription maps for the two respective models was 8.6% and 8.22% higher than that of the spectral model. Results based on the eight cotton fields showed that the spectral model was appropriate for the cotton fields with relatively severe infestation and the spectral-texture model was more appropriate for the cotton fields with low or moderate infestation.


2020 ◽  
Vol 12 (18) ◽  
pp. 2929
Author(s):  
Jinlong Gao ◽  
Jie Liu ◽  
Tiangang Liang ◽  
Mengjing Hou ◽  
Jing Ge ◽  
...  

Nondestructive and accurate estimating of the forage nitrogen–phosphorus (N:P) ratio is conducive to the real-time diagnosis of nutrient limitation and the formulation of a management scheme during the growth and development of forage. New-generation high-resolution remote sensors equipped with strategic red-edge wavebands offer opportunities and challenges for estimating and mapping forage N:P ratio in support of the sustainable utilization of alpine grassland resources. This study aims to detect the forage N:P ratio as an ecological indicator of grassland nutrient content by employing Sentinel-2 multispectral instrument (MSI) data and a random forest (RF) algorithm. The results showed that the estimation accuracy (R2) of the forage N:P ratio model established by combining the optimized spectral bands and vegetation indices (VIs) is 0.49 and 0.59 in the vigorous growth period (July) and the senescing period (November) of forage, respectively. Moreover, Sentinel-2 MSI B9 and B12 bands contributed greatly to the estimation of the forage N:P ratio, and the VIs (RECI2) constructed by B5 and B8A bands performed well in the estimation of the forage N:P ratio. Overall, it is promising to map the spatial distribution of the forage N:P ratio in alpine grassland using Sentinel-2 MSI data at regional scales. This study will be potentially beneficial in implementing precise positioning of vegetation nutrient deficiency and scientific fertilization management of grassland.


2021 ◽  
Vol 13 (11) ◽  
pp. 2027
Author(s):  
Mukunda Dev Behera ◽  
Surbhi Barnwal ◽  
Somnath Paramanik ◽  
Pulakesh Das ◽  
Bimal Kumar Bhattyacharya ◽  
...  

Although studies on species-level classification and mapping using multisource data and machine learning approaches are plenty, the use of data with ideal placement of central wavelength and bandwidth at appropriate spatial resolution, for the classification of mangrove species is underreported. The species composition of a mangrove forest has been estimated utilising the red-edge spectral bands and chlorophyll absorption information from AVIRIS-NG and Sentinel-2 data. In this study, three dominant species, Heritiera fomes, Excoecaria agallocha and Avicennia officinalis, have been classified using the random forest (RF) model for a mangrove forest in Bhitarkanika Wildlife Sanctuary, India. Various combinations of reflectance/backscatter bands and vegetation indices derived from Sentinel-2, AVIRIS-NG, and Sentinel-1 were used for species-level discrimination and mapping. The RF model showed maximum accuracy using Sentinel-2, followed by the AVIRIS-NG, in discriminating three dominant species and two mixed compositions. This study indicates the potential of Sentinel-2 data for discriminating various mangrove species owing to the appropriate placement of central wavelength and bandwidth in Sentinel-2 at ≥10 m spatial resolution. The variable importance plots proved that species-level classification could be attempted using red edge and chlorophyll absorption information. This study has wider applicability in other mangrove forests around the world.


2020 ◽  
Vol 12 (5) ◽  
pp. 871 ◽  
Author(s):  
Xinyu Li ◽  
Zhaohua Liu ◽  
Hui Lin ◽  
Guangxing Wang ◽  
Hua Sun ◽  
...  

Accurately estimating growing stem volume (GSV) is very important for forest resource management. The GSV estimation is affected by remote sensing images, variable selection methods, and estimation algorithms. Optical images have been widely used for modeling key attributes of forest stands, including GSV and aboveground biomass (AGB), because of their easy availability, large coverage and related mature data processing and analysis technologies. However, the low data saturation level and the difficulty of selecting feature variables from optical images often impede the improvement of estimation accuracy. In this research, two GaoFen-2 (GF-2) images, a Landsat 8 image, and fused images created by integrating GF-2 bands with the Landsat multispectral image using the Gram–Schmidt method were first used to derive various feature variables and obtain various datasets or data scenarios. A DC-FSCK approach that integrates feature variable screening and a combination optimization procedure based on the distance correlation coefficient and k-nearest neighbors (kNN) algorithm was proposed and compared with the stepwise regression analysis (SRA) and random forest (RF) for feature variable selection. The DC-FSCK considers the self-correlation and combination effect among feature variables so that the selected variables can improve the accuracy and saturation level of GSV estimation. To validate the proposed approach, six estimation algorithms were examined and compared, including Multiple Linear Regression (MLR), kNN, Support Vector Regression (SVR), RF, eXtreme Gradient Boosting (XGBoost) and Stacking. The results showed that compared with GF-2 and Landsat 8 images, overall, the fused image (Red_Landsat) of GF-2 red band with Landsat 8 multispectral image improved the GSV estimation accuracy of Chinese pine and larch plantations. The Red_Landsat image also performed better than other fused images (Pan_Landsat, Blue_Landsat, Green_Landsat and Nir_Landsat). For most of the combinations of the datasets and estimation models, the proposed variable selection method DC-FSCK led to more accurate GSV estimates compared with SRA and RF. In addition, in most of the combinations obtained by the datasets and variable selection methods, the Stacking algorithm performed better than other estimation models. More importantly, the combination of the fused image Red_Landsat with the DC-FSCK and Stacking algorithm led to the best performance of GSV estimation with the greatest adjusted coefficients of determination, 0.8127 and 0.6047, and the smallest relative root mean square errors of 17.1% and 20.7% for Chinese pine and larch, respectively. This study provided new insights on how to choose suitable optical images, variable selection methods and optimal modeling algorithms for the GSV estimation of Chinese pine and larch plantations.


2018 ◽  
Vol 10 (11) ◽  
pp. 1794 ◽  
Author(s):  
Magnus Persson ◽  
Eva Lindberg ◽  
Heather Reese

The Sentinel-2 program provides the opportunity to monitor terrestrial ecosystems with a high temporal and spectral resolution. In this study, a multi-temporal Sentinel-2 data set was used to classify common tree species over a mature forest in central Sweden. The tree species to be classified were Norway spruce (Picea abies), Scots pine (Pinus silvestris), Hybrid larch (Larix × marschlinsii), Birch (Betula sp.) and Pedunculate oak (Quercus robur). Four Sentinel-2 images from spring (7 April and 27 May), summer (9 July) and fall (19 October) of 2017 were used along with the Random Forest (RF) classifier. A variable selection approach was implemented to find fewer and uncorrelated bands resulting in the best model for tree species identification. The final model resulting in the highest overall accuracy (88.2%) came from using all bands from the four image dates. The single image that gave the most accurate classification result (80.5%) was the late spring image (27 May); the 27 May image was always included in subsequent image combinations that gave the highest overall accuracy. The five tree species were classified with a user’s accuracy ranging from 70.9% to 95.6%. Thirteen of the 40 bands were selected in a variable selection procedure and resulted in a model with only slightly lower accuracy (86.3%) than that using all bands. Among the highest ranked bands were the red edge bands 2 and 3 as well as the narrow NIR (near-infrared) band 8a, all from the 27 May image, and SWIR (short-wave infrared) bands from all four image dates. This study shows that the red-edge bands and SWIR bands from Sentinel-2 are of importance, and confirms that spring and/or fall images capturing phenological differences between the species are most useful to tree species classification.


2019 ◽  
Vol 139 (8) ◽  
pp. 850-857
Author(s):  
Hiromu Imaji ◽  
Takuya Kinoshita ◽  
Toru Yamamoto ◽  
Keisuke Ito ◽  
Masahiro Yoshida ◽  
...  

2021 ◽  
Vol 13 (2) ◽  
pp. 233
Author(s):  
Ilja Vuorinne ◽  
Janne Heiskanen ◽  
Petri K. E. Pellikka

Biomass is a principal variable in crop monitoring and management and in assessing carbon cycling. Remote sensing combined with field measurements can be used to estimate biomass over large areas. This study assessed leaf biomass of Agave sisalana (sisal), a perennial crop whose leaves are grown for fibre production in tropical and subtropical regions. Furthermore, the residue from fibre production can be used to produce bioenergy through anaerobic digestion. First, biomass was estimated for 58 field plots using an allometric approach. Then, Sentinel-2 multispectral satellite imagery was used to model biomass in an 8851-ha plantation in semi-arid south-eastern Kenya. Generalised Additive Models were employed to explore how well biomass was explained by various spectral vegetation indices (VIs). The highest performance (explained deviance = 76%, RMSE = 5.15 Mg ha−1) was achieved with ratio and normalised difference VIs based on the green (R560), red-edge (R740 and R783), and near-infrared (R865) spectral bands. Heterogeneity of ground vegetation and resulting background effects seemed to limit model performance. The best performing VI (R740/R783) was used to predict plantation biomass that ranged from 0 to 46.7 Mg ha−1 (mean biomass 10.6 Mg ha−1). The modelling showed that multispectral data are suitable for assessing sisal leaf biomass at the plantation level and in individual blocks. Although these results demonstrate the value of Sentinel-2 red-edge bands at 20-m resolution, the difference from the best model based on green and near-infrared bands at 10-m resolution was rather small.


2021 ◽  
Vol 13 (8) ◽  
pp. 1595
Author(s):  
Chunhua Li ◽  
Lizhi Zhou ◽  
Wenbin Xu

Wetland vegetation aboveground biomass (AGB) directly indicates wetland ecosystem health and is critical for water purification, carbon cycle, and biodiversity conservation. Accurate AGB estimation is essential for the monitoring and supervision of ecosystems, especially in seasonal floodplain wetlands. This paper explored the capability of spectral and texture features from the Sentinel-2 Multispectral Instrument (MSI) for modeling grassland AGB using random forest (RF) and extreme gradient boosting (XGBoost) algorithms in Shengjin Lake wetland (a Ramsar site). We use five-fold cross-validation to verify the model effectiveness. The results indicated that the RF and XGBoost models had a robust and efficient performance (with root mean square error (RMSE) of 126.571 g·m−2 and R2 of 0.844 for RF, RMSE of 112.425 g·m−2 and R2 of 0.869 for XGBoost), and the XGBoost models, by contrast, performed better. Both traditional and red-edge vegetation indices (VIs) obtained satisfactory results of AGB estimation (RMSE = 127.936 g·m−2, RMSE = 125.879 g·m−2 in XGBoost models, respectively), with the red-edge VIs contributed more to the AGB models. Moreover, we selected eight gray-level co-occurrence matrix (GLCM) textures calculated by four processing window sizes using the mean value of four offsets, and further analyzed the results of three analysis sets. Textures derived from traditional and red-edge bands using a 7 × 7 window size performed better in biomass estimation. This finding suggested that textures derived from the traditional bands were as important as the red-edge bands. The introduction of textures moderately improved the accuracy of modeling AGB, whereas the use of textures alo ne was not satisfactory. This research demonstrated that using the Sentinel-2 MSI and the two ensemble algorithms is an effective method for long-term dynamic monitoring and assessment of grass AGB in seasonal floodplain wetlands, which can support sustainable management and carbon accounting of wetland ecosystems.


2020 ◽  
Vol 12 (17) ◽  
pp. 2760
Author(s):  
Gourav Misra ◽  
Fiona Cawkwell ◽  
Astrid Wingler

Remote sensing of plant phenology as an indicator of climate change and for mapping land cover has received significant scientific interest in the past two decades. The advancing of spring events, the lengthening of the growing season, the shifting of tree lines, the decreasing sensitivity to warming and the uniformity of spring across elevations are a few of the important indicators of trends in phenology. The Sentinel-2 satellite sensors launched in June 2015 (A) and March 2017 (B), with their high temporal frequency and spatial resolution for improved land mapping missions, have contributed significantly to knowledge on vegetation over the last three years. However, despite the additional red-edge and short wave infra-red (SWIR) bands available on the Sentinel-2 multispectral instruments, with improved vegetation species detection capabilities, there has been very little research on their efficacy to track vegetation cover and its phenology. For example, out of approximately every four papers that analyse normalised difference vegetation index (NDVI) or enhanced vegetation index (EVI) derived from Sentinel-2 imagery, only one mentions either SWIR or the red-edge bands. Despite the short duration that the Sentinel-2 platforms have been operational, they have proved their potential in a wide range of phenological studies of crops, forests, natural grasslands, and other vegetated areas, and in particular through fusion of the data with those from other sensors, e.g., Sentinel-1, Landsat and MODIS. This review paper discusses the current state of vegetation phenology studies based on the first five years of Sentinel-2, their advantages, limitations, and the scope for future developments.


Sign in / Sign up

Export Citation Format

Share Document