Forest Type Classification Based on Integrated Spectral-Spatial-Temporal Features and Random Forest Algorithm—A Case Study in the Qinling Mountains

Kai Cheng; Juanle Wang

doi:10.3390/f10070559

Forest Type Classification Based on Integrated Spectral-Spatial-Temporal Features and Random Forest Algorithm—A Case Study in the Qinling Mountains

Forests ◽

10.3390/f10070559 ◽

2019 ◽

Vol 10 (7) ◽

pp. 559 ◽

Cited By ~ 5

Author(s):

Kai Cheng ◽

Juanle Wang

Keyword(s):

Time Series ◽

Random Forest ◽

Forest Type ◽

Evergreen Forest ◽

Recursive Feature Elimination ◽

Validation Dataset ◽

Qinling Mountains ◽

Temporal Features ◽

Time Series Images ◽

Type Classification

Spectral, spatial, and temporal features play important roles in land cover classification. However, limitations still exist in the integrated application of spectral-spatial-temporal (SST) features for forest type discrimination. This paper proposes a forest type classification framework based on SST features and the random forest (RF) algorithm. The SST features were derived from time-series images using original bands, vegetation index, gray-level correlation matrix, and harmonic analysis. Random forest-recursive feature elimination (RF-RFE) was used to optimize high-dimensional and correlated feature space, and determine the optimal SST feature set. Then, the classification was carried out using an RF classifier and the optimized SST feature set. This method was applied in the Qinling Mountains using Sentinel-2 time-series images. A total of 21 SST features were obtained through the RF-RFE method, and their importance was evaluated using the Gini index. The results indicated that spectral features contribute the most to separating shrubs, spatial features are more suitable for discrimination among evergreen forest types, and temporal features are more useful for evergreen forest, deciduous forest, and shrub types. The forest type map was generated based on the optimal SST feature set and RF algorithm, and evaluated based on an agreement with the validation dataset. The results showed that this integrated method is reliable, with an overall accuracy of 86.88% and kappa coefficient of 0.86, and can support forest type sustainable management and mapping at the local scale.

Download Full-text

Automatic Mapping of Irrigated Areas in Mediteranean Context Using Landsat 8 Time Series Images and Random Forest Algorithm

IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss.2018.8517810 ◽

2018 ◽

Cited By ~ 2

Author(s):

Z. Benbahria ◽

I. Sebari ◽

H. Hajji ◽

M. F. Smiej

Keyword(s):

Time Series ◽

Random Forest ◽

Landsat 8 ◽

Random Forest Algorithm ◽

Automatic Mapping ◽

Time Series Images

Download Full-text

Forest-Type Classification Using Time-Weighted Dynamic Time Warping Analysis in Mountain Areas: A Case Study in Southern China

Forests ◽

10.3390/f10111040 ◽

2019 ◽

Vol 10 (11) ◽

pp. 1040 ◽

Cited By ~ 2

Author(s):

Kai Cheng ◽

Juanle Wang

Keyword(s):

Time Series ◽

Dynamic Time Warping ◽

Forest Type ◽

Southern China ◽

Training Data ◽

Mountain Forest ◽

Time Warping ◽

Mountain Areas ◽

Dynamic Time ◽

Time Series Images

Efficient methodologies for mapping forest types in complicated mountain areas are essential for the implementation of sustainable forest management practices and monitoring. Existing solutions dedicated to forest-type mapping are primarily focused on supervised machine learning algorithms (MLAs) using remote sensing time-series images. However, MLAs are challenged by complex and problematic forest type compositions, lack of training data, loss of temporal data caused by clouds obscuration, and selection of input feature sets for mountainous areas. The time-weighted dynamic time warping (TWDTW) is a supervised classifier, an adaptation of the dynamic time warping method for time series analysis for land cover classification. This study evaluates the performance of the TWDTW method that uses a combination of Sentinel-2 and Landsat-8 time-series images when applied to complicated mountain forest-type classifications in southern China with complex topographic conditions and forest-type compositions. The classification outputs were compared to those produced by MLAs, including random forest (RF) and support vector machine (SVM). The results presented that the three forest-type maps obtained by TWDTW, RF, and SVM have high consistency in spatial distribution. TWDTW outperformed SVM and RF with mean overall accuracy and mean kappa coefficient of 93.81% and 0.93, respectively, followed by RF and SVM. Compared with MLAs, TWDTW method achieved the higher classification accuracy than RF and SVM, with even less training data. This proved the robustness and less sensitivities to training samples of the TWDTW method when applied to mountain forest-type classifications.

Download Full-text

Land Cover Classification using Google Earth Engine and Random Forest Classifier—The Role of Image Composition

Remote Sensing ◽

10.3390/rs12152411 ◽

2020 ◽

Vol 12 (15) ◽

pp. 2411 ◽

Cited By ~ 2

Author(s):

Thanh Noi Phan ◽

Verena Kuch ◽

Lukas W. Lehnert

Keyword(s):

Time Series ◽

Random Forest ◽

Land Cover ◽

Land Cover Classification ◽

High Accuracy ◽

Google Earth ◽

Temporal Aggregation ◽

Google Earth Engine ◽

Time Series Images ◽

Land Cover Maps

Land cover information plays a vital role in many aspects of life, from scientific and economic to political. Accurate information about land cover affects the accuracy of all subsequent applications, therefore accurate and timely land cover information is in high demand. In land cover classification studies over the past decade, higher accuracies were produced when using time series satellite images than when using single date images. Recently, the availability of the Google Earth Engine (GEE), a cloud-based computing platform, has gained the attention of remote sensing based applications where temporal aggregation methods derived from time series images are widely applied (i.e., the use the metrics such as mean or median), instead of time series images. In GEE, many studies simply select as many images as possible to fill gaps without concerning how different year/season images might affect the classification accuracy. This study aims to analyze the effect of different composition methods, as well as different input images, on the classification results. We use Landsat 8 surface reflectance (L8sr) data with eight different combination strategies to produce and evaluate land cover maps for a study area in Mongolia. We implemented the experiment on the GEE platform with a widely applied algorithm, the Random Forest (RF) classifier. Our results show that all the eight datasets produced moderately to highly accurate land cover maps, with overall accuracy over 84.31%. Among the eight datasets, two time series datasets of summer scenes (images from 1 June to 30 September) produced the highest accuracy (89.80% and 89.70%), followed by the median composite of the same input images (88.74%). The difference between these three classifications was not significant based on the McNemar test (p > 0.05). However, significant difference (p < 0.05) was observed for all other pairs involving one of these three datasets. The results indicate that temporal aggregation (e.g., median) is a promising method, which not only significantly reduces data volume (resulting in an easier and faster analysis) but also produces an equally high accuracy as time series data. The spatial consistency among the classification results was relatively low compared to the general high accuracy, showing that the selection of the dataset used in any classification on GEE is an important and crucial step, because the input images for the composition play an essential role in land cover classification, particularly with snowy, cloudy and expansive areas like Mongolia.

Download Full-text

Mapping Forest Types in China with 10 m Resolution Based on Spectral–Spatial–Temporal Features

Remote Sensing ◽

10.3390/rs13050973 ◽

2021 ◽

Vol 13 (5) ◽

pp. 973

Author(s):

Kai Cheng ◽

Juanle Wang ◽

Xinrong Yan

Keyword(s):

Remote Sensing ◽

Forest Type ◽

Google Earth ◽

Growth Patterns ◽

Forest Types ◽

Forest Classification ◽

Classification Framework ◽

Temporal Features ◽

Occurrence Matrix ◽

Type Classification

The comprehensive application of spectral, spatial, and temporal (SST) features derived from remote sensing images is a significant technique for classifying and mapping forest types. Facing limitations in the availability of detailed forest type identification processes for large regions, a forest type classification framework based on SST features was developed in this study. The advantages of Sentinel-2 and Landsat series imagery were used to extract SST forest type classification features, using red-edge bands, a gray-level co-occurrence matrix, and harmonic analysis, with the assistance of the Google Earth Engine platform. Considering four representative Chinese geographic regions—middle and high latitudes, complex mountainous areas, cloudy and rainy areas, and the N–S climate transition zone—our method was proven to be effective, with overall classification accuracies > 85%. The scheme to assess the importance of SST features for forest classification in various regions was designed using the Gini criterion in the random forest algorithm and revealed that spectral features were more effective in classifying forest types with complex compositions. Temporal features were found to be favorable for identifying forest types with obvious evergreen and deciduous growth patterns, while spatial features produced better classification results for forest types with different spatial structures, such as needle- or broad-leaved forests. The findings of this study can provide a reference for feature selection in remote sensing forest type classification processes, and identifying forest types in this way could provide support for the accurate and sustainable management of forest resources.

Download Full-text

Assessment of Annual Composite Images Obtained by Google Earth Engine for Urban Areas Mapping Using Random Forest

Remote Sensing ◽

10.3390/rs13040748 ◽

2021 ◽

Vol 13 (4) ◽

pp. 748

Author(s):

Zhaoming Zhang ◽

Mingyue Wei ◽

Dongchuan Pu ◽

Guojin He ◽

Guizhou Wang ◽

...

Keyword(s):

Time Series ◽

Random Forest ◽

Urban Areas ◽

Google Earth ◽

Landsat 8 ◽

Annual Time Series ◽

Google Earth Engine ◽

Time Series Images ◽

The Impact ◽

Annual Time

Urban areas represent the primary source region of greenhouse gas emissions. Mapping urban areas is essential for understanding land cover change, carbon cycles, and climate change (urban areas also refer to impervious surfaces, i.e., artificial cover and structures). Remote sensing has greatly advanced urban areas mapping over the last several decades. At present, we have entered the era of big data. Long time series of satellite data such as Landsat and high-performance computing platforms such as Google Earth Engine (GEE) offer new opportunities to map urban areas. The objective of this research was to determine how annual time series images from Landsat 8 Operational Land Imager (OLI) can effectively be composed to map urban areas in three cities in China in support of GEE. Three reducer functions, ee.Reducer.min(), ee.Reducer.median(), and ee.Reducer.max() provided by GEE, were selected to construct four schemes to synthesize the annual intensive time series Landsat 8 OLI data for three cities in China. Then, urban areas were mapped based on the random forest algorithm and the accuracy was evaluated in detail. The results show that (1) the quality of annual composite images was improved significantly, particularly in reducing the impact of cloud and cloud shadows, and (2) the annual composite images obtained by the combination of multiple reducer functions had better performance than that obtained by a single reducer function. Further, the overall accuracy of urban areas mapping with the combination of multiple reducer functions exceeded 90% in all three cities in China. In summary, a suitable combination of reducer functions for synthesizing annual time series images can enhance data quality and ensure differences between characteristics and higher precision for urban areas mapping.

Download Full-text

Multiple fault diagnosis for hydraulic systems using Nearest-centroid-with-DBA and Random-Forest-based-time-series-classification

2020 39th Chinese Control Conference (CCC) ◽

10.23919/ccc50068.2020.9189401 ◽

2020 ◽

Author(s):

Zhijie Peng ◽

Ke Zhang ◽

Yi Chai

Keyword(s):

Time Series ◽

Fault Diagnosis ◽

Random Forest ◽

Time Series Classification ◽

Hydraulic Systems ◽

Multiple Fault ◽

Multiple Fault Diagnosis

Download Full-text

A Novel Time-Sensitive Composite Similarity Model for Multivariate Time-Series Correlation Analysis

Entropy ◽

10.3390/e23060731 ◽

2021 ◽

Vol 23 (6) ◽

pp. 731

Author(s):

Mengxia Liang ◽

Xiaolong Wang ◽

Shaocong Wu

Keyword(s):

Time Series ◽

Correlation Analysis ◽

Stock Price ◽

Multivariate Time Series ◽

Temporal Features ◽

Proposed Model ◽

Time Series Segmentation ◽

Similarity Model ◽

Dynamic Time ◽

Investment Portfolios

Finding the correlation between stocks is an effective method for screening and adjusting investment portfolios for investors. One single temporal feature or static nontemporal features are generally used in most studies to measure the similarity between stocks. However, these features are not sufficient to explore phenomena such as price fluctuations similar in shape but unequal in length which may be caused by multiple temporal features. To research stock price volatilities entirely, mining the correlation between stocks should be considered from the point view of multiple features described as time series, including closing price, etc. In this paper, a time-sensitive composite similarity model designed for multivariate time-series correlation analysis based on dynamic time warping is proposed. First, a stock is chosen as the benchmark, and the multivariate time series are segmented by the peaks and troughs time-series segmentation (PTS) algorithm. Second, similar stocks are screened out by similarity. Finally, the rate of rising or falling together between stock pairs is used to verify the proposed model’s effectiveness. Compared with other models, the composite similarity model brings in multiple temporal features and is generalizable for numerical multivariate time series in different fields. The results show that the proposed model is very promising.

Download Full-text

PredNTS: Improved and Robust Prediction of Nitrotyrosine Sites by Integrating Multiple Sequence Features

International Journal of Molecular Sciences ◽

10.3390/ijms22052704 ◽

2021 ◽

Vol 22 (5) ◽

pp. 2704

Author(s):

Andi Nur Nilamyani ◽

Firda Nurul Auliah ◽

Mohammad Ali Moni ◽

Watshara Shoombuatong ◽

Md Mehedi Hasan ◽

...

Keyword(s):

Machine Learning ◽

Random Forest ◽

Web Application ◽

Computational Prediction ◽

Vital Role ◽

Machine Learning Algorithms ◽

Recursive Feature Elimination ◽

Post Translational Modification ◽

Multiple Sequence ◽

Sequence Features

Nitrotyrosine, which is generated by numerous reactive nitrogen species, is a type of protein post-translational modification. Identification of site-specific nitration modification on tyrosine is a prerequisite to understanding the molecular function of nitrated proteins. Thanks to the progress of machine learning, computational prediction can play a vital role before the biological experimentation. Herein, we developed a computational predictor PredNTS by integrating multiple sequence features including K-mer, composition of k-spaced amino acid pairs (CKSAAP), AAindex, and binary encoding schemes. The important features were selected by the recursive feature elimination approach using a random forest classifier. Finally, we linearly combined the successive random forest (RF) probability scores generated by the different, single encoding-employing RF models. The resultant PredNTS predictor achieved an area under a curve (AUC) of 0.910 using five-fold cross validation. It outperformed the existing predictors on a comprehensive and independent dataset. Furthermore, we investigated several machine learning algorithms to demonstrate the superiority of the employed RF algorithm. The PredNTS is a useful computational resource for the prediction of nitrotyrosine sites. The web-application with the curated datasets of the PredNTS is publicly available.

Download Full-text

Improving the Accuracy of Fractional Evergreen Forest Cover Estimation at Subpixel Scale in Cloudy and Rainy Areas by Harmonizing Landsat-8 and Sentinel-2 Time-Series Data

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing ◽

10.1109/jstars.2021.3064580 ◽

2021 ◽

Vol 14 ◽

pp. 3373-3385

Author(s):

Taixia Wu ◽

Yuting Zhao ◽

Shudong Wang ◽

Hongjun Su ◽

Yingying Yang ◽

...

Keyword(s):

Time Series ◽

Forest Cover ◽

Time Series Data ◽

Evergreen Forest ◽

Series Data ◽

Landsat 8 ◽

Sentinel 2

Download Full-text

Classifying Forest Type in the National Forest Inventory Context with Airborne Hyperspectral and Lidar Data

Remote Sensing ◽

10.3390/rs13101863 ◽

2021 ◽

Vol 13 (10) ◽

pp. 1863

Author(s):

Caileigh Shoot ◽

Hans-Erik Andersen ◽

L. Monika Moskal ◽

Chad Babcock ◽

Bruce D. Cook ◽

...

Keyword(s):

Random Forest ◽

Forest Inventory ◽

Forest Type ◽

Vegetation Indices ◽

National Forest Inventory ◽

Classification Algorithm ◽

Lidar Data ◽

National Forest ◽

Interior Alaska ◽

Type Information

Forest structure and composition regulate a range of ecosystem services, including biodiversity, water and nutrient cycling, and wood volume for resource extraction. Forest type is an important metric measured in the US Forest Service Forest Inventory and Analysis (FIA) program, the national forest inventory of the USA. Forest type information can be used to quantify carbon and other forest resources within specific domains to support ecological analysis and forest management decisions, such as managing for disease and pests. In this study, we developed a methodology that uses a combination of airborne hyperspectral and lidar data to map FIA-defined forest type between sparsely sampled FIA plot data collected in interior Alaska. To determine the best classification algorithm and remote sensing data for this task, five classification algorithms were tested with six different combinations of raw hyperspectral data, hyperspectral vegetation indices, and lidar-derived canopy and topography metrics. Models were trained using forest type information from 632 FIA subplots collected in interior Alaska. Of the thirty model and input combinations tested, the random forest classification algorithm with hyperspectral vegetation indices and lidar-derived topography and canopy height metrics had the highest accuracy (78% overall accuracy). This study supports random forest as a powerful classifier for natural resource data. It also demonstrates the benefits from combining both structural (lidar) and spectral (imagery) data for forest type classification.

Download Full-text