The Classification Performance and Mechanism of Machine Learning Algorithms in Winter Wheat Mapping Using Sentinel-2 10 m Resolution Imagery

Peng Fang; Xiwang Zhang; Panpan Wei; Yuanzheng Wang; Huiyi Zhang; Feng Liu; Jun Zhao

doi:10.3390/app10155075

The Classification Performance and Mechanism of Machine Learning Algorithms in Winter Wheat Mapping Using Sentinel-2 10 m Resolution Imagery

Applied Sciences ◽

10.3390/app10155075 ◽

2020 ◽

Vol 10 (15) ◽

pp. 5075 ◽

Cited By ~ 1

Author(s):

Peng Fang ◽

Xiwang Zhang ◽

Panpan Wei ◽

Yuanzheng Wang ◽

Huiyi Zhang ◽

...

Keyword(s):

Machine Learning ◽

Winter Wheat ◽

Learning Algorithms ◽

Classification Performance ◽

Machine Learning Algorithms ◽

Accuracy Score ◽

Grid Search ◽

Maximum Variation ◽

Performance And Mechanism ◽

Sentinel 2

Machine learning algorithms are crucial for crop identification and mapping. However, many works only focus on the identification results of these algorithms, but pay less attention to their classification performance and mechanism. In this paper, based on Google Earth Engine (GEE), Sentinel-2 10 m resolution images during a specific phenological period of winter wheat were obtained. Then, support vector machine (SVM), random forest (RF), and classification and regression tree (CART) machine learning algorithms were employed to identify and map winter wheat in a large-scale area. The hyperparameters of the three machine learning algorithms were tuned by grid search and the 5-fold cross-validation method. The classification performance of the three machine learning algorithms were compared, the results of which demonstrate that SVM achieves best performance in identifying winter wheat, and its overall accuracy (OA), user’s accuracy (UA), producer’s accuracy (PA), and kappa coefficient (Kappa) are 0.94, 0.95, 0.95, and 0.92, respectively. Moreover, 50 various combinations of training and validation sets were used to analyze the generalization ability of the algorithms, and the results show that the average OA of SVM, RF, and CART are 0.93, 0.92, and 0.88, respectively, thus indicating that SVM and RF are more robust than CART. To further explore the sensitivity of SVM, RF, and CART to variations of the algorithm parameters—namely, (C and gamma), (tree and split), and (maxD and minSP)—we employed the grid search method to iterate these parameters, respectively, and to analyze the effect of these parameters on the accuracy scores and classification residuals. It was found that with the change of (C and gamma) in (0.01~1000), SVM’s maximum variation of accuracy score is up to 0.63, and the maximum variation of residuals is 76,215 km2. We concluded that SVM is sensitive to the parameters (C and gamma) and presents a positive correlation. When the parameters (tree and split) change between (100~600) and (1~6), respectively, the RF’s maximum variation of accuracy score is 0.08, and the maximum variation of residuals is 1157 km2, indicating that RF is low in sensitivity toward the parameters (tree and split). When the parameters (maxD and minSP) are between (10~60), the maximum accuracy change value is 0.06, and the maximum variation of residuals is 6943 km2. Therefore, compared to RF, CART is sensitive to the parameters (maxD and minSP) and has poor robustness. In general, under the conditions of the hyperparameters, SVM and RF exhibit optimal classification performance, while CART has relatively inferior performance. Meanwhile, SVM, RF, and CART have different sensitivities toward the algorithm parameters; that is, SVM and CART are more sensitive to the algorithm parameters, while RF has low sensitivity toward changes in the algorithm parameters. The different parameters cause great changes in the accuracy scores and residuals, so it is necessary to determine the algorithm hyperparameters. Generally, default parameters can be used to achieve crop classification, but we recommend the enumeration method, similar to grid search, as a practical way to improve the classification performance of the algorithm if the best classification effect is expected.

Download Full-text

Land cover and land use classification performance of machine learning algorithms in a boreal landscape using Sentinel-2 data

GIScience & Remote Sensing ◽

10.1080/15481603.2019.1650447 ◽

2019 ◽

Vol 57 (1) ◽

pp. 1-20 ◽

Cited By ~ 30

Author(s):

Abdulhakim Mohamed Abdi

Keyword(s):

Machine Learning ◽

Land Use ◽

Land Cover ◽

Learning Algorithms ◽

Classification Performance ◽

Machine Learning Algorithms ◽

Land Use Classification ◽

Boreal Landscape ◽

Sentinel 2

Download Full-text

Digital mapping of soil salinization based on Sentinel-1 and Sentinel-2 data combined with machine learning algorithms

Regional Sustainability ◽

10.1016/j.regsus.2021.06.001 ◽

2021 ◽

Vol 2 (2) ◽

pp. 177-188

Author(s):

Guolin Ma ◽

Jianli Ding ◽

Lijng Han ◽

Zipeng Zhang ◽

Si Ran

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Soil Salinization ◽

Machine Learning Algorithms ◽

Digital Mapping ◽

Sentinel 2

Download Full-text

Exploitation of time series Sentinel-2 data and different machine learning algorithms for detailed tree species classification

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing ◽

10.1109/jstars.2021.3098817 ◽

2021 ◽

pp. 1-1

Author(s):

Yanbiao Xi ◽

Chunying Ren ◽

Qingjiu Tian ◽

Yongxing Ren ◽

Xinyu Dong ◽

...

Keyword(s):

Machine Learning ◽

Time Series ◽

Tree Species ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Species Classification ◽

Tree Species Classification ◽

Sentinel 2

Download Full-text

DDoS Detection on Internet of Things using Unsupervised Algorithms

E3S Web of Conferences ◽

10.1051/e3sconf/202129701005 ◽

2021 ◽

Vol 297 ◽

pp. 01005

Author(s):

Hailyie Tekleselassie

Keyword(s):

Learning Algorithms ◽

Classification Performance ◽

Machine Learning Algorithms ◽

Transport Layer ◽

Accuracy Score ◽

Ddos Attacks ◽

Knowledge Based ◽

Ddos Detection ◽

Iot Devices ◽

Unsupervised Algorithms

Through the growth of the fifth-generation networks and artificial intelligence technologies, new threats and challenges have appeared to wireless communication system, especially in cybersecurity. And IoT networks are gradually attractive stages for introduction of DDoS attacks due to integral frailer security and resource-constrained nature of IoT devices. This paper emphases on detecting DDoS attack in wireless networks by categorizing inward network packets on the transport layer as either “abnormal” or “normal” using the integration of machine learning algorithms knowledge-based system. In this paper, deep learning algorithms and CNN were autonomously trained for mitigating DDoS attacks. This paper lays importance on misuse based DDOS attacks which comprise TCP SYN-Flood and ICMP flood. The researcher uses CICIDS2017 and NSL-KDD dataset in training and testing the algorithms (model) while the experimentation phase. accuracy score is used to measure the classification performance of the four algorithms. the results display that the 99.93 performance is recorded.

Download Full-text

Comparison of Machine Learning Algorithms for Wildland-Urban Interface Fuelbreak Planning Integrating ALS and UAV-Borne LiDAR Data and Multispectral Images

Drones ◽

10.3390/drones4020021 ◽

2020 ◽

Vol 4 (2) ◽

pp. 21 ◽

Cited By ~ 1

Author(s):

Francisco Rodríguez-Puerta ◽

Rafael Alonso Ponce ◽

Fernando Pérez-Rodríguez ◽

Beatriz Águeda ◽

Saray Martín-García ◽

...

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Random Forest ◽

Learning Algorithms ◽

Remote Sensing Data ◽

Machine Learning Algorithms ◽

Data Sources ◽

Lidar Data ◽

Sensing Data ◽

Sentinel 2

Controlling vegetation fuels around human settlements is a crucial strategy for reducing fire severity in forests, buildings and infrastructure, as well as protecting human lives. Each country has its own regulations in this respect, but they all have in common that by reducing fuel load, we in turn reduce the intensity and severity of the fire. The use of Unmanned Aerial Vehicles (UAV)-acquired data combined with other passive and active remote sensing data has the greatest performance to planning Wildland-Urban Interface (WUI) fuelbreak through machine learning algorithms. Nine remote sensing data sources (active and passive) and four supervised classification algorithms (Random Forest, Linear and Radial Support Vector Machine and Artificial Neural Networks) were tested to classify five fuel-area types. We used very high-density Light Detection and Ranging (LiDAR) data acquired by UAV (154 returns·m−2 and ortho-mosaic of 5-cm pixel), multispectral data from the satellites Pleiades-1B and Sentinel-2, and low-density LiDAR data acquired by Airborne Laser Scanning (ALS) (0.5 returns·m−2, ortho-mosaic of 25 cm pixels). Through the Variable Selection Using Random Forest (VSURF) procedure, a pre-selection of final variables was carried out to train the model. The four algorithms were compared, and it was concluded that the differences among them in overall accuracy (OA) on training datasets were negligible. Although the highest accuracy in the training step was obtained in SVML (OA=94.46%) and in testing in ANN (OA=91.91%), Random Forest was considered to be the most reliable algorithm, since it produced more consistent predictions due to the smaller differences between training and testing performance. Using a combination of Sentinel-2 and the two LiDAR data (UAV and ALS), Random Forest obtained an OA of 90.66% in training and of 91.80% in testing datasets. The differences in accuracy between the data sources used are much greater than between algorithms. LiDAR growth metrics calculated using point clouds in different dates and multispectral information from different seasons of the year are the most important variables in the classification. Our results support the essential role of UAVs in fuelbreak planning and management and thus, in the prevention of forest fires.

Download Full-text

A Machine Learning Approach for Mapping Forest Vegetation in Riparian Zones in an Atlantic Biome Environment Using Sentinel-2 Imagery

Remote Sensing ◽

10.3390/rs12244086 ◽

2020 ◽

Vol 12 (24) ◽

pp. 4086

Author(s):

Danielle Elis Garcia Furuya ◽

João Alex Floriano Aguiar ◽

Nayara V. Estrabis ◽

Mayara Maezano Faita Pinheiro ◽

Michelle Taís Garcia Furuya ◽

...

Keyword(s):

Machine Learning ◽

Environmental Planning ◽

Riparian Zone ◽

Learning Algorithms ◽

Vegetation Mapping ◽

Forest Vegetation ◽

Machine Learning Algorithms ◽

Riparian Zones ◽

Support Vector ◽

Sentinel 2

Riparian zones consist of important environmental regions, specifically to maintain the quality of water resources. Accurately mapping forest vegetation in riparian zones is an important issue, since it may provide information about numerous surface processes that occur in these areas. Recently, machine learning algorithms have gained attention as an innovative approach to extract information from remote sensing imagery, including to support the mapping task of vegetation areas. Nonetheless, studies related to machine learning application for forest vegetation mapping in the riparian zones exclusively is still limited. Therefore, this paper presents a framework for forest vegetation mapping in riparian zones based on machine learning models using orbital multispectral images. A total of 14 Sentinel-2 images registered throughout the year, covering a large riparian zone of a portion of a wide river in the Pontal do Paranapanema region, São Paulo state, Brazil, was adopted as the dataset. This area is mainly composed of the Atlantic Biome vegetation, and it is near to the last primary fragment of its biome, being an important region from the environmental planning point of view. We compared the performance of multiple machine learning algorithms like decision tree (DT), random forest (RF), support vector machine (SVM), and normal Bayes (NB). We evaluated different dates and locations with all models. Our results demonstrated that the DT learner has, overall, the highest accuracy in this task. The DT algorithm also showed high accuracy when applied on different dates and in the riparian zone of another river. We conclude that the proposed approach is appropriated to accurately map forest vegetation in riparian zones, including temporal context.

Download Full-text

A Comparison of Hybrid Machine Learning Algorithms for the Retrieval of Wheat Biophysical Variables from Sentinel-2

Remote Sensing ◽

10.3390/rs11050481 ◽

2019 ◽

Vol 11 (5) ◽

pp. 481 ◽

Cited By ~ 26

Author(s):

Deepak Upreti ◽

Wenjiang Huang ◽

Weiping Kong ◽

Simone Pascucci ◽

Stefano Pignatti ◽

...

Keyword(s):

Machine Learning ◽

Least Squares ◽

Chlorophyll Content ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Area Index ◽

Random Forest Tree ◽

Ground Validation ◽

Study Sites ◽

Sentinel 2

This study focuses on the comparison of hybrid methods of estimation of biophysical variables such as leaf area index (LAI), leaf chlorophyll content (LCC), fraction of absorbed photosynthetically active radiation (FAPAR), fraction of vegetation cover (FVC), and canopy chlorophyll content (CCC) from Sentinel-2 satellite data. Different machine learning algorithms were trained with simulated spectra generated by the physically-based radiative transfer model PROSAIL and subsequently applied to Sentinel-2 reflectance spectra. The algorithms were assessed against a standard operational approach, i.e., the European Space Agency (ESA) Sentinel Application Platform (SNAP) toolbox, based on neural networks. Since kernel-based algorithms have a heavy computational cost when trained with large datasets, an active learning (AL) strategy was explored to try to alleviate this issue. Validation was carried out using ground data from two study sites: one in Shunyi (China) and the other in Maccarese (Italy). In general, the performance of the algorithms was consistent for the two study sites, though a different level of accuracy was found between the two sites, possibly due to slightly different ground sampling protocols and the range and variability of the values of the biophysical variables in the two ground datasets. For LAI estimation, the best ground validation results were obtained for both sites using least squares linear regression (LSLR) and partial least squares regression, with the best performances values of R2 of 0.78, rott mean squared error (RMSE) of 0.68 m2 m−2 and a relative RMSE (RRMSE) of 19.48% obtained in the Maccarese site with LSLR. The best results for LCC were obtained using Random Forest Tree Bagger (RFTB) and Bagging Trees (BagT) with the best performances obtained in Maccarese using RFTB (R2 = 0.26, RMSE = 8.88 μg cm−2, RRMSE = 17.43%). Gaussian Process Regression (GPR) was the best algorithm for all variables only in the cross-validation phase, but not in the ground validation, where it ranked as the best only for FVC in Maccarese (R2 = 0.90, RMSE = 0.08, RRMSE = 9.86%). It was found that the AL strategy was more efficient than the random selection of samples for training the GPR algorithm.

Download Full-text

High-resolution digital mapping of soil organic carbon and soil total nitrogen using DEM derivatives, Sentinel-1 and Sentinel-2 data based on machine learning algorithms

The Science of The Total Environment ◽

10.1016/j.scitotenv.2020.138244 ◽

2020 ◽

Vol 729 ◽

pp. 138244 ◽

Cited By ~ 2

Author(s):

Tao Zhou ◽

Yajun Geng ◽

Jie Chen ◽

Jianjun Pan ◽

Dagmar Haase ◽

...

Keyword(s):

Machine Learning ◽

High Resolution ◽

Organic Carbon ◽

Soil Organic Carbon ◽

Total Nitrogen ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Soil Total Nitrogen ◽

Digital Mapping ◽

Sentinel 2

Download Full-text

Mapping spatial variability of foliar nitrogen and carbon in Indian tropical moist deciduous sal (Shorea robusta) forest using machine learning algorithms and Sentinel-2 data

International Journal of Remote Sensing ◽

10.1080/01431161.2020.1823043 ◽

2020 ◽

Vol 42 (3) ◽

pp. 1139-1159

Author(s):

Vaishali Vasudeva ◽

Subrata Nandy ◽

Hitendra Padalia ◽

Ritika Srinet ◽

Prakash Chauhan

Keyword(s):

Machine Learning ◽

Spatial Variability ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Foliar Nitrogen ◽

Shorea Robusta ◽

Sentinel 2

Download Full-text

Remote Sensing of Turbidity for Lakes in Northeast China Using Sentinel-2 Images with Machine Learning Algorithms

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing ◽

10.1109/jstars.2021.3109292 ◽

2021 ◽

pp. 1-1

Author(s):

Yue Ma ◽

Kaishan Song ◽

Zhidan Wen ◽

Ge Liu ◽

Yingxin Shang ◽

...

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Northeast China ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Sentinel 2

Download Full-text