Object-Oriented LULC Classification in Google Earth Engine Combining SNIC, GLCM, and Machine Learning Algorithms

Andrea Tassi; Marco Vizzari

doi:10.3390/rs12223776

Object-Oriented LULC Classification in Google Earth Engine Combining SNIC, GLCM, and Machine Learning Algorithms

Remote Sensing ◽

10.3390/rs12223776 ◽

2020 ◽

Vol 12 (22) ◽

pp. 3776

Author(s):

Andrea Tassi ◽

Marco Vizzari

Keyword(s):

Machine Learning ◽

Central Italy ◽

Object Oriented ◽

Google Earth ◽

Machine Learning Algorithms ◽

Support Vector ◽

Landsat 8 ◽

Good Reliability ◽

Google Earth Engine ◽

Occurrence Matrix

Google Earth Engine (GEE) is a versatile cloud platform in which pixel-based (PB) and object-oriented (OO) Land Use–Land Cover (LULC) classification approaches can be implemented, thanks to the availability of the many state-of-art functions comprising various Machine Learning (ML) algorithms. OO approaches, including both object segmentation and object textural analysis, are still not common in the GEE environment, probably due to the difficulties existing in concatenating the proper functions, and in tuning the various parameters to overcome the GEE computational limits. In this context, this work is aimed at developing and testing an OO classification approach combining the Simple Non-Iterative Clustering (SNIC) algorithm to identify spatial clusters, the Gray-Level Co-occurrence Matrix (GLCM) to calculate cluster textural indices, and two ML algorithms (Random Forest (RF) or Support Vector Machine (SVM)) to perform the final classification. A Principal Components Analysis (PCA) is applied to the main seven GLCM indices to synthesize in one band the textural information used for the OO classification. The proposed approach is implemented in a user-friendly, freely available GEE code useful to perform the OO classification, tuning various parameters (e.g., choose the input bands, select the classification algorithm, test various segmentation scales) and compare it with a PB approach. The accuracy of OO and PB classifications can be assessed both visually and through two confusion matrices that can be used to calculate the relevant statistics (producer’s, user’s, overall accuracy (OA)). The proposed methodology was broadly tested in a 154 km2 study area, located in the Lake Trasimeno area (central Italy), using Landsat 8 (L8), Sentinel 2 (S2), and PlanetScope (PS) data. The area was selected considering its complex LULC mosaic mainly composed of artificial surfaces, annual and permanent crops, small lakes, and wooded areas. In the study area, the various tests produced interesting results on the different datasets (OA: PB RF (L8 = 72.7%, S2 = 82%, PS = 74.2), PB SVM (L8 = 79.1%, S2 = 80.2%, PS = 74.8%), OO RF (L8 = 64%, S2 = 89.3%, PS = 77.9), OO SVM (L8 = 70.4, S2 = 86.9%, PS = 73.9)). The broad code application demonstrated very good reliability of the whole process, even though the OO classification process resulted, sometimes, too demanding on higher resolution data, considering the available computational GEE resources.

Download Full-text

Assessing the performance of machine learning algorithms for soil salinity mapping in Google Earth Engine Platform using Sentinel-2A and Landsat-8 OLI data

Advances in Space Research ◽

10.1016/j.asr.2021.10.024 ◽

2021 ◽

Author(s):

Samet Aksoy ◽

Aylin Yildirim ◽

Taha Gorji ◽

Nikou Hamzehpour ◽

Aysegul Tanik ◽

...

Keyword(s):

Machine Learning ◽

Soil Salinity ◽

Learning Algorithms ◽

Google Earth ◽

Machine Learning Algorithms ◽

Landsat 8 ◽

Landsat 8 Oli ◽

Google Earth Engine ◽

Sentinel 2A

Download Full-text

Mapping Regional Soil Organic Matter Based on Sentinel-2A and MODIS Imagery Using Machine Learning Algorithms and Google Earth Engine

Remote Sensing ◽

10.3390/rs13152934 ◽

2021 ◽

Vol 13 (15) ◽

pp. 2934

Author(s):

Meiwei Zhang ◽

Meinan Zhang ◽

Haoxuan Yang ◽

Yuanliang Jin ◽

Xinle Zhang ◽

...

Keyword(s):

Machine Learning ◽

Organic Matter ◽

Soil Organic Matter ◽

Learning Algorithms ◽

Google Earth ◽

Machine Learning Algorithms ◽

Support Vector ◽

Full Band ◽

Google Earth Engine ◽

Sentinel 2A

Many studies have attempted to predict soil organic matter (SOM), whereas mapping high-precision and high-resolution SOM maps remains a challenge due to the difficulty of selecting appropriate satellite data sources and prediction algorithms. This study aimed to investigate the influence of different remotely sensed images and machine learning algorithms on SOM prediction. We constructed two comparative experiments, i.e., full-band and common-band variable datasets of Sentinel-2A and MODIS images using Google Earth Engine (GEE). The predictive performances of random forest (RF), artificial neural network (ANN), and support vector regression (SVR) algorithms were evaluated, and the SOM map was generated for the Songnen Plain. Results showed that the model based on the full-band Sentinel-2A dataset achieved the best performance. The application of Sentinel-2A data resulted in mean relative improvements (RIs) of 7.67% and 5.87%, respectively. The RF achieved a lower root mean squared error (RMSE = 0.68%) and a higher coefficient of determination (R2 = 0.67) in all of the predicted scenarios than ANN and SVR. The resultant SOM map accurately characterized the SOM spatial distribution. Therefore, the Sentinel-2A data have obvious advantages over MODIS due to their higher spectral and spatial resolutions, and the combination of the RF algorithm and GEE is an effective approach to SOM mapping.

Download Full-text

Monitoring of Vegetation Cover Changes With Geomorphological Forms using Google Earth Engine in Kendari City

Jurnal Geografi Gea ◽

10.17509/gea.v21i2.37070 ◽

2021 ◽

Vol 21 (2) ◽

pp. 159-170

Author(s):

Septianto Aldiansyah ◽

Masita Dwi Mandini Mannesa ◽

Supriatna Supriatna

Keyword(s):

Machine Learning ◽

Human Activities ◽

Vegetation Cover ◽

Open Space ◽

Near Infrared ◽

Google Earth ◽

Machine Learning Algorithms ◽

Landsat 8 ◽

Vegetation Density ◽

Google Earth Engine

Vegetation cover plays an important role in controlling the view, boundaries, air temperature, living place and aesthetics in an area. Vegetation cover changes can be caused by changes in temperature, rainfall and human activities. Google Earth Engine (GEE) provides machine learning algorithms such as NDVI which are very useful in extracting vegetation density levels from imagery. The purpose of this study was to analyze vegetation cover changes by human activities in relation to the geomorphological form of Kendari City. The imagery used in multi-temporal monitoring are Landsat-7 ETM in 2000, Landsat-5 TM in 2005 and 2010 and Landsat 8 OLI in 2015 and 2020. Input machine learning using near infrared (NIR) and red (Red) for the NDVI Algorithm while the geomorphological form uses SRTM imagery. The classification of vegetation cover consists of water bodies, open field, built areas and roads covered with asphalt, paving or soil, plantations/agriculture, bushes, grass, reeds, green open space and forests. Each sub-district experienced a decrease in vegetation cover in the form of plantations/agriculture, bushes, grass, reeds, green open space except for the West Kendari District which tended to be varied. The forest area is getting better every year. The existence of protected forests and geomorphological forms such as lowlands are the driving factors for changes in vegetation cover, while low hills and high hills are flat to steep are contrainst factors. Machine learning in GEE is very helpful in monitoring vegetation cover changes and has an NDVI algorithm that is quite easy to apply.

Download Full-text

Analysis of Land Use and Land Cover Using Machine Learning Algorithms on Google Earth Engine for Munneru River Basin, India

Sustainability ◽

10.3390/su132413758 ◽

2021 ◽

Vol 13 (24) ◽

pp. 13758

Author(s):

Kotapati Narayana Loukika ◽

Venkata Reddy Keesara ◽

Venkataramana Sridhar

Keyword(s):

Machine Learning ◽

Land Use ◽

Land Cover ◽

Learning Algorithms ◽

Google Earth ◽

Machine Learning Algorithms ◽

Landsat 8 ◽

Lulc Change ◽

Google Earth Engine ◽

Sentinel 2

The growing human population accelerates alterations in land use and land cover (LULC) over time, putting tremendous strain on natural resources. Monitoring and assessing LULC change over large areas is critical in a variety of fields, including natural resource management and climate change research. LULC change has emerged as a critical concern for policymakers and environmentalists. As the need for the reliable estimation of LULC maps from remote sensing data grows, it is critical to comprehend how different machine learning classifiers perform. The primary goal of the present study was to classify LULC on the Google Earth Engine platform using three different machine learning algorithms—namely, support vector machine (SVM), random forest (RF), and classification and regression trees (CART)—and to compare their performance using accuracy assessments. The LULC of the study area was classified via supervised classification. For improved classification accuracy, NDVI (normalized difference vegetation index) and NDWI (normalized difference water index) indices were also derived and included. For the years 2016, 2018, and 2020, multitemporal Sentinel-2 and Landsat-8 data with spatial resolutions of 10 m and 30 m were used for the LULC classification. ‘Water bodies’, ‘forest’, ‘barren land’, ‘vegetation’, and ‘built-up’ were the major land use classes. The average overall accuracy of SVM, RF, and CART classifiers for Landsat-8 images was 90.88%, 94.85%, and 82.88%, respectively, and 93.8%, 95.8%, and 86.4% for Sentinel-2 images. These results indicate that RF classifiers outperform both SVM and CART classifiers in terms of accuracy.

Download Full-text

PIXEL BASED LANDSLIDE IDENTIFICATION USING LANDSAT 8 AND GEE

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b3-2021-721-2021 ◽

2021 ◽

Vol XLIII-B3-2021 ◽

pp. 721-726

Author(s):

P. Singh ◽

V. Maurya ◽

R. Dwivedi

Keyword(s):

Machine Learning ◽

Research Work ◽

Volcanic Eruptions ◽

Google Earth ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Geological Survey ◽

Support Vector ◽

Landsat 8 ◽

Multi Band

Abstract. Landslide is one of the most common natural disasters triggered mainly due to heavy rainfall, cloud burst, earthquake, volcanic eruptions, unorganized constructions of roads, and deforestation. In India, field surveying is the most common method used to identify potential landslide regions and update the landslide inventories maintained by the Geological Survey of India, but it is very time-consuming, costly, and inefficient. Alternatively, advanced remote sensing technologies in landslide analysis allow rapid and easy data acquisitions and help to improve the traditional method of landslide detection capabilities. Supervised Machine learning algorithms, for example, Support Vector Machine (SVM), are challenging to conventional techniques by predicting disasters with astounding accuracy. In this research work, we have utilized open-source datasets (Landsat 8 multi-band images and JAXA ALOS DSM) and Google Earth Engine (GEE) to identify landslides in Rudraprayag using machine learning techniques. Rudraprayag is a district of Uttarakhand state in India, which has always been the center of attention of geological studies due to its higher density of landslide-prone zones. For the training and validation purpose, labeled landslide locations obtained from landslide inventory (prepared by the Geological Survey of India) and layers such as NDVI, NDWI, and slope (generated from JAXA ALOS DSM and Landsat 8 satellite multi-band imagery) were used. The landslide identification has been performed using SVM, Classification and Regression Trees (CART), Minimum Distance, Random forest (RF), and Naïve Bayes techniques, in which SVM and RF outperformed all other techniques by achieving an 87.5% true positive rate (TPR).

Download Full-text

Pixel- vs. Object-Based Landsat 8 Data Classification in Google Earth Engine Using Random Forest: The Case Study of Maiella National Park

Remote Sensing ◽

10.3390/rs13122299 ◽

2021 ◽

Vol 13 (12) ◽

pp. 2299

Author(s):

Andrea Tassi ◽

Daniela Gigante ◽

Giuseppe Modica ◽

Luciano Di Martino ◽

Marco Vizzari

Keyword(s):

Land Cover ◽

National Park ◽

Time Window ◽

Central Italy ◽

Google Earth ◽

Data Cube ◽

Landsat 8 ◽

Change Analysis ◽

Object Based ◽

Google Earth Engine

With the general objective of producing a 2018–2020 Land Use/Land Cover (LULC) map of the Maiella National Park (central Italy), useful for a future long-term LULC change analysis, this research aimed to develop a Landsat 8 (L8) data composition and classification process using Google Earth Engine (GEE). In this process, we compared two pixel-based (PB) and two object-based (OB) approaches, assessing the advantages of integrating the textural information in the PB approach. Moreover, we tested the possibility of using the L8 panchromatic band to improve the segmentation step and the object’s textural analysis of the OB approach and produce a 15-m resolution LULC map. After selecting the best time window of the year to compose the base data cube, we applied a cloud-filtering and a topography-correction process on the 32 available L8 surface reflectance images. On this basis, we calculated five spectral indices, some of them on an interannual basis, to account for vegetation seasonality. We added an elevation, an aspect, a slope layer, and the 2018 CORINE Land Cover classification layer to improve the available information. We applied the Gray-Level Co-Occurrence Matrix (GLCM) algorithm to calculate the image’s textural information and, in the OB approaches, the Simple Non-Iterative Clustering (SNIC) algorithm for the image segmentation step. We performed an initial RF optimization process finding the optimal number of decision trees through out-of-bag error analysis. We randomly distributed 1200 ground truth points and used 70% to train the RF classifier and 30% for the validation phase. This subdivision was randomly and recursively redefined to evaluate the performance of the tested approaches more robustly. The OB approaches performed better than the PB ones when using the 15 m L8 panchromatic band, while the addition of textural information did not improve the PB approach. Using the panchromatic band within an OB approach, we produced a detailed, 15-m resolution LULC map of the study area.

Download Full-text

Exploratory Analysis of Driving Force of Wildfires in Australia: An Application of Machine Learning within Google Earth Engine

Remote Sensing ◽

10.3390/rs13010010 ◽

2020 ◽

Vol 13 (1) ◽

pp. 10

Author(s):

Andrea Sulova ◽

Jamal Jokar Arsanjani

Keyword(s):

Climate Change ◽

Machine Learning ◽

Random Forest ◽

Google Earth ◽

Summer Season ◽

Driving Factors ◽

Machine Learning Algorithms ◽

Classification And Regression Tree ◽

Training Dataset ◽

Google Earth Engine

Recent studies have suggested that due to climate change, the number of wildfires across the globe have been increasing and continue to grow even more. The recent massive wildfires, which hit Australia during the 2019–2020 summer season, raised questions to what extent the risk of wildfires can be linked to various climate, environmental, topographical, and social factors and how to predict fire occurrences to take preventive measures. Hence, the main objective of this study was to develop an automatized and cloud-based workflow for generating a training dataset of fire events at a continental level using freely available remote sensing data with a reasonable computational expense for injecting into machine learning models. As a result, a data-driven model was set up in Google Earth Engine platform, which is publicly accessible and open for further adjustments. The training dataset was applied to different machine learning algorithms, i.e., Random Forest, Naïve Bayes, and Classification and Regression Tree. The findings show that Random Forest outperformed other algorithms and hence it was used further to explore the driving factors using variable importance analysis. The study indicates the probability of fire occurrences across Australia as well as identifies the potential driving factors of Australian wildfires for the 2019–2020 summer season. The methodical approach and achieved results and drawn conclusions can be of great importance to policymakers, environmentalists, and climate change researchers, among others.

Download Full-text

Mapeamento da Vegetação Nativa do Cerrado na Região de Três Lagoas-MS com o Google Earth Engine

Revista Brasileira de Cartografia ◽

10.14393/rbcv71n3-47461 ◽

2019 ◽

Vol 71 (3) ◽

pp. 702-725

Author(s):

Nayara Vasconcelos Estrabis ◽

José Marcato Junior ◽

Hemerson Pistori

Keyword(s):

Support Vector Machine ◽

Random Forest ◽

Google Earth ◽

Support Vector ◽

Landsat 8 ◽

Landsat 8 Oli ◽

Google Earth Engine

O Cerrado é um dos biomas existentes no Brasil e o segundo mais extenso da América do Sul. Possui grande importância devido a sua biodiversidade, ecossistema e principalmente por servir como um reservatório, ou “esponja”, que distribui água para os demais biomas, além de ser berço de nascentes de algumas das maiores bacias da América do Sul. No entanto, devido às atividades antrópicas praticadas (com destaque para a pecuária e silvicultura) e a redução da vegetação nativa, este bioma está ameaçado. Considerado como hotspot em biodiversidade, o Cerrado pode não existir em 2050. Com a necessidade de sua preservação, o objetivo desse trabalho consistiu em investigar o uso de algoritmos de aprendizado de máquina para realizar o mapeamento da vegetação nativa existente na região do município de Três Lagoas, utilizando a plataforma em nuvem Google Earth Engine. O processo foi realizado com uma imagem Landsat-8 OLI, datada de 10 de outubro de 2018, e com os algoritmos Random Forest (RF) e Support Vector Machine (SVM). Na validação da classificação, o RF e o SVM apresentaram índices kappa iguais a 0,94 e 0,97, respectivamente. O RF, quando comparado ao SVM, apresentou classificação mais ruidosa. Por fim, verificou-se a existência de vegetação nativa de aproximadamente 2556 km² ao adotar o RF e 2873 km² ao adotar SVM.

Download Full-text

Latest Advances in Fractional Snow Cover Mapping on MODIS Data by Machine Learning Algorithms

10.5194/egusphere-egu2020-13193 ◽

2020 ◽

Author(s):

Semih Kuter ◽

Zuhal Akyurek

Keyword(s):

Machine Learning ◽

Snow Cover ◽

General Circulation ◽

Snow Water Equivalent ◽

Machine Learning Algorithms ◽

Multivariate Adaptive Regression Splines ◽

Support Vector ◽

Landsat 8 ◽

European Alps ◽

Fractional Snow Cover

<p>Spatial extent of snow has been declared as an essential climate variable. Accurate modeling of snow cover is crucial for the better prediction of snow water equivalent and, consequently, for the success of general circulation and weather forecasting models as well as climate change and hydrological studies. This presentation mainly focuses on the representation of the latest findings of our efforts in fractional snow cover mapping on MODIS images by data-driven machine learning methodologies. For this purpose, a dataset composed of 20 MODIS - Landsat 8 image pairs acquired between Apr 2013 and Dec 2016 over European Alps were employed. Artificial neural networks (ANN), multivariate adaptive regression splines (MARS), support vector regression (SVR) and random forest (RF) models were trained and tested by using reference FSC maps generated from higher spatial resolution Landsat 8 binary snow maps. ANN, MARS, SVR and RF models exhibited quite good performance with average R &#8776; 0.93, whereas the agreement between the reference FSC maps and the MODIS&#8217; own product MOD10A1 (C5) was slightly poorer with R &#8776; 0.88.</p>

Download Full-text

A COMPARISON OF MACHINE-LEARNING REGRESSION ALGORITHMS FOR THE ESTIMATION OF LAI USING LANDSAT - 8 SATELLITE DATA

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-4-w16-679-2019 ◽

2019 ◽

Vol XLII-4/W16 ◽

pp. 679-683

Author(s):

V. P. Yadav ◽

R. Prasad ◽

R. Bala ◽

A. K. Vishwakarma ◽

S. A. Yadav ◽

...

Keyword(s):

Machine Learning ◽

Satellite Data ◽

Vegetation Index ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Accurate Estimation ◽

Support Vector ◽

Landsat 8 ◽

Area Index ◽

Global Circulation Models

Abstract. The leaf area index (LAI) is one of key variable of crops which plays important role in agriculture, ecology and climate change for global circulation models to compute energy and water fluxes. In the recent research era, the machine-learning algorithms have provided accurate computational approaches for the estimation of crops biophysical parameters using remotely sensed data. The three machine-learning algorithms, random forest regression (RFR), support vector regression (SVR) and artificial neural network regression (ANNR) were used to estimate the LAI for crops in the present study. The three different dates of Landsat-8 satellite images were used during January 2017 – March 2017 at different crops growth conditions in Varanasi district, India. The sampling regions were fully covered by major Rabi season crops like wheat, barley and mustard etc. In total pooled data, 60% samples were taken for the training of the algorithms and rest 40% samples were taken as testing and validation of the machinelearning regressions algorithms. The highest sensitivity of normalized difference vegetation index (NDVI) with LAI was found using RFR algorithms (R2 = 0.884, RMSE = 0.404) as compared to SVR (R2 = 0.847, RMSE = 0.478) and ANNR (R2 = 0.829, RMSE = 0.404). Therefore, RFR algorithms can be used for accurate estimation of LAI for crops using satellite data.

Download Full-text