Distinguishing Planting Structures of Different Complexity from UAV Multispectral Images

Qian Ma; Wenting Han; Shenjin Huang; Shide Dong; Guang Li; Haipeng Chen

doi:10.3390/s21061994

Distinguishing Planting Structures of Different Complexity from UAV Multispectral Images

Sensors ◽

10.3390/s21061994 ◽

2021 ◽

Vol 21 (6) ◽

pp. 1994

Author(s):

Qian Ma ◽

Wenting Han ◽

Shenjin Huang ◽

Shide Dong ◽

Guang Li ◽

...

Keyword(s):

Remote Sensing ◽

Confusion Matrix ◽

Object Oriented ◽

Low Complexity ◽

Classification Model ◽

Recursive Feature Elimination ◽

Support Vector ◽

Multispectral Images ◽

High Complexity ◽

Structure Complexity

This study explores the classification potential of a multispectral classification model for farmland with planting structures of different complexity. Unmanned aerial vehicle (UAV) remote sensing technology is used to obtain multispectral images of three study areas with low-, medium-, and high-complexity planting structures, containing three, five, and eight types of crops, respectively. The feature subsets of three study areas are selected by recursive feature elimination (RFE). Object-oriented random forest (OB-RF) and object-oriented support vector machine (OB-SVM) classification models are established for the three study areas. After training the models with the feature subsets, the classification results are evaluated using a confusion matrix. The OB-RF and OB-SVM models’ classification accuracies are 97.09% and 99.13%, respectively, for the low-complexity planting structure. The equivalent values are 92.61% and 99.08% for the medium-complexity planting structure and 88.99% and 97.21% for the high-complexity planting structure. For farmland with fragmentary plots and a high-complexity planting structure, as the planting structure complexity changed from low to high, both models’ overall accuracy levels decreased. The overall accuracy of the OB-RF model decreased by 8.1%, and that of the OB-SVM model only decreased by 1.92%. OB-SVM achieves an overall classification accuracy of 97.21%, and a single-crop extraction accuracy of at least 85.65%. Therefore, UAV multispectral remote sensing can be used for classification applications in highly complex planting structures.

Download Full-text

Landslide Mapping Using Multiscale LiDAR Digital Elevation Models

Environmental and Engineering Geoscience ◽

10.2113/eeg-2268 ◽

2020 ◽

Vol 26 (4) ◽

pp. 405-425

Author(s):

Javed Miandad ◽

Margaret M. Darrow ◽

Michael D. Hendricks ◽

Ronald P. Daanen

Keyword(s):

Image Classification ◽

Vegetation Index ◽

Confusion Matrix ◽

Object Oriented ◽

Support Vector ◽

Classification Approach ◽

Resolution Image ◽

Kappa Value ◽

The Individual ◽

Segmentation Image

ABSTRACT This study presents a new methodology to identify landslide and landslide-susceptible locations in Interior Alaska using only geomorphic properties from light detection and ranging (LiDAR) derivatives (i.e., slope, profile curvature, and roughness) and the normalized difference vegetation index (NDVI), focusing on the effect of different resolutions of LiDAR images. We developed a semi-automated object-oriented image classification approach in ArcGIS 10.5 and prepared a landslide inventory from visual observation of hillshade images. The multistage work flow included combining derivatives from 1-, 2.5-, and 5-m-resolution LiDAR, image segmentation, image classification using a support vector machine classifier, and image generalization to clean false positives. We assessed classification accuracy by generating confusion matrix tables. Analysis of the results indicated that LiDAR image scale played an important role in the classification, and the use of NDVI generated better results. Overall, the LiDAR 5-m-resolution image with NDVI generated the best results with a kappa value of 0.55 and an overall accuracy of 83 percent. The LiDAR 1-m-resolution image with NDVI generated the highest producer accuracy of 73 percent in identifying landslide locations. We produced a combined overlay map by summing the individual classified maps that was able to delineate landslide objects better than the individual maps. The combined classified map from 1-, 2.5-, and 5-m-resolution LiDAR with NDVI generated producer accuracies of 60, 80, and 86 percent and user accuracies of 39, 51, and 98 percent for landslide, landslide-susceptible, and stable locations, respectively, with an overall accuracy of 84 percent and a kappa value of 0.58. This semi-automated object-oriented image classification approach demonstrated potential as a viable tool with further refinement and/or in combination with additional data sources.

Download Full-text

KLASIFIKASI TEKS SOSIAL MEDIA TWITTER MENGGUNAKAN SUPPORT VECTOR MACHINE (Studi Kasus Penusukan Wiranto)

Jurnal Informatika dan Rekayasa Elektronik ◽

10.36595/jire.v2i2.117 ◽

2019 ◽

Vol 2 (2) ◽

pp. 43

Author(s):

Lalu Mutawalli ◽

Mohammad Taufan Asri Zaen ◽

Wire Bagye

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Big Data ◽

Mass Communication ◽

Confusion Matrix ◽

Classification Model ◽

Support Vector ◽

Large Set ◽

Appearance Time ◽

Data Production

In the era of technological disruption of mass communication, social media became a reference in absorbing public opinion. The digitalization of data is very rapidly produced by social media users because it is an attempt to represent the feelings of the audience. Data production in question is the user posts the status and comments on social media. Data production by the public in social media raises a very large set of data or can be referred to as big data. Big data is a collection of data sets in very large numbers, complex, has a relatively fast appearance time, so that makes it difficult to handle. Analysis of big data with data mining methods to get knowledge patterns in it. This study analyzes the sentiments of netizens on Twitter social media on Mr. Wiranto stabbing case. The results of the sentiment analysis showed 41% gave positive comments, 29% commented neutrally, and 29% commented negatively on events. Besides, modeling of the data is carried out using a support vector machine algorithm to create a system capable of classifying positive, neutral, and negative connotations. The classification model that has been made is then tested using the confusion matrix technique with each result is a precision value of 83%, a recall value of 80%, and finally, as much as 80% obtained in testing the accuracy.

Download Full-text

Possibilities of milpa identification in Yucatan through remote sensing techniques and Sentinel-2 data

10.29007/hbs2 ◽

2019 ◽

Author(s):

Juan Carlos Valdiviezo-Navarro ◽

Adan Salazar-Garibay ◽

Karla Juliana Rodríguez-Robayo ◽

Lilián Juárez ◽

María Elena Méndez-López ◽

...

Keyword(s):

Remote Sensing ◽

Urban Areas ◽

Food Sovereignty ◽

Region Of Interest ◽

Remote Sensing Data ◽

Support Vector ◽

Multispectral Images ◽

Crop Fields ◽

Remote Sensing Techniques ◽

Sentinel 2

Maya milpa is one of the most important agrifood systems in Mesoamerica, not only because its ancient origin but also due to lead an increase in landscape diversity and to be a relevant source of families food security and food sovereignty. Nowadays, satellite remote sensing data, as the multispectral images of Sentinel-2 platforms, permit us the monitor- ing of different kinds of structures such as water bodies, urban areas, and particularly agricultural fields. Through its multispectral signatures, mono-crop fields or homogeneous vegetation zones like corn fields, barley fields, or other ones, have been successfully detected by using classification techniques with multispectral images. However, Maya milpa is a complex field which is conformed by different kinds of vegetables species and fragments of natural vegetation that in conjunction cannot be considered as a mono-crop field. In this work, we show some preliminary studies on the availability of monitoring this complex system in a region of interest in Yucatan, through a support vector machine (SVM) approach.

Download Full-text

Multi-Class Taxonomy of Well Integrity Anomalies Applying Inductive Learning Algorithms: Analytical Approach for Artificial-Lift Wells

10.2118/206129-ms ◽

2021 ◽

Author(s):

Mostafa Sa'eed Yakoot ◽

Adel Mohamed Salem Ragab ◽

Omar Mahmoud

Keyword(s):

Decision Tree ◽

Confusion Matrix ◽

Learning Algorithms ◽

Oil And Gas Industry ◽

Classification Model ◽

Gradient Boosting ◽

Support Vector ◽

Risk Category ◽

Well Integrity ◽

Extreme Gradient Boosting

Abstract Well integrity has become a crucial field with increased focus and being published intensively in industry researches. It is important to maintain the integrity of the individual well to ensure that wells operate as expected for their designated life (or higher) with all risks kept as low as reasonably practicable, or as specified. Machine learning (ML) and artificial intelligence (AI) models are used intensively in oil and gas industry nowadays. ML concept is based on powerful algorithms and robust database. Developing an efficient classification model for well integrity (WI) anomalies is now feasible because of having enormous number of well failures and well barrier integrity tests, and analyses in the database. Circa 9000 dataset points were collected from WI tests performed for 800 wells in Gulf of Suez, Egypt for almost 10 years. Moreover, those data have been quality-controlled and quality-assured by experienced engineers. The data contain different forms of WI failures. The contributing parameter set includes a total of 23 barrier elements. Data were structured and fed into 11 different ML algorithms to build an automated systematic tool for calculating imposed risk category of any well. Comparison analysis for the deployed models was performed to infer the best predictive model that can be relied on. 11 models include both supervised and ensemble learning algorithms such as random forest, support vector machine (SVM), decision tree and scalable boosting techniques. Out of 11 models, the results showed that extreme gradient boosting (XGB), categorical boosting (CatBoost), and decision tree are the most reliable algorithms. Moreover, novel evaluation metrics for confusion matrix of each model have been introduced to overcome the problem of existing metrics which don't consider domain knowledge during model evaluation. The innovated model will help to utilize company resources efficiently and dedicate personnel efforts to wells with the high-risk. As a result, progressive improvements on business, safety, environment, and performance of the business. This paper would be a milestone in the design and creation of the Well Integrity Database Management Program through the combination of integrity and ML.

Download Full-text

Defect Prediction for Object Oriented Software using Support Vector based Fuzzy Classification Model

International Journal of Computer Applications ◽

10.5120/9766-3114 ◽

2012 ◽

Vol 60 (15) ◽

pp. 8-16 ◽

Cited By ~ 2

Author(s):

Bharavi Mishraand K.K.Shukla

Keyword(s):

Object Oriented ◽

Fuzzy Classification ◽

Classification Model ◽

Defect Prediction ◽

Support Vector

Download Full-text

ENHANCED CLASSIFICATION MODEL FOR MULTISPECTRAL OBSERVATIONS FROM THE EARTH

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-iv-4-221-2018 ◽

2018 ◽

Vol IV-4 ◽

pp. 221-226

Author(s):

I. E. Villalon-Turrubiates ◽

M. J. Llovera-Torres

Keyword(s):

Remote Sensing ◽

Decision Model ◽

Accurate Result ◽

Classification Model ◽

Multispectral Images ◽

Performance Study ◽

Spectral Bands ◽

The Earth ◽

Multispectral Remote Sensing ◽

Decision Making Rule

<p><strong>Abstract.</strong> The image classification procedure to identify remote sensing signatures from a particular geographical region can be performed with an identification model that has the ability to use large datasets to reach an accurate result. This novel methodology is referred to as the Statistical Enhanced Classification algorithm, which has been developed to employ multispectral images based in the statistical supervised learning theory and can be used for applications in environmental monitoring and analysis. This paper presents the performance study of the proposed methodology using both, multispectral synthetic images and multispectral remote sensing images. The obtained results are accurate due to the use of several spectral bands, the use of statistics such as mean and standard deviation for the training classes and for the pixel neighborhood, which provides more robust information, and the decision-making rule that has the ability to decide if the pixel is not belonging to a predefined class, which leads to an accurate decision model.</p>

Download Full-text

Predictive Model and Feature Importance for Early Detection of Type II Diabetes Mellitus

10.21203/rs.3.rs-172421/v1 ◽

2021 ◽

Author(s):

Eric Adua ◽

Emmanuel Awuni Kolog ◽

Ebenezer Afrifa-Yamoah ◽

Bright Amankwah ◽

Christian Obirikorang ◽

...

Keyword(s):

Predictive Model ◽

Test Data ◽

Type Ii Diabetes ◽

Local Community ◽

Confusion Matrix ◽

Recursive Feature Elimination ◽

Support Vector ◽

Svm Classifier ◽

Type Ii ◽

Operating Characteristics

Abstract BackgroundAccurate prediction and early recognition of type II diabetes (T2DM) will lead to timely and meaningful interventions, while preventing T2DM associated complications. In this context, machine learning (ML) is promising, as it can transform vast amount of T2DM data into clinically relevant information. This study compares multiple ML techniques for predictive modelling based on different T2DM associated variables in an African population, Ghana. MethodsThe study involves 219 T2DM patients and 219 healthy individuals who were recruited from the hospital and the local community, respectively. Anthropometric and biochemical information including glycated haemoglobin (HbA1c), body mass index (BMI), blood pressure, fasting blood sugar (FBS), serum lipids [(total cholesterol (TC), triglycerides (TG), high and low-density lipoprotein cholesterol (HDL-c and LDL-c)] were collected. From this data, four ML classification algorithms including Naïve-Bayes (NB), K-Nearest Neighbor (KNN), Support Vector Machines (SVM) and Decision Tree (DT) were used to predict T2DM. Precision, Recall, F1-Scores, Receiver Operating Characteristics (ROC) scores and the confusion matrix were computed to determine the performance of the various algorithms while the importance of the feature attributes was determined by recursive feature elimination technique.ResultsAll the classifiers performed beyond the acceptable threshold of 70% for the Precision, Recall, F-score and Accuracy. After building the predictive model, 82% of diabetic test data was detected by the NB classifier, of which 93% were accurately predicted. The SVM classifier was the second-best performing classifier which yielded an overall accuracy of 84%. The non-T2DM test data yielded an accurate prediction score of 75% from the 98% of the proportion of the non-T2DM test data. KNN and DT yielded accuracies of 83% and 81%, respectively. NB has the best performance (AUC=0.87) followed by SVM (AUC= 0.84), KNN (AUC= 0.85) and DT (AUC= 0.81). The best three feature attributes, in order of importance, are HbA1c, TC and BMI whereas the least three importance of the features are Age, HDL-c and LDL-c.ConclusionBased on the predictive performance and high accuracy, the study has shown the potential of ML as a robust forecasting tool for T2DM. Our results can be a benchmark for guiding policy decisions in T2DM surveillance in resource and medical expertise limited countries such as Ghana.

Download Full-text

Predictive Model and Feature Importance for Early Detection of Type II Diabetes Mellitus

10.21203/rs.3.rs-150169/v1 ◽

2021 ◽

Author(s):

Eric Adua ◽

Emmanuel Awuni Kolog ◽

Ebenezer Afrifa-Yamoah ◽

Bright Amankwah ◽

Christian Obirikorang ◽

...

Keyword(s):

Predictive Model ◽

Test Data ◽

Type Ii Diabetes ◽

Confusion Matrix ◽

Accurate Prediction ◽

Recursive Feature Elimination ◽

Support Vector ◽

Svm Classifier ◽

Type Ii ◽

Operating Characteristics

Abstract Background Accurate prediction and early recognition of type II diabetes (T2DM) will lead to timely and meaningful interventions, while preventing T2DM associated complications. In this context, machine learning (ML) is promising, as it can transform vast amount of T2DM data into clinically relevant information. This study compares multiple ML techniques for predictive modelling based on different T2DM associated variables in an African population, Ghana. Methods The study involves 219 T2DM patients and 219 healthy individuals who were recruited from the hospital and the local community, respectively. Anthropometric and biochemical information including glycated haemoglobin (HbA1c), body mass index (BMI), blood pressure, fasting blood sugar (FBS), serum lipids [(total cholesterol (TC), triglycerides (TG), high and low-density lipoprotein cholesterol (HDL-c and LDL-c)] were collected. From this data, four ML classification algorithms including Naïve-Bayes (NB), K-Nearest Neighbor (KNN), Support Vector Machines (SVM) and Decision Tree (DT) were used to predict T2DM. Precision, Recall, F1-Scores, Receiver Operating Characteristics (ROC) scores and the confusion matrix were computed to determine the performance of the various algorithms while the importance of the feature attributes was determined by recursive feature elimination technique. Results All the classifiers performed beyond the acceptable threshold of 70% for the Precision, Recall, F-score and Accuracy. After building the predictive model, 82% of diabetic test data was detected by the NB classifier, of which 93% were accurately predicted. The SVM classifier was the second-best performing classifier which yielded an overall accuracy of 84%. The non-T2DM test data yielded an accurate prediction score of 75% from the 98% of the proportion of the non-T2DM test data. KNN and DT yielded accuracies of 83% and 81%, respectively. NB has the best performance (AUC = 0.87) followed by SVM (AUC = 0.84), KNN (AUC = 0.85) and DT (AUC = 0.81). The best three feature attributes, in order of importance, are HbA1c, TC and BMI whereas the least three importance of the features are Age, HDL-c and LDL-c. Conclusion Based on the predictive performance and high accuracy, the study has shown the potential of ML as a robust forecasting tool for T2DM. Our results can be a benchmark for guiding policy decisions in T2DM surveillance in resource and medical expertise limited countries such as Ghana.

Download Full-text

Classification model for accuracy and intrusion detection using machine learning approach

PeerJ Computer Science ◽

10.7717/peerj-cs.437 ◽

2021 ◽

Vol 7 ◽

pp. e437

Author(s):

Arushi Agarwal ◽

Purushottam Sharma ◽

Mohammed Alshehri ◽

Ahmed A. Mohamed ◽

Osama Alfarraj

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Nearest Neighbor ◽

Performance Metrics ◽

Detection System ◽

Confusion Matrix ◽

Machine Learning Algorithms ◽

Classification Model ◽

Support Vector ◽

K Nearest Neighbor

In today’s cyber world, the demand for the internet is increasing day by day, increasing the concern of network security. The aim of an Intrusion Detection System (IDS) is to provide approaches against many fast-growing network attacks (e.g., DDoS attack, Ransomware attack, Botnet attack, etc.), as it blocks the harmful activities occurring in the network system. In this work, three different classification machine learning algorithms—Naïve Bayes (NB), Support Vector Machine (SVM), and K-nearest neighbor (KNN)—were used to detect the accuracy and reducing the processing time of an algorithm on the UNSW-NB15 dataset and to find the best-suited algorithm which can efficiently learn the pattern of the suspicious network activities. The data gathered from the feature set comparison was then applied as input to IDS as data feeds to train the system for future intrusion behavior prediction and analysis using the best-fit algorithm chosen from the above three algorithms based on the performance metrics found. Also, the classification reports (Precision, Recall, and F1-score) and confusion matrix were generated and compared to finalize the support-validation status found throughout the testing phase of the model used in this approach.

Download Full-text

Object-oriented remote sensing imagery classification accuracy assessment based on confusion matrix

2012 20th International Conference on Geoinformatics ◽

10.1109/geoinformatics.2012.6270271 ◽

2012 ◽

Cited By ~ 3

Author(s):

Lina Yi ◽

Guifeng Zhang

Keyword(s):

Remote Sensing ◽

Classification Accuracy ◽

Accuracy Assessment ◽

Confusion Matrix ◽

Object Oriented ◽

Remote Sensing Imagery

Download Full-text