scholarly journals Distinguishing Planting Structures of Different Complexity from UAV Multispectral Images

Sensors ◽  
2021 ◽  
Vol 21 (6) ◽  
pp. 1994
Author(s):  
Qian Ma ◽  
Wenting Han ◽  
Shenjin Huang ◽  
Shide Dong ◽  
Guang Li ◽  
...  

This study explores the classification potential of a multispectral classification model for farmland with planting structures of different complexity. Unmanned aerial vehicle (UAV) remote sensing technology is used to obtain multispectral images of three study areas with low-, medium-, and high-complexity planting structures, containing three, five, and eight types of crops, respectively. The feature subsets of three study areas are selected by recursive feature elimination (RFE). Object-oriented random forest (OB-RF) and object-oriented support vector machine (OB-SVM) classification models are established for the three study areas. After training the models with the feature subsets, the classification results are evaluated using a confusion matrix. The OB-RF and OB-SVM models’ classification accuracies are 97.09% and 99.13%, respectively, for the low-complexity planting structure. The equivalent values are 92.61% and 99.08% for the medium-complexity planting structure and 88.99% and 97.21% for the high-complexity planting structure. For farmland with fragmentary plots and a high-complexity planting structure, as the planting structure complexity changed from low to high, both models’ overall accuracy levels decreased. The overall accuracy of the OB-RF model decreased by 8.1%, and that of the OB-SVM model only decreased by 1.92%. OB-SVM achieves an overall classification accuracy of 97.21%, and a single-crop extraction accuracy of at least 85.65%. Therefore, UAV multispectral remote sensing can be used for classification applications in highly complex planting structures.

2020 ◽  
Vol 26 (4) ◽  
pp. 405-425
Author(s):  
Javed Miandad ◽  
Margaret M. Darrow ◽  
Michael D. Hendricks ◽  
Ronald P. Daanen

ABSTRACT This study presents a new methodology to identify landslide and landslide-susceptible locations in Interior Alaska using only geomorphic properties from light detection and ranging (LiDAR) derivatives (i.e., slope, profile curvature, and roughness) and the normalized difference vegetation index (NDVI), focusing on the effect of different resolutions of LiDAR images. We developed a semi-automated object-oriented image classification approach in ArcGIS 10.5 and prepared a landslide inventory from visual observation of hillshade images. The multistage work flow included combining derivatives from 1-, 2.5-, and 5-m-resolution LiDAR, image segmentation, image classification using a support vector machine classifier, and image generalization to clean false positives. We assessed classification accuracy by generating confusion matrix tables. Analysis of the results indicated that LiDAR image scale played an important role in the classification, and the use of NDVI generated better results. Overall, the LiDAR 5-m-resolution image with NDVI generated the best results with a kappa value of 0.55 and an overall accuracy of 83 percent. The LiDAR 1-m-resolution image with NDVI generated the highest producer accuracy of 73 percent in identifying landslide locations. We produced a combined overlay map by summing the individual classified maps that was able to delineate landslide objects better than the individual maps. The combined classified map from 1-, 2.5-, and 5-m-resolution LiDAR with NDVI generated producer accuracies of 60, 80, and 86 percent and user accuracies of 39, 51, and 98 percent for landslide, landslide-susceptible, and stable locations, respectively, with an overall accuracy of 84 percent and a kappa value of 0.58. This semi-automated object-oriented image classification approach demonstrated potential as a viable tool with further refinement and/or in combination with additional data sources.


2019 ◽  
Vol 2 (2) ◽  
pp. 43
Author(s):  
Lalu Mutawalli ◽  
Mohammad Taufan Asri Zaen ◽  
Wire Bagye

In the era of technological disruption of mass communication, social media became a reference in absorbing public opinion. The digitalization of data is very rapidly produced by social media users because it is an attempt to represent the feelings of the audience. Data production in question is the user posts the status and comments on social media. Data production by the public in social media raises a very large set of data or can be referred to as big data. Big data is a collection of data sets in very large numbers, complex, has a relatively fast appearance time, so that makes it difficult to handle. Analysis of big data with data mining methods to get knowledge patterns in it. This study analyzes the sentiments of netizens on Twitter social media on Mr. Wiranto stabbing case. The results of the sentiment analysis showed 41% gave positive comments, 29% commented neutrally, and 29% commented negatively on events. Besides, modeling of the data is carried out using a support vector machine algorithm to create a system capable of classifying positive, neutral, and negative connotations. The classification model that has been made is then tested using the confusion matrix technique with each result is a precision value of 83%, a recall value of 80%, and finally, as much as 80% obtained in testing the accuracy.


10.29007/hbs2 ◽  
2019 ◽  
Author(s):  
Juan Carlos Valdiviezo-Navarro ◽  
Adan Salazar-Garibay ◽  
Karla Juliana Rodríguez-Robayo ◽  
Lilián Juárez ◽  
María Elena Méndez-López ◽  
...  

Maya milpa is one of the most important agrifood systems in Mesoamerica, not only because its ancient origin but also due to lead an increase in landscape diversity and to be a relevant source of families food security and food sovereignty. Nowadays, satellite remote sensing data, as the multispectral images of Sentinel-2 platforms, permit us the monitor- ing of different kinds of structures such as water bodies, urban areas, and particularly agricultural fields. Through its multispectral signatures, mono-crop fields or homogeneous vegetation zones like corn fields, barley fields, or other ones, have been successfully detected by using classification techniques with multispectral images. However, Maya milpa is a complex field which is conformed by different kinds of vegetables species and fragments of natural vegetation that in conjunction cannot be considered as a mono-crop field. In this work, we show some preliminary studies on the availability of monitoring this complex system in a region of interest in Yucatan, through a support vector machine (SVM) approach.


2021 ◽  
Author(s):  
Mostafa Sa'eed Yakoot ◽  
Adel Mohamed Salem Ragab ◽  
Omar Mahmoud

Abstract Well integrity has become a crucial field with increased focus and being published intensively in industry researches. It is important to maintain the integrity of the individual well to ensure that wells operate as expected for their designated life (or higher) with all risks kept as low as reasonably practicable, or as specified. Machine learning (ML) and artificial intelligence (AI) models are used intensively in oil and gas industry nowadays. ML concept is based on powerful algorithms and robust database. Developing an efficient classification model for well integrity (WI) anomalies is now feasible because of having enormous number of well failures and well barrier integrity tests, and analyses in the database. Circa 9000 dataset points were collected from WI tests performed for 800 wells in Gulf of Suez, Egypt for almost 10 years. Moreover, those data have been quality-controlled and quality-assured by experienced engineers. The data contain different forms of WI failures. The contributing parameter set includes a total of 23 barrier elements. Data were structured and fed into 11 different ML algorithms to build an automated systematic tool for calculating imposed risk category of any well. Comparison analysis for the deployed models was performed to infer the best predictive model that can be relied on. 11 models include both supervised and ensemble learning algorithms such as random forest, support vector machine (SVM), decision tree and scalable boosting techniques. Out of 11 models, the results showed that extreme gradient boosting (XGB), categorical boosting (CatBoost), and decision tree are the most reliable algorithms. Moreover, novel evaluation metrics for confusion matrix of each model have been introduced to overcome the problem of existing metrics which don't consider domain knowledge during model evaluation. The innovated model will help to utilize company resources efficiently and dedicate personnel efforts to wells with the high-risk. As a result, progressive improvements on business, safety, environment, and performance of the business. This paper would be a milestone in the design and creation of the Well Integrity Database Management Program through the combination of integrity and ML.


Author(s):  
I. E. Villalon-Turrubiates ◽  
M. J. Llovera-Torres

<p><strong>Abstract.</strong> The image classification procedure to identify remote sensing signatures from a particular geographical region can be performed with an identification model that has the ability to use large datasets to reach an accurate result. This novel methodology is referred to as the Statistical Enhanced Classification algorithm, which has been developed to employ multispectral images based in the statistical supervised learning theory and can be used for applications in environmental monitoring and analysis. This paper presents the performance study of the proposed methodology using both, multispectral synthetic images and multispectral remote sensing images. The obtained results are accurate due to the use of several spectral bands, the use of statistics such as mean and standard deviation for the training classes and for the pixel neighborhood, which provides more robust information, and the decision-making rule that has the ability to decide if the pixel is not belonging to a predefined class, which leads to an accurate decision model.</p>


2021 ◽  
Author(s):  
Eric Adua ◽  
Emmanuel Awuni Kolog ◽  
Ebenezer Afrifa-Yamoah ◽  
Bright Amankwah ◽  
Christian Obirikorang ◽  
...  

Abstract BackgroundAccurate prediction and early recognition of type II diabetes (T2DM) will lead to timely and meaningful interventions, while preventing T2DM associated complications. In this context, machine learning (ML) is promising, as it can transform vast amount of T2DM data into clinically relevant information. This study compares multiple ML techniques for predictive modelling based on different T2DM associated variables in an African population, Ghana. MethodsThe study involves 219 T2DM patients and 219 healthy individuals who were recruited from the hospital and the local community, respectively. Anthropometric and biochemical information including glycated haemoglobin (HbA1c), body mass index (BMI), blood pressure, fasting blood sugar (FBS), serum lipids [(total cholesterol (TC), triglycerides (TG), high and low-density lipoprotein cholesterol (HDL-c and LDL-c)] were collected. From this data, four ML classification algorithms including Naïve-Bayes (NB), K-Nearest Neighbor (KNN), Support Vector Machines (SVM) and Decision Tree (DT) were used to predict T2DM. Precision, Recall, F1-Scores, Receiver Operating Characteristics (ROC) scores and the confusion matrix were computed to determine the performance of the various algorithms while the importance of the feature attributes was determined by recursive feature elimination technique.ResultsAll the classifiers performed beyond the acceptable threshold of 70% for the Precision, Recall, F-score and Accuracy. After building the predictive model, 82% of diabetic test data was detected by the NB classifier, of which 93% were accurately predicted. The SVM classifier was the second-best performing classifier which yielded an overall accuracy of 84%. The non-T2DM test data yielded an accurate prediction score of 75% from the 98% of the proportion of the non-T2DM test data. KNN and DT yielded accuracies of 83% and 81%, respectively. NB has the best performance (AUC=0.87) followed by SVM (AUC= 0.84), KNN (AUC= 0.85) and DT (AUC= 0.81). The best three feature attributes, in order of importance, are HbA1c, TC and BMI whereas the least three importance of the features are Age, HDL-c and LDL-c.ConclusionBased on the predictive performance and high accuracy, the study has shown the potential of ML as a robust forecasting tool for T2DM. Our results can be a benchmark for guiding policy decisions in T2DM surveillance in resource and medical expertise limited countries such as Ghana.


2021 ◽  
Author(s):  
Eric Adua ◽  
Emmanuel Awuni Kolog ◽  
Ebenezer Afrifa-Yamoah ◽  
Bright Amankwah ◽  
Christian Obirikorang ◽  
...  

Abstract Background Accurate prediction and early recognition of type II diabetes (T2DM) will lead to timely and meaningful interventions, while preventing T2DM associated complications. In this context, machine learning (ML) is promising, as it can transform vast amount of T2DM data into clinically relevant information. This study compares multiple ML techniques for predictive modelling based on different T2DM associated variables in an African population, Ghana. Methods The study involves 219 T2DM patients and 219 healthy individuals who were recruited from the hospital and the local community, respectively. Anthropometric and biochemical information including glycated haemoglobin (HbA1c), body mass index (BMI), blood pressure, fasting blood sugar (FBS), serum lipids [(total cholesterol (TC), triglycerides (TG), high and low-density lipoprotein cholesterol (HDL-c and LDL-c)] were collected. From this data, four ML classification algorithms including Naïve-Bayes (NB), K-Nearest Neighbor (KNN), Support Vector Machines (SVM) and Decision Tree (DT) were used to predict T2DM. Precision, Recall, F1-Scores, Receiver Operating Characteristics (ROC) scores and the confusion matrix were computed to determine the performance of the various algorithms while the importance of the feature attributes was determined by recursive feature elimination technique. Results All the classifiers performed beyond the acceptable threshold of 70% for the Precision, Recall, F-score and Accuracy. After building the predictive model, 82% of diabetic test data was detected by the NB classifier, of which 93% were accurately predicted. The SVM classifier was the second-best performing classifier which yielded an overall accuracy of 84%. The non-T2DM test data yielded an accurate prediction score of 75% from the 98% of the proportion of the non-T2DM test data. KNN and DT yielded accuracies of 83% and 81%, respectively. NB has the best performance (AUC = 0.87) followed by SVM (AUC = 0.84), KNN (AUC = 0.85) and DT (AUC = 0.81). The best three feature attributes, in order of importance, are HbA1c, TC and BMI whereas the least three importance of the features are Age, HDL-c and LDL-c. Conclusion Based on the predictive performance and high accuracy, the study has shown the potential of ML as a robust forecasting tool for T2DM. Our results can be a benchmark for guiding policy decisions in T2DM surveillance in resource and medical expertise limited countries such as Ghana.


2021 ◽  
Vol 7 ◽  
pp. e437
Author(s):  
Arushi Agarwal ◽  
Purushottam Sharma ◽  
Mohammed Alshehri ◽  
Ahmed A. Mohamed ◽  
Osama Alfarraj

In today’s cyber world, the demand for the internet is increasing day by day, increasing the concern of network security. The aim of an Intrusion Detection System (IDS) is to provide approaches against many fast-growing network attacks (e.g., DDoS attack, Ransomware attack, Botnet attack, etc.), as it blocks the harmful activities occurring in the network system. In this work, three different classification machine learning algorithms—Naïve Bayes (NB), Support Vector Machine (SVM), and K-nearest neighbor (KNN)—were used to detect the accuracy and reducing the processing time of an algorithm on the UNSW-NB15 dataset and to find the best-suited algorithm which can efficiently learn the pattern of the suspicious network activities. The data gathered from the feature set comparison was then applied as input to IDS as data feeds to train the system for future intrusion behavior prediction and analysis using the best-fit algorithm chosen from the above three algorithms based on the performance metrics found. Also, the classification reports (Precision, Recall, and F1-score) and confusion matrix were generated and compared to finalize the support-validation status found throughout the testing phase of the model used in this approach.


Sign in / Sign up

Export Citation Format

Share Document