Comparing Three Machine Learning Techniques for Building Extraction from a Digital Surface Model

Nicla Maria Notarangelo; Arianna Mazzariello; Raffaele Albano; Aurelia Sole

doi:10.3390/app11136072

Comparing Three Machine Learning Techniques for Building Extraction from a Digital Surface Model

Applied Sciences ◽

10.3390/app11136072 ◽

2021 ◽

Vol 11 (13) ◽

pp. 6072

Author(s):

Nicla Maria Notarangelo ◽

Arianna Mazzariello ◽

Raffaele Albano ◽

Aurelia Sole

Keyword(s):

Machine Learning ◽

High Resolution ◽

Urban Morphology ◽

Supervised Machine Learning ◽

Support Vector ◽

Surface Model ◽

Feature Descriptor ◽

Digital Surface Model ◽

Building Extraction ◽

Area Of Interest

Automatic building extraction from high-resolution remotely sensed data is a major area of interest for an extensive range of fields (e.g., urban planning, environmental risk management) but challenging due to urban morphology complexity. Among the different methods proposed, the approaches based on supervised machine learning (ML) achieve the best results. This paper aims to investigate building footprint extraction using only high-resolution raster digital surface model (DSM) data by comparing the performance of three different popular supervised ML models on a benchmark dataset. The first two methods rely on a histogram of oriented gradients (HOG) feature descriptor and a classical ML (support vector machine (SVM)) or a shallow neural network (extreme learning machine (ELM)) classifier, and the third model is a fully convolutional network (FCN) based on deep learning with transfer learning. Used data were obtained from the International Society for Photogrammetry and Remote Sensing (ISPRS) and cover the urban areas of Vaihingen an der Enz, Potsdam, and Toronto. The results indicated that performances of models based on shallow ML (feature extraction and classifier training) are affected by the urban context investigated (F1 scores from 0.49 to 0.81), whereas the FCN-based model proved to be the most robust and best-performing method for building extraction from a high-resolution raster DSM (F1 scores from 0.80 to 0.86).

Download Full-text

Automated Building Extraction using High Resolution Satellite Imagery though Ensemble Modelling and Machine Learning

10.21523/gcj1.18020103 ◽

2018 ◽

Vol 2 (1) ◽

pp. 31-46

Author(s):

Soumya K. Das ◽

Prakash P. S. ◽

Bharath Aithal

Keyword(s):

Machine Learning ◽

High Resolution ◽

Vegetation Index ◽

Normalized Difference Vegetation Index ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

Support Vector ◽

Building Extraction ◽

Ensemble Technique ◽

Ensemble Algorithm

Building extraction has been a challenging task due to complex structures and features of various land use with matching spectral and spatial attributes in a satellite data. We attempted to extract building as features using machine-learning algorithms such as Support Vector Machine (SVM), Random Forests (RF), Artificial Neural Network (ANN) and Improved Ensemble Technique as Gradient Boosting. The techniques used increases their classification accuracies using spectral properties as well as indices such as Normalized Difference Vegetation Index (NDVI) as attributes. Extracted results through various methods, performance of three different machine learning such as Ensemble method, RF and SVM are applied and results are analyzed for their behavior in different building distribution. Different algorithms showed variations in accuracies and performance in different built-up conditions. Ensemble algorithm performed very well in all conditions followed by RF and SVM performed better in coarse resolution, while ANN performed better in high resolution and overall accuracies of all algorithms increased with better spatial resolution. Ensemble algorithm showed relatively efficient performance in regions with extensive heterogeneous features. These analyses can helpful to provide quantitative data for various stocktaking analysis and city managers for better administration capabilities.

Download Full-text

Mapping Heterogeneous Urban Landscapes from the Fusion of Digital Surface Model and Unmanned Aerial Vehicle-Based Images Using Adaptive Multiscale Image Segmentation and Classification

Remote Sensing ◽

10.3390/rs12071081 ◽

2020 ◽

Vol 12 (7) ◽

pp. 1081 ◽

Cited By ~ 5

Author(s):

Mohamed Barakat A. Gibril ◽

Bahareh Kalantar ◽

Rami Al-Ruzouq ◽

Naonori Ueda ◽

Vahideh Saeidi ◽

...

Keyword(s):

Image Segmentation ◽

Unmanned Aerial Vehicle ◽

Urban Areas ◽

Integrated Approach ◽

Supervised Machine Learning ◽

Support Vector ◽

Surface Model ◽

Urban Landscapes ◽

Digital Surface Model ◽

Aerial Vehicle

Considering the high-level details in an ultrahigh-spatial-resolution (UHSR) unmanned aerial vehicle (UAV) dataset, detailed mapping of heterogeneous urban landscapes is extremely challenging because of the spectral similarity between classes. In this study, adaptive hierarchical image segmentation optimization, multilevel feature selection, and multiscale (MS) supervised machine learning (ML) models were integrated to accurately generate detailed maps for heterogeneous urban areas from the fusion of the UHSR orthomosaic and digital surface model (DSM). The integrated approach commenced through a preliminary MS image segmentation parameter selection, followed by the application of three supervised ML models, namely, random forest (RF), support vector machine (SVM), and decision tree (DT). These models were implemented at the optimal MS levels to identify preliminary information, such as the optimal segmentation level(s) and relevant features, for extracting 12 land use/land cover (LULC) urban classes from the fused datasets. Using the information obtained from the first phase of the analysis, detailed MS classification was iteratively conducted to improve the classification accuracy and derive the final urban LULC maps. Two UAV-based datasets were used to develop and assess the effectiveness of the proposed framework. The hierarchical classification of the pilot study area showed that the RF was superior with an overall accuracy (OA) of 94.40% and a kappa coefficient (K) of 0.938, followed by SVM (OA = 92.50% and K = 0.917) and DT (OA = 91.60% and K = 0.908). The classification results of the second dataset revealed that SVM was superior with an OA of 94.45% and K of 0.938, followed by RF (OA = 92.46% and K = 0.916) and DT (OA = 90.46% and K = 0.893). The proposed framework exhibited an excellent potential for the detailed mapping of heterogeneous urban landscapes from the fusion of UHSR orthophoto and DSM images using various ML models.

Download Full-text

Predictive Modelling of Employee Turnover in Indian IT Industry Using Machine Learning Techniques

Vision The Journal of Business Perspective ◽

10.1177/0972262918821221 ◽

2019 ◽

Vol 23 (1) ◽

pp. 12-21 ◽

Cited By ~ 2

Author(s):

Shikha N. Khera ◽

Divya

Keyword(s):

Machine Learning ◽

Learning Algorithm ◽

Confusion Matrix ◽

Predictive Modelling ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Support Vector ◽

It Industry ◽

Knowledge Based ◽

Employee Attrition

Information technology (IT) industry in India has been facing a systemic issue of high attrition in the past few years, resulting in monetary and knowledge-based loses to the companies. The aim of this research is to develop a model to predict employee attrition and provide the organizations opportunities to address any issue and improve retention. Predictive model was developed based on supervised machine learning algorithm, support vector machine (SVM). Archival employee data (consisting of 22 input features) were collected from Human Resource databases of three IT companies in India, including their employment status (response variable) at the time of collection. Accuracy results from the confusion matrix for the SVM model showed that the model has an accuracy of 85 per cent. Also, results show that the model performs better in predicting who will leave the firm as compared to predicting who will not leave the company.

Download Full-text

Supervised Machine Learning Methods and Hyperspectral Imaging Techniques Jointly Applied for Brain Cancer Classification

Sensors ◽

10.3390/s21113827 ◽

2021 ◽

Vol 21 (11) ◽

pp. 3827

Author(s):

Gemma Urbanos ◽

Alberto Martín ◽

Guillermo Vázquez ◽

Marta Villanueva ◽

Manuel Villa ◽

...

Keyword(s):

Machine Learning ◽

Blood Vessel ◽

Hyperspectral Imaging ◽

Imaging Techniques ◽

Venous Blood ◽

Healthy Tissue ◽

Supervised Machine Learning ◽

Support Vector ◽

Arterial Blood

Hyperspectral imaging techniques (HSI) do not require contact with patients and are non-ionizing as well as non-invasive. As a consequence, they have been extensively applied in the medical field. HSI is being combined with machine learning (ML) processes to obtain models to assist in diagnosis. In particular, the combination of these techniques has proven to be a reliable aid in the differentiation of healthy and tumor tissue during brain tumor surgery. ML algorithms such as support vector machine (SVM), random forest (RF) and convolutional neural networks (CNN) are used to make predictions and provide in-vivo visualizations that may assist neurosurgeons in being more precise, hence reducing damages to healthy tissue. In this work, thirteen in-vivo hyperspectral images from twelve different patients with high-grade gliomas (grade III and IV) have been selected to train SVM, RF and CNN classifiers. Five different classes have been defined during the experiments: healthy tissue, tumor, venous blood vessel, arterial blood vessel and dura mater. Overall accuracy (OACC) results vary from 60% to 95% depending on the training conditions. Finally, as far as the contribution of each band to the OACC is concerned, the results obtained in this work are 3.81 times greater than those reported in the literature.

Download Full-text

Financial Context News Sentiment Analysis for the Lithuanian Language

Applied Sciences ◽

10.3390/app11104443 ◽

2021 ◽

Vol 11 (10) ◽

pp. 4443

Author(s):

Rokas Štrimaitis ◽

Pavel Stefanovič ◽

Simona Ramanauskaitė ◽

Asta Slotkienė

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Short Term Memory ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Experimental Investigations ◽

Support Vector ◽

Applied Machine Learning ◽

Bayes Algorithm ◽

Website Content

Financial area analysis is not limited to enterprise performance analysis. It is worth analyzing as wide an area as possible to obtain the full impression of a specific enterprise. News website content is a datum source that expresses the public’s opinion on enterprise operations, status, etc. Therefore, it is worth analyzing the news portal article text. Sentiment analysis in English texts and financial area texts exist, and are accurate, the complexity of Lithuanian language is mostly concentrated on sentiment analysis of comment texts, and does not provide high accuracy. Therefore in this paper, the supervised machine learning model was implemented to assign sentiment analysis on financial context news, gathered from Lithuanian language websites. The analysis was made using three commonly used classification algorithms in the field of sentiment analysis. The hyperparameters optimization using the grid search was performed to discover the best parameters of each classifier. All experimental investigations were made using the newly collected datasets from four Lithuanian news websites. The results of the applied machine learning algorithms show that the highest accuracy is obtained using a non-balanced dataset, via the multinomial Naive Bayes algorithm (71.1%). The other algorithm accuracies were slightly lower: a long short-term memory (71%), and a support vector machine (70.4%).

Download Full-text

Drill-Core Mineral Abundance Estimation Using Hyperspectral and High-Resolution Mineralogical Data

Remote Sensing ◽

10.3390/rs12071218 ◽

2020 ◽

Vol 12 (7) ◽

pp. 1218

Author(s):

Laura Tuşa ◽

Mahdi Khodadadzadeh ◽

Cecilia Contreras ◽

Kasra Rafiezadeh Shahi ◽

Margret Fuchs ◽

...

Keyword(s):

Machine Learning ◽

High Resolution ◽

Ore Deposits ◽

Machine Learning Algorithms ◽

Training Data ◽

Support Vector ◽

Drill Core ◽

Data Types ◽

Mineralogical Characterization ◽

Core Samples

Due to the extensive drilling performed every year in exploration campaigns for the discovery and evaluation of ore deposits, drill-core mapping is becoming an essential step. While valuable mineralogical information is extracted during core logging by on-site geologists, the process is time consuming and dependent on the observer and individual background. Hyperspectral short-wave infrared (SWIR) data is used in the mining industry as a tool to complement traditional logging techniques and to provide a rapid and non-invasive analytical method for mineralogical characterization. Additionally, Scanning Electron Microscopy-based image analyses using a Mineral Liberation Analyser (SEM-MLA) provide exhaustive high-resolution mineralogical maps, but can only be performed on small areas of the drill-cores. We propose to use machine learning algorithms to combine the two data types and upscale the quantitative SEM-MLA mineralogical data to drill-core scale. This way, quasi-quantitative maps over entire drill-core samples are obtained. Our upscaling approach increases result transparency and reproducibility by employing physical-based data acquisition (hyperspectral imaging) combined with mathematical models (machine learning). The procedure is tested on 5 drill-core samples with varying training data using random forests, support vector machines and neural network regression models. The obtained mineral abundance maps are further used for the extraction of mineralogical parameters such as mineral association.

Download Full-text

Optimizing machine learning models for granular NdFeB magnets by very fast simulated annealing

Scientific Reports ◽

10.1038/s41598-021-83315-9 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Hyeon-Kyu Park ◽

Jae-Hyeok Lee ◽

Jehyun Lee ◽

Sang-Koog Kim

Keyword(s):

Machine Learning ◽

Simulated Annealing ◽

Permanent Magnets ◽

Supervised Machine Learning ◽

Support Vector ◽

Micromagnetic Simulations ◽

Ndfeb Magnets ◽

Average Grain Size ◽

Macroscopic Properties ◽

Very Fast Simulated Annealing

AbstractThe macroscopic properties of permanent magnets and the resultant performance required for real implementations are determined by the magnets’ microscopic features. However, earlier micromagnetic simulations and experimental studies required relatively a lot of work to gain any complete and comprehensive understanding of the relationships between magnets’ macroscopic properties and their microstructures. Here, by means of supervised learning, we predict reliable values of coercivity (μ0Hc) and maximum magnetic energy product (BHmax) of granular NdFeB magnets according to their microstructural attributes (e.g. inter-grain decoupling, average grain size, and misalignment of easy axes) based on numerical datasets obtained from micromagnetic simulations. We conducted several tests of a variety of supervised machine learning (ML) models including kernel ridge regression (KRR), support vector regression (SVR), and artificial neural network (ANN) regression. The hyper-parameters of these models were optimized by a very fast simulated annealing (VFSA) algorithm with an adaptive cooling schedule. In our datasets of randomly generated 1,000 polycrystalline NdFeB cuboids with different microstructural attributes, all of the models yielded similar results in predicting both μ0Hc and BHmax. Furthermore, some outliers, which deteriorated the normality of residuals in the prediction of BHmax, were detected and further analyzed. Based on all of our results, we can conclude that our ML approach combined with micromagnetic simulations provides a robust framework for optimal design of microstructures for high-performance NdFeB magnets.

Download Full-text

Prediction of CO2 Minimum Miscibility Pressure Using an Augmented Machine-Learning-Based Model

SPE Journal ◽

10.2118/200326-pa ◽

2021 ◽

pp. 1-13

Author(s):

Utkarsh Sinha ◽

Birol Dindoruk ◽

Mohamed Soliman

Keyword(s):

Machine Learning ◽

Phase Behavior ◽

Hybrid Method ◽

Gas Injection ◽

Supervised Machine Learning ◽

Design Parameters ◽

Support Vector ◽

Hydrocarbon Gases ◽

Minimum Miscibility Pressure ◽

Analytical Correlation

Summary Minimum miscibility pressure (MMP) is one of the key design parameters for gas injection projects. It is a physical parameter that is a measure of local displacement efficiency while subject to some constraints due to its definition. Also, the MMP value is used to tune compositional models along with proper fluid description constrained with other available basic phase behavior data, such as bubble point pressure and volumetric properties. In general, carbon dioxide (CO2) and hydrocarbon gases are the most common gases used for (or screened for) gas injection processes, and because of recent focus, they are used to screen for the coupling of CO2-sequestration and CO2-enhanced oil recovery (EOR) projects. Because the CO2/oil phase behavior is quite different than the hydrocarbon gas/oil phase behavior, researchers developed specialized correlations for CO2 or CO2-rich streams. Therefore, there is a need for a tool with expanded range capabilities for the estimation of MMP for CO2 gas streams. The only known and widely accepted measurement technique for MMP that is coherent with its formal definition is the use of a slimtube apparatus. However, the use of slimtube restricts the amount of data available, even though there are other alternative techniques presented over the last three decades, which all have various limitations (Dindoruk et al. 2021). Due to some of the complexities highlighted in Dindoruk et al. (2021) and time and resource requirements, there have been a number of correlations developed in the literature using mostly classical regression techniques with relatively sparse data using various combinations of limited input data (Cronquist 1978; Lee 1979; Yellig and Metcalfe 1980; Alston et al. 1985; Glaso 1985; Jaubert et al. 1998; Emera and Sarma 2005; Yuan et al. 2005; Ahmadi et al. 2010; Ahmadi and Johns 2011). In this paper, we present two separate approaches for the calculation of the MMP of an oil for CO2 injection: analytical correlation in which the correlation coefficients were tuned using linear support vector machines (SVMs) (Press et al. 2007; MathWorks 2020; RDocumentation 2020b; Cortes and Vapnik 1995) and using a hybrid method (i.e., superlearner model), which consists of the combination of random forest (RF) regression (Breiman 2001) and the proposed analytical correlation. Both models take the compositional analysis of oils up to heptane plus fraction, molecular weight of oil, and the reservoir temperature as input parameters. Based on statistical and data analysis techniques in combination with the help of corresponding crossplots, we showed that the performance of the final proposed method (hybrid method) is superior to all the leading correlations (Cronquist 1978; Lee 1979; Yellig and Metcalfe 1980; Alston et al. 1985; Glaso 1985; Emera and Sarma 2005; Yuan et al. 2005) and supervised machine-learning (Metcalfe 1982) methods considered in the literature (Altman 1992; Chambers and Hastie 1992; Chapelle and Vapnik 2000; Breiman 2001; Press et al. 2007; MathWorks 2020). The proposed model works for the widest spectrum of MMPs from 1,000 to 4,900 psia, which covers the entire range of oils within the scope of CO2 EOR based on the widely used screening criteria (Taber et al. 1997a, 1997b).

Download Full-text

Sentiment Analysis using various Machine Learning and Deep Learning Techniques

Journal of the Nigerian Society of Physical Sciences ◽

10.46481/jnsps.2021.308 ◽

2021 ◽

pp. 385-394

Author(s):

V Umarani ◽

A Julian ◽

J Deepa

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Support Vector ◽

Analysis Process ◽

Learning Techniques

Sentiment analysis has gained a lot of attention from researchers in the last year because it has been widely applied to a variety of application domains such as business, government, education, sports, tourism, biomedicine, and telecommunication services. Sentiment analysis is an automated computational method for studying or evaluating sentiments, feelings, and emotions expressed as comments, feedbacks, or critiques. The sentiment analysis process can be automated using machine learning techniques, which analyses text patterns faster. The supervised machine learning technique is the most used mechanism for sentiment analysis. The proposed work discusses the flow of sentiment analysis process and investigates the common supervised machine learning techniques such as multinomial naive bayes, Bernoulli naive bayes, logistic regression, support vector machine, random forest, K-nearest neighbor, decision tree, and deep learning techniques such as Long Short-Term Memory and Convolution Neural Network. The work examines such learning methods using standard data set and the experimental results of sentiment analysis demonstrate the performance of various classifiers taken in terms of the precision, recall, F1-score, RoC-Curve, accuracy, running time and k fold cross validation and helps in appreciating the novelty of the several deep learning techniques and also giving the user an overview of choosing the right technique for their application.

Download Full-text

Classifying Lensed Gravitational Waves in the Geometrical Optics Limit with Machine Learning

American Journal of Undergraduate Research ◽

10.33697/ajur.2019.019 ◽

2019 ◽

Vol 16 (2) ◽

pp. 5-16

Author(s):

Amit Singh ◽

Ivan Li ◽

Otto Hannuksela ◽

Tjonnie Li ◽

Kyungmin Kim

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Gravitational Wave ◽

Gravitational Waves ◽

Geometrical Optics ◽

Supervised Machine Learning ◽

Support Vector ◽

Multi Layer Perceptron ◽

Machine Learning Classifiers ◽

Learning Classifiers

Gravitational waves are theorized to be gravitationally lensed when they propagate near massive objects. Such lensing effects cause potentially detectable repeated gravitational wave patterns in ground- and space-based gravitational wave detectors. These effects are difficult to discriminate when the lens is small and the repeated patterns superpose. Traditionally, matched filtering techniques are used to identify gravitational-wave signals, but we instead aim to utilize machine learning techniques to achieve this. In this work, we implement supervised machine learning classifiers (support vector machine, random forest, multi-layer perceptron) to discriminate such lensing patterns in gravitational wave data. We train classifiers with spectrograms of both lensed and unlensed waves using both point-mass and singular isothermal sphere lens models. As the result, classifiers return F1 scores ranging from 0:852 to 0:996, with precisions from 0:917 to 0:992 and recalls ranging from 0:796 to 1:000 depending on the type of classifier and lensing model used. This supports the idea that machine learning classifiers are able to correctly determine lensed gravitational wave signals. This also suggests that in the future, machine learning classifiers may be used as a possible alternative to identify lensed gravitational wave events and to allow us to study gravitational wave sources and massive astronomical objects through further analysis. KEYWORDS: Gravitational Waves; Gravitational Lensing; Geometrical Optics; Machine Learning; Classification; Support Vector Machine; Random Tree Forest; Multi-layer Perceptron

Download Full-text