Deep Learning Segmentation and Classification for Urban Village Using a Worldview Satellite Image Based on U-Net

Zhuokun Pan; Jiashu Xu; Yubin Guo; Yueming Hu; Guangxing Wang

doi:10.3390/rs12101574

Deep Learning Segmentation and Classification for Urban Village Using a Worldview Satellite Image Based on U-Net

Remote Sensing ◽

10.3390/rs12101574 ◽

2020 ◽

Vol 12 (10) ◽

pp. 1574 ◽

Cited By ~ 3

Author(s):

Zhuokun Pan ◽

Jiashu Xu ◽

Yubin Guo ◽

Yueming Hu ◽

Guangxing Wang

Keyword(s):

Deep Learning ◽

Satellite Image ◽

High Density ◽

Learning Method ◽

Geospatial Information ◽

Guangzhou City ◽

Urban Village ◽

Boundary Vector ◽

Object Based ◽

Urban Settlements

Unplanned urban settlements exist worldwide. The geospatial information of these areas is critical for urban management and reconstruction planning but usually unavailable. Automatically characterizing individual buildings in the unplanned urban village using remote sensing imagery is very challenging due to complex landscapes and high-density settlements. The newly emerging deep learning method provides the potential to characterize individual buildings in a complex urban village. This study proposed an urban village mapping paradigm based on U-net deep learning architecture. The study area is located in Guangzhou City, China. The Worldview satellite image with eight pan-sharpened bands at a 0.5-m spatial resolution and building boundary vector file were used as research purposes. There are ten sites of the urban villages included in this scene of the Worldview image. The deep neural network model was trained and tested based on the selected six and four sites of the urban village, respectively. Models for building segmentation and classification were both trained and tested. The results indicated that the U-net model reached overall accuracy over 86% for building segmentation and over 83% for the classification. The F1-score ranged from 0.9 to 0.98 for the segmentation, and from 0.63 to 0.88 for the classification. The Interaction over Union reached over 90% for the segmentation and 86% for the classification. The superiority of the deep learning method has been demonstrated through comparison with Random Forest and object-based image analysis. This study fully showed the feasibility, efficiency, and potential of the deep learning in delineating individual buildings in the high-density urban village. More importantly, this study implied that through deep learning methods, mapping unplanned urban settlements could further characterize individual buildings with considerable accuracy.

Download Full-text

Hand Gesture Recognition Using Instant High-density EMG Graph via Deep Learning Method

2020 Chinese Automation Congress (CAC) ◽

10.1109/cac51589.2020.9326536 ◽

2020 ◽

Author(s):

Dezhen Xiong ◽

Daohui Zhang ◽

Xingang Zhao ◽

Yiwen Zhao

Keyword(s):

Deep Learning ◽

Gesture Recognition ◽

Hand Gesture Recognition ◽

High Density ◽

Hand Gesture ◽

Learning Method

Download Full-text

Land Cover/Land Use Mapping of LISS IV Imagery Using Object-Based Convolutional Neural Network with Deep Features

Journal of the Indian Society of Remote Sensing ◽

10.1007/s12524-019-01064-9 ◽

2019 ◽

Vol 48 (1) ◽

pp. 145-154

Author(s):

S. Rajesh ◽

T. Gladima Nisia ◽

S. Arivazhagan ◽

R. Abisekaraj

Keyword(s):

Deep Learning ◽

Classification Accuracy ◽

Urban Areas ◽

Satellite Images ◽

Feature Learning ◽

Remotely Sensed ◽

Learning Method ◽

Deep Feature ◽

Object Based ◽

Remotely Sensed Images

Abstract The paper proposes a new method for classifying the LISS IV satellite images using deep learning method. Deep learning method is to automatically extract many features without any human intervention. The classification accuracy through deep learning is still improved by including object-based segmentation. The object-based deep feature learning method using CNN is used to accurately classify the remotely sensed images. The method is designed with the technique of extracting the deep features and using it for object-based classification. The proposed system extracts deep features using pre-defined filter values, thus increasing the overall performance of the process compared to randomly initialized filter values. The object-based classification method can preserve edge information in complex satellite images. To improve the classification accuracy and to reduce complexity, object-based deep learning technique is used. The proposed object-based deep learning approach is used to drastically increase the classification accuracy. Here, the remotely sensed images were used to classify the urban areas of Ahmadabad and Madurai cities. Experimental results show a better performance with the object-based classification.

Download Full-text

Mapping Landslides on EO Data: Performance of Deep Learning Models vs. Traditional Machine Learning Models

Remote Sensing ◽

10.3390/rs12030346 ◽

2020 ◽

Vol 12 (3) ◽

pp. 346 ◽

Cited By ~ 17

Author(s):

Nikhil Prakash ◽

Andrea Manconi ◽

Simon Loew

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Regional Scale ◽

Probability Of Detection ◽

Landslide Inventory ◽

Learning Method ◽

Learning Models ◽

Object Based ◽

Landslide Mapping ◽

Conventional Methods

Mapping landslides using automated methods is a challenging task, which is still largely done using human efforts. Today, the availability of high-resolution EO data products is increasing exponentially, and one of the targets is to exploit this data source for the rapid generation of landslide inventory. Conventional methods like pixel-based and object-based machine learning strategies have been studied extensively in the last decade. In addition, recent advances in CNN (convolutional neural network), a type of deep-learning method, has been widely successful in extracting information from images and have outperformed other conventional learning methods. In the last few years, there have been only a few attempts to adapt CNN for landslide mapping. In this study, we introduce a modified U-Net model for semantic segmentation of landslides at a regional scale from EO data using ResNet34 blocks for feature extraction. We also compare this with conventional pixel-based and object-based methods. The experiment was done in Douglas County, a study area selected in the south of Portland in Oregon, USA, and landslide inventory extracted from SLIDO (Statewide Landslide Information Database of Oregon) was considered as the ground truth. Landslide mapping is an imbalanced learning problem with very limited availability of training data. Our network was trained on a combination of focal Tversky loss and cross-entropy loss functions using augmented image tiles sampled from a selected training area. The deep-learning method was observed to have a better performance than the conventional methods with an MCC (Matthews correlation coefficient) score of 0.495 and a POD (probability of detection) rate of 0.72 .

Download Full-text

Comparison between Object-based Method and Deep Learning Method for Extracting Road Features Using Submeter-grade High-resolution Satellite Imagery

Sensors and Materials ◽

10.18494/sam.2019.2472 ◽

2019 ◽

Vol 31 (10) ◽

pp. 3335

Author(s):

Dong Gook Lee ◽

Ji Ho You ◽

Sung Geun Park ◽

Seung Hyub Baeck ◽

Hyun Jik Lee

Keyword(s):

Deep Learning ◽

High Resolution ◽

Satellite Imagery ◽

Learning Method ◽

Object Based ◽

High Resolution Satellite Imagery

Download Full-text

AN EFFICIENT DEEP LEARNING METHOD FOR CUSTOMER BEHAVIOUR PREDICTION USING MOUSE CLICK EVENTS

KỶ YẾU HỘI NGHỊ KHOA HỌC CÔNG NGHỆ QUỐC GIA LẦN THỨ XI NGHIÊN CỨU CƠ BẢN VÀ ỨNG DỤNG CÔNG NGHỆ THÔNG TIN ◽

10.15625/vap.2018.0002 ◽

2018 ◽

Author(s):

Khang Nguyen ◽

Anh V. Nguyen ◽

Lan N. Vu ◽

Nga Mai ◽

Binh P. Nguyen

Keyword(s):

Deep Learning ◽

Learning Method ◽

Customer Behaviour ◽

Mouse Click

Download Full-text

A deep-learning method for tissue volumetry from multi-spectral magnetic resonance imaging in multiple sclerosis

10.26226/morressier.59a3e8b5d462b8028d894cfa ◽

2017 ◽

Author(s):

Richard McKinley

Keyword(s):

Magnetic Resonance Imaging ◽

Multiple Sclerosis ◽

Deep Learning ◽

Magnetic Resonance ◽

Learning Method ◽

Resonance Imaging

Download Full-text

Siamese Reconstruction Network: Accurate Image Reconstruction from Human Brain Activity by Learning to Compare

Applied Sciences ◽

10.3390/app9224749 ◽

2019 ◽

Vol 9 (22) ◽

pp. 4749

Author(s):

Lingyun Jiang ◽

Kai Qiao ◽

Linyuan Wang ◽

Chi Zhang ◽

Jian Chen ◽

...

Keyword(s):

Deep Learning ◽

Human Brain ◽

Brain Activity ◽

Feature Space ◽

Training Data ◽

Reconstruction Method ◽

Learning Method ◽

Training Samples ◽

Visual Reconstruction ◽

Relationship Of

Decoding human brain activities, especially reconstructing human visual stimuli via functional magnetic resonance imaging (fMRI), has gained increasing attention in recent years. However, the high dimensionality and small quantity of fMRI data impose restrictions on satisfactory reconstruction, especially for the reconstruction method with deep learning requiring huge amounts of labelled samples. When compared with the deep learning method, humans can recognize a new image because our human visual system is naturally capable of extracting features from any object and comparing them. Inspired by this visual mechanism, we introduced the mechanism of comparison into deep learning method to realize better visual reconstruction by making full use of each sample and the relationship of the sample pair by learning to compare. In this way, we proposed a Siamese reconstruction network (SRN) method. By using the SRN, we improved upon the satisfying results on two fMRI recording datasets, providing 72.5% accuracy on the digit dataset and 44.6% accuracy on the character dataset. Essentially, this manner can increase the training data about from n samples to 2n sample pairs, which takes full advantage of the limited quantity of training samples. The SRN learns to converge sample pairs of the same class or disperse sample pairs of different class in feature space.

Download Full-text

A Deep Learning Method for Frame Selection in Videos for Structure from Motion Pipelines

10.1109/icip42928.2021.9506227 ◽

2021 ◽

Author(s):

Francesco Banterle ◽

Rui Gong ◽

Massimiliano Corsini ◽

Fabio Ganovelli ◽

Luc Van Gool ◽

...

Keyword(s):

Deep Learning ◽

Structure From Motion ◽

Learning Method ◽

Frame Selection

Download Full-text

Integrating Machine/Deep Learning Methods and Filtering Techniques for Reliable Mineral Phase Segmentation of 3D X-ray Computed Tomography Images

Energies ◽

10.3390/en14154595 ◽

2021 ◽

Vol 14 (15) ◽

pp. 4595

Author(s):

Parisa Asadi ◽

Lauren E. Beckingham

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Random Forest ◽

Ct Images ◽

Ct Imaging ◽

Learning Method ◽

Learning Methods ◽

X Ray ◽

Machine Learning Methods ◽

Filtering Techniques

X-ray CT imaging provides a 3D view of a sample and is a powerful tool for investigating the internal features of porous rock. Reliable phase segmentation in these images is highly necessary but, like any other digital rock imaging technique, is time-consuming, labor-intensive, and subjective. Combining 3D X-ray CT imaging with machine learning methods that can simultaneously consider several extracted features in addition to color attenuation, is a promising and powerful method for reliable phase segmentation. Machine learning-based phase segmentation of X-ray CT images enables faster data collection and interpretation than traditional methods. This study investigates the performance of several filtering techniques with three machine learning methods and a deep learning method to assess the potential for reliable feature extraction and pixel-level phase segmentation of X-ray CT images. Features were first extracted from images using well-known filters and from the second convolutional layer of the pre-trained VGG16 architecture. Then, K-means clustering, Random Forest, and Feed Forward Artificial Neural Network methods, as well as the modified U-Net model, were applied to the extracted input features. The models’ performances were then compared and contrasted to determine the influence of the machine learning method and input features on reliable phase segmentation. The results showed considering more dimensionality has promising results and all classification algorithms result in high accuracy ranging from 0.87 to 0.94. Feature-based Random Forest demonstrated the best performance among the machine learning models, with an accuracy of 0.88 for Mancos and 0.94 for Marcellus. The U-Net model with the linear combination of focal and dice loss also performed well with an accuracy of 0.91 and 0.93 for Mancos and Marcellus, respectively. In general, considering more features provided promising and reliable segmentation results that are valuable for analyzing the composition of dense samples, such as shales, which are significant unconventional reservoirs in oil recovery.

Download Full-text

Motion-shape-based deep learning approach for divergence behavior detection in high-density crowd

The Visual Computer ◽

10.1007/s00371-021-02088-4 ◽

2021 ◽

Author(s):

Muhammad Umer Farooq ◽

Mohamad Naufal M. Saad ◽

Sultan Daud Khan

Keyword(s):

Deep Learning ◽

High Density ◽

Learning Approach ◽

Behavior Detection

Download Full-text