NAS-HRIS: Automatic Design and Architecture Search of Neural Network for Semantic Segmentation in Remote Sensing Images

Mingwei Zhang; Weipeng Jing; Jingbo Lin; Nengzhen Fang; Wei Wei; Marcin Woźniak; Robertas Damaševičius

doi:10.3390/s20185292

NAS-HRIS: Automatic Design and Architecture Search of Neural Network for Semantic Segmentation in Remote Sensing Images

Sensors ◽

10.3390/s20185292 ◽

2020 ◽

Vol 20 (18) ◽

pp. 5292 ◽

Cited By ~ 1

Author(s):

Mingwei Zhang ◽

Weipeng Jing ◽

Jingbo Lin ◽

Nengzhen Fang ◽

Wei Wei ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Network Architecture ◽

Modern Society ◽

Semantic Segmentation ◽

Search Space ◽

Remote Sensing Images ◽

Neural Network Architecture ◽

The Neural Network ◽

Urban Modelling

The segmentation of high-resolution (HR) remote sensing images is very important in modern society, especially in the fields of industry, agriculture and urban modelling. Through the neural network, the machine can effectively and accurately extract the surface feature information. However, using the traditional deep learning methods requires plentiful efforts in order to find a robust architecture. In this paper, we introduce a neural network architecture search (NAS) method, called NAS-HRIS, which can automatically search neural network architecture on the dataset. The proposed method embeds a directed acyclic graph (DAG) into the search space and designs the differentiable searching process, which enables it to learn an end-to-end searching rule by using gradient descent optimization. It uses the Gumbel-Max trick to provide an efficient way when drawing samples from a non-continuous probability distribution, and it improves the efficiency of searching and reduces the memory consumption. Compared with other NAS, NAS-HRIS consumes less GPU memory without reducing the accuracy, which corresponds to a large amount of HR remote sensing imagery data. We have carried out experiments on the WHUBuilding dataset and achieved 90.44% MIoU. In order to fully demonstrate the feasibility of the method, we made a new urban Beijing Building dataset, and conducted experiments on satellite images and non-single source images, achieving better results than SegNet, U-Net and Deeplab v3+ models, while the computational complexity of our network architecture is much smaller.

Download Full-text

A novel convolutional neural network architecture of multispectral remote sensing images for automatic material classification

Signal Processing Image Communication ◽

10.1016/j.image.2021.116329 ◽

2021 ◽

pp. 116329

Author(s):

Chuen-Horng Lin ◽

Ting-You Wang

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Network Architecture ◽

Remote Sensing Images ◽

Neural Network Architecture ◽

Material Classification ◽

Multispectral Remote Sensing

Download Full-text

SEMANTIC SEGMENTATION OF CONVOLUTIONAL NEURAL NETWORK FOR SUPERVISED CLASSIFICATION OF MULTISPECTRAL REMOTE SENSING

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-3-2035-2018 ◽

2018 ◽

Vol XLII-3 ◽

pp. 2035-2039

Author(s):

L. Xue ◽

C. Liu ◽

Y. Wu ◽

H. Li

Keyword(s):

Neural Network ◽

Remote Sensing ◽

High Resolution ◽

Network Architecture ◽

Semantic Segmentation ◽

Remote Sensing Imagery ◽

The Neural Network ◽

Ground Object ◽

Remote Sensing Image Processing

Semantic segmentation is a fundamental research in remote sensing image processing. Because of the complex maritime environment, the classification of roads, vegetation, buildings and water from remote Sensing Imagery is a challenging task. Although the neural network has achieved excellent performance in semantic segmentation in the last years, there are a few of works using CNN for ground object segmentation and the results could be further improved. This paper used convolution neural network named U-Net, its structure has a contracting path and an expansive path to get high resolution output. In the network , We added BN layers, which is more conducive to the reverse pass. Moreover, after upsampling convolution , we add dropout layers to prevent overfitting. They are promoted to get more precise segmentation results. To verify this network architecture, we used a Kaggle dataset. Experimental results show that U-Net achieved good performance compared with other architectures, especially in high-resolution remote sensing imagery.

Download Full-text

Effective distributed convolutional neural network architecture for remote sensing images target classification with a pre-training approach

Journal of Systems Engineering and Electronics ◽

10.21629/jsee.2019.02.02 ◽

2019 ◽

Vol 30 (2) ◽

pp. 238

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Network Architecture ◽

Target Classification ◽

Remote Sensing Images ◽

Neural Network Architecture ◽

Training Approach

Download Full-text

SCORING MODELING BASED ON NEURAL NETWORKS FOR DETERMINING A BANK BORROWER'S RATING

Economy of Ukraine ◽

10.15407/economyukr.2020.10.054 ◽

2020 ◽

Vol 2020 (10) ◽

pp. 54-62

Author(s):

Oleksii VASYLIEV ◽

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Architecture ◽

Statistical Data ◽

Activation Function ◽

Decision Making Process ◽

Neural Network Architecture ◽

Acceptable Accuracy ◽

The Neural Network ◽

Sigmoid Activation Function

The problem of applying neural networks to calculate ratings used in banking in the decision-making process on granting or not granting loans to borrowers is considered. The task is to determine the rating function of the borrower based on a set of statistical data on the effectiveness of loans provided by the bank. When constructing a regression model to calculate the rating function, it is necessary to know its general form. If so, the task is to calculate the parameters that are included in the expression for the rating function. In contrast to this approach, in the case of using neural networks, there is no need to specify the general form for the rating function. Instead, certain neural network architecture is chosen and parameters are calculated for it on the basis of statistical data. Importantly, the same neural network architecture can be used to process different sets of statistical data. The disadvantages of using neural networks include the need to calculate a large number of parameters. There is also no universal algorithm that would determine the optimal neural network architecture. As an example of the use of neural networks to determine the borrower's rating, a model system is considered, in which the borrower's rating is determined by a known non-analytical rating function. A neural network with two inner layers, which contain, respectively, three and two neurons and have a sigmoid activation function, is used for modeling. It is shown that the use of the neural network allows restoring the borrower's rating function with quite acceptable accuracy.

Download Full-text

Convolutional Neural Network for the Semantic Segmentation of Remote Sensing Images

Mobile Networks and Applications ◽

10.1007/s11036-020-01703-3 ◽

2021 ◽

Vol 26 (1) ◽

pp. 200-215

Author(s):

Muhammad Alam ◽

Jian-Feng Wang ◽

Cong Guangpei ◽

LV Yunrong ◽

Yuanfang Chen

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Neural Networks ◽

Image Processing ◽

Deep Learning ◽

Semantic Segmentation ◽

Natural Scene ◽

Remote Sensing Images ◽

Advantages And Disadvantages ◽

Target Segmentation

AbstractIn recent years, the success of deep learning in natural scene image processing boosted its application in the analysis of remote sensing images. In this paper, we applied Convolutional Neural Networks (CNN) on the semantic segmentation of remote sensing images. We improve the Encoder- Decoder CNN structure SegNet with index pooling and U-net to make them suitable for multi-targets semantic segmentation of remote sensing images. The results show that these two models have their own advantages and disadvantages on the segmentation of different objects. In addition, we propose an integrated algorithm that integrates these two models. Experimental results show that the presented integrated algorithm can exploite the advantages of both the models for multi-target segmentation and achieve a better segmentation compared to these two models.

Download Full-text

Semantic Segmentation of Remote Sensing Images Using Transfer Learning and Deep Convolutional Neural Network With Dense Connection

IEEE Access ◽

10.1109/access.2020.3003914 ◽

2020 ◽

Vol 8 ◽

pp. 116744-116755 ◽

Cited By ~ 1

Author(s):

Binge Cui ◽

Xin Chen ◽

Yan Lu

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Transfer Learning ◽

Semantic Segmentation ◽

Deep Convolutional Neural Network ◽

Remote Sensing Images

Download Full-text

Spatial Variability Aware Deep Neural Networks (SVANN): A General Approach

ACM Transactions on Intelligent Systems and Technology ◽

10.1145/3466688 ◽

2021 ◽

Vol 12 (6) ◽

pp. 1-21

Author(s):

Jayant Gupta ◽

Carl Molnar ◽

Yiqun Xie ◽

Joe Knight ◽

Shashi Shekhar

Keyword(s):

Neural Network ◽

Spatial Variability ◽

Network Architecture ◽

Network Models ◽

Neural Network Architecture ◽

Neural Network Models ◽

Climatic Zones ◽

The Neural Network ◽

Plant Hardiness ◽

Interpretation Model

Spatial variability is a prominent feature of various geographic phenomena such as climatic zones, USDA plant hardiness zones, and terrestrial habitat types (e.g., forest, grasslands, wetlands, and deserts). However, current deep learning methods follow a spatial-one-size-fits-all (OSFA) approach to train single deep neural network models that do not account for spatial variability. Quantification of spatial variability can be challenging due to the influence of many geophysical factors. In preliminary work, we proposed a spatial variability aware neural network (SVANN-I, formerly called SVANN ) approach where weights are a function of location but the neural network architecture is location independent. In this work, we explore a more flexible SVANN-E approach where neural network architecture varies across geographic locations. In addition, we provide a taxonomy of SVANN types and a physics inspired interpretation model. Experiments with aerial imagery based wetland mapping show that SVANN-I outperforms OSFA and SVANN-E performs the best of all.

Download Full-text

Efficient Semantic Segmentation Using Multi-Path Decoder

Applied Sciences ◽

10.3390/app10186386 ◽

2020 ◽

Vol 10 (18) ◽

pp. 6386

Author(s):

Xing Bai ◽

Jun Zhou

Keyword(s):

Neural Network ◽

Real Time ◽

Network Architecture ◽

Resource Constraints ◽

Cost Effective ◽

Semantic Segmentation ◽

Classification Model ◽

Neural Network Architecture ◽

Great Progress ◽

Different Types

Benefiting from the booming of deep learning, the state-of-the-art models achieved great progress. But they are huge in terms of parameters and floating point operations, which makes it hard to apply them to real-time applications. In this paper, we propose a novel deep neural network architecture, named MPDNet, for fast and efficient semantic segmentation under resource constraints. First, we use a light-weight classification model pretrained on ImageNet as the encoder. Second, we use a cost-effective upsampling datapath to restore prediction resolution and convert features for classification into features for segmentation. Finally, we propose to use a multi-path decoder to extract different types of features, which are not ideal to process inside only one convolutional neural network. The experimental results of our model outperform other models aiming at real-time semantic segmentation on Cityscapes. Based on our proposed MPDNet, we achieve 76.7% mean IoU on Cityscapes test set with only 118.84GFLOPs and achieves 37.6 Hz on 768 × 1536 images on a standard GPU.

Download Full-text

Towards Better Interpretability in Deep Q-Networks

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33014561 ◽

2019 ◽

Vol 33 ◽

pp. 4561-4569 ◽

Cited By ~ 2

Author(s):

Raghuram Mandyam Annasamy ◽

Katia Sycara

Keyword(s):

Neural Network ◽

Network Architecture ◽

Empirical Studies ◽

Superior Performance ◽

Training Algorithms ◽

Neural Network Architecture ◽

Q Learning ◽

The Neural Network ◽

Learning Techniques ◽

Out Of Sample

Deep reinforcement learning techniques have demonstrated superior performance in a wide variety of environments. As improvements in training algorithms continue at a brisk pace, theoretical or empirical studies on understanding what these networks seem to learn, are far behind. In this paper we propose an interpretable neural network architecture for Q-learning which provides a global explanation of the model’s behavior using key-value memories, attention and reconstructible embeddings. With a directed exploration strategy, our model can reach training rewards comparable to the state-of-the-art deep Q-learning models. However, results suggest that the features extracted by the neural network are extremely shallow and subsequent testing using out-of-sample examples shows that the agent can easily overfit to trajectories seen during training.

Download Full-text

Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data

BMC Medical Genomics ◽

10.1186/s12920-019-0624-2 ◽

2019 ◽

Vol 12 (S10) ◽

Cited By ~ 1

Author(s):

Jie Hao ◽

Youngsoon Kim ◽

Tejaswini Mallavarapu ◽

Jung Hun Oh ◽

Mingon Kang

Keyword(s):

Neural Network ◽

Survival Analysis ◽

Cancer Patient ◽

Clinical Data ◽

Network Architecture ◽

Deep Neural Network ◽

Patient Survival ◽

Biological Mechanisms ◽

Neural Network Architecture ◽

The Neural Network

Abstract Background Understanding the complex biological mechanisms of cancer patient survival using genomic and clinical data is vital, not only to develop new treatments for patients, but also to improve survival prediction. However, highly nonlinear and high-dimension, low-sample size (HDLSS) data cause computational challenges to applying conventional survival analysis. Results We propose a novel biologically interpretable pathway-based sparse deep neural network, named Cox-PASNet, which integrates high-dimensional gene expression data and clinical data on a simple neural network architecture for survival analysis. Cox-PASNet is biologically interpretable where nodes in the neural network correspond to biological genes and pathways, while capturing the nonlinear and hierarchical effects of biological pathways associated with cancer patient survival. We also propose a heuristic optimization solution to train Cox-PASNet with HDLSS data. Cox-PASNet was intensively evaluated by comparing the predictive performance of current state-of-the-art methods on glioblastoma multiforme (GBM) and ovarian serous cystadenocarcinoma (OV) cancer. In the experiments, Cox-PASNet showed out-performance, compared to the benchmarking methods. Moreover, the neural network architecture of Cox-PASNet was biologically interpreted, and several significant prognostic factors of genes and biological pathways were identified. Conclusions Cox-PASNet models biological mechanisms in the neural network by incorporating biological pathway databases and sparse coding. The neural network of Cox-PASNet can identify nonlinear and hierarchical associations of genomic and clinical data to cancer patient survival. The open-source code of Cox-PASNet in PyTorch implemented for training, evaluation, and model interpretation is available at: https://github.com/DataX-JieHao/Cox-PASNet.

Download Full-text