Remote Sensing Imagery Super Resolution Based on Adaptive Multi-Scale Feature Fusion Network

Xinying Wang; Yingdan Wu; Yang Ming; Hui Lv

doi:10.3390/s20041142

Remote Sensing Imagery Super Resolution Based on Adaptive Multi-Scale Feature Fusion Network

Sensors ◽

10.3390/s20041142 ◽

2020 ◽

Vol 20 (4) ◽

pp. 1142

Author(s):

Xinying Wang ◽

Yingdan Wu ◽

Yang Ming ◽

Hui Lv

Keyword(s):

Remote Sensing ◽

Feature Extraction ◽

Feature Fusion ◽

Super Resolution ◽

Convolutional Network ◽

Remote Sensing Imagery ◽

Resolution Image ◽

Scale Feature ◽

Multi Scale ◽

Key Characteristics

Due to increasingly complex factors of image degradation, inferring high-frequency details of remote sensing imagery is more difficult compared to ordinary digital photos. This paper proposes an adaptive multi-scale feature fusion network (AMFFN) for remote sensing image super-resolution. Firstly, the features are extracted from the original low-resolution image. Then several adaptive multi-scale feature extraction (AMFE) modules, the squeeze-and-excited and adaptive gating mechanisms are adopted for feature extraction and fusion. Finally, the sub-pixel convolution method is used to reconstruct the high-resolution image. Experiments are performed on three datasets, the key characteristics, such as the number of AMFEs and the gating connection way are studied, and super-resolution of remote sensing imagery of different scale factors are qualitatively and quantitatively analyzed. The results show that our method outperforms the classic methods, such as Super-Resolution Convolutional Neural Network(SRCNN), Efficient Sub-Pixel Convolutional Network (ESPCN), and multi-scale residual CNN(MSRN).

Download Full-text

A Novel Effectively Optimized One-Stage Network for Object Detection in Remote Sensing Imagery

Remote Sensing ◽

10.3390/rs11111376 ◽

2019 ◽

Vol 11 (11) ◽

pp. 1376 ◽

Cited By ~ 4

Author(s):

Weiying Xie ◽

Haonan Qin ◽

Yunsong Li ◽

Zhuo Wang ◽

Jie Lei

Keyword(s):

Remote Sensing ◽

Spatial Information ◽

Feature Fusion ◽

Field Enhancement ◽

Feature Representation ◽

Detection Accuracy ◽

Convolutional Network ◽

Remote Sensing Imagery ◽

Multi Scale ◽

One Stage

With great significance in military and civilian applications, the topic of detecting small and densely arranged objects in wide-scale remote sensing imagery is still challenging nowadays. To solve this problem, we propose a novel effectively optimized one-stage network (NEOON). As a fully convolutional network, NEOON consists of four parts: Feature extraction, feature fusion, feature enhancement, and multi-scale detection. To extract effective features, the first part has implemented bottom-up and top-down coherent processing by taking successive down-sampling and up-sampling operations in conjunction with residual modules. The second part consolidates high-level and low-level features by adopting concatenation operations with subsequent convolutional operations to explicitly yield strong feature representation and semantic information. The third part is implemented by constructing a receptive field enhancement (RFE) module and incorporating it into the fore part of the network where the information of small objects exists. The final part is achieved by four detectors with different sensitivities accessing the fused features, all four parallel, to enable the network to make full use of information of objects in different scales. Besides, the Focal Loss is set to enable the cross entropy for classification to solve the tough problem of class imbalance in one-stage methods. In addition, we introduce the Soft-NMS to preserve accurate bounding boxes in the post-processing stage especially for densely arranged objects. Note that the split and merge strategy and multi-scale training strategy are employed in training. Thorough experiments are performed on ACS datasets constructed by us and NWPU VHR-10 datasets to evaluate the performance of NEOON. Specifically, 4.77% and 5.50% improvements in mAP and recall, respectively, on the ACS dataset as compared to YOLOv3 powerfully prove that NEOON can effectually improve the detection accuracy of small objects in remote sensing imagery. In addition, extensive experiments and comprehensive evaluations on the NWPU VHR-10 dataset with 10 classes have illustrated the superiority of NEOON in the extraction of spatial information of high-resolution remote sensing images.

Download Full-text

A Multi-Scale Feature Extraction-Based Normalized Attention Neural Network for Image Denoising

Electronics ◽

10.3390/electronics10030319 ◽

2021 ◽

Vol 10 (3) ◽

pp. 319

Author(s):

Yi Wang ◽

Xiao Song ◽

Guanghong Gong ◽

Ni Li

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Image Denoising ◽

Color Image ◽

Rapid Development ◽

Similarity Index ◽

Structural Similarity ◽

Convolutional Network ◽

Scale Feature ◽

Multi Scale

Due to the rapid development of deep learning and artificial intelligence techniques, denoising via neural networks has drawn great attention due to their flexibility and excellent performances. However, for most convolutional network denoising methods, the convolution kernel is only one layer deep, and features of distinct scales are neglected. Moreover, in the convolution operation, all channels are treated equally; the relationships of channels are not considered. In this paper, we propose a multi-scale feature extraction-based normalized attention neural network (MFENANN) for image denoising. In MFENANN, we define a multi-scale feature extraction block to extract and combine features at distinct scales of the noisy image. In addition, we propose a normalized attention network (NAN) to learn the relationships between channels, which smooths the optimization landscape and speeds up the convergence process for training an attention model. Moreover, we introduce the NAN to convolutional network denoising, in which each channel gets gain; channels can play different roles in the subsequent convolution. To testify the effectiveness of the proposed MFENANN, we used both grayscale and color image sets whose noise levels ranged from 0 to 75 to do the experiments. The experimental results show that compared with some state-of-the-art denoising methods, the restored images of MFENANN have larger peak signal-to-noise ratios (PSNR) and structural similarity index measure (SSIM) values and get better overall appearance.

Download Full-text

Semantic segmentation of remote sensing images based on dual attention and multi-scale feature fusion

Twelfth International Conference on Graphics and Image Processing (ICGIP 2020) ◽

10.1117/12.2589380 ◽

2021 ◽

Author(s):

Mengqian Weng ◽

Zhibo Hu ◽

Xiaopeng Xie ◽

Yunhong Li ◽

Lei Hu

Keyword(s):

Remote Sensing ◽

Feature Fusion ◽

Semantic Segmentation ◽

Remote Sensing Images ◽

Scale Feature ◽

Multi Scale

Download Full-text

IR remote sensing image registration based on multi-scale feature extraction

2014 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2014.6889630 ◽

2014 ◽

Author(s):

Jun Kong ◽

Min Jiang ◽

Jun Kong ◽

Yi-Ning Sun

Keyword(s):

Remote Sensing ◽

Feature Extraction ◽

Image Registration ◽

Remote Sensing Image ◽

Scale Feature ◽

Multi Scale

Download Full-text

A Novel 2D-3D CNN with Spectral-Spatial Multi-Scale Feature Fusion for Hyperspectral Image Classification

Remote Sensing ◽

10.3390/rs13224621 ◽

2021 ◽

Vol 13 (22) ◽

pp. 4621

Author(s):

Dongxu Liu ◽

Guangliang Han ◽

Peixun Liu ◽

Hang Yang ◽

Xinglong Sun ◽

...

Keyword(s):

Feature Extraction ◽

Image Classification ◽

Classification Scheme ◽

Hyperspectral Image ◽

Feature Fusion ◽

Spectral Feature ◽

Hyperspectral Image Classification ◽

Scale Feature ◽

Multi Scale ◽

3D Cnn

Multifarious hyperspectral image (HSI) classification methods based on convolutional neural networks (CNN) have been gradually proposed and achieve a promising classification performance. However, hyperspectral image classification still suffers from various challenges, including abundant redundant information, insufficient spectral-spatial representation, irregular class distribution, and so forth. To address these issues, we propose a novel 2D-3D CNN with spectral-spatial multi-scale feature fusion for hyperspectral image classification, which consists of two feature extraction streams, a feature fusion module as well as a classification scheme. First, we employ two diverse backbone modules for feature representation, that is, the spectral feature and the spatial feature extraction streams. The former utilizes a hierarchical feature extraction module to capture multi-scale spectral features, while the latter extracts multi-stage spatial features by introducing a multi-level fusion structure. With these network units, the category attribute information of HSI can be fully excavated. Then, to output more complete and robust information for classification, a multi-scale spectral-spatial-semantic feature fusion module is presented based on a Decomposition-Reconstruction structure. Last of all, we innovate a classification scheme to lift the classification accuracy. Experimental results on three public datasets demonstrate that the proposed method outperforms the state-of-the-art methods.

Download Full-text

A new end-to-end image dehazing algorithm based on residual attention mechanism

Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University ◽

10.1051/jnwpu/20213940901 ◽

2021 ◽

Vol 39 (4) ◽

pp. 901-908

Author(s):

Zhenjian Yang ◽

Jiamei Shang ◽

Zhongwei Zhang ◽

Yan Zhang ◽

Shudong Liu

Keyword(s):

Feature Extraction ◽

Feature Fusion ◽

Attention Mechanism ◽

Image Dehazing ◽

Atmospheric Scattering ◽

Scale Feature ◽

Multi Scale ◽

Color Distortion ◽

End To End ◽

Density Image

Traditional image dehazing algorithms based on prior knowledge and deep learning rely on the atmospheric scattering model and are easy to cause color distortion and incomplete dehazing. To solve these problems, an end-to-end image dehazing algorithm based on residual attention mechanism is proposed in this paper. The network includes four modules: encoder, multi-scale feature extraction, feature fusion and decoder. The encoder module encodes the input haze image into feature map, which is convenient for subsequent feature extraction and reduces memory consumption; the multi-scale feature extraction module includes residual smoothed dilated convolution module, residual block and efficient channel attention, which can expand the receptive field and extract different scale features by filtering and weighting; the feature fusion module with efficient channel attention adjusts the channel weight dynamically, acquires rich context information and suppresses redundant information so as to enhance the ability to extract haze density image of the network; finally, the encoder module maps the fused feature nonlinearly to obtain the haze density image and then restores the haze free image. The qualitative and quantitative tests based on SOTS test set and natural haze images show good objective and subjective evaluation results. This algorithm improves the problems of color distortion and incomplete dehazing effectively.

Download Full-text

Geospatial Object Detection on High Resolution Remote Sensing Imagery Based on Double Multi-Scale Feature Pyramid Network

Remote Sensing ◽

10.3390/rs11070755 ◽

2019 ◽

Vol 11 (7) ◽

pp. 755 ◽

Cited By ~ 20

Author(s):

Xiaodong Zhang ◽

Kun Zhu ◽

Guanzhou Chen ◽

Xiaoliang Tan ◽

Lifei Zhang ◽

...

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Object Detection ◽

Large Scale ◽

Training Data ◽

Validation Dataset ◽

Remote Sensing Imagery ◽

Scale Feature ◽

Multi Scale ◽

Feature Pyramid

Object detection on very-high-resolution (VHR) remote sensing imagery has attracted a lot of attention in the field of image automatic interpretation. Region-based convolutional neural networks (CNNs) have been vastly promoted in this domain, which first generate candidate regions and then accurately classify and locate the objects existing in these regions. However, the overlarge images, the complex image backgrounds and the uneven size and quantity distribution of training samples make the detection tasks more challenging, especially for small and dense objects. To solve these problems, an effective region-based VHR remote sensing imagery object detection framework named Double Multi-scale Feature Pyramid Network (DM-FPN) was proposed in this paper, which utilizes inherent multi-scale pyramidal features and combines the strong-semantic, low-resolution features and the weak-semantic, high-resolution features simultaneously. DM-FPN consists of a multi-scale region proposal network and a multi-scale object detection network, these two modules share convolutional layers and can be trained end-to-end. We proposed several multi-scale training strategies to increase the diversity of training data and overcome the size restrictions of the input images. We also proposed multi-scale inference and adaptive categorical non-maximum suppression (ACNMS) strategies to promote detection performance, especially for small and dense objects. Extensive experiments and comprehensive evaluations on large-scale DOTA dataset demonstrate the effectiveness of the proposed framework, which achieves mean average precision (mAP) value of 0.7927 on validation dataset and the best mAP value of 0.793 on testing dataset.

Download Full-text

BMF-CNN: an object detection method based on multi-scale feature fusion in VHR remote sensing images

Remote Sensing Letters ◽

10.1080/2150704x.2019.1706007 ◽

2019 ◽

Vol 11 (3) ◽

pp. 215-224

Author(s):

Zhong Dong ◽

Baojun Lin

Keyword(s):

Remote Sensing ◽

Object Detection ◽

Detection Method ◽

Feature Fusion ◽

Remote Sensing Images ◽

Scale Feature ◽

Multi Scale

Download Full-text

Pedestrian detection algorithm based on multi-scale feature extraction and attention feature fusion

Digital Signal Processing ◽

10.1016/j.dsp.2021.103311 ◽

2021 ◽

pp. 103311

Author(s):

Hao Xia ◽

Jun Ma ◽

Jiayu Ou ◽

Xinyao Lv ◽

Chengjie Bai

Keyword(s):

Feature Extraction ◽

Feature Fusion ◽

Pedestrian Detection ◽

Detection Algorithm ◽

Scale Feature ◽

Multi Scale

Download Full-text

Road Extraction from GF-1 Remote Sensing Images Based on Dilated Convolution Residual Network with Multi-Scale Feature Fusion

Laser & Optoelectronics Progress ◽

10.3788/lop202158.0228001 ◽

2021 ◽

Vol 58 (2) ◽

pp. 0228001

Author(s):

马天浩 Ma Tianhao ◽

谭海 Tan Hai ◽

李天琪 Li Tianqi ◽

吴雅男 Wu Yanan ◽

刘祺 Liu Qi

Keyword(s):

Remote Sensing ◽

Feature Fusion ◽

Road Extraction ◽

Remote Sensing Images ◽

Residual Network ◽

Scale Feature ◽

Multi Scale ◽

Dilated Convolution

Download Full-text