Integrating Weighted Feature Fusion and the Spatial Attention Module with Convolutional Neural Networks for Automatic Aircraft Detection from SAR Images

Jielan Wang; Hongguang Xiao; Lifu Chen; Jin Xing; Zhouhao Pan; Ru Luo; Xingmin Cai  Cai

doi:10.3390/rs13050910

Integrating Weighted Feature Fusion and the Spatial Attention Module with Convolutional Neural Networks for Automatic Aircraft Detection from SAR Images

Remote Sensing ◽

10.3390/rs13050910 ◽

2021 ◽

Vol 13 (5) ◽

pp. 910

Author(s):

Jielan Wang ◽

Hongguang Xiao ◽

Lifu Chen ◽

Jin Xing ◽

Zhouhao Pan ◽

...

Keyword(s):

Neural Networks ◽

Spatial Attention ◽

Large Scale ◽

Feature Fusion ◽

Background Information ◽

Test Time ◽

Detection Accuracy ◽

Sar Images ◽

Airport Runway ◽

Aircraft Detection

The automatic detection of aircrafts from SAR images is widely applied in both military and civil fields, but there are still considerable challenges. To address the high variety of aircraft sizes and complex background information in SAR images, a new fast detection framework based on convolution neural networks is proposed, which achieves automatic and rapid detection of aircraft with high accuracy. First, the airport runway areas are detected to generate the airport runway mask and rectangular contour of the whole airport are generated. Then, a new deep neural network proposed in this paper, named Efficient Weighted Feature Fusion and Attention Network (EWFAN), is used to detect aircrafts. EWFAN integrates the weighted feature fusion module, the spatial attention mechanism, and the CIF loss function. EWFAN can effectively reduce the interference of negative samples and enhance feature extraction, thereby significantly improving the detection accuracy. Finally, the airport runway mask is applied to the detected results to reduce false alarms and produce the final aircraft detection results. To evaluate the performance of the proposed framework, large-scale Gaofen-3 SAR images with 1 m resolution are utilized in the experiment. The detection rate and false alarm rate of our EWFAN algorithm are 95.4% and 3.3%, respectively, which outperforms Efficientdet and YOLOv4. In addition, the average test time with the proposed framework is only 15.40 s, indicating satisfying efficiency of automatic aircraft detection.

Download Full-text

Ship Detection in Large-scale SAR Images Based on Dense Spatial Attention and Multi-level Feature Fusion

10.1145/3472634.3472654 ◽

2021 ◽

Author(s):

Limin Zhang ◽

Yingjian Liu ◽

Qingxiang Guo ◽

Haoyu Yin ◽

Yue Li ◽

...

Keyword(s):

Spatial Attention ◽

Large Scale ◽

Feature Fusion ◽

Sar Images ◽

Ship Detection ◽

Multi Level

Download Full-text

SSD7-FFAM: A Real-Time Object Detection Network Friendly to Embedded Devices from Scratch

Applied Sciences ◽

10.3390/app11031096 ◽

2021 ◽

Vol 11 (3) ◽

pp. 1096

Author(s):

Qing Li ◽

Yingcheng Lin ◽

Wei He

Keyword(s):

Object Detection ◽

Real Time ◽

Large Scale ◽

Feature Fusion ◽

Contextual Information ◽

Attention Mechanism ◽

Detection Accuracy ◽

Single Shot ◽

Feature Maps ◽

Embedded Devices

The high requirements for computing and memory are the biggest challenges in deploying existing object detection networks to embedded devices. Living lightweight object detectors directly use lightweight neural network architectures such as MobileNet or ShuffleNet pre-trained on large-scale classification datasets, which results in poor network structure flexibility and is not suitable for some specific scenarios. In this paper, we propose a lightweight object detection network Single-Shot MultiBox Detector (SSD)7-Feature Fusion and Attention Mechanism (FFAM), which saves storage space and reduces the amount of calculation by reducing the number of convolutional layers. We offer a novel Feature Fusion and Attention Mechanism (FFAM) method to improve detection accuracy. Firstly, the FFAM method fuses high-level semantic information-rich feature maps with low-level feature maps to improve small objects’ detection accuracy. The lightweight attention mechanism cascaded by channels and spatial attention modules is employed to enhance the target’s contextual information and guide the network to focus on its easy-to-recognize features. The SSD7-FFAM achieves 83.7% mean Average Precision (mAP), 1.66 MB parameters, and 0.033 s average running time on the NWPU VHR-10 dataset. The results indicate that the proposed SSD7-FFAM is more suitable for deployment to embedded devices for real-time object detection.

Download Full-text

End-to-End Ship Detection in SAR Images for Complex Scenes Based on Deep CNNs

Journal of Sensors ◽

10.1155/2021/8893182 ◽

2021 ◽

Vol 2021 ◽

pp. 1-19

Author(s):

Yao Chen ◽

Tao Duan ◽

Changyuan Wang ◽

Yuanyuan Zhang ◽

Mo Huang

Keyword(s):

Data Augmentation ◽

Feature Fusion ◽

Detection Methods ◽

Detection Accuracy ◽

Deep Convolutional Neural Networks ◽

Sar Images ◽

Ship Detection ◽

End To End ◽

Sar Imagery ◽

Detection Speed

Ship detection on synthetic aperture radar (SAR) imagery has many valuable applications for both civil and military fields and has received extraordinary attention in recent years. The traditional detection methods are insensitive to multiscale ships and usually time-consuming, results in low detection accuracy and limitation for real-time processing. To balance the accuracy and speed, an end-to-end ship detection method for complex inshore and offshore scenes based on deep convolutional neural networks (CNNs) is proposed in this paper. First, the SAR images are divided into different grids, and the anchor boxes are predefined based on the responsible grids for dense ship prediction. Then, Darknet-53 with residual units is adopted as a backbone to extract features, and a top-down pyramid structure is added for multiscale feature fusion with concatenation. By this means, abundant hierarchical features containing both spatial and semantic information are extracted. Meanwhile, the strategies such as soft non-maximum suppression (Soft-NMS), mix-up and mosaic data augmentation, multiscale training, and hybrid optimization are used for performance enhancement. Besides, the model is trained from scratch to avoid learning objective bias of pretraining. The proposed one-stage method adopts end-to-end inference by a single network, so the detection speed can be guaranteed due to the concise paradigm. Extensive experiments are performed on the public SAR ship detection dataset (SSDD), and the results show that the method can detect both inshore and offshore ships with higher accuracy than other mainstream methods, yielding the accuracy with an average of 95.52%, and the detection speed is quite fast with about 72 frames per second (FPS). The actual Sentinel-1 and Gaofen-3 data are utilized for verification, and the detection results also show the effectiveness and robustness of the method.

Download Full-text

Facemask Detection Based on Double Convolutional Neural Networks

Recent Patents on Engineering ◽

10.2174/1872212115666210827100258 ◽

2021 ◽

Vol 15 ◽

Author(s):

Guoqiang Chen ◽

Bingxin Bai ◽

Hongpeng Zhou ◽

Mengchao Liu ◽

Huailong Yi

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Feature Fusion ◽

Contextual Information ◽

Detection Accuracy ◽

Learning Networks ◽

Position Information ◽

Detection Model ◽

Multi Stage

Background: The study on facemask detection is of great significance because facemask detection is difficult, and the workload is heavy in places with a large number of people during the COVID-19 outbreak. Objective: The study aims to explore new deep learning networks that can accurately detect facemasks and improve the network's ability to extract multi-level features and contextual information. In addition, the proposed network effectively avoids the interference of objects like masks. The new network could eventually detect masks wearers in the crowd. Method: A Multi-stage Feature Fusion Block (MFFB) and a Detector Cascade Block (DCB) are proposed and connected to the deep learning network for facemask detection. The network's ability to obtain information improves. The network proposed in the study is Double Convolutional Neural Networks (CNN) called DCNN, which can fuse mask features and face position information. During facemask detection, the network extracts the featural information of the object and then inputs it into the data fusion layer. Results: The experiment results show that the proposed network can detect masks and faces in a complex environment and dense crowd. The detection accuracy of the network improves effectively. At the same time, the real-time performance of the detection model is excellent. Conclusion: The two branch networks of the DCNN can effectively obtain the feature and position information of facemasks. The network overcomes the disadvantage that a single CNN is susceptible to the interference of the suspected mask objects. The verification shows that the MFFB and the DCB can improve the network's ability to obtain object information, and the proposed DCNN can achieve excellent detection performance.

Download Full-text

Hyperspectral Anomaly Detection Based on Separability-Aware Sample Cascade

Remote Sensing ◽

10.3390/rs11212537 ◽

2019 ◽

Vol 11 (21) ◽

pp. 2537 ◽

Cited By ~ 2

Author(s):

Dandan Ma ◽

Yuan Yuan ◽

Qi Wang

Keyword(s):

Anomaly Detection ◽

Large Scale ◽

Hyperspectral Image ◽

Superior Performance ◽

Detection Methods ◽

Background Information ◽

Detection Accuracy ◽

Layer By Layer ◽

Edge Preserving ◽

Image Pixels

A hyperspectral image usually covers a large scale of ground scene, which contains various materials with different spectral properties. When directly exploring the background information using all the image pixels, complex spectral interactions and inter-/intra-difference of different samples will significantly reduce the accuracy of background evaluation and further affect the detection performance. To address this problem, this paper proposes a novel hyperspectral anomaly detection method based on separability-aware sample cascade model. Through identifying separability of hyperspectral pixels, background samples are sifted out layer-by-layer according to their separable degrees from anomalies, which can ensure the accuracy and distinctiveness of background representation. First, as spatial structure is beneficial for recognizing target, a new spectral–spatial feature extraction technique is used in this work based on the PCA technique and edge-preserving filtering. Second, depending on different separability computed by sparse representation, samples are separated into different sets which can effectively and completely reflect various characteristics of background across all the cascade layers. Meanwhile, some potential abnormal targets are removed at each selection step to avoid their effects on subsequent layers. Finally, comprehensively taking different good properties of all the separability-aware layers into consideration, a simple multilayer anomaly detection strategy is adopted to obtain the final detection map. Extensive experimental results on five real-world hyperspectral images demonstrate our method’s superior performance. Compared with seven representative anomaly detection methods, our method improves the average detection accuracy with great advantages.

Download Full-text

Convolutional Neural Networks for Crystal Material Property Prediction Using Hybrid Orbital-Field Matrix and Magpie Descriptors

Crystals ◽

10.3390/cryst9040191 ◽

2019 ◽

Vol 9 (4) ◽

pp. 191 ◽

Cited By ~ 9

Author(s):

Zhuo Cao ◽

Yabo Dan ◽

Zheng Xiong ◽

Chengcheng Niu ◽

Xiang Li ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Large Scale ◽

Feature Fusion ◽

Computational Prediction ◽

Support Vector ◽

Two Dimensional ◽

Material Informatics ◽

Crystal Material ◽

Vector Machines

Computational prediction of crystal materials properties can help to do large-scale insiliconscreening. Recent studies of material informatics have focused on expert design of multidimensionalinterpretable material descriptors/features. However, successes of deep learning suchas Convolutional Neural Networks (CNN) in image recognition and speech recognition havedemonstrated their automated feature extraction capability to effectively capture the characteristicsof the data and achieve superior prediction performance. Here, we propose CNN-OFM-Magpie, aCNN model with OFM (Orbital-field Matrix) and Magpie descriptors to predict the formationenergy of 4030 crystal material by exploiting the complementarity of two-dimensional OFM featuresand Magpie features. Experiments showed that our method achieves better performance thanconventional regression algorithms such as support vector machines and Random Forest. It is alsobetter than CNN models using only the OFM features, the Magpie features, or the basic one-hotencodings. This demonstrates the advantages of CNN and feature fusion for materials propertyprediction. Finally, we visualized the two-dimensional OFM descriptors and analyzed the featuresextracted by the CNN to obtain greater understanding of the CNN-OFM model.

Download Full-text

A Fast Intrusion Detection Method for High-Speed Railway Clearance Based on Low-Cost Embedded GPUs

Sensors ◽

10.3390/s21217279 ◽

2021 ◽

Vol 21 (21) ◽

pp. 7279

Author(s):

Yao Wang ◽

Peizhi Yu

Keyword(s):

Neural Network ◽

Intrusion Detection ◽

High Speed ◽

Large Scale ◽

Feature Fusion ◽

Low Cost ◽

Obstacle Detection ◽

Detection Methods ◽

Detection Accuracy ◽

Real Time Processing

The efficiency and the effectiveness of railway intrusion detection are crucial to the safety of railway transportation. Most current methods of railway intrusion detection or obstacle detection are inappropriate for large-scale applications due to their high cost or limited coverage. In this study, we present a fast and low-cost solution to intrusion detection of high-speed railways. As the solution to heavy computational burdens in the current convolutional-neural-network-based detection methods, the proposed method is mainly a novel neural network based on the SSD framework, which includes a feature extractor using an improved MobileNet and a lightweight and efficient feature fusion module. In addition, aiming to improve the detection accuracy of small objects, the feature map weights are introduced through convolution operation to fuse features at different scales. TensorRT is employed to optimize and deploy the proposed network in the low-cost embedded GPU platform, NVIDIA Jetson TX2, to enhance the efficiency. The experimental results show that the proposed methods achieved 89% mAP on the railway intrusion detection dataset, and the average processing time for a single frame was 38.6 ms on the Jetson TX2 module, which satisfies the need of real-time processing.

Download Full-text

A Lightweight Object Detection Framework for Remote Sensing Images

Remote Sensing ◽

10.3390/rs13040683 ◽

2021 ◽

Vol 13 (4) ◽

pp. 683

Author(s):

Lang Huyan ◽

Yunpeng Bai ◽

Ying Li ◽

Dongmei Jiang ◽

Yanning Zhang ◽

...

Keyword(s):

Remote Sensing ◽

Object Detection ◽

Real Time ◽

Large Scale ◽

Feature Fusion ◽

Computational Cost ◽

Feature Representation ◽

Detection Accuracy ◽

Remote Sensing Images ◽

Low Level

Onboard real-time object detection in remote sensing images is a crucial but challenging task in this computation-constrained scenario. This task not only requires the algorithm to yield excellent performance but also requests limited time and space complexity of the algorithm. However, previous convolutional neural networks (CNN) based object detectors for remote sensing images suffer from heavy computational cost, which hinders them from being deployed on satellites. Moreover, an onboard detector is desired to detect objects at vastly different scales. To address these issues, we proposed a lightweight one-stage multi-scale feature fusion detector called MSF-SNET for onboard real-time object detection of remote sensing images. Using lightweight SNET as the backbone network reduces the number of parameters and computational complexity. To strengthen the detection performance of small objects, three low-level features are extracted from the three stages of SNET respectively. In the detection part, another three convolutional layers are designed to further extract deep features with rich semantic information for large-scale object detection. To improve detection accuracy, the deep features and low-level features are fused to enhance the feature representation. Extensive experiments and comprehensive evaluations on the openly available NWPU VHR-10 dataset and DIOR dataset are conducted to evaluate the proposed method. Compared with other state-of-art detectors, the proposed detection framework has fewer parameters and calculations, while maintaining consistent accuracy.

Download Full-text

A Fast Aircraft Detection Method for SAR Images Based on Efficient Bidirectional Path Aggregated Attention Network

Remote Sensing ◽

10.3390/rs13152940 ◽

2021 ◽

Vol 13 (15) ◽

pp. 2940

Author(s):

Ru Luo ◽

Lifu Chen ◽

Jin Xing ◽

Zhihui Yuan ◽

Siyu Tan ◽

...

Keyword(s):

Spatial Information ◽

Speckle Noise ◽

False Alarms ◽

Detection Accuracy ◽

Sar Images ◽

Attention Network ◽

Multi Scale ◽

Complex Background ◽

Size Heterogeneity ◽

Aircraft Detection

In aircraft detection from synthetic aperture radar (SAR) images, there are several major challenges: the shattered features of the aircraft, the size heterogeneity and the interference of a complex background. To address these problems, an Efficient Bidirectional Path Aggregation Attention Network (EBPA2N) is proposed. In EBPA2N, YOLOv5s is used as the base network and then the Involution Enhanced Path Aggregation (IEPA) module and Effective Residual Shuffle Attention (ERSA) module are proposed and systematically integrated to improve the detection accuracy of the aircraft. The IEPA module aims to effectively extract advanced semantic and spatial information to better capture multi-scale scattering features of aircraft. Then, the lightweight ERSA module further enhances the extracted features to overcome the interference of complex background and speckle noise, so as to reduce false alarms. To verify the effectiveness of the proposed network, Gaofen-3 airports SAR data with 1 m resolution are utilized in the experiment. The detection rate and false alarm rate of our EBPA2N algorithm are 93.05% and 4.49%, respectively, which is superior to the latest networks of EfficientDet-D0 and YOLOv5s, and it also has an advantage of detection speed.

Download Full-text

Aircraft Detection for Remote Sensing Image Based on Bidirectional and Dense Feature Fusion

Computational Intelligence and Neuroscience ◽

10.1155/2021/7618828 ◽

2021 ◽

Vol 2021 ◽

pp. 1-14

Author(s):

Liming Zhou ◽

Haoxin Yan ◽

Chang Zheng ◽

Xiaohan Rao ◽

Yahui Li ◽

...

Keyword(s):

Remote Sensing ◽

Object Detection ◽

Detection Method ◽

Feature Fusion ◽

Remote Sensing Image ◽

High Rate ◽

Detection Methods ◽

Detection Accuracy ◽

Aircraft Detection ◽

Low Rate

Aircraft, as one of the indispensable transport tools, plays an important role in military activities. Therefore, it is a significant task to locate the aircrafts in the remote sensing images. However, the current object detection methods cause a series of problems when applied to the aircraft detection for the remote sensing image, for instance, the problems of low rate of detection accuracy and high rate of missed detection. To address the problems of low rate of detection accuracy and high rate of missed detection, an object detection method for remote sensing image based on bidirectional and dense feature fusion is proposed to detect aircraft targets in sophisticated environments. On the fundamental of the YOLOv3 detection framework, this method adds a feature fusion module to enrich the details of the feature map by mixing the shallow features with the deep features together. Experimental results on the RSOD-DataSet and NWPU-DataSet indicate that the new method raised in the article is capable of improving the problems of low rate of detection accuracy and high rate of missed detection. Meanwhile, the AP for the aircraft increases by 1.57% compared with YOLOv3.

Download Full-text