Multi-Temporal Unmanned Aerial Vehicle Remote Sensing for Vegetable Mapping Using an Attention-Based Recurrent Convolutional Neural Network

Quanlong Feng; Jianyu Yang; Yiming Liu; Cong Ou; Dehai Zhu; Bowen Niu; Jiantao Liu; Baoguo Li

doi:10.3390/rs12101668

Multi-Temporal Unmanned Aerial Vehicle Remote Sensing for Vegetable Mapping Using an Attention-Based Recurrent Convolutional Neural Network

Remote Sensing ◽

10.3390/rs12101668 ◽

2020 ◽

Vol 12 (10) ◽

pp. 1668 ◽

Cited By ~ 4

Author(s):

Quanlong Feng ◽

Jianyu Yang ◽

Yiming Liu ◽

Cong Ou ◽

Dehai Zhu ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Unmanned Aerial Vehicle ◽

Feature Fusion ◽

Agricultural Landscapes ◽

Sequential Dependency ◽

Temporal Features ◽

Multi Temporal ◽

Aerial Vehicle

Vegetable mapping from remote sensing imagery is important for precision agricultural activities such as automated pesticide spraying. Multi-temporal unmanned aerial vehicle (UAV) data has the merits of both very high spatial resolution and useful phenological information, which shows great potential for accurate vegetable classification, especially under complex and fragmented agricultural landscapes. In this study, an attention-based recurrent convolutional neural network (ARCNN) has been proposed for accurate vegetable mapping from multi-temporal UAV red-green-blue (RGB) imagery. The proposed model firstly utilizes a multi-scale deformable CNN to learn and extract rich spatial features from UAV data. Afterwards, the extracted features are fed into an attention-based recurrent neural network (RNN), from which the sequential dependency between multi-temporal features could be established. Finally, the aggregated spatial-temporal features are used to predict the vegetable category. Experimental results show that the proposed ARCNN yields a high performance with an overall accuracy of 92.80%. When compared with mono-temporal classification, the incorporation of multi-temporal UAV imagery could significantly boost the accuracy by 24.49% on average, which justifies the hypothesis that the low spectral resolution of RGB imagery could be compensated by the inclusion of multi-temporal observations. In addition, the attention-based RNN in this study outperforms other feature fusion methods such as feature-stacking. The deformable convolution operation also yields higher classification accuracy than that of a standard convolution unit. Results demonstrate that the ARCNN could provide an effective way for extracting and aggregating discriminative spatial-temporal features for vegetable mapping from multi-temporal UAV RGB imagery.

Download Full-text

UFCN: a fully convolutional neural network for road extraction in RGB imagery acquired by remote sensing from an unmanned aerial vehicle

Journal of Applied Remote Sensing ◽

10.1117/1.jrs.12.016020 ◽

2018 ◽

Vol 12 (01) ◽

pp. 1 ◽

Cited By ~ 15

Author(s):

Ramesh Kestur ◽

Shariq Farooq ◽

Rameen Abdal ◽

Emad Mehraj ◽

Omkar Narasipura ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Unmanned Aerial Vehicle ◽

Road Extraction ◽

Aerial Vehicle

Download Full-text

Image‐based road crack risk‐informed assessment using a convolutional neural network and an unmanned aerial vehicle

Structural Control and Health Monitoring ◽

10.1002/stc.2749 ◽

2021 ◽

Author(s):

Ankang Ji ◽

Xiaolong Xue ◽

Yuna Wang ◽

Xiaowei Luo ◽

Luqi Wang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Unmanned Aerial Vehicle ◽

Aerial Vehicle

Download Full-text

Road Extraction from Unmanned Aerial Vehicle Remote Sensing Images Based on Improved Neural Networks

Sensors ◽

10.3390/s19194115 ◽

2019 ◽

Vol 19 (19) ◽

pp. 4115 ◽

Cited By ~ 1

Author(s):

Yuxia Li ◽

Bo Peng ◽

Lei He ◽

Kunlong Fan ◽

Zhenxu Li ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Neural Networks ◽

Unmanned Aerial Vehicle ◽

Computational Efficiency ◽

Neural Nets ◽

Road Extraction ◽

Remote Sensing Images ◽

Feature Maps ◽

Aerial Vehicle

Roads are vital components of infrastructure, the extraction of which has become a topic of significant interest in the field of remote sensing. Because deep learning has been a popular method in image processing and information extraction, researchers have paid more attention to extracting road using neural networks. This article proposes the improvement of neural networks to extract roads from Unmanned Aerial Vehicle (UAV) remote sensing images. D-Linknet was first considered for its high performance; however, the huge scale of the net reduced computational efficiency. With a focus on the low computational efficiency problem of the popular D-LinkNet, this article made some improvements: (1) Replace the initial block with a stem block. (2) Rebuild the entire network based on ResNet units with a new structure, allowing for the construction of an improved neural network D-Linknetplus. (3) Add a 1 × 1 convolution layer before DBlock to reduce the input feature maps, reducing parameters and improving computational efficiency. Add another 1 × 1 convolution layer after DBlock to recover the required number of output channels. Accordingly, another improved neural network B-D-LinknetPlus was built. Comparisons were performed between the neural nets, and the verification were made with the Massachusetts Roads Dataset. The results show improved neural networks are helpful in reducing the network size and developing the precision needed for road extraction.

Download Full-text

Low-Altitude Unmanned Aerial Vehicle for Real-Time Greenhouse Plant Disease Monitoring Using Convolutional Neural Network

10.1007/978-981-16-2712-5_6 ◽

2021 ◽

pp. 63-76

Author(s):

C. S. Arvind ◽

K. Prajwal ◽

Amrut C. Patil ◽

A. Ashwin Kumar ◽

A. Sreedevi ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Real Time ◽

Unmanned Aerial Vehicle ◽

Plant Disease ◽

Disease Monitoring ◽

Aerial Vehicle ◽

Greenhouse Plant ◽

Low Altitude

Download Full-text

Coarse-to-fine object detection in unmanned aerial vehicle imagery using lightweight convolutional neural network and deep motion saliency

Neurocomputing ◽

10.1016/j.neucom.2019.03.102 ◽

2020 ◽

Vol 398 ◽

pp. 555-565 ◽

Cited By ~ 1

Author(s):

Jing Zhang ◽

Xi Liang ◽

Meng Wang ◽

Liheng Yang ◽

Li Zhuo

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Unmanned Aerial Vehicle ◽

Aerial Vehicle ◽

Motion Saliency ◽

Coarse To Fine

Download Full-text

An Efficient and Lightweight Convolutional Neural Network for Remote Sensing Image Scene Classification

Sensors ◽

10.3390/s20071999 ◽

2020 ◽

Vol 20 (7) ◽

pp. 1999 ◽

Cited By ~ 6

Author(s):

Donghang Yu ◽

Qing Xu ◽

Haitao Guo ◽

Chuan Zhao ◽

Yuzhun Lin ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Visual Recognition ◽

Feature Fusion ◽

Remote Sensing Image ◽

Classification Performance ◽

Image Features ◽

Training Dataset ◽

Scene Classification

Classifying remote sensing images is vital for interpreting image content. Presently, remote sensing image scene classification methods using convolutional neural networks have drawbacks, including excessive parameters and heavy calculation costs. More efficient and lightweight CNNs have fewer parameters and calculations, but their classification performance is generally weaker. We propose a more efficient and lightweight convolutional neural network method to improve classification accuracy with a small training dataset. Inspired by fine-grained visual recognition, this study introduces a bilinear convolutional neural network model for scene classification. First, the lightweight convolutional neural network, MobileNetv2, is used to extract deep and abstract image features. Each feature is then transformed into two features with two different convolutional layers. The transformed features are subjected to Hadamard product operation to obtain an enhanced bilinear feature. Finally, the bilinear feature after pooling and normalization is used for classification. Experiments are performed on three widely used datasets: UC Merced, AID, and NWPU-RESISC45. Compared with other state-of-art methods, the proposed method has fewer parameters and calculations, while achieving higher accuracy. By including feature fusion with bilinear pooling, performance and accuracy for remote scene classification can greatly improve. This could be applied to any remote sensing image classification task.

Download Full-text

Multilayer Feature Fusion With Weight Adjustment Based on a Convolutional Neural Network for Remote Sensing Scene Classification

IEEE Geoscience and Remote Sensing Letters ◽

10.1109/lgrs.2020.2970810 ◽

2020 ◽

pp. 1-5

Author(s):

Chenhui Ma ◽

Xiaodong Mu ◽

Renpu Lin ◽

Shuyang Wang

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Feature Fusion ◽

Scene Classification

Download Full-text

GRAPH NEURAL NETWORK BASED MULTI-FEATURE FUSION FOR BUILDING CHANGE DETECTION

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b3-2021-377-2021 ◽

2021 ◽

Vol XLIII-B3-2021 ◽

pp. 377-382

Author(s):

W. Yuan ◽

X. Yuan ◽

Z. Fan ◽

Z. Guo ◽

X. Shi ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Change Detection ◽

Point Cloud ◽

Feature Fusion ◽

Experimental Comparison ◽

3D Point Cloud ◽

Remote Sensing Images ◽

Illumination Variations ◽

Multi Temporal

Abstract. Building Change Detection (BCD) via multi-temporal remote sensing images is essential for various applications such as urban monitoring, urban planning, and disaster assessment. However, most building change detection approaches only extract features from different kinds of remote sensing images for change index determination, which can not determine the insignificant changes of small buildings. Given co-registered multi-temporal remote sensing images, the illumination variations and misregistration errors always lead to inaccurate change detection results. This study investigates the applicability of multi-feature fusion from both directly extract 2D features from remote sensing images and 3D features extracted by the dense image matching (DIM) generated 3D point cloud for accurate building change index generation. This paper introduces a graph neural network (GNN) based end-to-end learning framework for building change detection. The proposed framework includes feature extraction, feature fusion, and change index prediction. It starts with a pre-trained VGG-16 network as a backend and uses U-net architecture with five layers for feature map extraction. The extracted 2D features and 3D features are utilized as input into GNN based feature fusion parts. In the GNN parts, we introduce a flexible context aggregation mechanism based on attention to address the illumination variations and misregistration errors, enabling the framework to reason about the image-based texture information and depth information introduced by DIM generated 3D point cloud jointly. After that, the GNN generated affinity matrix is utilized for change index determination through a Hungarian algorithm. The experiment conducted on a dataset that covered Setagaya-Ku, Tokyo area, shows that the proposed method generated change map achieved the precision of 0.762 and the F1-score of 0.68 at pixel-level. Compared to traditional image-based change detection methods, our approach learns prior over geometrical structure information from the real 3D world, which robust to the misregistration errors. Compared to CNN based methods, the proposed method learns to fuse 2D and 3D features together to represent more comprehensive information for building change index determination. The experimental comparison results demonstrated that the proposed approach outperforms the traditional methods and CNN based methods.

Download Full-text

Super Resolution pada Citra Udara menggunakan Convolutional Neural Network

ELKOMIKA Jurnal Teknik Energi Elektrik Teknik Telekomunikasi & Teknik Elektronika ◽

10.26760/elkomika.v9i1.71 ◽

2021 ◽

Vol 9 (1) ◽

pp. 71

Author(s):

MUHAMMAD EFAN ABDULFATTAH ◽

LEDYA NOVAMIZANTI ◽

SYAMSUL RIZAL

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Deep Learning ◽

Convolutional Neural Network ◽

Unmanned Aerial Vehicle ◽

Super Resolution ◽

Image Features ◽

Scale Factor ◽

Aerial Imagery ◽

Aerial Vehicle

ABSTRAKBencana di Indonesia didominasi oleh bencana hidrometeorologi yang mengakibatkan kerusakan dalam skala besar. Melalui pemetaan, penanganan yang menyeluruh dapat dilakukan guna membantu analisa dan penindakan selanjutnya. Unmanned Aerial Vehicle (UAV) dapat digunakan sebagai alat bantu pemetaan dari udara. Namun, karena faktor kamera maupun perangkat pengolah citra yang tidak memenuhi spesifikasi, hasilnya menjadi kurang informatif. Penelitian ini mengusulkan Super Resolution pada citra udara berbasis Convolutional Neural Network (CNN) dengan model DCSCN. Model terdiri atas Feature Extraction Network untuk mengekstraksi ciri citra, dan Reconstruction Network untuk merekonstruksi citra. Performa DCSCN dibandingkan dengan Super Resolution CNN (SRCNN). Eksperimen dilakukan pada dataset Set5 dengan nilai scale factor 2, 3 dan 4. Secara berurutan SRCNN menghasilkan nilai PSNR dan SSIM sebesar 36.66 dB / 0.9542, 32.75 dB / 0.9090 dan 30.49 dB / 0.8628. Performa DCSCN meningkat menjadi 37.614dB / 0.9588, 33.86 dB / 0.9225 dan 31.48 dB / 0.8851.Kata kunci: citra udara, deep learning, super resolution ABSTRACTDisasters in Indonesia are dominated by hydrometeorological disasters, which cause large-scale damage. Through mapping, comprehensive handling can be done to help the analysis and subsequent action. Unmanned Aerial Vehicle (UAV) can be used as an aerial mapping tool. However, due to the camera and image processing devices that do not meet specifications, the results are less informative. This research proposes Super Resolution on aerial imagery based on Convolutional Neural Network (CNN) with the DCSCN model. The model consists of Feature Extraction Network for extracting image features and Reconstruction Network for reconstructing images. DCSCN's performance is compared to CNN Super Resolution (SRCNN). Experiments were carried out on the Set5 dataset with scale factor values 2, 3, and 4. The SRCNN sequentially produced PSNR and SSIM values of 36.66dB / 0.9542, 32.75dB / 0.9090 and 30.49dB / 0.8628. DCSCN's performance increased to 37,614dB / 0.9588, 33.86dB / 0.9225 and 31.48dB / 0.8851.Keywords: aerial imagery, deep learning, super resolution

Download Full-text

High-resolution remote sensing vehicle automatic detection based on feature fusion convolutional neural network

10.1117/12.2626645 ◽

2021 ◽

Author(s):

Xin Li ◽

Kai Guo ◽

Mutailifu Subei ◽

Dudu Guo

Keyword(s):

Neural Network ◽

Remote Sensing ◽

High Resolution ◽

Convolutional Neural Network ◽

Feature Fusion ◽

Automatic Detection

Download Full-text