A Reconfigurable Convolutional Neural Network-Accelerated Coprocessor Based on RISC-V Instruction Set

Ning Wu; Tao Jiang; Lei Zhang; Fang Zhou; Fen Ge

doi:10.3390/electronics9061005

Compact Convolutional Neural Network Accelerator for IoT Endpoint SoC

Electronics ◽

10.3390/electronics8050497 ◽

2019 ◽

Vol 8 (5) ◽

pp. 497 ◽

Cited By ~ 5

Author(s):

Fen Ge ◽

Ning Wu ◽

Hao Xiao ◽

Yuanyuan Zhang ◽

Fang Zhou

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Functional Modules ◽

Computational Power ◽

System A ◽

Unit Design ◽

Artificial Intelligence Algorithm ◽

And Performance ◽

On Chip ◽

The Internet Of Things

As a classical artificial intelligence algorithm, the convolutional neural network (CNN) algorithm plays an important role in image recognition and classification and is gradually being applied in the Internet of Things (IoT) system. A compact CNN accelerator for the IoT endpoint System-on-Chip (SoC) is proposed in this paper to meet the needs of CNN computations. Based on analysis of the CNN structure, basic functional modules of CNN such as convolution circuit and pooling circuit with a low data bandwidth and a smaller area are designed, and an accelerator is constructed in the form of four acceleration chains. After the acceleration unit design is completed, the Cortex-M3 is used to construct a verification SoC and the designed verification platform is implemented on the FPGA to evaluate the resource consumption and performance analysis of the CNN accelerator. The CNN accelerator achieved a throughput of 6.54 GOPS (giga operations per second) by consuming 4901 LUTs without using any hardware multipliers. The comparison shows that the compact accelerator proposed in this paper makes the CNN computational power of the SoC based on the Cortex-M3 kernel two times higher than the quad-core Cortex-A7 SoC and 67% of the computational power of eight-core Cortex-A53 SoC.

Download Full-text

Natural Disasters Intensity Analysis and Classification Based on Multispectral Images Using Multi-Layered Deep Convolutional Neural Network

Sensors ◽

10.3390/s21082648 ◽

2021 ◽

Vol 21 (8) ◽

pp. 2648

Author(s):

Muhammad Aamir ◽

Tariq Ali ◽

Muhammad Irfan ◽

Ahmad Shaf ◽

Muhammad Zeeshan Azam ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Natural Disasters ◽

Deep Convolutional Neural Network ◽

Multispectral Images ◽

Learning Techniques ◽

Proposed Model ◽

Disaster Intensity ◽

And Performance

Natural disasters not only disturb the human ecological system but also destroy the properties and critical infrastructures of human societies and even lead to permanent change in the ecosystem. Disaster can be caused by naturally occurring events such as earthquakes, cyclones, floods, and wildfires. Many deep learning techniques have been applied by various researchers to detect and classify natural disasters to overcome losses in ecosystems, but detection of natural disasters still faces issues due to the complex and imbalanced structures of images. To tackle this problem, we propose a multilayered deep convolutional neural network. The proposed model works in two blocks: Block-I convolutional neural network (B-I CNN), for detection and occurrence of disasters, and Block-II convolutional neural network (B-II CNN), for classification of natural disaster intensity types with different filters and parameters. The model is tested on 4428 natural images and performance is calculated and expressed as different statistical values: sensitivity (SE), 97.54%; specificity (SP), 98.22%; accuracy rate (AR), 99.92%; precision (PRE), 97.79%; and F1-score (F1), 97.97%. The overall accuracy for the whole model is 99.92%, which is competitive and comparable with state-of-the-art algorithms.

Download Full-text

Optimizing Convolutional Neural Network Accelerator on Low-Cost FPGA

Journal of Circuits System and Computers ◽

10.1142/s0218126621501930 ◽

2021 ◽

pp. 2150193

Author(s):

Truong Quang Vinh ◽

Dinh Viet Hai

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Low Cost ◽

Optimal Number ◽

Data Reuse ◽

Logic Element ◽

Input Buffer ◽

Classification Tasks ◽

Processing Engine ◽

Better Than

Convolutional neural network (CNN) is one of the most promising algorithms that outweighs other traditional methods in terms of accuracy in classification tasks. However, several CNNs, such as VGG, demand a huge computation in convolutional layers. Many accelerators implemented on powerful FPGAs have been introduced to address the problems. In this paper, we present a VGG-based accelerator which is optimized for a low-cost FPGA. In order to optimize the FPGA resource of logic element and memory, we propose a dedicated input buffer that maximizes the data reuse. In addition, we design a low resource processing engine with the optimal number of Multiply Accumulate (MAC) units. In the experiments, we use VGG16 model for inference to evaluate the performance of our accelerator and achieve a throughput of 38.8[Formula: see text]GOPS at a clock speed of 150[Formula: see text]MHz on Intel Cyclone V SX SoC. The experimental results show that our design is better than previous works in terms of resource efficiency.

Download Full-text

A multi-objective opposition-based barnacles mating optimization for image super resolution using hyper-Spectral images

Journal of Engineering Design and Technology ◽

10.1108/jedt-01-2021-0030 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Valli Bhasha A. ◽

Venkatramana Reddy B.D.

Keyword(s):

Neural Network ◽

Wavelet Transform ◽

Convolutional Neural Network ◽

Super Resolution ◽

Discrete Wavelet ◽

Content Type ◽

Resolution Model ◽

Deep Cnn ◽

Image Super Resolution ◽

Better Than

Purpose The problems of Super resolution are broadly discussed in diverse fields. Rather than the progression toward the super resolution models for real-time images, operating hyperspectral images still remains a challenging problem. Design/methodology/approach This paper aims to develop the enhanced image super-resolution model using “optimized Non-negative Structured Sparse Representation (NSSR), Adaptive Discrete Wavelet Transform (ADWT), and Optimized Deep Convolutional Neural Network”. Once after converting the HR images into LR images, the NSSR images are generated by the optimized NSSR. Then the ADWT is used for generating the subbands of both NSSR and HRSB images. The residual image with this information is obtained by the optimized Deep CNN. All the improvements on the algorithms are done by the Opposition-based Barnacles Mating Optimization (O-BMO), with the objective of attaining the multi-objective function concerning the “Peak Signal-to-Noise Ratio (PSNR), and Structural similarity (SSIM) index”. Extensive analysis on benchmark hyperspectral image datasets shows that the proposed model achieves superior performance over typical other existing super-resolution models. Findings From the analysis, the overall analysis of the suggested and the conventional super resolution models relies that the PSNR of the improved O-BMO-(NSSR+DWT+CNN) was 38.8% better than bicubic, 11% better than NSSR, 16.7% better than DWT+CNN, 1.3% better than NSSR+DWT+CNN, and 0.5% better than NSSR+FF-SHO-(DWT+CNN). Hence, it has been confirmed that the developed O-BMO-(NSSR+DWT+CNN) is performing well in converting LR images to HR images. Originality/value This paper adopts a latest optimization algorithm called O-BMO with optimized Non-negative Structured Sparse Representation (NSSR), Adaptive Discrete Wavelet Transform (ADWT) and Optimized Deep Convolutional Neural Network for developing the enhanced image super-resolution model. This is the first work that uses O-BMO-based Deep CNN for image super-resolution model enhancement.

Download Full-text

Object Detection in Ground-Penetrating Radar Images Using a Deep Convolutional Neural Network and Image Set Preparation by Migration

International Journal of Geophysics ◽

10.1155/2018/9365184 ◽

2018 ◽

Vol 2018 ◽

pp. 1-8 ◽

Cited By ~ 3

Author(s):

Kazuya Ishitsuka ◽

Shinichiro Iso ◽

Kyosuke Onishi ◽

Toshifumi Matsuoka

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Ground Penetrating Radar ◽

Deep Convolutional Neural Network ◽

Training Data ◽

Neural Network Approach ◽

Radar Images ◽

Image Set ◽

Ground Penetrating ◽

Better Than

Ground-penetrating radar allows the acquisition of many images for investigation of the pavement interior and shallow geological structures. Accordingly, an efficient methodology of detecting objects, such as pipes, reinforcing steel bars, and internal voids, in ground-penetrating radar images is an emerging technology. In this paper, we propose using a deep convolutional neural network to detect characteristic hyperbolic signatures from embedded objects. As a first step, we developed a migration-based method to collect many training data and created 53510 categorized images. We then examined the accuracy of the deep convolutional neural network in detecting the signatures. The accuracy of the classification was 0.945 (94.5%)–0.979 (97.9%) when using several thousands of training images and was much better than the accuracy of the conventional neural network approach. Our results demonstrate the effectiveness of the deep convolutional neural network in detecting characteristic events in ground-penetrating radar images.

Download Full-text

Pedestrian detection based on improved LeNet-5 convolutional neural network

Journal of Algorithms & Computational Technology ◽

10.1177/1748302619873601 ◽

2019 ◽

Vol 13 ◽

pp. 174830261987360 ◽

Cited By ~ 2

Author(s):

Chuan-Wei Zhang ◽

Meng-Yue Yang ◽

Hong-Jun Zeng ◽

Jian-Ping Wen

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Real Time ◽

Network Model ◽

Pedestrian Detection ◽

The Real ◽

Better Than

In this article, according to the real-time and accuracy requirements of advanced vehicle-assisted driving in pedestrian detection, an improved LeNet-5 convolutional neural network is proposed. Firstly, the structure of LeNet-5 network model is analyzed, and the structure and parameters of the network are improved and optimized on the basis of this network to get a new LeNet network model, and then it is used to detect pedestrians. Finally, the miss rate of the improved LeNet convolutional neural network is found to be 25% by contrast and analysis. The experiment proves that this method is better than SA-Fast R-CNN and classical LeNet-5 CNN algorithm.

Download Full-text

ROCK-CNN: a Distributed RockPro64-based Convolutional Neural Network Cluster for IoT. Verification and Performance Analysis

2020 26th Conference of Open Innovations Association (FRUCT) ◽

10.23919/fruct48808.2020.9087457 ◽

2020 ◽

Author(s):

Rezeda Khaydarova ◽

Vladislav Fishchenko ◽

Dmitriy Mouromtsev ◽

Vladislav Shmatkov ◽

Maksim Lapaev

Keyword(s):

Neural Network ◽

Performance Analysis ◽

Convolutional Neural Network ◽

Network Cluster ◽

And Performance

Download Full-text

Deep Learning Convolutional Neural Network for the Retrieval of Land Surface Temperature from AMSR2 Data in China

Sensors ◽

10.3390/s19132987 ◽

2019 ◽

Vol 19 (13) ◽

pp. 2987 ◽

Cited By ~ 7

Author(s):

Jiancan Tan ◽

Nusseiba NourEldeen ◽

Kebiao Mao ◽

Jiancheng Shi ◽

Zhaoliang Li ◽

...

Keyword(s):

Neural Network ◽

Surface Temperature ◽

Convolutional Neural Network ◽

Land Surface Temperature ◽

Land Surface ◽

Measurement Data ◽

Retrieval Algorithm ◽

Observation Data ◽

Moderate Resolution Imaging Spectroradiometer ◽

Better Than

A convolutional neural network (CNN) algorithm was developed to retrieve the land surface temperature (LST) from Advanced Microwave Scanning Radiometer 2 (AMSR2) data in China. Reference data were selected using the Moderate Resolution Imaging Spectroradiometer (MODIS) LST product to overcome the problem related to the need for synchronous ground observation data. The AMSR2 brightness temperature (TB) data and MODIS surface temperature data were randomly divided into training and test datasets, and a CNN was constructed to simulate passive microwave radiation transmission to invert the surface temperature. The twelve V/H channel combinations (7.3, 10.65, 18.7, 23.8, 36.5, 89 GHz) resulted in the most stable and accurate CNN retrieval model. Vertical polarizations performed better than horizontal polarizations; however, because CNNs rely heavily on large amounts of data, the combination of vertical and horizontal polarizations performed better than a single polarization. The retrievals in different regions indicated that the CNN accuracy was highest over large bare land areas. A comparison of the retrieval results with ground measurement data from meteorological stations yielded R2 = 0.987, RMSE = 2.69 K, and an average relative error of 2.57 K, which indicated that the accuracy of the CNN LST retrieval algorithm was high and the retrieval results can be applied to long-term LST sequence analysis in China.

Download Full-text

Fast and Efficient Edge Fusing Network Architectures for Accurate Single Image Super-resolution

10.32920/ryerson.14660967 ◽

2021 ◽

Author(s):

Debjoy Chowdhury

Keyword(s):

Neural Network ◽

Receptive Field ◽

Convolutional Neural Network ◽

Network Architecture ◽

Performance Enhancement ◽

Super Resolution ◽

Main Concept ◽

High Scale ◽

Image Super Resolution ◽

And Performance

Recovering a High-Resolution (HR) image from a Low-Resolution (LR) image is the main concept of image Super-Resolution (SR). Convolution Neural Networks (CNN) are becoming widely adopted in many applications including generation of HR images from LR images. Although CNNs are widely used with great performance improvements, there is still much room for improvement. There has always been a trade-off between the number of parameters and performance enhancement. This thesis presents a novel convolutional neural network architecture for high scale image SR inspired by the DenseNet and ResNet architecture. In particular, modifications can be made to the convolutional layers in the network: stacking the features and reusing the weight layers to increase the receptive field. It is shown how this method can be used to expand the receptive field and performance of super-resolution networks, without increasing the number of trainable parameters and sacrificing the computation time. These modifications can easily be integrated into any convolutional neural network to improve the accuracy by efficient high-level feature extraction while reducing training time and parameter numbers. Proposed methods are especially effective for the challenging high scale SR due to edge and texture recovery through the expanded network receptive field. Experimental results show that the proposed model outperforms the state-of-the-art methods.

Download Full-text

Fast and Efficient Edge Fusing Network Architectures for Accurate Single Image Super-resolution

10.32920/ryerson.14660967.v1 ◽

2021 ◽

Author(s):

Debjoy Chowdhury

Keyword(s):

Neural Network ◽

Receptive Field ◽

Convolutional Neural Network ◽

Network Architecture ◽

Performance Enhancement ◽

Super Resolution ◽

Main Concept ◽

High Scale ◽

Image Super Resolution ◽

And Performance

Recovering a High-Resolution (HR) image from a Low-Resolution (LR) image is the main concept of image Super-Resolution (SR). Convolution Neural Networks (CNN) are becoming widely adopted in many applications including generation of HR images from LR images. Although CNNs are widely used with great performance improvements, there is still much room for improvement. There has always been a trade-off between the number of parameters and performance enhancement. This thesis presents a novel convolutional neural network architecture for high scale image SR inspired by the DenseNet and ResNet architecture. In particular, modifications can be made to the convolutional layers in the network: stacking the features and reusing the weight layers to increase the receptive field. It is shown how this method can be used to expand the receptive field and performance of super-resolution networks, without increasing the number of trainable parameters and sacrificing the computation time. These modifications can easily be integrated into any convolutional neural network to improve the accuracy by efficient high-level feature extraction while reducing training time and parameter numbers. Proposed methods are especially effective for the challenging high scale SR due to edge and texture recovery through the expanded network receptive field. Experimental results show that the proposed model outperforms the state-of-the-art methods.

Download Full-text