A Lightweight Pedestrian Detection Engine with Two-Stage Low-Complexity Detection Network and Adaptive Region Focusing Technique

Luying Que; Teng Zhang; Hongtao Guo; Conghan Jia; Yuchuan Gong; Liang Chang; Jun Zhou

doi:10.3390/s21175851

A Lightweight Pedestrian Detection Engine with Two-Stage Low-Complexity Detection Network and Adaptive Region Focusing Technique

Sensors ◽

10.3390/s21175851 ◽

2021 ◽

Vol 21 (17) ◽

pp. 5851

Author(s):

Luying Que ◽

Teng Zhang ◽

Hongtao Guo ◽

Conghan Jia ◽

Yuchuan Gong ◽

...

Keyword(s):

Deep Learning ◽

Computational Complexity ◽

Video Surveillance ◽

Mobile Applications ◽

Pedestrian Detection ◽

Low Complexity ◽

Detection Accuracy ◽

Two Stage ◽

Intelligent Robots ◽

Xilinx Fpga

Pedestrian detection has been widely used in applications such as video surveillance and intelligent robots. Recently, deep learning-based pedestrian detection engines have attracted lots of attention. However, the computational complexity of these engines is high, which makes them unsuitable for hardware- and power-constrained mobile applications, such as drones for surveillance. In this paper, we propose a lightweight pedestrian detection engine with a two-stage low-complexity detection network and adaptive region focusing technique, to reduce the computational complexity in pedestrian detection, while maintaining sufficient detection accuracy. The proposed pedestrian detection engine has significantly reduced the number of parameters (0.73 M) and operations (1.04 B), while achieving a comparable precision (85.18%) and miss rate (25.16%) to many existing designs. Moreover, the proposed engine, together with YOLOv3 and YOLOv3-Tiny, has been implemented on a Xilinx FPGA Zynq7020 for comparison. It is able to achieve 16.3 Fps while consuming 0.59 W, which outperforms the results of YOLOv3 (5.3 Fps, 2.43 W) and YOLOv3-Tiny (12.8 Fps, 0.95 W).

Download Full-text

A Review of Intelligent Driving Pedestrian Detection Based on Deep Learning

Computational Intelligence and Neuroscience ◽

10.1155/2021/5410049 ◽

2021 ◽

Vol 2021 ◽

pp. 1-16

Author(s):

Di Tian ◽

Yi Han ◽

Biyao Wang ◽

Tian Guan ◽

Wei Wei

Keyword(s):

Deep Learning ◽

Object Detection ◽

Rapid Development ◽

Pedestrian Detection ◽

Evaluation Criteria ◽

Human Perception ◽

Detection Accuracy ◽

Learning Stage ◽

Detection Technology ◽

General Object

Pedestrian detection is a specific application of object detection. Compared with general object detection, it shows similarities and unique characteristics. In addition, it has important application value in the fields of intelligent driving and security monitoring. In recent years, with the rapid development of deep learning, pedestrian detection technology has also made great progress. However, there still exists a huge gap between it and human perception. Meanwhile, there are still a lot of problems, and there remains a lot of room for research. Regarding the application of pedestrian detection in intelligent driving technology, it is of necessity to ensure its real-time performance. Additionally, it is necessary to lighten the model while ensuring detection accuracy. This paper first briefly describes the development process of pedestrian detection and then concentrates on summarizing the research results of pedestrian detection technology in the deep learning stage. Subsequently, by summarizing the pedestrian detection dataset and evaluation criteria, the core issues of the current development of pedestrian detection are analyzed. Finally, the next possible development direction of pedestrian detection technology is explained at the end of the paper.

Download Full-text

A Deep Learning-Aided Detection Method for FTN-Based NOMA

Wireless Communications and Mobile Computing ◽

10.1155/2020/5684851 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Jianxiong Pan ◽

Neng Ye ◽

Aihua Wang ◽

Xiangming Li

Keyword(s):

Deep Learning ◽

Computational Complexity ◽

Smart City ◽

Mean Squared Error ◽

Detection Method ◽

High Reliability ◽

Detection Methods ◽

Low Latency ◽

Detection Accuracy ◽

Minimum Mean Squared Error

The rapid booming of future smart city applications and Internet of things (IoT) has raised higher demands on the next-generation radio access technologies with respect to connection density, spectral efficiency (SE), transmission accuracy, and detection latency. Recently, faster-than-Nyquist (FTN) and nonorthogonal multiple access (NOMA) have been regarded as promising technologies to achieve higher SE and massive connections, respectively. In this paper, we aim to exploit the joint benefits of FTN and NOMA by superimposing multiple FTN-based transmission signals on the same physical recourses. Considering the complicated intra- and interuser interferences introduced by the proposed transmission scheme, the conventional detection methods suffer from high computational complexity. To this end, we develop a novel sliding-window detection method by incorporating the state-of-the-art deep learning (DL) technology. The data-driven offline training is first applied to derive a near-optimal receiver for FTN-based NOMA, which is deployed online to achieve high detection accuracy as well as low latency. Monte Carlo simulation results validate that the proposed detector achieves higher detection accuracy than minimum mean squared error-frequency domain equalization (MMSE-FDE) and can even approach the performance of the maximum likelihood-based receiver with greatly reduced computational complexity, which is suitable for IoT applications in smart city with low latency and high reliability requirements.

Download Full-text

Two-stage pyramidal convolutional neural networks for image colorization

APSIPA Transactions on Signal and Information Processing ◽

10.1017/atsip.2021.13 ◽

2021 ◽

Vol 10 ◽

Author(s):

Yu-Jen Wei ◽

Tsu-Tsai Wei ◽

Tien-Ying Kuo ◽

Po-Chyi Su

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Computational Complexity ◽

Color Space ◽

Research Trend ◽

Two Stage ◽

Map Generation ◽

Generation Network ◽

Chroma Components

The development of colorization algorithms through deep learning has become the current research trend. These algorithms colorize grayscale images automatically and quickly, but the colors produced are usually subdued and have low saturation. This research addresses this issue of existing algorithms by presenting a two-stage convolutional neural network (CNN) structure with the first and second stages being a chroma map generation network and a refinement network, respectively. To begin, we convert the color space of an image from RGB to HSV to predict its low-resolution chroma components and therefore reduce the computational complexity. Following that, the first-stage output is zoomed in and its detail is enhanced with a pyramidal CNN, resulting in a colorized image. Experiments show that, while using fewer parameters, our methodology produces results with more realistic color and higher saturation than existing methods.

Download Full-text

Real-Time Pedestrian Detection Approach with an Efficient Data Communication Bandwidth Strategy

Transportation Research Record Journal of the Transportation Research Board ◽

10.1177/0361198119843255 ◽

2019 ◽

Vol 2673 (6) ◽

pp. 129-139 ◽

Cited By ~ 3

Author(s):

Mizanur Rahman ◽

Mhafuzul Islam ◽

Jon Calhoun ◽

Mashrur Chowdhury

Keyword(s):

Deep Learning ◽

Real Time ◽

Pedestrian Detection ◽

Data Communication ◽

Signalized Intersections ◽

Detection Accuracy ◽

Signal To Noise ◽

Detection Strategy ◽

Efficient Data ◽

Computing Infrastructure

Vehicle-to-pedestrian communication could significantly improve pedestrian safety at signalized intersections. However, it is unlikely that pedestrians will typically be carrying a low latency communication-enabled device with an activated pedestrian safety application in their hand-held device all the time. Because of this, multiple traffic cameras at a signalized intersection could be used to accurately detect and locate pedestrians using deep learning, and broadcast safety alerts related to pedestrians to warn connected and automated vehicles around signalized intersections. However, the unavailability of high-performance roadside computing infrastructure and the limited network bandwidth between traffic cameras and the computing infrastructure limits the ability of real-time data streaming and processing for pedestrian detection. In this paper, we describe an edge computing-based real-time pedestrian detection strategy that combines a pedestrian detection algorithm using deep learning and an efficient data communication approach to reduce bandwidth requirements while maintaining high pedestrian detection accuracy. We utilize a lossy compression technique on traffic camera data to determine the tradeoff between the reduction of the communication bandwidth requirements and a defined pedestrian detection accuracy. The performance of the pedestrian detection strategy is measured in relation to pedestrian classification accuracy with varying peak signal-to-noise ratios. The analyses reveal that we detect pedestrians by maintaining a defined detection accuracy with a peak signal-to-noise ratio 43 dB while reducing the communication bandwidth from 9.82 Mbits/sec to 0.31 Mbits/sec, a 31× reduction.

Download Full-text

A Low-Complexity Pedestrian Detection Framework for Smart Video Surveillance Systems

IEEE Transactions on Circuits and Systems for Video Technology ◽

10.1109/tcsvt.2016.2581660 ◽

2017 ◽

Vol 27 (10) ◽

pp. 2260-2273 ◽

Cited By ~ 32

Author(s):

Muhammad Bilal ◽

Asim Khan ◽

Muhammad Umar Karim Khan ◽

Chong-Min Kyung

Keyword(s):

Video Surveillance ◽

Pedestrian Detection ◽

Low Complexity ◽

Surveillance Systems

Download Full-text

An Efficient Road Surveillance Approach to Detect, Recognize & Tracking Vehicles Using Deep Learning Methods

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit2174106 ◽

2021 ◽

pp. 503-512

Author(s):

Vinod Kumar Yadav ◽

Dr. Pritaj Yadav ◽

Dr. Shailja Sharma

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Vehicle Detection ◽

Machine Learning Algorithms ◽

Detection Methods ◽

Motor Vehicles ◽

Detection Accuracy ◽

Two Stage ◽

One Stage ◽

Traffic Regulation

In the current scenario on the increasing number of motor vehicles day by day, so traffic regulation faces many challenges on intelligent road surveillance and governance, this is one of the important research areas in the artificial intelligence or deep learning. Among various technologies, computer vision and machine learning algorithms have the most efficient, as a huge vehicles video or image data on road is available for study. In this paper, we proposed computer vision-based an efficient approach to vehicle detection, recognition and Tracking. We merge with one-stage (YOLOv4) and two-stage (R-FCN) detectors methods to improve vehicle detection accuracy and speed results. Two-stage object detection methods provide high localization and object recognition precision, even as one-stage detectors achieve high inference and test speed. Deep-SORT tracker method applied for detects bounding boxes to estimate trajectories. We analyze the performance of the Mask RCNN benchmark, YOLOv3 and Proposed YOLOv4 + R-FCN on the UA-DETRAC dataset and study with certain parameters like Mean Average Precisions (mAP), Precision recall.

Download Full-text

Deep Learning for Pedestrian Detection

International Journal of Scientific and Research Publications (IJSRP) ◽

10.29322/ijsrp.9.08.2019.p9212 ◽

2019 ◽

Vol 9 (8) ◽

pp. p9212

Author(s):

Utkarsha Sagar ◽

Ravi Raja ◽

Himanshu Shekhar

Keyword(s):

Deep Learning ◽

Pedestrian Detection

Download Full-text

COVID-19 pneumonia diagnosis using a simple 2D deep learning framework with a single chest CT image (Preprint)

10.2196/preprints.19407 ◽

2020 ◽

Author(s):

Jinseok Lee

Keyword(s):

Deep Learning ◽

Diagnostic Performance ◽

Ct Images ◽

Chest Ct ◽

University Hospital ◽

Detection Accuracy ◽

Ct Image ◽

Test Dataset ◽

Learning Framework ◽

Testing Dataset

BACKGROUND The coronavirus disease (COVID-19) has explosively spread worldwide since the beginning of 2020. According to a multinational consensus statement from the Fleischner Society, computed tomography (CT) can be used as a relevant screening tool owing to its higher sensitivity for detecting early pneumonic changes. However, physicians are extremely busy fighting COVID-19 in this era of worldwide crisis. Thus, it is crucial to accelerate the development of an artificial intelligence (AI) diagnostic tool to support physicians. OBJECTIVE We aimed to quickly develop an AI technique to diagnose COVID-19 pneumonia and differentiate it from non-COVID pneumonia and non-pneumonia diseases on CT. METHODS A simple 2D deep learning framework, named fast-track COVID-19 classification network (FCONet), was developed to diagnose COVID-19 pneumonia based on a single chest CT image. FCONet was developed by transfer learning, using one of the four state-of-art pre-trained deep learning models (VGG16, ResNet50, InceptionV3, or Xception) as a backbone. For training and testing of FCONet, we collected 3,993 chest CT images of patients with COVID-19 pneumonia, other pneumonia, and non-pneumonia diseases from Wonkwang University Hospital, Chonnam National University Hospital, and the Italian Society of Medical and Interventional Radiology public database. These CT images were split into a training and a testing set at a ratio of 8:2. For the test dataset, the diagnostic performance to diagnose COVID-19 pneumonia was compared among the four pre-trained FCONet models. In addition, we tested the FCONet models on an additional external testing dataset extracted from the embedded low-quality chest CT images of COVID-19 pneumonia in recently published papers. RESULTS Of the four pre-trained models of FCONet, the ResNet50 showed excellent diagnostic performance (sensitivity 99.58%, specificity 100%, and accuracy 99.87%) and outperformed the other three pre-trained models in testing dataset. In additional external test dataset using low-quality CT images, the detection accuracy of the ResNet50 model was the highest (96.97%), followed by Xception, InceptionV3, and VGG16 (90.71%, 89.38%, and 87.12%, respectively). CONCLUSIONS The FCONet, a simple 2D deep learning framework based on a single chest CT image, provides excellent diagnostic performance in detecting COVID-19 pneumonia. Based on our testing dataset, the ResNet50-based FCONet might be the best model, as it outperformed other FCONet models based on VGG16, Xception, and InceptionV3.

Download Full-text

Deep Transfer Learning Based Intrusion Detection System for Electric Vehicular Networks

Sensors ◽

10.3390/s21144736 ◽

2021 ◽

Vol 21 (14) ◽

pp. 4736

Author(s):

Sk. Tanzir Mehedi ◽

Adnan Anwar ◽

Ziaur Rahman ◽

Kawsar Ahmed

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Intrusion Detection ◽

Real Time ◽

Transfer Learning ◽

Security Requirements ◽

Detection Accuracy ◽

Area Network ◽

Complex Data ◽

Network Intrusion

The Controller Area Network (CAN) bus works as an important protocol in the real-time In-Vehicle Network (IVN) systems for its simple, suitable, and robust architecture. The risk of IVN devices has still been insecure and vulnerable due to the complex data-intensive architectures which greatly increase the accessibility to unauthorized networks and the possibility of various types of cyberattacks. Therefore, the detection of cyberattacks in IVN devices has become a growing interest. With the rapid development of IVNs and evolving threat types, the traditional machine learning-based IDS has to update to cope with the security requirements of the current environment. Nowadays, the progression of deep learning, deep transfer learning, and its impactful outcome in several areas has guided as an effective solution for network intrusion detection. This manuscript proposes a deep transfer learning-based IDS model for IVN along with improved performance in comparison to several other existing models. The unique contributions include effective attribute selection which is best suited to identify malicious CAN messages and accurately detect the normal and abnormal activities, designing a deep transfer learning-based LeNet model, and evaluating considering real-world data. To this end, an extensive experimental performance evaluation has been conducted. The architecture along with empirical analyses shows that the proposed IDS greatly improves the detection accuracy over the mainstream machine learning, deep learning, and benchmark deep transfer learning models and has demonstrated better performance for real-time IVN security.

Download Full-text

An Empirical Study on Deployment Faults of Deep Learning Based Mobile Applications

2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE) ◽

10.1109/icse43902.2021.00068 ◽

2021 ◽

Author(s):

Zhenpeng Chen ◽

Huihan Yao ◽

Yiling Lou ◽

Yanbin Cao ◽

Yuanqiang Liu ◽

...

Keyword(s):

Deep Learning ◽

Empirical Study ◽

Mobile Applications

Download Full-text