A Rapid Recognition Method for Electronic Components Based on the Improved YOLO-V3 Network

Rui Huang; Jinan Gu; Xiaohong Sun; Yongtao Hou; Saad Uddin

doi:10.3390/electronics8080825

A Rapid Recognition Method for Electronic Components Based on the Improved YOLO-V3 Network

Electronics ◽

10.3390/electronics8080825 ◽

2019 ◽

Vol 8 (8) ◽

pp. 825 ◽

Cited By ~ 8

Author(s):

Rui Huang ◽

Jinan Gu ◽

Xiaohong Sun ◽

Yongtao Hou ◽

Saad Uddin

Keyword(s):

Deep Learning ◽

Intelligent Manufacturing ◽

Detection Methods ◽

Detection Accuracy ◽

Electronic Components ◽

Recognition Method ◽

Image Labeling ◽

Fast Recognition ◽

Speed And Accuracy ◽

Detection Speed

Rapid object recognition in the industrial field is the key to intelligent manufacturing. The research on fast recognition methods based on deep learning was the focus of researchers in recent years, but the balance between detection speed and accuracy was not well solved. In this paper, a fast recognition method for electronic components in a complex background is presented. Firstly, we built the image dataset, including image acquisition, image augmentation, and image labeling. Secondly, a fast recognition method based on deep learning was proposed. The balance between detection accuracy and detection speed was solved through the lightweight improvement of YOLO (You Only Look Once)-V3 network model. Finally, the experiment was completed, and the proposed method was compared with several popular detection methods. The results showed that the accuracy reached 95.21% and the speed was 0.0794 s, which proved the superiority of this method for electronic component detection.

Download Full-text

A New Video-Based Crash Detection Method: Balancing Speed and Accuracy Using a Feature Fusion Deep Learning Framework

Journal of Advanced Transportation ◽

10.1155/2020/8848874 ◽

2020 ◽

Vol 2020 ◽

pp. 1-12

Author(s):

Zhenbo Lu ◽

Wei Zhou ◽

Shixiang Zhang ◽

Chen Wang

Keyword(s):

Deep Learning ◽

Short Term Memory ◽

Feature Fusion ◽

Urban Traffic ◽

Detection Accuracy ◽

Traffic Crash ◽

Learning Framework ◽

Proposed Model ◽

Speed And Accuracy ◽

Detection Speed

Quick and accurate crash detection is important for saving lives and improved traffic incident management. In this paper, a feature fusion-based deep learning framework was developed for video-based urban traffic crash detection task, aiming at achieving a balance between detection speed and accuracy with limited computing resource. In this framework, a residual neural network (ResNet) combined with attention modules was proposed to extract crash-related appearance features from urban traffic videos (i.e., a crash appearance feature extractor), which were further fed to a spatiotemporal feature fusion model, Conv-LSTM (Convolutional Long Short-Term Memory), to simultaneously capture appearance (static) and motion (dynamic) crash features. The proposed model was trained by a set of video clips covering 330 crash and 342 noncrash events. In general, the proposed model achieved an accuracy of 87.78% on the testing dataset and an acceptable detection speed (FPS > 30 with GTX 1060). Thanks to the attention module, the proposed model can capture the localized appearance features (e.g., vehicle damage and pedestrian fallen-off) of crashes better than conventional convolutional neural networks. The Conv-LSTM module outperformed conventional LSTM in terms of capturing motion features of crashes, such as the roadway congestion and pedestrians gathering after crashes. Compared to traditional motion-based crash detection model, the proposed model achieved higher detection accuracy. Moreover, it could detect crashes much faster than other feature fusion-based models (e.g., C3D). The results show that the proposed model is a promising video-based urban traffic crash detection algorithm that could be used in practice in the future.

Download Full-text

High-Speed Lightweight Ship Detection Algorithm Based on YOLO-V4 for Three-Channels RGB SAR Image

Remote Sensing ◽

10.3390/rs13101909 ◽

2021 ◽

Vol 13 (10) ◽

pp. 1909

Author(s):

Jiahuan Jiang ◽

Xiongjun Fu ◽

Rui Qin ◽

Xiaoyan Wang ◽

Zhifeng Ma

Keyword(s):

Deep Learning ◽

Gpu Computing ◽

Hot Spot ◽

Detection Algorithm ◽

Detection Methods ◽

Detection Accuracy ◽

Processing Unit ◽

Sar Image ◽

Marine Monitoring ◽

Ship Detection

Synthetic Aperture Radar (SAR) has become one of the important technical means of marine monitoring in the field of remote sensing due to its all-day, all-weather advantage. National territorial waters to achieve ship monitoring is conducive to national maritime law enforcement, implementation of maritime traffic control, and maintenance of national maritime security, so ship detection has been a hot spot and focus of research. After the development from traditional detection methods to deep learning combined methods, most of the research always based on the evolving Graphics Processing Unit (GPU) computing power to propose more complex and computationally intensive strategies, while in the process of transplanting optical image detection ignored the low signal-to-noise ratio, low resolution, single-channel and other characteristics brought by the SAR image imaging principle. Constantly pursuing detection accuracy while ignoring the detection speed and the ultimate application of the algorithm, almost all algorithms rely on powerful clustered desktop GPUs, which cannot be implemented on the frontline of marine monitoring to cope with the changing realities. To address these issues, this paper proposes a multi-channel fusion SAR image processing method that makes full use of image information and the network’s ability to extract features; it is also based on the latest You Only Look Once version 4 (YOLO-V4) deep learning framework for modeling architecture and training models. The YOLO-V4-light network was tailored for real-time and implementation, significantly reducing the model size, detection time, number of computational parameters, and memory consumption, and refining the network for three-channel images to compensate for the loss of accuracy due to light-weighting. The test experiments were completed entirely on a portable computer and achieved an Average Precision (AP) of 90.37% on the SAR Ship Detection Dataset (SSDD), simplifying the model while ensuring a lead over most existing methods. The YOLO-V4-lightship detection algorithm proposed in this paper has great practical application in maritime safety monitoring and emergency rescue.

Download Full-text

Application of Color Featuring and Deep Learning in Maize Plant Detection

Remote Sensing ◽

10.3390/rs12142229 ◽

2020 ◽

Vol 12 (14) ◽

pp. 2229

Author(s):

Haojie Liu ◽

Hong Sun ◽

Minzan Li ◽

Michihisa Iida

Keyword(s):

Deep Learning ◽

False Negative ◽

Color Space ◽

Performance Comparison ◽

Maize Plant ◽

Detection Accuracy ◽

Learning Technology ◽

Practical Applications ◽

Feature Based ◽

Detection Speed

Maize plant detection was conducted in this study with the goals of target fertilization and reduction of fertilization waste in weed spots and gaps between maize plants. The methods used included two types of color featuring and deep learning (DL). The four color indices used were excess green (ExG), excess red (ExR), ExG minus ExR, and the hue value from the HSV (hue, saturation, and value) color space, while the DL methods used were YOLOv3 and YOLOv3_tiny. For practical application, this study focused on performance comparison in detection accuracy, robustness to complex field conditions, and detection speed. Detection accuracy was evaluated by the resulting images, which were divided into three categories: true positive, false positive, and false negative. The robustness evaluation was performed by comparing the average intersection over union of each detection method across different sub–datasets—namely original subset, blur processing subset, increased brightness subset, and reduced brightness subset. The detection speed was evaluated by the indicator of frames per second. Results demonstrated that the DL methods outperformed the color index–based methods in detection accuracy and robustness to complex conditions, while they were inferior to color feature–based methods in detection speed. This research shows the application potential of deep learning technology in maize plant detection. Future efforts are needed to improve the detection speed for practical applications.

Download Full-text

Real-Time Vehicle Detection Algorithm Based on Vision and Lidar Point Cloud Fusion

Journal of Sensors ◽

10.1155/2019/8473980 ◽

2019 ◽

Vol 2019 ◽

pp. 1-9 ◽

Cited By ~ 8

Author(s):

Hai Wang ◽

Xinyu Lou ◽

Yingfeng Cai ◽

Yicheng Li ◽

Long Chen

Keyword(s):

Deep Learning ◽

Real Time ◽

Autonomous Vehicles ◽

Point Cloud ◽

Vehicle Detection ◽

Detection Algorithm ◽

Detection Methods ◽

Depth Information ◽

Detection Accuracy ◽

Classification Rate

Vehicle detection is one of the most important environment perception tasks for autonomous vehicles. The traditional vision-based vehicle detection methods are not accurate enough especially for small and occluded targets, while the light detection and ranging- (lidar-) based methods are good in detecting obstacles but they are time-consuming and have a low classification rate for different target types. Focusing on these shortcomings to make the full use of the advantages of the depth information of lidar and the obstacle classification ability of vision, this work proposes a real-time vehicle detection algorithm which fuses vision and lidar point cloud information. Firstly, the obstacles are detected by the grid projection method using the lidar point cloud information. Then, the obstacles are mapped to the image to get several separated regions of interest (ROIs). After that, the ROIs are expanded based on the dynamic threshold and merged to generate the final ROI. Finally, a deep learning method named You Only Look Once (YOLO) is applied on the ROI to detect vehicles. The experimental results on the KITTI dataset demonstrate that the proposed algorithm has high detection accuracy and good real-time performance. Compared with the detection method based only on the YOLO deep learning, the mean average precision (mAP) is increased by 17%.

Download Full-text

Diseases Detection of Occlusion and Overlapping Tomato Leaves Based on Deep Learning

Frontiers in Plant Science ◽

10.3389/fpls.2021.792244 ◽

2021 ◽

Vol 12 ◽

Author(s):

Xuewei Wang ◽

Jun Liu ◽

Guoxu Liu

Keyword(s):

Natural Environment ◽

Network Architecture ◽

Plant Diseases ◽

Frame Rate ◽

Detection Methods ◽

Detection Accuracy ◽

Layer By Layer ◽

Detection Technology ◽

Tomato Diseases ◽

Detection Speed

Background: In view of the existence of light shadow, branches occlusion, and leaves overlapping conditions in the real natural environment, problems such as slow detection speed, low detection accuracy, high missed detection rate, and poor robustness in plant diseases and pests detection technology arise.Results: Based on YOLOv3-tiny network architecture, to reduce layer-by-layer loss of information during network transmission, and to learn from the idea of inverse-residual block, this study proposes a YOLOv3-tiny-IRB algorithm to optimize its feature extraction network, improve the gradient disappearance phenomenon during network deepening, avoid feature information loss, and realize network multilayer feature multiplexing and fusion. The network is trained by the methods of expanding datasets and multiscale strategies to obtain the optimal weight model.Conclusion: The experimental results show that when the method is tested on the self-built tomato diseases and pests dataset, and while ensuring the detection speed (206 frame rate per second), the mean Average precision (mAP) under three conditions: (a) deep separation, (b) debris occlusion, and (c) leaves overlapping are 98.3, 92.1, and 90.2%, respectively. Compared with the current mainstream object detection methods, the proposed method improves the detection accuracy of tomato diseases and pests under conditions of occlusion and overlapping in real natural environment.

Download Full-text

Ship Detection Based on YOLOv2 for SAR Imagery

Remote Sensing ◽

10.3390/rs11070786 ◽

2019 ◽

Vol 11 (7) ◽

pp. 786 ◽

Cited By ~ 41

Author(s):

Yang-Lang Chang ◽

Amare Anagaw ◽

Lena Chang ◽

Yi Wang ◽

Chih-Yu Hsiao ◽

...

Keyword(s):

Deep Learning ◽

Object Detection ◽

Real Time ◽

Experimental Results ◽

Detection Methods ◽

Computational Time ◽

Detection Accuracy ◽

Single Shot ◽

Ship Detection ◽

Sar Imagery

Synthetic aperture radar (SAR) imagery has been used as a promising data source for monitoring maritime activities, and its application for oil and ship detection has been the focus of many previous research studies. Many object detection methods ranging from traditional to deep learning approaches have been proposed. However, majority of them are computationally intensive and have accuracy problems. The huge volume of the remote sensing data also brings a challenge for real time object detection. To mitigate this problem a high performance computing (HPC) method has been proposed to accelerate SAR imagery analysis, utilizing the GPU based computing methods. In this paper, we propose an enhanced GPU based deep learning method to detect ship from the SAR images. The You Only Look Once version 2 (YOLOv2) deep learning framework is proposed to model the architecture and training the model. YOLOv2 is a state-of-the-art real-time object detection system, which outperforms Faster Region-Based Convolutional Network (Faster R-CNN) and Single Shot Multibox Detector (SSD) methods. Additionally, in order to reduce computational time with relatively competitive detection accuracy, we develop a new architecture with less number of layers called YOLOv2-reduced. In the experiment, we use two types of datasets: A SAR ship detection dataset (SSDD) dataset and a Diversified SAR Ship Detection Dataset (DSSDD). These two datasets were used for training and testing purposes. YOLOv2 test results showed an increase in accuracy of ship detection as well as a noticeable reduction in computational time compared to Faster R-CNN. From the experimental results, the proposed YOLOv2 architecture achieves an accuracy of 90.05% and 89.13% on the SSDD and DSSDD datasets respectively. The proposed YOLOv2-reduced architecture has a similarly competent detection performance as YOLOv2, but with less computational time on a NVIDIA TITAN X GPU. The experimental results shows that the deep learning can make a big leap forward in improving the performance of SAR image ship detection.

Download Full-text

A Deep Learning-Aided Detection Method for FTN-Based NOMA

Wireless Communications and Mobile Computing ◽

10.1155/2020/5684851 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Jianxiong Pan ◽

Neng Ye ◽

Aihua Wang ◽

Xiangming Li

Keyword(s):

Deep Learning ◽

Computational Complexity ◽

Smart City ◽

Mean Squared Error ◽

Detection Method ◽

High Reliability ◽

Detection Methods ◽

Low Latency ◽

Detection Accuracy ◽

Minimum Mean Squared Error

The rapid booming of future smart city applications and Internet of things (IoT) has raised higher demands on the next-generation radio access technologies with respect to connection density, spectral efficiency (SE), transmission accuracy, and detection latency. Recently, faster-than-Nyquist (FTN) and nonorthogonal multiple access (NOMA) have been regarded as promising technologies to achieve higher SE and massive connections, respectively. In this paper, we aim to exploit the joint benefits of FTN and NOMA by superimposing multiple FTN-based transmission signals on the same physical recourses. Considering the complicated intra- and interuser interferences introduced by the proposed transmission scheme, the conventional detection methods suffer from high computational complexity. To this end, we develop a novel sliding-window detection method by incorporating the state-of-the-art deep learning (DL) technology. The data-driven offline training is first applied to derive a near-optimal receiver for FTN-based NOMA, which is deployed online to achieve high detection accuracy as well as low latency. Monte Carlo simulation results validate that the proposed detector achieves higher detection accuracy than minimum mean squared error-frequency domain equalization (MMSE-FDE) and can even approach the performance of the maximum likelihood-based receiver with greatly reduced computational complexity, which is suitable for IoT applications in smart city with low latency and high reliability requirements.

Download Full-text

Robust Self-Adaptation Fall-Detection System Based on Camera Height

Sensors ◽

10.3390/s19173768 ◽

2019 ◽

Vol 19 (17) ◽

pp. 3768 ◽

Cited By ~ 2

Author(s):

Kong ◽

Chen ◽

Wang ◽

Chen ◽

Meng ◽

...

Keyword(s):

Deep Learning ◽

Detection System ◽

Fall Detection ◽

Daily Activities ◽

Detection Methods ◽

Depth Camera ◽

Detection Accuracy ◽

Data Set ◽

Human Segmentation ◽

Self Adaptation

Vision-based fall-detection methods have been previously studied but many have limitations in terms of practicality. Due to differences in rooms, users do not set the camera or sensors at the same height. However, few studies have taken this into consideration. Moreover, some fall-detection methods are lacking in terms of practicality because only standing, sitting and falling are taken into account. Hence, this study constructs a data set consisting of various daily activities and fall events and studies the effect of camera/sensor height on fall-detection accuracy. Each activity in the data set is carried out by eight participants in eight directions and taken with the depth camera at five different heights. Many related studies heavily depended on human segmentation by using Kinect SDK but this is not reliable enough. To address this issue, this study proposes Enhanced Tracking and Denoising Alex-Net (ETDA-Net) to improve tracking and denoising performance and classify fall and non-fall events. Experimental results indicate that fall-detection accuracy is affected by camera height, against which ETDA-Net is robust, outperforming traditional deep learning based fall-detection methods.

Download Full-text

Automatic Pixel-Level Pavement Crack Recognition Using a Deep Feature Aggregation Segmentation Network with a scSE Attention Mechanism Module

Sensors ◽

10.3390/s21092902 ◽

2021 ◽

Vol 21 (9) ◽

pp. 2902

Author(s):

Wenting Qiao ◽

Qiangwei Liu ◽

Xiaoguang Wu ◽

Biao Ma ◽

Gang Li

Keyword(s):

Deep Learning ◽

Crack Detection ◽

Attention Mechanism ◽

Model Parameters ◽

Detection Accuracy ◽

Safe Driving ◽

Deep Feature ◽

Feature Aggregation ◽

Pavement Crack Detection ◽

Detection Speed

Pavement crack detection is essential for safe driving. The traditional manual crack detection method is highly subjective and time-consuming. Hence, an automatic pavement crack detection system is needed to facilitate this progress. However, this is still a challenging task due to the complex topology and large noise interference of crack images. Recently, although deep learning-based technologies have achieved breakthrough progress in crack detection, there are still some challenges, such as large parameters and low detection efficiency. Besides, most deep learning-based crack detection algorithms find it difficult to establish good balance between detection accuracy and detection speed. Inspired by the latest deep learning technology in the field of image processing, this paper proposes a novel crack detection algorithm based on the deep feature aggregation network with the spatial-channel squeeze & excitation (scSE) attention mechanism module, which calls CrackDFANet. Firstly, we cut the collected crack images into 512 × 512 pixel image blocks to establish a crack dataset. Then through iterative optimization on the training and validation sets, we obtained a crack detection model with good robustness. Finally, the CrackDFANet model verified on a total of 3516 images in five datasets with different sizes and containing different noise interferences. Experimental results show that the trained CrackDFANet has strong anti-interference ability, and has better robustness and generalization ability under the interference of light interference, parking line, water stains, plant disturbance, oil stains, and shadow conditions. Furthermore, the CrackDFANet is found to be better than other state-of-the-art algorithms with more accurate detection effect and faster detection speed. Meanwhile, our algorithm model parameters and error rates are significantly reduced.

Download Full-text

End-to-End Ship Detection in SAR Images for Complex Scenes Based on Deep CNNs

Journal of Sensors ◽

10.1155/2021/8893182 ◽

2021 ◽

Vol 2021 ◽

pp. 1-19

Author(s):

Yao Chen ◽

Tao Duan ◽

Changyuan Wang ◽

Yuanyuan Zhang ◽

Mo Huang

Keyword(s):

Data Augmentation ◽

Feature Fusion ◽

Detection Methods ◽

Detection Accuracy ◽

Deep Convolutional Neural Networks ◽

Sar Images ◽

Ship Detection ◽

End To End ◽

Sar Imagery ◽

Detection Speed

Ship detection on synthetic aperture radar (SAR) imagery has many valuable applications for both civil and military fields and has received extraordinary attention in recent years. The traditional detection methods are insensitive to multiscale ships and usually time-consuming, results in low detection accuracy and limitation for real-time processing. To balance the accuracy and speed, an end-to-end ship detection method for complex inshore and offshore scenes based on deep convolutional neural networks (CNNs) is proposed in this paper. First, the SAR images are divided into different grids, and the anchor boxes are predefined based on the responsible grids for dense ship prediction. Then, Darknet-53 with residual units is adopted as a backbone to extract features, and a top-down pyramid structure is added for multiscale feature fusion with concatenation. By this means, abundant hierarchical features containing both spatial and semantic information are extracted. Meanwhile, the strategies such as soft non-maximum suppression (Soft-NMS), mix-up and mosaic data augmentation, multiscale training, and hybrid optimization are used for performance enhancement. Besides, the model is trained from scratch to avoid learning objective bias of pretraining. The proposed one-stage method adopts end-to-end inference by a single network, so the detection speed can be guaranteed due to the concise paradigm. Extensive experiments are performed on the public SAR ship detection dataset (SSDD), and the results show that the method can detect both inshore and offshore ships with higher accuracy than other mainstream methods, yielding the accuracy with an average of 95.52%, and the detection speed is quite fast with about 72 frames per second (FPS). The actual Sentinel-1 and Gaofen-3 data are utilized for verification, and the detection results also show the effectiveness and robustness of the method.

Download Full-text