Object detection method on station logo with single shot multi-box detector

Fei Rong; Li Shasha; Xu Qingzheng; Liu Kun

doi:10.1049/joe.2019.1213

CP-SSD: Context Information Scene Perception Object Detection Based on SSD

Applied Sciences ◽

10.3390/app9142785 ◽

2019 ◽

Vol 9 (14) ◽

pp. 2785 ◽

Cited By ~ 2

Author(s):

Yun Jiang ◽

Tingting Peng ◽

Ning Tan

Keyword(s):

Object Detection ◽

Semantic Information ◽

Detection Method ◽

Scene Perception ◽

Context Information ◽

Single Shot ◽

Deep Layers ◽

Feature Information ◽

The Mean ◽

Detection Effect

Single Shot MultiBox Detector (SSD) has achieved good results in object detection but there are problems such as insufficient understanding of context information and loss of features in deep layers. In order to alleviate these problems, we propose a single-shot object detection network Context Perception-SSD (CP-SSD). CP-SSD promotes the network’s understanding of context information by using context information scene perception modules, so as to capture context information for objects of different scales. Deep layer feature map used semantic activation module, through self-supervised learning to adjust the context feature information and channel interdependence, and enhance useful semantic information. CP-SSD was validated on benchmark dataset PASCAL VOC 2007. The experimental results show that, compared with SSD, the mean Average Precision (mAP) of the CP-SSD detection method reaches 77.8%, which is 0.6% higher than that of SSD, and the detection effect was significantly improved in images with difficult to distinguish the object from the background.

Download Full-text

A Moving Object Detection Method Based on Sliding Window Gaussian Mixture Model

JOURNAL OF ELECTRONICS INFORMATION TECHNOLOGY ◽

10.3724/sp.j.1146.2012.01449 ◽

2014 ◽

Vol 35 (7) ◽

pp. 1650-1656 ◽

Cited By ~ 3

Author(s):

Jian-ying Zhou ◽

Xiao-pei Wu ◽

Chao Zhang ◽

Zhao Lü

Keyword(s):

Object Detection ◽

Gaussian Mixture Model ◽

Mixture Model ◽

Detection Method ◽

Sliding Window ◽

Gaussian Mixture ◽

Moving Object Detection ◽

Moving Object

Download Full-text

Design of Desktop Audiovisual Entertainment System with Deep Learning and Haptic Sensations

Symmetry ◽

10.3390/sym12101718 ◽

2020 ◽

Vol 12 (10) ◽

pp. 1718

Author(s):

Chien-Hsing Chou ◽

Yu-Sheng Su ◽

Che-Ju Hsu ◽

Kong-Chang Lee ◽

Ping-Hsuan Han

Keyword(s):

Deep Learning ◽

Object Detection ◽

User Experience ◽

Recognition System ◽

Scene Recognition ◽

Single Shot ◽

Auditory Signals ◽

Hot Weather ◽

Viewing Experience ◽

At Home

In this study, we designed a four-dimensional (4D) audiovisual entertainment system called Sense. This system comprises a scene recognition system and hardware modules that provide haptic sensations for users when they watch movies and animations at home. In the scene recognition system, we used Google Cloud Vision to detect common scene elements in a video, such as fire, explosions, wind, and rain, and further determine whether the scene depicts hot weather, rain, or snow. Additionally, for animated videos, we applied deep learning with a single shot multibox detector to detect whether the animated video contained scenes of fire-related objects. The hardware module was designed to provide six types of haptic sensations set as line-symmetry to provide a better user experience. After the system considers the results of object detection via the scene recognition system, the system generates corresponding haptic sensations. The system integrates deep learning, auditory signals, and haptic sensations to provide an enhanced viewing experience.

Download Full-text

A Robust Thermal Infrared Vehicle and Pedestrian Detection Method in Complex Scenes

Sensors ◽

10.3390/s21041240 ◽

2021 ◽

Vol 21 (4) ◽

pp. 1240

Author(s):

Yang Liu ◽

Hailong Su ◽

Cao Zeng ◽

Xiaoli Li

Keyword(s):

Detection Method ◽

Selection Process ◽

Pedestrian Detection ◽

Thermal Infrared ◽

Single Shot ◽

Detection Algorithms ◽

Complex Scenes ◽

Thermal Infrared Images ◽

Online Feature Selection ◽

Huge Challenge

In complex scenes, it is a huge challenge to accurately detect motion-blurred, tiny, and dense objects in the thermal infrared images. To solve this problem, robust thermal infrared vehicle and pedestrian detection method is proposed in this paper. An important weight parameter β is first proposed to reconstruct the loss function of the feature selective anchor-free (FSAF) module in its online feature selection process, and the FSAF module is optimized to enhance the detection performance of motion-blurred objects. The proposal of parameter β provides an effective solution to the challenge of motion-blurred object detection. Then, the optimized anchor-free branches of the FSAF module are plugged into the YOLOv3 single-shot detector and work jointly with the anchor-based branches of the YOLOv3 detector in both training and inference, which efficiently improves the detection precision of the detector for tiny and dense objects. Experimental results show that the method proposed is superior to other typical thermal infrared vehicle and pedestrian detection algorithms due to 72.2% mean average precision (mAP).

Download Full-text

A Two-Phase Fashion Apparel Detection Method Based on YOLOv4

Applied Sciences ◽

10.3390/app11093782 ◽

2021 ◽

Vol 11 (9) ◽

pp. 3782

Author(s):

Chu-Hui Lee ◽

Chen-Wei Lin

Keyword(s):

Object Detection ◽

Transfer Learning ◽

Detection Method ◽

Phase Transfer ◽

Recognition Task ◽

Phase Detection ◽

Target Domain ◽

Two Phase ◽

Detection Technology ◽

Fashion Apparel

Object detection is one of the important technologies in the field of computer vision. In the area of fashion apparel, object detection technology has various applications, such as apparel recognition, apparel detection, fashion recommendation, and online search. The recognition task is difficult for a computer because fashion apparel images have different characteristics of clothing appearance and material. Currently, fast and accurate object detection is the most important goal in this field. In this study, we proposed a two-phase fashion apparel detection method named YOLOv4-TPD (YOLOv4 Two-Phase Detection), based on the YOLOv4 algorithm, to address this challenge. The target categories for model detection were divided into the jacket, top, pants, skirt, and bag. According to the definition of inductive transfer learning, the purpose was to transfer the knowledge from the source domain to the target domain that could improve the effect of tasks in the target domain. Therefore, we used the two-phase training method to implement the transfer learning. Finally, the experimental results showed that the mAP of our model was better than the original YOLOv4 model through the two-phase transfer learning. The proposed model has multiple potential applications, such as an automatic labeling system, style retrieval, and similarity detection.

Download Full-text

Underwater Object Detection Based on Improved Single Shot MultiBox Detector

2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence ◽

10.1145/3446132.3446170 ◽

2020 ◽

Author(s):

Zhongyun Jiang ◽

Rongrong Wang

Keyword(s):

Object Detection ◽

Single Shot ◽

Underwater Object

Download Full-text

A motion based object detection method

2020 2nd International Conference on Information Technology and Computer Application (ITCA) ◽

10.1109/itca52113.2020.00067 ◽

2020 ◽

Author(s):

Chen Zhaoyang ◽

Gao Haolin ◽

Wang Kun

Keyword(s):

Object Detection ◽

Detection Method

Download Full-text

Research on object detection method based on FF-YOLO for complex scenes

IEEE Access ◽

10.1109/access.2021.3108398 ◽

2021 ◽

pp. 1-1

Author(s):

Chen Baoyuan ◽

Liu Yitong ◽

Sun Kun

Keyword(s):

Object Detection ◽

Detection Method ◽

Complex Scenes

Download Full-text

Investigating the Potential of Network Optimization for a Constrained Object Detection Problem

Journal of Imaging ◽

10.3390/jimaging7040064 ◽

2021 ◽

Vol 7 (4) ◽

pp. 64

Author(s):

Tanguy Ophoff ◽

Cédric Gullentops ◽

Kristof Van Beeck ◽

Toon Goedemé

Keyword(s):

Computational Complexity ◽

Object Detection ◽

Network Optimization ◽

Real Life ◽

Optimization Techniques ◽

Training Data ◽

Single Shot ◽

Standard Object ◽

Number Of Classes

Object detection models are usually trained and evaluated on highly complicated, challenging academic datasets, which results in deep networks requiring lots of computations. However, a lot of operational use-cases consist of more constrained situations: they have a limited number of classes to be detected, less intra-class variance, less lighting and background variance, constrained or even fixed camera viewpoints, etc. In these cases, we hypothesize that smaller networks could be used without deteriorating the accuracy. However, there are multiple reasons why this does not happen in practice. Firstly, overparameterized networks tend to learn better, and secondly, transfer learning is usually used to reduce the necessary amount of training data. In this paper, we investigate how much we can reduce the computational complexity of a standard object detection network in such constrained object detection problems. As a case study, we focus on a well-known single-shot object detector, YoloV2, and combine three different techniques to reduce the computational complexity of the model without reducing its accuracy on our target dataset. To investigate the influence of the problem complexity, we compare two datasets: a prototypical academic (Pascal VOC) and a real-life operational (LWIR person detection) dataset. The three optimization steps we exploited are: swapping all the convolutions for depth-wise separable convolutions, perform pruning and use weight quantization. The results of our case study indeed substantiate our hypothesis that the more constrained a problem is, the more the network can be optimized. On the constrained operational dataset, combining these optimization techniques allowed us to reduce the computational complexity with a factor of 349, as compared to only a factor 9.8 on the academic dataset. When running a benchmark on an Nvidia Jetson AGX Xavier, our fastest model runs more than 15 times faster than the original YoloV2 model, whilst increasing the accuracy by 5% Average Precision (AP).

Download Full-text

Deep Sensor Fusion Based on Frustum Point Single Shot Multibox Detector for 3D Object Detection

10.1109/icip42928.2021.9506167 ◽

2021 ◽

Author(s):

Yu Wang ◽

Ye Zhang ◽

Shaohua Zhai ◽

Hao Chen ◽

Shaoqi Shi ◽

...

Keyword(s):

Object Detection ◽

Sensor Fusion ◽

Single Shot ◽

3D Object ◽

3D Object Detection

Download Full-text