Object Detection Based on Multiple Information Fusion Net

Yanni Zhang; Jun Kong; Miao Qi; Yunpeng Liu; Jianzhong Wang; Yinghua Lu

doi:10.3390/app10010418

Object Detection Based on Multiple Information Fusion Net

Applied Sciences ◽

10.3390/app10010418 ◽

2020 ◽

Vol 10 (1) ◽

pp. 418

Author(s):

Yanni Zhang ◽

Jun Kong ◽

Miao Qi ◽

Yunpeng Liu ◽

Jianzhong Wang ◽

...

Keyword(s):

Computer Vision ◽

Object Detection ◽

Information Fusion ◽

State Of The Art ◽

Detection Methods ◽

Strongly Correlated ◽

Novel Object ◽

Proposed Model ◽

Level Information ◽

Long Time

Object detection has been playing a significant role in computer vision for a long time, but it is still full of challenges. In this paper, we propose a novel object detection framework based on relationship among different objects and the scene-level information of the whole image to cope with the problem that some strongly correlated objects are difficult to be recognized. Our motivation is to enrich the semantics of object detection feature by a scene-level information branch and a relationship branch. There are three important changes of our framework over traditional detection methods: representation of relationship, scene-level information as the prior knowledge and the fusion of the above two information. Extensive experiments are carried out on PASCAL VOC and MS COCO databases. The experimental results show that the detection performance can be improved by introducing relationship and scene-level information, and our proposed model achieve better performance than several classical and state-of-the-art methods.

Download Full-text

A Hard Example Mining Approach for Concealed Multi-Object Detection of Active Terahertz Image

Applied Sciences ◽

10.3390/app112311241 ◽

2021 ◽

Vol 11 (23) ◽

pp. 11241

Author(s):

Ling Li ◽

Fei Xue ◽

Dong Liang ◽

Xiaofei Chen

Keyword(s):

Computer Vision ◽

Object Detection ◽

State Of The Art ◽

Terahertz Imaging ◽

Public Security ◽

Counter Terrorism ◽

Detection Algorithms ◽

Public Dataset ◽

The One ◽

Objects Detection

Concealed objects detection in terahertz imaging is an urgent need for public security and counter-terrorism. So far, there is no public terahertz imaging dataset for the evaluation of objects detection algorithms. This paper provides a public dataset for evaluating multi-object detection algorithms in active terahertz imaging. Due to high sample similarity and poor imaging quality, object detection on this dataset is much more difficult than on those commonly used public object detection datasets in the computer vision field. Since the traditional hard example mining approach is designed based on the two-stage detector and cannot be directly applied to the one-stage detector, this paper designs an image-based Hard Example Mining (HEM) scheme based on RetinaNet. Several state-of-the-art detectors, including YOLOv3, YOLOv4, FRCN-OHEM, and RetinaNet, are evaluated on this dataset. Experimental results show that the RetinaNet achieves the best mAP and HEM further enhances the performance of the model. The parameters affecting the detection metrics of individual images are summarized and analyzed in the experiments.

Download Full-text

Image-Based Malware Classification Using VGG19 Network and Spatial Convolutional Attention

Electronics ◽

10.3390/electronics10192444 ◽

2021 ◽

Vol 10 (19) ◽

pp. 2444

Author(s):

Mazhar Javed Awan ◽

Osama Ahmed Masood ◽

Mazin Abed Mohammed ◽

Awais Yasin ◽

Azlan Mohd Zain ◽

...

Keyword(s):

High Performance ◽

State Of The Art ◽

Malware Detection ◽

Detection Methods ◽

Learning Framework ◽

Malware Classification ◽

Proposed Model ◽

Communication Devices ◽

Digital Assets

In recent years the amount of malware spreading through the internet and infecting computers and other communication devices has tremendously increased. To date, countless techniques and methodologies have been proposed to detect and neutralize these malicious agents. However, as new and automated malware generation techniques emerge, a lot of malware continues to be produced, which can bypass some state-of-the-art malware detection methods. Therefore, there is a need for the classification and detection of these adversarial agents that can compromise the security of people, organizations, and countless other forms of digital assets. In this paper, we propose a spatial attention and convolutional neural network (SACNN) based on deep learning framework for image-based classification of 25 well-known malware families with and without class balancing. Performance was evaluated on the Malimg benchmark dataset using precision, recall, specificity, precision, and F1 score on which our proposed model with class balancing reached 97.42%, 97.95%, 97.33%, 97.11%, and 97.32%. We also conducted experiments on SACNN with class balancing on benign class, also produced above 97%. The results indicate that our proposed model can be used for image-based malware detection with high performance, despite being simpler as compared to other available solutions.

Download Full-text

Improved Background Subtraction Technique for detecting Moving Object

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813999200817172733 ◽

2020 ◽

Vol 13 ◽

Author(s):

Tannistha Pal

Keyword(s):

Computer Vision ◽

Object Detection ◽

Image Enhancement ◽

Background Subtraction ◽

Moving Object ◽

Detection Methods ◽

Subtraction Method ◽

Subtraction Technique ◽

Visual Background ◽

Computer Vision Applications

Introduction: Moving object detection from videos is among the most difficult task in different areas of computer vision applications. Among the traditional object detection methods, researchers conclude that Background Subtraction method carried out better in aspects of execution time and output quality. Mehtod: Visual background extractor is a renowned algorithm in Background Subtraction method for detecting moving object in various applications. In the recent years, lots of work has been carried out to improve the existing Visual Background extractor algorithm. Result: After investigating many state of art techniques and finding out the research gaps, this paper presents an improved background subtraction technique based on morphological operation and 2D median filter for detecting moving object which reduces the noise in the output video and also enhances its accuracy at a very limited additional cost. Experimental results in several benchmark datasets confirmed the superiority of the proposed method over the state-of-the-art object detection methods. Conclusion: In this article, a method has been proposed for moving object detection where the quality of the output object is enhanced and good accuracy is achieved. This method provide with accurate experimental results, which helps in efficient object detection. The proposed technique also deals with Visual Background extractor Algorithm along with the Image Enhancement Procedure like Morphological and 2-D Filtering at a limited additional cost Discussion: This article worked on certain specific field, like noise reduction and image enhancement of output images of the existing ViBe Algorithm. The technique proposed in this article will be beneficial for various computer vision applications like video surveillance, road condition monitoring, airport safety, human activity analysis, monitoring marine border for security purpose etc.

Download Full-text

A Fast Orientation Invariant Detector Based on the One-stage Method

MATEC Web of Conferences ◽

10.1051/matecconf/201823204036 ◽

2018 ◽

Vol 232 ◽

pp. 04036

Author(s):

Jun Yin ◽

Huadong Pan ◽

Hui Su ◽

Zhonggeng Liu ◽

Zhirong Peng

Keyword(s):

Object Detection ◽

Loss Function ◽

High Efficiency ◽

Detection Method ◽

State Of The Art ◽

Orientation Angle ◽

Detection Methods ◽

Detection Algorithms ◽

Bounding Boxes ◽

The One

We propose an object detection method that predicts the orientation bounding boxes (OBB) to estimate objects locations, scales and orientations based on YOLO (You Only Look Once), which is one of the top detection algorithms performing well both in accuracy and speed. Horizontal bounding boxes(HBB), which are not robust to orientation variances, are used in the existing object detection methods to detect targets. The proposed orientation invariant YOLO (OIYOLO) detector can effectively deal with the bird’s eye viewpoint images where the orientation angles of the objects are arbitrary. In order to estimate the rotated angle of objects, we design a new angle loss function. Therefore, the training of OIYOLO forces the network to learn the annotated orientation angle of objects, making OIYOLO orientation invariances. The proposed approach that predicts OBB can be applied in other detection frameworks. In additional, to evaluate the proposed OIYOLO detector, we create an UAV-DAHUA datasets that annotated with objects locations, scales and orientation angles accurately. Extensive experiments conducted on UAV-DAHUA and DOTA datasets demonstrate that OIYOLO achieves state-of-the-art detection performance with high efficiency comparing with the baseline YOLO algorithms.

Download Full-text

Weighted Ensemble Object Detection with Optimized Coefficients for Remote Sensing Images

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi9060370 ◽

2020 ◽

Vol 9 (6) ◽

pp. 370

Author(s):

Atakan Körez ◽

Necaattin Barışçı ◽

Aydın Çetin ◽

Uçman Ergün

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Object Detection ◽

Mean Average Precision ◽

Detection Methods ◽

Remote Sensing Images ◽

Average Precision ◽

Proposed Model ◽

Detection Of Objects ◽

Very High

The detection of objects in very high-resolution (VHR) remote sensing images has become increasingly popular with the enhancement of remote sensing technologies. High-resolution images from aircrafts or satellites contain highly detailed and mixed backgrounds that decrease the success of object detection in remote sensing images. In this study, a model that performs weighted ensemble object detection using optimized coefficients is proposed. This model uses the outputs of three different object detection models trained on the same dataset. The model’s structure takes two or more object detection methods as its input and provides an output with an optimized coefficient-weighted ensemble. The Northwestern Polytechnical University Very High Resolution 10 (NWPU-VHR10) and Remote Sensing Object Detection (RSOD) datasets were used to measure the object detection success of the proposed model. Our experiments reveal that the proposed model improved the Mean Average Precision (mAP) performance by 0.78%–16.5% compared to stand-alone models and presents better mean average precision than other state-of-the-art methods (3.55% higher on the NWPU-VHR-10 dataset and 1.49% higher when using the RSOD dataset).

Download Full-text

A Lightweight YOLOv4-Based Forestry Pest Detection Method Using Coordinate Attention and Feature Fusion

Entropy ◽

10.3390/e23121587 ◽

2021 ◽

Vol 23 (12) ◽

pp. 1587

Author(s):

Mingfeng Zha ◽

Wenbin Qian ◽

Wenlong Yi ◽

Jing Hua

Keyword(s):

Detection Method ◽

Feature Fusion ◽

State Of The Art ◽

Detection Methods ◽

Model Parameters ◽

Symmetric Structure ◽

Proposed Model ◽

Pest Detection ◽

Feature Information ◽

Small Targets

Traditional pest detection methods are challenging to use in complex forestry environments due to their low accuracy and speed. To address this issue, this paper proposes the YOLOv4_MF model. The YOLOv4_MF model utilizes MobileNetv2 as the feature extraction block and replaces the traditional convolution with depth-wise separated convolution to reduce the model parameters. In addition, the coordinate attention mechanism was embedded in MobileNetv2 to enhance feature information. A symmetric structure consisting of a three-layer spatial pyramid pool is presented, and an improved feature fusion structure was designed to fuse the target information. For the loss function, focal loss was used instead of cross-entropy loss to enhance the network’s learning of small targets. The experimental results showed that the YOLOv4_MF model has 4.24% higher mAP, 4.37% higher precision, and 6.68% higher recall than the YOLOv4 model. The size of the proposed model was reduced to 1/6 of that of YOLOv4. Moreover, the proposed algorithm achieved 38.62% mAP with respect to some state-of-the-art algorithms on the COCO dataset.

Download Full-text

Towards Robust Object Detection in Floor Plan Images: A Data Augmentation Approach

Applied Sciences ◽

10.3390/app112311174 ◽

2021 ◽

Vol 11 (23) ◽

pp. 11174

Author(s):

Shashank Mishra ◽

Khurram Azeem Hashmi ◽

Alain Pagani ◽

Marcus Liwicki ◽

Didier Stricker ◽

...

Keyword(s):

Computer Vision ◽

Object Detection ◽

Data Augmentation ◽

State Of The Art ◽

Floor Plan ◽

Previous State ◽

Floor Plans ◽

Better Than

Object detection is one of the most critical tasks in the field of Computer vision. This task comprises identifying and localizing an object in the image. Architectural floor plans represent the layout of buildings and apartments. The floor plans consist of walls, windows, stairs, and other furniture objects. While recognizing floor plan objects is straightforward for humans, automatically processing floor plans and recognizing objects is challenging. In this work, we investigate the performance of the recently introduced Cascade Mask R-CNN network to solve object detection in floor plan images. Furthermore, we experimentally establish that deformable convolution works better than conventional convolutions in the proposed framework. Prior datasets for object detection in floor plan images are either publicly unavailable or contain few samples. We introduce SFPI, a novel synthetic floor plan dataset consisting of 10,000 images to address this issue. Our proposed method conveniently exceeds the previous state-of-the-art results on the SESYD dataset with an mAP of 98.1%. Moreover, it sets impressive baseline results on our novel SFPI dataset with an mAP of 99.8%. We believe that introducing the modern dataset enables the researcher to enhance the research in this domain.

Download Full-text

An Optimized Approach for Intra-Class Fruit Classification Using Deep Convolutional Neural Network

International Journal of Image and Graphics ◽

10.1142/s0219467821400143 ◽

2021 ◽

pp. 2140014

Author(s):

Rishipal Singh ◽

Rajneesh Rani ◽

Aman Kamboj

Keyword(s):

Neural Network ◽

Computer Vision ◽

Convolutional Neural Network ◽

State Of The Art ◽

Deep Convolutional Neural Network ◽

Classification Models ◽

Proposed Model ◽

Traditional Classification

Fruits classification is one of the influential applications of computer vision. Traditional classification models are trained by considering various features such as color, shape, texture, etc. These features are common for different varieties of the same fruit. Therefore, a new set of features is required to classify the fruits belonging to the same class. In this paper, we have proposed an optimized method to classify intra-class fruits using deep convolutional layers. The proposed architecture is capable of solving the challenges of a commercial tray-based system in the supermarket. As the research in intra-class classification is still in its infancy, there are challenges that have not been tackled. So, the proposed method is specifically designed to overcome the challenges related to intra-class fruits classification. The proposed method showcases an impressive performance for intra-class classification, which is achieved using a few parameters than the existing methods. The proposed model consists of Inception block, Residual connections and various other layers in very precise order. To validate its performance, the proposed method is compared with state-of-the-art models and performs best in terms of accuracy, loss, parameters, and depth.

Download Full-text

Real-Time Small Drones Detection Based on Pruned YOLOv4

Sensors ◽

10.3390/s21103374 ◽

2021 ◽

Vol 21 (10) ◽

pp. 3374

Author(s):

Hansen Liu ◽

Kuangang Fan ◽

Qinghua Ouyang ◽

Na Li

Keyword(s):

Object Detection ◽

Real Time ◽

Processing Speed ◽

State Of The Art ◽

Detection Methods ◽

Detection Accuracy ◽

Small Object ◽

Art Object ◽

Real Time Detection ◽

Small Object Detection

To address the threat of drones intruding into high-security areas, the real-time detection of drones is urgently required to protect these areas. There are two main difficulties in real-time detection of drones. One of them is that the drones move quickly, which leads to requiring faster detectors. Another problem is that small drones are difficult to detect. In this paper, firstly, we achieve high detection accuracy by evaluating three state-of-the-art object detection methods: RetinaNet, FCOS, YOLOv3 and YOLOv4. Then, to address the first problem, we prune the convolutional channel and shortcut layer of YOLOv4 to develop thinner and shallower models. Furthermore, to improve the accuracy of small drone detection, we implement a special augmentation for small object detection by copying and pasting small drones. Experimental results verify that compared to YOLOv4, our pruned-YOLOv4 model, with 0.8 channel prune rate and 24 layers prune, achieves 90.5% mAP and its processing speed is increased by 60.4%. Additionally, after small object augmentation, the precision and recall of the pruned-YOLOv4 almost increases by 22.8% and 12.7%, respectively. Experiment results verify that our pruned-YOLOv4 is an effective and accurate approach for drone detection.

Download Full-text

A Deep Learning Enabled Multi-Class Plant Disease Detection Model Based on Computer Vision

AI ◽

10.3390/ai2030026 ◽

2021 ◽

Vol 2 (3) ◽

pp. 413-428

Author(s):

Arunabha M. Roy ◽

Jayabrata Bhaduri

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Large Scale ◽

Plant Disease ◽

State Of The Art ◽

Plant Diseases ◽

Disease Detection ◽

Detection Model ◽

Proposed Model ◽

Apple Plant

In this paper, a deep learning enabled object detection model for multi-class plant disease has been proposed based on a state-of-the-art computer vision algorithm. While most existing models are limited to disease detection on a large scale, the current model addresses the accurate detection of fine-grained, multi-scale early disease detection. The proposed model has been improved to optimize for both detection speed and accuracy and applied to multi-class apple plant disease detection in the real environment. The mean average precision (mAP) and F1-score of the detection model reached up to 91.2% and 95.9%, respectively, at a detection rate of 56.9 FPS. The overall detection result demonstrates that the current algorithm significantly outperforms the state-of-the-art detection model with a 9.05% increase in precision and 7.6% increase in F1-score. The proposed model can be employed as an effective and efficient method to detect different apple plant diseases under complex orchard scenarios.

Download Full-text