Hierarchical Open-Set Object Detection in Unseen Data

Yeong Hyeon Kim; Dong Kyun Shin; Minhaz Uddin Ahmed; Phill Kyu Rhee

doi:10.3390/sym11101271

Hierarchical Open-Set Object Detection in Unseen Data

Symmetry ◽

10.3390/sym11101271 ◽

2019 ◽

Vol 11 (10) ◽

pp. 1271

Author(s):

Yeong Hyeon Kim ◽

Dong Kyun Shin ◽

Minhaz Uddin Ahmed ◽

Phill Kyu Rhee

Keyword(s):

Object Detection ◽

Clustering Algorithm ◽

Feature Model ◽

Ensemble Prediction ◽

Object Class ◽

Open Set ◽

Regular Object ◽

Unseen Data ◽

Object Proposals ◽

Object Classes

In this paper, we propose an open-set object detection framework based on a dynamic hierarchical structure with incremental learning capabilities for unseen object classes. We were motivated by the observation that deep features extracted from visual objects show a strong hierarchical clustering property. The hierarchical feature model (HFM) was used to learn a new object class by using collaborative sampling (CS), and open-set-aware active semi-supervised learning (ASSL) algorithms. We divided object proposals into superclasses by using the agglomerative clustering algorithm. Data samples in each superclass node were classified into multiple augmented class nodes instead of directly associating with regular object classes. One or more augmented class nodes are related to a regular object class, and each augmented class has only one superclass. Object proposals from inexperienced data distribution are assigned to an augmented class node. Dynamic HFM nodes in the decision path are assembled to constitute an ensemble prediction, and the new augmented object is associated with a new regular object class. Our experimental results showed that the proposed method uses standard benchmark datasets such as PASCAL VOC, MS COCO, ILSVRC DET, and local datasets to perform better than state-of-the-art techniques.

Download Full-text

Malaria parasite detection in thick blood smear microscopic images using modified YOLOV3 and YOLOV4 models

BMC Bioinformatics ◽

10.1186/s12859-021-04036-4 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Fetulhak Abdurahman ◽

Kinde Anlay Fante ◽

Mohammed Aliy

Keyword(s):

Object Detection ◽

Malaria Parasite ◽

Blood Smear ◽

Clustering Algorithm ◽

State Of The Art ◽

Detection Accuracy ◽

Small Object ◽

Thick Blood Smear ◽

Malaria Parasites ◽

Microscopic Images

Abstract Background Manual microscopic examination of Leishman/Giemsa stained thin and thick blood smear is still the “gold standard” for malaria diagnosis. One of the drawbacks of this method is that its accuracy, consistency, and diagnosis speed depend on microscopists’ diagnostic and technical skills. It is difficult to get highly skilled microscopists in remote areas of developing countries. To alleviate this problem, in this paper, we propose to investigate state-of-the-art one-stage and two-stage object detection algorithms for automated malaria parasite screening from microscopic image of thick blood slides. Results YOLOV3 and YOLOV4 models, which are state-of-the-art object detectors in accuracy and speed, are not optimized for detecting small objects such as malaria parasites in microscopic images. We modify these models by increasing feature scale and adding more detection layers to enhance their capability of detecting small objects without notably decreasing detection speed. We propose one modified YOLOV4 model, called YOLOV4-MOD and two modified models of YOLOV3, which are called YOLOV3-MOD1 and YOLOV3-MOD2. Besides, new anchor box sizes are generated using K-means clustering algorithm to exploit the potential of these models in small object detection. The performance of the modified YOLOV3 and YOLOV4 models were evaluated on a publicly available malaria dataset. These models have achieved state-of-the-art accuracy by exceeding performance of their original versions, Faster R-CNN, and SSD in terms of mean average precision (mAP), recall, precision, F1 score, and average IOU. YOLOV4-MOD has achieved the best detection accuracy among all the other models with a mAP of 96.32%. YOLOV3-MOD2 and YOLOV3-MOD1 have achieved mAP of 96.14% and 95.46%, respectively. Conclusions The experimental results of this study demonstrate that performance of modified YOLOV3 and YOLOV4 models are highly promising for detecting malaria parasites from images captured by a smartphone camera over the microscope eyepiece. The proposed system is suitable for deployment in low-resource setting areas.

Download Full-text

Object Detection Using Clustering Algorithm Adaptive Searching Regions in Aerial Images

Computer Vision – ECCV 2020 Workshops - Lecture Notes in Computer Science ◽

10.1007/978-3-030-66823-5_39 ◽

2020 ◽

pp. 651-664

Author(s):

Yi Wang ◽

Youlong Yang ◽

Xi Zhao

Keyword(s):

Object Detection ◽

Clustering Algorithm ◽

Aerial Images

Download Full-text

Uncertainty for Identifying Open-Set Errors in Visual Object Detection

IEEE Robotics and Automation Letters ◽

10.1109/lra.2021.3123374 ◽

2021 ◽

pp. 1-1

Author(s):

Dimity Miller ◽

Niko Sunderhauf ◽

Michael J Milford ◽

Feras Dayoub

Keyword(s):

Object Detection ◽

Visual Object ◽

Open Set

Download Full-text

Object Detection Based on Region Decomposition and Assembly

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33018094 ◽

2019 ◽

Vol 33 ◽

pp. 8094-8101 ◽

Cited By ~ 4

Author(s):

Seung-Hwan Bae

Keyword(s):

Neural Networks ◽

Object Detection ◽

Performance Improvement ◽

Semantic Relations ◽

Detection Accuracy ◽

Semantic Features ◽

Multi Scale ◽

Object Proposals ◽

Object Region ◽

High Level

Region-based object detection infers object regions for one or more categories in an image. Due to the recent advances in deep learning and region proposal methods, object detectors based on convolutional neural networks (CNNs) have been flourishing and provided the promising detection results. However, the detection accuracy is degraded often because of the low discriminability of object CNN features caused by occlusions and inaccurate region proposals. In this paper, we therefore propose a region decomposition and assembly detector (R-DAD) for more accurate object detection.In the proposed R-DAD, we first decompose an object region into multiple small regions. To capture an entire appearance and part details of the object jointly, we extract CNN features within the whole object region and decomposed regions. We then learn the semantic relations between the object and its parts by combining the multi-region features stage by stage with region assembly blocks, and use the combined and high-level semantic features for the object classification and localization. In addition, for more accurate region proposals, we propose a multi-scale proposal layer that can generate object proposals of various scales. We integrate the R-DAD into several feature extractors, and prove the distinct performance improvement on PASCAL07/12 and MSCOCO18 compared to the recent convolutional detectors.

Download Full-text

Object Detection Algorithm Based on Multiheaded Attention

Applied Sciences ◽

10.3390/app9091829 ◽

2019 ◽

Vol 9 (9) ◽

pp. 1829 ◽

Cited By ~ 1

Author(s):

Jie Jiang ◽

Hui Xu ◽

Shichang Zhang ◽

Yujie Fang

Keyword(s):

Object Detection ◽

Linear Interpolation ◽

Detection Algorithm ◽

Attention Mechanism ◽

Visual Object ◽

Single Shot ◽

Object Class ◽

Feature Information ◽

Base Network ◽

Detector Model

This study proposes a multiheaded object detection algorithm referred to as MANet. The main purpose of the study is to integrate feature layers of different scales based on the attention mechanism and to enhance contextual connections. To achieve this, we first replaced the feed-forward base network of the single-shot detector with the ResNet–101 (inspired by the Deconvolutional Single-Shot Detector) and then applied linear interpolation and the attention mechanism. The information of the feature layers at different scales was fused to improve the accuracy of target detection. The primary contributions of this study are the propositions of (a) a fusion attention mechanism, and (b) a multiheaded attention fusion method. Our final MANet detector model effectively unifies the feature information among the feature layers at different scales, thus enabling it to detect objects with different sizes and with higher precision. We used the 512 × 512 input MANet (the backbone is ResNet–101) to obtain a mean accuracy of 82.7% based on the PASCAL visual object class 2007 test. These results demonstrated that our proposed method yielded better accuracy than those provided by the conventional Single-shot detector (SSD) and other advanced detectors.

Download Full-text

Robust object proposals re-ranking for object detection in autonomous driving using convolutional neural networks

Signal Processing Image Communication ◽

10.1016/j.image.2017.02.007 ◽

2017 ◽

Vol 53 ◽

pp. 110-122 ◽

Cited By ~ 31

Author(s):

Cuong Cao Pham ◽

Jae Wook Jeon

Keyword(s):

Neural Networks ◽

Object Detection ◽

Convolutional Neural Networks ◽

Autonomous Driving ◽

Object Proposals

Download Full-text

Wireless multimedia sensor network video object detection method using dynamic clustering algorithm

Multimedia Tools and Applications ◽

10.1007/s11042-019-7474-y ◽

2019 ◽

Vol 79 (23-24) ◽

pp. 16927-16940

Author(s):

Yilin Shao

Keyword(s):

Object Detection ◽

Sensor Network ◽

Clustering Algorithm ◽

Detection Method ◽

Dynamic Clustering ◽

Video Object ◽

Wireless Multimedia ◽

Wireless Multimedia Sensor Network

Download Full-text

Vehicle Pedestrian Detection Method Based on Spatial Pyramid Pooling and Attention Mechanism

Information ◽

10.3390/info11120583 ◽

2020 ◽

Vol 11 (12) ◽

pp. 583

Author(s):

Mingtao Guo ◽

Donghui Xue ◽

Peng Li ◽

He Xu

Keyword(s):

Object Detection ◽

Clustering Algorithm ◽

Pedestrian Detection ◽

Attention Mechanism ◽

Complex Environments ◽

Global Features ◽

Feature Map ◽

Spatial Pyramid Pooling ◽

Institute Of Technology ◽

Spatial Pyramid

Object detection for vehicles and pedestrians is extremely difficult to achieve in autopilot applications for the Internet of vehicles, and it is a task that requires the ability to locate and identify smaller targets even in complex environments. This paper proposes a single-stage object detection network (YOLOv3-promote) for the detection of vehicles and pedestrians in complex environments in cities, which improves on the traditional You Only Look Once version 3 (YOLOv3). First, spatial pyramid pooling is used to fuse local and global features in an image to better enrich the expression ability of the feature map and to more effectively detect targets with large size differences in the image; second, an attention mechanism is added to the feature map to weight each channel, thereby enhancing key features and removing redundant features, which allows for strengthening the ability of the feature network to discriminate between target objects and backgrounds; lastly, the anchor box derived from the K-means clustering algorithm is fitted to the final prediction box to complete the positioning and identification of target vehicles and pedestrians. The experimental results show that the proposed method achieved 91.4 mAP (mean average precision), 83.2 F1 score, and 43.7 frames per second (FPS) on the KITTI (Karlsruhe Institute of Technology and Toyota Technological Institute) dataset, and the detection performance was superior to the conventional YOLOv3 algorithm in terms of both accuracy and speed.

Download Full-text

Salient Object Detection Based on Background Feature Clustering

Advances in Multimedia ◽

10.1155/2017/4183986 ◽

2017 ◽

Vol 2017 ◽

pp. 1-9 ◽

Cited By ~ 3

Author(s):

Kan Huang ◽

Yong Zhang ◽

Bo Lv ◽

Yongbiao Shi

Keyword(s):

Object Detection ◽

Clustering Algorithm ◽

Geodesic Distance ◽

Saliency Map ◽

Salient Object Detection ◽

Salient Object ◽

Edge Preserving ◽

Feature Clustering ◽

Background Distribution ◽

Extensive Evaluation

Automatic estimation of salient object without any prior knowledge tends to greatly enhance many computer vision tasks. This paper proposes a novel bottom-up based framework for salient object detection by first modeling background and then separating salient objects from background. We model the background distribution based on feature clustering algorithm, which allows for fully exploiting statistical and structural information of the background. Then a coarse saliency map is generated according to the background distribution. To be more discriminative, the coarse saliency map is enhanced by a two-step refinement which is composed of edge-preserving element-level filtering and upsampling based on geodesic distance. We provide an extensive evaluation and show that our proposed method performs favorably against other outstanding methods on two most commonly used datasets. Most importantly, the proposed approach is demonstrated to be more effective in highlighting the salient object uniformly and robust to background noise.

Download Full-text

Object detection in hyperspectral imagery by using K-means clustering algorithm with pre-processing

10.1117/12.717926 ◽

2007 ◽

Cited By ~ 2

Author(s):

M. S. Alam ◽

M. I. Elbakary ◽

M. S. Aslan

Keyword(s):

Object Detection ◽

Clustering Algorithm ◽

Hyperspectral Imagery

Download Full-text