scholarly journals Detection of Parking Slots Based on Mask R-CNN

2020 ◽  
Vol 10 (12) ◽  
pp. 4295
Author(s):  
Shaokang Jiang ◽  
Haobin Jiang ◽  
Shidian Ma ◽  
Zhongxu Jiang

Obtaining information on parking slots is a prerequisite for the development of automatic parking systems, which is an essential part of the automatic driving processes. In this paper, we proposed a parking-slot-marking detection approach based on deep learning. The detection process involves the generation of mask of the marking-points by using the Mask R-CNN algorithm, extracting parking guidelines and parallel lines on the mask using the line segment detection (LSD) to determine the candidate parking slots. The experimental results show that the proposed method works well under the condition of complex illumination and around-view images from different sources, with a precision of 94.5% and a recall of 92.7%. The results also indicate that it can be applied to diverse slot types, including vertical, parallel and slanted slots, which is superior to previous methods.

2021 ◽  
Vol 2021 ◽  
pp. 1-8
Author(s):  
Wenjing Lu

This paper proposes a deep learning-based method for mitosis detection in breast histopathology images. A main problem in mitosis detection is that most of the datasets only have weak labels, i.e., only the coordinates indicating the center of the mitosis region. This makes most of the existing powerful object detection methods hardly be used in mitosis detection. Aiming at solving this problem, this paper firstly applies a CNN-based algorithm to pixelwisely segment the mitosis regions, based on which bounding boxes of mitosis are generated as strong labels. Based on the generated bounding boxes, an object detection network is trained to accomplish mitosis detection. Experimental results show that the proposed method is effective in detecting mitosis, and the accuracies outperform state-of-the-art literatures.


IEEE Access ◽  
2020 ◽  
Vol 8 ◽  
pp. 42595-42607
Author(s):  
Qida Yu ◽  
Guili Xu ◽  
Yuehua Cheng ◽  
Zheng H. Zhu

2014 ◽  
Vol 631-632 ◽  
pp. 602-605 ◽  
Author(s):  
Xin Ying Liu ◽  
Ping Ping Liu

Currently, there has been growing interest in unmanned aerial vehicle (UAV) during the landing. With the widespread use of the UAVs, a more precise estimation on pose and position in the process of landing is required to support the higher-level applications. In this paper, the estimation of pose and position based on the line segment detection (LSD) is proposed. By applying a vision camera, a landmark is detected using the effective LSD algorithm. Then a line-based vision model is built to calculate the pose and position of the UAV. Experimental results show that the state solutions of the proposed method are effective with different shape of landmarks, and the accuracy is minute-level in pose angle error and centimeter-level in position error.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Changyong Li ◽  
Yongxian Fan ◽  
Xiaodong Cai

Abstract Background With the development of deep learning (DL), more and more methods based on deep learning are proposed and achieve state-of-the-art performance in biomedical image segmentation. However, these methods are usually complex and require the support of powerful computing resources. According to the actual situation, it is impractical that we use huge computing resources in clinical situations. Thus, it is significant to develop accurate DL based biomedical image segmentation methods which depend on resources-constraint computing. Results A lightweight and multiscale network called PyConvU-Net is proposed to potentially work with low-resources computing. Through strictly controlled experiments, PyConvU-Net predictions have a good performance on three biomedical image segmentation tasks with the fewest parameters. Conclusions Our experimental results preliminarily demonstrate the potential of proposed PyConvU-Net in biomedical image segmentation with resources-constraint computing.


Sensors ◽  
2021 ◽  
Vol 21 (8) ◽  
pp. 2595
Author(s):  
Balakrishnan Ramalingam ◽  
Abdullah Aamir Hayat ◽  
Mohan Rajesh Elara ◽  
Braulio Félix Gómez ◽  
Lim Yi ◽  
...  

The pavement inspection task, which mainly includes crack and garbage detection, is essential and carried out frequently. The human-based or dedicated system approach for inspection can be easily carried out by integrating with the pavement sweeping machines. This work proposes a deep learning-based pavement inspection framework for self-reconfigurable robot named Panthera. Semantic segmentation framework SegNet was adopted to segment the pavement region from other objects. Deep Convolutional Neural Network (DCNN) based object detection is used to detect and localize pavement defects and garbage. Furthermore, Mobile Mapping System (MMS) was adopted for the geotagging of the defects. The proposed system was implemented and tested with the Panthera robot having NVIDIA GPU cards. The experimental results showed that the proposed technique identifies the pavement defects and litters or garbage detection with high accuracy. The experimental results on the crack and garbage detection are presented. It is found that the proposed technique is suitable for deployment in real-time for garbage detection and, eventually, sweeping or cleaning tasks.


2021 ◽  
Vol 54 (6) ◽  
pp. 1-35
Author(s):  
Ninareh Mehrabi ◽  
Fred Morstatter ◽  
Nripsuta Saxena ◽  
Kristina Lerman ◽  
Aram Galstyan

With the widespread use of artificial intelligence (AI) systems and applications in our everyday lives, accounting for fairness has gained significant importance in designing and engineering of such systems. AI systems can be used in many sensitive environments to make important and life-changing decisions; thus, it is crucial to ensure that these decisions do not reflect discriminatory behavior toward certain groups or populations. More recently some work has been developed in traditional machine learning and deep learning that address such challenges in different subdomains. With the commercialization of these systems, researchers are becoming more aware of the biases that these applications can contain and are attempting to address them. In this survey, we investigated different real-world applications that have shown biases in various ways, and we listed different sources of biases that can affect AI applications. We then created a taxonomy for fairness definitions that machine learning researchers have defined to avoid the existing bias in AI systems. In addition to that, we examined different domains and subdomains in AI showing what researchers have observed with regard to unfair outcomes in the state-of-the-art methods and ways they have tried to address them. There are still many future directions and solutions that can be taken to mitigate the problem of bias in AI systems. We are hoping that this survey will motivate researchers to tackle these issues in the near future by observing existing work in their respective fields.


2013 ◽  
Vol 385-386 ◽  
pp. 1429-1433 ◽  
Author(s):  
Zhong Yan Liang ◽  
San Yuan Zhang

The tilt license plate correction is an important part of the license plate recognition system. Traditional correction methods are based on one theory. It is difficult to use the advantages of different approaches. We propose some methods to help improve the tile license plate correction: a bounding box selection method based on similar height and a mutual correction method based on fitted parallel straight lines. Moreover, we use wide bounding boxes to segment touched characters. If the method based on parallel lines fails, another method, such as PCA-based one, can be used for complement. Experimental results show the proposed method outperforms others.


2018 ◽  
Vol 32 (14) ◽  
pp. 1850166 ◽  
Author(s):  
Lilin Fan ◽  
Kaiyuan Song ◽  
Dong Liu

Semi-supervised community detection is an important research topic in the field of complex network, which incorporates prior knowledge and topology to guide the community detection process. However, most of the previous work ignores the impact of the noise from prior knowledge during the community detection process. This paper proposes a novel strategy to identify and remove the noise from prior knowledge based on harmonic function, so as to make use of prior knowledge more efficiently. Finally, this strategy is applied to three state-of-the-art semi-supervised community detection methods. A series of experiments on both real and artificial networks demonstrate that the accuracy of semi-supervised community detection approach can be further improved.


2021 ◽  
pp. 1-11
Author(s):  
Oscar Herrera ◽  
Belém Priego

Traditionally, a few activation functions have been considered in neural networks, including bounded functions such as threshold, sigmoidal and hyperbolic-tangent, as well as unbounded ReLU, GELU, and Soft-plus, among other functions for deep learning, but the search for new activation functions still being an open research area. In this paper, wavelets are reconsidered as activation functions in neural networks and the performance of Gaussian family wavelets (first, second and third derivatives) are studied together with other functions available in Keras-Tensorflow. Experimental results show how the combination of these activation functions can improve the performance and supports the idea of extending the list of activation functions to wavelets which can be available in high performance platforms.


Author(s):  
Yu-Xiang Zhao ◽  
Yi-Zeng Hsieh ◽  
Shih-Syun Lin

With advances in technology, photo booths equipped with automatic capturing systems have gradually replaced the identification (ID) photo service provided by photography studios, thereby enabling consumers to save a considerable amount of time and money. Common automatic capturing systems employ text and voice instructions to guide users in capturing their ID photos; however, the capturing results may not conform to ID photo specifications. To address this issue, this study proposes an ID photo capturing algorithm that can automatically detect facial contours and adjust the size of captured images. The authors adopted a deep learning method (You Only Look Once) to detect the face and applied a semi-automatic annotation technique of facial landmarks to find the lip and chin regions from the facial region. In the experiments, subjects were seated at various distances and heights for testing the performance of the proposed algorithm. The experimental results show that the proposed algorithm can effectively and accurately capture ID photos that satisfy the required specifications.


Sign in / Sign up

Export Citation Format

Share Document