scholarly journals PPANet: Point-Wise Pyramid Attention Network for Semantic Segmentation

2021 ◽  
Vol 2021 ◽  
pp. 1-16
Author(s):  
Mohammed A. M. Elhassan ◽  
YuXuan Chen ◽  
Yunyi Chen ◽  
Chenxi Huang ◽  
Jane Yang ◽  
...  

In recent years, convolutional neural networks (CNNs) have been at the centre of the advances and progress of advanced driver assistance systems and autonomous driving. This paper presents a point-wise pyramid attention network, namely, PPANet, which employs an encoder-decoder approach for semantic segmentation. Specifically, the encoder adopts a novel squeeze nonbottleneck module as a base module to extract feature representations, where squeeze and expansion are utilized to obtain high segmentation accuracy. An upsampling module is designed to work as a decoder; its purpose is to recover the lost pixel-wise representations from the encoding part. The middle part consists of two parts point-wise pyramid attention (PPA) module and an attention-like module connected in parallel. The PPA module is proposed to utilize contextual information effectively. Furthermore, we developed a combined loss function from dice loss and binary cross-entropy to improve accuracy and get faster training convergence in KITTI road segmentation. The paper conducted the training and testing experiments on KITTI road segmentation and Camvid datasets, and the evaluation results show that the proposed method proved its effectiveness in road semantic segmentation.

2020 ◽  
Vol 9 (1) ◽  
pp. 2698-2704

Advanced Driving Assistance System (ADAS) has seen tremendous growth over the past 10 years. In recent times, luxury cars, as well as some newly emerging cars, come with ADAS application. From 2014, Because of the entry of the European new car assessment programme (EuroNCAP) [1] in the AEBS test, it helped gain momentum the introduction of ADAS in Europe [1]. Most OEMs and research institutes have already demonstrated on the self-driving cars [1]. So here, a focus is made on road segmentation where LiDAR sensor takes in the image of the surrounding and where the vehicle should know its path, it is fulfilled by processing a convolutional neural network called semantic segmentation on an FPGA board in 16.9ms [3]. Further, a traffic light detection model is also developed by using NVidia Jetson and 2 FPGA boards, collectively named as 'Driving brain' which acts as a super computer for such networks. The results are obtained at higher accuracy by processing the obtained traffic light images into the CNN classifier [5]. Overall, this paper gives a brief idea of the technical trend of autonomous driving which throws light on algorithms and for advanced driver-assistance systems used for road segmentation and traffic light detection


2021 ◽  
Vol 69 (6) ◽  
pp. 511-523
Author(s):  
Henrietta Lengyel ◽  
Viktor Remeli ◽  
Zsolt Szalay

Abstract The emergence of new autonomous driving systems and functions – in particular, systems that base their decisions on the output of machine learning subsystems responsible for environment perception – brings a significant change in the risks to the safety and security of transportation. These kinds of Advanced Driver Assistance Systems are vulnerable to new types of malicious attacks, and their properties are often not well understood. This paper demonstrates the theoretical and practical possibility of deliberate physical adversarial attacks against deep learning perception systems in general, with a focus on safety-critical driver assistance applications such as traffic sign classification in particular. Our newly developed traffic sign stickers are different from other similar methods insofar that they require no special knowledge or precision in their creation and deployment, thus they present a realistic and severe threat to traffic safety and security. In this paper we preemptively point out the dangers and easily exploitable weaknesses that current and future systems are bound to face.


2020 ◽  
Vol 25 (3) ◽  
pp. 83-92
Author(s):  
Bong-Seo Park ◽  
Hyun-cheol Park ◽  
Jung-jun Her

With the development of advanced driver assistance systems, the more reliable the autonomous driving technology is, the more the rest and entertainment times of the driver of the car increases. Hence, the importance of the entertainment function of automotive audio-video navigation (AVN) systems is increasing. Currently, the AVN system of automobiles has a monitoring function for fault diagnosis and a combination of functions. Applying these technologies is challenging for drivers who want to tune the audio quality to their musical taste. In this study, a method for upgrading the sound quality using a power supply noise filter without deforming the AVN system was developed. The low-pass attenuation that appeared as a side effect was solved by applying a filter using the loudness isotropic curve. In the installation method of the filter, the method of using a fuse holder minimized the inconvenience of AVN detachment and wiring. Based on the results obtained in this study, further research and improvement of the filter are required for audio tuning of various models.


2020 ◽  
Vol 8 (3) ◽  
pp. 188
Author(s):  
Fangfang Liu ◽  
Ming Fang

Image semantic segmentation technology has been increasingly applied in many fields, for example, autonomous driving, indoor navigation, virtual reality and augmented reality. However, underwater scenes, where there is a huge amount of marine biological resources and irreplaceable biological gene banks that need to be researched and exploited, are limited. In this paper, image semantic segmentation technology is exploited to study underwater scenes. We extend the current state-of-the-art semantic segmentation network DeepLabv3 + and employ it as the basic framework. First, the unsupervised color correction method (UCM) module is introduced to the encoder structure of the framework to improve the quality of the image. Moreover, two up-sampling layers are added to the decoder structure to retain more target features and object boundary information. The model is trained by fine-tuning and optimizing relevant parameters. Experimental results indicate that the image obtained by our method demonstrates better performance in improving the appearance of the segmented target object and avoiding its pixels from mingling with other class’s pixels, enhancing the segmentation accuracy of the target boundaries and retaining more feature information. Compared with the original method, our method improves the segmentation accuracy by 3%.


2019 ◽  
Vol 10 (1) ◽  
pp. 13 ◽  
Author(s):  
Shichao Zhang ◽  
Zhe Zhang ◽  
Libo Sun ◽  
Wenhu Qin

Generally, most approaches using methods such as cropping, rotating, and flipping achieve more data to train models for improving the accuracy of detection and segmentation. However, due to the difficulties of labeling such data especially semantic segmentation data, those traditional data augmentation methodologies cannot help a lot when the training set is really limited. In this paper, a model named OFA-Net (One For All Network) is proposed to combine object detection and semantic segmentation tasks. Meanwhile, using a strategy called “1-N Alternation” to train the OFA-Net model, which can make a fusion of features from detection and segmentation data. The results show that object detection data can be recruited to better the segmentation accuracy performance, and furthermore, segmentation data assist a lot to enhance the confidence of predictions for object detection. Finally, the OFA-Net model is trained without traditional data augmentation methodologies and tested on the KITTI test server. The model works well on the KITTI Road Segmentation challenge and can do a good job on the object detection task.


2020 ◽  
Vol 10 (9) ◽  
pp. 3289
Author(s):  
Hanwool Woo ◽  
Mizuki Sugimoto ◽  
Hirokazu Madokoro ◽  
Kazuhito Sato ◽  
Yusuke Tamura ◽  
...  

In this paper, we propose a novel method to estimate a goal of surround vehicles to perform a lane change at a merging section. Recently, autonomous driving and advance driver-assistance systems are attracting great attention as a solution to substitute human drivers and to decrease accident rates. For example, a warning system to alert a lane change performed by surrounding vehicles to the front space of the host vehicle can be considered. If it is possible to forecast the intention of the interrupting vehicle in advance, the host driver can easily respond to the lane change with sufficient reaction time. This paper assumes a mandatory situation where two lanes are merged. The proposed method assesses the interaction between the lane-changing vehicle and the host vehicle on the mainstream lane. Then, the lane-change goal is estimated based on the interaction under the assumption that the lane-changing driver decides to minimize the collision risk. The proposed method applies the dynamic potential field method, which changes the distribution according to the relative speed and distance between two subject vehicles, to assess the interaction. The performance of goal estimation is evaluated using real traffic data, and it is demonstrated that the estimation can be successfully performed by the proposed method.


Sensors ◽  
2021 ◽  
Vol 21 (22) ◽  
pp. 7730
Author(s):  
◽  

Semantic segmentation is one of the most active research topics in computer vision with the goal to assign dense semantic labels for all pixels in a given image. In this paper, we introduce HFEN (Hierarchical Feature Extraction Network), a lightweight network to reach a balance between inference speed and segmentation accuracy. Our architecture is based on an encoder-decoder framework. The input images are down-sampled through an efficient encoder to extract multi-layer features. Then the extracted features are fused via a decoder, where the global contextual information and spatial information are aggregated for final segmentations with real-time performance. Extensive experiments have been conducted on two standard benchmarks, Cityscapes and Camvid, where our network achieved superior performance on NVIDIA 2080Ti.


Sign in / Sign up

Export Citation Format

Share Document