Efficient Semantic Segmentation Using Multi-Path Decoder

Xing Bai; Jun Zhou

doi:10.3390/app10186386

Efficient Semantic Segmentation Using Multi-Path Decoder

Applied Sciences ◽

10.3390/app10186386 ◽

2020 ◽

Vol 10 (18) ◽

pp. 6386

Author(s):

Xing Bai ◽

Jun Zhou

Keyword(s):

Neural Network ◽

Real Time ◽

Network Architecture ◽

Resource Constraints ◽

Cost Effective ◽

Semantic Segmentation ◽

Classification Model ◽

Neural Network Architecture ◽

Great Progress ◽

Different Types

Benefiting from the booming of deep learning, the state-of-the-art models achieved great progress. But they are huge in terms of parameters and floating point operations, which makes it hard to apply them to real-time applications. In this paper, we propose a novel deep neural network architecture, named MPDNet, for fast and efficient semantic segmentation under resource constraints. First, we use a light-weight classification model pretrained on ImageNet as the encoder. Second, we use a cost-effective upsampling datapath to restore prediction resolution and convert features for classification into features for segmentation. Finally, we propose to use a multi-path decoder to extract different types of features, which are not ideal to process inside only one convolutional neural network. The experimental results of our model outperform other models aiming at real-time semantic segmentation on Cityscapes. Based on our proposed MPDNet, we achieve 76.7% mean IoU on Cityscapes test set with only 118.84GFLOPs and achieves 37.6 Hz on 768 × 1536 images on a standard GPU.

Download Full-text

A Deep Neural Network Architecture for Real-Time Semantic Segmentation on Embedded Board

Journal of KIISE ◽

10.5626/jok.2018.45.1.94 ◽

2018 ◽

Vol 45 (1) ◽

pp. 94-98

Author(s):

Junyeop Lee ◽

Youngwan Lee

Keyword(s):

Neural Network ◽

Real Time ◽

Network Architecture ◽

Deep Neural Network ◽

Semantic Segmentation ◽

Neural Network Architecture

Download Full-text

Experimentally Defined Convolutional Neural Network Architecture Variants for Non-Temporal Real-Time Fire Detection

2018 25th IEEE International Conference on Image Processing (ICIP) ◽

10.1109/icip.2018.8451657 ◽

2018 ◽

Cited By ~ 8

Author(s):

Andrew J. Dunnings ◽

Toby P. Breckon

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Real Time ◽

Network Architecture ◽

Fire Detection ◽

Neural Network Architecture

Download Full-text

Comparative Performance Analysis of Neural Network Real-Time Object Detections in Different Implementations

EPJ Web of Conferences ◽

10.1051/epjconf/202022602020 ◽

2020 ◽

Vol 226 ◽

pp. 02020

Author(s):

Alexey V. Stadnik ◽

Pavel S. Sazhin ◽

Slavomir Hnatic

Keyword(s):

Neural Network ◽

Neural Networks ◽

Computer Vision ◽

Performance Analysis ◽

Object Detection ◽

Real Time ◽

Network Architecture ◽

Neural Network Architecture ◽

Comparative Performance

The performance of neural networks is one of the most important topics in the field of computer vision. In this work, we analyze the speed of object detection using the well-known YOLOv3 neural network architecture in different frameworks under different hardware requirements. We obtain results, which allow us to formulate preliminary qualitative conclusions about the feasibility of various hardware scenarios to solve tasks in real-time environments.

Download Full-text

MYOLOv3-Tiny: A new convolutional neural network architecture for real-time detection of track fasteners

Computers in Industry ◽

10.1016/j.compind.2020.103303 ◽

2020 ◽

Vol 123 ◽

pp. 103303

Author(s):

Hangyu Qi ◽

Tianhua Xu ◽

Guang Wang ◽

Yu Cheng ◽

Cong Chen

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Real Time ◽

Network Architecture ◽

Neural Network Architecture ◽

Real Time Detection

Download Full-text

ORTHOSEG: A DEEP MULTIMODAL CONVOLUTONAL NEURAL NETWORK ARCHITECTURE FOR SEMANTIC SEGMENTATION OF ORTHOIMAGERY

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-5-621-2018 ◽

2018 ◽

Vol XLII-5 ◽

pp. 621-628 ◽

Cited By ~ 2

Author(s):

P. Bodani ◽

K. Shreshtha ◽

S. Sharma

Keyword(s):

Neural Network ◽

Network Architecture ◽

Multiple Scales ◽

Semantic Segmentation ◽

Effective Field ◽

Surface Model ◽

Training Procedure ◽

Feature Maps ◽

Neural Network Architecture ◽

Wide Range

<p><strong>Abstract.</strong> This paper addresses the task of semantic segmentation of orthoimagery using multimodal data e.g. optical RGB, infrared and digital surface model. We propose a deep convolutional neural network architecture termed OrthoSeg for semantic segmentation using multimodal, orthorectified and coregistered data. We also propose a training procedure for supervised training of OrthoSeg. The training procedure complements the inherent architectural characteristics of OrthoSeg for preventing complex co-adaptations of learned features, which may arise due to probable high dimensionality and spatial correlation in multimodal and/or multispectral coregistered data. OrthoSeg consists of parallel encoding networks for independent encoding of multimodal feature maps and a decoder designed for efficiently fusing independently encoded multimodal feature maps. A softmax layer at the end of the network uses the features generated by the decoder for pixel-wise classification. The decoder fuses feature maps from the parallel encoders locally as well as contextually at multiple scales to generate per-pixel feature maps for final pixel-wise classification resulting in segmented output. We experimentally show the merits of OrthoSeg by demonstrating state-of-the-art accuracy on the ISPRS Potsdam 2D Semantic Segmentation dataset. Adaptability is one of the key motivations behind OrthoSeg so that it serves as a useful architectural option for a wide range of problems involving the task of semantic segmentation of coregistered multimodal and/or multispectral imagery. Hence, OrthoSeg is designed to enable independent scaling of parallel encoder networks and decoder network to better match application requirements, such as the number of input channels, the effective field-of-view, and model capacity.</p>

Download Full-text

Application of deep learning methods to predict ionosphere parameters in real time

E3S Web of Conferences ◽

10.1051/e3sconf/202019602007 ◽

2020 ◽

Vol 196 ◽

pp. 02007

Author(s):

Vladimir Mochalov ◽

Anastasia Mochalova

Keyword(s):

Neural Network ◽

Deep Learning ◽

Real Time ◽

Network Architecture ◽

Short Term Memory ◽

Neural Network Architecture ◽

Short Term ◽

Learning Methods ◽

Term Memory ◽

Long Short Term Memory

In this paper, the previously obtained results on recognition of ionograms using deep learning are expanded to predict the parameters of the ionosphere. After the ionospheric parameters have been identified on the ionogram using deep learning in real time, we can predict the parameters for some time ahead on the basis of the new data obtained Examples of predicting the ionosphere parameters using an artificial recurrent neural network architecture long short-term memory are given. The place of the block for predicting the parameters of the ionosphere in the system for analyzing ionospheric data using deep learning methods is shown.

Download Full-text

REAL-TIME VIDEO SCALING BASED ON CONVOLUTION NEURAL NETWORK ARCHITECTURE

ICTACT Journal on Image and Video Processing ◽

10.21917/ijivp.2017.0218 ◽

2017 ◽

Vol 8 (1) ◽

pp. 1533-1542

Author(s):

S Safinaz ◽

◽

A V Ravi Kumar ◽

Keyword(s):

Neural Network ◽

Real Time ◽

Network Architecture ◽

Convolution Neural Network ◽

Neural Network Architecture

Download Full-text

Spectral Flux-Based Convolutional Neural Network Architecture for Speech Source Localization and its Real-Time Implementation

IEEE Access ◽

10.1109/access.2020.3033533 ◽

2020 ◽

Vol 8 ◽

pp. 197047-197058

Author(s):

Yiya Hao ◽

Abdullah Kucuk ◽

Anshuman Ganguly ◽

Issa M. S. Panahi

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Real Time ◽

Source Localization ◽

Network Architecture ◽

Neural Network Architecture ◽

Spectral Flux ◽

Speech Source Localization

Download Full-text

Real-Time Video Scaling Based on Convolution Neural Network Architecture

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v7.i2.pp381-394 ◽

2017 ◽

Vol 7 (2) ◽

pp. 381

Author(s):

S Safinaz ◽

AV Ravi kumar

Keyword(s):

Neural Network ◽

High Resolution ◽

Real Time ◽

Network Architecture ◽

High Efficiency ◽

Super Resolution ◽

Reconstruction Error ◽

Convolution Neural Network ◽

Neural Network Architecture ◽

Video Frames

In recent years, video super resolution techniques becomes mandatory requirements to get high resolution videos. Many super resolution techniques researched but still video super resolution or scaling is a vital challenge. In this paper, we have presented a real-time video scaling based on convolution neural network architecture to eliminate the blurriness in the images and video frames and to provide better reconstruction quality while scaling of large datasets from lower resolution frames to high resolution frames. We compare our outcomes with multiple exiting algorithms. Our extensive results of proposed technique RemCNN (Reconstruction error minimization Convolution Neural Network) shows that our model outperforms the existing technologies such as bicubic, bilinear, MCResNet and provide better reconstructed motioning images and video frames. The experimental results shows that our average PSNR result is 47.80474 considering upscale-2, 41.70209 for upscale-3 and 36.24503 for upscale-4 for Myanmar dataset which is very high in contrast to other existing techniques. This results proves our proposed model real-time video scaling based on convolution neural network architecture’s high efficiency and better performance.

Download Full-text

Real-time sign language recognition based on neural network architecture

2011 IEEE 43rd Southeastern Symposium on System Theory ◽

10.1109/ssst.2011.5753805 ◽

2011 ◽

Cited By ~ 15

Author(s):

Priyanka Mekala ◽

Ying Gao ◽

Jeffrey Fan ◽

Asad Davari

Keyword(s):

Neural Network ◽

Real Time ◽

Sign Language ◽

Network Architecture ◽

Language Recognition ◽

Sign Language Recognition ◽

Neural Network Architecture

Download Full-text