End to end multi-scale convolutional neural network for crowd counting

Crowd counting via Multi-Scale Adversarial Convolutional Neural Networks

Journal of Intelligent Systems ◽

10.1515/jisys-2019-0157 ◽

2020 ◽

Vol 30 (1) ◽

pp. 180-191

Author(s):

Liping Zhu ◽

Hong Zhang ◽

Sikandar Ali ◽

Baoli Yang ◽

Chengyang Li

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Density Estimation ◽

Large Scale ◽

Receptive Fields ◽

Crowd Counting ◽

Multi Scale ◽

Training Scheme ◽

Joint Training ◽

End To End

Abstract The purpose of crowd counting is to estimate the number of pedestrians in crowd images. Crowd counting or density estimation is an extremely challenging task in computer vision, due to large scale variations and dense scene. Current methods solve these issues by compounding multi-scale Convolutional Neural Network with different receptive fields. In this paper, a novel end-to-end architecture based on Multi-Scale Adversarial Convolutional Neural Network (MSA-CNN) is proposed to generate crowd density and estimate the amount of crowd. Firstly, a multi-scale network is used to extract the globally relevant features in the crowd image, and then fractionally-strided convolutional layers are designed for up-sampling the output to recover the loss of crucial details caused by the earlier max pooling layers. An adversarial loss is directly employed to shrink the estimated value into the realistic subspace to reduce the blurring effect of density estimation. Joint training is performed in an end-to-end fashion using a combination of Adversarial loss and Euclidean loss. The two losses are integrated via a joint training scheme to improve density estimation performance.We conduct some extensive experiments on available datasets to show the significant improvements and supremacy of the proposed approach over the available state-of-the-art approaches.

Download Full-text

Waveform-based End-to-end Deep Convolutional Neural Network with Multi-scale Sliding Windows for Weakly Labeled Sound Event Detection

2020 International Conference on Artificial Intelligence in Information and Communication (ICAIIC) ◽

10.1109/icaiic48513.2020.9064985 ◽

2020 ◽

Author(s):

Seokjin Lee ◽

Minhan Kim

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Event Detection ◽

Deep Convolutional Neural Network ◽

Sliding Windows ◽

Multi Scale ◽

Sound Event ◽

Sound Event Detection ◽

End To End

Download Full-text

Multi-scale dilated convolution of convolutional neural network for crowd counting

Multimedia Tools and Applications ◽

10.1007/s11042-019-08208-6 ◽

2019 ◽

Vol 79 (1-2) ◽

pp. 1057-1073 ◽

Cited By ~ 8

Author(s):

Yanjie Wang ◽

Shiyu Hu ◽

Guodong Wang ◽

Chenglizhao Chen ◽

Zhenkuan Pan

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Crowd Counting ◽

Multi Scale ◽

Dilated Convolution

Download Full-text

Improving Crowd Counting with Multi-Task Multi-Scale Convolutional Neural Network

2018 Eighth International Conference on Instrumentation & Measurement, Computer, Communication and Control (IMCCC) ◽

10.1109/imccc.2018.00104 ◽

2018 ◽

Author(s):

Siqi Tang ◽

Yijia Wu ◽

Wei Bai ◽

Zhisong Pan

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Crowd Counting ◽

Multi Scale

Download Full-text

Crowd Counting via Residual Multi-Scale Convolutional Neural Network

2019 Seventh International Conference on Advanced Cloud and Big Data (CBD) ◽

10.1109/cbd.2019.00063 ◽

2019 ◽

Author(s):

Jingang Lu ◽

Li Zhang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Crowd Counting ◽

Multi Scale

Download Full-text

A novel multi-scale convolutional neural network for motor imagery classification

Biomedical Signal Processing and Control ◽

10.1016/j.bspc.2021.102747 ◽

2021 ◽

Vol 68 ◽

pp. 102747

Author(s):

Mouad Riyad ◽

Mohammed Khalil ◽

Abdellah Adib

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Motor Imagery ◽

Multi Scale

Download Full-text

Pixel-level Diabetic Retinopathy Lesion Detection Using Multi-scale Convolutional Neural Network

2021 IEEE 3rd Global Conference on Life Sciences and Technologies (LifeTech) ◽

10.1109/lifetech52111.2021.9391891 ◽

2021 ◽

Author(s):

Qi Li ◽

Chenglei Peng ◽

Yazhen Ma ◽

Sidan Du ◽

Bin Guo ◽

...

Keyword(s):

Neural Network ◽

Diabetic Retinopathy ◽

Convolutional Neural Network ◽

Lesion Detection ◽

Multi Scale

Download Full-text

Matching Large Baseline Oblique Stereo Images Using an End-to-End Convolutional Neural Network

Remote Sensing ◽

10.3390/rs13020274 ◽

2021 ◽

Vol 13 (2) ◽

pp. 274

Author(s):

Guobiao Yao ◽

Alper Yilmaz ◽

Li Zhang ◽

Fei Meng ◽

Haibin Ai ◽

...

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Stereo Matching ◽

Least Square ◽

Affine Invariant ◽

Stereo Images ◽

Distance Ratio ◽

Matching Algorithm ◽

End To End

The available stereo matching algorithms produce large number of false positive matches or only produce a few true-positives across oblique stereo images with large baseline. This undesired result happens due to the complex perspective deformation and radiometric distortion across the images. To address this problem, we propose a novel affine invariant feature matching algorithm with subpixel accuracy based on an end-to-end convolutional neural network (CNN). In our method, we adopt and modify a Hessian affine network, which we refer to as IHesAffNet, to obtain affine invariant Hessian regions using deep learning framework. To improve the correlation between corresponding features, we introduce an empirical weighted loss function (EWLF) based on the negative samples using K nearest neighbors, and then generate deep learning-based descriptors with high discrimination that is realized with our multiple hard network structure (MTHardNets). Following this step, the conjugate features are produced by using the Euclidean distance ratio as the matching metric, and the accuracy of matches are optimized through the deep learning transform based least square matching (DLT-LSM). Finally, experiments on Large baseline oblique stereo images acquired by ground close-range and unmanned aerial vehicle (UAV) verify the effectiveness of the proposed approach, and comprehensive comparisons demonstrate that our matching algorithm outperforms the state-of-art methods in terms of accuracy, distribution and correct ratio. The main contributions of this article are: (i) our proposed MTHardNets can generate high quality descriptors; and (ii) the IHesAffNet can produce substantial affine invariant corresponding features with reliable transform parameters.

Download Full-text

Convolutional Neural Network for Crowd Counting on Metro Platforms

Symmetry ◽

10.3390/sym13040703 ◽

2021 ◽

Vol 13 (4) ◽

pp. 703

Author(s):

Jun Zhang ◽

Jiaze Liu ◽

Zhizhong Wang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Estimation Error ◽

Image Features ◽

Urban Rail Transit ◽

Crowd Counting ◽

Passenger Flow ◽

Urban Rail ◽

Density Map ◽

Flow Detection

Owing to the increased use of urban rail transit, the flow of passengers on metro platforms tends to increase sharply during peak periods. Monitoring passenger flow in such areas is important for security-related reasons. In this paper, in order to solve the problem of metro platform passenger flow detection, we propose a CNN (convolutional neural network)-based network called the MP (metro platform)-CNN to accurately count people on metro platforms. The proposed method is composed of three major components: a group of convolutional neural networks is used on the front end to extract image features, a multiscale feature extraction module is used to enhance multiscale features, and transposed convolution is used for upsampling to generate a high-quality density map. Currently, existing crowd-counting datasets do not adequately cover all of the challenging situations considered in this study. Therefore, we collected images from surveillance videos of a metro platform to form a dataset containing 627 images, with 9243 annotated heads. The results of the extensive experiments showed that our method performed well on the self-built dataset and the estimation error was minimum. Moreover, the proposed method could compete with other methods on four standard crowd-counting datasets.

Download Full-text

Automatic Driving of End-to-end Convolutional Neural Network Based on MobileNet-V2 Migration Learning

Proceedings of the 12th International Symposium on Visual Information Communication and Interaction - VINCI'2019 ◽

10.1145/3356422.3356458 ◽

2019 ◽

Author(s):

Minghong Hu ◽

Hui Guo ◽

Xuyuan Ji

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Automatic Driving ◽

End To End

Download Full-text