Counting Crowds with Perspective Distortion Correction via Adaptive Learning

Yixuan Sun; Jian Jin; Xingjiao Wu; Tianlong Ma; Jing Yang

doi:10.3390/s20133781

Counting Crowds with Perspective Distortion Correction via Adaptive Learning

Sensors ◽

10.3390/s20133781 ◽

2020 ◽

Vol 20 (13) ◽

pp. 3781

Author(s):

Yixuan Sun ◽

Jian Jin ◽

Xingjiao Wu ◽

Tianlong Ma ◽

Jing Yang

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Adaptive Learning ◽

Perspective Transformation ◽

Distortion Correction ◽

Data Sets ◽

Original Image ◽

Learning Network ◽

Perspective Distortion ◽

Counting Data

The goal of crowd counting is to estimate the number of people in the image. Presently, use regression to count people number became a mainstream method. It is worth noting that, with the development of convolutional neural networks (CNN), methods that are based on CNN have become a research hotspot. It is a more interesting topic that how to locate the site of the person in the image than simply predicting the number of people in the image. The perspective transformation present is still a challenge, because perspective distortion will cause differences in the size of the crowd in the image. To devote perspective distortion and locate the site of the person more accuracy, we design a novel framework named Adaptive Learning Network (CAL). We use the VGG as the backbone. After each pooling layer is output, we collect the 1/2, 1/4, 1/8, and 1/16 features of the original image and combine them with the weights learned by an adaptive learning branch. The object of our adaptive learning branch is each image in the datasets. By combining the output features of different sizes of each image, the challenge of drastic changes in the size of the image crowd due to perspective transformation is reduced. We conducted experiments on four population counting data sets (i.e., ShanghaiTech Part A, ShanghaiTech Part B, UCF_CC_50 and UCF-QNRF), and the results show that our model has a good performance.

Download Full-text

Blind First-Order Perspective Distortion Correction Using Parallel Convolutional Neural Networks

Sensors ◽

10.3390/s20174898 ◽

2020 ◽

Vol 20 (17) ◽

pp. 4898

Author(s):

Neil Patrick Del Gallego ◽

Joel Ilao ◽

Macario Cordel

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Network Architecture ◽

Input Image ◽

Distortion Correction ◽

Generative Adversarial Networks ◽

Adversarial Networks ◽

Perspective Distortion ◽

Corrected Image ◽

Perspective Distortions

In this work, we present a network architecture with parallel convolutional neural networks (CNN) for removing perspective distortion in images. While other works generate corrected images through the use of generative adversarial networks or encoder-decoder networks, we propose a method wherein three CNNs are trained in parallel, to predict a certain element pair in the 3×3 transformation matrix, M^. The corrected image is produced by transforming the distorted input image using M^−1. The networks are trained from our generated distorted image dataset using KITTI images. Experimental results show promise in this approach, as our method is capable of correcting perspective distortions on images and outperforms other state-of-the-art methods. Our method also recovers the intended scale and proportion of the image, which is not observed in other works.

Download Full-text

RSPCN: Super-Resolution of Digital Elevation Model Based on Recursive Sub-Pixel Convolutional Neural Networks

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10080501 ◽

2021 ◽

Vol 10 (8) ◽

pp. 501

Author(s):

Ruichen Zhang ◽

Shaofeng Bian ◽

Houpu Li

Keyword(s):

Neural Networks ◽

Digital Elevation Model ◽

Convolutional Neural Networks ◽

Adaptive Learning ◽

Nearest Neighbor ◽

Super Resolution ◽

Recursion Theory ◽

Research Issues ◽

Digital Elevation ◽

Elevation Model

The digital elevation model (DEM) is known as one kind of the most significant fundamental geographical data models. The theory, method and application of DEM are hot research issues in geography, especially in geomorphology, hydrology, soil and other related fields. In this paper, we improve the efficient sub-pixel convolutional neural networks (ESPCN) and propose recursive sub-pixel convolutional neural networks (RSPCN) to generate higher-resolution DEMs (HRDEMs) from low-resolution DEMs (LRDEMs). Firstly, the structure of RSPCN is described in detail based on recursion theory. This paper explores the effects of different training datasets, with the self-adaptive learning rate Adam algorithm optimizing the model. Furthermore, the adding-“zero” boundary method is introduced into the RSPCN algorithm as a data preprocessing method, which improves the RSPCN method’s accuracy and convergence. Extensive experiments are conducted to train the method till optimality. Finally, comparisons are made with other traditional interpolation methods, such as bicubic, nearest-neighbor and bilinear methods. The results show that our method has obvious improvements in both accuracy and robustness and further illustrate the feasibility of deep learning methods in the DEM data processing area.

Download Full-text

Hardware‐Friendly Stochastic and Adaptive Learning in Memristor Convolutional Neural Networks

Advanced Intelligent Systems ◽

10.1002/aisy.202100041 ◽

2021 ◽

pp. 2100041

Author(s):

Wei Zhang ◽

Lunshuai Pan ◽

Xuelong Yan ◽

Guangchao Zhao ◽

Hong Chen ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Adaptive Learning

Download Full-text

Convolutional Neural Networks for Scientific Images and Other Large Data Sets

10.1007/978-3-030-70388-2_6 ◽

2021 ◽

pp. 149-172

Author(s):

Ryan G. McClarren

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Large Data ◽

Large Data Sets ◽

Data Sets ◽

Scientific Images

Download Full-text

Preprocessing for Enhancing the Classification of Pulmonary Data Sets using Convolutional Neural Networks

PROGRAMMNAYA INGENERIA ◽

10.17587/prin.9.369-374 ◽

2018 ◽

Vol 9 (10) ◽

pp. 369-374

Author(s):

N. Esmaeilishahmirzadi ◽

◽

H. Mortezapour ◽

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Data Sets

Download Full-text

Multi-Branch-CNN: classification of ion channel interacting peptides using parallel convolutional neural networks

10.1101/2021.11.13.468342 ◽

2021 ◽

Author(s):

Jielu Yan ◽

Bob Zhang ◽

Mingliang Zhou ◽

Hang Fai Kwok ◽

Shirley W.I. Siu

Keyword(s):

Neural Networks ◽

Ion Channels ◽

Ion Channel ◽

Convolutional Neural Networks ◽

Data Sets ◽

The Novel ◽

Sodium Potassium ◽

Test Set ◽

Drug Candidates

Ligand peptides that have high affinity for ion channels are critical for regulating ion flux across the plasma membrane. These peptides are now being considered as potential drug candidates for many diseases, such as cardiovascular disease and cancers. There are several studies to identify ion channel interacting peptides computationally, but, to the best of our knowledge, none of them published available tools for prediction. To provide a solution, we present Multi-branch-CNN, a parallel convolutional neural networks (CNNs) method for identifying three types of ion channel peptide binders (sodium, potassium, and calcium). Our experiment shows that the Multi-Branch-CNN method performs comparably to thirteen traditional ML algorithms (TML13) on the test sets of three ion channels. To evaluate the predictive power of our method with respect to novel sequences, as is the case in real-world applications, we created an additional test set for each ion channel, called the novel-test set, which has little or no similarities to the sequences in either the sequences of the train set or the test set. In the novel-test experiment, Multi-Branch-CNN performs significantly better than TML13, showing an improvement in accuracy of 6%, 14%, and 15% for sodium, potassium, and calcium channels, respectively. We confirmed the effectiveness of Multi-Branch-CNN by comparing it to the standard CNN method with one input branch (Single-Branch-CNN) and an ensemble method (TML13-Stack). To facilitate applications, the data sets, script files to reproduce the experiments, and the final predictive models are freely available at https://github.com/jieluyan/Multi-Branch-CNN.

Download Full-text

DeepInterface: Protein-protein interface validation using 3D Convolutional Neural Networks

10.1101/617506 ◽

2019 ◽

Cited By ~ 2

Author(s):

A.T. Balci ◽

C. Gumeli ◽

A. Hakouz ◽

D. Yuret ◽

O. Keskin ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Protein Interactions ◽

Data Bank ◽

Validation Dataset ◽

Data Sets ◽

Protein Protein Interactions ◽

3 Dimensional ◽

Almost All ◽

The Given

AbstractMotivationProtein–protein interactions are crucial in almost all biological processes. Proteins interact through their interfaces. It is important to determine how proteins interact through interfaces to understand protein binding mechanisms and to predict new protein-protein interactions.ResultsWe present DeepInterface, a deep learning based method which predicts, for a given protein complex, if the interface between the proteins of a complex is a true interface or not. The model is a 3-dimensional convolutional neural networks model and the positive datasets are obtained from all complexes in the Protein Data Bank, the negative datasets are the incorrect solutions of the docking decoys. The model analyzes a given interface structure and outputs the probability of the given structure being an interface. The accuracy of the model for several interface data sets, including PIFACE, PPI4DOCK, DOCKGROUND is approximately 88% in the validation dataset and 75% in the test dataset. The method can be used to improve the accuracy of template based PPI predictions.

Download Full-text

Plant Diseases Identification through a Discount Momentum Optimizer in Deep Learning

Applied Sciences ◽

10.3390/app11209468 ◽

2021 ◽

Vol 11 (20) ◽

pp. 9468

Author(s):

Yunyun Sun ◽

Yutong Liu ◽

Haocheng Zhou ◽

Huijuan Hu

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Adaptive Learning ◽

Learning Rate ◽

Plant Diseases ◽

Stochastic Gradient Descent ◽

Automatic Identification ◽

Deep Convolutional Neural Networks ◽

Adaptive Learning Rate

Deep learning proves its promising results in various domains. The automatic identification of plant diseases with deep convolutional neural networks attracts a lot of attention at present. This article extends stochastic gradient descent momentum optimizer and presents a discount momentum (DM) deep learning optimizer for plant diseases identification. To examine the recognition and generalization capability of the DM optimizer, we discuss the hyper-parameter tuning and convolutional neural networks models across the plantvillage dataset. We further conduct comparison experiments on popular non-adaptive learning rate methods. The proposed approach achieves an average validation accuracy of no less than 97% for plant diseases prediction on several state-of-the-art deep learning models and holds a low sensitivity to hyper-parameter settings. Experimental results demonstrate that the DM method can bring a higher identification performance, while still maintaining a competitive performance over other non-adaptive learning rate methods in terms of both training speed and generalization.

Download Full-text

Introspective analysis of convolutional neural networks for improving discrimination performance and feature visualisation

PeerJ Computer Science ◽

10.7717/peerj-cs.497 ◽

2021 ◽

Vol 7 ◽

pp. e497

Author(s):

Shakeel Shafiq ◽

Tayyaba Azim

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Convolutional Neural Networks ◽

Discrimination Performance ◽

Input Image ◽

Data Sets ◽

Discrimination Power ◽

Level Information ◽

Fully Connected

Deep neural networks have been widely explored and utilised as a useful tool for feature extraction in computer vision and machine learning. It is often observed that the last fully connected (FC) layers of convolutional neural network possess higher discrimination power as compared to the convolutional and maxpooling layers whose goal is to preserve local and low-level information of the input image and down sample it to avoid overfitting. Inspired from the functionality of local binary pattern (LBP) operator, this paper proposes to induce discrimination into the mid layers of convolutional neural network by introducing a discriminatively boosted alternative to pooling (DBAP) layer that has shown to serve as a favourable replacement of early maxpooling layer in a convolutional neural network (CNN). A thorough research of the related works show that the proposed change in the neural architecture is novel and has not been proposed before to bring enhanced discrimination and feature visualisation power achieved from the mid layer features. The empirical results reveal that the introduction of DBAP layer in popular neural architectures such as AlexNet and LeNet produces competitive classification results in comparison to their baseline models as well as other ultra-deep models on several benchmark data sets. In addition, better visualisation of intermediate features can allow one to seek understanding and interpretation of black box behaviour of convolutional neural networks, used widely by the research community.

Download Full-text

Deep Convolutional Neural Networks for Hyperspectral Image Classification

Journal of Sensors ◽

10.1155/2015/258619 ◽

2015 ◽

Vol 2015 ◽

pp. 1-12 ◽

Cited By ~ 419

Author(s):

Wei Hu ◽

Yangyu Huang ◽

Li Wei ◽

Fan Zhang ◽

Hengchao Li

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Hyperspectral Image ◽

Image Data ◽

Classification Performance ◽

Spectral Signature ◽

Support Vector ◽

Data Sets ◽

Deep Convolutional Neural Networks ◽

Visual Tasks

Recently, convolutional neural networks have demonstrated excellent performance on various visual tasks, including the classification of common two-dimensional images. In this paper, deep convolutional neural networks are employed to classify hyperspectral images directly in spectral domain. More specifically, the architecture of the proposed classifier contains five layers with weights which are the input layer, the convolutional layer, the max pooling layer, the full connection layer, and the output layer. These five layers are implemented on each spectral signature to discriminate against others. Experimental results based on several hyperspectral image data sets demonstrate that the proposed method can achieve better classification performance than some traditional methods, such as support vector machines and the conventional deep learning-based methods.

Download Full-text