Multiactivation Pooling Method in Convolutional Neural Networks for Image Recognition

Wireless Communications and Mobile Computing ◽

10.1155/2018/8196906 ◽

2018 ◽

Vol 2018 ◽

pp. 1-15 ◽

Cited By ~ 5

Author(s):

Qi Zhao ◽

Shuchang Lyu ◽

Boxue Zhang ◽

Wenquan Feng

Keyword(s):

Neural Networks ◽

Image Processing ◽

Big Data ◽

Convolutional Neural Networks ◽

Image Recognition ◽

Large Scale ◽

Fog Computing ◽

Feature Extractor ◽

Benchmark Datasets ◽

Classification Tasks

Convolutional neural networks (CNNs) are becoming more and more popular today. CNNs now have become a popular feature extractor applying to image processing, big data processing, fog computing, etc. CNNs usually consist of several basic units like convolutional unit, pooling unit, activation unit, and so on. In CNNs, conventional pooling methods refer to 2×2 max-pooling and average-pooling, which are applied after the convolutional or ReLU layers. In this paper, we propose a Multiactivation Pooling (MAP) Method to make the CNNs more accurate on classification tasks without increasing depth and trainable parameters. We add more convolutional layers before one pooling layer and expand the pooling region to 4×4, 8×8, 16×16, and even larger. When doing large-scale subsampling, we pick top-k activation, sum up them, and constrain them by a hyperparameter σ. We pick VGG, ALL-CNN, and DenseNets as our baseline models and evaluate our proposed MAP method on benchmark datasets: CIFAR-10, CIFAR-100, SVHN, and ImageNet. The classification results are competitive.

Download Full-text

Fish Recognition Model for Fraud Prevention using Convolutional Neural Networks

10.21203/rs.3.rs-849174/v1 ◽

2021 ◽

Author(s):

Rhayane Monteiro ◽

Morgana Ribeiro ◽

Calebi Viana ◽

Mario Wedney de Lima Moreira ◽

Glacio Araújo ◽

...

Keyword(s):

Neural Networks ◽

Species Identification ◽

Convolutional Neural Networks ◽

Image Recognition ◽

Fish Species ◽

Effective Solution ◽

Food Fraud ◽

Fraud Prevention ◽

Recognition Model ◽

Classification Tasks

Abstract Fraud, misidentification, and adulteration of food, whether unintentional or purposeful, are a worldwide and growing concern. Aquaculture and fisheries are recognized as one of the sectors most vulnerable to food fraud. Besides, a series of risks related to health and distrust between consumer and popular market that this sector develop an effective solution for fraud control. Species identification is an essential aspect to expose commercial fraud. Convolutional neural networks (CNNs) are one of the most powerful tools for image recognition and classification tasks. Thus, the objective of this study is to propose a model of recognition of fish species based on CNNs. The results obtained show an algorithm with an accuracy of 86%, providing an effective solution to prevent fish fraud.

Download Full-text

Spatio–Temporal Image Representation of 3D Skeletal Movements for View-Invariant Action Recognition with Deep Convolutional Neural Networks

Sensors ◽

10.3390/s19081932 ◽

2019 ◽

Vol 19 (8) ◽

pp. 1932 ◽

Cited By ~ 4

Author(s):

Huy Hieu Pham ◽

Houssam Salmane ◽

Louahdi Khoudour ◽

Alain Crouzil ◽

Pablo Zegers ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Action Recognition ◽

Large Scale ◽

Image Representation ◽

Human Action ◽

Computational Time ◽

Deep Convolutional Neural Networks ◽

Classification Tasks ◽

Spatio Temporal

Designing motion representations for 3D human action recognition from skeleton sequences is an important yet challenging task. An effective representation should be robust to noise, invariant to viewpoint changes and result in a good performance with low-computational demand. Two main challenges in this task include how to efficiently represent spatio–temporal patterns of skeletal movements and how to learn their discriminative features for classification tasks. This paper presents a novel skeleton-based representation and a deep learning framework for 3D action recognition using RGB-D sensors. We propose to build an action map called SPMF (Skeleton Posture-Motion Feature), which is a compact image representation built from skeleton poses and their motions. An Adaptive Histogram Equalization (AHE) algorithm is then applied on the SPMF to enhance their local patterns and form an enhanced action map, namely Enhanced-SPMF. For learning and classification tasks, we exploit Deep Convolutional Neural Networks based on the DenseNet architecture to learn directly an end-to-end mapping between input skeleton sequences and their action labels via the Enhanced-SPMFs. The proposed method is evaluated on four challenging benchmark datasets, including both individual actions, interactions, multiview and large-scale datasets. The experimental results demonstrate that the proposed method outperforms previous state-of-the-art approaches on all benchmark tasks, whilst requiring low computational time for training and inference.

Download Full-text

A novel MapReduce-based deep convolutional neural network algorithm

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-201790 ◽

2021 ◽

pp. 1-13

Author(s):

Xiang-Min Liu ◽

Jian Hu ◽

Deborah Simon Mwakapesa ◽

Y.A. Nanehkaran ◽

Yi-Min Mao ◽

...

Keyword(s):

Neural Networks ◽

Big Data ◽

Convolutional Neural Networks ◽

Large Scale ◽

Feature Learning ◽

Deep Convolutional Neural Networks ◽

Network Training ◽

Load Rate ◽

Data Environment ◽

Neural Networks Optimization

Deep convolutional neural networks (DCNNs), with their complex network structure and powerful feature learning and feature expression capabilities, have been remarkable successes in many large-scale recognition tasks. However, with the expectation of memory overhead and response time, along with the increasing scale of data, DCNN faces three non-rival challenges in a big data environment: excessive network parameters, slow convergence, and inefficient parallelism. To tackle these three problems, this paper develops a deep convolutional neural networks optimization algorithm (PDCNNO) in the MapReduce framework. The proposed method first pruned the network to obtain a compressed network in order to effectively reduce redundant parameters. Next, a conjugate gradient method based on modified secant equation (CGMSE) is developed in the Map phase to further accelerate the convergence of the network. Finally, a load balancing strategy based on regulate load rate (LBRLA) is proposed in the Reduce phase to quickly achieve equal grouping of data and thus improving the parallel performance of the system. We compared the PDCNNO algorithm with other algorithms on three datasets, including SVHN, EMNIST Digits, and ISLVRC2012. The experimental results show that our algorithm not only reduces the space and time overhead of network training but also obtains a well-performing speed-up ratio in a big data environment.

Download Full-text

Reducing weight precision of convolutional neural networks towards large-scale on-chip image recognition

10.1117/12.2176598 ◽

2015 ◽

Author(s):

Zhengping Ji ◽

Ilia Ovsiannikov ◽

Yibing Wang ◽

Lilong Shi ◽

Qiang Zhang

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Image Recognition ◽

Large Scale ◽

On Chip

Download Full-text

Forest fire image recognition based on convolutional neural network

Journal of Algorithms & Computational Technology ◽

10.1177/1748302619887689 ◽

2019 ◽

Vol 13 ◽

pp. 174830261988768 ◽

Cited By ~ 3

Author(s):

Yuanbin Wang ◽

Langfei Dang ◽

Jieying Ren

Keyword(s):

Neural Network ◽

Neural Networks ◽

Image Processing ◽

Convolutional Neural Network ◽

Convolutional Neural Networks ◽

Image Recognition ◽

Forest Fire ◽

Recognition Rate ◽

Extraction Process ◽

Fire Flame

In order to detect fire automatically, a forest fire image recognition method based on convolutional neural networks is proposed in this paper. There are two main types of fire recognition algorithms. One is based on traditional image processing technology and the other is based on convolutional neural network technology. The former is easy to lead in false detection because of blindness and randomness in the stage of feature selection, while for the latter the unprocessed convolutional neural network is applied directly, so that the characteristics learned by the network are not accurate enough, and recognition rate may be affected. In view of these problems, conventional image processing techniques and convolutional neural networks are combined, and an adaptive pooling approach is introduced. The fire flame area can be segmented and the characteristics can be learned by this algorithm ahead. At the same time, the blindness in the traditional feature extraction process is avoided, and the learning of invalid features in the convolutional neural network is also avoided. Experiments show that the convolutional neural network method based on adaptive pooling method has better performance and has higher recognition rate.

Download Full-text