Acoustic Scene Classification with Mismatched Devices Using CliqueNets and Mixup Data Augmentation

PulseNetOne: Fast Unsupervised Pruning of Convolutional Neural Networks for Remote Sensing

Remote Sensing ◽

10.3390/rs12071092 ◽

2020 ◽

Vol 12 (7) ◽

pp. 1092

Author(s):

David Browne ◽

Michael Giering ◽

Steven Prestwich

Keyword(s):

Remote Sensing ◽

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Data Augmentation ◽

Recognition Task ◽

Scene Recognition ◽

Training Data ◽

Learning Approach ◽

Scene Classification

Scene classification is an important aspect of image/video understanding and segmentation. However, remote-sensing scene classification is a challenging image recognition task, partly due to the limited training data, which causes deep-learning Convolutional Neural Networks (CNNs) to overfit. Another difficulty is that images often have very different scales and orientation (viewing angle). Yet another is that the resulting networks may be very large, again making them prone to overfitting and unsuitable for deployment on memory- and energy-limited devices. We propose an efficient deep-learning approach to tackle these problems. We use transfer learning to compensate for the lack of data, and data augmentation to tackle varying scale and orientation. To reduce network size, we use a novel unsupervised learning approach based on k-means clustering, applied to all parts of the network: most network reduction methods use computationally expensive supervised learning methods, and apply only to the convolutional or fully connected layers, but not both. In experiments, we set new standards in classification accuracy on four remote-sensing and two scene-recognition image datasets.

Download Full-text

LDA-based data augmentation algorithm for acoustic scene classification

Knowledge-Based Systems ◽

10.1016/j.knosys.2020.105600 ◽

2020 ◽

Vol 195 ◽

pp. 105600

Author(s):

Yan Leng ◽

Weiwei Zhao ◽

Chan Lin ◽

Chengli Sun ◽

Rongyan Wang ◽

...

Keyword(s):

Data Augmentation ◽

Scene Classification ◽

Augmentation Algorithm

Download Full-text

Deep learning in remote sensing scene classification: a data augmentation enhanced convolutional neural network framework

GIScience & Remote Sensing ◽

10.1080/15481603.2017.1323377 ◽

2017 ◽

Vol 54 (5) ◽

pp. 741-758 ◽

Cited By ~ 58

Author(s):

Xingrui Yu ◽

Xiaomin Wu ◽

Chunbo Luo ◽

Peng Ren

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Deep Learning ◽

Convolutional Neural Network ◽

Data Augmentation ◽

Scene Classification

Download Full-text

Learning Robust Scene Classification Model with Data Augmentation Based on Xception

Journal of Physics Conference Series ◽

10.1088/1742-6596/1575/1/012009 ◽

2020 ◽

Vol 1575 ◽

pp. 012009

Author(s):

Haiyan Chen ◽

Yu Yang ◽

Suning Zhang

Keyword(s):

Data Augmentation ◽

Classification Model ◽

Scene Classification

Download Full-text

Scene Classification for Sports Video Summarization Using Transfer Learning

Sensors ◽

10.3390/s20061702 ◽

2020 ◽

Vol 20 (6) ◽

pp. 1702 ◽

Cited By ~ 2

Author(s):

Muhammad Rafiq ◽

Ghazala Rafiq ◽

Rockson Agyeman ◽

Gyu Sang Choi ◽

Seong-Il Jin

Keyword(s):

Data Augmentation ◽

Video Summarization ◽

Performance Comparison ◽

Practical Implementation ◽

Scene Classification ◽

Advertising Agencies ◽

Sports Video ◽

Performance Results ◽

Implementation Gaps

This paper proposes a novel method for sports video scene classification with the particular intention of video summarization. Creating and publishing a shorter version of the video is more interesting than a full version due to instant entertainment. Generating shorter summaries of the videos is a tedious task that requires significant labor hours and unnecessary machine occupation. Due to the growing demand for video summarization in marketing, advertising agencies, awareness videos, documentaries, and other interest groups, researchers are continuously proposing automation frameworks and novel schemes. Since the scene classification is a fundamental component of video summarization and video analysis, the quality of scene classification is particularly important. This article focuses on various practical implementation gaps over the existing techniques and presents a method to achieve high-quality of scene classification. We consider cricket as a case study and classify five scene categories, i.e., batting, bowling, boundary, crowd and close-up. We employ our model using pre-trained AlexNet Convolutional Neural Network (CNN) for scene classification. The proposed method employs new, fully connected layers in an encoder fashion. We employ data augmentation to achieve a high accuracy of 99.26% over a smaller dataset. We conduct a performance comparison against baseline approaches to prove the superiority of the method as well as state-of-the-art models. We evaluate our performance results on cricket videos and compare various deep-learning models, i.e., Inception V3, Visual Geometry Group (VGGNet16, VGGNet19), Residual Network (ResNet50), and AlexNet. Our experiments demonstrate that our method with AlexNet CNN produces better results than existing proposals.

Download Full-text

A Convolutional Neural Network Based on Grouping Structure for Scene Classification

Remote Sensing ◽

10.3390/rs13132457 ◽

2021 ◽

Vol 13 (13) ◽

pp. 2457

Author(s):

Xuan Wu ◽

Zhijie Zhang ◽

Wanchang Zhang ◽

Yaning Yi ◽

Chuanrong Zhang ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Feature Extraction ◽

Convolutional Neural Network ◽

Data Augmentation ◽

Remote Sensing Image ◽

Difficult Problem ◽

Image Features ◽

Attention Mechanism ◽

Scene Classification

Convolutional neural network (CNN) is capable of automatically extracting image features and has been widely used in remote sensing image classifications. Feature extraction is an important and difficult problem in current research. In this paper, data augmentation for avoiding over fitting was attempted to enrich features of samples to improve the performance of a newly proposed convolutional neural network with UC-Merced and RSI-CB datasets for remotely sensed scene classifications. A multiple grouped convolutional neural network (MGCNN) for self-learning that is capable of promoting the efficiency of CNN was proposed, and the method of grouping multiple convolutional layers capable of being applied elsewhere as a plug-in model was developed. Meanwhile, a hyper-parameter C in MGCNN is introduced to probe into the influence of different grouping strategies for feature extraction. Experiments on the two selected datasets, the RSI-CB dataset and UC-Merced dataset, were carried out to verify the effectiveness of this newly proposed convolutional neural network, the accuracy obtained by MGCNN was 2% higher than the ResNet-50. An algorithm of attention mechanism was thus adopted and incorporated into grouping processes and a multiple grouped attention convolutional neural network (MGCNN-A) was therefore constructed to enhance the generalization capability of MGCNN. The additional experiments indicate that the incorporation of the attention mechanism to MGCNN slightly improved the accuracy of scene classification, but the robustness of the proposed network was enhanced considerably in remote sensing image classifications.

Download Full-text

Vision Transformers for Remote Sensing Image Classification

Remote Sensing ◽

10.3390/rs13030516 ◽

2021 ◽

Vol 13 (3) ◽

pp. 516

Author(s):

Yakoub Bazi ◽

Laila Bashmal ◽

Mohamad M. Al Rahhal ◽

Reham Al Dayil ◽

Naif Al Ajlan

Keyword(s):

Remote Sensing ◽

Language Processing ◽

Additional Data ◽

Data Augmentation ◽

State Of The Art ◽

Remote Sensing Image ◽

Classification Performance ◽

Scene Classification ◽

Remote Sensing Image Classification ◽

Augmentation Strategies

In this paper, we propose a remote-sensing scene-classification method based on vision transformers. These types of networks, which are now recognized as state-of-the-art models in natural language processing, do not rely on convolution layers as in standard convolutional neural networks (CNNs). Instead, they use multihead attention mechanisms as the main building block to derive long-range contextual relation between pixels in images. In a first step, the images under analysis are divided into patches, then converted to sequence by flattening and embedding. To keep information about the position, embedding position is added to these patches. Then, the resulting sequence is fed to several multihead attention layers for generating the final representation. At the classification stage, the first token sequence is fed to a softmax classification layer. To boost the classification performance, we explore several data augmentation strategies to generate additional data for training. Moreover, we show experimentally that we can compress the network by pruning half of the layers while keeping competing classification accuracies. Experimental results conducted on different remote-sensing image datasets demonstrate the promising capability of the model compared to state-of-the-art methods. Specifically, Vision Transformer obtains an average classification accuracy of 98.49%, 95.86%, 95.56% and 93.83% on Merced, AID, Optimal31 and NWPU datasets, respectively. While the compressed version obtained by removing half of the multihead attention layers yields 97.90%, 94.27%, 95.30% and 93.05%, respectively.

Download Full-text

Long-term scalogram integrated with an iterative data augmentation scheme for acoustic scene classification

The Journal of the Acoustical Society of America ◽

10.1121/10.0005202 ◽

2021 ◽

Vol 149 (6) ◽

pp. 4198-4213

Author(s):

Hangting Chen ◽

Zuozhen Liu ◽

Zongming Liu ◽

Pengyuan Zhang

Keyword(s):

Data Augmentation ◽

Scene Classification

Download Full-text

Multi-scale semantic feature fusion and data augmentation for acoustic scene classification

Applied Acoustics ◽

10.1016/j.apacoust.2020.107238 ◽

2020 ◽

Vol 163 ◽

pp. 107238 ◽

Cited By ~ 3

Author(s):

Liping Yang ◽

Lianjie Tao ◽

Xinxing Chen ◽

Xiaohua Gu

Keyword(s):

Data Augmentation ◽

Feature Fusion ◽

Semantic Feature ◽

Scene Classification ◽

Multi Scale

Download Full-text

A Novel Data Augmentation Method for Sea Ice Scene Classification of Arctic Aerial Images

IEEE Access ◽

10.1109/access.2019.2929861 ◽

2019 ◽

Vol 7 ◽

pp. 104241-104249

Author(s):

Yiming Yan ◽

Chunming Zhang ◽

Nan Su

Keyword(s):

Sea Ice ◽

Data Augmentation ◽

Aerial Images ◽

Scene Classification

Download Full-text