Acoustic Scenery Recognition Using CWT and Deep Neural Network

2021 ◽  
Author(s):  
Francisco Mondragon ◽  
Jonathan Jimenez ◽  
Mariko Nakano ◽  
Toru Nakashika ◽  
Hector Perez-Meana

The development of acoustic scenes recognition systems has been a topic of extensive research due to its applications in several fields of science and engineering. This paper proposes an environmental system in which firstly a time-frequency representation is obtained using the Continuous Wavelet Transform (CWT). The time frequency representation is then represented as a color image using the Viridis color map, which is then inserted into a Deep Neural Network (DNN) to carry out the classification task. Evaluation results using several public data bases show that proposed scheme provides a classification performance better than the performance provided by other previously proposed schemes.

Biometrics provides greater security and usability than conventional personal authentication methods. Fingerprints, facial identification systems and voice recognition systems are the features that biometric systems can use. To improve biometric authentication, the proposed method considered that the input image is iris and fingerprint; at first, pre-processing is performed through histogram equalization for all image inputs to enhance the image quality. Then the extraction process of the feature will be performed. The suggested method uses modified Local Binary Pattern (MLBP), GLCM with orientation transformation, and DWT features next to the extracted features to be combined for feature extraction. Then the optimum function is found with the Rider Optimization Algorithm (ROA) for all MLBP, GLCM and DWT. Eventually, the approach suggested is accepted. Deep Neural Network (DNN) performs the proposed authentication process. A DNN is a multilayered artificial neural network between the layers of input and output. The DNN finds the right mathematical manipulation to turn the input into the output, whether it is an acknowledged image or not. Suggested process quality is measured in terms of reliability recognition. In the MATLAB platform, the suggested approach is implemented.


Entropy ◽  
2019 ◽  
Vol 21 (12) ◽  
pp. 1199 ◽  
Author(s):  
Hyeon Kyu Lee ◽  
Young-Seok Choi

The motor imagery-based brain-computer interface (BCI) using electroencephalography (EEG) has been receiving attention from neural engineering researchers and is being applied to various rehabilitation applications. However, the performance degradation caused by motor imagery EEG with very low single-to-noise ratio faces several application issues with the use of a BCI system. In this paper, we propose a novel motor imagery classification scheme based on the continuous wavelet transform and the convolutional neural network. Continuous wavelet transform with three mother wavelets is used to capture a highly informative EEG image by combining time-frequency and electrode location. A convolutional neural network is then designed to both classify motor imagery tasks and reduce computation complexity. The proposed method was validated using two public BCI datasets, BCI competition IV dataset 2b and BCI competition II dataset III. The proposed methods were found to achieve improved classification performance compared with the existing methods, thus showcasing the feasibility of motor imagery BCI.


2017 ◽  
Vol 1 (4) ◽  
pp. 271-277 ◽  
Author(s):  
Abdullah Caliskan ◽  
Mehmet Emin Yuksel

Abstract In this study, a deep neural network classifier is proposed for the classification of coronary artery disease medical data sets. The proposed classifier is tested on reference CAD data sets from the literature and also compared with popular representative classification methods regarding its classification performance. Experimental results show that the deep neural network classifier offers much better accuracy, sensitivity and specificity rates when compared with other methods. The proposed method presents itself as an easily accessible and cost-effective alternative to currently existing methods used for the diagnosis of CAD and it can be applied for easily checking whether a given subject under examination has at least one occluded coronary artery or not.


2021 ◽  
Author(s):  
Rakesh Kumar Jha ◽  
Preety D Swami

Abstract Time-frequency analysis plays a vital role in fault diagnosis of nonstationary vibration signals acquired from mechanical systems. However, the practical applications face the challenges of continuous variation in speed and load. Apart from this, the disturbances introduced by noise are inevitable. This paper aims to develop a robust method for fault identification in bearings under varying speed, load and noisy conditions. An Optimal Wavelet Subband Deep Neural Network (OWS-DNN) technique is proposed that automatically extracts features from an optimal wavelet subband selected on the basis of Shannon entropy. After denoising the optimal subband, the optimal subbands are dimensionally reduced by the encoder section of an autoencoder. The output of the encoder can be considered as data features. Finally, softmax classifier is employed to classify the encoder output. The vibration signals were recorded on a machinery fault simulator setup for various combinations of speed and load for healthy and faulty bearings. The signals were subjected to various noise levels and the deep neural network was trained. The achieved experimental results reveal high accuracy in fault classification as compared to other techniques under comparison.


Author(s):  
Yong Du ◽  
Yangyang Xu ◽  
Taizhong Ye ◽  
Qiang Wen ◽  
Chufeng Xiao ◽  
...  

Color dimensionality reduction is believed as a non-invertible process, as re-colorization results in perceptually noticeable and unrecoverable distortion. In this article, we propose to convert a color image into a grayscale image that can fully recover its original colors, and more importantly, the encoded information is discriminative and sparse, which saves storage capacity. Particularly, we design an invertible deep neural network for color encoding and decoding purposes. This network learns to generate a residual image that encodes color information, and it is then combined with a base grayscale image for color recovering. In this way, the non-differentiable compression process (e.g., JPEG) of the base grayscale image can be integrated into the network in an end-to-end manner. To further reduce the size of the residual image, we present a specific layer to enhance Sparsity Enforcing Priors (SEP), thus leading to negligible storage space. The proposed method allows color embedding on a sparse residual image while keeping a high, 35dB PSNR on average. Extensive experiments demonstrate that the proposed method outperforms state-of-the-arts in terms of image quality and tolerability to compression.


Sign in / Sign up

Export Citation Format

Share Document