Acoustic Scenery Recognition Using CWT and Deep Neural Network

Mapping Intimacies ◽

10.3233/faia210029 ◽

2021 ◽

Author(s):

Francisco Mondragon ◽

Jonathan Jimenez ◽

Mariko Nakano ◽

Toru Nakashika ◽

Hector Perez-Meana

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Color Image ◽

Classification Performance ◽

Science And Engineering ◽

Time Frequency ◽

Frequency Representation ◽

Public Data ◽

Recognition Systems ◽

Color Map

The development of acoustic scenes recognition systems has been a topic of extensive research due to its applications in several fields of science and engineering. This paper proposes an environmental system in which firstly a time-frequency representation is obtained using the Continuous Wavelet Transform (CWT). The time frequency representation is then represented as a color image using the Viridis color map, which is then inserted into a Deep Neural Network (DNN) to carry out the classification task. Evaluation results using several public data bases show that proposed scheme provides a classification performance better than the performance provided by other previously proposed schemes.

Download Full-text

Time—Frequency Mask Estimation based on Deep Neural Network for Flexible Load Disaggregation in Buildings

IEEE Transactions on Smart Grid ◽

10.1109/tsg.2021.3066547 ◽

2021 ◽

pp. 1-1

Author(s):

Junho Song ◽

Yonggu Lee ◽

Euiseok Hwang

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Time Frequency ◽

Load Disaggregation ◽

Flexible Load ◽

Mask Estimation

Download Full-text

Optimized Multi-Model Biometric Based Human Authentication using Deep Neural Network

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c1066.1183s319 ◽

2019 ◽

Vol 8 (3S3) ◽

pp. 280-290

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Histogram Equalization ◽

Extraction Process ◽

Input Image ◽

Suggested Approach ◽

Biometric Systems ◽

Recognition Systems ◽

The Right ◽

Facial Identification

Biometrics provides greater security and usability than conventional personal authentication methods. Fingerprints, facial identification systems and voice recognition systems are the features that biometric systems can use. To improve biometric authentication, the proposed method considered that the input image is iris and fingerprint; at first, pre-processing is performed through histogram equalization for all image inputs to enhance the image quality. Then the extraction process of the feature will be performed. The suggested method uses modified Local Binary Pattern (MLBP), GLCM with orientation transformation, and DWT features next to the extracted features to be combined for feature extraction. Then the optimum function is found with the Rider Optimization Algorithm (ROA) for all MLBP, GLCM and DWT. Eventually, the approach suggested is accepted. Deep Neural Network (DNN) performs the proposed authentication process. A DNN is a multilayered artificial neural network between the layers of input and output. The DNN finds the right mathematical manipulation to turn the input into the output, whether it is an acknowledged image or not. Suggested process quality is measured in terms of reliability recognition. In the MATLAB platform, the suggested approach is implemented.

Download Full-text

Application of Continuous Wavelet Transform and Convolutional Neural Network in Decoding Motor Imagery Brain-Computer Interface

Entropy ◽

10.3390/e21121199 ◽

2019 ◽

Vol 21 (12) ◽

pp. 1199 ◽

Cited By ~ 5

Author(s):

Hyeon Kyu Lee ◽

Young-Seok Choi

Keyword(s):

Neural Network ◽

Wavelet Transform ◽

Convolutional Neural Network ◽

Motor Imagery ◽

Continuous Wavelet Transform ◽

Brain Computer Interface ◽

Classification Performance ◽

Computer Interface ◽

Continuous Wavelet ◽

Time Frequency

The motor imagery-based brain-computer interface (BCI) using electroencephalography (EEG) has been receiving attention from neural engineering researchers and is being applied to various rehabilitation applications. However, the performance degradation caused by motor imagery EEG with very low single-to-noise ratio faces several application issues with the use of a BCI system. In this paper, we propose a novel motor imagery classification scheme based on the continuous wavelet transform and the convolutional neural network. Continuous wavelet transform with three mother wavelets is used to capture a highly informative EEG image by combining time-frequency and electrode location. A convolutional neural network is then designed to both classify motor imagery tasks and reduce computation complexity. The proposed method was validated using two public BCI datasets, BCI competition IV dataset 2b and BCI competition II dataset III. The proposed methods were found to achieve improved classification performance compared with the existing methods, thus showcasing the feasibility of motor imagery BCI.

Download Full-text

Classification of coronary artery disease data sets by using a deep neural network

The EuroBiotech Journal ◽

10.24190/issn2564-615x/2017/04.03 ◽

2017 ◽

Vol 1 (4) ◽

pp. 271-277 ◽

Cited By ~ 8

Author(s):

Abdullah Caliskan ◽

Mehmet Emin Yuksel

Keyword(s):

Neural Network ◽

Coronary Artery Disease ◽

Coronary Artery ◽

Deep Neural Network ◽

Cost Effective ◽

Classification Performance ◽

Data Sets ◽

Neural Network Classifier ◽

Artery Disease

Abstract In this study, a deep neural network classifier is proposed for the classification of coronary artery disease medical data sets. The proposed classifier is tested on reference CAD data sets from the literature and also compared with popular representative classification methods regarding its classification performance. Experimental results show that the deep neural network classifier offers much better accuracy, sensitivity and specificity rates when compared with other methods. The proposed method presents itself as an easily accessible and cost-effective alternative to currently existing methods used for the diagnosis of CAD and it can be applied for easily checking whether a given subject under examination has at least one occluded coronary artery or not.

Download Full-text

Rolling bearings fault diagnosis and classification based on optimal wavelet subband selection and deep neural network

10.21203/rs.3.rs-342393/v1 ◽

2021 ◽

Author(s):

Rakesh Kumar Jha ◽

Preety D Swami

Keyword(s):

Neural Network ◽

Fault Diagnosis ◽

Deep Neural Network ◽

Vital Role ◽

Fault Classification ◽

Vibration Signals ◽

Softmax Classifier ◽

Practical Applications ◽

Time Frequency ◽

Diagnosis And Classification

Abstract Time-frequency analysis plays a vital role in fault diagnosis of nonstationary vibration signals acquired from mechanical systems. However, the practical applications face the challenges of continuous variation in speed and load. Apart from this, the disturbances introduced by noise are inevitable. This paper aims to develop a robust method for fault identification in bearings under varying speed, load and noisy conditions. An Optimal Wavelet Subband Deep Neural Network (OWS-DNN) technique is proposed that automatically extracts features from an optimal wavelet subband selected on the basis of Shannon entropy. After denoising the optimal subband, the optimal subbands are dimensionally reduced by the encoder section of an autoencoder. The output of the encoder can be considered as data features. Finally, softmax classifier is employed to classify the encoder output. The vibration signals were recorded on a machinery fault simulator setup for various combinations of speed and load for healthy and faulty bearings. The signals were subjected to various noise levels and the deep neural network was trained. The achieved experimental results reveal high accuracy in fault classification as compared to other techniques under comparison.

Download Full-text

Deep Neural Network Quantizers Outperforming Continuous Speech Recognition Systems

Speech and Computer - Lecture Notes in Computer Science ◽

10.1007/978-3-030-26061-3_54 ◽

2019 ◽

pp. 530-539

Author(s):

Tobias Watzel ◽

Lujun Li ◽

Ludwig Kürzinger ◽

Gerhard Rigoll

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Deep Neural Network ◽

Continuous Speech ◽

Continuous Speech Recognition ◽

Recognition Systems

Download Full-text

Object classification with convolution neural network based on the time-frequency representation of their echo

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP) ◽

10.1109/mlsp.2017.8168134 ◽

2017 ◽

Cited By ~ 2

Author(s):

Mariia Dmitrieva ◽

Matias Valdenegro-Toro ◽

Keith Brown ◽

Gary Heald ◽

David Lane

Keyword(s):

Neural Network ◽

Object Classification ◽

Convolution Neural Network ◽

Time Frequency ◽

Frequency Representation

Download Full-text

Detection of ventricular arrhythmia using hybrid time–frequency-based features and deep neural network

Physical and Engineering Sciences in Medicine ◽

10.1007/s13246-020-00964-2 ◽

2021 ◽

Author(s):

Sukanta Sabut ◽

Om Pandey ◽

B. S. P. Mishra ◽

Monalisa Mohanty

Keyword(s):

Neural Network ◽

Ventricular Arrhythmia ◽

Deep Neural Network ◽

Time Frequency

Download Full-text

Invertible Grayscale with Sparsity Enforcing Priors

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3451993 ◽

2021 ◽

Vol 17 (3) ◽

pp. 1-17

Author(s):

Yong Du ◽

Yangyang Xu ◽

Taizhong Ye ◽

Qiang Wen ◽

Chufeng Xiao ◽

...

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Storage Capacity ◽

Color Image ◽

Grayscale Image ◽

Storage Space ◽

Color Information ◽

Compression Process ◽

The Arts ◽

Residual Image

Color dimensionality reduction is believed as a non-invertible process, as re-colorization results in perceptually noticeable and unrecoverable distortion. In this article, we propose to convert a color image into a grayscale image that can fully recover its original colors, and more importantly, the encoded information is discriminative and sparse, which saves storage capacity. Particularly, we design an invertible deep neural network for color encoding and decoding purposes. This network learns to generate a residual image that encodes color information, and it is then combined with a base grayscale image for color recovering. In this way, the non-differentiable compression process (e.g., JPEG) of the base grayscale image can be integrated into the network in an end-to-end manner. To further reduce the size of the residual image, we present a specific layer to enhance Sparsity Enforcing Priors (SEP), thus leading to negligible storage space. The proposed method allows color embedding on a sparse residual image while keeping a high, 35dB PSNR on average. Extensive experiments demonstrate that the proposed method outperforms state-of-the-arts in terms of image quality and tolerability to compression.

Download Full-text

Crossterm-Free Time-Frequency Representation Exploiting Deep Convolutional Neural Network

Signal Processing ◽

10.1016/j.sigpro.2021.108372 ◽

2021 ◽

pp. 108372

Author(s):

Shuimei Zhang ◽

Md. Saidur Rahman Pavel ◽

Yimin D. Zhang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Free Time ◽

Deep Convolutional Neural Network ◽

Time Frequency ◽

Frequency Representation

Download Full-text