scholarly journals Voice Pathology Detection and Classification Using Convolutional Neural Network Model

2020 ◽  
Vol 10 (11) ◽  
pp. 3723 ◽  
Author(s):  
Mazin Abed Mohammed ◽  
Karrar Hameed Abdulkareem ◽  
Salama A. Mostafa ◽  
Mohd Khanapi Abd Ghani ◽  
Mashael S. Maashi ◽  
...  

Voice pathology disorders can be effectively detected using computer-aided voice pathology classification tools. These tools can diagnose voice pathologies at an early stage and offering appropriate treatment. This study aims to develop a powerful feature extraction voice pathology detection tool based on Deep Learning. In this paper, a pre-trained Convolutional Neural Network (CNN) was applied to a dataset of voice pathology to maximize the classification accuracy. This study also proposes a distinguished training method combined with various training strategies in order to generalize the application of the proposed system on a wide range of problems related to voice disorders. The proposed system has tested using a voice database, namely the Saarbrücken voice database (SVD). The experimental results show the proposed CNN method for speech pathology detection achieves accuracy up to 95.41%. It also obtains 94.22% and 96.13% for F1-Score and Recall. The proposed system shows a high capability of the real-clinical application that offering a fast-automatic diagnosis and treatment solutions within 3 s to achieve the classification accuracy.

In this paper, the classification of normal controls (NC), very mild cognitive impairment and the early stage of Alzheimer’s disease (AD) known as mild cognitive impairment (MCI) from magnetic resonance imaging (MRI) is proposed, based on the two dimensional variational mode decomposition (2D-VMD) and deep convolutional neural network (DCNN). The 2D-VMD is applied to decompose the MRI scans into a discrete number of band limited intrinsic mode functions (BLIMFs). The automatic feature extraction, selection and optimization are performed using the proposed DCNN. The classification accuracy and learning speed of the 2D-VMD-DCNN method are compared with DCNN by taking the MRI data as input. The superior classification accuracy of the proposed 2D-VMD-DCNN method over DCNN method as well as other recently introduced prevalent methods is the major advantage for analyzing the biomedical images in the field of health care


Author(s):  
Wanli Wang ◽  
Botao Zhang ◽  
Kaiqi Wu ◽  
Sergey A Chepinskiy ◽  
Anton A Zhilenkov ◽  
...  

In this paper, a hybrid method based on deep learning is proposed to visually classify terrains encountered by mobile robots. Considering the limited computing resource on mobile robots and the requirement for high classification accuracy, the proposed hybrid method combines a convolutional neural network with a support vector machine to keep a high classification accuracy while improve work efficiency. The key idea is that the convolutional neural network is used to finish a multi-class classification and simultaneously the support vector machine is used to make a two-class classification. The two-class classification performed by the support vector machine is aimed at one kind of terrain that users are mostly concerned with. Results of the two classifications will be consolidated to get the final classification result. The convolutional neural network used in this method is modified for the on-board usage of mobile robots. In order to enhance efficiency, the convolutional neural network has a simple architecture. The convolutional neural network and the support vector machine are trained and tested by using RGB images of six kinds of common terrains. Experimental results demonstrate that this method can help robots classify terrains accurately and efficiently. Therefore, the proposed method has a significant potential for being applied to the on-board usage of mobile robots.


2021 ◽  
Vol 13 (3) ◽  
pp. 335
Author(s):  
Yuhao Qing ◽  
Wenyi Liu

In recent years, image classification on hyperspectral imagery utilizing deep learning algorithms has attained good results. Thus, spurred by that finding and to further improve the deep learning classification accuracy, we propose a multi-scale residual convolutional neural network model fused with an efficient channel attention network (MRA-NET) that is appropriate for hyperspectral image classification. The suggested technique comprises a multi-staged architecture, where initially the spectral information of the hyperspectral image is reduced into a two-dimensional tensor, utilizing a principal component analysis (PCA) scheme. Then, the constructed low-dimensional image is input to our proposed ECA-NET deep network, which exploits the advantages of its core components, i.e., multi-scale residual structure and attention mechanisms. We evaluate the performance of the proposed MRA-NET on three public available hyperspectral datasets and demonstrate that, overall, the classification accuracy of our method is 99.82 %, 99.81%, and 99.37, respectively, which is higher compared to the corresponding accuracy of current networks such as 3D convolutional neural network (CNN), three-dimensional residual convolution structure (RES-3D-CNN), and space–spectrum joint deep network (SSRN).


2018 ◽  
Vol 2018 ◽  
pp. 1-10 ◽  
Author(s):  
Saad Albawi ◽  
Oguz Bayat ◽  
Saad Al-Azawi ◽  
Osman N. Ucan

Recently, social touch gesture recognition has been considered an important topic for touch modality, which can lead to highly efficient and realistic human-robot interaction. In this paper, a deep convolutional neural network is selected to implement a social touch recognition system for raw input samples (sensor data) only. The touch gesture recognition is performed using a dataset previously measured with numerous subjects that perform varying social gestures. This dataset is dubbed as the corpus of social touch, where touch was performed on a mannequin arm. A leave-one-subject-out cross-validation method is used to evaluate system performance. The proposed method can recognize gestures in nearly real time after acquiring a minimum number of frames (the average range of frame length was from 0.2% to 4.19% from the original frame lengths) with a classification accuracy of 63.7%. The achieved classification accuracy is competitive in terms of the performance of existing algorithms. Furthermore, the proposed system outperforms other classification algorithms in terms of classification ratio and touch recognition time without data preprocessing for the same dataset.


2021 ◽  
Author(s):  
Yuki Shimizu ◽  
Shigeo Morimoto ◽  
Masayuki Sanada ◽  
Yukinori Inoue

The optimal design of interior permanent magnet synchronous motors requires a long time because finite element analysis (FEA) is performed repeatedly. To solve this problem, many researchers have used artificial intelligence to construct a prediction model that can replace FEA. However, because the training data are generated by FEA, it takes a very long time to obtain a sufficient amount of data, making it impossible to train a large-scale prediction model. Here, we propose a method for generating a large amount of data from a small number of FEA results using machine learning. An automatic design system with a deep generative model and a convolutional neural network is then constructed. With its sufficient data, the proposed system can handle three topologies and three motor parameters in a wide range of current vector regions. The proposed system was applied to multi-objective optimization design, with the optimization completed in 13-15 seconds.


2021 ◽  
Author(s):  
Yuki Shimizu ◽  
Shigeo Morimoto ◽  
Masayuki Sanada ◽  
Yukinori Inoue

The optimal design of interior permanent magnet synchronous motors requires a long time because finite element analysis (FEA) is performed repeatedly. To solve this problem, many researchers have used artificial intelligence to construct a prediction model that can replace FEA. However, because the training data are generated by FEA, it takes a very long time to obtain a sufficient amount of data, making it impossible to train a large-scale prediction model. Here, we propose a method for generating a large amount of data from a small number of FEA results using machine learning. An automatic design system with a deep generative model and a convolutional neural network is then constructed. With its sufficient data, the proposed system can handle three topologies and three motor parameters in a wide range of current vector regions. The proposed system was applied to multi-objective optimization design, with the optimization completed in 13-15 seconds.


2019 ◽  
Vol 2019 ◽  
pp. 1-16 ◽  
Author(s):  
Lian Zou ◽  
Shaode Yu ◽  
Tiebao Meng ◽  
Zhicheng Zhang ◽  
Xiaokun Liang ◽  
...  

This study reviews the technique of convolutional neural network (CNN) applied in a specific field of mammographic breast cancer diagnosis (MBCD). It aims to provide several clues on how to use CNN for related tasks. MBCD is a long-standing problem, and massive computer-aided diagnosis models have been proposed. The models of CNN-based MBCD can be broadly categorized into three groups. One is to design shallow or to modify existing models to decrease the time cost as well as the number of instances for training; another is to make the best use of a pretrained CNN by transfer learning and fine-tuning; the third is to take advantage of CNN models for feature extraction, and the differentiation of malignant lesions from benign ones is fulfilled by using machine learning classifiers. This study enrolls peer-reviewed journal publications and presents technical details and pros and cons of each model. Furthermore, the findings, challenges and limitations are summarized and some clues on the future work are also given. Conclusively, CNN-based MBCD is at its early stage, and there is still a long way ahead in achieving the ultimate goal of using deep learning tools to facilitate clinical practice. This review benefits scientific researchers, industrial engineers, and those who are devoted to intelligent cancer diagnosis.


2020 ◽  
Vol 10 (2) ◽  
pp. 84 ◽  
Author(s):  
Atif Mehmood ◽  
Muazzam Maqsood ◽  
Muzaffar Bashir ◽  
Yang Shuyuan

Alzheimer’s disease (AD) may cause damage to the memory cells permanently, which results in the form of dementia. The diagnosis of Alzheimer’s disease at an early stage is a problematic task for researchers. For this, machine learning and deep convolutional neural network (CNN) based approaches are readily available to solve various problems related to brain image data analysis. In clinical research, magnetic resonance imaging (MRI) is used to diagnose AD. For accurate classification of dementia stages, we need highly discriminative features obtained from MRI images. Recently advanced deep CNN-based models successfully proved their accuracy. However, due to a smaller number of image samples available in the datasets, there exist problems of over-fitting hindering the performance of deep learning approaches. In this research, we developed a Siamese convolutional neural network (SCNN) model inspired by VGG-16 (also called Oxford Net) to classify dementia stages. In our approach, we extend the insufficient and imbalanced data by using augmentation approaches. Experiments are performed on a publicly available dataset open access series of imaging studies (OASIS), by using the proposed approach, an excellent test accuracy of 99.05% is achieved for the classification of dementia stages. We compared our model with the state-of-the-art models and discovered that the proposed model outperformed the state-of-the-art models in terms of performance, efficiency, and accuracy.


Sign in / Sign up

Export Citation Format

Share Document