Personalized HRTF Modeling Based on Deep Neural Network Using Anthropometric Measurements and Images of the Ear

Geon Lee; Hong Kim

doi:10.3390/app8112180

Personalized HRTF Modeling Based on Deep Neural Network Using Anthropometric Measurements and Images of the Ear

Applied Sciences ◽

10.3390/app8112180 ◽

2018 ◽

Vol 8 (11) ◽

pp. 2180 ◽

Cited By ~ 3

Author(s):

Geon Lee ◽

Hong Kim

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Neural Network ◽

Subjective Evaluation ◽

Estimation Method ◽

Objective Evaluation ◽

Anthropometric Measurements ◽

Sound Sources ◽

Subjective Evaluations ◽

Localization Test

This paper proposes a personalized head-related transfer function (HRTF) estimation method based on deep neural networks by using anthropometric measurements and ear images. The proposed method consists of three sub-networks for representing personalized features and estimating the HRTF. As input features for neural networks, the anthropometric measurements regarding the head and torso are used for a feedforward deep neural network (DNN), and the ear images are used for a convolutional neural network (CNN). After that, the outputs of these two sub-networks are merged into another DNN for estimation of the personalized HRTF. To evaluate the performance of the proposed method, objective and subjective evaluations are conducted. For the objective evaluation, the root mean square error (RMSE) and the log spectral distance (LSD) between the reference HRTF and the estimated one are measured. Consequently, the proposed method provides the RMSE of −18.40 dB and LSD of 4.47 dB, which are lower by 0.02 dB and higher by 0.85 dB than the DNN-based method using anthropometric data without pinna measurements, respectively. Next, a sound localization test is performed for the subjective evaluation. As a result, it is shown that the proposed method can localize sound sources with higher accuracy of around 11% and 6% than the average HRTF method and DNN-based method, respectively. In addition, the reductions of the front/back confusion rate by 12.5% and 2.5% are achieved by the proposed method, compared to the average HRTF method and DNN-based method, respectively.

Download Full-text

Performance Estimation Method on IoT-Cloud Environments Using Hybrid Deep Neural Network

The International Journal of Internet of Things and its Applications ◽

10.21742/ijiota.2018.2.2.01 ◽

2018 ◽

Vol 2 (2) ◽

pp. 1-6

Author(s):

Yunsik Son ◽

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Estimation Method ◽

Performance Estimation ◽

Cloud Environments

Download Full-text

Deep neural networks using a single neuron: folded-in-time architecture using feedback-modulated delay loops

Nature Communications ◽

10.1038/s41467-021-25427-4 ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

Florian Stelzer ◽

André Röhm ◽

Raul Vicente ◽

Ingo Fischer ◽

Serhiy Yanchuk

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Neural Network ◽

Single Neuron ◽

Deep Neural Networks ◽

Back Propagation ◽

Local Network ◽

Multiple Time ◽

Learning Tools ◽

Back Propagation Algorithm

AbstractDeep neural networks are among the most widely applied machine learning tools showing outstanding performance in a broad range of tasks. We present a method for folding a deep neural network of arbitrary size into a single neuron with multiple time-delayed feedback loops. This single-neuron deep neural network comprises only a single nonlinearity and appropriately adjusted modulations of the feedback signals. The network states emerge in time as a temporal unfolding of the neuron’s dynamics. By adjusting the feedback-modulation within the loops, we adapt the network’s connection weights. These connection weights are determined via a back-propagation algorithm, where both the delay-induced and local network connections must be taken into account. Our approach can fully represent standard Deep Neural Networks (DNN), encompasses sparse DNNs, and extends the DNN concept toward dynamical systems implementations. The new method, which we call Folded-in-time DNN (Fit-DNN), exhibits promising performance in a set of benchmark tasks.

Download Full-text

Deep neural network analysis of nanoparticle ordering to identify defects in layered carbon materials

Chemical Science ◽

10.1039/d0sc05696k ◽

2021 ◽

Author(s):

Daniil A. Boiko ◽

Evgeniy O. Pentsak ◽

Vera A. Cherepanova ◽

Evgeniy G. Gordeev ◽

Valentine P. Ananikov

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Analysis ◽

Carbon Material ◽

Deep Neural Network ◽

Carbon Materials ◽

Material Surface ◽

Neural Network Analysis ◽

Pd Nanoparticle ◽

Sem Imaging

Defectiveness of carbon material surface is a key issue for many applications. Pd-nanoparticle SEM imaging was used to highlight “hidden” defects and analyzed by neural networks to solve order/disorder classification and defect segmentation tasks.

Download Full-text

Estimating Traffic Volume on Minor Roads at Rural Stop-Controlled Intersections using Deep Learning

Transportation Research Record Journal of the Transportation Research Board ◽

10.1177/0361198119837236 ◽

2019 ◽

Vol 2673 (4) ◽

pp. 108-116 ◽

Cited By ~ 1

Author(s):

Mostafa H. Tawfeek ◽

Karim El-Basyouny

Keyword(s):

Neural Network ◽

Linear Regression ◽

Network Model ◽

Neural Network Model ◽

Deep Neural Network ◽

Safety Management ◽

Estimation Method ◽

Estimation Model ◽

The Neural Network ◽

Annual Average Daily Traffic

Safety Performance Functions (SPFs) are regression models used to predict the expected number of collisions as a function of various traffic and geometric characteristics. One of the integral components in developing SPFs is the availability of accurate exposure factors, that is, annual average daily traffic (AADT). However, AADTs are not often available for minor roads at rural intersections. This study aims to develop a robust AADT estimation model using a deep neural network. A total of 1,350 rural four-legged, stop-controlled intersections from the Province of Alberta, Canada, were used to train the neural network. The results of the deep neural network model were compared with the traditional estimation method, which uses linear regression. The results indicated that the deep neural network model improved the estimation of minor roads’ AADT by 35% when compared with the traditional method. Furthermore, SPFs developed using linear regression resulted in models with statistically insignificant AADTs on minor roads. Conversely, the SPF developed using the neural network provided a better fit to the data with both AADTs on minor and major roads being statistically significant variables. The findings indicated that the proposed model could enhance the predictive power of the SPF and therefore improve the decision-making process since SPFs are used in all parts of the safety management process.

Download Full-text

Inclusion of Multiple Cycling of the Potential into Deep Neural Network Classification of Voltammetric Reaction Mechanisms

Faraday Discussions ◽

10.1039/d1fd00050k ◽

2021 ◽

Author(s):

Luke Gundry ◽

Gareth Kennedy ◽

Alan Bond ◽

Jie Zhang

Keyword(s):

Neural Network ◽

Neural Networks ◽

Reaction Mechanisms ◽

Deep Neural Network ◽

Deep Neural Networks ◽

Neural Network Classification ◽

Initial Cycle

The use of Deep Neural Networks (DNNs) for the classification of electrochemical mechanisms based on training with simulations of the initial cycle of potential have been reported. In this paper,...

Download Full-text

Automated vessel segmentation in lung CT and CTA images via deep neural networks

Journal of X-Ray Science and Technology ◽

10.3233/xst-210955 ◽

2021 ◽

pp. 1-15

Author(s):

Wenjun Tan ◽

Luyu Zhou ◽

Xiaoshuo Li ◽

Xiaoyu Yang ◽

Yufei Chen ◽

...

Keyword(s):

Neural Network ◽

Computed Tomography ◽

Neural Networks ◽

Deep Neural Network ◽

Deep Neural Networks ◽

Vessel Segmentation ◽

Multi Scale ◽

Vascular Segmentation ◽

Study Results ◽

Lung Ct

BACKGROUND: The distribution of pulmonary vessels in computed tomography (CT) and computed tomography angiography (CTA) images of lung is important for diagnosing disease, formulating surgical plans and pulmonary research. PURPOSE: Based on the pulmonary vascular segmentation task of International Symposium on Image Computing and Digital Medicine 2020 challenge, this paper reviews 12 different pulmonary vascular segmentation algorithms of lung CT and CTA images and then objectively evaluates and compares their performances. METHODS: First, we present the annotated reference dataset of lung CT and CTA images. A subset of the dataset consisting 7,307 slices for training and 3,888 slices for testing was made available for participants. Second, by analyzing the performance comparison of different convolutional neural networks from 12 different institutions for pulmonary vascular segmentation, the reasons for some defects and improvements are summarized. The models are mainly based on U-Net, Attention, GAN, and multi-scale fusion network. The performance is measured in terms of Dice coefficient, over segmentation ratio and under segmentation rate. Finally, we discuss several proposed methods to improve the pulmonary vessel segmentation results using deep neural networks. RESULTS: By comparing with the annotated ground truth from both lung CT and CTA images, most of 12 deep neural network algorithms do an admirable job in pulmonary vascular extraction and segmentation with the dice coefficients ranging from 0.70 to 0.85. The dice coefficients for the top three algorithms are about 0.80. CONCLUSIONS: Study results show that integrating methods that consider spatial information, fuse multi-scale feature map, or have an excellent post-processing to deep neural network training and optimization process are significant for further improving the accuracy of pulmonary vascular segmentation.

Download Full-text

Full-Reference and No-Reference Objective Evaluation of Deep Neural Network Speech

2021 13th International Conference on Quality of Multimedia Experience (QoMEX) ◽

10.1109/qomex51781.2021.9465433 ◽

2021 ◽

Author(s):

Stephen Voran

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Objective Evaluation ◽

Full Reference

Download Full-text

A new deep neural network based on a stack of single-hidden-layer feedforward neural networks with randomly fixed hidden neurons

Neurocomputing ◽

10.1016/j.neucom.2015.06.017 ◽

2016 ◽

Vol 171 ◽

pp. 63-72 ◽

Cited By ~ 19

Author(s):

Junying Hu ◽

Jiangshe Zhang ◽

Chunxia Zhang ◽

Juan Wang

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Neural Network ◽

Feedforward Neural Networks ◽

Hidden Layer ◽

Hidden Neurons

Download Full-text

Web application to support evidence of individual emotional impact evoked by COVID-19 pandemic restrictions (Preprint)

10.2196/preprints.33021 ◽

2021 ◽

Author(s):

Hugo Mitre-Hernandez ◽

Rodolfo Ferro-Perez ◽

Francisco Gonzalez-Hernandez

Keyword(s):

Neural Network ◽

Mental Health ◽

Neural Networks ◽

Emotion Recognition ◽

Web Application ◽

Deep Neural Network ◽

Data Transfer ◽

Computational Cost ◽

Low Computational Cost ◽

The Web

BACKGROUND Mental health effects during COVID-19 quarantine need to be handled because patients, relatives, and healthcare workers are living with negative emotional behaviors. The clinical disorders of depression and anxiety are evoking anger, fear, sadness, disgust, and reducing happiness. Therefore, track emotions with the help of psychologists on online consultations –to reduce the risk of contagion– will go a long way in assisting with mental health. The human micro-expressions can describe genuine emotions of people and can be captured by Deep Neural Networks (DNNs) models. But the challenge is to implement it under the poor performance of a part of society's computers and the low speed of internet connection. OBJECTIVE This study aimed to create a useful and usable web application to record emotions in a patient’s card in real-time, achieving a small data transfer, and a Convolutional Neural Networks (CNN) model with a low computational cost. METHODS To validate the low computational cost premise, firstly, we compare DNN architectures results, collecting the floating-point operations per second (FLOPS), the Number of Parameters (NP) and accuracy from the MobileNet, PeleeNet, Extended Deep Neural Network (EDNN), Inception- Based Deep Neural Network (IDNN) and our proposed Residual mobile-based Network (ResmoNet) model. Secondly, we compare the trained models' results in terms of Main Memory Utilization (MMU) and Response Time to complete the Emotion recognition (RTE). Finally, we design a data transfer that includes the raw data of emotions and the basic text information of the patient. The web application was evaluated with the System Usability Scale (SUS) and a utility questionnaire by psychologists and psychiatrists (experts). RESULTS All CNN models were set up using 150 epochs for training and testing comparing the results for each variable in ResmoNet with the best model. It was obtained that ResmoNet has 115,976 NP less than MobileNet, 243,901 FLOPS less than MobileNet, and 5% less accuracy than EDNN (95%). Moreover, ResmoNet used less MMU than any model, only EDNN overcomes ResmoNet in 0.01 seconds for RTE. Finally, with our model, we develop a web application to collect emotions in real-time during a psychological consultation. For data transfer, the patient’s card and raw emotional data have 2 kb with a UTF-8 encoding approximately. Finally, according to the experts, the web application has good usability (73.8 of 100) and utility (3.94 of 5). CONCLUSIONS A usable and useful web application for psychologists and psychiatrists is presented. This tool includes an efficient and light facial emotion recognition model. Its purpose is to be a complementary tool for diagnostic processes.

Download Full-text

Extensive deep neural networks for transferring small scale learning to large scale systems

Chemical Science ◽

10.1039/c8sc04578j ◽

2019 ◽

Vol 10 (15) ◽

pp. 4129-4140 ◽

Cited By ~ 9

Author(s):

Kyle Mills ◽

Kevin Ryczko ◽

Iryna Luchak ◽

Adam Domurad ◽

Chris Beeler ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Large Scale ◽

Deep Neural Network ◽

Deep Neural Networks ◽

Small Scale ◽

Large Scale Systems ◽

Energy Entropy ◽

Large Systems ◽

Number Of Particles

We present a physically-motivated topology of a deep neural network that can efficiently infer extensive parameters (such as energy, entropy, or number of particles) of arbitrarily large systems, doing so with scaling.

Download Full-text