Smartphone-based single-channel speech enhancement application for hearing aids

Nikhil Shankar; Gautam Shreedhar Bhat; Issa M. S. Panahi; Stephanie Tittle; Linda M. Thibodeau

doi:10.1121/10.0006045

Temporal Auditory Coding Features for Causal Speech Enhancement

Electronics ◽

10.3390/electronics9101698 ◽

2020 ◽

Vol 9 (10) ◽

pp. 1698

Author(s):

Iordanis Thoidis ◽

Lazaros Vrysis ◽

Dimitrios Markou ◽

George Papanikolaou

Keyword(s):

Feature Extraction ◽

Speech Enhancement ◽

Hearing Aids ◽

Single Channel ◽

Power Spectra ◽

Low Frequency ◽

Audio Signal ◽

Great Success ◽

Structure Information ◽

Feature Extraction And Selection

Perceptually motivated audio signal processing and feature extraction have played a key role in the determination of high-level semantic processes and the development of emerging systems and applications, such as mobile phone telecommunication and hearing aids. In the era of deep learning, speech enhancement methods based on neural networks have seen great success, mainly operating on the log-power spectra. Although these approaches surpass the need for exhaustive feature extraction and selection, it is still unclear whether they target the important sound characteristics related to speech perception. In this study, we propose a novel set of auditory-motivated features for single-channel speech enhancement by fusing temporal envelope and temporal fine structure information in the context of vocoder-like processing. A causal gated recurrent unit (GRU) neural network is employed to recover the low-frequency amplitude modulations of speech. Experimental results indicate that the exploited system achieves considerable gains for normal-hearing and hearing-impaired listeners, in terms of objective intelligibility and quality metrics. The proposed auditory-motivated feature set achieved better objective intelligibility results compared to the conventional log-magnitude spectrogram features, while mixed results were observed for simulated listeners with hearing loss. Finally, we demonstrate that the proposed analysis/synthesis framework provides satisfactory reconstruction accuracy of speech signals.

Download Full-text

Speaker-Independent Speech Enhancement with Brain Signals

10.36227/techrxiv.16624477.v1 ◽

2021 ◽

Author(s):

Maryam Hosseini ◽

Luca Celotti ◽

Eric Plourde

Keyword(s):

Speech Enhancement ◽

Hearing Aids ◽

Prior Information ◽

Single Channel ◽

Trinity College ◽

Brain Activity ◽

Ethics Committees ◽

Auditory Information ◽

Trinity College Dublin ◽

The Brain

Single-channel speech enhancement algorithms have seen great improvements over the past few years. Despite these improvements, they still lack the efficiency of the auditory system in extracting attended auditory information in the presence of competing speakers. Recently, it has been shown that the attended auditory information can be decoded from the brain activity of the listener. In this paper, we propose two novel deep learning methods referred to as the Brain Enhanced Speech Denoiser (BESD) and the U-shaped Brain Enhanced Speech Denoiser (U-BESD) respectively, that take advantage of this fact to denoise a multi-talker speech mixture. We use a Feature-wise Linear Modulation (FiLM) between the brain activity and the sound mixture, to better extract the features of the attended speaker to perform speech enhancement. We show, using electroencephalography (EEG) signals recorded from the listener, that U-BESD outperforms a current autoencoder approach in enhancing a speech mixture as well as a speech separation approach that uses brain activity. Moreover, we show that both BESD and U-BESD successfully extract the attended speaker without any prior information about this speaker. This makes both algorithms great candidates for realistic applications where no prior information about the attended speaker is available, such as hearing aids, cellphones, or noise cancelling headphones. All procedures were performed in accordance with the Declaration of Helsinki and were approved by the Ethics Committees of the School of Psychology at Trinity College Dublin, and the Health Sciences Faculty at Trinity College Dublin.

Download Full-text

Speaker-Independent Speech Enhancement with Brain Signals

10.36227/techrxiv.16624477 ◽

2021 ◽

Author(s):

Maryam Hosseini ◽

Luca Celotti ◽

Eric Plourde

Keyword(s):

Speech Enhancement ◽

Hearing Aids ◽

Prior Information ◽

Single Channel ◽

Trinity College ◽

Brain Activity ◽

Ethics Committees ◽

Auditory Information ◽

Trinity College Dublin ◽

The Brain

Single-channel speech enhancement algorithms have seen great improvements over the past few years. Despite these improvements, they still lack the efficiency of the auditory system in extracting attended auditory information in the presence of competing speakers. Recently, it has been shown that the attended auditory information can be decoded from the brain activity of the listener. In this paper, we propose two novel deep learning methods referred to as the Brain Enhanced Speech Denoiser (BESD) and the U-shaped Brain Enhanced Speech Denoiser (U-BESD) respectively, that take advantage of this fact to denoise a multi-talker speech mixture. We use a Feature-wise Linear Modulation (FiLM) between the brain activity and the sound mixture, to better extract the features of the attended speaker to perform speech enhancement. We show, using electroencephalography (EEG) signals recorded from the listener, that U-BESD outperforms a current autoencoder approach in enhancing a speech mixture as well as a speech separation approach that uses brain activity. Moreover, we show that both BESD and U-BESD successfully extract the attended speaker without any prior information about this speaker. This makes both algorithms great candidates for realistic applications where no prior information about the attended speaker is available, such as hearing aids, cellphones, or noise cancelling headphones. All procedures were performed in accordance with the Declaration of Helsinki and were approved by the Ethics Committees of the School of Psychology at Trinity College Dublin, and the Health Sciences Faculty at Trinity College Dublin.

Download Full-text

Adaptive Single-Channel Speech Enhancement Method for a Push-To-Talk Enabled Wireless Communication Device

IEICE Transactions on Communications ◽

10.1587/transcom.2015ccp0023 ◽

2016 ◽

Vol E99.B (8) ◽

pp. 1745-1753

Author(s):

Hyoung-Gook KIM ◽

Jin Young KIM

Keyword(s):

Wireless Communication ◽

Speech Enhancement ◽

Single Channel ◽

Communication Device ◽

Enhancement Method

Download Full-text

Error Modeling via Asymmetric Laplace Distribution for Deep Neural Network Based Single-Channel Speech Enhancement

10.21437/interspeech.2018-1439 ◽

2018 ◽

Author(s):

Li Chai ◽

Jun Du ◽

Chin-Hui Lee

Keyword(s):

Neural Network ◽

Speech Enhancement ◽

Deep Neural Network ◽

Single Channel ◽

Laplace Distribution ◽

Error Modeling ◽

Asymmetric Laplace Distribution

Download Full-text

Perceptual weighting deep neural networks for single-channel speech enhancement

2016 12th World Congress on Intelligent Control and Automation (WCICA) ◽

10.1109/wcica.2016.7578300 ◽

2016 ◽

Cited By ~ 2

Author(s):

Wei Han ◽

Xiongwei Zhang ◽

Gang Min ◽

Xingyu Zhou ◽

Wei Zhang

Keyword(s):

Neural Networks ◽

Speech Enhancement ◽

Deep Neural Networks ◽

Single Channel ◽

Perceptual Weighting

Download Full-text

A New Weighted Loss for Single Channel Speech Enhancement under Low Signal-to-Noise Ratio Environment

2020 15th IEEE International Conference on Signal Processing (ICSP) ◽

10.1109/icsp48669.2020.9320989 ◽

2020 ◽

Author(s):

Jian Xiao ◽

Hongqing Liu ◽

Yi Zhou ◽

Zhen Luo

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Signal To Noise Ratio ◽

Signal To Noise ◽

Noise Ratio

Download Full-text

Robust Constrained MFMVDR Filters for Single-Channel Speech Enhancement based on Spherical Uncertainty Set

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2020.3042013 ◽

2020 ◽

pp. 1-1

Author(s):

Doerte Fischer ◽

Simon Doclo

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Uncertainty Set

Download Full-text

A single channel speech enhancement method based on masking properties and minimum statistics

6th International Conference on Signal Processing, 2002. ◽

10.1109/icosp.2002.1181091 ◽

2003 ◽

Cited By ~ 1

Author(s):

Jiang Xiaoping ◽

Fu Hua ◽

Yao Tianren

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Enhancement Method ◽

Minimum Statistics

Download Full-text

A novel single-channel speech enhancement in noisy environments

4th International Conference on Intelligent Environments (IE 08) ◽

10.1049/cp:20081182 ◽

2008 ◽

Author(s):

Gin-Der Wu

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Noisy Environments

Download Full-text