Single channel speech enhancement using MMSE estimation of short-time modulation magnitude spectrum

Single-channel speech enhancement using spectral subtraction in the short-time modulation domain

Speech Communication ◽

10.1016/j.specom.2010.02.004 ◽

2010 ◽

Vol 52 (5) ◽

pp. 450-475 ◽

Cited By ~ 80

Author(s):

Kuldip Paliwal ◽

Kamil Wójcicki ◽

Belinda Schwerin

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Spectral Subtraction ◽

Time Modulation ◽

Modulation Domain ◽

Short Time

Download Full-text

Phase-Sensitive Decision-Directed SNR Estimator for Single-Channel Speech Enhancement

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001417580034 ◽

2017 ◽

Vol 31 (08) ◽

pp. 1758003

Author(s):

Shifeng Ou ◽

Peng Song ◽

Ying Gao

Keyword(s):

Speech Enhancement ◽

Speech Processing ◽

Single Channel ◽

Signal To Noise Ratio ◽

A Priori ◽

Processing System ◽

Phase Information ◽

Amplitude Spectra ◽

Phase Sensitive ◽

Short Time

The a priori signal-to-noise ratio (SNR) plays an essential role in many speech enhancement systems. Most of the existing approaches to estimate the a priori SNR only exploit the amplitude spectra while making the phase neglected. Considering the fact that incorporating phase information into a speech processing system can significantly improve the speech quality, this paper proposes a phase-sensitive decision-directed (DD) approach for the a priori SNR estimate. By representing the short-time discrete Fourier transform (STFT) signal spectra geometrically in a complex plane, the proposed approach estimates the a priori SNR using both the magnitude and phase information while making no assumptions about the phase difference between clean speech and noise spectra. Objective evaluations in terms of the spectrograms, segmental SNR, log-spectral distance (LSD) and short-time objective intelligibility (STOI) measures are presented to demonstrate the superiority of the proposed approach compared to several competitive methods at different noise conditions and input SNR levels.

Download Full-text

Single Channel Speech Enhancement Using Adaptive Soft-Thresholding with Bivariate EMD

ISRN Signal Processing ◽

10.1155/2013/724378 ◽

2013 ◽

Vol 2013 ◽

pp. 1-8 ◽

Cited By ~ 5

Author(s):

Md. Ekramul Hamid ◽

Md. Khademul Islam Molla ◽

Xin Dang ◽

Takayoshi Nakai

Keyword(s):

Speech Enhancement ◽

Speech Signal ◽

Single Channel ◽

Complex Signal ◽

Intrinsic Mode Functions ◽

Noisy Speech ◽

Mode Decomposition ◽

Data Adaptive ◽

Improved Performance ◽

Short Time

This paper presents a novel data adaptive thresholding approach to single channel speech enhancement. The noisy speech signal and fractional Gaussian noise (fGn) are combined to produce the complex signal. The fGn is generated using the noise variance roughly estimated from the noisy speech signal. Bivariate empirical mode decomposition (bEMD) is employed to decompose the complex signal into a finite number of complex-valued intrinsic mode functions (IMFs). The real and imaginary parts of the IMFs represent the IMFs of observed speech and fGn, respectively. Each IMF is divided into short time frames for local processing. The variance of IMF of fGn calculated within a frame is used as the reference term to classify corresponding noisy speech frame into noise and signal dominant frames. Only the noise dominant frames are soft-thresholded to reduce the noise effects. Then, all the frames as well as IMFs of speech are combined, yielding the enhanced speech signal. The experimental results show the improved performance of the proposed algorithm compared to the recently reported methods.

Download Full-text

Speech enhancement based on β-order MMSE estimation of Short Time Spectral Amplitude and Laplacian speech modeling

Speech Communication ◽

10.1016/j.specom.2014.12.002 ◽

2015 ◽

Vol 67 ◽

pp. 92-101 ◽

Cited By ~ 7

Author(s):

Hamid Reza Abutalebi ◽

Mehdi Rashidinejad

Keyword(s):

Speech Enhancement ◽

Spectral Amplitude ◽

Speech Modeling ◽

Mmse Estimation ◽

Short Time

Download Full-text

Data-Dependent Ensemble of Magnitude Spectrum Predictions for Single Channel Speech Enhancement

2019 IEEE 21st International Workshop on Multimedia Signal Processing (MMSP) ◽

10.1109/mmsp.2019.8901800 ◽

2019 ◽

Author(s):

Pasi Pertila

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Magnitude Spectrum

Download Full-text

Adaptive Single-Channel Speech Enhancement Method for a Push-To-Talk Enabled Wireless Communication Device

IEICE Transactions on Communications ◽

10.1587/transcom.2015ccp0023 ◽

2016 ◽

Vol E99.B (8) ◽

pp. 1745-1753

Author(s):

Hyoung-Gook KIM ◽

Jin Young KIM

Keyword(s):

Wireless Communication ◽

Speech Enhancement ◽

Single Channel ◽

Communication Device ◽

Enhancement Method

Download Full-text

Error Modeling via Asymmetric Laplace Distribution for Deep Neural Network Based Single-Channel Speech Enhancement

10.21437/interspeech.2018-1439 ◽

2018 ◽

Author(s):

Li Chai ◽

Jun Du ◽

Chin-Hui Lee

Keyword(s):

Neural Network ◽

Speech Enhancement ◽

Deep Neural Network ◽

Single Channel ◽

Laplace Distribution ◽

Error Modeling ◽

Asymmetric Laplace Distribution

Download Full-text

Perceptual weighting deep neural networks for single-channel speech enhancement

2016 12th World Congress on Intelligent Control and Automation (WCICA) ◽

10.1109/wcica.2016.7578300 ◽

2016 ◽

Cited By ~ 2

Author(s):

Wei Han ◽

Xiongwei Zhang ◽

Gang Min ◽

Xingyu Zhou ◽

Wei Zhang

Keyword(s):

Neural Networks ◽

Speech Enhancement ◽

Deep Neural Networks ◽

Single Channel ◽

Perceptual Weighting

Download Full-text

A New Weighted Loss for Single Channel Speech Enhancement under Low Signal-to-Noise Ratio Environment

2020 15th IEEE International Conference on Signal Processing (ICSP) ◽

10.1109/icsp48669.2020.9320989 ◽

2020 ◽

Author(s):

Jian Xiao ◽

Hongqing Liu ◽

Yi Zhou ◽

Zhen Luo

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Signal To Noise Ratio ◽

Signal To Noise ◽

Noise Ratio

Download Full-text

Robust Constrained MFMVDR Filters for Single-Channel Speech Enhancement based on Spherical Uncertainty Set

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2020.3042013 ◽

2020 ◽

pp. 1-1

Author(s):

Doerte Fischer ◽

Simon Doclo

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Uncertainty Set

Download Full-text