The Effect of a Voice Activity Detector on the Speech Enhancement Performance of the Binaural Multichannel Wiener Filter

EURASIP Journal on Audio Speech and Music Processing ◽

10.1186/1687-4722-2010-840294 ◽

2010 ◽

Vol 2010 (1) ◽

pp. 840294 ◽

Cited By ~ 3

Author(s):

Jasmina Catic ◽

Torsten Dau ◽

JörgM Buchholz ◽

Fredrik Gran

Keyword(s):

Speech Enhancement ◽

Wiener Filter ◽

Voice Activity Detector ◽

Voice Activity

Download Full-text

A hybrid noise canceller with a real-time adaptive Wiener filter and a geometric-based voice-activity detector for an automotive application

International Journal of Adaptive Control and Signal Processing ◽

10.1002/acs.1146 ◽

2009 ◽

Vol 24 (6) ◽

pp. 508-522 ◽

Cited By ~ 3

Author(s):

T. Z. Qi ◽

T. J. Moir

Keyword(s):

Real Time ◽

Wiener Filter ◽

Automotive Application ◽

Voice Activity Detector ◽

Voice Activity

Download Full-text

Speech enhancement via Mel-scale Wiener filtering with a frequency-wise voice activity detector

Journal of Mechanical Science and Technology ◽

10.1007/bf02916349 ◽

2007 ◽

Vol 21 (5) ◽

pp. 708-722 ◽

Cited By ~ 3

Author(s):

Hwa Soo Kim ◽

Young Man Cho ◽

Han-Jun Kim

Keyword(s):

Speech Enhancement ◽

Wiener Filtering ◽

Voice Activity Detector ◽

Voice Activity

Download Full-text

Single-channel speech enhancement: Using recurrent neuro-fuzzy voice activity detector and spectral subtraction algorithms

2008 IEEE International Conference on Systems, Man and Cybernetics ◽

10.1109/icsmc.2008.4811764 ◽

2008 ◽

Author(s):

Fang-Chen Chuang ◽

Jeen-Shing Wang ◽

Li-Ying Wu

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Spectral Subtraction ◽

Voice Activity Detector ◽

Neuro Fuzzy ◽

Voice Activity

Download Full-text

Speech Enhancement for Secure Communication Using Coupled Spectral Subtraction and Wiener Filter

Electronics ◽

10.3390/electronics8080897 ◽

2019 ◽

Vol 8 (8) ◽

pp. 897 ◽

Cited By ~ 2

Author(s):

Hilman Pardede ◽

Kalamullah Ramli ◽

Yohan Suryanto ◽

Nur Hayati ◽

Alfan Presekal

Keyword(s):

Speech Enhancement ◽

Communication System ◽

Secure Communication ◽

Wiener Filter ◽

Speech Quality ◽

Spectral Subtraction ◽

Speech Signals ◽

Voice Activity Detector ◽

Noise Estimate

The encryption process for secure voice communication may degrade the speech quality when it is applied to the speech signals before encoding them through a conventional communication system such as GSM or radio trunking. This is because the encryption process usually includes a randomization of the speech signals, and hence, when the speech is decrypted, it may perceptibly be distorted, so satisfactory speech quality for communication is not achieved. To deal with this, we could apply a speech enhancement method to improve the quality of decrypted speech. However, many speech enhancement methods work by assuming noise is present all the time, so the voice activity detector (VAD) is applied to detect the non-speech period to update the noise estimate. Unfortunately, this assumption is not valid for the decrypted speech. Since the encryption process is applied only when speech is detected, distortions from the secure communication system are characteristically different. They exist when speech is present. Therefore, a noise estimator that is able to update noise even when speech is present is needed. However, most noise estimator techniques only adapt to slow changes of noise to avoid over-estimation of noise, making them unsuitable for this task. In this paper, we propose a speech enhancement technique to improve the quality of speech from secure communication. We use a combination of the Wiener filter and spectral subtraction for the noise estimator, so our method is better at tracking fast changes of noise without over-estimating them. Our experimental results on various communication channels indicate that our method is better than other popular noise estimators and speech enhancement methods.

Download Full-text

FPGA implementation of voice activity detector for efficient speech enhancement

2014 IEEE 12th International New Circuits and Systems Conference (NEWCAS) ◽

10.1109/newcas.2014.6934042 ◽

2014 ◽

Cited By ~ 3

Author(s):

Mourad Oukherfellah ◽

Mohammed Bahoura

Keyword(s):

Speech Enhancement ◽

Fpga Implementation ◽

Voice Activity Detector ◽

Voice Activity

Download Full-text

A Hierarchical Framework Approach for Voice Activity Detection and Speech Enhancement

The Scientific World JOURNAL ◽

10.1155/2014/723643 ◽

2014 ◽

Vol 2014 ◽

pp. 1-8 ◽

Cited By ~ 2

Author(s):

Yan Zhang ◽

Zhen-min Tang ◽

Yan-ping Li ◽

Yang Luo

Keyword(s):

Speech Enhancement ◽

Speaker Recognition ◽

Wiener Filter ◽

Voice Activity Detection ◽

Activity Detection ◽

Hierarchical Framework ◽

Framework Approach ◽

Noisy Conditions ◽

Voice Activity ◽

Timit Database

Accurate and effective voice activity detection (VAD) is a fundamental step for robust speech or speaker recognition. In this study, we proposed a hierarchical framework approach for VAD and speech enhancement. The modified Wiener filter (MWF) approach is utilized for noise reduction in the speech enhancement block. For the feature selection and voting block, several discriminating features were employed in a voting paradigm for the consideration of reliability and discriminative power. Effectiveness of the proposed approach is compared and evaluated to other VAD techniques by using two well-known databases, namely, TIMIT database and NOISEX-92 database. Experimental results show that the proposed method performs well under a variety of noisy conditions.

Download Full-text

Hidden-Markov-model-based voice activity detector with high speech detection rate for speech enhancement

IET Signal Processing ◽

10.1049/iet-spr.2010.0282 ◽

2012 ◽

Vol 6 (1) ◽

pp. 54 ◽

Cited By ~ 13

Author(s):

H. Veisi ◽

H. Sameti

Keyword(s):

Markov Model ◽

Hidden Markov Model ◽

Speech Enhancement ◽

Detection Rate ◽

Hidden Markov ◽

Speech Detection ◽

Voice Activity Detector ◽

Model Based ◽

Voice Activity

Download Full-text

Prediction of NMF-based Wiener Filter for Speech Enhancement Using Deep Neural Networks

2020 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) ◽

10.1109/icspcc50002.2020.9259477 ◽

2020 ◽

Author(s):

Zhigang Bai ◽

Changchun Bao ◽

Zihao Cui

Keyword(s):

Neural Networks ◽

Speech Enhancement ◽

Deep Neural Networks ◽

Wiener Filter

Download Full-text

Dual-Mic Speech Enhancement Based on TF-GSC with Leakage Suppression and Signal Recovery

Applied Sciences ◽

10.3390/app11062816 ◽

2021 ◽

Vol 11 (6) ◽

pp. 2816

Author(s):

Hansol Kim ◽

Jong Won Shin

Keyword(s):

Speech Enhancement ◽

Wiener Filter ◽

Signal Recovery ◽

Gain Function ◽

Microphone Signal ◽

Perceptual Evaluation ◽

Blocking Matrix ◽

Adaptive Noise ◽

Adaptive Noise Canceller ◽

Sidelobe Canceller

The transfer function-generalized sidelobe canceller (TF-GSC) is one of the most popular structures for the adaptive beamformer used in multi-channel speech enhancement. Although the TF-GSC has shown decent performance, a certain amount of steering error is inevitable, which causes leakage of speech components through the blocking matrix (BM) and distortion in the fixed beamformer (FBF) output. In this paper, we propose to suppress the leaked signal in the output of the BM and restore the desired signal in the FBF output of the TF-GSC. To reduce the risk of attenuating speech in the adaptive noise canceller (ANC), the speech component in the output of the BM is suppressed by applying a gain function similar to the square-root Wiener filter, assuming that a certain portion of the desired speech should be leaked into the BM output. Additionally, we propose to restore the attenuated desired signal in the FBF output by adding some of the microphone signal components back, depending on how microphone signals are related to the FBF and BM outputs. The experimental results showed that the proposed TF-GSC outperformed conventional TF-GSC in terms of the perceptual evaluation of speech quality (PESQ) scores under various noise conditions and the direction of arrivals for the desired and interfering sources.

Download Full-text