A new time-frequency binary mask estimation method based on convex optimization of speech power

In computational auditory scene analysis, the accurate estimation of binary mask or ratio mask plays a key role in noise masking. An inaccurate estimation often leads to some artifacts and temporal discontinuity in the synthesized speech. To overcome this problem, we propose a new ratio mask estimation method in terms of Wiener filtering in each Gammatone channel. In the reconstruction of Wiener filter, we utilize the relationship of the speech and noise power spectra in each Gammatone channel to build the objective function for the convex optimization of speech power. To improve the accuracy of estimation, the estimated ratio mask is further modified based on its adjacent time–frequency units, and then smoothed by interpolating with the estimated binary masks. The objective tests including the signal-to-noise ratio improvement, spectral distortion and intelligibility, and subjective listening test demonstrate the superiority of the proposed method compared with the reference methods.

Download Full-text

A convex optimization approach for time-frequency mask estimation

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) ◽

10.1109/waspaa.2017.8169989 ◽

2017 ◽

Author(s):

Feng Bao ◽

Waleed H. Abdulla

Keyword(s):

Convex Optimization ◽

Optimization Approach ◽

Time Frequency ◽

Mask Estimation

Download Full-text

A New Time-Frequency Attention Mechanism for TDNN and CNN-LSTM-TDNN, with Application to Language Identification

10.21437/interspeech.2019-1256 ◽

2019 ◽

Cited By ~ 3

Author(s):

Xiaoxiao Miao ◽

Ian McLoughlin ◽

Yonghong Yan

Keyword(s):

Attention Mechanism ◽

Language Identification ◽

Time Frequency ◽

New Time

Download Full-text

Time—Frequency Mask Estimation based on Deep Neural Network for Flexible Load Disaggregation in Buildings

IEEE Transactions on Smart Grid ◽

10.1109/tsg.2021.3066547 ◽

2021 ◽

pp. 1-1

Author(s):

Junho Song ◽

Yonggu Lee ◽

Euiseok Hwang

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Time Frequency ◽

Load Disaggregation ◽

Flexible Load ◽

Mask Estimation

Download Full-text

New Time-Frequency Transient Features for Nonintrusive Load Monitoring

Energies ◽

10.3390/en14051437 ◽

2021 ◽

Vol 14 (5) ◽

pp. 1437

Author(s):

Mahfoud Drouaz ◽

Bruno Colicchio ◽

Ali Moukadem ◽

Alain Dieterlen ◽

Djafar Ould-Abdeslam

Keyword(s):

Transient Signal ◽

Learning Tools ◽

Time Frequency ◽

Energy Measure ◽

Load Monitoring ◽

Feature Based ◽

Stockwell Transform ◽

Processing Techniques ◽

Equipment Laboratory ◽

New Time

A crucial step in nonintrusive load monitoring (NILM) is feature extraction, which consists of signal processing techniques to extract features from voltage and current signals. This paper presents a new time-frequency feature based on Stockwell transform. The extracted features aim to describe the shape of the current transient signal by applying an energy measure on the fundamental and the harmonic frequency voices. In order to validate the proposed methodology, classical machine learning tools are applied (k-NN and decision tree classifiers) on two existing datasets (Controlled On/Off Loads Library (COOLL) and Home Equipment Laboratory Dataset (HELD1)). The classification rates achieved are clearly higher than that for other related studies in the literature, with 99.52% and 96.92% classification rates for the COOLL and HELD1 datasets, respectively.

Download Full-text