A Survey of Polyphonic Sound Event Detection Based on Non-Negative Matrix Factorization

Non-Negative Matrix Factorization-Convolutional Neural Network (NMF-CNN) for Sound Event Detection

Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019) ◽

10.33682/50ef-dx29 ◽

2019 ◽

Cited By ~ 1

Author(s):

Teck Kai Chan ◽

Cheng Siong Chin ◽

Ye Li

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Event Detection ◽

Matrix Factorization ◽

Sound Event ◽

Sound Event Detection ◽

Non Negative Matrix Factorization

Download Full-text

Data-Dependent Feature Extraction Method Based on Non-Negative Matrix Factorization for Weakly Supervised Domestic Sound Event Detection

Applied Sciences ◽

10.3390/app11031040 ◽

2021 ◽

Vol 11 (3) ◽

pp. 1040

Author(s):

Seokjin Lee ◽

Minhan Kim ◽

Seunghyeon Shin ◽

Sooyoung Park ◽

Youngho Jeong

Keyword(s):

Feature Extraction ◽

Event Detection ◽

Matrix Factorization ◽

Extraction Method ◽

Extraction Methods ◽

Feature Extraction Method ◽

Sound Event ◽

Sound Event Detection ◽

Weakly Supervised ◽

Non Negative Matrix Factorization

In this paper, feature extraction methods are developed based on the non-negative matrix factorization (NMF) algorithm to be applied in weakly supervised sound event detection. Recently, the development of various features and systems have been attempted to tackle the problems of acoustic scene classification and sound event detection. However, most of these systems use data-independent spectral features, e.g., Mel-spectrogram, log-Mel-spectrum, and gammatone filterbank. Some data-dependent feature extraction methods, including the NMF-based methods, recently demonstrated the potential to tackle the problems mentioned above for long-term acoustic signals. In this paper, we further develop the recently proposed NMF-based feature extraction method to enable its application in weakly supervised sound event detection. To achieve this goal, we develop a strategy for training the frequency basis matrix using a heterogeneous database consisting of strongly- and weakly-labeled data. Moreover, we develop a non-iterative version of the NMF-based feature extraction method so that the proposed feature extraction method can be applied as a part of the model structure similar to the modern “on-the-fly” transform method for the Mel-spectrogram. To detect the sound events, the temporal basis is calculated using the NMF method and then used as a feature for the mean-teacher-model-based classifier. The results are improved for the event-wise post-processing method. To evaluate the proposed system, simulations of the weakly supervised sound event detection were conducted using the Detection and Classification of Acoustic Scenes and Events 2020 Task 4 database. The results reveal that the proposed system has F1-score performance comparable with the Mel-spectrogram and gammatonegram and exhibits 3–5% better performance than the log-Mel-spectrum and constant-Q transform.

Download Full-text

Overlapping sound event detection with supervised Nonnegative Matrix Factorization

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2017.7951792 ◽

2017 ◽

Cited By ~ 2

Author(s):

Victor Bisot ◽

Slim Essid ◽

Gael Richard

Keyword(s):

Event Detection ◽

Matrix Factorization ◽

Nonnegative Matrix Factorization ◽

Nonnegative Matrix ◽

Sound Event ◽

Sound Event Detection

Download Full-text

Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2015.7177950 ◽

2015 ◽

Cited By ~ 35

Author(s):

Annamaria Mesaros ◽

Toni Heittola ◽

Onur Dikmen ◽

Tuomas Virtanen

Keyword(s):

Event Detection ◽

Matrix Factorization ◽

Real Life ◽

Sound Event ◽

Sound Event Detection ◽

Spectral Representations

Download Full-text

Adaptive Noise Reduction for Sound Event Detection Using Subband-Weighted NMF

Sensors ◽

10.3390/s19143206 ◽

2019 ◽

Vol 19 (14) ◽

pp. 3206 ◽

Cited By ~ 8

Author(s):

Qing Zhou ◽

Zuren Feng ◽

Emmanouil Benetos

Keyword(s):

Noise Reduction ◽

Event Detection ◽

Reduction Method ◽

Time Varying ◽

Relative Importance ◽

Noisy Signal ◽

Sound Event ◽

Sound Event Detection ◽

Adaptive Noise ◽

Non Negative Matrix Factorization

Sound event detection in real-world environments suffers from the interference of non-stationary and time-varying noise. This paper presents an adaptive noise reduction method for sound event detection based on non-negative matrix factorization (NMF). First, a scheme for noise dictionary learning from the input noisy signal is employed by the technique of robust NMF, which supports adaptation to noise variations. The estimated noise dictionary is used to develop a supervised source separation framework in combination with a pre-trained event dictionary. Second, to improve the separation quality, we extend the basic NMF model to a weighted form, with the aim of varying the relative importance of the different components when separating a target sound event from noise. With properly designed weights, the separation process is forced to rely more on those dominant event components, whereas the noise gets greatly suppressed. The proposed method is evaluated on a dataset of the rare sound event detection task of the DCASE 2017 challenge, and achieves comparable results to the top-ranking system based on convolutional recurrent neural networks (CRNNs). The proposed weighted NMF method shows an excellent noise reduction ability, and achieves an improvement of an F-score by 5%, compared to the unweighted approach.

Download Full-text

A Transfer Learning Based Feature Extractor for Polyphonic Sound Event Detection Using Connectionist Temporal Classification

10.21437/interspeech.2017-1469 ◽

2017 ◽

Cited By ~ 2

Author(s):

Yun Wang ◽

Florian Metze

Keyword(s):

Transfer Learning ◽

Event Detection ◽

Sound Event ◽

Feature Extractor ◽

Sound Event Detection ◽

Connectionist Temporal Classification

Download Full-text

An Effective Perturbation Based Semi-Supervised Learning Method for Sound Event Detection

10.21437/interspeech.2020-2329 ◽

2020 ◽

Author(s):

Xu Zheng ◽

Yan Song ◽

Jie Yan ◽

Li-Rong Dai ◽

Ian McLoughlin ◽

...

Keyword(s):

Supervised Learning ◽

Event Detection ◽

Learning Method ◽

Sound Event ◽

Sound Event Detection

Download Full-text

Neural Network Distillation on IoT Platforms for Sound Event Detection

10.21437/interspeech.2019-2394 ◽

2019 ◽

Cited By ~ 3

Author(s):

Gianmarco Cerutti ◽

Rahul Prasad ◽

Alessio Brutti ◽

Elisabetta Farella

Keyword(s):

Neural Network ◽

Event Detection ◽

Sound Event ◽

Iot Platforms ◽

Sound Event Detection

Download Full-text

Self-Training for Sound Event Detection in Audio Mixtures

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414450 ◽

2021 ◽

Author(s):

Sangwook Park ◽

Ashwin Bellur ◽

David K. Han ◽

Mounya Elhilali

Keyword(s):

Event Detection ◽

Sound Event ◽

Sound Event Detection

Download Full-text

Sound Event Detection Based on Curriculum Learning Considering Learning Difficulty of Events

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414184 ◽

2021 ◽

Author(s):

Noriyuki Tonami ◽

Keisuke Imoto ◽

Yuki Okamoto ◽

Takahiro Fukumori ◽

Yoichi Yamashita

Keyword(s):

Event Detection ◽

Learning Difficulty ◽

Sound Event ◽

Sound Event Detection

Download Full-text