Daubechies Wavelets Based Robust Audio Fingerprinting for Content-Based Audio Retrieval

2012 ◽  
Vol 4 (2) ◽  
pp. 49-69
Author(s):  
Wei Sun ◽  
Zhe-Ming Lu ◽  
Fa-Xin Yu ◽  
Rong-Jun Shen

Audio fingerprinting is the process to obtain a compact content-based signature that summarizes the essence of an audio clip. In general, existing audio fingerprinting schemes based on wavelet transforms are not robust against large linear speed changes. The authors present a novel framework for content-based audio retrieval based on the audio fingerprinting scheme that is robust against large linear speed changes. In the proposed scheme, 8 levels Daubechies wavelet decomposition is adopted for extracting time-frequency features and two fingerprint extraction algorithms are designed. The experimental results from this study are discussed further into the article.

Author(s):  
Wei Sun ◽  
Zhe-Ming Lu ◽  
Fa-Xin Yu ◽  
Rong-Jun Shen

Audio fingerprinting is the process to obtain a compact content-based signature that summarizes the essence of an audio clip. In general, existing audio fingerprinting schemes based on wavelet transforms are not robust against large linear speed changes. The authors present a novel framework for content-based audio retrieval based on the audio fingerprinting scheme that is robust against large linear speed changes. In the proposed scheme, 8 levels Daubechies wavelet decomposition is adopted for extracting time-frequency features and two fingerprint extraction algorithms are designed. The experimental results from this study are discussed further into the article.


Author(s):  
Eirik Berge

AbstractWe investigate the wavelet spaces $$\mathcal {W}_{g}(\mathcal {H}_{\pi })\subset L^{2}(G)$$ W g ( H π ) ⊂ L 2 ( G ) arising from square integrable representations $$\pi :G \rightarrow \mathcal {U}(\mathcal {H}_{\pi })$$ π : G → U ( H π ) of a locally compact group G. We show that the wavelet spaces are rigid in the sense that non-trivial intersection between them imposes strong restrictions. Moreover, we use this to derive consequences for wavelet transforms related to convexity and functions of positive type. Motivated by the reproducing kernel Hilbert space structure of wavelet spaces we examine an interpolation problem. In the setting of time–frequency analysis, this problem turns out to be equivalent to the HRT-conjecture. Finally, we consider the problem of whether all the wavelet spaces $$\mathcal {W}_{g}(\mathcal {H}_{\pi })$$ W g ( H π ) of a locally compact group G collectively exhaust the ambient space $$L^{2}(G)$$ L 2 ( G ) . We show that the answer is affirmative for compact groups, while negative for the reduced Heisenberg group.


2021 ◽  
Author(s):  
Jing Yuan ◽  
Zijie Wang ◽  
Dehe Yang ◽  
Qiao Wang ◽  
Zeren Zima ◽  
...  

<p>Lightning whistlers, found frequently in electromagnetic satellite observation, are the important tool to study electromagnetic environment of the earth space. With the increasing data from electromagnetic satellites, a considerable amount of time and human efforts are needed to detect lightning whistlers from these tremendous data. In recent years, algorithms for lightning whistlers automatic detection have been conducted. However, these methods can only work in the time-frequency profile (image) of the electromagnetic satellites data with two major limitations: vast storage memory for the time-frequency profile (image) and expensive computation for employing the methods to detect automatically the whistler from the time-frequency profile. These limitations hinder the methods work efficiently on ZH-1 satellite. To overcome the limitations and realize the real-time whistler detection automatically on board satellite, we propose a novel algorithm for detecting lightning whistler from the original observed data without transforming it to the time-frequency profile (image).</p><p>The motivation is that the frequency of lightning whistler is in the audio frequency range. It encourages us to utilize the speech recognition techniques to recognize the whistler in the original data \of SCM VLF Boarded on ZH-1. Firstly, we averagely move a 0.16 seconds window on the original data to obtain the patch data as the audio clip. Secondly, we extract the Mel-frequency cepstral coefficients (MFCCs) of the patch data as a type of cepstral representation of the audio clip. Thirdly, the MFCCs are input to the Long Short-Term Memory (LSTM) recurrent neutral networks to classification. To evaluate the proposed method, we construct the dataset composed of 10000 segments of SCM wave data observed from ZH-1 satellite(5000 segments which involving whistler and 5000 segments without any whistler). The proposed method can achieve 84% accuracy, 87% in recall, 85.6% in F1score.Furthermore, it can save more than 126.7MB and 0.82 seconds compared to the method employing the YOLOv3 neutral network for detecting whistler on each time-frequency profile.</p><p> </p><p>Key words: ZH-1 satellite, SCM,lightning whistler, MFCC, LSTM</p>


2009 ◽  
Vol 24 (4) ◽  
pp. 1083-1092 ◽  
Author(s):  
Ru Yang ◽  
Bo Zhang ◽  
Dongyuan Qiu ◽  
Zuolian Liu

2021 ◽  
Vol 315 ◽  
pp. 03022
Author(s):  
Ivan Chicherin ◽  
Boris Fedosenkov ◽  
Dmitriy Dubinkin ◽  
Wang Zhenbo

Introduction. Purpose of the work. Within the framework of the computer-aided system, a technology has been formed for the method of controlling the current trajectories (CTs) of unmanned vehicles (UMVs) when they move along routes in a quarry in open pit mining. The purpose of the presented studies is to analyze the application of a wavelet transforms technique to the problem of routing unmanned vehicles when they move along routes within open pit roads. Methodology. The results of modeling certain one-dimensional signals corresponding to the UMV current trajectories when they deviate to the left / right from a nominal axial trajectory (NAT), as well as their time-frequency representations in a wavelet medium are presented. An algorithm of the procedure for displaying scalar UMV CT control signals in a complex medium of time-frequency wavelet transforms has been developed and described. Such a transformation allows for a functionally transparent and information-capacious monitoring of the UMV movement and efficiently manage the processes of trajectory routing dump trucks in an open pit. Research results, analysis. The processes of modifying the UMV movement current trajectories under the control of the computer-aided system are generated using wavelet transforms methods. They are based on algorithms for projecting the trajectory signals with a time-dependent frequency (chirp signals) onto a set of wavelet functions as part of a wavelet thesaurus (wavelet dictionary), executing certain wavelet matching pursuit procedures, and displaying the CT scalar signals in a specific multidimensional medium of Cohen’s class time-frequency distributions. The simulation results in the form of the current trajectory (CT-) signals waveforms and their three-dimensional time-frequency representations as Wigner maps showing the UMV movement in a start-stop mode, as well as the signals of formed continuous deviation trajectories when they leave to the left and to the right from the NAT, are presented. An algorithm for the formation of 3D-representations of UMV current trajectory one-dimensional signals is presented. Conclusion. The conclusion is made that the mathematical technique of wavelet transforms is the most expedient and effective means for computer-aided monitoring and controlling the dynamics of UMV movement along routes within open pit roads.


Sign in / Sign up

Export Citation Format

Share Document