Short‐Time Spectrum and “Cepstrum” Techniques for Vocal‐Pitch Detection

Abstract A method termed harmonic tracking is developed to recover time dependent gear motion from machine casing vibration. The harmonic tracking method uses short-time spectral generation and a subsequent set of algorithms to locate and track gear meshing frequencies as functions of time. The meshing frequencies are then integrated with respect to time to obtain the rotation of individual gears. More specifically, spectral generation is performed using the discrete Fourier transform, and the locating and tracking algorithms involve locating tones in each short-time spectrum and tracking them through successive spectra to recover gear meshing harmonics. The harmonic tracking method is found to be more robust than demodulation-based methods in the presence of measurement noise and signal distortion from the structural transfer function between gears and the casing. The harmonic tracking method is tested, both through simulation and experiments involving motor-operated valves (MOV’s) as part of the development of a diagnostic system for MOV’s. In all cases, the harmonic tracking method is found to recover gear motion with sufficient accuracy to perform diagnostics. The harmonic tracking method should be generally applicable to situations in which a non-invasive technique is required for determining the time-dependent angular speeds and displacements of gearbox input, intermediary, and output shafts.

Download Full-text

Short‐Time “Cepstrum” Pitch Detection

The Journal of the Acoustical Society of America ◽

10.1121/1.2143271 ◽

1964 ◽

Vol 36 (5) ◽

pp. 1030-1030 ◽

Cited By ~ 19

Author(s):

A. M. Noll ◽

M. R. Schroeder

Keyword(s):

Pitch Detection ◽

Short Time

Download Full-text

A new spectrum feature discovered for a category of signals produced by massive and random micro-sources

MATEC Web of Conferences ◽

10.1051/matecconf/201821005010 ◽

2018 ◽

Vol 210 ◽

pp. 05010

Author(s):

Xiaodong Zhuang ◽

Nikos Mastorakis

Keyword(s):

Statistical Study ◽

Amplitude Spectrum ◽

Time Spectrum ◽

Frequency Component ◽

Main Category ◽

Frequency Components ◽

Spectrum Feature ◽

Short Time ◽

The Relationship ◽

System Characteristics

A statistical study is implemented on the short-time spectrum of one main category of random signals. For the signals with massive and random micro-sources, a new statistic feature of the short-time amplitude spectrum is discovered, which reveals the relationship between the amplitude’s average and its standard for each frequency component. Moreover, the association between the amplitude distributions for different frequency components is also studied. A model representing such association is presented, which accords well with the statistic feature discovered. The analysis result has potential application in signal classification, and also in the study of system characteristics underlying the observed signal.

Download Full-text

Research on denoising method based on improved short — Time spectrum estimation

2016 5th International Conference on Computer Science and Network Technology (ICCSNT) ◽

10.1109/iccsnt.2016.8070263 ◽

2016 ◽

Cited By ~ 1

Author(s):

Jian Kang ◽

Hongbo Wang

Keyword(s):

Time Spectrum ◽

Spectrum Estimation ◽

Denoising Method ◽

Short Time

Download Full-text

Correction to "The 'unity-lagged' short-time spectrum of a narrow-band Gaussian process" ¡

IEEE Transactions on Acoustics Speech and Signal Processing ◽

10.1109/tassp.1984.1164281 ◽

1984 ◽

Vol 32 (1) ◽

pp. 189-189

Keyword(s):

Gaussian Process ◽

Narrow Band ◽

Time Spectrum ◽

Short Time

Download Full-text

The short-time spectrum analysis of real-time sampling speech with DSP TMS320VC5416 chip

10.1117/12.2030641 ◽

2013 ◽

Author(s):

Qinru Fan ◽

Wen-hua Ren

Keyword(s):

Real Time ◽

Spectrum Analysis ◽

Time Spectrum ◽

Time Sampling ◽

Short Time

Download Full-text

Pitch Estimation using Time Domain Algorithm

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.j9733.0981119 ◽

2019 ◽

Vol 8 (11) ◽

pp. 1391-1394

Keyword(s):

Time Domain ◽

Vocal Folds ◽

Detection Algorithm ◽

Pitch Detection ◽

Pitch Estimation ◽

The Time Domain ◽

Short Time ◽

The Voice ◽

Magnitude Difference ◽

Average Magnitude Difference Function

Speech is classified into voice, unvoiced and silence. The voice speech is the periodic vibration of vocal folds. Background noise affects the speech signals. In many speech applications calculation of pitch plays a major role. The paper proposes a pitch detection algorithm based on the short-time average magnitude difference function (AMDF) and the short-term autocorrelation function (ACF). Detecting the Pitch within the speech signal is important in most of all the speech related applications. Detection of Pitch is useful in identification of speaker. One solution to get detect with the pitch is by using the time domain algorithms. This paper gives idea about estimation and detection of pitch in time domain algorithm for different voice samples

Download Full-text

An Investigation on Rolling Element Bearing Fault and Real-Time Spectrum Analysis by Using Short-Time Fourier Transform

Proceedings of International Conference on Recent Trends in Machine Learning, IoT, Smart Cities and Applications - Advances in Intelligent Systems and Computing ◽

10.1007/978-981-15-7234-0_52 ◽

2020 ◽

pp. 561-567

Author(s):

M. Siva Santhoshi ◽

K. Sharath Babu ◽

Sanjeev Kumar ◽

Durgesh Nandan

Keyword(s):

Fourier Transform ◽

Real Time ◽

Spectrum Analysis ◽

Time Spectrum ◽

Rolling Element Bearing ◽

Short Time Fourier Transform ◽

Bearing Fault ◽

Rolling Element ◽

Element Bearing ◽

Short Time

Download Full-text

Robustness of Auditory Teager Energy Cepstrum Coefficients for Classification of Pathological and Normal Voices in Noisy Environments

The Scientific World JOURNAL ◽

10.1155/2013/435729 ◽

2013 ◽

Vol 2013 ◽

pp. 1-8 ◽

Cited By ~ 4

Author(s):

Lotfi Salhi ◽

Adnane Cherif

Keyword(s):

Auditory Processing ◽

Filter Bank ◽

Energy Operator ◽

Time Spectrum ◽

Neural System ◽

Noisy Environments ◽

Robust Feature Extraction ◽

Short Time ◽

Teager Energy

This paper focuses on a robust feature extraction algorithm for automatic classification of pathological and normal voices in noisy environments. The proposed algorithm is based on human auditory processing and the nonlinear Teager-Kaiser energy operator. The robust features which labeled Teager Energy Cepstrum Coefficients (TECCs) are computed in three steps. Firstly, each speech signal frame is passed through a Gammatone or Mel scale triangular filter bank. Then, the absolute value of the Teager energy operator of the short-time spectrum is calculated. Finally, the discrete cosine transform of the log-filtered Teager Energy spectrum is applied. This feature is proposed to identify the pathological voices using a developed neural system of multilayer perceptron (MLP). We evaluate the developed method using mixed voice database composed of recorded voice samples from normophonic or dysphonic speakers. In order to show the robustness of the proposed feature in detection of pathological voices at different White Gaussian noise levels, we compare its performance with results for clean environments. The experimental results show that TECCs computed from Gammatone filter bank are more robust in noisy environments than other extracted features, while their performance is practically similar to clean environments.

Download Full-text

Acoustic signal processing based on the short‐time spectrum

The Journal of the Acoustical Society of America ◽

10.1121/1.2015754 ◽

1977 ◽

Vol 61 (S1) ◽

pp. S51-S52

Author(s):

M. W. Callahan

Keyword(s):

Signal Processing ◽

Acoustic Signal ◽

Time Spectrum ◽

Short Time

Download Full-text