Comparison of Voice Analysis Systems for Perturbation Measurement

1996 ◽  
Vol 39 (1) ◽  
pp. 126-134 ◽  
Author(s):  
Steven Bielamowicz ◽  
Jody Kreiman ◽  
Bruce R. Gerratt ◽  
Marc S. Dauer ◽  
Gerald S. Berke

Dysphonic voices are often analyzed using automated voice analysis software. However, the reliability of acoustic measures obtained from these programs remains unknown, particularly when they are applied to pathological voices. This study compared perturbation measures from CSpeech, Computerized Speech Laboratory, SoundScope, and a hand marking voice analysis system. Sustained vowels from 29 male and 21 female speakers with mild to severe dysphonia were digitized, and fundamental frequency (F 0 ), jitter, shimmer, and harmonics- or signal-to-noise ratios were computed. Commercially available acoustical analysis programs agreed well, but not perfectly, in their measures of F 0 . Measures of perturbation in the various analysis packages use different algorithms, provide results in different units, and often yield values for voices that violate the assumption of quasi-periodicity. As a result, poor rank order correlations between programs using similar measures of perturbation were noted. Because measures of aperiodicity apparently cannot be reliably applied to voices that are even mildly aperiodic, we question their utility in quantifying vocal quality, especially in pathological voices.

1994 ◽  
Vol 103 (2) ◽  
pp. 145-148 ◽  
Author(s):  
Jack J. Jiang ◽  
David B. Wexler ◽  
Ingo R. Titze ◽  
Steven D. Gray

A submucosal fat autograft was implanted within the cover of injured vocal folds of 5 dogs. The implant occurred 6 weeks after unilateral mucosal excision had been performed. Three months postoperatively the larynges of these animals were excised and their phonation was compared to that of normal dog larynges and to other larynges with mucosal excision (but without fat grafting). Radiated acoustic pressure from the artificially driven larynges was recorded and digitized at 20 kHz with 16-bit resolution. Amplitude and fundamental frequency perturbations were extracted from a segment of phonation to assess the stability of the acoustic signals from the 3 groups. It was found that fat augmentation after mucosal excision reduced amplitude and frequency perturbation measures. There was no significant difference between fat-augmented and normal vocal folds. The acoustic measures were also positively correlated with phonation threshold and phonation efficiency measures reported earlier. The results suggest that submucosal fat autograft implantation within an injured vocal fold cover can restore not only the “ease” of phonation, but also the stability of phonation, which is a component of vocal quality.


1990 ◽  
Vol 33 (4) ◽  
pp. 761-775 ◽  
Author(s):  
Susan Nittrouer ◽  
Richard S. McGowan ◽  
Paul H. Milenkovic ◽  
Donna Beehler

Several acoustic measures of laryngeal activity were made on adult speech to help answer two questions left unresolved by previous work: (1) how each measure varies, if at all, with phonetic structure, and (2) what aspect of laryngeal activity each measure specifies. Speech samples of 15 syllables (three vowels in five prevocalic consonantal contexts) were collected from men and women at two times of the day (early morning and late afternoon). Eight measurements were made, mainly on slices extracted from the middle of the vocalic portions, and inferential and correlational statistics were applied to these measures. Results of the inferential tests indicated differences between men and women in how laryngeal adjustments are made, affecting relative amounts of vocal jitter and spectral tilt of the voicing source. In addition, the voicing and manner characteristics of the prevocalic consonant were found to affect fundamental frequency, cycle-to-cycle perturbations, and amount of aspiration noise. To a lesser extent, vowel height and front/back tongue placement also affected these acoustic source characteristics. Results of the correlational tests showed that different laryngeal mechanisms contributed differentially to signal-to-noise ratios for men and women, and these mechanisms were more greatly affected by fundamental frequency for men’s samples. Finally, various acoustic measures of laryngeal noise were found to be related to the same underlying mechanism.


2014 ◽  
Vol 57 (1) ◽  
pp. 26-45 ◽  
Author(s):  
Carlyn Burris ◽  
Houri K. Vorperian ◽  
Marios Fourakis ◽  
Ray D. Kent ◽  
Daniel M. Bolt

Purpose This study examines accuracy and comparability of 4 trademarked acoustic analysis software packages (AASPs): Praat, WaveSurfer, TF32, and CSL by using synthesized and natural vowels. Features of AASPs are also described. Method Synthesized and natural vowels were analyzed using each of the AASP's default settings to secure 9 acoustic measures: fundamental frequency (F0), formant frequencies (F1–F4), and formant bandwidths (B1–B4). The discrepancy between the software measured values and the input values (synthesized, previously reported, and manual measurements) was used to assess comparability and accuracy. Basic AASP features are described. Results Results indicate that Praat, WaveSurfer, and TF32 generate accurate and comparable F0 and F1–F4 data for synthesized vowels and adult male natural vowels. Results varied by vowel for women and children, with some serious errors. Bandwidth measurements by AASPs were highly inaccurate as compared with manual measurements and published data on formant bandwidths. Conclusions Values of F0 and F1–F4 are generally consistent and fairly accurate for adult vowels and for some child vowels using the default settings in Praat, WaveSurfer, and TF32. Manipulation of default settings yields improved output values in TF32 and CSL. Caution is recommended especially before accepting F1–F4 results for children and B1–B4 results for all speakers.


2020 ◽  
Vol 63 (11) ◽  
pp. 3855-3864
Author(s):  
Wanting Huang ◽  
Lena L. N. Wong ◽  
Fei Chen ◽  
Haihong Liu ◽  
Wei Liang

Purpose Fundamental frequency (F0) is the primary acoustic cue for lexical tone perception in tonal languages but is processed in a limited way in cochlear implant (CI) systems. The aim of this study was to evaluate the importance of F0 contours in sentence recognition in Mandarin-speaking children with CIs and find out whether it is similar to/different from that in age-matched normal-hearing (NH) peers. Method Age-appropriate sentences, with F0 contours manipulated to be either natural or flattened, were randomly presented to preschool children with CIs and their age-matched peers with NH under three test conditions: in quiet, in white noise, and with competing sentences at 0 dB signal-to-noise ratio. Results The neutralization of F0 contours resulted in a significant reduction in sentence recognition. While this was seen only in noise conditions among NH children, it was observed throughout all test conditions among children with CIs. Moreover, the F0 contour-induced accuracy reduction ratios (i.e., the reduction in sentence recognition resulting from the neutralization of F0 contours compared to the normal F0 condition) were significantly greater in children with CIs than in NH children in all test conditions. Conclusions F0 contours play a major role in sentence recognition in both quiet and noise among pediatric implantees, and the contribution of the F0 contour is even more salient than that in age-matched NH children. These results also suggest that there may be differences between children with CIs and NH children in how F0 contours are processed.


Animals ◽  
2021 ◽  
Vol 11 (5) ◽  
pp. 1242
Author(s):  
Georg Haider ◽  
Ursula Schulz ◽  
Nikola Katic ◽  
Christian Peham ◽  
Gilles Dupré

Single-port access systems (SPASs) are currently used in human and veterinary surgeries. However, they pose technical challenges, such as instrument crowding, intra- and extracorporeal instrument collision, and reduced maneuverability. Studies comparing the maneuverability of the scopes and instruments in different SPASs are lacking. This study aimed to compare the maneuverability of three different SPASs: the Covidien SILS-port, Storz Endocone, and glove port. A clear acrylic box with artificial skin placed at the bottom was used to mimic the abdominal wall and cavity. The three SPASs were placed from below, and a 10-mm endoscope and 5-mm instrument were introduced. A motion analysis system consisting of 18 cameras and motion analysis software were used to track the movement of the endoscope and instrument, to determine the volume of the cone-shaped, three-dimensional figures over which movement was possible, with higher values indicating greater maneuverability. The Mann–Whitney U test was used for the analysis. The maneuverability of the endoscope alone was significantly higher in the glove port system than in the other two SPASs. When inserting an additional instrument, the maneuverability significantly decreased in the SILS-port and Endocone, but not in the glove port. The highest maneuverability overall was found in the glove port.


2011 ◽  
Vol 130-134 ◽  
pp. 2404-2408
Author(s):  
Jun Ma ◽  
Wen Ying Su

In view of the heavy workload and possible intervention to the normal traffic flow during the performance testing of road traffic signs, this paper is designed to present a system that can be installed in an automobile and automatically track and analyze the performance of traffic signs. The system consists of a carrying vehicle, GPS, IMU, area-array cameras, frame grabbers, data acquisition software and data analysis software. Based on close-range photogrammetry technology, the system is designed with a set of effective road traffic signs automatic detection algorithms, which can automatically measure and analyze the properties of road traffic signs, such as dimensions, headroom and verticality of the column, etc.


2021 ◽  
Author(s):  
Matthew S Binder ◽  
Zachary P Pranske ◽  
Joaquin N Lugo

Vocal communication is an essential behavior in mammals and is relevant to human neurodevelopmental conditions. Mice produce communicative vocalizations, known as ultrasonic vocalizations (USVs), that can be recorded with various programs. The Mouse Song Analyzer is an automated analysis system, while DeepSqueak is a semi-automated system. We used data from C57BL/6J, FVB.129, and FVB mice to compare whether the DeepSqueak and Mouse Song Analyzer systems measure a similar total number, duration, and fundamental frequency of USVs. We found that the two systems detected a similar quantity of USVs for FVB.129 mice (r= .90, p< .001), but displayed lower correlations for C57BL/6J (r= .76, p< .001) and FVB mice (r= .60, p< .001). We also found that DeepSqueak detected significantly more USVs for C57BL/6J mice than the Mouse Song Analyzer. The two systems detected a similar duration of USVs for C57BL/6J (r= .82, p< .001), but lower correlations for FVB.129 (r= .13, p< .001) and FVB mice (r= .51, p< .01) were found, with DeepSqueak detecting significantly more USVs per each strain. We found lower than acceptable correlations for fundamental frequency in C57BL/6J (r= .54, p< .01), FVB.129 (r= .76, p< .001), and FVB mice (r= .07, p= .76), with the Mouse Song Analyzer detecting a significantly higher fundamental frequency for FVB.129 mice. These findings demonstrate that the strain of mouse used significantly affects the number, duration, and fundamental frequency of USVs that are detected between programs. Overall, we found that DeepSqueak is more accurate than the Mouse Song Analyzer.


OTO Open ◽  
2019 ◽  
Vol 3 (3) ◽  
pp. 2473974X1986638
Author(s):  
Jacob I. Tower ◽  
Lynn Acton ◽  
Jessica Wolf ◽  
Walton Wilson ◽  
Nwanmegha Young

Objective The purpose of this study was to investigate the effect of vocal training on acoustic and aerodynamic characteristics of student actors’ voices. Study Design Prospective cohort study. Setting Tertiary medical facility speech and swallow center. Subjects and Methods Acoustic, aerodynamic, and Voice Handicap Index–10 measures were collected from 14 first-year graduate-level drama students before and after a standard vocal training program and analyzed for changes over time. Results Among the aerodynamic measures that were collected, mean expiratory airflow was significantly reduced after vocal training. Among the acoustic measures that were collected, mean fundamental frequency was significantly increased after vocal training. On average, Voice Handicap Index–10 scores were unchanged after vocal training. Conclusion The cohort of drama students undergoing vocal training demonstrated improvements in voice aerodynamics, which indicate enhanced glottal efficiency after training. The present study also found an increased average fundamental frequency among the actors during sustained voicing and no changes in jitter and shimmer despite frequent performance.


1991 ◽  
Vol 22 (3) ◽  
pp. 179-188 ◽  
Author(s):  
Richard D. Saniga ◽  
Margaret F. Carlin

In our society competing noise has become part of most everyday activities. Vocal abusers need to learn to compensate for this auditory distractor. The present paper describes a voice therapy program for adolescent vocal abusers that utilizes a varying signal-to-noise ratio. Once this compensation is learned, vocal abusers can maintain an appropriate fundamental frequency and vocal intensity in their speaking voice.


2019 ◽  
Vol 81 (6) ◽  
Author(s):  
A. Nazifah Abdullah ◽  
S. H. K. Hamadi ◽  
M. Isa ◽  
B. Ismail ◽  
A. N. Nanyan ◽  
...  

Partial discharge (PD) measurement is an essential to detect and diagnose the existence of the PD. However, this measurement has faced noise disturbance in industrial environments. Thus, PD analysis system using discrete wavelet transform (DWT) denoising technique via Laboratory Virtual Instrument Engineering Workbench (LabVIEW) software is proposed to distinguish noise from the measured PD signal. In this work, the performance of denoising process is analyzed based on calculated mean square error (MSE) and signal to noise ratio (SNR). The result is manipulated based on Haar, Daubechies, Coiflets, Symlets and Biorthogonal type of mother wavelet with different decomposition levels. From the SNR results, all types of the mother wavelet are suitable to be used in denoising technique since the value of SNR is in large positive value. Therefore, further studies were conducted and found out that db14, coif3, sym5 and bior5.5 wavelets with least MSE value are considered good to be used in the denoising technique. However, bior5.5 wavelet is proposed as the most optimum mother wavelet due to consistency of producing minimum value of MSE and followed by db14.


Sign in / Sign up

Export Citation Format

Share Document