scholarly journals Comparison of Acoustic and Auditory-Perceptual Evaluation in Sustained Vowel and Connected Speech by Glottal Closure Pattern in Voice Disorders

2020 ◽  
Vol 25 (3) ◽  
pp. 738-749
Author(s):  
Eun-Jung Lee ◽  
Jaeock Kim
1981 ◽  
Vol 46 (4) ◽  
pp. 428-432 ◽  
Author(s):  
David E. Hartman ◽  
Arnold E. Aronson

Case history and voice data for 17 patients who presented with intermittent moments of breathy dysphonia (IBD) in contextual speech were analyzed. From recorded samples of vowel prolongation four variants of phonation were identified. Of 13 patients examined neurologically, 10 (77%) had positive neurologic signs. Psychiatric and physical illnesses were often associated with onset of the voice disorder. Sex ratio of the patients in this study was nearly the same as that for adductor spastic dysphonia. Age at onset was similar to that which has been described for "functional" voice disorders. Different therapies were generally ineffective in alleviating the disorder. Although not conclusive, patients presenting with IBD in connected speech may have an underlying neurologic or psychologic disorders.


2020 ◽  
Vol 29 (3) ◽  
pp. 1596-1607 ◽  
Author(s):  
Olivia Murton ◽  
Robert Hillman ◽  
Daryush Mehta

Purpose The goal of this study was to employ frequently used analysis methods and tasks to identify values for cepstral peak prominence (CPP) that can aid clinical voice evaluation. Experiment 1 identified CPP values to distinguish speakers with and without voice disorders. Experiment 2 was an initial attempt to estimate auditory-perceptual ratings of overall dysphonia severity using CPP values. Method CPP was computed using the Analysis of Dysphonia in Speech and Voice (ADSV) program and Praat. Experiment 1 included recordings from 295 patients with medically diagnosed voice disorders and 50 vocally healthy control speakers. Speakers produced sustained /a/ vowels and the English language Rainbow Passage. CPP cutoff values that best distinguished patient and control speakers were identified. Experiment 2 analyzed recordings from 32 English speakers with varying dysphonia severity and provided preliminary validation of the Experiment 1 cutoffs. Speakers sustained the /a/ vowel and read four sentences from the Consensus Auditory-Perceptual Evaluation of Voice protocol. Trained listeners provided auditory-perceptual ratings of overall dysphonia for the recordings, which were estimated using CPP values in a linear regression model whose performance was evaluated using the coefficient of determination ( r 2 ). Results Experiment 1 identified CPP cutoff values of 11.46 dB (ADSV) and 14.45 dB (Praat) for the sustained /a/ vowels and 6.11 dB (ADSV) and 9.33 dB (Praat) for the Rainbow Passage. CPP values below those thresholds indicated the presence of a voice disorder with up to 94.5% accuracy. In Experiment 2, CPP values estimated ratings of overall dysphonia with r 2 values up to .74. Conclusions The CPP cutoff values identified in Experiment 1 provide normative reference points for clinical voice evaluation based on sustained /a/ vowels and the Rainbow Passage. Experiment 2 provides an initial predictive framework that can be used to relate CPP values to the auditory perception of overall dysphonia severity based on sustained /a/ vowels and Consensus Auditory-Perceptual Evaluation of Voice sentences.


2014 ◽  
Vol 28 (2) ◽  
pp. 210-215 ◽  
Author(s):  
Susana Vaz Freitas ◽  
Pedro Melo Pestana ◽  
Vítor Almeida ◽  
Aníbal Ferreira

2000 ◽  
Vol 109 (2) ◽  
pp. 204-209 ◽  
Author(s):  
Stellan Hertegård ◽  
Svante Granqvist ◽  
Per-Åke Lindestad

Fifteen patients, 13 women and 2 men, with a mean age of 72.7 years (56 to 86 years) and a clinical diagnosis of essential voice tremor, were treated with botulinum injections to the thyroarytenoid muscles, and in some cases, to the cricothyroid or thyrohyoid muscles. Evaluations were based on subjective judgments by the patients, and on perceptual and acoustic analysis of voice recordings. Subjective evaluations indicated that the treatment had a beneficial effect in 67% of the patients. Perceptual evaluations showed a significant decrease in voice tremor during connected speech (p < .05). Acoustic analysis showed a nearly significant decrease in the fundamental frequency variations (p = .06) and a significant decrease in fundamental frequency during sustained vowel phonation (p < .01). The results of perceptual evaluation coincided most closely with the subjective judgments. It was concluded that the treatment was successful in 50% to 65% of the patients, depending on the method of evaluation.


2019 ◽  
Vol 33 (5) ◽  
pp. 809.e1-809.e10 ◽  
Author(s):  
Shaheen N. Awan ◽  
Jackie L. Gartner-Schmidt ◽  
Lauren K. Timmons ◽  
Amanda I. Gillespie

2012 ◽  
Vol 24 (2) ◽  
pp. 107-112 ◽  
Author(s):  
Youri Maryn ◽  
Nelson Roy

PURPOSE: Auditory-perceptual evaluation of dysphonia may be influenced by the type of speech/voice task used to render judgements during the clinical evaluation, i.e., sustained vowels versus continuous speech. This study explored (a) differences in listener dysphonia severity ratings on the basis of speech/voice tasks, (b) the influence of speech/voice task on dysphonia severity ratings of stimuli that combined sustained vowels and continuous speech, and (c) the differences in inter-rater reliability of dysphonia severity ratings between both speech tasks. METHODS: Five experienced listeners rated overall dysphonia severity in sustained vowels, continuous speech and concatenated speech samples elicited by 39 subjects with various voice disorders and degrees of hoarseness. RESULTS: Data confirmed that sustained vowels are rated significantly more dysphonic than continuous speech. Furthermore, dysphonia severity in concatenated speech samples is least determined by the sustained vowel. Finally, no significant difference was found in inter-rater reliability between dysphonia severity ratings of sustained vowels versus continuous speech. CONCLUSION: Based upon the results, both types of speech/voice tasks (i.e., sustained vowel and continuous speech) should be elicited and judged by clinicians in the auditory-perceptual rating of dysphonia severity.


2015 ◽  
Vol 2015 ◽  
pp. 1-11 ◽  
Author(s):  
Tino Haderlein ◽  
Cornelia Schwemmle ◽  
Michael Döllinger ◽  
Václav Matoušek ◽  
Martin Ptok ◽  
...  

Due to low intra- and interrater reliability, perceptual voice evaluation should be supported by objective, automatic methods. In this study, text-based, computer-aided prosodic analysis and measurements of connected speech were combined in order to model perceptual evaluation of the German Roughness-Breathiness-Hoarseness (RBH) scheme. 58 connected speech samples (43 women and 15 men;48.7±17.8years) containing the German version of the text “The North Wind and the Sun” were evaluated perceptually by 19 speech and voice therapy students according to the RBH scale. For the human-machine correlation, Support Vector Regression with measurements of the vocal fold cycle irregularities (CFx) and the closed phases of vocal fold vibration (CQx) of the Laryngograph and 33 features from a prosodic analysis module were used to model the listeners’ ratings. The best human-machine results for roughness were obtained from a combination of six prosodic features and CFx (r=0.71,ρ=0.57). These correlations were approximately the same as the interrater agreement among human raters (r=0.65,ρ=0.61). CQx was one of the substantial features of the hoarseness model. For hoarseness and breathiness, the human-machine agreement was substantially lower. Nevertheless, the automatic analysis method can serve as the basis for a meaningful objective support for perceptual analysis.


1994 ◽  
Vol 37 (5) ◽  
pp. 985-1000 ◽  
Author(s):  
Guus de Krom

This study describes a perception experiment in which listeners were asked to rate voice fragments obtained from a variety of speakers on grade, breathiness, and roughness. Four different types of stimuli were presented to each listener. One type of stimulus was based on connected speech fragments; the other three were based on different segments of a sustained vowel, yielding a 200 msec vowel onset stimulus, a 200 msec post-onset stimulus, and a 1000 msec whole vowel stimulus. Analyses focused on the consistency and reliability of grade, roughness, and breathiness ratings. Results indicated that stimulus type had virtually no effect on either within- or between-listener consistency of the grade, breathiness, or roughness ratings. Rating reliability too was hardly influenced by stimulus type. When determined as a function of the overall degree of deviance of a voice, the reliability of breathiness and roughness ratings was slightly higher for whole vowel and vowel onset stimuli than for connected speech and post-onset stimuli. It is concluded that connected speech stimuli are not necessarily to be preferred over vowel-type stimuli for a perceptual evaluation of grade, roughness, or breathiness. The somewhat higher reliability of ratings on vowel onset and whole vowel stimuli as compared to the post-onset stimuli is taken as an indication that the onset part of a vowel may contain voice quality cues that are less salient in the most stable part of a vowel.


Sign in / Sign up

Export Citation Format

Share Document