Comparison of Acoustic and Auditory-Perceptual Evaluation in Sustained Vowel and Connected Speech by Glottal Closure Pattern in Voice Disorders

Case history and voice data for 17 patients who presented with intermittent moments of breathy dysphonia (IBD) in contextual speech were analyzed. From recorded samples of vowel prolongation four variants of phonation were identified. Of 13 patients examined neurologically, 10 (77%) had positive neurologic signs. Psychiatric and physical illnesses were often associated with onset of the voice disorder. Sex ratio of the patients in this study was nearly the same as that for adductor spastic dysphonia. Age at onset was similar to that which has been described for "functional" voice disorders. Different therapies were generally ineffective in alleviating the disorder. Although not conclusive, patients presenting with IBD in connected speech may have an underlying neurologic or psychologic disorders.

Download Full-text

Cepstral Peak Prominence Values for Clinical Voice Evaluation

American Journal of Speech-Language Pathology ◽

10.1044/2020_ajslp-20-00001 ◽

2020 ◽

Vol 29 (3) ◽

pp. 1596-1607 ◽

Cited By ~ 3

Author(s):

Olivia Murton ◽

Robert Hillman ◽

Daryush Mehta

Keyword(s):

English Language ◽

Voice Disorders ◽

Reference Points ◽

Voice Disorder ◽

Coefficient Of Determination ◽

Initial Attempt ◽

Perceptual Evaluation ◽

Cutoff Values ◽

Preliminary Validation ◽

Cepstral Peak Prominence

Purpose The goal of this study was to employ frequently used analysis methods and tasks to identify values for cepstral peak prominence (CPP) that can aid clinical voice evaluation. Experiment 1 identified CPP values to distinguish speakers with and without voice disorders. Experiment 2 was an initial attempt to estimate auditory-perceptual ratings of overall dysphonia severity using CPP values. Method CPP was computed using the Analysis of Dysphonia in Speech and Voice (ADSV) program and Praat. Experiment 1 included recordings from 295 patients with medically diagnosed voice disorders and 50 vocally healthy control speakers. Speakers produced sustained /a/ vowels and the English language Rainbow Passage. CPP cutoff values that best distinguished patient and control speakers were identified. Experiment 2 analyzed recordings from 32 English speakers with varying dysphonia severity and provided preliminary validation of the Experiment 1 cutoffs. Speakers sustained the /a/ vowel and read four sentences from the Consensus Auditory-Perceptual Evaluation of Voice protocol. Trained listeners provided auditory-perceptual ratings of overall dysphonia for the recordings, which were estimated using CPP values in a linear regression model whose performance was evaluated using the coefficient of determination ( r 2 ). Results Experiment 1 identified CPP cutoff values of 11.46 dB (ADSV) and 14.45 dB (Praat) for the sustained /a/ vowels and 6.11 dB (ADSV) and 9.33 dB (Praat) for the Rainbow Passage. CPP values below those thresholds indicated the presence of a voice disorder with up to 94.5% accuracy. In Experiment 2, CPP values estimated ratings of overall dysphonia with r 2 values up to .74. Conclusions The CPP cutoff values identified in Experiment 1 provide normative reference points for clinical voice evaluation based on sustained /a/ vowels and the Rainbow Passage. Experiment 2 provides an initial predictive framework that can be used to relate CPP values to the auditory perception of overall dysphonia severity based on sustained /a/ vowels and Consensus Auditory-Perceptual Evaluation of Voice sentences.

Download Full-text

Fundamental Frequency and Intensity Effects on Cepstral Measures in Vowels from Connected Speech of Speakers with Voice Disorders

Journal of Voice ◽

10.1016/j.jvoice.2019.11.014 ◽

2019 ◽

Cited By ~ 1

Author(s):

Marília Carvalho Sampaio ◽

Jörg Edgar Bohlender ◽

Meike Brockmann-Bauser

Keyword(s):

Fundamental Frequency ◽

Voice Disorders ◽

Connected Speech

Download Full-text

Audio-Perceptual Evaluation of Portuguese Voice Disorders—An Inter- and Intrajudge Reliability Study

Journal of Voice ◽

10.1016/j.jvoice.2013.08.001 ◽

2014 ◽

Vol 28 (2) ◽

pp. 210-215 ◽

Cited By ~ 8

Author(s):

Susana Vaz Freitas ◽

Pedro Melo Pestana ◽

Vítor Almeida ◽

Aníbal Ferreira

Keyword(s):

Voice Disorders ◽

Reliability Study ◽

Perceptual Evaluation

Download Full-text

Botulinum Toxin Injections for Essential Voice Tremor

Annals of Otology Rhinology & Laryngology ◽

10.1177/000348940010900216 ◽

2000 ◽

Vol 109 (2) ◽

pp. 204-209 ◽

Cited By ~ 64

Author(s):

Stellan Hertegård ◽

Svante Granqvist ◽

Per-Åke Lindestad

Keyword(s):

Botulinum Toxin ◽

Fundamental Frequency ◽

Acoustic Analysis ◽

Connected Speech ◽

Perceptual Evaluation ◽

Subjective Evaluations ◽

Subjective Judgments ◽

Method Of Evaluation ◽

Botulinum Toxin Injections ◽

Frequency Variations

Fifteen patients, 13 women and 2 men, with a mean age of 72.7 years (56 to 86 years) and a clinical diagnosis of essential voice tremor, were treated with botulinum injections to the thyroarytenoid muscles, and in some cases, to the cricothyroid or thyrohyoid muscles. Evaluations were based on subjective judgments by the patients, and on perceptual and acoustic analysis of voice recordings. Subjective evaluations indicated that the treatment had a beneficial effect in 67% of the patients. Perceptual evaluations showed a significant decrease in voice tremor during connected speech (p < .05). Acoustic analysis showed a nearly significant decrease in the fundamental frequency variations (p = .06) and a significant decrease in fundamental frequency during sustained vowel phonation (p < .01). The results of perceptual evaluation coincided most closely with the subjective judgments. It was concluded that the treatment was successful in 50% to 65% of the patients, depending on the method of evaluation.

Download Full-text

Effects of a Variably Occluded Face Mask on the Aerodynamic and Acoustic Characteristics of Connected Speech in Patients With and Without Voice Disorders

Journal of Voice ◽

10.1016/j.jvoice.2018.03.002 ◽

2019 ◽

Vol 33 (5) ◽

pp. 809.e1-809.e10 ◽

Cited By ~ 3

Author(s):

Shaheen N. Awan ◽

Jackie L. Gartner-Schmidt ◽

Lauren K. Timmons ◽

Amanda I. Gillespie

Keyword(s):

Voice Disorders ◽

Face Mask ◽

Connected Speech ◽

Acoustic Characteristics

Download Full-text

Sustained vowels and continuous speech in the auditory-perceptual evaluation of dysphonia severity

Jornal da Sociedade Brasileira de Fonoaudiologia ◽

10.1590/s2179-64912012000200003 ◽

2012 ◽

Vol 24 (2) ◽

pp. 107-112 ◽

Cited By ~ 48

Author(s):

Youri Maryn ◽

Nelson Roy

Keyword(s):

Clinical Evaluation ◽

Voice Disorders ◽

Continuous Speech ◽

Rater Reliability ◽

Perceptual Evaluation ◽

Significant Difference

PURPOSE: Auditory-perceptual evaluation of dysphonia may be influenced by the type of speech/voice task used to render judgements during the clinical evaluation, i.e., sustained vowels versus continuous speech. This study explored (a) differences in listener dysphonia severity ratings on the basis of speech/voice tasks, (b) the influence of speech/voice task on dysphonia severity ratings of stimuli that combined sustained vowels and continuous speech, and (c) the differences in inter-rater reliability of dysphonia severity ratings between both speech tasks. METHODS: Five experienced listeners rated overall dysphonia severity in sustained vowels, continuous speech and concatenated speech samples elicited by 39 subjects with various voice disorders and degrees of hoarseness. RESULTS: Data confirmed that sustained vowels are rated significantly more dysphonic than continuous speech. Furthermore, dysphonia severity in concatenated speech samples is least determined by the sustained vowel. Finally, no significant difference was found in inter-rater reliability between dysphonia severity ratings of sustained vowels versus continuous speech. CONCLUSION: Based upon the results, both types of speech/voice tasks (i.e., sustained vowel and continuous speech) should be elicited and judged by clinicians in the auditory-perceptual rating of dysphonia severity.

Download Full-text

Automatic Evaluation of Voice Quality Using Text-Based Laryngograph Measurements and Prosodic Analysis

Computational and Mathematical Methods in Medicine ◽

10.1155/2015/316325 ◽

2015 ◽

Vol 2015 ◽

pp. 1-11 ◽

Cited By ~ 4

Author(s):

Tino Haderlein ◽

Cornelia Schwemmle ◽

Michael Döllinger ◽

Václav Matoušek ◽

Martin Ptok ◽

...

Keyword(s):

Vocal Fold ◽

Voice Quality ◽

Support Vector ◽

Analysis Method ◽

Prosodic Features ◽

Connected Speech ◽

Perceptual Evaluation ◽

Vocal Fold Vibration ◽

The North ◽

Automatic Methods

Due to low intra- and interrater reliability, perceptual voice evaluation should be supported by objective, automatic methods. In this study, text-based, computer-aided prosodic analysis and measurements of connected speech were combined in order to model perceptual evaluation of the German Roughness-Breathiness-Hoarseness (RBH) scheme. 58 connected speech samples (43 women and 15 men;48.7±17.8years) containing the German version of the text “The North Wind and the Sun” were evaluated perceptually by 19 speech and voice therapy students according to the RBH scale. For the human-machine correlation, Support Vector Regression with measurements of the vocal fold cycle irregularities (CFx) and the closed phases of vocal fold vibration (CQx) of the Laryngograph and 33 features from a prosodic analysis module were used to model the listeners’ ratings. The best human-machine results for roughness were obtained from a combination of six prosodic features and CFx (r=0.71,ρ=0.57). These correlations were approximately the same as the interrater agreement among human raters (r=0.65,ρ=0.61). CQx was one of the substantial features of the hoarseness model. For hoarseness and breathiness, the human-machine agreement was substantially lower. Nevertheless, the automatic analysis method can serve as the basis for a meaningful objective support for perceptual analysis.

Download Full-text

Consistency and Reliability of Voice Quality Ratings for Different Types of Speech Fragments

Journal of Speech Language and Hearing Research ◽

10.1044/jshr.3705.985 ◽

1994 ◽

Vol 37 (5) ◽

pp. 985-1000 ◽

Cited By ~ 76

Author(s):

Guus de Krom

Keyword(s):

Stimulus Type ◽

Voice Quality ◽

The Other ◽

Connected Speech ◽

Speech Stimuli ◽

Perceptual Evaluation ◽

Quality Ratings ◽

Different Types ◽

Stable Part ◽

Post Onset

This study describes a perception experiment in which listeners were asked to rate voice fragments obtained from a variety of speakers on grade, breathiness, and roughness. Four different types of stimuli were presented to each listener. One type of stimulus was based on connected speech fragments; the other three were based on different segments of a sustained vowel, yielding a 200 msec vowel onset stimulus, a 200 msec post-onset stimulus, and a 1000 msec whole vowel stimulus. Analyses focused on the consistency and reliability of grade, roughness, and breathiness ratings. Results indicated that stimulus type had virtually no effect on either within- or between-listener consistency of the grade, breathiness, or roughness ratings. Rating reliability too was hardly influenced by stimulus type. When determined as a function of the overall degree of deviance of a voice, the reliability of breathiness and roughness ratings was slightly higher for whole vowel and vowel onset stimuli than for connected speech and post-onset stimuli. It is concluded that connected speech stimuli are not necessarily to be preferred over vowel-type stimuli for a perceptual evaluation of grade, roughness, or breathiness. The somewhat higher reliability of ratings on vowel onset and whole vowel stimuli as compared to the post-onset stimuli is taken as an indication that the onset part of a vowel may contain voice quality cues that are less salient in the most stable part of a vowel.

Download Full-text