Predicting Voice Quality of Deaf Speakers on the Basis of Glottal Characteristics

1990 ◽  
Vol 33 (1) ◽  
pp. 116-122 ◽  
Author(s):  
Nico Arends ◽  
Dirk-Jan Povel ◽  
Edward Van Os ◽  
Leo Speth

Twenty profoundly deaf and 5 normal-hearing subjects produced 225 sustained vowels /a, i, u/ at different pitch levels and 75 phonetically balanced sentences, while vocal fold vibration was recorded through an electrolaryngograph (ELG). The utterances recorded on audio tape were judged by 10 experienced listeners on general voice quality, breathiness, hoarseness, and laryngeal strain on a 5-point scale. Seven parameters describing time-domain characteristics of the ELG-signal were extracted online by a special purpose computer system. Measurements were made over 500 consecutive vibratory cycles (10-cycle window), yielding a mean and standard deviation for each parameter per utterance. All data were submitted to analysis of variance and multiple regression analyses. Multiple correlations between glottal parameters and judged voice deviations varied between .46 and .70 indicating that overall prediction cannot reliably be based on these parameters, although severe cases of deaf voice deviations may be detectable.

Author(s):  
Christine Ericsdotter Nordgren

Speech sounds are commonly divided into two main categories in human languages: vowels, such as ‘e’, ‘a’, ‘o’, and consonants, such as ‘k’, ‘n’, ‘s’. This division is made on the basis of both phonetic and phonological principles, which is useful from a general linguistic point of view but problematic for detailed description and analysis. The main differences between vowels and consonants are that (1) vowels are sounds produced with an open airway between the larynx and the lips, at least along the midline, whereas consonants are produced with a stricture or closure somewhere along it; and (2) that vowels tend to be syllabic in languages, meaning that they embody a sonorous peak in a syllable, whereas only some kinds of consonants tend to be syllabic. There are two main physical components needed to produce a vowel: a sound source, typically a tone produced by vocal fold vibration at the larynx, and a resonator, typically the upper airways. When the tone resonates in the upper airways, it gets a specific quality of sound, perceived and interpreted as a vowel quality, for example, ‘e’ or ‘a’. Which vowel quality is produced is determined by the shape of the inner space of the throat and mouth, the vocal tract shape, created by the speaker’s configuration of the articulators, which include the lips, tongue, jaw, hard and soft palate, pharynx, and larynx. Which vowel is perceived is determined by the auditory and visual input as well as by the listener’s expectations and language experience. Diachronic and synchronic studies on vowel typology show main trends in the vowel inventories in the worlds’ languages, which can be associated with human phonetic aptitude.


1999 ◽  
Vol 52 (1-3) ◽  
pp. 32-38 ◽  
Author(s):  
Seiji Niimi ◽  
Mamiko Miyaji

2018 ◽  
Vol 7 (4) ◽  
pp. 1-7
Author(s):  
Anna Kuligowska ◽  
Barbara Jamróz ◽  
Joanna Chmielewska ◽  
Katarzyna Jędra ◽  
Tomasz Czernicki ◽  
...  

Aim of study: Evaluation of the speech therapy on voice quality in patients with unilateral vocal fold palsy. Material and methods: The study group included 11 patients, 8 women and 3 men, in age between 16 to 72 years, with unilateral vocal fold palsy, diagnosed in ENT Department of Warsaw Medical University between 2017-2018. Each person completed questionnaires: the voice disability self-assessment scale (VHI), the voice-based quality of life (VRQoL) scale, the vocal tract discomfort scale (VTD). All questionnaires were completed twice, before and after the voice therapy. In addition, the acoustic analysis of the voice, the assessment of the maximum phonation time and the breathing tract were performed twice in each patient. Each of the patients had a voice rehabilitation consisting of a series of 10 meetings. Results: Statistical analysis of the results of maximum phonation time, the self-assessment of voice disability, the quality of life depending on the voice, discomfort of the vocal tract voice acoustic analysis showed statistically significant differences in the results before and after rehabilitation (p <0.005). In addition, the improvement of the respiratory tract was observed in the majority of patients. Conclusions: Speech therapy significantly affects the voice quality of patients with unilateral laryngeal nerve palsy.


2009 ◽  
Vol 123 (8) ◽  
pp. 873-876 ◽  
Author(s):  
R Pratap ◽  
P Mehta ◽  
B Blagnys ◽  
P Q Montgomery

AbstractBackground:The diagnosis and treatment of unilateral vocal fold palsy is a common part of otolaryngology practice. In those patients in whom resolution of symptoms is slow, the resulting dysphonia can have a dramatic effect on the patient's quality of voice and life. We have previously described the procedure of direct phonoplasty under local anaesthesia using the transnasal laryngoesophagoscope.Objective:To examine the subjective and objective data for the first five patients to undergo this procedure, in the form of laryngographic speech analysis, perceptual assessment and therapy outcome measures.Results:Analysis showed a statistically significant improvement in voice quality, in all the above assessment categories, following local anaesthetic direct phonoplasty using the transnasal laryngoesophagoscope.Conclusion:Collagen injection via transnasal flexible laryngoesophagoscopy is a particularly useful technique for treating vocal fold medialisation, especially in palliative care patients and those with shortened life expectancy.


2021 ◽  
Vol 26 (4) ◽  
pp. 921-932
Author(s):  
Ji Sung Kim ◽  
Seong Hee Choi ◽  
Kyoungjae Lee ◽  
Chul-Hee Choi ◽  
Soo-Geun Wang ◽  
...  

Objectives: The purpose of this study is to investigate the characteristics of vocal fold vibration during sustained vowel /a/ phonation and various semi-occluded vocal tract exercise (SOVTEs) using a vibration simulator and digital kymography (DKG).Methods: A total of 12 normal young speakers (6 males, 6 females) aged 20-30 years participated in the study. They phonated a sustained /a/ vowel and performed SOVTE. The vocal fold vibration characteristics were measured according to the number of vibration sources (single vs. double), and vocal tract occlusion degree using a vibration simulator and DKG. Glottal gap quotient (GQ, %), speed quotient (SQ, %) and amplitude (pixel) were estimated quantitatively from the DKG image.Results: The results showed that significantly higher GQ (p = .000) and SQ (p = .000) were observed in the humming and bilabial fricative /β/ compared to open vowels. The amplitude was significantly higher in the open vowel /a/ than in humming (p = .018) and bilabial fricative /β/ (p = .003). Also, when comparing the vocal fold vibration parameters according to vibration type (single source: straw phonation vs. double source: straw phonation with water), the double source presented a significantly higher GQ (p = .000) as well as SQ (p = .008) in comparison with a single source.Conclusion: SOVTE showed a glottal gap that is different from the opened vowel /a/. It also had a longer opening of the vocal fold and a smaller amplitude than the vowel. This suggests that SOVTE may be helpful for facilitating vocal fold vibration and good voice quality in clinical practice. The current study can be meaningful in providing theoretical and clinical evidence for SOVTE.


2002 ◽  
Vol 111 (6) ◽  
pp. 523-529 ◽  
Author(s):  
Petri Reijonen ◽  
Sari Lehikoinen-Söderlund ◽  
Heikki Rihkanen

The objective of this study was to evaluate the effects on voice quality of augmentation by injection of minced fascia in patients with unilateral vocal fold paralysis. Preoperative and postoperative voice samples from 14 patients (6 men and 8 women; mean age, 59 years) were analyzed by computerized acoustic analysis and blinded perceptual evaluation. Statistically significant improvements were seen in perturbation measurements (jitter and shimmer), noise-to-harmonics ratio, and maximum phonation time. A panel of evaluators rated 10 of the 14 postoperative voices as normal or near-normal. Injection laryngoplasty with minced fascia offers a new, effective, well-tolerated, and inexpensive method to medialize a paralyzed vocal fold. The graft seems to survive well, as indicated by good vocal results with a follow-up ranging from 5 to 32 months.


2016 ◽  
Vol 59 (5) ◽  
pp. 994-1001 ◽  
Author(s):  
Bruce R. Gerratt ◽  
Jody Kreiman ◽  
Marc Garellek

Purpose The question of what type of utterance—a sustained vowel or continuous speech—is best for voice quality analysis has been extensively studied but with equivocal results. This study examines whether previously reported differences derive from the articulatory and prosodic factors occurring in continuous speech versus sustained phonation. Method Speakers with voice disorders sustained vowels and read sentences. Vowel samples were excerpted from the steadiest portion of each vowel in the sentences. In addition to sustained and excerpted vowels, a 3rd set of stimuli was created by shortening sustained vowel productions to match the duration of vowels excerpted from continuous speech. Acoustic measures were made on the stimuli, and listeners judged the severity of vocal quality deviation. Results Sustained vowels and those extracted from continuous speech contain essentially the same acoustic and perceptual information about vocal quality deviation. Conclusions Perceived and/or measured differences between continuous speech and sustained vowels derive largely from voice source variability across segmental and prosodic contexts and not from variations in vocal fold vibration in the quasisteady portion of the vowels. Approaches to voice quality assessment by using continuous speech samples average across utterances and may not adequately quantify the variability they are intended to assess.


2015 ◽  
Vol 2015 ◽  
pp. 1-11 ◽  
Author(s):  
Tino Haderlein ◽  
Cornelia Schwemmle ◽  
Michael Döllinger ◽  
Václav Matoušek ◽  
Martin Ptok ◽  
...  

Due to low intra- and interrater reliability, perceptual voice evaluation should be supported by objective, automatic methods. In this study, text-based, computer-aided prosodic analysis and measurements of connected speech were combined in order to model perceptual evaluation of the German Roughness-Breathiness-Hoarseness (RBH) scheme. 58 connected speech samples (43 women and 15 men;48.7±17.8years) containing the German version of the text “The North Wind and the Sun” were evaluated perceptually by 19 speech and voice therapy students according to the RBH scale. For the human-machine correlation, Support Vector Regression with measurements of the vocal fold cycle irregularities (CFx) and the closed phases of vocal fold vibration (CQx) of the Laryngograph and 33 features from a prosodic analysis module were used to model the listeners’ ratings. The best human-machine results for roughness were obtained from a combination of six prosodic features and CFx (r=0.71,ρ=0.57). These correlations were approximately the same as the interrater agreement among human raters (r=0.65,ρ=0.61). CQx was one of the substantial features of the hoarseness model. For hoarseness and breathiness, the human-machine agreement was substantially lower. Nevertheless, the automatic analysis method can serve as the basis for a meaningful objective support for perceptual analysis.


Sign in / Sign up

Export Citation Format

Share Document