Education in acoustics and speech science using vocal-tract models

2012 ◽  
Vol 131 (3) ◽  
pp. 2444-2454 ◽  
Author(s):  
Takayuki Arai
Keyword(s):  
Author(s):  
Asterios Toutios ◽  
Shrikanth S. Narayanan

Real-time magnetic resonance imaging (rtMRI) of the moving vocal tract during running speech production is an important emerging tool for speech production research providing dynamic information of a speaker's upper airway from the entire midsagittal plane or any other scan plane of interest. There have been several advances in the development of speech rtMRI and corresponding analysis tools, and their application to domains such as phonetics and phonological theory, articulatory modeling, and speaker characterization. An important recent development has been the open release of a database that includes speech rtMRI data from five male and five female speakers of American English each producing 460 phonetically balanced sentences. The purpose of the present paper is to give an overview and outlook of the advances in rtMRI as a tool for speech research and technology development.


Author(s):  
Tanner Sorensen ◽  
Zisis Skordilis ◽  
Asterios Toutios ◽  
Yoon-Chul Kim ◽  
Yinghua Zhu ◽  
...  
Keyword(s):  

Author(s):  
Byron D. Erath ◽  
Sean D. Peterson ◽  
Matias Zañartu ◽  
Michael W. Plesniak

Voiced speech involves complex fluid-structure-acoustic interactions. When a critical lung pressure is achieved, the vocal folds are pushed apart inciting self-sustained oscillations. The interplay between the aerodynamic forces and the myoelastic tissue properties produces robust oscillation of the vocal folds. The pulsatile nature of the flow as it emanates from vocal folds creates an oscillatory pressure field which acoustically excites the vocal tract and ultimately forms intelligible sound. Recently, it has been shown that the acoustic pressures are high enough in magnitude that they modulate the static fluid pressures which drive the flow.1 This coupling effect creates a feedback loop with the fluids, acoustics, and vocal fold dynamics becoming interconnected. Consequently, speech science investigations that aim to capture the relevant physics must consider all three components to yield credible, clinically-relevant results.


2020 ◽  
Vol 63 (4) ◽  
pp. 931-947
Author(s):  
Teresa L. D. Hardy ◽  
Carol A. Boliek ◽  
Daniel Aalto ◽  
Justin Lewicke ◽  
Kristopher Wells ◽  
...  

Purpose The purpose of this study was twofold: (a) to identify a set of communication-based predictors (including both acoustic and gestural variables) of masculinity–femininity ratings and (b) to explore differences in ratings between audio and audiovisual presentation modes for transgender and cisgender communicators. Method The voices and gestures of a group of cisgender men and women ( n = 10 of each) and transgender women ( n = 20) communicators were recorded while they recounted the story of a cartoon using acoustic and motion capture recording systems. A total of 17 acoustic and gestural variables were measured from these recordings. A group of observers ( n = 20) rated each communicator's masculinity–femininity based on 30- to 45-s samples of the cartoon description presented in three modes: audio, visual, and audio visual. Visual and audiovisual stimuli contained point light displays standardized for size. Ratings were made using a direct magnitude estimation scale without modulus. Communication-based predictors of masculinity–femininity ratings were identified using multiple regression, and analysis of variance was used to determine the effect of presentation mode on perceptual ratings. Results Fundamental frequency, average vowel formant, and sound pressure level were identified as significant predictors of masculinity–femininity ratings for these communicators. Communicators were rated significantly more feminine in the audio than the audiovisual mode and unreliably in the visual-only mode. Conclusions Both study purposes were met. Results support continued emphasis on fundamental frequency and vocal tract resonance in voice and communication modification training with transgender individuals and provide evidence for the potential benefit of modifying sound pressure level, especially when a masculine presentation is desired.


2020 ◽  
Vol 63 (1) ◽  
pp. 109-124
Author(s):  
Carly Jo Hosbach-Cannon ◽  
Soren Y. Lowell ◽  
Raymond H. Colton ◽  
Richard T. Kelley ◽  
Xue Bao

Purpose To advance our current knowledge of singer physiology by using ultrasonography in combination with acoustic measures to compare physiological differences between musical theater (MT) and opera (OP) singers under controlled phonation conditions. Primary objectives addressed in this study were (a) to determine if differences in hyolaryngeal and vocal fold contact dynamics occur between two professional voice populations (MT and OP) during singing tasks and (b) to determine if differences occur between MT and OP singers in oral configuration and associated acoustic resonance during singing tasks. Method Twenty-one singers (10 MT and 11 OP) were included. All participants were currently enrolled in a music program. Experimental procedures consisted of sustained phonation on the vowels /i/ and /ɑ/ during both a low-pitch task and a high-pitch task. Measures of hyolaryngeal elevation, tongue height, and tongue advancement were assessed using ultrasonography. Vocal fold contact dynamics were measured using electroglottography. Simultaneous acoustic recordings were obtained during all ultrasonography procedures for analysis of the first two formant frequencies. Results Significant oral configuration differences, reflected by measures of tongue height and tongue advancement, were seen between groups. Measures of acoustic resonance also showed significant differences between groups during specific tasks. Both singer groups significantly raised their hyoid position when singing high-pitched vowels, but hyoid elevation was not statistically different between groups. Likewise, vocal fold contact dynamics did not significantly differentiate the two singer groups. Conclusions These findings suggest that, under controlled phonation conditions, MT singers alter their oral configuration and achieve differing resultant formants as compared with OP singers. Because singers are at a high risk of developing a voice disorder, understanding how these two groups of singers adjust their vocal tract configuration during their specific singing genre may help to identify risky vocal behavior and provide a basis for prevention of voice disorders.


2020 ◽  
Vol 63 (4) ◽  
pp. 1018-1032
Author(s):  
Chia-Hsin Wu ◽  
Roger W. Chan

Purpose Semi-occluded vocal tract (SOVT) exercises with tubes or straws have been widely used for a variety of voice disorders. Yet, the effects of longer periods of SOVT exercises (lasting for weeks) on the aging voice are not well understood. This study investigated the effects of a 6-week straw phonation in water (SPW) exercise program. Method Thirty-seven elderly subjects with self-perceived voice problems were assigned into two groups: (a) SPW exercises with six weekly sessions and home practice (experimental group) and (b) vocal hygiene education (control group). Before and after intervention (2 weeks after the completion of the exercise program), acoustic analysis, auditory–perceptual evaluation, and self-assessment of vocal impairment were conducted. Results Analysis of covariance revealed significant differences between the two groups in smoothed cepstral peak prominence measures, harmonics-to-noise ratio, the auditory–perceptual parameter of breathiness, and Voice Handicap Index-10 scores postintervention. No significant differences between the two groups were found for other measures. Conclusions Our results supported the positive effects of SOVT exercises for the aging voice, with a 6-week SPW exercise program being a clinical option. Future studies should involve long-term follow-up and additional outcome measures to better understand the efficacy of SOVT exercises, particularly SPW exercises, for the aging voice.


2008 ◽  
Vol 18 (1) ◽  
pp. 31-40 ◽  
Author(s):  
David J. Zajac

Abstract The purpose of this opinion article is to review the impact of the principles and technology of speech science on clinical practice in the area of craniofacial disorders. Current practice relative to (a) speech aerodynamic assessment, (b) computer-assisted single-word speech intelligibility testing, and (c) behavioral management of hypernasal resonance are reviewed. Future directions and/or refinement of each area are also identified. It is suggested that both challenging and rewarding times are in store for clinical researchers in craniofacial disorders.


Sign in / Sign up

Export Citation Format

Share Document