Computational Models for Speech Production

2018 ◽  
pp. 359-367
Phonology ◽  
2011 ◽  
Vol 28 (2) ◽  
pp. 197-234 ◽  
Author(s):  
Robert Daland ◽  
Bruce Hayes ◽  
James White ◽  
Marc Garellek ◽  
Andrea Davis ◽  
...  

AbstractThe term sonority projection refers to behavioural distinctions speakers make between unattested phonological sequences on the basis of sonority. For example, among onset clusters, the well-formedness relation [bn]>[lb] is observed in speech perception, speech production and non-word acceptability (Davidson 2006, 2007, Berent et al.2007, Albright, ms). We begin by replicating the sonority projection effects in a non-word acceptability study. Then we evaluate the extent to which sonority projection is predicted by existing computational models of phonotactics (Coleman & Pierrehumbert 1997, Hayes & Wilson 2008, inter alia). We show that a model based only on lexical statistics can explain sonority projection in English without a pre-existing sonority sequencing principle. To do this, a model must possess (i) a featural system supporting sonority-based generalisations, and (ii) a context representation including syllabification or equivalent information.


2010 ◽  
Vol 22 (7) ◽  
pp. 1504-1529 ◽  
Author(s):  
Jason W. Bohland ◽  
Daniel Bullock ◽  
Frank H. Guenther

Speakers plan the phonological content of their utterances before their release as speech motor acts. Using a finite alphabet of learned phonemes and a relatively small number of syllable structures, speakers are able to rapidly plan and produce arbitrary syllable sequences that fall within the rules of their language. The class of computational models of sequence planning and performance termed competitive queuing models have followed K. S. Lashley [The problem of serial order in behavior. In L. A. Jeffress (Ed.), Cerebral mechanisms in behavior (pp. 112–136). New York: Wiley, 1951] in assuming that inherently parallel neural representations underlie serial action, and this idea is increasingly supported by experimental evidence. In this article, we developed a neural model that extends the existing DIVA model of speech production in two complementary ways. The new model includes paired structure and content subsystems [cf. MacNeilage, P. F. The frame/content theory of evolution of speech production. Behavioral and Brain Sciences, 21, 499–511, 1998 ] that provide parallel representations of a forthcoming speech plan as well as mechanisms for interfacing these phonological planning representations with learned sensorimotor programs to enable stepping through multisyllabic speech plans. On the basis of previous reports, the model's components are hypothesized to be localized to specific cortical and subcortical structures, including the left inferior frontal sulcus, the medial premotor cortex, the basal ganglia, and the thalamus. The new model, called gradient order DIVA, thus fills a void in current speech research by providing formal mechanistic hypotheses about both phonological and phonetic processes that are grounded by neuroanatomy and physiology. This framework also generates predictions that can be tested in future neuroimaging and clinical case studies.


1992 ◽  
Vol 23 (3) ◽  
pp. 261-268 ◽  
Author(s):  
Alan G. Kamhi

My response to Fey’s article (1985; reprinted 1992, this issue) focuses on the confusion caused by the application of simplistic phonological definitions and models to the assessment and treatment of children with speech delays. In addition to having no explanatory adequacy, such definitions/models lead either to assessment and treatment procedures that are similarly focused or to procedures that have no clear logical ties to the models with which they supposedly are linked. Narrowly focused models and definitions also usually include no mention of speech production processes. Bemoaning this state of affairs, I attempt to show why it is important for clinicians to embrace broad-based models of phonological disorders that have some explanatory value. Such models are consistent with assessment procedures that are comprehensive in nature and treatment procedures that focus on linguistic, as well as motoric, aspects of speech.


1996 ◽  
Vol 5 (1) ◽  
pp. 23-32 ◽  
Author(s):  
Chris Halpin ◽  
Barbara Herrmann ◽  
Margaret Whearty

The family described in this article provides an unusual opportunity to relate findings from genetic, histological, electrophysiological, psychophysical, and rehabilitative investigation. Although the total number evaluated is large (49), the known, living affected population is smaller (14), and these are spread from age 20 to age 59. As a result, the findings described above are those of a large-scale case study. Clearly, more data will be available through longitudinal study of the individuals documented in the course of this investigation but, given the slow nature of the progression in this disease, such studies will be undertaken after an interval of several years. The general picture presented to the audiologist who must rehabilitate these cases is that of a progressive cochlear degeneration that affects only thresholds at first, and then rapidly diminishes speech intelligibility. The expected result is that, after normal language development, the patient may accept hearing aids well, encouraged by the support of the family. Performance and satisfaction with the hearing aids is good, until the onset of the speech intelligibility loss, at which time the patient will encounter serious difficulties and may reject hearing aids as unhelpful. As the histological and electrophysiological results indicate, however, the eighth nerve remains viable, especially in the younger affected members, and success with cochlear implantation may be expected. Audiologic counseling efforts are aided by the presence of role models and support from the other affected members of the family. Speech-language pathology services were not considered important by the members of this family since their speech production developed normally and has remained very good. Self-correction of speech was supported by hearing aids and cochlear implants (Case 5’s speech production was documented in Perkell, Lane, Svirsky, & Webster, 1992). These patients received genetic counseling and, due to the high penetrance of the disease, exhibited serious concerns regarding future generations and the hope of a cure.


2019 ◽  
Vol 62 (7) ◽  
pp. 2099-2117 ◽  
Author(s):  
Jason A. Whitfield ◽  
Zoe Kriegel ◽  
Adam M. Fullenkamp ◽  
Daryush D. Mehta

Purpose Prior investigations suggest that simultaneous performance of more than 1 motor-oriented task may exacerbate speech motor deficits in individuals with Parkinson disease (PD). The purpose of the current investigation was to examine the extent to which performing a low-demand manual task affected the connected speech in individuals with and without PD. Method Individuals with PD and neurologically healthy controls performed speech tasks (reading and extemporaneous speech tasks) and an oscillatory manual task (a counterclockwise circle-drawing task) in isolation (single-task condition) and concurrently (dual-task condition). Results Relative to speech task performance, no changes in speech acoustics were observed for either group when the low-demand motor task was performed with the concurrent reading tasks. Speakers with PD exhibited a significant decrease in pause duration between the single-task (speech only) and dual-task conditions for the extemporaneous speech task, whereas control participants did not exhibit changes in any speech production variable between the single- and dual-task conditions. Conclusions Overall, there were little to no changes in speech production when a low-demand oscillatory motor task was performed with concurrent reading. For the extemporaneous task, however, individuals with PD exhibited significant changes when the speech and manual tasks were performed concurrently, a pattern that was not observed for control speakers. Supplemental Material https://doi.org/10.23641/asha.8637008


1971 ◽  
Vol 36 (3) ◽  
pp. 397-409 ◽  
Author(s):  
Rachel E. Stark

Real-time amplitude contour and spectral displays were used in teaching speech production skills to a profoundly deaf, nonspeaking boy. This child had a visual attention problem, a behavior problem, and a poor academic record. In individual instruction, he was first taught to produce features of speech, for example, friction, nasal, and stop, which are present in vocalizations of 6- to 9-month-old infants, and then to combine these features in syllables and words. He made progress in speech, although sign language and finger spelling were taught at the same time. Speech production skills were retained after instruction was terminated. The results suggest that deaf children are able to extract information about the features of speech from visual displays, and that a developmental sequence should be followed as far as possible in teaching speech production skills to them.


Sign in / Sign up

Export Citation Format

Share Document