Computational Models for Speech Production

Computational Models of Speech Pattern Processing ◽

10.1007/978-3-642-60087-6_20 ◽

1999 ◽

pp. 199-213 ◽

Cited By ~ 22

Author(s):

Li Deng

Keyword(s):

Speech Production ◽

Computational Models

Download Full-text

Explaining sonority projection effects

Phonology ◽

10.1017/s0952675711000145 ◽

2011 ◽

Vol 28 (2) ◽

pp. 197-234 ◽

Cited By ~ 53

Author(s):

Robert Daland ◽

Bruce Hayes ◽

James White ◽

Marc Garellek ◽

Andrea Davis ◽

...

Keyword(s):

Speech Perception ◽

Speech Production ◽

Computational Models ◽

Lexical Statistics ◽

Model Based ◽

Onset Clusters

AbstractThe term sonority projection refers to behavioural distinctions speakers make between unattested phonological sequences on the basis of sonority. For example, among onset clusters, the well-formedness relation [bn]>[lb] is observed in speech perception, speech production and non-word acceptability (Davidson 2006, 2007, Berent et al.2007, Albright, ms). We begin by replicating the sonority projection effects in a non-word acceptability study. Then we evaluate the extent to which sonority projection is predicted by existing computational models of phonotactics (Coleman & Pierrehumbert 1997, Hayes & Wilson 2008, inter alia). We show that a model based only on lexical statistics can explain sonority projection in English without a pre-existing sonority sequencing principle. To do this, a model must possess (i) a featural system supporting sonority-based generalisations, and (ii) a context representation including syllabification or equivalent information.

Download Full-text

Neural Representations and Mechanisms for the Performance of Simple Speech Sequences

Journal of Cognitive Neuroscience ◽

10.1162/jocn.2009.21306 ◽

2010 ◽

Vol 22 (7) ◽

pp. 1504-1529 ◽

Cited By ~ 162

Author(s):

Jason W. Bohland ◽

Daniel Bullock ◽

Frank H. Guenther

Keyword(s):

New York ◽

Speech Production ◽

Computational Models ◽

Premotor Cortex ◽

Serial Order ◽

Neural Model ◽

Finite Alphabet ◽

Subcortical Structures ◽

New Model ◽

Neural Representations

Speakers plan the phonological content of their utterances before their release as speech motor acts. Using a finite alphabet of learned phonemes and a relatively small number of syllable structures, speakers are able to rapidly plan and produce arbitrary syllable sequences that fall within the rules of their language. The class of computational models of sequence planning and performance termed competitive queuing models have followed K. S. Lashley [The problem of serial order in behavior. In L. A. Jeffress (Ed.), Cerebral mechanisms in behavior (pp. 112–136). New York: Wiley, 1951] in assuming that inherently parallel neural representations underlie serial action, and this idea is increasingly supported by experimental evidence. In this article, we developed a neural model that extends the existing DIVA model of speech production in two complementary ways. The new model includes paired structure and content subsystems [cf. MacNeilage, P. F. The frame/content theory of evolution of speech production. Behavioral and Brain Sciences, 21, 499–511, 1998 ] that provide parallel representations of a forthcoming speech plan as well as mechanisms for interfacing these phonological planning representations with learned sensorimotor programs to enable stepping through multisyllabic speech plans. On the basis of previous reports, the model's components are hypothesized to be localized to specific cortical and subcortical structures, including the left inferior frontal sulcus, the medial premotor cortex, the basal ganglia, and the thalamus. The new model, called gradient order DIVA, thus fills a void in current speech research by providing formal mechanistic hypotheses about both phonological and phonetic processes that are grounded by neuroanatomy and physiology. This framework also generates predictions that can be tested in future neuroimaging and clinical case studies.

Download Full-text

The Need for a Broad-Based Model of Phonological Disorders

Language Speech and Hearing Services in Schools ◽

10.1044/0161-1461.2303.261 ◽

1992 ◽

Vol 23 (3) ◽

pp. 261-268 ◽

Cited By ~ 24

Author(s):

Alan G. Kamhi

Keyword(s):

Speech Production ◽

Phonological Disorders ◽

Production Processes ◽

Assessment And Treatment ◽

Speech Delays ◽

Explanatory Adequacy ◽

State Of Affairs ◽

Assessment Procedures ◽

Treatment Procedures

My response to Fey’s article (1985; reprinted 1992, this issue) focuses on the confusion caused by the application of simplistic phonological definitions and models to the assessment and treatment of children with speech delays. In addition to having no explanatory adequacy, such definitions/models lead either to assessment and treatment procedures that are similarly focused or to procedures that have no clear logical ties to the models with which they supposedly are linked. Narrowly focused models and definitions also usually include no mention of speech production processes. Bemoaning this state of affairs, I attempt to show why it is important for clinicians to embrace broad-based models of phonological disorders that have some explanatory value. Such models are consistent with assessment procedures that are comprehensive in nature and treatment procedures that focus on linguistic, as well as motoric, aspects of speech.

Download Full-text

A Family With Autosomal-Dominant Progressive Sensorineural Hearing Loss

American Journal of Audiology ◽

10.1044/1059-0889.0501.23 ◽

1996 ◽

Vol 5 (1) ◽

pp. 23-32 ◽

Cited By ~ 3

Author(s):

Chris Halpin ◽

Barbara Herrmann ◽

Margaret Whearty

Keyword(s):

Speech Production ◽

Hearing Aids ◽

Role Models ◽

Speech Intelligibility ◽

Large Scale ◽

Speech Language Pathology ◽

The Family ◽

Patient Will ◽

Language Pathology

The family described in this article provides an unusual opportunity to relate findings from genetic, histological, electrophysiological, psychophysical, and rehabilitative investigation. Although the total number evaluated is large (49), the known, living affected population is smaller (14), and these are spread from age 20 to age 59. As a result, the findings described above are those of a large-scale case study. Clearly, more data will be available through longitudinal study of the individuals documented in the course of this investigation but, given the slow nature of the progression in this disease, such studies will be undertaken after an interval of several years. The general picture presented to the audiologist who must rehabilitate these cases is that of a progressive cochlear degeneration that affects only thresholds at first, and then rapidly diminishes speech intelligibility. The expected result is that, after normal language development, the patient may accept hearing aids well, encouraged by the support of the family. Performance and satisfaction with the hearing aids is good, until the onset of the speech intelligibility loss, at which time the patient will encounter serious difficulties and may reject hearing aids as unhelpful. As the histological and electrophysiological results indicate, however, the eighth nerve remains viable, especially in the younger affected members, and success with cochlear implantation may be expected. Audiologic counseling efforts are aided by the presence of role models and support from the other affected members of the family. Speech-language pathology services were not considered important by the members of this family since their speech production developed normally and has remained very good. Self-correction of speech was supported by hearing aids and cochlear implants (Case 5’s speech production was documented in Perkell, Lane, Svirsky, & Webster, 1992). These patients received genetic counseling and, due to the high penetrance of the disease, exhibited serious concerns regarding future generations and the hope of a cure.

Download Full-text

Application of concepts from Cross-Recurrence Analysis in speech production: An overview and a comparison to other nonlinear methods

Journal of Speech Language and Hearing Research ◽

10.1044/2013_jslhr-s-12-0223 ◽

2013 ◽

Author(s):

L. Lancia ◽

S. Fuchs ◽

M. Tiede

Keyword(s):

Speech Production ◽

Recurrence Analysis ◽

Nonlinear Methods

Download Full-text

Effects of Concurrent Manual Task Performance on Connected Speech Acoustics in Individuals With Parkinson Disease

Journal of Speech Language and Hearing Research ◽

10.1044/2019_jslhr-s-msc18-18-0190 ◽

2019 ◽

Vol 62 (7) ◽

pp. 2099-2117 ◽

Cited By ~ 2

Author(s):

Jason A. Whitfield ◽

Zoe Kriegel ◽

Adam M. Fullenkamp ◽

Daryush D. Mehta

Keyword(s):

Task Performance ◽

Speech Production ◽

Dual Task ◽

Motor Task ◽

Task Condition ◽

Connected Speech ◽

Speech Acoustics ◽

Single Task ◽

Manual Task ◽

Task Conditions

Purpose Prior investigations suggest that simultaneous performance of more than 1 motor-oriented task may exacerbate speech motor deficits in individuals with Parkinson disease (PD). The purpose of the current investigation was to examine the extent to which performing a low-demand manual task affected the connected speech in individuals with and without PD. Method Individuals with PD and neurologically healthy controls performed speech tasks (reading and extemporaneous speech tasks) and an oscillatory manual task (a counterclockwise circle-drawing task) in isolation (single-task condition) and concurrently (dual-task condition). Results Relative to speech task performance, no changes in speech acoustics were observed for either group when the low-demand motor task was performed with the concurrent reading tasks. Speakers with PD exhibited a significant decrease in pause duration between the single-task (speech only) and dual-task conditions for the extemporaneous speech task, whereas control participants did not exhibit changes in any speech production variable between the single- and dual-task conditions. Conclusions Overall, there were little to no changes in speech production when a low-demand oscillatory motor task was performed with concurrent reading. For the extemporaneous task, however, individuals with PD exhibited significant changes when the speech and manual tasks were performed concurrently, a pattern that was not observed for control speakers. Supplemental Material https://doi.org/10.23641/asha.8637008

Download Full-text

Functional Neuroimaging of Speech Production in People Who Stutter

Perspectives on Fluency and Fluency Disorders ◽

10.1044/ffd12.2.3 ◽

2002 ◽

Vol 12 (2) ◽

pp. 3-6

Author(s):

Michael Blomgren ◽

Srikantan Nagarajan

Keyword(s):

Speech Production ◽

Functional Neuroimaging

Download Full-text

Speech Production as a Measure of Hearing Aid Benefit in Infants and Young Children With Hearing Loss

Perspectives on Hearing and Hearing Disorders in Childhood ◽

10.1044/hhdc17.1.15 ◽

2007 ◽

Vol 17 (1) ◽

pp. 15-20 ◽

Cited By ~ 1

Author(s):

Sheila R. Pratt ◽

Kelly Schnoor ◽

Melissa Friedman

Keyword(s):

Hearing Loss ◽

Young Children ◽

Speech Production ◽

Hearing Aid ◽

Children With Hearing Loss ◽

Hearing Aid Benefit ◽

Infants And Young Children

Download Full-text

The Use of Real-Time Visual Displays of Speech in the Training of a Profoundly Deaf, Nonspeaking Child: A Case Report

Journal of Speech and Hearing Disorders ◽

10.1044/jshd.3603.397 ◽

1971 ◽

Vol 36 (3) ◽

pp. 397-409 ◽

Cited By ~ 10

Author(s):

Rachel E. Stark

Keyword(s):

Case Report ◽

Real Time ◽

Speech Production ◽

Behavior Problem ◽

Deaf Children ◽

Developmental Sequence ◽

Visual Displays ◽

Individual Instruction ◽

Academic Record ◽

Extract Information

Real-time amplitude contour and spectral displays were used in teaching speech production skills to a profoundly deaf, nonspeaking boy. This child had a visual attention problem, a behavior problem, and a poor academic record. In individual instruction, he was first taught to produce features of speech, for example, friction, nasal, and stop, which are present in vocalizations of 6- to 9-month-old infants, and then to combine these features in syllables and words. He made progress in speech, although sign language and finger spelling were taught at the same time. Speech production skills were retained after instruction was terminated. The results suggest that deaf children are able to extract information about the features of speech from visual displays, and that a developmental sequence should be followed as far as possible in teaching speech production skills to them.

Download Full-text