Continuous speech processing

Phonetic Analysis and the Automatic Segmentation and Labeling of Speech Sounds

Journal of the International Phonetic Association ◽

10.1017/s002510030000400x ◽

1990 ◽

Vol 20 (1) ◽

pp. 15-21 ◽

Cited By ~ 3

Author(s):

Peter Roach ◽

Helen Roach ◽

Andrea Dew ◽

Paul Rowlands

Keyword(s):

Speech Processing ◽

Acoustic Analysis ◽

Automatic Segmentation ◽

Research Work ◽

Speech Sounds ◽

Computer Algorithms ◽

Continuous Speech ◽

Considerable Debate ◽

Theoretical Issues

A fundamentally important practice in phonetics is the analysis of continuous speech into a sequence of discrete segments. There has been considerable debate about the theoretical validity of this practice within classical auditory/kinaesthetic phonetics and in phonology as well as in the context of the acoustic analysis of speech; in recent years the issue has become more widely important as research work in automatic speech processing has resulted in computer algorithms for segmenting speech and assigning phonetic labels to the segments. Work on the automatic segmentation and labeling of speech sounds has been carried on in our department since 1980. This paper begins by examining the theoretical issues involved in segmentation and labeling, then describes our own work.

Download Full-text

Hearing loss is associated with delayed neural responses to continuous speech

10.1101/2021.01.21.427550 ◽

2021 ◽

Author(s):

Marlies Gillis ◽

Lien Decruy ◽

Jonas Vanthornhout ◽

Tom Francart

Keyword(s):

Hearing Loss ◽

Speech Processing ◽

Noise Level ◽

Normal Hearing ◽

Hearing Impaired ◽

Neural Responses ◽

Continuous Speech ◽

Processing Efficiency ◽

Sound Amplification ◽

The Impact

AbstractWe investigated the impact of hearing loss on the neural processing of speech. Using a forward modelling approach, we compared the neural responses to continuous speech of 14 adults with sensorineural hearing loss with those of age-matched normal-hearing peers.Compared to their normal-hearing peers, hearing-impaired listeners had increased neural tracking and delayed neural responses to continuous speech in quiet. The latency also increased with the degree of hearing loss. As speech understanding decreased, neural tracking decreased in both population; however, a significantly different trend was observed for the latency of the neural responses. For normal-hearing listeners, the latency increased with increasing background noise level. However, for hearing-impaired listeners, this increase was not observed.Our results support that the neural response latency indicates the efficiency of neural speech processing. Hearing-impaired listeners process speech in silence less efficiently then normal-hearing listeners. Our results suggest that this reduction in neural speech processing efficiency is a gradual effect which occurs as hearing deteriorates. Moreover, the efficiency of neural speech processing in hearing-impaired listeners is already at its lowest level when listening to speech in quiet, while normal-hearing listeners show a further decrease in efficiently when the noise level increases.From our results, it is apparent that sound amplification does not solve hearing loss. Even when intelligibility is apparently perfect, hearing-impaired listeners process speech less efficiently.

Download Full-text

Acoustic Characteristics of Speech Produced Without Oral Sensation

Journal of Speech and Hearing Research ◽

10.1044/jshr.1601.67 ◽

1973 ◽

Vol 16 (1) ◽

pp. 67-77 ◽

Cited By ~ 9

Author(s):

Yoshiyuki Horii ◽

Arthur S. House ◽

Kung-Pu Li ◽

Robert L. Ringel

Keyword(s):

Speech Processing ◽

High Frequency ◽

Acoustic Analysis ◽

Continuous Speech ◽

Acoustic Characteristics ◽

Frequency Distributions ◽

Fundamental Frequencies ◽

Long Time ◽

Short Time ◽

Nerve Block Anesthesia

The acoustic characteristics of continuous speech produced by an adult male talker with and without oral (nerve block) anesthesia were investigated using digital speech processing procedures. Vowel-to-consonant ratios, long-time and short-time spectra, fundamental frequency distributions, phonation-time ratios, and rate of utterances were calculated and compared for the normal and anesthetized conditions. The results showed that the speech produced without oral sensation was characterized by a reduction and shift of high-frequency energy, temporal disorganization primarily manifested as prolongation of utterance, and higher and more variable fundamental frequencies. The study also demonstrated applicability of computer techniques on general acoustic analysis of continuous speech.

Download Full-text

Combining predictive coding with neural oscillations optimizes on-line speech processing

10.1101/477588 ◽

2018 ◽

Cited By ~ 1

Author(s):

Sevada Hovsepyan ◽

Itsaso Olasagasti ◽

Anne-Lise Giraud

Keyword(s):

Sensory Processing ◽

Speech Processing ◽

Predictive Coding ◽

Theta Oscillations ◽

Neural Oscillations ◽

Speech Comprehension ◽

Contextual Cues ◽

Continuous Speech ◽

Neural Representations ◽

On Line

ABSTRACTSpeech comprehension requires segmenting continuous speech to connect it on-line with discrete linguistic neural representations. This process relies on theta-gamma oscillation coupling, which tracks syllables and encodes them in decipherable neural activity. Speech comprehension also strongly depends on contextual cues predicting speech structure and content. To explore the effects of theta-gamma coupling on bottom-up/top-down dynamics during on-line speech perception, we designed a generative model that can recognize syllable sequences in continuous speech. The model uses theta oscillations to detect syllable onsets and align both gamma-rate encoding activity with syllable boundaries and predictions with speech input. We observed that the model performed best when theta oscillations were used to align gamma units with input syllables, i.e. when bidirectional information flows were coordinated, and internal timing knowledge was exploited. This work demonstrates that notions of predictive coding and neural oscillations can usefully be brought together to account for dynamic on-line sensory processing.

Download Full-text

An ERP study of continuous speech processing

Cognitive Brain Research ◽

10.1016/s0926-6410(02)00194-5 ◽

2003 ◽

Vol 15 (3) ◽

pp. 214-227 ◽

Cited By ~ 47

Author(s):

Lisa D Sanders ◽

Helen J Neville

Keyword(s):

Speech Processing ◽

Continuous Speech

Download Full-text

Activation–verification in continuous speech processing: Interaction of cognitive strategies as a possible theoretical approach

Journal of Neurolinguistics ◽

10.1016/j.jneuroling.2007.06.001 ◽

2008 ◽

Vol 21 (1) ◽

pp. 1-17 ◽

Cited By ~ 8

Author(s):

Kuzma Strelnikov

Keyword(s):

Theoretical Approach ◽

Speech Processing ◽

Cognitive Strategies ◽

Continuous Speech

Download Full-text

An ERP study of continuous speech processing

Cognitive Brain Research ◽

10.1016/s0926-6410(02)00195-7 ◽

2003 ◽

Vol 15 (3) ◽

pp. 228-240 ◽

Cited By ~ 70

Author(s):

Lisa D Sanders ◽

Helen J Neville

Keyword(s):

Speech Processing ◽

Continuous Speech

Download Full-text

Isolating Neural Indices of Continuous Speech Processing at the Phonetic Level

Advances in Experimental Medicine and Biology - Physiology, Psychoacoustics and Cognition in Normal and Impaired Hearing ◽

10.1007/978-3-319-25474-6_35 ◽

2016 ◽

pp. 337-345 ◽

Cited By ~ 6

Author(s):

Giovanni M. Di Liberto ◽

Edmund C. Lalor

Keyword(s):

Speech Processing ◽

Continuous Speech

Download Full-text

Age-Related Speech Processing Decline May Account for Comprehension Problems

ASHA Leader ◽

10.1044/leader.rib2.22012017.15 ◽

2017 ◽

Vol 22 (1) ◽

pp. 15-15

Keyword(s):

Speech Processing ◽

Age Related

Download Full-text

What Is Left Is Right

European Psychologist ◽

10.1027/1016-9040.14.1.78 ◽

2009 ◽

Vol 14 (1) ◽

pp. 78-89 ◽

Cited By ~ 8

Author(s):

Kenneth Hugdahl ◽

René Westerhausen

Keyword(s):

Speech Processing ◽

Information Transfer ◽

Hemispheric Asymmetry ◽

Functional Neuroimaging ◽

Gray Matter Volume ◽

Functional Anatomy ◽

Posterior Part ◽

Functional Asymmetry ◽

Planum Temporale ◽

Evolution Of Speech

The present paper is based on a talk on hemispheric asymmetry given by Kenneth Hugdahl at the Xth European Congress of Psychology, Praha July 2007. Here, we propose that hemispheric asymmetry evolved because of a left hemisphere speech processing specialization. The evolution of speech and the need for air-based communication necessitated division of labor between the hemispheres in order to avoid having duplicate copies in both hemispheres that would increase processing redundancy. It is argued that the neuronal basis of this labor division is the structural asymmetry observed in the peri-Sylvian region in the posterior part of the temporal lobe, with a left larger than right planum temporale area. This is the only example where a structural, or anatomical, asymmetry matches a corresponding functional asymmetry. The increase in gray matter volume in the left planum temporale area corresponds to a functional asymmetry of speech processing, as indexed from both behavioral, dichotic listening, and functional neuroimaging studies. The functional anatomy of the corpus callosum also supports such a view, with regional specificity of information transfer between the hemispheres.

Download Full-text