scholarly journals Continuous speech processing

2020 ◽  
Vol 18 ◽  
pp. 25-31 ◽  
Author(s):  
Christian Brodbeck ◽  
Jonathan Z Simon
1990 ◽  
Vol 20 (1) ◽  
pp. 15-21 ◽  
Author(s):  
Peter Roach ◽  
Helen Roach ◽  
Andrea Dew ◽  
Paul Rowlands

A fundamentally important practice in phonetics is the analysis of continuous speech into a sequence of discrete segments. There has been considerable debate about the theoretical validity of this practice within classical auditory/kinaesthetic phonetics and in phonology as well as in the context of the acoustic analysis of speech; in recent years the issue has become more widely important as research work in automatic speech processing has resulted in computer algorithms for segmenting speech and assigning phonetic labels to the segments. Work on the automatic segmentation and labeling of speech sounds has been carried on in our department since 1980. This paper begins by examining the theoretical issues involved in segmentation and labeling, then describes our own work.


2021 ◽  
Author(s):  
Marlies Gillis ◽  
Lien Decruy ◽  
Jonas Vanthornhout ◽  
Tom Francart

AbstractWe investigated the impact of hearing loss on the neural processing of speech. Using a forward modelling approach, we compared the neural responses to continuous speech of 14 adults with sensorineural hearing loss with those of age-matched normal-hearing peers.Compared to their normal-hearing peers, hearing-impaired listeners had increased neural tracking and delayed neural responses to continuous speech in quiet. The latency also increased with the degree of hearing loss. As speech understanding decreased, neural tracking decreased in both population; however, a significantly different trend was observed for the latency of the neural responses. For normal-hearing listeners, the latency increased with increasing background noise level. However, for hearing-impaired listeners, this increase was not observed.Our results support that the neural response latency indicates the efficiency of neural speech processing. Hearing-impaired listeners process speech in silence less efficiently then normal-hearing listeners. Our results suggest that this reduction in neural speech processing efficiency is a gradual effect which occurs as hearing deteriorates. Moreover, the efficiency of neural speech processing in hearing-impaired listeners is already at its lowest level when listening to speech in quiet, while normal-hearing listeners show a further decrease in efficiently when the noise level increases.From our results, it is apparent that sound amplification does not solve hearing loss. Even when intelligibility is apparently perfect, hearing-impaired listeners process speech less efficiently.


1973 ◽  
Vol 16 (1) ◽  
pp. 67-77 ◽  
Author(s):  
Yoshiyuki Horii ◽  
Arthur S. House ◽  
Kung-Pu Li ◽  
Robert L. Ringel

The acoustic characteristics of continuous speech produced by an adult male talker with and without oral (nerve block) anesthesia were investigated using digital speech processing procedures. Vowel-to-consonant ratios, long-time and short-time spectra, fundamental frequency distributions, phonation-time ratios, and rate of utterances were calculated and compared for the normal and anesthetized conditions. The results showed that the speech produced without oral sensation was characterized by a reduction and shift of high-frequency energy, temporal disorganization primarily manifested as prolongation of utterance, and higher and more variable fundamental frequencies. The study also demonstrated applicability of computer techniques on general acoustic analysis of continuous speech.


2018 ◽  
Author(s):  
Sevada Hovsepyan ◽  
Itsaso Olasagasti ◽  
Anne-Lise Giraud

ABSTRACTSpeech comprehension requires segmenting continuous speech to connect it on-line with discrete linguistic neural representations. This process relies on theta-gamma oscillation coupling, which tracks syllables and encodes them in decipherable neural activity. Speech comprehension also strongly depends on contextual cues predicting speech structure and content. To explore the effects of theta-gamma coupling on bottom-up/top-down dynamics during on-line speech perception, we designed a generative model that can recognize syllable sequences in continuous speech. The model uses theta oscillations to detect syllable onsets and align both gamma-rate encoding activity with syllable boundaries and predictions with speech input. We observed that the model performed best when theta oscillations were used to align gamma units with input syllables, i.e. when bidirectional information flows were coordinated, and internal timing knowledge was exploited. This work demonstrates that notions of predictive coding and neural oscillations can usefully be brought together to account for dynamic on-line sensory processing.


2003 ◽  
Vol 15 (3) ◽  
pp. 214-227 ◽  
Author(s):  
Lisa D Sanders ◽  
Helen J Neville

2003 ◽  
Vol 15 (3) ◽  
pp. 228-240 ◽  
Author(s):  
Lisa D Sanders ◽  
Helen J Neville

2009 ◽  
Vol 14 (1) ◽  
pp. 78-89 ◽  
Author(s):  
Kenneth Hugdahl ◽  
René Westerhausen

The present paper is based on a talk on hemispheric asymmetry given by Kenneth Hugdahl at the Xth European Congress of Psychology, Praha July 2007. Here, we propose that hemispheric asymmetry evolved because of a left hemisphere speech processing specialization. The evolution of speech and the need for air-based communication necessitated division of labor between the hemispheres in order to avoid having duplicate copies in both hemispheres that would increase processing redundancy. It is argued that the neuronal basis of this labor division is the structural asymmetry observed in the peri-Sylvian region in the posterior part of the temporal lobe, with a left larger than right planum temporale area. This is the only example where a structural, or anatomical, asymmetry matches a corresponding functional asymmetry. The increase in gray matter volume in the left planum temporale area corresponds to a functional asymmetry of speech processing, as indexed from both behavioral, dichotic listening, and functional neuroimaging studies. The functional anatomy of the corpus callosum also supports such a view, with regional specificity of information transfer between the hemispheres.


Sign in / Sign up

Export Citation Format

Share Document