A challenge to Articulation Theory: Narrow-band acoustic signals and visual speech cues

Research has shown that speaking in a deliberately clear manner can improve the accuracy of auditory speech recognition. Allowing listeners access to visual speech cues also enhances speech understanding. Whether the nature of information provided by speaking clearly and by using visual speech cues is redundant has not been determined. This study examined how speaking mode (clear vs. conversational) and presentation mode (auditory vs. auditory-visual) influenced the perception of words within nonsense sentences. In Experiment 1, 30 young listeners with normal hearing responded to videotaped stimuli presented audiovisually in the presence of background noise at one of three signal-to-noise ratios. In Experiment 2, 9 participants returned for an additional assessment using auditory-only presentation. Results of these experiments showed significant effects of speaking mode (clear speech was easier to understand than was conversational speech) and presentation mode (auditoryvisual presentation led to better performance than did auditory-only presentation). The benefit of clear speech was greater for words occurring in the middle of sentences than for words at either the beginning or end of sentences for both auditory-only and auditory-visual presentation, whereas the greatest benefit from supplying visual cues was for words at the end of sentences spoken both clearly and conversationally. The total benefit from speaking clearly and supplying visual cues was equal to the sum of each of these effects. Overall, the results suggest that speaking clearly and providing visual speech information provide complementary (rather than redundant) information.

Download Full-text

Evaluating the Effort Expended to Understand Speech in Noise Using a Dual-Task Paradigm: The Effects of Providing Visual Speech Cues

Journal of Speech Language and Hearing Research ◽

10.1044/1092-4388(2009/08-0140) ◽

2010 ◽

Vol 53 (1) ◽

pp. 18-33 ◽

Cited By ~ 97

Author(s):

Sarah Fraser ◽

Jean-Pierre Gagné ◽

Majolaine Alepins ◽

Pascale Dubois

Keyword(s):

Dual Task ◽

Visual Speech ◽

Speech In Noise ◽

Speech Cues ◽

Dual Task Paradigm

Download Full-text

Audio-visual speech perception without speech cues

Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96 ◽

10.1109/icslp.1996.607238 ◽

2002 ◽

Cited By ~ 1

Author(s):

H.M. Saldana ◽

D.B. Pisoni ◽

J.M. Fellowes ◽

R.E. Remez

Keyword(s):

Speech Perception ◽

Visual Speech ◽

Speech Cues ◽

Visual Speech Perception

Download Full-text

Relationships between acoustic reflex patterns elicited by unfiltered white noise and narrow band white noise stimuli of different duration but of the same intensity

The Journal of Laryngology & Otology ◽

10.1017/s0022215100097814 ◽

1985 ◽

Vol 99 (9) ◽

pp. 857-863 ◽

Cited By ~ 3

Author(s):

Giovanni Rossi ◽

Paolo Solero ◽

M. Rolando

Keyword(s):

White Noise ◽

Narrow Band ◽

Acoustic Signals ◽

The Other ◽

Acoustic Reflex ◽

Other Hand ◽

Recruitment Time

AbstractFor the purpose of this study, acoustic signals were generated by an Amplaid MK VI. An Amplaid 702 impedence meter was connected to its averaging section and to its computer. The stimuli were bursts of unfiltered white noise (UWN) and of narrow band white noise (NBWN; 30 db./oct/slope; central frequencies 1,000, 2,000, 4,000 Hz.) lasting 3–1,000 msec, at intensity of 105 db. SPL p.e. The following parameters were evaluated: stapedius contraction latency, amplitude, duration and recruitment time. It was found that latency was independent of the spectrum of the stimulus and its duration. Amplitude and recruitment time, on the other hand, were related to spectrum and duration, while duration of contraction was directly related to the duration of the stimulus only.

Download Full-text

The role of visual speech cues in sound change: A study of the cot-caught contrast among Michigan speakers

The Journal of the Acoustical Society of America ◽

10.1121/1.4970151 ◽

2016 ◽

Vol 140 (4) ◽

pp. 3219-3219

Author(s):

Jonathan Havenhill ◽

Youngah Do

Keyword(s):

Sound Change ◽

Visual Speech ◽

Speech Cues

Download Full-text

Spectral Distribution of Prosodic Information

Journal of Speech Language and Hearing Research ◽

10.1044/jshr.3902.228 ◽

1996 ◽

Vol 39 (2) ◽

pp. 228-238 ◽

Cited By ~ 31

Author(s):

Ken W. Grant ◽

Brian E. Walden

Keyword(s):

High Frequency ◽

Low Frequency ◽

Visual Speech ◽

Prosodic Features ◽

Temporal Properties ◽

Prosodic Cues ◽

Visual Speech Recognition ◽

Speech Cues ◽

Boundary Location ◽

Speech Spectrum

Prosodic speech cues for rhythm, stress, and intonation are related primarily to variations in intensity, duration, and fundamental frequency. Because these cues make use of temporal properties of the speech waveform they are likely to be represented broadly across the speech spectrum. In order to determine the relative importance of different frequency regions for the recognition of Prosodic cues, identification of four Prosodic features, syllable number, syllabic stress, sentence intonation, and phrase boundary location, was evaluated under six filter conditions spanning the range from 200–6100 Hz. Each filter condition had equal articulation index (Al) weights, Al ½ 0.10; p(C) isolated words ≈ 0.40. Results obtained with normally hearing subjects showed that there was an interaction between filter condition and the identification of specific Prosodic features. For example, information from high-frequency regions of speech was particularly useful in the identification of syllable number and stress, whereas information from low-frequency regions was helpful in identifying intonation patterns. In spite of these spectral differences, overall listeners performed remarkably well in identifying Prosodic patterns, although individual differences were apparent. For some subjects, equivalent levels of performance across the six filter conditions were achieved. These results are discussed in relation to auditory and auditory-visual speech recognition.

Download Full-text

Phase variance of narrow-band acoustic signals on near-surface paths

Atmospheric and Oceanic Optics ◽

10.1134/s1024856017030101 ◽

2017 ◽

Vol 30 (3) ◽

pp. 236-242 ◽

Cited By ~ 1

Author(s):

V. P. Mamyshev ◽

S. L. Odintsov

Keyword(s):

Narrow Band ◽

Acoustic Signals ◽

Near Surface ◽

Phase Variance

Download Full-text

Modeling the spatial and frequency distribution of narrow‐band acoustic signals scattering from the ocean surface

The Journal of the Acoustical Society of America ◽

10.1121/1.412095 ◽

1995 ◽

Vol 97 (3) ◽

pp. 1559-1565 ◽

Cited By ~ 2

Author(s):

Michael Wild ◽

Robert Joyce

Keyword(s):

Frequency Distribution ◽

Narrow Band ◽

Ocean Surface ◽

Acoustic Signals

Download Full-text

Should visual speech cues (speechreading) be considered when fitting hearing aids?

The Journal of the Acoustical Society of America ◽

10.1121/1.4777903 ◽

2002 ◽

Vol 111 (5) ◽

pp. 2354 ◽

Cited By ~ 1

Author(s):

Ken Grant

Keyword(s):

Hearing Aids ◽

Visual Speech ◽

Speech Cues ◽

Fitting Hearing Aids

Download Full-text

Audiovisual speech perception in infancy: The influence of vowel identity and infants’ productive abilities on sensitivity to (mis)matches between auditory and visual speech cues.

Developmental Psychology ◽

10.1037/a0039964 ◽

2016 ◽

Vol 52 (2) ◽

pp. 191-204 ◽

Cited By ~ 10

Author(s):

Nicole Altvater-Mackensen ◽

Nivedita Mani ◽

Tobias Grossmann

Keyword(s):

Speech Perception ◽

Visual Speech ◽

Audiovisual Speech ◽

Audiovisual Speech Perception ◽

Speech Cues

Download Full-text