Distinct cortical locations for integration of audiovisual speech and the McGurk effect

Audiovisual Speech Perception: Acoustic and Visual Phonetic Features Contributing to the McGurk Effect

i-Perception ◽

10.1068/ic768 ◽

2011 ◽

Vol 2 (8) ◽

pp. 768-768

Author(s):

Kaisa Tiippana ◽

Martti Vainio ◽

Mikko Tiainen

Keyword(s):

Speech Perception ◽

Mcgurk Effect ◽

Audiovisual Speech ◽

Audiovisual Speech Perception ◽

Phonetic Features

Download Full-text

Brain activity during audiovisual speech perception: An fMRI study of the McGurk effect

Neuroreport ◽

10.1097/00001756-200306110-00006 ◽

2003 ◽

Vol 14 (8) ◽

pp. 1129-1133 ◽

Cited By ~ 97

Author(s):

Jeffery A. Jones ◽

Daniel E. Callan

Keyword(s):

Speech Perception ◽

Brain Activity ◽

Mcgurk Effect ◽

Audiovisual Speech ◽

Audiovisual Speech Perception ◽

Fmri Study

Download Full-text

Cultural and linguistic factors in audiovisual speech processing: The McGurk effect in Chinese subjects

Perception & Psychophysics ◽

10.3758/bf03206849 ◽

1997 ◽

Vol 59 (1) ◽

pp. 73-80 ◽

Cited By ~ 62

Author(s):

Kaoru Sekiyama

Keyword(s):

Speech Processing ◽

Mcgurk Effect ◽

Audiovisual Speech ◽

Chinese Subjects

Download Full-text

Sound Location Can Influence Audiovisual Speech Perception When Spatial Attention Is Manipulated

Seeing and Perceiving ◽

10.1163/187847511x557308 ◽

2011 ◽

Vol 24 (1) ◽

pp. 67-90 ◽

Cited By ~ 11

Author(s):

Riikka Möttönen ◽

Kaisa Tiippana ◽

Mikko Sams ◽

Hanna Puharinen

Keyword(s):

Speech Perception ◽

Spatial Attention ◽

Reaction Times ◽

Mcgurk Effect ◽

Visual Speech ◽

Audiovisual Speech ◽

Sound Location ◽

Audiovisual Speech Perception ◽

The Right ◽

Talking Face

AbstractAudiovisual speech perception has been considered to operate independent of sound location, since the McGurk effect (altered auditory speech perception caused by conflicting visual speech) has been shown to be unaffected by whether speech sounds are presented in the same or different location as a talking face. Here we show that sound location effects arise with manipulation of spatial attention. Sounds were presented from loudspeakers in five locations: the centre (location of the talking face) and 45°/90° to the left/right. Auditory spatial attention was focused on a location by presenting the majority (90%) of sounds from this location. In Experiment 1, the majority of sounds emanated from the centre, and the McGurk effect was enhanced there. In Experiment 2, the major location was 90° to the left, causing the McGurk effect to be stronger on the left and centre than on the right. Under control conditions, when sounds were presented with equal probability from all locations, the McGurk effect tended to be stronger for sounds emanating from the centre, but this tendency was not reliable. Additionally, reaction times were the shortest for a congruent audiovisual stimulus, and this was the case independent of location. Our main finding is that sound location can modulate audiovisual speech perception, and that spatial attention plays a role in this modulation.

Download Full-text

Reducing Playback Rate of Audiovisual Speech Leads to a Surprising Decrease in the McGurk Effect

Multisensory Research ◽

10.1163/22134808-00002586 ◽

2018 ◽

Vol 31 (1-2) ◽

pp. 19-38 ◽

Cited By ~ 5

Author(s):

John F. Magnotti ◽

Debshila Basu Mallick ◽

Michael S. Beauchamp

Keyword(s):

Visual Information ◽

Mcgurk Effect ◽

Visual Speech ◽

Large Individual ◽

Audiovisual Speech ◽

Unexpected Finding ◽

Video Playback ◽

Natural Rate ◽

Audiovisual Speech Perception ◽

Bayesian Integration

We report the unexpected finding that slowing video playback decreases perception of the McGurk effect. This reduction is counter-intuitive because the illusion depends on visual speech influencing the perception of auditory speech, and slowing speech should increase the amount of visual information available to observers. We recorded perceptual data from 110 subjects viewing audiovisual syllables (either McGurk or congruent control stimuli) played back at one of three rates: the rate used by the talker during recording (the natural rate), a slow rate (50% of natural), or a fast rate (200% of natural). We replicated previous studies showing dramatic variability in McGurk susceptibility at the natural rate, ranging from 0–100% across subjects and from 26–76% across the eight McGurk stimuli tested. Relative to the natural rate, slowed playback reduced the frequency of McGurk responses by 11% (79% of subjects showed a reduction) and reduced congruent accuracy by 3% (25% of subjects showed a reduction). Fast playback rate had little effect on McGurk responses or congruent accuracy. To determine whether our results are consistent with Bayesian integration, we constructed a Bayes-optimal model that incorporated two assumptions: individuals combine auditory and visual information according to their reliability, and changing playback rate affects sensory reliability. The model reproduced both our findings of large individual differences and the playback rate effect. This work illustrates that surprises remain in the McGurk effect and that Bayesian integration provides a useful framework for understanding audiovisual speech perception.

Download Full-text

“Paying” attention to audiovisual speech: Do incongruent stimuli incur greater costs?

10.31234/osf.io/37yfs ◽

2019 ◽

Author(s):

Violet Aurora Brown ◽

Julia Feld Strand

Keyword(s):

Speech Processing ◽

Response Times ◽

Audiovisual Integration ◽

Mcgurk Effect ◽

Visual Speech ◽

Visual Signal ◽

Audiovisual Speech ◽

Dual Task Paradigm ◽

Cortical Regions ◽

Illusory Percept

The McGurk effect is a multisensory phenomenon in which discrepant auditory and visual speech signals typically result in an illusory percept (McGurk & MacDonald, 1976). McGurk stimuli are often used in studies assessing the attentional requirements of audiovisual integration (e.g., Alsius et al., 2005), but no study has directly compared the costs associated with integrating congruent versus incongruent audiovisual speech. Some evidence suggests that the McGurk effect may not be representative of naturalistic audiovisual speech processing—susceptibility to the McGurk effect is not associated with the ability to derive benefit from the addition of the visual signal (Van Engen et al., 2017), and distinct cortical regions are recruited when processing congruent versus incongruent speech (Erickson et al., 2014). In two experiments, one using response times to identify congruent and incongruent syllables and one using a dual-task paradigm, we assessed whether congruent and incongruent audiovisual speech incur different attentional costs. We demonstrated that response times to both the speech task (Experiment 1) and a secondary vibrotactile task (Experiment 2) were indistinguishable for congruent compared to incongruent syllables, but McGurk fusions were responded to more quickly than McGurk non-fusions. These results suggest that despite documented differences in how congruent and incongruent stimuli are processed (Erickson et al., 2014; Van Engen, Xie, & Chandrasekaran, 2017), they do not appear to differ in terms of processing time or effort. However, responses that result in McGurk fusions are processed more quickly than those that result in non-fusions, though attentional cost is comparable for the two response types.

Download Full-text

Acoustic and visual phonetic features in the mcgurk effect — an audiovisual speech illusion

10.21437/interspeech.2013-424 ◽

2013 ◽

Author(s):

Kaisa Tiippana ◽

Mikko Tiainen ◽

Lari Vainio ◽

Martti Vainio

Keyword(s):

Mcgurk Effect ◽

Audiovisual Speech ◽

Phonetic Features

Download Full-text

Own-race faces promote integrated audiovisual speech information

Quarterly Journal of Experimental Psychology ◽

10.1177/17470218211044480 ◽

2021 ◽

pp. 174702182110444

Author(s):

Yuta Ujiie ◽

Kohske Takahashi

Keyword(s):

Speech Perception ◽

Mcgurk Effect ◽

The Other ◽

Emotional Expressions ◽

Audiovisual Speech ◽

Audiovisual Speech Perception ◽

Facial Identity ◽

Race Effect ◽

Speech Information ◽

Effect Experiment

The other-race effect indicates a perceptual advantage when processing own-race faces. This effect has been demonstrated in individuals’ recognition of facial identity and emotional expressions. However, it remains unclear whether the other-race effect also exists in multisensory domains. We conducted two experiments to provide evidence for the other-race effect in facial speech recognition, using the McGurk effect. Experiment 1 tested this issue among East Asian adults, examining the magnitude of the McGurk effect during stimuli using speakers from two different races (own-race vs. other-race). We found that own-race faces induced a stronger McGurk effect than other-race faces. Experiment 2 indicated that the other-race effect was not simply due to different levels of attention being paid to the mouths of own- and other-race speakers. Our findings demonstrated that own-race faces enhance the weight of visual input during audiovisual speech perception, and they provide evidence of the own-race effect in the audiovisual interaction for speech perception in adults.

Download Full-text

The other-race effect on the McGurk effect in infancy

Attention Perception & Psychophysics ◽

10.3758/s13414-021-02342-w ◽

2021 ◽

Author(s):

Yuta Ujiie ◽

So Kanazawa ◽

Masami K. Yamaguchi

Keyword(s):

Processing System ◽

Mcgurk Effect ◽

The Other ◽

Audiovisual Speech ◽

Race Effect ◽

The Difference ◽

Face Stimuli ◽

Perceptual Narrowing ◽

Sensory Process

AbstractThis study investigated the difference in the McGurk effect between own-race-face and other-race-face stimuli among Japanese infants from 5 to 9 months of age. The McGurk effect results from infants using information from a speaker’s face in audiovisual speech integration. We hypothesized that the McGurk effect varies with the speaker’s race because of the other-race effect, which indicates an advantage for own-race faces in our face processing system. Experiment 1 demonstrated the other-race effect on audiovisual speech integration such that the infants ages 5–6 months and 8–9 months are likely to perceive the McGurk effect when observing an own-race-face speaker, but not when observing an other-race-face speaker. Experiment 2 found the other-race effect on audiovisual speech integration regardless of irrelevant speech identity cues. Experiment 3 confirmed the infants’ ability to differentiate two auditory syllables. These results showed that infants are likely to integrate voice with an own-race-face, but not with an other-race-face. This implies the role of experiences with own-race-faces in the development of audiovisual speech integration. Our findings also contribute to the discussion of whether perceptual narrowing is a modality-general, pan-sensory process.

Download Full-text

Hearing Lips and Seeing Voices: the Origins and Development of the ‘McGurk Effect’ and Reflections on Audio–Visual Speech Perception Over the Last 40 Years

Multisensory Research ◽

10.1163/22134808-00002548 ◽

2018 ◽

Vol 31 (1-2) ◽

pp. 7-18 ◽

Cited By ~ 3

Author(s):

John MacDonald

Keyword(s):

Speech Perception ◽

Visual Illusion ◽

Simultaneous Presentation ◽

Mcgurk Effect ◽

Visual Speech ◽

Audiovisual Speech ◽

Audiovisual Speech Perception ◽

Profound Impact ◽

Visual Speech Perception

In 1976 Harry McGurk and I published a paper in Nature, entitled ‘Hearing Lips and Seeing Voices’. The paper described a new audio–visual illusion we had discovered that showed the perception of auditorily presented speech could be influenced by the simultaneous presentation of incongruent visual speech. This hitherto unknown effect has since had a profound impact on audiovisual speech perception research. The phenomenon has come to be known as the ‘McGurk effect’, and the original paper has been cited in excess of 4800 times. In this paper I describe the background to the discovery of the effect, the rationale for the generation of the initial stimuli, the construction of the exemplars used and the serendipitous nature of the finding. The paper will also cover the reaction (and non-reaction) to the Nature publication, the growth of research on, and utilizing the ‘McGurk effect’ and end with some reflections on the significance of the finding.

Download Full-text