Audiovisual integration as conflict resolution: The conflict of the McGurk illusion

Luis Morís Fernández; Emiliano Macaluso; Salvador Soto-Faraco

doi:10.1002/hbm.23758

Predictive power in models of audiovisual integration of speech

Seeing and Perceiving ◽

10.1163/187847612x647379 ◽

2012 ◽

Vol 25 (0) ◽

pp. 105 ◽

Cited By ~ 1

Author(s):

Tobias Søren Andersen

Keyword(s):

Speech Perception ◽

Predictive Power ◽

Internal Representation ◽

Audiovisual Integration ◽

Natural Case ◽

The Face ◽

Fuzzy Logical Model ◽

Mcgurk Illusion ◽

The Voice ◽

Talking Face

Seeing the talking face can influence the phoneme perceived from the voice. This facilitates speech perception in the natural case where the face and voice are congruent and can cause the McGurk illusion when they are not. The classical example of the McGurk illusion is when acoustic /aba/ is perceived as /ada/ when dubbed onto a face articulating /aga/. In order to fully understand the underlying process of integrating information across the senses we need a computational account with predictive power. The Fuzzy Logical Model of Perception is one computational account of audiovisual integration in speech perception. Here we describe alternative accounts in which integration is based on an early continuous internal representation on which the phonetic classes fall. We show that these alternative accounts can provide just as good a fit when corrected for the number of free parameters. We also show, using cross-validation, that they have greater, but not great, predictive power. Finally, we show that introducing a regularization term can amend the lack of predictive power. With regularization, models based on continuous representations have the highest predictive power.

Download Full-text

Forty Years After Hearing Lips and Seeing Voices: the McGurk Effect Revisited

Multisensory Research ◽

10.1163/22134808-00002565 ◽

2018 ◽

Vol 31 (1-2) ◽

pp. 111-144 ◽

Cited By ~ 25

Author(s):

Agnès Alsius ◽

Martin Paré ◽

Kevin G. Munhall

Keyword(s):

Sufficient Conditions ◽

Audiovisual Integration ◽

Mcgurk Effect ◽

Audiovisual Speech ◽

Proxy Measure ◽

Audiovisual Processing ◽

Integration Mechanisms ◽

Mcgurk Illusion ◽

Necessary And Sufficient ◽

Speech Events

Since its discovery 40 years ago, the McGurk illusion has been usually cited as a prototypical paradigmatic case of multisensory binding in humans, and has been extensively used in speech perception studies as a proxy measure for audiovisual integration mechanisms. Despite the well-established practice of using the McGurk illusion as a tool for studying the mechanisms underlying audiovisual speech integration, the magnitude of the illusion varies enormously across studies. Furthermore, the processing of McGurk stimuli differs from congruent audiovisual processing at both phenomenological and neural levels. This questions the suitability of this illusion as a tool to quantify the necessary and sufficient conditions under which audiovisual integration occurs in natural conditions. In this paper, we review some of the practical and theoretical issues related to the use of the McGurk illusion as an experimental paradigm. We believe that, without a richer understanding of the mechanisms involved in the processing of the McGurk effect, experimenters should be really cautious when generalizing data generated by McGurk stimuli to matching audiovisual speech events.

Download Full-text

Early and late beta-band power reflect audiovisual perception in the McGurk illusion

Journal of Neurophysiology ◽

10.1152/jn.00783.2014 ◽

2015 ◽

Vol 113 (7) ◽

pp. 2342-2350 ◽

Cited By ~ 21

Author(s):

Yadira Roa Romero ◽

Daniel Senkowski ◽

Julian Keil

Keyword(s):

Auditory Perception ◽

Audiovisual Integration ◽

Visual Speech ◽

Audiovisual Speech ◽

Beta Band ◽

Audiovisual Speech Perception ◽

Subsequent Formation ◽

Band Power ◽

Mcgurk Illusion ◽

Audiovisual Stimuli

The McGurk illusion is a prominent example of audiovisual speech perception and the influence that visual stimuli can have on auditory perception. In this illusion, a visual speech stimulus influences the perception of an incongruent auditory stimulus, resulting in a fused novel percept. In this high-density electroencephalography (EEG) study, we were interested in the neural signatures of the subjective percept of the McGurk illusion as a phenomenon of speech-specific multisensory integration. Therefore, we examined the role of cortical oscillations and event-related responses in the perception of congruent and incongruent audiovisual speech. We compared the cortical activity elicited by objectively congruent syllables with incongruent audiovisual stimuli. Importantly, the latter elicited a subjectively congruent percept: the McGurk illusion. We found that early event-related responses (N1) to audiovisual stimuli were reduced during the perception of the McGurk illusion compared with congruent stimuli. Most interestingly, our study showed a stronger poststimulus suppression of beta-band power (13–30 Hz) at short (0–500 ms) and long (500–800 ms) latencies during the perception of the McGurk illusion compared with congruent stimuli. Our study demonstrates that auditory perception is influenced by visual context and that the subsequent formation of a McGurk illusion requires stronger audiovisual integration even at early processing stages. Our results provide evidence that beta-band suppression at early stages reflects stronger stimulus processing in the McGurk illusion. Moreover, stronger late beta-band suppression in McGurk illusion indicates the resolution of incongruent physical audiovisual input and the formation of a coherent, illusory multisensory percept.

Download Full-text

When eyes beat lips: speaker gaze affects audiovisual integration in the McGurk illusion

Psychological Research ◽

10.1007/s00426-021-01618-y ◽

2021 ◽

Author(s):

Basil Wahn ◽

Laura Schmitz ◽

Alan Kingstone ◽

Anne Böckler-Raettig

Keyword(s):

Visual Information ◽

Critical Role ◽

Audiovisual Integration ◽

Visual Speech ◽

Speech Signals ◽

Human Communication ◽

Direct Gaze ◽

Eye Motion ◽

Mcgurk Illusion ◽

Illusory Percept

AbstractEye contact is a dynamic social signal that captures attention and plays a critical role in human communication. In particular, direct gaze often accompanies communicative acts in an ostensive function: a speaker directs her gaze towards the addressee to highlight the fact that this message is being intentionally communicated to her. The addressee, in turn, integrates the speaker’s auditory and visual speech signals (i.e., her vocal sounds and lip movements) into a unitary percept. It is an open question whether the speaker’s gaze affects how the addressee integrates the speaker’s multisensory speech signals. We investigated this question using the classic McGurk illusion, an illusory percept created by presenting mismatching auditory (vocal sounds) and visual information (speaker’s lip movements). Specifically, we manipulated whether the speaker (a) moved his eyelids up/down (i.e., open/closed his eyes) prior to speaking or did not show any eye motion, and (b) spoke with open or closed eyes. When the speaker’s eyes moved (i.e., opened or closed) before an utterance, and when the speaker spoke with closed eyes, the McGurk illusion was weakened (i.e., addressees reported significantly fewer illusory percepts). In line with previous research, this suggests that motion (opening or closing), as well as the closed state of the speaker’s eyes, captured addressees’ attention, thereby reducing the influence of the speaker’s lip movements on the addressees’ audiovisual integration process. Our findings reaffirm the power of speaker gaze to guide attention, showing that its dynamics can modulate low-level processes such as the integration of multisensory speech signals.

Download Full-text

Bayesian binding and fusion models explain illusion and enhancement effects in audiovisual speech perception

PLoS ONE ◽

10.1371/journal.pone.0246986 ◽

2021 ◽

Vol 16 (2) ◽

pp. e0246986

Author(s):

Alma Lindborg ◽

Tobias S. Andersen

Keyword(s):

Speech Perception ◽

Bayesian Models ◽

Audiovisual Integration ◽

Visual Speech ◽

Multisensory Perception ◽

Visual Signal ◽

Audiovisual Speech ◽

Enhancement Effect ◽

Audiovisual Speech Perception ◽

Mcgurk Illusion

Speech is perceived with both the ears and the eyes. Adding congruent visual speech improves the perception of a faint auditory speech stimulus, whereas adding incongruent visual speech can alter the perception of the utterance. The latter phenomenon is the case of the McGurk illusion, where an auditory stimulus such as e.g. “ba” dubbed onto a visual stimulus such as “ga” produces the illusion of hearing “da”. Bayesian models of multisensory perception suggest that both the enhancement and the illusion case can be described as a two-step process of binding (informed by prior knowledge) and fusion (informed by the information reliability of each sensory cue). However, there is to date no study which has accounted for how they each contribute to audiovisual speech perception. In this study, we expose subjects to both congruent and incongruent audiovisual speech, manipulating the binding and the fusion stages simultaneously. This is done by varying both temporal offset (binding) and auditory and visual signal-to-noise ratio (fusion). We fit two Bayesian models to the behavioural data and show that they can both account for the enhancement effect in congruent audiovisual speech, as well as the McGurk illusion. This modelling approach allows us to disentangle the effects of binding and fusion on behavioural responses. Moreover, we find that these models have greater predictive power than a forced fusion model. This study provides a systematic and quantitative approach to measuring audiovisual integration in the perception of the McGurk illusion as well as congruent audiovisual speech, which we hope will inform future work on audiovisual speech perception.

Download Full-text

Amicable Agreement Versus Majority Rule: Conflict Resolution in Switzerland.Jürg SteinerThe Death of Communal Liberty: A History of Freedom in a Swiss Mountain Canton. Benjamin R. Barber

The Journal of Politics ◽

10.1017/s0022381600041347 ◽

1975 ◽

Vol 37 (2) ◽

pp. 588-590

Author(s):

Keith Legg

Keyword(s):

Conflict Resolution ◽

Majority Rule ◽

History Of

Download Full-text

Attending to our future: Why we should support conflict resolution education

Conflict Resolution Quarterly ◽

10.1002/crq.1 ◽

2002 ◽

Vol 19 (4) ◽

pp. 383-388

Author(s):

Tricia S. Jones

Keyword(s):

Conflict Resolution ◽

Conflict Resolution Education

Download Full-text

A solution-oriented approach to conflict resolution in a work system

British Journal of Guidance and Counselling ◽

10.1080/03069889500760401 ◽

1995 ◽

Vol 23 (3) ◽

pp. 409-419 ◽

Cited By ~ 1

Author(s):

Susan Webb

Keyword(s):

Conflict Resolution ◽

Work System ◽

Oriented Approach

Download Full-text

Agreeableness, Conflict Resolution Tactics, and School Behavior in Second Graders

Journal of Individual Differences ◽

10.1027/1614-0001/a000199 ◽

2016 ◽

Vol 37 (3) ◽

pp. 145-151 ◽

Cited By ~ 2

Author(s):

Daniel L. Gadke ◽

Renée M. Tobin ◽

W. Joel Schneider

Keyword(s):

Conflict Resolution ◽

Problem Behaviors ◽

Second Graders ◽

Adaptive Behaviors ◽

Second Grade ◽

School Behavior ◽

Multiple Sources ◽

School Based ◽

Second Grade Students ◽

Negotiation Tactics

Abstract. This study examined the association between Agreeableness and children’s selection of conflict resolution tactics and their overt behaviors at school. A total of 157 second graders responded to a series of conflict resolution vignettes and were observed three times during physical education classes at school. We hypothesized that Agreeableness would be inversely related to the endorsement of power assertion tactics and to displays of problem behaviors, and positively related to the endorsement of negotiation tactics and to displays of adaptive behaviors. Consistent with hypotheses, Agreeableness was inversely related to power assertion tactics and to displays of off-task, disruptive, and verbally aggressive behaviors. There was no evidence that Agreeableness was related to more socially sophisticated responses to conflict, such as negotiation, with our sample of second grade students; however, it was related to displays of adaptive behaviors, specifically on-task behaviors. Limitations, including potential reactivity effects and the restriction of observational data collection to one school-based setting, are discussed. Future researchers are encouraged to collect data from multiple sources in more than one setting over time.

Download Full-text

The Theatrical Metaphor: Can It Aid Conflict Resolution?

Contemporary Psychology ◽

10.1037/027290 ◽

1987 ◽

Vol 32 (7) ◽

pp. 602-603

Author(s):

Sheldon Stryker

Keyword(s):

Conflict Resolution

Download Full-text