Speaker Variations Influence Speechreading Speed for Dynamic Faces

Perception ◽  
10.1068/p5104 ◽  
2005 ◽  
Vol 34 (5) ◽  
pp. 595-610 ◽  
Author(s):  
Jürgen M Kaufmann ◽  
Stefan R Schweinberger

We investigated the influence of task-irrelevant speaker variations on speechreading performance. In three experiments with video digitised faces presented either in dynamic, static-sequential, or static mode, participants performed speeded classifications on vowel utterances (German vowels /u/ and /i/). A Garner interference paradigm was used, in which speaker identity was task-irrelevant but could be either correlated, constant, or orthogonal to the vowel uttered. Reaction times for facial speech classifications were slowed by task-irrelevant speaker variations for dynamic stimuli. The results are discussed with reference to distributed models of face perception (Haxby et al, 2000 Trends in Cognitive Sciences4 223–233) and the relevance of both dynamic information and speaker characteristics for speechreading.

2021 ◽  
Vol 35 (1) ◽  
pp. 15-22
Author(s):  
Kohei Fuseda ◽  
Jun’ichi Katayama

Abstract. Interest is a positive emotion related to attention. The event-related brain potential (ERP) probe technique is a useful method to evaluate the level of interest in dynamic stimuli. However, even in the irrelevant probe technique, the probe is presented as a physical stimulus and steals the observer’s attentional resources, although no overt response is required. Therefore, the probe might become a problematic distractor, preventing deep immersion of participants. Heartbeat-evoked brain potential (HEP) is a brain activity, time-locked to a cardiac event. No probe is required to obtain HEP data. Thus, we aimed to investigate whether the HEP can be used to evaluate the level of interest. Twenty-four participants (12 males and 12 females) watched attractive and unattractive individuals of the opposite sex in interesting and uninteresting videos (7 min each), respectively. We performed two techniques each for both the interesting and the uninteresting videos: the ERP probe and the HEP techniques. In the former, somatosensory stimuli were presented as task-irrelevant probes while participants watched videos: frequent (80%) and infrequent (20%) stimuli were presented at each wrist in random order. In the latter, participants watched videos without the probe. The P2 amplitude in response to the somatosensory probe was smaller and the positive wave amplitudes of HEP were larger while watching the videos of attractive individuals than while watching the videos of unattractive ones. These results indicate that the HEP technique is a useful method to evaluate the level of interest without an external probe stimulus.


2022 ◽  
Vol 12 (1) ◽  
Author(s):  
Nihan Alp ◽  
Huseyin Ozkan

AbstractIntegrating the spatiotemporal information acquired from the highly dynamic world around us is essential to navigate, reason, and decide properly. Although this is particularly important in a face-to-face conversation, very little research to date has specifically examined the neural correlates of temporal integration in dynamic face perception. Here we present statistically robust observations regarding the brain activations measured via electroencephalography (EEG) that are specific to the temporal integration. To that end, we generate videos of neutral faces of individuals and non-face objects, modulate the contrast of the even and odd frames at two specific frequencies ($$f_1$$ f 1 and $$f_2$$ f 2 ) in an interlaced manner, and measure the steady-state visual evoked potential as participants view the videos. Then, we analyze the intermodulation components (IMs: ($$nf_1\pm mf_2$$ n f 1 ± m f 2 ), a linear combination of the fundamentals with integer multipliers) that consequently reflect the nonlinear processing and indicate temporal integration by design. We show that electrodes around the medial temporal, inferior, and medial frontal areas respond strongly and selectively when viewing dynamic faces, which manifests the essential processes underlying our ability to perceive and understand our social world. The generation of IMs is only possible if even and odd frames are processed in succession and integrated temporally, therefore, the strong IMs in our frequency spectrum analysis show that the time between frames (1/60 s) is sufficient for temporal integration.


2012 ◽  
Vol 25 (0) ◽  
pp. 99
Author(s):  
Samuel Couth ◽  
Ellen Poliakoff ◽  
Emma Gowen

Reaching and grasping requires integration of visual, proprioceptive and somatosensory inputs. Previous research has shown that manipulating the ‘graspabilty’ of a visual stimulus influences reaction times to that stimulus (e.g., Tucker and Ellis, 1998). Here we explored whether this same effect can be extended to the planning and online control of arm movements. Participants made a mimed reaching movement with their left or right hand depending on the colour of images of affordance (door handles) and control stimuli (a row of dots of similar size and orientation as the door handle). Stimulus onset was manipulated by changing when the grey stimulus changed colour. Stimuli either pointed towards (compatible) or pointed away from (incompatible) the responding hand. Spatially compatible affordance stimuli facilitated reach onset compared to other stimuli and compatibility combinations, replicating previous reaction time studies. This can be attributed to a priming of the motor system by spatially compatible affording items. Results also indicated a larger outwards deviation of reach trajectory for spatially incompatible control stimuli compared to spatially compatible control stimuli, which waned with stimulus onset delay. This reveals an immediate inhibitory effect on reach trajectory, such that outwards movement is over-compensated to negate this incompatible orientation. Overall, we observed that the effect of visual spatial compatibility on reach kinematics differs with the action relevance of the stimulus. We are currently exploring how this multisensory visuomotor effect changes with age.


2021 ◽  
Author(s):  
Rafael Román-Caballero ◽  
Andrea Marotta ◽  
Juan Lupiáñez

Recent research has found that eye gaze and arrows yield opposite congruency effects in a spatial interference paradigm, arrows eliciting faster responses when their direction is congruent with their position (standard congruency effect), and gaze producing faster reaction times for incongruent conditions (reversed congruency effect). But social stimuli observable in ecological contexts tend to be more perceptually complex than simple arrows. The present study aimed to replicate this dissociation using whole faces and a comparable non-social target, formed by arrows embedded in a colored geometric background. Whereas the reversed congruency effect with gaze was replicated, the standard spatial interference with arrows was surprisingly absent. A similar outcome appeared when the contrast between the arrows and the task-irrelevant background increased. The results confirm the robustness of the reversed congruency effect with eyes, regardless of whether they are presented alone or within a face. In addition, the unexpected absence of the spatial conflict with complex arrow targets is a consequence of higher figure-ground segregation demands, which extend the processing of the task-relevant spatial dimension and, in turn, cause the decay of the location code.


2012 ◽  
Vol 25 (0) ◽  
pp. 70
Author(s):  
Zohar Eitan ◽  
Lawrence E. Marks

Garner’s speeded discrimination paradigm is a central tool in studying crossmodal interaction, revealing automatic perceptual correspondences between dimensions in different modalities. To date, however, the paradigm has been used solely with static, unchanging stimuli, limiting its ecological validity. Here, we use Garner’s paradigm to examine interactions between dynamic (time-varying) audiovisual dimensions — pitch direction and vertical visual motion. In Experiment 1, 32 participants rapidly discriminated ascending vs. descending pitch glides, ignoring concurrent visual motion (auditory task), and ascending vs. descending visual motion, ignoring pitch change (visual task). Results in both tasks revealed strong congruence effects, but no Garner interference, an unusual pattern inconsistent with some interpretations of Garner interference. To examine whether this pattern of results is specific to dynamic stimuli, Experiment 2 (testing another 64 participants) used a modified Garner design with two baseline conditions: The irrelevant stimuli were dynamic in one baseline and static in the other, the test stimuli always being dynamic. The results showed significant Garner interference relative to the static baseline (for both the auditory and visual tasks), but not relative to the dynamic baseline. Congruence effects were evident throughout. We suggest that dynamic stimuli reduce attention to and memory of between-trial variation, thereby reducing Garner interference. Because congruence effects depend primarily on within-trial relations, however, congruence effects are unaffected. Results indicate how a classic tool such as Garner’s paradigm, used productively to examine dimensional interactions between static stimuli, may be readily adapted to probe the radically different behavior of dynamic, time-varying multisensory stimuli.


2005 ◽  
Vol 22 (3) ◽  
pp. 371-388 ◽  
Author(s):  
Amandine Penel ◽  
Mari Riess Jones

One reason why music features temporal regularities is that they elicit expectancies about when an event will occur, focusing a listener�s attention around certain points in time. Evidence comes from phoneme monitoring tasks (using reaction times, J. G. Martin, 1979) and pitch and time judgment tasks (using accuracy measures, M. R. Jones, H. Moynihan, N. MacKenzie,& J. Puente, 2002; E. W. Large & M. R. Jones, 1999). Reaction times were faster and accuracy was higher for rhythmically expected elements than for unexpected elements. By contrast, A. Penel and M. R. Jones (2004) recently reported an inversely related finding: faster reaction times for rhythmically unexpected tones, which they labeled a temporal capture effect. The present research examines expectancy versus capture phenomena by using a speeded detection task in which listeners must respond to a lower pitched target located within monotone and isochronous sequences. One interonset interval was shortened or lengthened independently of the target�s position. Temporal irregularities tended to trigger false alarms, suggesting capture effects. Patterns of reaction times showed expectancy effects when the temporally perturbed event preceded the target, but these effects seemed to decrease with time in the sequence. When the target itself was temporally perturbed, some capture was observed, but only when the target came early in the sequence. We conclude that Martin�s (1979) expectancy effects in phoneme monitoring were coarticulatory rather than rhythmical.


2019 ◽  
Author(s):  
Sangtae Ahn ◽  
Caroline Lustenberger ◽  
L. Fredrik Jarskog ◽  
Flavio Fröhlich

AbstractFace perception is a highly developed function of the human visual system. Previous studies of event-related potentials (ERPs) have identified a face-selective ERP component (negative peak at about 170 milliseconds after stimulation onset, N170) in healthy participants. In contrast, patients with schizophrenia exhibit reduced amplitude of the N170, which may represent a pathological deficit in the neurophysiology of face perception. Interestingly, healthy humans with schizophrenia-like experiences (schizotypy) also exhibit abnormal processing of face perception. Yet, it has remained unknown how schizotypy in healthy humans is associated with the neurophysiological substrate of face perception. Here, we recruited 35 participants and assessed their schizotypy by the magical ideation rating scale. We used high-density electroencephalography to obtain ERPs elicited by a set of Mooney faces (face and non-face conditions). We divided the participants into two groups (high and low schizotypy) by a median split of schizotypy scores. We investigated mean reaction times and the N170 component in response to the stimuli. We found significant slowed reaction times and reduced amplitude of the N170 component in response to the face stimuli in the high-schizotypy group. In addition, across the full data set, we found that the schizotypy scores were significantly correlated with both the reaction times and the N170 amplitude. Our results thus support the model of schizotypy as a manifestation of a continuum between healthy individuals and patients with schizophrenia, where the N170 impairment serves as a biomarker for the degree of pathology along this continuum.


Author(s):  
Nadine Lavan ◽  
Harriet Smith ◽  
Li Jiang ◽  
Carolyn McGettigan

AbstractPrevious studies have shown that face-voice matching accuracy is more consistently above chance for dynamic (i.e. speaking) faces than for static faces. This suggests that dynamic information can play an important role in informing matching decisions. We initially asked whether this advantage for dynamic stimuli is due to shared information across modalities that is encoded in articulatory mouth movements. Participants completed a sequential face-voice matching task with (1) static images of faces, (2) dynamic videos of faces, (3) dynamic videos where only the mouth was visible, and (4) dynamic videos where the mouth was occluded, in a well-controlled stimulus set. Surprisingly, after accounting for random variation in the data due to design choices, accuracy for all four conditions was at chance. Crucially, however, exploratory analyses revealed that participants were not responding randomly, with different patterns of response biases being apparent for different conditions. Our findings suggest that face-voice identity matching may not be possible with above-chance accuracy but that analyses of response biases can shed light upon how people attempt face-voice matching. We discuss these findings with reference to the differential functional roles for faces and voices recently proposed for multimodal person perception.


2020 ◽  
Author(s):  
Sabrina Sghirripa ◽  
Lynton Graetz ◽  
Ashley Merkin ◽  
Nigel C Rogasch ◽  
Michael C Ridding ◽  
...  

AbstractAs working memory (WM) is limited in capacity, it is important to direct neural resources towards processing task-relevant information while ignoring distractors. Neural oscillations in the alpha frequency band (8-12 Hz) have been suggested to play a role in the inhibition of task-irrelevant information during WM, although results are mixed, possibly due to differences in the type of WM task employed. Here, we examined the role of alpha power in inhibition of anticipated distractors of varying strength using a modified Sternberg task where the encoding and retention periods were temporally separated. We recorded EEG while 20 young adults completed the task and found: 1) slower reaction times in strong distractor trials compared to weak distractor trials; 2) increased alpha power in posterior regions from baseline prior to presentation of a distractor regardless of condition; and 3) no differences in alpha power between strong and weak distractor conditions. Our results suggest that parieto-occipital alpha power is increased prior to a distractor. However we could not find evidence that alpha power is further modulated by distractor strength.


Sign in / Sign up

Export Citation Format

Share Document