Divided auditory attention with up to three sound sources: A cocktail party

William A. Yost; Stanley Sheft; Raymond (Toby) Dye

doi:10.1121/1.409277

An fMRI Study to Investigate Auditory Attention: A Model of the Cocktail Party Phenomenon

Magnetic Resonance in Medical Sciences ◽

10.2463/mrms.4.75 ◽

2005 ◽

Vol 4 (2) ◽

pp. 75-82 ◽

Cited By ~ 29

Author(s):

Toshiharu NAKAI ◽

Chikako KATO ◽

Kayako MATSUO

Keyword(s):

Auditory Attention ◽

Cocktail Party ◽

Fmri Study

Download Full-text

Visually guided auditory attention in a dynamic “cocktail-party” speech perception task: ERP evidence for age-related differences

Hearing Research ◽

10.1016/j.heares.2016.11.001 ◽

2017 ◽

Vol 344 ◽

pp. 98-108 ◽

Cited By ~ 5

Author(s):

Stephan Getzmann ◽

Edmund Wascher

Keyword(s):

Speech Perception ◽

Auditory Attention ◽

Visually Guided ◽

Cocktail Party ◽

Perception Task ◽

Age Related

Download Full-text

Auditory attention tracking states in a cocktail party environment can be decoded by deep convolutional neural networks

Journal of Neural Engineering ◽

10.1088/1741-2552/ab92b2 ◽

2020 ◽

Vol 17 (3) ◽

pp. 036013 ◽

Cited By ~ 1

Author(s):

Yin Tian ◽

Liang Ma

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Auditory Attention ◽

Cocktail Party ◽

Deep Convolutional Neural Networks

Download Full-text

The Combination of Neural Tracking and Alpha Power Lateralization for Auditory Attention Detection

Journal of Speech Language and Hearing Research ◽

10.1044/2021_jslhr-20-00608 ◽

2021 ◽

pp. 1-14

Author(s):

Szymon Drgas ◽

Magdalena Blaszak ◽

Anna Przekoracka-Krawczyk

Keyword(s):

Auditory Attention ◽

Natural Speech ◽

Alpha Power ◽

Cocktail Party ◽

Combined Method ◽

Acoustic Source ◽

Accurate Identification ◽

Speech Stimuli ◽

Listening Task ◽

Similar Accuracy

Purpose The acoustic source that is attended to by the listener in a mixture can be identified with a certain accuracy on the basis of their neural response recorded during listening, and various phenomena may be used to detect attention. For example, neural tracking (NT) and alpha power lateralization (APL) may be utilized in order to obtain information concerning attention. However, these methods of auditory attention detection (AAD) are typically tested in different experimental setups, which makes it impossible to compare their accuracy. The aim of this study is to compare the accuracy of AAD based on NT, APL, and their combination for a dichotic natural speech listening task. Method Thirteen adult listeners were presented with dichotic speech stimuli and instructed to attend to one of them. Electroencephalogram of the subjects was continuously recorded during the experiment using a set of 32 active electrodes. The accuracy of AAD was evaluated for trial lengths of 50, 25, and 12.5 s. AAD was tested for various parameters of NT- and APL-based modules. Results The obtained results suggest that NT of natural running speech provides similar accuracy to APL. The statistically significant improvement of the accuracy of AAD using a combined method has been observed not only for the longest duration of test samples (50 s, p = .005) but also for shorter ones (25 s, p = .011). Conclusions It seems that the combination of standard NT and APL significantly increases the effectiveness of accurate identification of the traced signal perceived by a listener under dichotic conditions. It has been demonstrated that, under certain conditions, the combination of NT and APL may provide a benefit for AAD in cocktail party scenarios.

Download Full-text

Comparison of speech envelope extraction methods for EEG-based auditory attention detection in a cocktail party scenario

2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) ◽

10.1109/embc.2015.7319552 ◽

2015 ◽

Cited By ~ 10

Author(s):

Wouter Biesmans ◽

Jonas Vanthornhout ◽

Jan Wouters ◽

Marc Moonen ◽

Tom Francart ◽

...

Keyword(s):

Extraction Methods ◽

Auditory Attention ◽

Cocktail Party ◽

Envelope Extraction ◽

Speech Envelope

Download Full-text

Where is the cocktail party? Decoding locations of attended and unattended moving sound sources using EEG

NeuroImage ◽

10.1016/j.neuroimage.2019.116283 ◽

2020 ◽

Vol 205 ◽

pp. 116283 ◽

Cited By ~ 7

Author(s):

Adam Bednar ◽

Edmund C. Lalor

Keyword(s):

Cocktail Party ◽

Sound Sources

Download Full-text

A simulated “cocktail party” with up to three sound sources

Perception & Psychophysics ◽

10.3758/bf03206830 ◽

1996 ◽

Vol 58 (7) ◽

pp. 1026-1036 ◽

Cited By ~ 73

Author(s):

William A. Yost ◽

Raymond H. Dye ◽

Stanley Sheft

Keyword(s):

Cocktail Party ◽

Sound Sources

Download Full-text

Dynamic Oscillatory Processes Governing Cued Orienting and Allocation of Auditory Attention

Journal of Cognitive Neuroscience ◽

10.1162/jocn_a_00452 ◽

2013 ◽

Vol 25 (11) ◽

pp. 1926-1943 ◽

Cited By ~ 47

Author(s):

Jyrki Ahveninen ◽

Samantha Huang ◽

John W. Belliveau ◽

Wei-Tang Chang ◽

Matti Hämäläinen

Keyword(s):

Frontal Cortex ◽

Correlation Effect ◽

Posterior Cingulate Cortex ◽

Anterior Insula ◽

Auditory Attention ◽

Gamma Activity ◽

Sound Sources ◽

Voluntary Engagement ◽

Beta Power ◽

The Right

In everyday listening situations, we need to constantly switch between alternative sound sources and engage attention according to cues that match our goals and expectations. The exact neuronal bases of these processes are poorly understood. We investigated oscillatory brain networks controlling auditory attention using cortically constrained fMRI-weighted magnetoencephalography/EEG source estimates. During consecutive trials, participants were instructed to shift attention based on a cue, presented in the ear where a target was likely to follow. To promote audiospatial attention effects, the targets were embedded in streams of dichotically presented standard tones. Occasionally, an unexpected novel sound occurred opposite to the cued ear to trigger involuntary orienting. According to our cortical power correlation analyses, increased frontoparietal/temporal 30–100 Hz gamma activity at 200–1400 msec after cued orienting predicted fast and accurate discrimination of subsequent targets. This sustained correlation effect, possibly reflecting voluntary engagement of attention after the initial cue-driven orienting, spread from the TPJ, anterior insula, and inferior frontal cortices to the right FEFs. Engagement of attention to one ear resulted in a significantly stronger increase of 7.5–15 Hz alpha in the ipsilateral than contralateral parieto-occipital cortices 200–600 msec after the cue onset, possibly reflecting cross-modal modulation of the dorsal visual pathway during audiospatial attention. Comparisons of cortical power patterns also revealed significant increases of sustained right medial frontal cortex theta power, right dorsolateral pFC and anterior insula/inferior frontal cortex beta power, and medial parietal cortex and posterior cingulate cortex gamma activity after cued versus novelty-triggered orienting (600–1400 msec). Our results reveal sustained oscillatory patterns associated with voluntary engagement of auditory spatial attention, with the frontoparietal and temporal gamma increases being best predictors of subsequent behavioral performance.

Download Full-text

Auditory attention switching with listening difficulty: Behavioral and pupillometric measures

10.31234/osf.io/2ubyj ◽

2019 ◽

Author(s):

Daniel McCloy ◽

Eric Larson ◽

Adrian K.C. Lee

Keyword(s):

Auditory Attention ◽

Stream Segregation ◽

Self Report ◽

Attention Switching ◽

Listening Effort ◽

Sound Sources ◽

Experimental Conditions ◽

Switching Attention ◽

Crowded Environments ◽

Target Detection Task

Pupillometry has emerged as a useful tool for studying listening effort. Past work involving listeners with normal audiological thresholds has shown that switching attention between competing talker streams evokes pupil dilation indicative of listening effort [McCloy et al (2017), J. Acoust. Soc. Am 141(4):2440]. The current experiment examines behavioral and pupillometric data from a two-stream target detection task requiring attention-switching between auditory streams, in two participant groups: audiometrically normal listeners who self-report difficulty localizing sound sources and/or understanding speech in reverberant or acoustically crowded environments, and their age-matched controls who do not report such problems. Three experimental conditions varied the number and type of stream segregation cues available. Participants who reported listening difficulty showed both behavioral and pupillometric signs of increased effort compared to controls, especially in trials where listeners had to switch attention between streams, or trials where only a single stream segregation cue was available.

Download Full-text

Selective auditory attention detection using dynamic learning systems: The study of RNN and reinforcement learning

10.1101/2021.02.18.431748 ◽

2021 ◽

Author(s):

Masoud Geravanchizadeh ◽

Hossein Roushan

Keyword(s):

Reinforcement Learning ◽

Detection System ◽

Auditory Attention ◽

Final Decision ◽

Learning Approaches ◽

Cocktail Party ◽

Dynamic Learning ◽

Learning Stage ◽

Q Learning ◽

Markov Decision

AbstractThe cocktail party phenomenon describes the ability of the human brain to focus auditory attention on a particular stimulus while ignoring other acoustic events. Selective auditory attention detection (SAAD) is an important issue in the development of brain-computer interface systems and cocktail party processors. This paper proposes a new dynamic attention detection system to process the temporal evolution of the input signal. In the proposed dynamic system, after preprocessing of the input signals, the probabilistic state space of the system is formed. Then, in the learning stage, different dynamic learning methods, including recurrent neural network (RNN) and reinforcement learning (Markov decision process (MDP) and deep Q-learning) are applied to make the final decision as to the attended speech. Among different dynamic learning approaches, the evaluation results show that the deep Q-learning approach (MDP+RNN) provides the highest classification accuracy (94.2%) with the least detection delay. The proposed SAAD system is advantageous, in the sense that the detection of attention is performed dynamically for the sequential inputs. Also, the system has the potential to be used in scenarios, where the attention of the listener might be switched in time in the presence of various acoustic events.

Download Full-text