scholarly journals Meta-control of the exploration-exploitation dilemma emerges from probabilistic inference over a hierarchy of time scales

2019 ◽  
Author(s):  
Dimitrije Marković ◽  
Thomas Goschke ◽  
Stefan J. Kiebel

AbstractCognitive control is typically understood as a set of mechanisms which enable humans to reach goals that require integrating the consequences of actions over longer time scales. Importantly, using routine beheavior or making choices beneficial only at a short time scales would prevent one from attaining these goals. During the past two decades, researchers have proposed various computational cognitive models that successfully account for behaviour related to cognitive control in a wide range of laboratory tasks. As humans operate in a dynamic and uncertain environment, making elaborate plans and integrating experience over multiple time scales is computationally expensive, the specific question of how uncertain consequences at different time scales are integrated into adaptive decisions remains poorly understood. Here, we propose that precisely the problem of integrating experience and forming elaborate plans over multiple time scales is a key component for better understanding how human agents solve cognitive control dilemmas such as the exploration-exploitation dilemma. In support of this conjecture, we present a computational model of probabilistic inference over hidden states and actions, which are represented as a hierarchy of time scales. Simulations of goal-reaching agents instantiating the model in an uncertain and dynamic task environment show how the exploration-exploitation dilemma may be solved by inferring meta-control states which adapt behaviour to changing contexts.

Author(s):  
Dimitrije Marković ◽  
Thomas Goschke ◽  
Stefan J. Kiebel

AbstractCognitive control is typically understood as a set of mechanisms that enable humans to reach goals that require integrating the consequences of actions over longer time scales. Importantly, using routine behaviour or making choices beneficial only at short time scales would prevent one from attaining these goals. During the past two decades, researchers have proposed various computational cognitive models that successfully account for behaviour related to cognitive control in a wide range of laboratory tasks. As humans operate in a dynamic and uncertain environment, making elaborate plans and integrating experience over multiple time scales is computationally expensive. Importantly, it remains poorly understood how uncertain consequences at different time scales are integrated into adaptive decisions. Here, we pursue the idea that cognitive control can be cast as active inference over a hierarchy of time scales, where inference, i.e., planning, at higher levels of the hierarchy controls inference at lower levels. We introduce the novel concept of meta-control states, which link higher-level beliefs with lower-level policy inference. Specifically, we conceptualize cognitive control as inference over these meta-control states, where solutions to cognitive control dilemmas emerge through surprisal minimisation at different hierarchy levels. We illustrate this concept using the exploration-exploitation dilemma based on a variant of a restless multi-armed bandit task. We demonstrate that beliefs about contexts and meta-control states at a higher level dynamically modulate the balance of exploration and exploitation at the lower level of a single action. Finally, we discuss the generalisation of this meta-control concept to other control dilemmas.


2006 ◽  
Vol 96 (6) ◽  
pp. 3448-3464 ◽  
Author(s):  
Giancarlo La Camera ◽  
Alexander Rauch ◽  
David Thurbon ◽  
Hans-R. Lüscher ◽  
Walter Senn ◽  
...  

Neural dynamic processes correlated over several time scales are found in vivo, in stimulus-evoked as well as spontaneous activity, and are thought to affect the way sensory stimulation is processed. Despite their potential computational consequences, a systematic description of the presence of multiple time scales in single cortical neurons is lacking. In this study, we injected fast spiking and pyramidal (PYR) neurons in vitro with long-lasting episodes of step-like and noisy, in-vivo-like current. Several processes shaped the time course of the instantaneous spike frequency, which could be reduced to a small number (1–4) of phenomenological mechanisms, either reducing (adapting) or increasing (facilitating) the neuron's firing rate over time. The different adaptation/facilitation processes cover a wide range of time scales, ranging from initial adaptation (<10 ms, PYR neurons only), to fast adaptation (<300 ms), early facilitation (0.5–1 s, PYR only), and slow (or late) adaptation (order of seconds). These processes are characterized by broad distributions of their magnitudes and time constants across cells, showing that multiple time scales are at play in cortical neurons, even in response to stationary stimuli and in the presence of input fluctuations. These processes might be part of a cascade of processes responsible for the power-law behavior of adaptation observed in several preparations, and may have far-reaching computational consequences that have been recently described.


2021 ◽  
Vol 17 (3) ◽  
pp. e1008866
Author(s):  
Amadeus Maes ◽  
Mauricio Barahona ◽  
Claudia Clopath

Sequential behaviour is often compositional and organised across multiple time scales: a set of individual elements developing on short time scales (motifs) are combined to form longer functional sequences (syntax). Such organisation leads to a natural hierarchy that can be used advantageously for learning, since the motifs and the syntax can be acquired independently. Despite mounting experimental evidence for hierarchical structures in neuroscience, models for temporal learning based on neuronal networks have mostly focused on serial methods. Here, we introduce a network model of spiking neurons with a hierarchical organisation aimed at sequence learning on multiple time scales. Using biophysically motivated neuron dynamics and local plasticity rules, the model can learn motifs and syntax independently. Furthermore, the model can relearn sequences efficiently and store multiple sequences. Compared to serial learning, the hierarchical model displays faster learning, more flexible relearning, increased capacity, and higher robustness to perturbations. The hierarchical model redistributes the variability: it achieves high motif fidelity at the cost of higher variability in the between-motif timings.


2020 ◽  
Author(s):  
Amadeus Maes ◽  
Mauricio Barahona ◽  
Claudia Clopath

ABSTRACTSequential behaviour is often compositional and organised across multiple time scales: a set of individual elements developing on short time scales (motifs) are combined to form longer functional sequences (syntax). Such organisation leads to a natural hierarchy that can be used advantageously for learning, since the motifs and the syntax can be acquired independently. Despite mounting experimental evidence for hierarchical structures in neuroscience, models for temporal learning based on neuronal networks have mostly focused on serial methods. Here, we introduce a network model of spiking neurons with a hierarchical organisation aimed at sequence learning on multiple time scales. Using biophysically motivated neuron dynamics and local plasticity rules, the model can learn motifs and syntax independently. Furthermore, the model can relearn sequences efficiently and store multiple sequences. Compared to serial learning, the hierarchical model displays faster learning, more flexible relearning, increased capacity, and higher robustness to perturbations. The hierarchical model redistributes the variability: it achieves high motif fidelity at the cost of higher variability in the between-motif timings.


2018 ◽  
Author(s):  
Yan Liang ◽  
◽  
Daniele J. Cherniak ◽  
Chenguang Sun

2019 ◽  
Vol 11 (4) ◽  
pp. 1163 ◽  
Author(s):  
Melissa Bedinger ◽  
Lindsay Beevers ◽  
Lila Collet ◽  
Annie Visser

Climate change is a product of the Anthropocene, and the human–nature system in which we live. Effective climate change adaptation requires that we acknowledge this complexity. Theoretical literature on sustainability transitions has highlighted this and called for deeper acknowledgment of systems complexity in our research practices. Are we heeding these calls for ‘systems’ research? We used hydrohazards (floods and droughts) as an example research area to explore this question. We first distilled existing challenges for complex human–nature systems into six central concepts: Uncertainty, multiple spatial scales, multiple time scales, multimethod approaches, human–nature dimensions, and interactions. We then performed a systematic assessment of 737 articles to examine patterns in what methods are used and how these cover the complexity concepts. In general, results showed that many papers do not reference any of the complexity concepts, and no existing approach addresses all six. We used the detailed results to guide advancement from theoretical calls for action to specific next steps. Future research priorities include the development of methods for consideration of multiple hazards; for the study of interactions, particularly in linking the short- to medium-term time scales; to reduce data-intensivity; and to better integrate bottom–up and top–down approaches in a way that connects local context with higher-level decision-making. Overall this paper serves to build a shared conceptualisation of human–nature system complexity, map current practice, and navigate a complexity-smart trajectory for future research.


2021 ◽  
Vol 40 (9) ◽  
pp. 2139-2154
Author(s):  
Caroline E. Weibull ◽  
Paul C. Lambert ◽  
Sandra Eloranta ◽  
Therese M. L. Andersson ◽  
Paul W. Dickman ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document