scholarly journals Learning to select actions shapes recurrent dynamics in the corticostriatal system

2019 ◽  
Author(s):  
Christian D. Márton ◽  
Simon R. Schultz ◽  
Bruno B. Averbeck

AbstractLearning to select appropriate actions based on their values is fundamental to adaptive behavior. This form of learning is supported by fronto-striatal systems. The dorsal-lateral prefrontal cortex (dlPFC) and the dorsal striatum (dSTR), which are strongly interconnected, are key nodes in this circuitry. Substantial experimental evidence, including neurophysiological recordings, have shown that neurons in these structures represent key aspects of learning. The computational mechanisms that shape the neurophysiological responses, however, are not clear. To examine this, we developed a recurrent neural network (RNN) model of the dlPFC-dSTR circuit and trained it on an oculomotor sequence learning task. We compared the activity generated by the model to activity recorded from monkey dlPFC and dSTR in the same task. This network consisted of a striatal component which encoded action values, and a prefrontal component which selected appropriate actions. After training, this system was able to autonomously represent and update action values and select actions, thus being able to closely approximate the representational structure in corticostriatal recordings. We found that learning to select the correct actions drove action-sequence representations further apart in activity space, both in the model and in the neural data. The model revealed that learning proceeds by increasing the distance between sequence-specific representations. This makes it more likely that the model will select the appropriate action sequence as learning develops. Our model thus supports the hypothesis that learning in networks drives the neural representations of actions further apart, increasing the probability that the network generates correct actions as learning proceeds. Altogether, this study advances our understanding of how neural circuit dynamics are involved in neural computation, showing how dynamics in the corticostriatal system support task learning.

NeuroImage ◽  
1998 ◽  
Vol 7 (4) ◽  
pp. S883
Author(s):  
P. Peigneux ◽  
P. Maquet ◽  
M. Van der Linden ◽  
T. Meulemans ◽  
C. Degueldre ◽  
...  

2014 ◽  
Vol 111 (3) ◽  
pp. 628-640 ◽  
Author(s):  
Fatemeh Noohi ◽  
Nate B. Boyden ◽  
Youngbin Kwak ◽  
Jennifer Humfleet ◽  
David T. Burke ◽  
...  

Individuals learn new skills at different rates. Given the involvement of corticostriatal pathways in some types of learning, variations in dopaminergic transmission may contribute to these individual differences. Genetic polymorphisms of the catechol- O-methyltransferase (COMT) enzyme and dopamine receptor D2 (DRD2) genes partially determine cortical and striatal dopamine availability, respectively. Individuals who are homozygous for the COMT methionine ( met) allele show reduced cortical COMT enzymatic activity, resulting in increased dopamine levels in the prefrontal cortex as opposed to individuals who are carriers of the valine ( val) allele. DRD2 G-allele homozygotes benefit from a higher striatal dopamine level compared with T-allele carriers. We hypothesized that individuals who are homozygous for COMT met and DRD2 G alleles would show higher rates of motor learning. Seventy-two young healthy females (20 ± 1.9 yr) performed a sensorimotor adaptation task and a motor sequence learning task. A nonparametric mixed model ANOVA revealed that the COMT val-val group demonstrated poorer performance in the sequence learning task compared with the met-met group and showed a learning deficit in the visuomotor adaptation task compared with both met-met and val-met groups. The DRD2 TT group showed poorer performance in the sequence learning task compared with the GT group, but there was no difference between DRD2 genotype groups in adaptation rate. Although these results did not entirely come out as one might predict based on the known contribution of corticostriatal pathways to motor sequence learning, they support the role of genetic polymorphisms of COMT val158met (rs4680) and DRD2 G>T (rs 1076560) in explaining individual differences in motor performance and motor learning, dependent on task type.


2020 ◽  
Author(s):  
Reza Saadati Fard ◽  
Kensuke Arai ◽  
Uri T. Eden ◽  
Emery N. Brown ◽  
Ali Yousefi

AbstractEstablished methods to track the dynamics of neural representations focus at the level of individual neurons for spiking data, and individual or pair of channels for local field potentials. However, our understanding of neural function and computation has moved toward an integrative view, based upon coordinated activity of multiple neural populations across brain areas. To draw network-level inferences of brain function, we propose a new modeling framework that combines the state-space model and cross-spectral matrix estimates – this is called state-space coherence (SSCoh). We define elements of the SSCoh and derive system identification and approximate filter solution for multivariate space processes. We expand SCoh for mixed observation processes, where the observation includes different modalities of neural data including local filed potential and spiking activity. Finally, we show an application of the framework to study neural synchrony across different brain nodes of a task participant performing Stroop task under different distraction levels.


Cortex ◽  
2018 ◽  
Vol 100 ◽  
pp. 84-94 ◽  
Author(s):  
Ádám Takács ◽  
Andrea Kóbor ◽  
Júlia Chezan ◽  
Noémi Éltető ◽  
Zsanett Tárnok ◽  
...  

2015 ◽  
Vol 8 (2) ◽  
pp. 277-282 ◽  
Author(s):  
Karolina Janacsek ◽  
Geza Gergely Ambrus ◽  
Walter Paulus ◽  
Andrea Antal ◽  
Dezso Nemeth

2015 ◽  
Vol 113 (1) ◽  
pp. 4-13 ◽  
Author(s):  
Elizabeth Tricomi ◽  
Karolina M. Lempert

For the consequences of our actions to guide behavior, the brain must represent different types of outcome-related information. For example, an outcome can be construed as negative because an expected reward was not delivered or because an outcome of low value was delivered. Thus behavioral consequences can differ in terms of the information they provide about outcome probability and value. We investigated the role of the striatum in processing probability-based and value-based negative feedback by training participants to associate cues with food rewards and then employing a selective satiety procedure to devalue one food outcome. Using functional magnetic resonance imaging, we examined brain activity related to receipt of expected rewards, receipt of devalued outcomes, omission of expected rewards, omission of devalued outcomes, and expected omissions of an outcome. Nucleus accumbens activation was greater for rewarding outcomes than devalued outcomes, but activity in this region did not correlate with the probability of reward receipt. Activation of the right caudate and putamen, however, was largest in response to rewarding outcomes relative to expected omissions of reward. The dorsal striatum (caudate and putamen) at the time of feedback also showed a parametric increase correlating with the trialwise probability of reward receipt. Our results suggest that the ventral striatum is sensitive to the motivational relevance, or subjective value, of the outcome, while the dorsal striatum codes for a more complex signal that incorporates reward probability. Value and probability information may be integrated in the dorsal striatum, to facilitate action planning and allocation of effort.


Sign in / Sign up

Export Citation Format

Share Document