Joint Modeling of Reaction Times and Choice Improves Parameter Identifiability in Reinforcement Learning Models

Mapping Intimacies ◽

10.1101/306720 ◽

2018 ◽

Author(s):

Ian C. Ballard ◽

Samuel M. McClure

Keyword(s):

Reinforcement Learning ◽

Model Fitting ◽

Reaction Times ◽

Learning Rate ◽

List Type ◽

Learning Models ◽

Parameter Identifiability ◽

Bayesian Priors ◽

Reinforcement Learning Models ◽

Parameters Of Reinforcement

AbstractBackgroundReinforcement learning models provide excellent descriptions of learning in multiple species across a variety of tasks. Many researchers are interested in relating parameters of reinforcement learning models to neural measures, psychological variables or experimental manipulations. We demonstrate that parameter identification is difficult because a range of parameter values provide approximately equal quality fits to data. This identification problem has a large impact on power: we show that a researcher who wants to detect a medium sized correlation (r = .3) with 80% power between a variable and learning rate must collect 60% more subjects than specified by a typical power analysis in order to account for the noise introduced by model fitting.New MethodWe derive a Bayesian optimal model fitting technique that takes advantage of information contained in choices and reaction times to constrain parameter estimates.ResultsWe show using simulation and empirical data that this method substantially improves the ability to recover learning rates.Comparison with Existing MethodsWe compare this method against the use of Bayesian priors. We show in simulations that the combined use of Bayesian priors and reaction times confers the highest parameter identifiability. However, in real data where the priors may have been misspecified, the use of Bayesian priors interferes with the ability of reaction time data to improve parameter identifiability.ConclusionsWe present a simple technique that takes advantage of readily available data to substantially improve the quality of inferences that can be drawn from parameters of reinforcement learning models.Highlights–Parameters of reinforcement learning models are particularly difficult to estimate–Incorporating reaction times into model fitting improves parameter identifiability–Bayesian weighting of choice and reaction times improves the power of analyses assessing learning rate

Download Full-text

Joint modeling of reaction times and choice improves parameter identifiability in reinforcement learning models

Journal of Neuroscience Methods ◽

10.1016/j.jneumeth.2019.01.006 ◽

2019 ◽

Vol 317 ◽

pp. 37-44 ◽

Cited By ~ 11

Author(s):

Ian C. Ballard ◽

Samuel M. McClure

Keyword(s):

Reinforcement Learning ◽

Reaction Times ◽

Joint Modeling ◽

Learning Models ◽

Parameter Identifiability ◽

Reinforcement Learning Models

Download Full-text

Signed and unsigned reward prediction errors dynamically enhance learning and memory

eLife ◽

10.7554/elife.61077 ◽

2021 ◽

Vol 10 ◽

Author(s):

Nina Rouhani ◽

Yael Niv

Keyword(s):

Reinforcement Learning ◽

Locus Coeruleus ◽

Learning And Memory ◽

Learning Rate ◽

Prediction Errors ◽

Learning Models ◽

The Past ◽

Reward Prediction ◽

Midbrain Dopamine ◽

Reinforcement Learning Models

Memory helps guide behavior, but which experiences from the past are prioritized? Classic models of learning posit that events associated with unpredictable outcomes as well as, paradoxically, predictable outcomes, deploy more attention and learning for those events. Here, we test reinforcement learning and subsequent memory for those events, and treat signed and unsigned reward prediction errors (RPEs), experienced at the reward-predictive cue or reward outcome, as drivers of these two seemingly contradictory signals. By fitting reinforcement learning models to behavior, we find that both RPEs contribute to learning by modulating a dynamically changing learning rate. We further characterize the effects of these RPE signals on memory, and show that both signed and unsigned RPEs enhance memory, in line with midbrain dopamine and locus-coeruleus modulation of hippocampal plasticity, thereby reconciling separate findings in the literature.

Download Full-text

Supplemental Material for Reconciling Reinforcement Learning Models With Behavioral Extinction and Renewal: Implications for Addiction, Relapse, and Problem Gambling

Psychological Review ◽

10.1037/0033-295x.114.3.784.supp ◽

2007 ◽

Cited By ~ 1

Keyword(s):

Reinforcement Learning ◽

Problem Gambling ◽

Learning Models ◽

Behavioral Extinction ◽

Reinforcement Learning Models

Download Full-text

Bayes factors for reinforcement-learning models of the Iowa gambling task.

Decision ◽

10.1037/dec0000040 ◽

2016 ◽

Vol 3 (2) ◽

pp. 115-131 ◽

Cited By ~ 14

Author(s):

Helen Steingroever ◽

Ruud Wetzels ◽

Eric-Jan Wagenmakers

Keyword(s):

Reinforcement Learning ◽

Iowa Gambling Task ◽

Bayes Factors ◽

Gambling Task ◽

Learning Models ◽

Reinforcement Learning Models

Download Full-text

Effects of Working Memory Capacity on the Speed and Accuracy of Learning in Reinforcement Learning Models

PsycEXTRA Dataset ◽

10.1037/e528942014-552 ◽

2014 ◽

Author(s):

Adnane Ez-Zizi ◽

Simon Farrell ◽

David Leslie

Keyword(s):

Working Memory ◽

Reinforcement Learning ◽

Working Memory Capacity ◽

Memory Capacity ◽

Learning Models ◽

Reinforcement Learning Models ◽

Speed And Accuracy

Download Full-text

Supplemental Material for Reinforcement Learning Models of Risky Choice and the Promotion of Risk-Taking by Losses Disguised as Wins in Rats

Journal of Experimental Psychology Animal Learning and Cognition ◽

10.1037/xan0000141.supp ◽

2017 ◽

Keyword(s):

Reinforcement Learning ◽

Risk Taking ◽

Risky Choice ◽

Learning Models ◽

Losses Disguised As Wins ◽

Reinforcement Learning Models

Download Full-text

Test-retest reliability of canonical reinforcement learning models

10.32470/ccn.2019.1053-0 ◽

2019 ◽

Author(s):

Laura Weidinger ◽

Andrea Gradassi ◽

Lucas Molleman ◽

Wouter van den Bos

Keyword(s):

Reinforcement Learning ◽

Learning Models ◽

Retest Reliability ◽

Reinforcement Learning Models ◽

Test Retest Reliability

Download Full-text

From reinforcement learning models to psychiatric and neurological disorders

Nature Neuroscience ◽

10.1038/nn.2723 ◽

2011 ◽

Vol 14 (2) ◽

pp. 154-162 ◽

Cited By ~ 369

Author(s):

Tiago V Maia ◽

Michael J Frank

Keyword(s):

Reinforcement Learning ◽

Neurological Disorders ◽

Learning Models ◽

Reinforcement Learning Models

Download Full-text

Heterogeneity of strategy use in the Iowa gambling task: A comparison of win-stay/lose-shift and reinforcement learning models

Psychonomic Bulletin & Review ◽

10.3758/s13423-012-0324-9 ◽

2012 ◽

Vol 20 (2) ◽

pp. 364-371 ◽

Cited By ~ 61

Author(s):

Darrell A. Worthy ◽

Melissa J. Hawthorne ◽

A. Ross Otto

Keyword(s):

Reinforcement Learning ◽

Iowa Gambling Task ◽

Strategy Use ◽

Gambling Task ◽

Learning Models ◽

Reinforcement Learning Models

Download Full-text

440. Defining Trans-Diagnostic Psychiatric Traits Using Reinforcement Learning Models in Large Online Samples

Biological Psychiatry ◽

10.1016/j.biopsych.2017.02.924 ◽

2017 ◽

Vol 81 (10) ◽

pp. S180

Author(s):

Claire Gillan

Keyword(s):

Reinforcement Learning ◽

Learning Models ◽

Reinforcement Learning Models

Download Full-text