Humans are primarily model-based learners in the two-stage task

Mapping Intimacies ◽

10.1101/682922 ◽

2019 ◽

Cited By ~ 7

Author(s):

Carolina Feher da Silva ◽

Todd A. Hare

Keyword(s):

Simple Model ◽

Habit Formation ◽

Learning Processes ◽

Reward Learning ◽

Learning Models ◽

Two Stage ◽

Model Based ◽

Model Free ◽

Task Instructions ◽

Versus Model

AbstractDistinct model-free and model-based learning processes are thought to drive both typical and dysfunctional behaviours. Data from two-stage decision tasks have seemingly shown that human behaviour is driven by both processes operating in parallel. However, in this study, we show that more detailed task instructions lead participants to make primarily model-based choices that have little, if any, simple model-free influence. We also demonstrate that behaviour in the two-stage task may falsely appear to be driven by a combination of simple model-free and model-based learning if purely model-based agents form inaccurate models of the task because of misconceptions. Furthermore, we report evidence that many participants do misconceive the task in important ways. Overall, we argue that humans formulate a wide variety of learning models. Consequently, the simple dichotomy of model-free versus model-based learning is inadequate to explain behaviour in the two-stage task and connections between reward learning, habit formation, and compulsivity.

Download Full-text

Experimental validation of robot-assisted cardiovascular catheterization: model-based versus model-free control

International Journal of Computer Assisted Radiology and Surgery ◽

10.1007/s11548-018-1757-z ◽

2018 ◽

Vol 13 (6) ◽

pp. 797-804

Author(s):

Xiaomei Wang ◽

Kit-Hang Lee ◽

Denny K. C. Fu ◽

Ziyang Dong ◽

Kui Wang ◽

...

Keyword(s):

Experimental Validation ◽

Robot Assisted ◽

Model Based ◽

Model Free ◽

Versus Model ◽

Model Free Control

Download Full-text

Model-based versus model-free control designs for improving microalgae growth in a closed photobioreactor: Some preliminary comparisons

2016 24th Mediterranean Conference on Control and Automation (MED) ◽

10.1109/med.2016.7535870 ◽

2016 ◽

Cited By ~ 8

Author(s):

Sihem Tebbani ◽

Mariana Titica ◽

Cedric Join ◽

Michel Fliess ◽

Didier Dumur

Keyword(s):

Model Based ◽

Model Free ◽

Versus Model ◽

Model Free Control ◽

Control Designs ◽

Microalgae Growth

Download Full-text

Parallel model-based and model-free reinforcement learning for card sorting performance

Scientific Reports ◽

10.1038/s41598-020-72407-7 ◽

2020 ◽

Vol 10 (1) ◽

Cited By ~ 1

Author(s):

Alexander Steinke ◽

Florian Lange ◽

Bruno Kopp

Keyword(s):

Reinforcement Learning ◽

Cognitive Flexibility ◽

Theoretical Perspective ◽

Behavioral Flexibility ◽

Wisconsin Card Sorting Test ◽

Card Sorting ◽

Learning Models ◽

Model Based ◽

Model Free ◽

Reinforcement Learning Models

Abstract The Wisconsin Card Sorting Test (WCST) is considered a gold standard for the assessment of cognitive flexibility. On the WCST, repeating a sorting category following negative feedback is typically treated as indicating reduced cognitive flexibility. Therefore such responses are referred to as ‘perseveration’ errors. Recent research suggests that the propensity for perseveration errors is modulated by response demands: They occur less frequently when their commitment repeats the previously executed response. Here, we propose parallel reinforcement-learning models of card sorting performance, which assume that card sorting performance can be conceptualized as resulting from model-free reinforcement learning at the level of responses that occurs in parallel with model-based reinforcement learning at the categorical level. We compared parallel reinforcement-learning models with purely model-based reinforcement learning, and with the state-of-the-art attentional-updating model. We analyzed data from 375 participants who completed a computerized WCST. Parallel reinforcement-learning models showed best predictive accuracies for the majority of participants. Only parallel reinforcement-learning models accounted for the modulation of perseveration propensity by response demands. In conclusion, parallel reinforcement-learning models provide a new theoretical perspective on card sorting and it offers a suitable framework for discerning individual differences in latent processes that subserve behavioral flexibility.

Download Full-text

Model-free versus Model-based Volatility Prediction

Journal of Financial Econometrics ◽

10.1093/jjfinec/nbm004 ◽

2007 ◽

Vol 5 (3) ◽

pp. 358-359 ◽

Cited By ~ 16

Author(s):

D. N. Politis

Keyword(s):

Model Based ◽

Model Free ◽

Versus Model

Download Full-text

Model-Based versus Model-Free Implied Volatility: Evidence from North American, European, and Asian Index Option Markets

The Journal of Derivatives ◽

10.3905/jod.2017.24.3.042 ◽

2017 ◽

Vol 24 (3) ◽

pp. 42-68 ◽

Cited By ~ 4

Author(s):

Ernest N. Biktimirov ◽

Chunrong Wang

Keyword(s):

North American ◽

Implied Volatility ◽

Option Markets ◽

Model Based ◽

Model Free ◽

Versus Model ◽

Index Option

Download Full-text

Model-based and model-free Pavlovian reward learning: Revaluation, revision, and revelation

Cognitive Affective & Behavioral Neuroscience ◽

10.3758/s13415-014-0277-8 ◽

2014 ◽

Vol 14 (2) ◽

pp. 473-492 ◽

Cited By ~ 165

Author(s):

Peter Dayan ◽

Kent C. Berridge

Keyword(s):

Reward Learning ◽

Model Based ◽

Model Free

Download Full-text

Acute Stress Effects on Model-Based versus Model-Free Reinforcement Learning

PsycEXTRA Dataset ◽

10.1037/e633262013-263 ◽

2013 ◽

Cited By ~ 1

Author(s):

A. Ross Otto ◽

Candace M. Raio ◽

Elizabeth A. Phelps ◽

Nathaniel Daw

Keyword(s):

Reinforcement Learning ◽

Acute Stress ◽

Stress Effects ◽

Model Based ◽

Model Free ◽

Versus Model

Download Full-text

Model-Based versus Model-Free Implied Volatility: Evidence from US, European, and Asian Index Option Markets

SSRN Electronic Journal ◽

10.2139/ssrn.2080403 ◽

2012 ◽

Author(s):

Ernest N. Biktimirov ◽

Chunrong Wang

Keyword(s):

Implied Volatility ◽

Option Markets ◽

Model Based ◽

Model Free ◽

Versus Model ◽

Index Option

Download Full-text

A note on the analysis of two-stage task results: How changes in task structure affect what model-free and model-based strategies predict about the effects of reward and transition on the stay probability

PLoS ONE ◽

10.1371/journal.pone.0195328 ◽

2018 ◽

Vol 13 (4) ◽

pp. e0195328 ◽

Cited By ~ 8

Author(s):

Carolina Feher da Silva ◽

Todd A. Hare

Keyword(s):

Task Structure ◽

Two Stage ◽

Model Based ◽

Model Free

Download Full-text

A note on the analysis of two-stage task results: how changes in task structure affect what model-free and model-based strategies predict about the effects of reward and transition on the stay probability

10.1101/187856 ◽

2017 ◽

Author(s):

Carolina Feher da Silva ◽

Todd A. Hare

Keyword(s):

Scientific Progress ◽

Task Structure ◽

Decision Task ◽

Two Stage ◽

Model Based ◽

Model Free ◽

Task Dependence ◽

Design Choice ◽

Important Means ◽

The Impact

AbstractMany studies that aim to detect model-free and model-based influences on behavior employ two-stage behavioral tasks of the type pioneered by Daw and colleagues in 2011. Such studies commonly modify existing two-stage decision paradigms in order to better address a given hypothesis, which is an important means of scientific progress. It is, however, critical to fully appreciate the impact of any modified or novel experimental design features on the expected results. Here, we use two concrete examples to demonstrate that relatively small changes in the two-stage task design can substantially change the pattern of actions taken by model-free and model-based agents as a function of the reward outcomes and transitions on previous trials. In the first, we show that, under specific conditions, purely model-free agents will produce the reward by transition interactions typically thought to characterize model-based behavior on a two-stage task. The second example shows that model-based agents’ behavior is driven by a main effect of transition-type in addition to the canonical reward by transition interaction whenever the reward probabilities of the final states do not sum to one. Together, these examples emphasize the task-dependence of model-free and model-based behavior and highlight the benefits of using computer simulations to determine what pattern of results to expect from both model-free and model-based agents performing a given two-stage decision task in order to design choice paradigms and analysis strategies best suited to the current question.

Download Full-text