Individual differences in dopamine function underlying the balance between model-based and model-free control

Mapping Intimacies ◽

10.1101/860361 ◽

2019 ◽

Cited By ~ 1

Author(s):

Ying Lee ◽

Lorenz Deserno ◽

Nils B. Kroemer ◽

Shakoor Pooseh ◽

Liane Oehme ◽

...

Keyword(s):

Individual Differences ◽

Control Individual ◽

General Notion ◽

Sequential Decision ◽

Model Based ◽

Model Free ◽

Positron Emission ◽

Model Free Control ◽

Induced Changes ◽

The Relationship

AbstractReinforcement learning involves a balance between model-free (MF) and model-based (MB) systems. Recent studies suggest that individuals with either pharmacologically enhanced levels of dopamine (DA) or higher baseline levels of DA exhibit more MB control. However, it remains unknown whether such pharmacological effects depend on baseline DA.Here, we investigated whether effects of L-DOPA on the balance of MB/MF control depend on ventral striatal baseline DA. Sixty participants had two functional magnetic resonance imaging (fMRI) scans while performing a two-stage sequential decision-making task under 150 mg L-DOPA or placebo (counterbalanced), followed by a 4-hour 18F-DOPA positron emission tomography (PET) scan (on a separate occasion).We found an interaction between baseline DA levels and L-DOPA induced changes in MB control. Individuals with higher baseline DA levels showed a greater L-DOPA induced enhancement in MB control. Surprisingly, we found a corresponding drug-by-baseline DA interaction on MF, but not MB learning signals in the ventromedial prefrontal cortex. We did not find a significant interaction between baseline DA levels and L-DOPA effects on MF control or MB/MF balance.In sum, our findings point to a baseline dependency of L-DOPA effects on differential aspects of MB and MF control. Individual differences in DA washout may be an important moderator of L-DOPA effects. Overall, our findings complement the general notion where higher DA levels is related to a greater reliance on MB control. Although the relationship between phasic DA firing and MF learning is conventionally assumed in the animal literature, the relationship between DA and MF control is not as straightforward and requires further clarification.

Download Full-text

Hunger improves reinforcement-driven but not planned action

10.1101/2021.03.24.436435 ◽

2021 ◽

Author(s):

Maaike M.H. van Swieten ◽

Rafal Bogacz ◽

Sanjay G. Manohar

Keyword(s):

Decision Making ◽

Sequential Decision ◽

Context Sensitive ◽

Decision Systems ◽

Model Based ◽

Model Free ◽

Prevalence Of Obesity ◽

Model Free Control ◽

Decision Mode

AbstractHuman decisions can be reflexive or planned, being governed respectively by model-free and model-based learning systems. These two systems might differ in their responsiveness to our needs. Hunger drives us to specifically seek food rewards, but here we ask whether it might have more general effects on these two decision systems. On one hand, the model-based system is often considered flexible and context-sensitive, and might therefore be modulated by metabolic needs. On the other hand, the model-free system’s primitive reinforcement mechanisms may have closer ties to biological drives. Here, we tested participants on a well-established two-stage sequential decision-making task that dissociates the contribution of model-based and model-free control. Hunger enhanced overall performance by increasing model-free control, without affecting model-based control. These results demonstrate a generalised effect of hunger on decision-making that enhances reliance on primitive reinforcement learning, which in some situations translates into adaptive benefits.Significance statementThe prevalence of obesity and eating disorder is steadily increasing. To counteract problems related to eating, people need to make rational decisions. However, appetite may switch us to a different decision mode, making it harder to achieve long-term goals. Here we show that planned and reinforcement-driven actions are differentially sensitive to hunger. Hunger specifically affected reinforcement-driven actions, and did not affect the planning of actions. Our data shows that people behave differently when they are hungry. We also provide a computational model of how the behavioural changes might arise.

Download Full-text

Hunger improves reinforcement-driven but not planned action

Cognitive Affective & Behavioral Neuroscience ◽

10.3758/s13415-021-00921-w ◽

2021 ◽

Author(s):

Maaike M.H. van Swieten ◽

Rafal Bogacz ◽

Sanjay G. Manohar

Keyword(s):

Decision Making ◽

Learning Systems ◽

Sequential Decision Making ◽

Sequential Decision ◽

Context Sensitive ◽

Decision Systems ◽

Model Based ◽

Model Free ◽

Model Free Control ◽

Overall Performance

Download Full-text

Author Correction: Reliance on model-based and model-free control in obesity

Scientific Reports ◽

10.1038/s41598-021-83028-z ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Lieneke K. Janssen ◽

Florian P. Mahner ◽

Florian Schlagenhauf ◽

Lorenz Deserno ◽

Annette Horstmann

Keyword(s):

Model Based ◽

Model Free ◽

Model Free Control

An amendment to this paper has been published and can be accessed via a link at the top of the paper.

Download Full-text

Model-Based and Model-Free Control of DC-DC Converters with High-Order Dynamics and Limited Measurements

IEEE Transactions on Industrial Electronics ◽

10.1109/tie.2020.3001845 ◽

2020 ◽

pp. 1-1

Author(s):

Javier Loranca ◽

Jonathan Carlos Mayo Maldonado ◽

Gerardo Escobar ◽

Carlos Villarreal-Hernandez ◽

Thabiso Maupong ◽

...

Keyword(s):

High Order ◽

Model Based ◽

Model Free ◽

Model Free Control

Download Full-text

Experimental validation of robot-assisted cardiovascular catheterization: model-based versus model-free control

International Journal of Computer Assisted Radiology and Surgery ◽

10.1007/s11548-018-1757-z ◽

2018 ◽

Vol 13 (6) ◽

pp. 797-804

Author(s):

Xiaomei Wang ◽

Kit-Hang Lee ◽

Denny K. C. Fu ◽

Ziyang Dong ◽

Kui Wang ◽

...

Keyword(s):

Experimental Validation ◽

Robot Assisted ◽

Model Based ◽

Model Free ◽

Versus Model ◽

Model Free Control

Download Full-text

Trajectory Tracking With an Aggregation of Domestic Hot Water Heaters: Combining Model-Based and Model-Free Control in a Commercial Deployment

IEEE Transactions on Smart Grid ◽

10.1109/tsg.2018.2890275 ◽

2019 ◽

Vol 10 (5) ◽

pp. 5686-5695 ◽

Cited By ~ 7

Author(s):

Mingxi Liu ◽

Stef Peeters ◽

Duncan S. Callaway ◽

Bert J. Claessens

Keyword(s):

Trajectory Tracking ◽

Hot Water ◽

Domestic Hot Water ◽

Model Based ◽

Water Heaters ◽

Model Free ◽

Model Free Control

Download Full-text

Model-based versus model-free control designs for improving microalgae growth in a closed photobioreactor: Some preliminary comparisons

2016 24th Mediterranean Conference on Control and Automation (MED) ◽

10.1109/med.2016.7535870 ◽

2016 ◽

Cited By ~ 8

Author(s):

Sihem Tebbani ◽

Mariana Titica ◽

Cedric Join ◽

Michel Fliess ◽

Didier Dumur

Keyword(s):

Model Based ◽

Model Free ◽

Versus Model ◽

Model Free Control ◽

Control Designs ◽

Microalgae Growth

Download Full-text

Model-based and model-free control of flexible-link robots: A comparison between representative methods

Applied Mathematical Modelling ◽

10.1016/j.apm.2009.01.012 ◽

2009 ◽

Vol 33 (10) ◽

pp. 3906-3925 ◽

Cited By ~ 35

Author(s):

Gerasimos G. Rigatos

Keyword(s):

Flexible Link ◽

Model Based ◽

Model Free ◽

Model Free Control ◽

Flexible Link Robots

Download Full-text

Increased temporal discounting and reduced model-based control in problem gambling are not substantially modulated by exposure to virtual gambling environments.

10.1101/2021.09.16.459889 ◽

2021 ◽

Author(s):

Luca Rene Bruder ◽

Ben Wagner ◽

David Mathar ◽

Jan Peters

Keyword(s):

Pathological Gambling ◽

Electrodermal Activity ◽

Temporal Discounting ◽

Sports Betting ◽

Reduced Model ◽

Sequential Decision ◽

Model Based Control ◽

Model Based ◽

Model Free ◽

Environment Exposure

High-performance virtual reality (VR) technology has opened new possibilities for the examination of the reactivity towards addiction-related cues (cue-reactivity) in addiction. In this preregistered study (https://osf.io/4mrta), we investigated the subjective, physiological, and behavioral effects of gambling-related VR environment exposure in participants reporting frequent or pathological gambling (n=31) as well as non-gambling controls (n=29). On two separate days, participants explored two rich and navigable VR-environments (neutral: cafe vs. gambling-related: casino/sports-betting facility), while electrodermal activity and heart rate were continuously measured using remote sensors. Within VR, participants performed a temporal discounting task and a sequential decision-making task designed to assess model-based and model-free contributions to behavior. Replicating previous findings, we found strong evidence for increased temporal discounting and reduced model-based control in participants reporting frequent or pathological gambling. Although VR gambling environment exposure increased subjective craving, there was if anything inconclusive evidence for further behavioral or physiological effects. Instead, VR exposure substantially increased physiological arousal (electrodermal activity), across groups and conditions. VR is a promising tool for the investigation of context effects in addiction, but some caution is warranted since effects of real gambling environments might not generally replicate in VR. Future studies should delineate how factors such as cognitive load and ecological validity could be balanced to create a more naturalistic VR experience.

Download Full-text

Έλεγχος εξασφάλισης κύλισης σε σφαιρικά ρομποτικά ακροδάκτυλα

10.12681/eadd/38166 ◽

2016 ◽

Author(s):

Λεωνίδας Δρούκας

Keyword(s):

Performance Control ◽

Model Based Control ◽

Prescribed Performance ◽

Model Based ◽

Model Free ◽

Prescribed Performance Control ◽

Model Free Control ◽

Control Methodology

Όπως στην περίπτωση του ανθρώπου έτσι και στην περίπτωση των δακτύλων ενός ρομποτικού χεριού, η κίνηση μέσω κύλισης των ακροδακτύλων είναι καθοριστική για την επίτευξη της ευσταθούς σύλληψης και του ομαλού χειρισμού ενός αντικειμένου από το ρομποτικό χέρι. Σε αντίθεση με μια κίνηση ολίσθησης πάνω στην επιφάνεια με την οποία έρχεται σε επαφή το ρομποτικό δάκτυλο, η κύλιση του ακροδακτύλου βοηθάει στην ακριβέστερη τοποθέτησή του πάνω στην επιφάνεια αυτή, συμβάλλοντας έτσι στον ομαλότερο συνολικά χειρισμό του εκάστοτε αντικειμένου όπως για παράδειγμα η μετακίνηση ή η περιστροφή του. Στην υπάρχουσα βιβλιογραφία, οι περισσότεροι ελεγκτικοί νόμοι που έχουν προταθεί σχεδιάζονται με βάση ένα εξιδανικευμένο μοντέλο του συστήματος ρομποτικού δακτύλου - επιφάνειας επαφής. Το μοντέλο αυτό περιλαμβάνει ενσωματωμένους τους δεσμούς κύλισης του ακροδακτύλου, με αποτέλεσμα η κίνηση κύλισης να θεωρείται επί της ουσίας ως κάτι δεδομένο και εκ των προτέρων εξασφαλισμένο. Στην πραγματικότητα φυσικά το παραπάνω δεν ισχύει, καθώς οι εκάστοτε υπάρχουσες συνθήκες τριβής που εξαρτώνται από τα υλικά του ακροδακτύλου και της επιφάνειας μπορεί να ευνοούν λίγο ή και καθόλου την κύλιση του πρώτου πάνω στην δεύτερη. Έτσι στις περιπτώσεις αυτές, οι παραπάνω ελεγκτικές μεθοδολογίες είναι πολύ πιθανό να οδηγήσουν το ακροδάκτυλο σε μια κίνηση που θα αποτελείται από συνδυασμένη κύλιση και ολίσθηση ή ακόμη και μόνον από ολίσθηση. Στην διδακτορική αυτή διατριβή, η εξασφάλιση της κύλισης του ακροδακτύλου πάνω σε μια επιφάνεια επαφής δεν θεωρείται δεδομένη από πριν, αλλά λαμβάνεται υπόψιν ως ένας επιπλέον στόχος ελέγχου. Οι ελεγκτές που προτείνονται επιτυγχάνουν την διασφάλιση της κύλισης του σφαιρικού άκρου ενός ρομποτικού δακτύλου σε μια οποιαδήποτε επιφάνεια με άγνωστες ή και μη ευνοϊκές συνθήκες τριβής, συνδυάζοντας παράλληλα την επίτευξη του ευρέως διαδεδομένου στη ρομποτική ελέγχου της θέσης του ακροδακτύλου και του μέτρου της ασκούμενης από αυτό κάθετης δύναμης. Σχεδιάζονται ελεγκτές οι οποίοι είτε βασίζονται στην γνώση του μοντέλου του ρομποτικού δακτύλου (model based control), είτε δεν απαιτούν καμία τέτοια πληροφορία (model free control). Για την σχεδίαση των τελευταίων χρησιμοποιείται η Μέθοδος Ελέγχου Προδιαγεγραμμένης Επίδοσης (Prescribed Performance Control Methodology). Για όλους τους προτεινόμενους ελεγκτές, παρουσιάζεται η θεωρητική σχεδίαση και η προσομοιακή τους αξιολόγηση που αποδεικνύουν την εξασφάλιση του βασικότερου ελεγκτικού μας στόχου, δηλαδή της κύλισης του ρομποτικού ακροδακτύλου πάνω στην επιφάνεια επαφής, ενώ σε ορισμένες περιπτώσεις παρουσιάζονται επιπροσθέτως και πειραματικά αποτελέσματα που ενισχύουν ακόμα περισσότερο την αποτελεσματικότητα των αντίστοιχων προτεινόμενων ελεγκτών. Τέλος, η εξασφάλιση της κύλισης του ακροδακτύλου μέσω ενός από τους προτεινόμενους ελεγκτές αξιοποιείται για την εφαρμογή κατάλληλης εφαπτομενικής δύναμης ώστε να επιτευχθεί ο επιθυμητός χειρισμός χωρίς σύλληψη ενός αντικειμένου από ένα ρομποτικό δάκτυλο. Ειδικότερα, παρουσιάζεται μέσω προσομοιώσεων η επιτυχής μετακίνηση ή/και περιστροφή ενός επίπεδου και ενός κυλινδρικού αντικειμένου μέσω χειρισμού χωρίς σύλληψη από ένα ρομποτικό δάκτυλο με εκμετάλλευση της κύλισης του άκρου του πάνω στο εκάστοτε αντικείμενο.

Download Full-text