scholarly journals Biased belief updating and suboptimal choice in foraging decisions

2019 ◽  
Author(s):  
Neil Garrett ◽  
Nathaniel D. Daw

AbstractIn many choice scenarios, including prey, employment, and mate search, options are not encountered simultaneously and so cannot be directly compared. Deciding which ones optimally to engage, and which to forego, requires developing accurate beliefs about the overall distribution of prospects. However, the role of learning in this process – and how biases due to learning may affect choice – are poorly understood. In three experiments, we adapted a classic prey selection task from foraging theory to examine how individuals kept track of an environment’s reward rate and adjusted their choices in response to its fluctuations. In accord with qualitative predictions from optimal foraging models, participants adjusted their selectivity to the richness of the environment: becoming less selective in poorer environments and increasing acceptance of less profitable options. These preference shifts were observed not just in response to global (between block) manipulations of the offer distributions, but also to local, trial-by-trial offer variation within a block, suggesting an incremental learning rule. Further offering evidence into the learning process, these preference changes were more pronounced when the environment improved compared to when it deteriorated. All these observations were best explained by a trial-by-trial learning model in which participants estimate the overall reward rate, but with upward vs. downward changes controlled by separate learning rates. A failure to adjust expectations sufficiently when an environment becomes worse leads to suboptimal choices: options that are valuable given the environmental conditions are rejected in the false expectation that better options will materialize. These findings offer a previously unappreciated parallel in the serial choice setting of observations of asymmetric updating and resulting biased (often overoptimistic) estimates in other domains.

Author(s):  
Ziqing Yao ◽  
Xuanyi Lin ◽  
Xiaoqing Hu

Abstract When people are confronted with feedback that counters their prior beliefs, they preferentially rely on desirable rather than undesirable feedback in belief updating, i.e. an optimism bias. In two pre-registered EEG studies employing an adverse life event probability estimation task, we investigated the neurocognitive processes that support the formation and the change of optimism biases in immediate and 24 h delayed tests. We found that optimistic belief updating biases not only emerged immediately but also became significantly larger after 24 h, suggesting an active role of valence-dependent offline consolidation processes in the change of optimism biases. Participants also showed optimistic memory biases: they were less accurate in remembering undesirable than desirable feedback probabilities, with inferior memories of undesirable feedback associated with lower belief updating in the delayed test. Examining event-related brain potentials (ERPs) revealed that desirability of feedback biased initial encoding: desirable feedback elicited larger P300s than undesirable feedback, with larger P300 amplitudes predicting both higher belief updating and memory accuracies. These results suggest that desirability of feedback could bias both online and offline memory-related processes such as encoding and consolidation, with both processes contributing to the formation and change of optimism biases.


1994 ◽  
Vol 6 (2) ◽  
pp. 255-269 ◽  
Author(s):  
Geoffrey J. Goodhill ◽  
Harry G. Barrow

The effect of different kinds of weight normalization on the outcome of a simple competitive learning rule is analyzed. It is shown that there are important differences in the representation formed depending on whether the constraint is enforced by dividing each weight by the same amount (“divisive enforcement”) or subtracting a fixed amount from each weight (“subtractive enforcement”). For the divisive cases weight vectors spread out over the space so as to evenly represent “typical” inputs, whereas for the subtractive cases the weight vectors tend to the axes of the space, so as to represent “extreme” inputs. The consequences of these differences are examined.


2018 ◽  
Vol 30 (12) ◽  
pp. 1803-1820 ◽  
Author(s):  
Marieke Jepma ◽  
Stephen B. R. E. Brown ◽  
Peter R. Murphy ◽  
Stephany C. Koelewijn ◽  
Boukje de Vries ◽  
...  

To make optimal predictions in a dynamic environment, the impact of new observations on existing beliefs—that is, the learning rate—should be guided by ongoing estimates of change and uncertainty. Theoretical work has proposed specific computational roles for various neuromodulatory systems in the control of learning rate, but empirical evidence is still sparse. The aim of the current research was to examine the role of the noradrenergic and cholinergic systems in learning rate regulation. First, we replicated our recent findings that the centroparietal P3 component of the EEG—an index of phasic catecholamine release in the cortex—predicts trial-to-trial variability in learning rate and mediates the effects of surprise and belief uncertainty on learning rate (Study 1, n = 17). Second, we found that pharmacological suppression of either norepinephrine or acetylcholine activity produced baseline-dependent effects on learning rate following nonobvious changes in an outcome-generating process (Study 1). Third, we identified two genes, coding for α2A receptor sensitivity ( ADRA2A) and norepinephrine reuptake ( NET), as promising targets for future research on the genetic basis of individual differences in learning rate (Study 2, n = 137). Our findings suggest a role for the noradrenergic and cholinergic systems in belief updating and underline the importance of studying interactions between different neuromodulatory systems.


2005 ◽  
Vol 17 (4) ◽  
pp. 859-879 ◽  
Author(s):  
Rudy Guyonneau ◽  
Rufin VanRullen ◽  
Simon J. Thorpe

Spike timing-dependent plasticity (STDP) is a learning rule that modifies the strength of a neuron's synapses as a function of the precise temporal relations between input and output spikes. In many brains areas, temporal aspects of spike trains have been found to be highly reproducible. How will STDP affect a neuron's behavior when it is repeatedly presented with the same input spike pattern? We show in this theoretical study that repeated inputs systematically lead to a shaping of the neuron's selectivity, emphasizing its very first input spikes, while steadily decreasing the postsynaptic response latency. This was obtained under various conditions of background noise, and even under conditions where spiking latencies and firing rates, or synchrony, provided conflicting informations. The key role of first spikes demonstrated here provides further support for models using a single wave of spikes to implement rapid neural processing.


2014 ◽  
Vol 68 (7) ◽  
pp. 1205-1213 ◽  
Author(s):  
Pau Sunyer ◽  
Josep Maria Espelta ◽  
Raúl Bonal ◽  
Alberto Muñoz

2006 ◽  
Vol 209 (21) ◽  
pp. 4295-4303 ◽  
Author(s):  
J. E. Layne ◽  
P. W. Chen ◽  
C. Gilbert

BIOS ◽  
2013 ◽  
Vol 84 (1) ◽  
pp. 8-13
Author(s):  
Harrison Taylor ◽  
John P. Ludlam

Cognition ◽  
2007 ◽  
Vol 105 (3) ◽  
pp. 704-714 ◽  
Author(s):  
José C. Perales ◽  
Andrés Catena ◽  
Antonio Maldonado ◽  
Antonio Cándido

Sign in / Sign up

Export Citation Format

Share Document