scholarly journals Learning fast and slow: deviations from the matching law can reflect an optimal strategy under uncertainty

2017 ◽  
Author(s):  
Kiyohito Iigaya ◽  
Yashar Ahmadian ◽  
Leo P. Sugrue ◽  
Greg S. Corrado ◽  
Yonatan Loewenstein ◽  
...  

AbstractBehavior which deviates from our normative expectations often appears irrational. A classic example concerns the question of how choice should be distributed among multiple alternatives. The so-called matching law predicts that the fraction of choices made to any option should match the fraction of total rewards earned from the option. This choice strategy can maximize reward in a stationary reward schedule. Empirically, however, behavior often deviates from this ideal. While such deviations have often been interpreted as reflecting ‘noisy’, suboptimal, decision-making, here we instead suggest that they reflect a strategy which is adaptive in nonstationary and uncertain environments. We analyze the results of a dynamic foraging task. Animals exhibited significant deviations from matching, and animals turned out to be able to collect more rewards when deviation was larger. We show that this behavior can be understood if one considers that animals had incomplete information about the environments dynamics. In particular, using computational models, we show that in such nonstationary environments, learning on both fast and slow timescales is beneficial. Learning on fast timescales means that an animal can react to sudden changes in the environment, though this inevitably introduces large fluctuations (variance) in value estimates. Concurrently, learning on slow timescales reduces the amplitude of these fluctuations at the price of introducing a bias that causes systematic deviations. We confirm this prediction in data – monkeys indeed solved the bias-variance tradeoff by combining learning on both fast and slow timescales. Our work suggests that multi-timescale learning could be a biologically plausible mechanism for optimizing decisions under uncertainty.

2010 ◽  
Vol 10 (11) ◽  
pp. 26931-26959
Author(s):  
J.-P. Chen ◽  
T.-S. Tsai ◽  
S.-C. Liu

Abstract. Photochemically driven nucleation bursts, which typically occur in a few hours after sunrise, often produce strong aerosol number concentration (ANC) fluctuations. The causes of such ANC spikes were investigated using a detailed aerosol model running in the parcel mode. Two potential mechanisms for the ANC spikes are proposed and simulated. The blocking of actinic flux by scattered clouds can significantly influence new particle production, but this does not cause strong fluctuations in the number of aerosols within sizes greater than the detection limit of our measurements. A more plausible mechanism is the turbulence eddy effect. Strong aerosol nucleation may occur in both updrafts and downdrafts, while the cloud formation at the boundary layer top strongly reduces the number of aerosols. As the number of aerosols is sensitive to turbulence eddy and cloud formation properties, a changing turbulence condition would result in large fluctuations in the evolution of ANC similar to that observed at the surface.


2011 ◽  
Vol 11 (14) ◽  
pp. 7171-7184 ◽  
Author(s):  
J.-P. Chen ◽  
T.-S. Tsai ◽  
S.-C. Liu

Abstract. Photochemically driven nucleation bursts, which typically occur within a few hours after sunrise, often produce strong aerosol number concentration (ANC) fluctuations. The causes of such ANC spikes were investigated using a detailed aerosol model running in the parcel mode. Two potential mechanisms for the ANC spikes were proposed and simulated. The blocking of actinic flux by scattered clouds can significantly influence new particle production, but this does not cause strong fluctuations in the number of aerosols within sizes greater than the detection limit of our measurements. A more plausible mechanism is the turbulence eddy effect. Strong aerosol nucleation may occur in both updrafts and downdrafts, while the cloud formation at the boundary layer top strongly reduces the number of aerosols. As the number of aerosols is sensitive to turbulence eddy and cloud formation properties, a changing turbulence condition would result in large fluctuations in the evolution of ANC similar to that observed at the surface.


2001 ◽  
Vol 13 (12) ◽  
pp. 2743-2761 ◽  
Author(s):  
Ole Jensen

There are numerous reports on rhythmic coupling between separate brain networks. It has been proposed that this rhythmic coupling indicates exchange of information. So far, few computational models have been proposed that explore this principle and its potential computational benefits. Recent results on hippocampal place cells of the rat provide new insight; it has been shown that information about space is encoded by the firing of place cells with respect to the phase of the ongoing theta rhythm. This principle is termed phase coding and suggests that upcoming locations (predicted by the hippocampus) are encoded by cells firing late in the theta cycle, whereas current location is encoded by early firing in the theta cycle. A network reading the hippocampal output must inevitably also receive an oscillatory theta input in order to decipher the phase-coded firing patterns. In this article, I propose a simple physiologically plausible mechanism implemented as an oscillatory network that can decode the hippocampal output. By changing only the phase of the theta input to the decoder, qualitatively different information is transferred: the theta phase determines whether representations of current or upcoming locations are read by the decoder. The proposed mechanism provides a computational principle for information transfer between oscillatory networks and might generalize to brain networks beyond the hippocampal region.


2018 ◽  
Author(s):  
Shiva Farashahi ◽  
Habiba Azab ◽  
Benjamin Hayden ◽  
Alireza Soltani

ABSTRACTMonkeys and other animals appear to share with humans two risk attitudes predicted by prospect theory: an inverse-S-shaped probability weighting function and a steeper utility curve for losses than for gains. These findings suggest that such preferences are stable traits with common neural substrates. We hypothesized instead that animals tailor their preferences to subtle changes in task contexts, making risk attitudes flexible. Previous studies used a limited number of outcomes, trial types, and contexts. To gain a broader perspective, we examined two large datasets of male macaques’ risky choices: one from a task with real (juice) gains and another from a token task with gains and losses. In contrast to previous findings, monkeys were risk-seeking for both gains and losses (i.e. lacked a reflection effect) and showed steeper gain than loss curves (loss-seeking). Utility curves for gains were substantially different in the two tasks. Monkeys showed nearly linear probability weightings in one task and S-shaped ones in the other; neither task produced a consistent inverse-S-shaped curve. To account for these observations, we developed and tested various computational models of the processes involved in the construction of reward value. We found that adaptive differential weighting of prospective gamble outcomes could partially account for the observed differences in the utility functions across the two experiments and thus, provide a plausible mechanism underlying flexible risk attitudes. Together, our results support the idea that risky choices are flexibly constructed at the time of elicitation and place important constraints on neural models of economic choice.


2021 ◽  
Author(s):  
Rui Su ◽  
Jin Zeng ◽  
Ben O'Shaughnessy

Cell entry of SARS-CoV-2 is accomplished by the S2 subunit of the spike S protein on the virion surface by fusion of viral and host cell membranes. Fusion requires the prefusion S2 to transit to its potent, fusogenic form, the fusion intermediate (FI). However, the FI structure is unknown, detailed computational models of the FI are not available, and the mechanisms of fusion and entry remain unclear. Here, we constructed a full-length model of the CoV-2 FI by extrapolating from known CoV-2 pre- and postfusion structures. Atomistic and coarse-grained simulations showed the FI is a remarkably flexible mechanical assembly executing large orientational and extensional fluctuations due to three hinges in the C-terminal base. Fluctuations lead to a large fusion peptide exploration volume and may aid capture of the host cell target membrane and define the clock for fluctuation-triggered refolding and membrane fusion. This work suggests several novel potential drug targets.


F1000Research ◽  
2015 ◽  
Vol 4 ◽  
pp. 147 ◽  
Author(s):  
Jan Kubanek ◽  
Lawrence H. Snyder

When faced with a choice, humans and animals commonly distribute their behavior in proportion to the frequency of payoff of each option. Such behavior is referred to as matching and has been captured by the matching law. However, matching is not a general law of economic choice. Matching in its strict sense seems to be specifically observed in tasks whose properties make matching an optimal or a near-optimal strategy. We engaged monkeys in a foraging task in which matching was not the optimal strategy. Over-matching the proportions of the mean offered reward magnitudes that would yield more reward than matching, yet, surprisingly, the animals almost exactly matched them. To gain insight into this phenomenon, we modeled the animals' decision-making using a mechanistic model. The model accounted for the animals' macroscopic and microscopic choice behavior. When the models' three parameters were not constrained to mimic the monkeys' behavior, the model over-matched the reward proportions and in doing so, harvested substantially more reward than the monkeys. This optimized model revealed a marked bottleneck in the monkeys' choice function that compares the value of the two options. The model featured a very steep value comparison function relative to that of the monkeys. The steepness of the value comparison function had a profound effect on the earned reward and on the level of matching. We implemented this value comparison function through responses of simulated biological neurons. We found that due to the presence of neural noise, steepening the value comparison requires an exponential increase in the number of value-coding neurons. Matching may be a compromise between harvesting satisfactory reward and the high demands placed by neural noise on optimal neural computation.


2016 ◽  
Vol 7 (1) ◽  
Author(s):  
Ken-Ichiro Tsutsui ◽  
Fabian Grabenhorst ◽  
Shunsuke Kobayashi ◽  
Wolfram Schultz

Abstract Neuronal reward valuations provide the physiological basis for economic behaviour. Yet, how such valuations are converted to economic decisions remains unclear. Here we show that the dorsolateral prefrontal cortex (DLPFC) implements a flexible value code based on object-specific valuations by single neurons. As monkeys perform a reward-based foraging task, individual DLPFC neurons signal the value of specific choice objects derived from recent experience. These neuronal object values satisfy principles of competitive choice mechanisms, track performance fluctuations and follow predictions of a classical behavioural model (Herrnstein’s matching law). Individual neurons dynamically encode both, the updating of object values from recently experienced rewards, and their subsequent conversion to object choices during decision-making. Decoding from unselected populations enables a read-out of motivational and decision variables not emphasized by individual neurons. These findings suggest a dynamic single-neuron and population value code in DLPFC that advances from reward experiences to economic object values and future choices.


F1000Research ◽  
2015 ◽  
Vol 4 ◽  
pp. 147
Author(s):  
Jan Kubanek ◽  
Lawrence H. Snyder

When faced with a choice, humans and animals commonly distribute their behavior in proportion to the frequency of payoff of each option. Such behavior is referred to as matching and has been captured by the matching law. However, matching is not a general law of economic choice. Matching in its strict sense seems to be specifically observed in tasks whose properties make matching an optimal or a near-optimal strategy. We engaged monkeys in a foraging task in which matching was not the optimal strategy. Over-matching the proportions of the mean offered reward magnitudes would yield more reward than matching, yet, surprisingly, the animals almost exactly matched them. To gain insight into this phenomenon, we modeled the animals' decision-making using a mechanistic model. The model accounted for the animals' macroscopic and microscopic choice behavior. When the models' three parameters were not constrained to mimic the monkeys' behavior, the model over-matched the reward proportions and in doing so, harvested substantially more reward than the monkeys. This optimized model revealed a marked bottleneck in the monkeys' choice function that compares the value of the two options. The model featured a very steep value comparison function relative to that of the monkeys. The steepness of the value comparison function had a profound effect on the earned reward and on the level of matching. We implemented this value comparison function through responses of simulated biological neurons. We found that due to the presence of neural noise, steepening the value comparison requires an exponential increase in the number of value-coding neurons. Matching may be a compromise between harvesting satisfactory reward and the high demands placed by neural noise on optimal neural computation.


1999 ◽  
Vol 58 (4) ◽  
pp. 273-286 ◽  
Author(s):  
Regula P. Berger ◽  
Alexander Grob ◽  
August Flammer

This study focuses on the importance of social developmental expectations, assessed as emotional and cognitive evaluations regarding the timing and the gender-role conformity of normative developmental tasks. Two central questions were raised. First, to what degree do the timing and the gender-role conformity affect the adults' expectations? Second, how much does the adults' own gender-role orientation (GRO), classified as traditional vs. liberal, affect their expectations? A 4 (timing modus) × 2 (developmental task) × 2 (gender-role conformity)-factorial design was administered to a sample of 140 adults of both sexes, 20 to 81 years old. Coping in time and with gender-role typical career received the most approval. Typical developmental tasks were more approved by persons with a traditional than with a liberal GRO. However, the evaluation of non-typical developmental tasks was not affected by the GRO. The possibility of a shift in normative expectations toward more liberal, diverse, and self-defined female gender-roles is discussed.


Sign in / Sign up

Export Citation Format

Share Document