Context-dependent multiplexing by individual VTA dopamine neurons
AbstractDopamine (DA) neurons of the ventral tegmental area (VTA) track external cues and rewards to generate a reward prediction error (RPE) signal during Pavlovian conditioning. Here we explored how RPE is implemented for a self-paced, operant task in freely moving mice. The animal could trigger a reward-predicting cue by remaining in a specific location of an operant box for a brief time before moving to a spout for reward collection. In vivo single-unit recordings revealed phasic responses to the cue and reward in correct trials, while with failures the activity paused, reflecting positive and negative error signals of a reward prediction. In addition, a majority of VTA DA neurons also encoded parameters of the goal-directed action (e.g. movement velocity, acceleration, distance to goal and licking) by changes in tonic firing rate. Such multiplexing of individual neurons was only apparent while the mouse was engaged in the task. We conclude that a multiplexed internal representation during the task modulates VTA DA neuron activity, indicating a multimodal prediction error that shapes behavioral adaptation of a self-paced goal-directed action.