Monitoring Dopamine Release During Reward Learning
published: Aug. 9, 2010, recorded: May 2009, views: 213
Report a problem or upload filesIf you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
In the process of learning, we “sometimes make more deliberative choices, and sometimes make more visceral ones,” says Paul Phillips. These are “semantic terms we intuitively know,” and scientists have become well-versed in creating tasks for animals and humans that demonstrate how these different kinds of learning (analytical- reflective vs. impulsive -reflexive) play out. Phillips has been trying to track dopamine release (a neurotransmitter linked to learning) in such divergent learning processes.
The model-based learning system pairs a stimulus with a reward, and after training, a subject creates a “model representation of the world” that allows it to predict the appearance of the reward after the stimulus. In contrast, the model-free system of learning “uses a one dimensional value that gets updated” as the subject accumulates experience and begins to weigh the difference between expectation and the reward that’s actually delivered.
One of Phillips’ studies involved implanting electrodes for measuring dopamine release in the striatum of rats participating in different types of learning tasks. Phillips work shows time-dependent changes in the release of dopamine during classic conditioning tasks. At first the dopamine spikes only after the reward, but over time, the animal learns it will receive the reward after the stimulus (a light cue), and soon, the cue alone elicits the dopamine response. Phillips has also found that two distinct parts of the striatum register increased dopamine at different points in the training. “This is quite interesting in terms of thinking about what these brain regions have been implicated in, and specifically the idea of habits in the dorsal striatum.”
The results of some research suggest that during these learning processes, all the dopamine neurons should be firing. But Phillips says this doesn’t explain why “we’re getting signals in (one) part of the brain but not in the other.” Phillips speculates that dopamine’s “arch nemesis acetylcholine” might be inhibiting dopamine release in certain parts of the striatum during specific phases of reinforcement learning.
Phillips has also been working with selectively bred lines of rats, which seem to exhibit behaviors, and dopamine release patterns, suggestive of two distinct learning strategies. He concludes that “associations between stimuli and rewards can be learned through multiple strategies with different computational demands,” and he doesn’t believe that animals “are locked into one strategy or another.”
Link this pageWould you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !