, 11 tweets, 4 min read Read on Twitter
Extremely proud of @bari_bilal & colleagues' work:
cell.com/neuron/fulltex…

How does the nervous system maximize reward in a dynamic world? In the brain's panoply of functions, these are among its most wonderful. Theory says that decision policies are updated by feedback.
The link between this feedback (discrepancies between predicted and obtained rewards) and firing rates of dopamine neurons is one of systems neuroscience's success stories. But where are these mental models actually stored? How are they remembered in the time between decisions?
@bari_bilal discovered two variables from the theory, relative value and total value, are represented by long-lasting firing rate changes in medial prefrontal cortex neurons. This was very surprising to see at first; I used to think neurons had "baseline" firing rates.
These neurons' relative-value firing rates were used to bias choices. Total-value firing rates were used to bias vigor (response time). Remarkably, relative-value activity was extremely stable, together with choices themselves (no "forgetting").
In contrast, total-value activity decayed slowly, together with response time! These are exactly the ingredients you would need to tell the rest of the brain what choice it should make and how much energy it should expend.
What is the "rest of the brain" in this case? It turned out that these decision variables were weakly represented in premotor cortex, which is part of the final common pathway for action selection. So which part of the brain listens directly to these variables?
@bari_bilal recorded specifically from neurons that projected to dorsomedial striatum (using antidromic stimulation with collision tests; a beautiful technique, in its ability to yield positive evidence from a lack of response).
Both decision variables made it into dorsomedial striatum, which is thought to combine cortical, thalamic, and neuromodulatory input to balance exploration and exploitation of alternatives. Some puzzles: (1) How do the basal ganglia convert decision variables into actions?
(2) How can a network of cortical neurons maintain persistent activity over tens of seconds, carefully tuning the variables stored in particular neurons? (This balance of robustness and flexibility is amazing.)
(3) How can the nervous system adjust how quickly to learn from feedback, and how to balance exploration with exploitation (perhaps using neuromodulators such as serotonin and norepinephrine)?

Contact me if you're interested in solving any of these puzzles together.
We have received generous support from @HopkinsNeuro, @HopkinsMedicine, @NIH, @MQmentalhealth, @BBRFoundation, Klingenstein-Simons, and Whitehall.
Missing some Tweet in this thread?
You can try to force a refresh.

Like this thread? Get email updates or save it to PDF!

Subscribe to Jeremiah Cohen
Profile picture

Get real-time email alerts when new unrolls are available from this author!

This content may be removed anytime!

Twitter may remove this content at anytime, convert it as a PDF, save and print for later use!

Try unrolling a thread yourself!

how to unroll video

1) Follow Thread Reader App on Twitter so you can easily mention us!

2) Go to a Twitter thread (series of Tweets by the same owner) and mention us with a keyword "unroll" @threadreaderapp unroll

You can practice here first or read more on our help page!

Follow Us on Twitter!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just three indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3.00/month or $30.00/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!