How does the nervous system maximize reward in a dynamic world? In the brain's panoply of functions, these are among its most wonderful. Theory says that decision policies are updated by feedback.
The link between this feedback (discrepancies between predicted and obtained rewards) and firing rates of dopamine neurons is one of systems neuroscience's success stories. But where are these mental models actually stored? How are they remembered in the time between decisions?