↓ Skip to main content

Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes

Overview of attention for article published in Frontiers in Computational Neuroscience, January 2010
Altmetric Badge

Mentioned by

twitter
2 X users

Citations

dimensions_citation
161 Dimensions

Readers on

mendeley
450 Mendeley
citeulike
5 CiteULike
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes
Published in
Frontiers in Computational Neuroscience, January 2010
DOI 10.3389/fncom.2010.00146
Pubmed ID
Authors

Rajesh P. N. Rao

Abstract

A fundamental problem faced by animals is learning to select actions based on noisy sensory information and incomplete knowledge of the world. It has been suggested that the brain engages in Bayesian inference during perception but how such probabilistic representations are used to select actions has remained unclear. Here we propose a neural model of action selection and decision making based on the theory of partially observable Markov decision processes (POMDPs). Actions are selected based not on a single "optimal" estimate of state but on the posterior distribution over states (the "belief" state). We show how such a model provides a unified framework for explaining experimental results in decision making that involve both information gathering and overt actions. The model utilizes temporal difference (TD) learning for maximizing expected reward. The resulting neural architecture posits an active role for the neocortex in belief computation while ascribing a role to the basal ganglia in belief representation, value computation, and action selection. When applied to the random dots motion discrimination task, model neurons representing belief exhibit responses similar to those of LIP neurons in primate neocortex. The appropriate threshold for switching from information gathering to overt actions emerges naturally during reward maximization. Additionally, the time course of reward prediction error in the model shares similarities with dopaminergic responses in the basal ganglia during the random dots task. For tasks with a deadline, the model learns a decision making strategy that changes with elapsed time, predicting a collapsing decision threshold consistent with some experimental studies. The model provides a new framework for understanding neural decision making and suggests an important role for interactions between the neocortex and the basal ganglia in learning the mapping between probabilistic sensory representations and actions that maximize rewards.

X Demographics

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 450 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 16 4%
Germany 9 2%
United Kingdom 9 2%
Netherlands 4 <1%
Switzerland 3 <1%
France 3 <1%
Iran, Islamic Republic of 2 <1%
Japan 2 <1%
Portugal 2 <1%
Other 8 2%
Unknown 392 87%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 138 31%
Researcher 82 18%
Student > Master 54 12%
Student > Doctoral Student 28 6%
Student > Bachelor 25 6%
Other 85 19%
Unknown 38 8%
Readers by discipline Count As %
Neuroscience 89 20%
Psychology 69 15%
Computer Science 68 15%
Agricultural and Biological Sciences 64 14%
Engineering 37 8%
Other 72 16%
Unknown 51 11%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 1. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 14 January 2012.
All research outputs
#18,345,702
of 23,573,357 outputs
Outputs from Frontiers in Computational Neuroscience
#982
of 1,380 outputs
Outputs of similar age
#152,568
of 167,085 outputs
Outputs of similar age from Frontiers in Computational Neuroscience
#11
of 14 outputs
Altmetric has tracked 23,573,357 research outputs across all sources so far. This one is in the 19th percentile – i.e., 19% of other outputs scored the same or lower than it.
So far Altmetric has tracked 1,380 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 6.3. This one is in the 22nd percentile – i.e., 22% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 167,085 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 8th percentile – i.e., 8% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 14 others from the same source and published within six weeks on either side of this one. This one is in the 21st percentile – i.e., 21% of its contemporaries scored the same or lower than it.