↓ Skip to main content

Improving Robot Motor Learning with Negatively Valenced Reinforcement Signals

Overview of attention for article published in Frontiers in Neurorobotics, April 2017
Altmetric Badge

About this Attention Score

  • Average Attention Score compared to outputs of the same age
  • Average Attention Score compared to outputs of the same age and source

Mentioned by

twitter
5 X users

Citations

dimensions_citation
14 Dimensions

Readers on

mendeley
29 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Improving Robot Motor Learning with Negatively Valenced Reinforcement Signals
Published in
Frontiers in Neurorobotics, April 2017
DOI 10.3389/fnbot.2017.00010
Pubmed ID
Authors

Nicolás Navarro-Guerrero, Robert J. Lowe, Stefan Wermter

Abstract

Both nociception and punishment signals have been used in robotics. However, the potential for using these negatively valenced types of reinforcement learning signals for robot learning has not been exploited in detail yet. Nociceptive signals are primarily used as triggers of preprogrammed action sequences. Punishment signals are typically disembodied, i.e., with no or little relation to the agent-intrinsic limitations, and they are often used to impose behavioral constraints. Here, we provide an alternative approach for nociceptive signals as drivers of learning rather than simple triggers of preprogrammed behavior. Explicitly, we use nociception to expand the state space while we use punishment as a negative reinforcement learning signal. We compare the performance-in terms of task error, the amount of perceived nociception, and length of learned action sequences-of different neural networks imbued with punishment-based reinforcement signals for inverse kinematic learning. We contrast the performance of a version of the neural network that receives nociceptive inputs to that without such a process. Furthermore, we provide evidence that nociception can improve learning-making the algorithm more robust against network initializations-as well as behavioral performance by reducing the task error, perceived nociception, and length of learned action sequences. Moreover, we provide evidence that punishment, at least as typically used within reinforcement learning applications, may be detrimental in all relevant metrics.

X Demographics

X Demographics

The data shown below were collected from the profiles of 5 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 29 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
India 1 3%
Germany 1 3%
Unknown 27 93%

Demographic breakdown

Readers by professional status Count As %
Researcher 6 21%
Student > Ph. D. Student 5 17%
Student > Doctoral Student 3 10%
Student > Bachelor 3 10%
Student > Master 3 10%
Other 4 14%
Unknown 5 17%
Readers by discipline Count As %
Computer Science 9 31%
Engineering 8 28%
Psychology 2 7%
Neuroscience 2 7%
Sports and Recreations 1 3%
Other 2 7%
Unknown 5 17%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 2. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 27 July 2017.
All research outputs
#13,546,553
of 22,963,381 outputs
Outputs from Frontiers in Neurorobotics
#269
of 872 outputs
Outputs of similar age
#158,944
of 308,920 outputs
Outputs of similar age from Frontiers in Neurorobotics
#8
of 16 outputs
Altmetric has tracked 22,963,381 research outputs across all sources so far. This one is in the 39th percentile – i.e., 39% of other outputs scored the same or lower than it.
So far Altmetric has tracked 872 research outputs from this source. They receive a mean Attention Score of 4.1. This one has gotten more attention than average, scoring higher than 66% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 308,920 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 46th percentile – i.e., 46% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 16 others from the same source and published within six weeks on either side of this one. This one has gotten more attention than average, scoring higher than 50% of its contemporaries.