Rein Houthooft – Jay van Zyl @ ecosystem.Ai

Evolved Policy Gradients

OpenAI April 18, 2018 April 18, 2018

We’re releasing an experimental metalearning approach called Evolved Policy Gradients, a method that evolves the loss function of learning agents, which can enable fast training on novel tasks. Agents trained with EPG can succeed at basic tasks at test time that were outside their training regime, like learning to navigate

Marcin Andrychowicz Matthias Plappert Pieter Abbeel Prafulla Dhariwal Rein Houthooft Richard Chen Szymon Sidor Tamim Asfour Xi Chen

Better Exploration with Parameter Noise

OpenAI July 27, 2017 July 27, 2017

We’ve found that adding adaptive noise to the parameters of reinforcement learning algorithms frequently boosts performance. This exploration method is simple to implement and very rarely decreases performance, so it’s worth trying on any problem.

View Code Read Paper

Action Space Noise

Parameter Space Noise

*Parameter noise helps algorithms more

Marcin Andrychowicz Matthias Plappert Pieter Abbeel Prafulla Dhariwal Rein Houthooft Richard Chen Szymon Sidor Tamim Asfour Xi Chen

Better Exploration with Parameter Noise

OpenAI July 27, 2017 July 27, 2017

We’ve found that adding adaptive noise to the parameters of reinforcement learning algorithms frequently boosts performance. This exploration method is simple to implement and very rarely decreases performance, so it’s worth trying on any problem.

View on GitHub View on arXiv Read more

Action Space Noise

Parameter Space Noise

Parameter

Share this:

Share this:

Share this: