Jonathan Ho – Jay van Zyl @ ecosystem.Ai

Evolved Policy Gradients

OpenAI April 18, 2018 April 18, 2018

We’re releasing an experimental metalearning approach called Evolved Policy Gradients, a method that evolves the loss function of learning agents, which can enable fast training on novel tasks. Agents trained with EPG can succeed at basic tasks at test time that were outside their training regime, like learning to navigate

John Schulman Jonathan Ho Kevin Frans Peter Chen Pieter Abbeel

Learning a Hierarchy

OpenAI October 26, 2017 October 26, 2017

We’ve developed a hierarchical reinforcement learning algorithm that learns high-level actions useful for solving a range of tasks, allowing fast solving of tasks requiring thousands of timesteps. Our algorithm, when applied to a set of navigation problems, discovers a set of high-level actions for walking and crawling in different directions,

John Schulman Jonathan Ho Kevin Frans Peter Chen Pieter Abbeel

Learning a Hierarchy

OpenAI October 26, 2017 October 26, 2017

We’ve developed a hierarchical reinforcement learning algorithm that learns high-level actions useful for solving a range of tasks, allowing fast solving of tasks requiring thousands of timesteps. Our algorithm, when applied to a set of navigation problems, discovers a set of high-level actions for walking and crawling in different directions,

Alex Ray Jonas Schneider Jonathan Ho Peter Welinder Wojciech Zaremba

Faster Physics in Python

OpenAI June 28, 2017 June 28, 2017

We’re open-sourcing a high-performance Python library for robotic simulation using the MuJoCo engine, developed over our past year of robotics research.

View Code View Docs

This library is one of our core tools for deep learning robotics research, which we’ve now released as a major version of mujoco-py, our Python

Alex Ray Jonas Schneider Jonathan Ho Peter Welinder Wojciech Zaremba

Faster Physics in Python

OpenAI June 28, 2017 June 28, 2017

We’re open-sourcing a high-performance Python library for robotic simulation using the MuJoCo engine, developed over our past year of robotics research.

View Code View Docs

This library is one of our core tools for deep learning robotics research, which we’ve now released as a major version of mujoco-py, our Python

Share this:

Share this:

Share this:

Share this:

Share this: