Jay van Zyl @ ecosystem.Ai

DeepMind Makes AI Walk & Run (Serious & Funny)

Steve Digital July 29, 2017 July 29, 2017

The agility and flexibility of a monkey swinging through the trees or a
football player dodging opponents and scoring a goal can be breathtaking.
Mastering this kind of sophisticated motor control is a hallmark of
physical intelligence, and is a crucial part of AI research.

Better Exploration with Parameter Noise

OpenAI July 27, 2017 July 27, 2017

We’ve found that adding adaptive noise to the parameters of reinforcement learning algorithms frequently boosts performance. This exploration method is simple to implement and very rarely decreases performance, so it’s worth trying on any problem.

View on GitHub View on arXiv Read more

Action Space Noise

Parameter Space Noise

Parameter

Marcin Andrychowicz Matthias Plappert Pieter Abbeel Prafulla Dhariwal Rein Houthooft Richard Chen Szymon Sidor Tamim Asfour Xi Chen

Better Exploration with Parameter Noise

OpenAI July 27, 2017 July 27, 2017

We’ve found that adding adaptive noise to the parameters of reinforcement learning algorithms frequently boosts performance. This exploration method is simple to implement and very rarely decreases performance, so it’s worth trying on any problem.

View Code Read Paper

Action Space Noise

Parameter Space Noise

*Parameter noise helps algorithms more

Research

Going beyond average for reinforcement learning

Latest Post July 24, 2017 July 24, 2017

Consider the commuter who toils backwards and forwards each day on a train. Most mornings, her train runs on time and she reaches her first meeting relaxed and ready. But she knows that once in awhile the unexpected happens: a mechanical problem, a sig…

Research

Going beyond average for reinforcement learning

Latest Post July 23, 2017 July 23, 2017

90064 Los Angeles, CA United States

Better Banking with help of Analytics and Machine learning

balram prasad July 21, 2017 July 21, 2017

In 2015, I was working at Diebold where we build ATM machine hardware and software and complete ecosystem around the ATM. When we talk about ATM machine, it is a collection of very complex small hardware which collectively performs tasks. And typically, when we think ATM is only used for cash withdraw and that is not true. When we talk about ATM it is a Bank branch itself. You can deposit cash, withdraw cash, deposit cheques. And Whatever we can do in the branch we can do with…

Alec Radford Filip Wolski John Schulman Oleg Klimov Prafulla Dhariwal

Proximal Policy Optimization

OpenAI July 20, 2017 July 20, 2017

We’re releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of-the-art approaches while being much simpler to implement and tune. PPO has become the default reinforcement learning algorithm at OpenAI because of its ease of use and good performance.

View

Alec Radford Filip Wolski John Schulman Oleg Klimov Prafulla Dhariwal

Proximal Policy Optimization

OpenAI July 20, 2017 July 20, 2017

We’re releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of-the-art approaches while being much simpler to implement and tune. PPO has become the default reinforcement learning algorithm at OpenAI because of its ease of use and good performance.

PPO

Uncategorized

Artificial intelligence suggests recipes based on food photos

Adam Conner-Simons | Rachel Gordon | CSAIL July 20, 2017 July 20, 2017

Given a still image of a dish filled with food, CSAIL team’s deep-learning algorithm recommends ingredients and recipes.

Research

Agents that imagine and plan

Latest Post July 20, 2017 July 20, 2017

Imagining the consequences of your actions before you take them is a powerful tool of human cognition. When placing a glass on the edge of a table, for example, we will likely pause to consider how stable it is and whether it might fall. On the basis o…

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this: