Jay van Zyl @ ecosystem.Ai

Jay van Zyl @ ecosystem.Ai

DeepMind Makes AI Walk & Run (Serious & Funny)

The agility and flexibility of a monkey swinging through the trees or a
football player dodging opponents and scoring a goal can be breathtaking.
Mastering this kind of sophisticated motor control is a hallmark of
physical intelligence, and is a crucial part of AI research.

Better Exploration with Parameter Noise

We’ve found that adding adaptive noise to the parameters of reinforcement learning algorithms frequently boosts performance. This exploration method is simple to implement and very rarely decreases performance, so it’s worth trying on any problem.

Action Space Noise Parameter Space Noise

Parameter

Better Exploration with Parameter Noise

We’ve found that adding adaptive noise to the parameters of reinforcement learning algorithms frequently boosts performance. This exploration method is simple to implement and very rarely decreases performance, so it’s worth trying on any problem.

Action Space Noise Parameter Space Noise

*Parameter noise helps algorithms more

Going beyond average for reinforcement learning

Consider the commuter who toils backwards and forwards each day on a train. Most mornings, her train runs on time and she reaches her first meeting relaxed and ready. But she knows that once in awhile the unexpected happens: a mechanical problem, a sig…

Better Banking with help of Analytics and Machine learning

In 2015, I was working at Diebold where we build ATM machine hardware and software and complete ecosystem around the ATM. When we talk about ATM machine, it is a collection of very complex small hardware which collectively performs tasks. And typically, when we think ATM is only used for cash withdraw and that is not true. When we talk about ATM it is a Bank branch itself. You can deposit cash, withdraw cash, deposit cheques. And Whatever we can do in the branch we can do with…

Proximal Policy Optimization

We’re releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of-the-art approaches while being much simpler to implement and tune. PPO has become the default reinforcement learning algorithm at OpenAI because of its ease of use and good performance.

Proximal Policy Optimization

We’re releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of-the-art approaches while being much simpler to implement and tune. PPO has become the default reinforcement learning algorithm at OpenAI because of its ease of use and good performance.

PPO

Artificial intelligence suggests recipes based on food photos

Given a still image of a dish filled with food, CSAIL team’s deep-learning algorithm recommends ingredients and recipes.

Agents that imagine and plan

Imagining the consequences of your actions before you take them is a powerful tool of human cognition. When placing a glass on the edge of a table, for example, we will likely pause to consider how stable it is and whether it might fall. On the basis o…