Jay van Zyl @ ecosystem.Ai

Jay van Zyl @ ecosystem.Ai

AI and Neuroscience: A virtuous circle

Recent progress in AI has been remarkable.Artificial systems now outperform expert humans at Atari video games, the ancient board game Go, and high-stakes matches of heads-up poker. They can also produce handwriting and speech indistinguishable from th…

Musk vs Zuck vs Trump

Who knows less about AI and wants more than they should. Recently a nice
little “row” emerged over who can beat the other at AI know-how in order to
have or have-not get politicians involved in AI regulation.

DeepMind Makes AI Walk & Run (Serious & Funny)

The agility and flexibility of a monkey swinging through the trees or a
football player dodging opponents and scoring a goal can be breathtaking.
Mastering this kind of sophisticated motor control is a hallmark of
physical intelligence, and is a crucial part of AI research.

Better Exploration with Parameter Noise

We’ve found that adding adaptive noise to the parameters of reinforcement learning algorithms frequently boosts performance. This exploration method is simple to implement and very rarely decreases performance, so it’s worth trying on any problem.

Action Space Noise Parameter Space Noise

Parameter

Better Exploration with Parameter Noise

We’ve found that adding adaptive noise to the parameters of reinforcement learning algorithms frequently boosts performance. This exploration method is simple to implement and very rarely decreases performance, so it’s worth trying on any problem.

Action Space Noise Parameter Space Noise

*Parameter noise helps algorithms more

Going beyond average for reinforcement learning

Consider the commuter who toils backwards and forwards each day on a train. Most mornings, her train runs on time and she reaches her first meeting relaxed and ready. But she knows that once in awhile the unexpected happens: a mechanical problem, a sig…

Better Banking with help of Analytics and Machine learning

In 2015, I was working at Diebold where we build ATM machine hardware and software and complete ecosystem around the ATM. When we talk about ATM machine, it is a collection of very complex small hardware which collectively performs tasks. And typically, when we think ATM is only used for cash withdraw and that is not true. When we talk about ATM it is a Bank branch itself. You can deposit cash, withdraw cash, deposit cheques. And Whatever we can do in the branch we can do with…

Proximal Policy Optimization

We’re releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of-the-art approaches while being much simpler to implement and tune. PPO has become the default reinforcement learning algorithm at OpenAI because of its ease of use and good performance.