Gathering Human Feedback
RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.
This simulated
Gathering Human Feedback
RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.
This simulated
Automatic image retouching on your phone
System can apply a range of styles in real-time, so that the viewfinder displays the enhanced image.
AI and Neuroscience: A virtuous circle
Recent progress in AI has been remarkable.Artificial systems now outperform expert humans at Atari video games, the ancient board game Go, and high-stakes matches of heads-up poker. They can also produce handwriting and speech indistinguishable from th…
Musk vs Zuck vs Trump
Who knows less about AI and wants more than they should. Recently a nice
little “row” emerged over who can beat the other at AI know-how in order to
have or have-not get politicians involved in AI regulation.
DeepMind Makes AI Walk & Run (Serious & Funny)
The agility and flexibility of a monkey swinging through the trees or a
football player dodging opponents and scoring a goal can be breathtaking.
Mastering this kind of sophisticated motor control is a hallmark of
physical intelligence, and is a crucial part of AI research.