Tom Brown – Jay van Zyl @ ecosystem.Ai

Gathering Human Feedback

OpenAI August 3, 2017 August 3, 2017

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.

View Code

This simulated

Dario Amodei Paul Christiano Tom Brown

Gathering Human Feedback

OpenAI August 3, 2017 August 3, 2017

View Code

This simulated

Share this:

Share this: