Dario Amodei – Jay van Zyl @ ecosystem.Ai

How AI Training Scales

OpenAI December 14, 2018 December 14, 2018

We’ve discovered that the gradient noise scale, a simple statistical metric, predicts the parallelizability of neural network training.

Dario Amodei Paul Christiano

Learning Complex Goals with Iterated Amplification

OpenAI October 22, 2018 October 22, 2018

We’re proposing an AI safety technique called iterated amplification that lets us specify complicated behaviors and goals that are beyond human scale, by demonstrating how to decompose a task into simpler sub-tasks, rather than by providing labeled data or a reward function. Although this idea is in its very

Danny Hernandez Dario Amodei

AI and Compute

OpenAI May 16, 2018 May 16, 2018

Since 2012, the amount of compute used in the largest AI training runs has been increasing exponentially with a 3.5 month doubling time (by comparison, Moore’s Law had an 18 month doubling period).

Dario Amodei Geoffrey Irving

AI Safety via Debate

OpenAI May 3, 2018 May 3, 2018

We’re proposing an AI safety technique which trains agents to debate topics with one another, using a human to judge who wins. We believe that this or a similar approach could eventually help us train AI systems to perform far more cognitively advanced tasks than humans are capable of, while

Dario Amodei Jack Clark Michael Page

Preparing for Malicious Uses of AI

OpenAI February 21, 2018 February 21, 2018

We’ve co-authored a paper that forecasts how malicious actors could misuse AI technology, and potential ways we can prevent and mitigate these threats. This paper is the outcome of almost a year of sustained work with our colleagues at the Future of Humanity Institute, the Centre for the Study of

Dario Amodei Paul Christiano Tom Brown

Gathering Human Feedback

OpenAI August 3, 2017 August 3, 2017

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.

View Code

This simulated

Dario Amodei Paul Christiano Tom Brown

Gathering Human Feedback

OpenAI August 3, 2017 August 3, 2017

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.

View Code

This simulated

Alex Ray Dario Amodei Paul Christiano

Learning from Human Preferences

OpenAI June 13, 2017 June 13, 2017

One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or getting the complex goal a bit wrong, can lead to undesirable and even dangerous behavior. In collaboration with DeepMind’s safety team, we’ve

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this: