Research – Page 28 – Jay van Zyl @ ecosystem.Ai

Our approach to alignment research

Jan Leike August 24, 2022 August 24, 2022

Our approach to aligning AGI is empirical and iterative. We are improving our AI systems’ ability to learn from human feedback and to assist humans at evaluating AI. Our goal is to build a sufficiently aligned AI system that can help us solve all other alignment problems.

Our

Research

DALL·E 2 Pre-Training Mitigations

Alex Nichol June 28, 2022 June 28, 2022

In order to share the magic of DALL·E 2 with a broad audience, we needed to reduce the risks associated with powerful image generation models. To this end, we put various guardrails in place to prevent generated images from violating our content policy. This post focuses on pre-training

Research

Learning to Play Minecraft with Video PreTraining (VPT)

Bowen Baker June 23, 2022 June 23, 2022

We trained a neural network to play Minecraft by Video PreTraining (VPT) on a massive unlabeled video dataset of human Minecraft play, while using only a small amount of labeled contractor data. With fine-tuning, our model can learn to craft diamond tools, a task that usually takes proficient humans over

Research

AI-Written Critiques Help Humans Notice Flaws

Jan Leike June 13, 2022 June 13, 2022

Showing model-generated critical comments to humans helps them find flaws in summaries.

Research

Techniques for Training Large Neural Networks

Lilian Weng June 9, 2022 June 9, 2022

Large neural networks are at the core of many recent advances in AI, but training them is a difficult engineering and research challenge which requires orchestrating a cluster of GPUs to perform a single synchronized calculation. As cluster and model sizes have grown, machine learning practitioners have developed an increasing

Research

Solving (Some) Formal Math Olympiad Problems

Stanislas Polu February 2, 2022 February 2, 2022

We built a neural theorem prover for Lean that learned to solve a variety of challenging high-school olympiad problems, including problems from the AMC12 and AIME competitions, as well as two problems adapted from the IMO.^[1] The prover uses a language model to find proofs of formal statements. Each

Research

Aligning Language Models to Follow Instructions

Ryan Lowe January 27, 2022 January 27, 2022

We’ve trained language models that are much better at following user intentions than GPT-3 while also making them more truthful and less toxic, using techniques developed through our alignment research. These InstructGPT models, which are trained with humans in the loop, are now deployed as the default language

Research

WebGPT: Improving the Factual Accuracy of Language Models through Web Browsing

Jacob Hilton December 16, 2021 December 16, 2021

We’ve fine-tuned GPT-3 to more accurately answer open-ended questions using a text-based web browser. Our prototype copies how humans research answers to questions online—it submits search queries, follows links, and scrolls up and down web pages. It is trained to cite its sources, which makes it

Research

Solving Math Word Problems

Karl Cobbe October 29, 2021 October 29, 2021

We’ve trained a system that solves grade school math problems with nearly twice the accuracy of a fine-tuned GPT-3 model. It solves about 90% as many problems as real kids: a small sample of 9-12 year olds scored 60% on a test from our dataset, while our

Research

Summarizing Books with Human Feedback

OpenAI September 23, 2021 September 23, 2021

Scaling human oversight of AI systems for tasks that are difficult to evaluate.

Introduction

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this: