Research – Page 29 – Jay van Zyl @ ecosystem.Ai

Solving (Some) Formal Math Olympiad Problems

Stanislas Polu February 2, 2022 February 2, 2022

We built a neural theorem prover for Lean that learned to solve a variety of challenging high-school olympiad problems, including problems from the AMC12 and AIME competitions, as well as two problems adapted from the IMO.^[1] The prover uses a language model to find proofs of formal statements. Each

Research

Aligning Language Models to Follow Instructions

Ryan Lowe January 27, 2022 January 27, 2022

We’ve trained language models that are much better at following user intentions than GPT-3 while also making them more truthful and less toxic, using techniques developed through our alignment research. These InstructGPT models, which are trained with humans in the loop, are now deployed as the default language

Research

WebGPT: Improving the Factual Accuracy of Language Models through Web Browsing

Jacob Hilton December 16, 2021 December 16, 2021

We’ve fine-tuned GPT-3 to more accurately answer open-ended questions using a text-based web browser. Our prototype copies how humans research answers to questions online—it submits search queries, follows links, and scrolls up and down web pages. It is trained to cite its sources, which makes it

Research

Solving Math Word Problems

Karl Cobbe October 29, 2021 October 29, 2021

We’ve trained a system that solves grade school math problems with nearly twice the accuracy of a fine-tuned GPT-3 model. It solves about 90% as many problems as real kids: a small sample of 9-12 year olds scored 60% on a test from our dataset, while our

Research

Summarizing Books with Human Feedback

OpenAI September 23, 2021 September 23, 2021

Scaling human oversight of AI systems for tasks that are difficult to evaluate.

Research

Introducing Triton: Open-Source GPU Programming for Neural Networks

Philippe Tillet July 28, 2021 July 28, 2021

We’re releasing Triton 1.0, an open-source Python-like programming language which enables researchers with no CUDA experience to write highly efficient GPU code—most of the time on par with what an expert would be able to produce. Triton makes it possible to reach peak hardware performance

Research

Improving Language Model Behavior by Training on a Curated Dataset

Irene Solaiman June 10, 2021 June 10, 2021

Our latest research finds we can improve language model behavior with respect to specific behavioral values by fine-tuning on a small, curated dataset.

Milestones Multimodal Research

Multimodal Neurons in Artificial Neural Networks

OpenAI March 4, 2021 March 4, 2021

We’ve discovered neurons in CLIP that respond to the same concept whether presented literally, symbolically, or conceptually.

Research

Scaling Kubernetes to 7,500 Nodes

OpenAI January 25, 2021 January 25, 2021

We’ve scaled Kubernetes clusters to 7,500 nodes, producing a scalable infrastructure for large models like GPT-3, CLIP, and DALL·E, but also for rapid small-scale iterative research such as Scaling Laws for Neural Language Models. Scaling a single Kubernetes cluster to this size is rarely done

Milestones Multimodal Research

DALL·E: Creating Images from Text

OpenAI January 5, 2021 January 5, 2021

We’ve trained a neural network called DALL·E that creates images from text captions for a wide range of concepts expressible in natural language.

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this: