Karl Cobbe

Solving Math Word Problems

Karl Cobbe October 29, 2021 October 29, 2021

We’ve trained a system that solves grade school math problems with nearly twice the accuracy of a fine-tuned GPT-3 model. It solves about 90% as many problems as real kids: a small sample of 9-12 year olds scored 60% on a test from our dataset, while our

Uncategorized

Procgen Benchmark

Karl Cobbe December 3, 2019 December 3, 2019

We’re releasing Procgen Benchmark, 16 simple-to-use procedurally-generated environments which provide a direct measure of how quickly a reinforcement learning agent learns generalizable skills.

Share this:

Share this: