<span class="vcard">Ashley Pilipiszyn</span>
Ashley Pilipiszyn

Reinforcement Learning with Prediction-Based Rewards

We’ve developed Random Network Distillation (RND), a prediction-based method for encouraging reinforcement learning agents to explore their environments through curiosity, which for the first time exceeds average human performance on Montezuma’s Revenge.

OpenAI 2019 Winter Scholars Application Open

We are now accepting applications for our second cohort of OpenAI Scholars, a program where we provide 6-10 stipends and mentorship to individuals from underrepresented groups to study deep learning full-time for 3 months and open-source a project. The first cohort of Scholars recently released their projects and presented at