Maruan Al-Shedivat – Jay van Zyl @ ecosystem.Ai

Meta-Learning for Wrestling

OpenAI October 11, 2017 October 11, 2017

We show that for the task of simulated robot wrestling, a meta-learning agent can learn to quickly defeat a stronger non-meta-learning agent.

Igor Mordatch Ilya Sutskever Maruan Al-Shedivat Pieter Abbeel Trapit Bansal Yura Burda

Meta-Learning for Wrestling

OpenAI October 11, 2017 October 11, 2017

We show that for the task of simulated robot wrestling, a meta-learning agent can learn to quickly defeat a stronger non-meta-learning agent.

Igor Mordatch Jakob Foerster Maruan Al-Shedivat Pieter Abbeel Richard Chen Shimon Whiteson

Learning to Model Other Minds

OpenAI September 14, 2017 September 14, 2017

We’re releasing an algorithm which accounts for the fact that other agents are learning too, and discovers self-interested yet collaborative strategies like tit-for-tat in the iterated prisoner’s dilemma. This algorithm, Learning with Opponent-Learning Awareness (LOLA), is a small step towards agents that model other minds.

Read Paper

LOLA, a collaboration

Igor Mordatch Jakob Foerster Maruan Al-Shedivat Pieter Abbeel Richard Chen Shimon Whiteson

Learning to Model Other Minds

OpenAI September 14, 2017 September 14, 2017

We’re releasing an algorithm which accounts for the fact that other agents are learning too, and discovers self-interested yet collaborative strategies like tit-for-tat in the iterated prisoner’s dilemma. This algorithm, Learning with Opponent-Learning Awareness (LOLA), is a small step towards agents that model other minds.

Read Paper

LOLA, a collaboration

Share this:

Share this:

Share this:

Share this: