DeepMind Blog

Uncategorized

Open-sourcing MuJoCo

DeepMind Blog May 23, 2022 May 23, 2022

In October 2021, we announced that we acquired the MuJoCo physics simulator, and made it freely available for everyone to support research everywhere. We also committed to developing and maintaining MuJoCo as a free, open-source, community-driven proje…

Uncategorized

Open-sourcing MuJoCo

DeepMind Blog May 23, 2022 May 23, 2022

In October 2021, we announced that we acquired the MuJoCo physics simulator, and made it freely available for everyone to support research everywhere. We also committed to developing and maintaining MuJoCo as a free, open-source, community-driven proje…

Uncategorized

From LEGO competitions to DeepMind’s robotics lab

DeepMind Blog May 19, 2022 May 19, 2022

If you want to be at DeepMind, go for it. Apply, interview, and just try. You might not get it the first time but that doesn’t mean you can’t try again. I never thought DeepMind would accept me, and when they did, I thought it was a mistake. Everyone d…

Uncategorized

From LEGO competitions to DeepMind’s robotics lab

DeepMind Blog May 19, 2022 May 19, 2022

If you want to be at DeepMind, go for it. Apply, interview, and just try. You might not get it the first time but that doesn’t mean you can’t try again. I never thought DeepMind would accept me, and when they did, I thought it was a mistake. Everyone d…

Uncategorized

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

DeepMind Blog May 16, 2022 May 16, 2022

In our recent paper, we explore how populations of deep reinforcement learning (deep RL) agents can learn microeconomic behaviours, such as production, consumption, and trading of goods. We find that artificial agents learn to make economically rationa…

Uncategorized

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

DeepMind Blog May 16, 2022 May 16, 2022

In our recent paper, we explore how populations of deep reinforcement learning (deep RL) agents can learn microeconomic behaviours, such as production, consumption, and trading of goods. We find that artificial agents learn to make economically rationa…

Uncategorized

A Generalist Agent

DeepMind Blog May 12, 2022 May 12, 2022

Inspired by progress in large-scale language modelling, we apply a similar approach towards building a single generalist agent beyond the realm of text outputs. The agent, which we refer to as Gato, works as a multi-modal, multi-task, multi-embodiment …

Uncategorized

A Generalist Agent

DeepMind Blog May 12, 2022 May 12, 2022

Inspired by progress in large-scale language modelling, we apply a similar approach towards building a single generalist agent beyond the realm of text outputs. The agent, which we refer to as Gato, works as a multi-modal, multi-task, multi-embodiment …

Uncategorized

Active offline policy selection

DeepMind Blog May 6, 2022 May 6, 2022

To make RL more applicable to real-world applications like robotics, we propose using an intelligent evaluation procedure to select the policy for deployment, called active offline policy selection (A-OPS). In A-OPS, we make use of the prerecorded data…

Uncategorized

Active offline policy selection

DeepMind Blog May 6, 2022 May 6, 2022

To make RL more applicable to real-world applications like robotics, we propose using an intelligent evaluation procedure to select the policy for deployment, called active offline policy selection (A-OPS). In A-OPS, we make use of the prerecorded data…

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this: