DeepMind Blog

Uncategorized

Tackling multiple tasks with a single visual language model

DeepMind Blog April 28, 2022 April 28, 2022

We introduce Flamingo, a single visual language model (VLM) that sets a new state of the art in few-shot learning on a wide range of open-ended multimodal tasks.

Uncategorized

When a passion for bass and brass help build better tools

DeepMind Blog April 28, 2022 April 28, 2022

We caught up with Kevin Millikin, a software engineer on the DevTools team. He’s in Salt Lake City this week to present at PyCon US, the largest annual gathering for those using and developing the open-source Python programming language.

Uncategorized

Tackling multiple tasks with a single visual language model

DeepMind Blog April 28, 2022 April 28, 2022

We introduce Flamingo, a single visual language model (VLM) that sets a new state of the art in few-shot learning on a wide range of open-ended multimodal tasks.

Uncategorized

When a passion for bass and brass help build better tools

DeepMind Blog April 28, 2022 April 28, 2022

We caught up with Kevin Millikin, a software engineer on the DevTools team. He’s in Salt Lake City this week to present at PyCon US, the largest annual gathering for those using and developing the open-source Python programming language.

Uncategorized

DeepMind’s latest research at ICLR 2022

DeepMind Blog April 25, 2022 April 25, 2022

Beyond supporting the event as sponsors and regular workshop organisers, our research teams are presenting 29 papers, including 10 collaborations this year. Here’s a brief glimpse into our upcoming oral, spotlight, and poster presentations.

Uncategorized

DeepMind’s latest research at ICLR 2022

DeepMind Blog April 25, 2022 April 25, 2022

Beyond supporting the event as sponsors and regular workshop organisers, our research teams are presenting 29 papers, including 10 collaborations this year. Here’s a brief glimpse into our upcoming oral, spotlight, and poster presentations.

Uncategorized

An empirical analysis of compute-optimal large language model training

DeepMind Blog April 12, 2022 April 12, 2022

We ask the question: “What is the optimal model size and number of training tokens for a given compute budget?” To answer this question, we train models of various sizes and with various numbers of tokens, and estimate this trade-off empirically. Our m…

Uncategorized

An empirical analysis of compute-optimal large language model training

DeepMind Blog April 12, 2022 April 12, 2022

We ask the question: “What is the optimal model size and number of training tokens for a given compute budget?” To answer this question, we train models of various sizes and with various numbers of tokens, and estimate this trade-off empirically. Our m…

Uncategorized

GopherCite: Teaching language models to support answers with verified quotes

DeepMind Blog March 16, 2022 March 16, 2022

Language models like Gopher can “hallucinate” facts that appear plausible but are actually fake. Those who are familiar with this problem know to do their own fact-checking, rather than trusting what language models say. Those who are not, may end up b…

Uncategorized

GopherCite: Teaching language models to support answers with verified quotes

DeepMind Blog March 16, 2022 March 16, 2022

Language models like Gopher can “hallucinate” facts that appear plausible but are actually fake. Those who are familiar with this problem know to do their own fact-checking, rather than trusting what language models say. Those who are not, may end up b…

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this: