Ryan Lowe

Aligning Language Models to Follow Instructions

Ryan Lowe January 27, 2022 January 27, 2022

We’ve trained language models that are much better at following user intentions than GPT-3 while also making them more truthful and less toxic, using techniques developed through our alignment research. These InstructGPT models, which are trained with humans in the loop, are now deployed as the default language

Share this: