GPT-3

Exploring Inverse Scaling

October 26, 2022 October 26, 2022

I recently submitted an entry for the Inverse Scaling Prize, and while it wasn’t selected, I think it still reveals some interesting properties of model scaling that are worth exploring (and are similar to those analyzed in one winning submission from Cavendish Labs). The goal of the competition is to identify machine learning tasks where …

Exploring Inverse Scaling Read More »

Examining AGI Timelines

January 15, 2022 February 9, 2022

During a recent email exchange, a reader of the blog brought up the question of technology / AI timelines and what progress in that domain might look like. His question struck a chord, reminding me of the uneasy feeling I often get when reading the optimistic predictions of others (including futurist Ray Kurzweil and philosopher …

Examining AGI Timelines Read More »

Experimenting with GPT-3

December 30, 2021 December 30, 2021

About a year and a half ago, OpenAI rolled out GPT-3, a massive text-prediction transformer model which shattered many assumptions about the difficulty of understanding and creating written language. GPT-3’s predecessors (GPT and GPT-2) had shown that generating sensible responses to a variety of input texts was possible with enough data, but GPT-3 took that …

Experimenting with GPT-3 Read More »