Finding signal on Twitter is more difficult than it used to be. We curate the best tweets on topics like AI, startups, and product development every weekday so you can focus on what matters.

GPT 5.2 achieves higher accuracy with more compute time

OpenAI just dropped that GPT 5.2 is #1 on the METR eval, finishing 6.5hr long human tasks half of the time, but.. It takes ~10x the working time per task of Gemini 3 Pro and ~25x of Opus 4.5. The models are smarter, but by using insane amounts of test-time compute reasoning!

Content
17
4
2
2

Topics

Read the stories that matter.

Save hours a day in 5 minutes