Finding signal on Twitter is more difficult than it used to be. We curate the best tweets on topics like AI, startups, and product development every weekday so you can focus on what matters.

Claude Opus 4.6 Performance on ARC-AGI Benchmarks

Claude Opus 4.6 (120K Thinking) on ARC-AGI Semi-Private Eval Max Effort: - ARC-AGI-1: 93.0%, $1.88/task - ARC-AGI-2: 68.8% $3.64/task New ARC-AGI SOTA model from @AnthropicAI

Content
164
4
25
15

Topics

Read the stories that matter.

Save hours a day in 5 minutes