Finding signal on Twitter is more difficult than it used to be. We curate the best tweets on topics like AI, startups, and product development every weekday so you can focus on what matters.
@arcprize
A North Star for open AGI. Co-founders: @fchollet @mikeknoop. President: @gregkamradt. We're hiring mission-driven builders: arcprize.org/jobs
GPT-5.4 and GPT-5.4 Pro from @OpenAI on ARC-AGI Semi Private ARC-AGI-2: - GPT-5.4: 74.0%, $1.52/task - GPT-5.4 Pro: 83.3%, $16.41/task
ARC-AGI-3 Launch Party March...
International models on ARC-AGI-2 Semi Private - Kimi K2.5 (@Kimi_Moonshot): 12%, $0.28 - Minimax M2.5 (@MiniMax_AI): 5%, $0.17 - GLM-5...
Gemini 3.1 Pro on ARC-AGI Semi-Private Eval @GoogleDeepMind - ARC-AGI-1: 98%, $0.52/task - ARC-AGI-2: 77%, $0.96/task Gemini to push the Pareto Frontier of performance and efficiency

Claude Opus 4.6 (120K Thinking) on ARC-AGI Semi-Private Eval Max Effort: - ARC-AGI-1: 93.0%, $1.88/task - ARC-AGI-2: 68.8% $3.64/task New ARC-AGI SOTA model from @AnthropicAI
Claude Sonnet 4.6 (120K Thinking) on ARC-AGI Semi-Private Eval @AnthropicAI Max Effort: - ARC-AGI-1: 86%, $1.45/task - ARC-AGI-2: 58% $2.72/task

ARC Prize submitted a response to @NSF 's new Tech Labs program which funds teams breaking down barriers for emerging tech. We believe NSF will be a neutral anchor for this work. Tech Labs will accelerate the path from AI research to real-world deployment.
Gemini 3 Deep Think (2/26) Semi Private Eval - ARC-AGI-1: 96.0%, $7.17/task - ARC-AGI-2: 84.6% $13.62/task New ARC-AGI SOTA model from @GoogleDeepMind

A year ago, we verified a preview of an unreleased version of @OpenAI o3 (High) that scored 88% on ARC-AGI-1 at est. $4.5k/task Today, we’ve verified a new GPT-5.2 Pro (X-High) SOTA score of 90.5% at $11.64/task This represents a ~390X efficiency improvement in one year