Finding signal on Twitter is more difficult than it used to be. We curate the best tweets on topics like AI, startups, and product development every weekday so you can focus on what matters.

AI Judgment Limits in Long Agentic Tasks

Building smarter models is increasingly important as larger models have better “judgment” As agentic task length increases the number of required judgement calls that the AI needs to make based on user intent scales faster Judgement may be a bigger limiter than hallucinations

Topics

Read the stories that matter.

Save hours a day in 5 minutes