We doomscroll, you upskill.
Finding signal on X is harder than ever. We curate high-value insights on AI, Startups, and Product so you can focus on what matters.
93 tweets
The @ilyasut episode 0:00:00 – Explaining model jaggedness 0:09:39 - Emotions and value functions 0:18:49 – What are we scaling? 0:25:13 – Why humans generalize better than models 0:35:45 – Straight-shotting superintelligence 0:46:47 – SSI’s model will learn from deployment 0:55:07 – Alignment 1:18:13 – “We are squarely an age of research company” 1:29:23 – Self-play and multi-agent 1:32:42 – Research taste Look up Dwarkesh Podcast on YouTube, Apple Podcasts, or Spotify. Enjoy!
A number of people are talking about implications of AI to schools. I spoke about some of my thoughts to a school board earlier, some highlights: 1. You will never be able to detect the use of AI in homework. Full stop. All "detectors" of AI imo don't really work, can be defeated in various ways, and are in principle doomed to fail. You have to assume that any work done outside classroom has used AI. 2. Therefore, the majority of grading has to shift to in-class work (instead of at-home assignments), in settings where teachers can physically monitor students. The students remain motivated to learn how to solve problems without AI because they know they will be evaluated without it in class later. 3. We want students to be able to use AI, it is here to stay and it is extremely powerful, but we also don't want students to be naked in the world without it. Using the calculator as an example of a historically disruptive technology, school teaches you how to do all the basic math & arithmetic so that you can in principle do it by hand, even if calculators are pervasive and greatly speed up work in practical settings. In addition, you understand what it's doing for you, so should it give you a wrong answer (e.g. you mistyped "prompt"), you should be able to notice it, gut check it, verify it in some other way, etc. The verification ability is especially important in the case of AI, which is presently a lot more fallible in a great variety of ways compared to calculators. 4. A lot of the evaluation settings remain at teacher's discretion and involve a creative design space of no tools, cheatsheets, open book, provided AI responses, direct internet/AI access, etc. TLDR the goal is that the students are proficient in the use of AI, but can also exist without it, and imo the only way to get there is to flip classes around and move the majority of testing to in class settings.
JUST IN: Elon Musk says AI and humanoid robots will "eliminate poverty" and "make everyone wealthy."
Gemini Nano Banana Pro can solve exam questions *in* the exam page image. With doodles, diagrams, all that. ChatGPT thinks these solutions are all correct except Se_2P_2 should be "diselenium diphosphide" and a spelling mistake (should be "thiocyanic acid" not "thoicyanic") :O
>be Andrej Karpathy >studied computer science from Toronto to Stanford, specializing in deep learning >became Tesla’s director of AI in his early 30s, leading the Autopilot vision team >helped build the foundations of OpenAI as one of its earliest researchers >teaches millions through free lectures, notebooks, and open-source work >keeps his life simple, quiet, and focused on learning >steps away from big titles when he feels the need to reset >builds small AI projects for fun, shares them openly >lives calmly, thinking deeply, working on what he believes matters most Has Karpathy quietly optimized life in a way most people never figure out?
This weekend, a Medium article called "I Reverse-Engineered 200 AI Startups. 146 Are Selling You Repackaged ChatGPT and Claude with New UI" made it to the front page of Hacker News. The author found that out of 200 AI startups, "73% are running third-party APIs with extra steps." The author also found that at least one company was putting a 1000x markup on model API costs. We discuss at length:
Holy shit. I’ve used ChatGPT every day for 3 years. Just spent 2 hours on Gemini 3. I’m not going back. The leap is insane — reasoning, speed, images, video… everything is sharper and faster. It feels like the world just changed, again.
How Google Finally Leapfrogged Rivals With New Gemini Rollout
I’m starting to get into a habit of reading everything (blogs, articles, book chapters,…) with LLMs. Usually pass 1 is manual, then pass 2 “explain/summarize”, pass 3 Q&A. I usually end up with a better/deeper understanding than if I moved on. Growing to among top use cases. On the flip side, if you’re a writer trying to explain/communicate something, we may increasingly see less of a mindset of “I’m writing this for another human” and more “I’m writing this for an LLM”. Because once an LLM “gets it”, it can then target, personalize and serve the idea to its user.
guys what gpt-5 model should i use in cursor gpt-5.1 codex gpt-5.1 codex mini gpt 5.1 codex high gpt 5.1 codex fast gpt 5.1 codex high fast gpt 5.1 codex low gpt 5.1 codex low fast gpt 5.1 fast gpt 5.1 high gpt 5.1 high fast gpt 5.1 low gpt 5.1 low fast gpt 5 codex high gpt 5 codex fast gpt 5 codex high gpt 5 codex high fast gpt-5.1 gpt 5 codex gpt-5 gpt 5 fast gpt 5 medium gpt 5 medium fast gpt 5 high gpt 5 high fast gpt 5 low gpt 5 low fast gpt 5.1 codex mini high gpt 5.1 codex mini low gpt-5-mini gpt-5-nano gpt-5-pro thanks in advance
AI Agents will fade away, like microservices did. Painful to scale, and difficult to deploy. Eventually, you will see them hidden behind a wall of well-engineered solutions. Hype doesn't survive complexity.
Upload a meeting transcript to NotebookLM and get it to turn it into a slide deck using Nano Banana Pro. Absolutely insane.
JUST IN: Report shows OpenAI needs to raise at least $207 billion by 2030 to stay in business, but they will still be losing money if they do so.
Absolutely insane stat. Opus 4.5 outperformed EVERY SINGLE HUMAN CANDIDATE EVER in Anthropic's notoriously difficult take-home exam for prospective performance engineering candidates.
Engineers have Cursor. Writers have ChatGPT. FINALLY there’s an AI platform for marketers. Introducing Mopac Software.