We doomscroll, you upskill.
Finding signal on X is harder than ever. We curate high-value insights on AI, Startups, and Product so you can focus on what matters.
Page 1 • Showing 5 tweets
gemini 3 works with images video audio the way people do 〜 all at once, connected helped me make this music video, fully generated with soundtrack, in a single html file wait till the end - in 4K a true multi-modal model just *`'~gets it~'`* better @GoogleDeepMind building serious momentum
it's always been about the data. AI currently: 1. get tons of data → baseline intelligence 2. get high quality data → shape it into the perfect tool 3. find better ways to measure intelligence so you know what good data is 4. improve & repeat great data envs are invaluable.
new research on 445 ai benchmarks • 48% disagree on what they measure • 39% use convenient, not correct, data • 16% test statistical significance we still don't know how to measure our most powerful tools IMO treat evals like sports, not the SAT competition > tests clear rules -> human-understandable results
Today, we're launching Good Start Labs w/ $3.6M from amazing investors including @Inovia & @generalcatalyst My whole life I've been learning from games Over the past five years, I've dreamt about how AI learn with me. Today we're launching LOL Arena, the first AI benchmark for humor, informed by millions of human votes. We are also launching Diplomacy Arena ranking strategy, betrayal, and prompt impact across models. In the coming years we hope to lead at the intersection of Gen AI & Games and define what it means to do alignment via entertainment. Ensuring everyone can share their voice and help AI become a tool that really is custom built to help bring our dreams to life. If that inspires you, join us! We're hiring. Here's what we're shipping today: