Finding signal on Twitter is more difficult than it used to be. We curate the best tweets on topics like AI, startups, and product development every weekday so you can focus on what matters.
We have been shipping π³οΈβ€οΈ
π¦ Community Evals & Benchmark Datasets: Benchmark datasets host benchmark leaderboards, you can now contribute eval results by opening a PR to model repositories, all PRs are fed to benchmark datasets
π¦ Chat with datasets: agents live in Data Studio, you can ask questions about datasets
π¦ Select sections in datasets: Data Studio now has a spreadsheet-like UX, allowing quick selections
π¦ MLX compatibility: Find hardware compatible for MLX models and quantized versions in model repositories
π¦ You can now save blog drafts and access them from the editor π
π¦ Datasets now support LanceDB format
π¦ Model repositories show snippets for SGLang
We just shipped Community Evals and Benchmark repositories for decentralized evals π€ > Scores you and model authors report are on leaderboards ππ» > Benchmark datasets host live leaderboards of reported results π > You can open PRs to add scores, they live in model repositories. Community Evals will expose scores currently distributed across model cards, papers, and benchmarks. It wonβt solve the differences in scores, but it is transparent!