Tech Twitter

We doomscroll, you upskill.

Finding signal on X is harder than ever. We curate high-value insights on AI, Startups, and Product so you can focus on what matters.

Explainable AI: Unlock How Small Models Actually Work

We’ve developed a new way to train small AI models with internal mechanisms that are easier for humans to understand. Language models like the ones behind ChatGPT have complex, sometimes surprising structures, and we don’t yet fully understand how they work. This approach helps us begin to close that gap.

Understanding neural networks through sparse circuits

Understanding neural networks through sparse circuits

1.0K
77
151
511

Topics

interpretable aimodel interpretabilityneural network transparencylanguage model explainabilityai alignmentmechanistic interpretabilitymodel transparency