Running 1 CorrSteer: Correlation-Based Steering of Language Models via Sparse Autoencoders ๐งญ 1 Steer language model output by clicking visual layers
Running Featured 49 Porting nanochat to Transformers: an AI modeling history lesson ๐ 49 Learn about ML and Transformers through nanochat
Running 11 FAT5 (Flash Attention T5) report โก 11 English version of the blog post introducing FAT5 model
Running 72 Unfolding Robotics: Open-Source Shirt Folding from Data to Deployment ๐ค 72 Explore the open-source guide to robot shirt folding
Running on CPU Upgrade 222 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens ๐ 222 Explore synthetic data experiments on a virtual bookshelf
Sleeping 5 Robotics research should think (and do) more about sustainability! ๐ 5 Explore robotics papers by sustainability goals
Running Featured 24 Chasing the Counting Manifold in Open LLMs ๐ 24 Counting manifolds in open LLMs from behavior to SAEs.
Running Featured 72 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems ๐ 72 Who needs 1T parameters? Olympiad proofs with a 4B model
Running Featured 88 Parakeet STT Progressive Transcription ๐ค 88 Transcribe speech to text instantly with WebGPU acceleration