Sharad Vikram
sharadmv.bsky.social
Sharad Vikram
@sharadmv.bsky.social
JAX, Pallas, Gemini @ Google Deepmind
Reposted by Sharad Vikram
Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems view” of LLMs and wrote a little textbook called “How To Scale Your Model” which we’re releasing today. 1/n
February 4, 2025 at 6:54 PM