jacobaustin123.bsky.social
@jacobaustin123.bsky.social
Researcher at Google DeepMind. I make LLMs go fast. I also play piano and climb sometimes. Opinions my own
Reposted
Training our most capable Gemini models relies heavily on our JAX software stack+Google's TPU hardware platforms.

If you want to learn more, see this awesome book "How to Scale Your Model":

jax-ml.github.io/scaling-book/

Put together by several of my Google DeepMind colleagues listed below 🎉.
Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems view” of LLMs and wrote a little textbook called “How To Scale Your Model” which we’re releasing today. 1/n
February 4, 2025 at 7:51 PM
Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems view” of LLMs and wrote a little textbook called “How To Scale Your Model” which we’re releasing today. 1/n
February 4, 2025 at 6:54 PM
Excited to be here! Hopefully the skies are brighter on this side of the fence. Will be posting research stuff here, mostly
November 24, 2024 at 4:54 PM