Spent the weekend reading the paper and sorting through the intuitions. Here's a visual guide and the main intuitions to understand the model and the process that created it.
newsletter.languagemodels.co/p/the-illust...
New blog post!
NeurIPS 2025 papers are out—and it’s a lot to take in. This visualization lets you explore the entire research landscape interactively, with clusters and
@cohere.com LLM-generated explanations that make it easier to grasp.
New blog post!
NeurIPS 2025 papers are out—and it’s a lot to take in. This visualization lets you explore the entire research landscape interactively, with clusters and
@cohere.com LLM-generated explanations that make it easier to grasp.
Our new book will contain chapters on the fundamentals of agents (memory, tools, and planning), alongside more advanced concepts like RL and reasoning LLMs.
Our new book will contain chapters on the fundamentals of agents (memory, tools, and planning), alongside more advanced concepts like RL and reasoning LLMs.
New book announcement!
Thrilled that together with @maartengr.bsky.social , we're writing a new book titled “An Illustrated Guide to AI Agents” and published by @oreilly.bsky.social.
New book announcement!
Thrilled that together with @maartengr.bsky.social , we're writing a new book titled “An Illustrated Guide to AI Agents” and published by @oreilly.bsky.social.
Trained on 15T tokens in 1,000+ languages, it’s built for transparency, responsibility & the public good.
Read more: actu.epfl.ch/news/apertus...
New post! A visual tour of the architecture, message formatting, and reasoning of the latest GPT.
newsletter.languagemodels.co/p/the-illust...
New post! A visual tour of the architecture, message formatting, and reasoning of the latest GPT.
newsletter.languagemodels.co/p/the-illust...
- Current AI focus is RL (with Richard Sutton) solving Atari games
- Thinking in line with the Alberta Plan.
- It was a misstep to start working too low-level (e.g., at the cuda level). I kept stepping up the stack chain until now in pytorch
- Current AI focus is RL (with Richard Sutton) solving Atari games
- Thinking in line with the Alberta Plan.
- It was a misstep to start working too low-level (e.g., at the cuda level). I kept stepping up the stack chain until now in pytorch
#pydata #datascience
We have 3 top flight keynotes lined up for you this year from @jayalammar.bsky.social, Leanne Kim Fitzpatrick and Tony Mears.
Just 17 days left. Book your tickets now!
pydata.org/london2025
#pydata #datascience
We have 3 top flight keynotes lined up for you this year from @jayalammar.bsky.social, Leanne Kim Fitzpatrick and Tony Mears.
Just 17 days left. Book your tickets now!
pydata.org/london2025
We have 3 top flight keynotes lined up for you this year from @jayalammar.bsky.social, Leanne Kim Fitzpatrick and Tony Mears.
Just 17 days left. Book your tickets now!
pydata.org/london2025
It's a huge collaboration between 56 universities, labs, and organizations, resulting in a massive benchmark of 1000+ languages, 500+ tasks, and a dozen+ domains.
Details in 🧵
It's a huge collaboration between 56 universities, labs, and organizations, resulting in a massive benchmark of 1000+ languages, 500+ tasks, and a dozen+ domains.
Details in 🧵
There's now even a free course available with
@deeplearningai.bsky.social!
There's now even a free course available with
@deeplearningai.bsky.social!
Science Advances (@science.org) suggests they do—following linguistic laws seen in human speech. 🧵 www.science.org/doi/10.1126/...
Science Advances (@science.org) suggests they do—following linguistic laws seen in human speech. 🧵 www.science.org/doi/10.1126/...
www.science.org/doi/10.1126/...
www.science.org/doi/10.1126/...
Spent the weekend reading the paper and sorting through the intuitions. Here's a visual guide and the main intuitions to understand the model and the process that created it.
newsletter.languagemodels.co/p/the-illust...
Spent the weekend reading the paper and sorting through the intuitions. Here's a visual guide and the main intuitions to understand the model and the process that created it.
newsletter.languagemodels.co/p/the-illust...
www.youtube.com/watch?v=-Kwl...
www.youtube.com/watch?v=-Kwl...
Details in 🧵
Details in 🧵
And excited that professors are starting to use the book to teach LLM courses. Reach out to us if we can be of assistance!
And if you've liked the book, leave us a review on Amazon or Goodreads!
And excited that professors are starting to use the book to teach LLM courses. Reach out to us if we can be of assistance!
And if you've liked the book, leave us a review on Amazon or Goodreads!
www.youtube.com/watch?v=bivZ...
www.youtube.com/watch?v=bivZ...
A step change as influential as the release of GPT-4. Reasoning language models are the current and next big thing.
I explain:
* The ARC prize
* o3 model size / cost
* Dispelling training myths
* Extreme benchmark progress
A step change as influential as the release of GPT-4. Reasoning language models are the current and next big thing.
I explain:
* The ARC prize
* o3 model size / cost
* Dispelling training myths
* Extreme benchmark progress
Come early as quantities are limited!
Come early as quantities are limited!
Tomorrow I'll be signing copies of my book at 3PM! Limited copies available!
Tomorrow I'll be signing copies of my book at 3PM! Limited copies available!
Explore ~4,500 NeurIPS papers in this interactive visualization:
jalammar.github.io/assets/neuri...
(Click on a point to see the paper on the website)
Uses @cohere.com models and @lelandmcinnes.bsky.social's datamapplot/umap to help make sense of the overwhelming scale of NeurIPS.
Explore ~4,500 NeurIPS papers in this interactive visualization:
jalammar.github.io/assets/neuri...
(Click on a point to see the paper on the website)
Uses @cohere.com models and @lelandmcinnes.bsky.social's datamapplot/umap to help make sense of the overwhelming scale of NeurIPS.
youtu.be/34_Tub6vXDk