@AnthropicAI’s new research paper shows that not only do AI models not use CoT like we thought, they might not use it at all for reasoning.
In fact, they might be lying to us in their CoT.
What you need to know: 🧵
@AnthropicAI’s new research paper shows that not only do AI models not use CoT like we thought, they might not use it at all for reasoning.
In fact, they might be lying to us in their CoT.
What you need to know: 🧵
I'm now 100x more productive than I ever was.
How do I use it? Which tools do I use?
Here are my actual use cases for AI: 👇
I'm now 100x more productive than I ever was.
How do I use it? Which tools do I use?
Here are my actual use cases for AI: 👇
It’s much smaller (32B vs 671B params) but delivers comparable results. You can run it on your PC!
Insanely fast, thinking-focused, and agent-capable.
Let’s dive in.
It’s much smaller (32B vs 671B params) but delivers comparable results. You can run it on your PC!
Insanely fast, thinking-focused, and agent-capable.
Let’s dive in.
They're 10x faster and 10x cheaper than traditional LLMs.
Here's everything you need to know:
They're 10x faster and 10x cheaper than traditional LLMs.
Here's everything you need to know:
This is their "largest and best model for chat yet"
Here's what you need to know...
This is their "largest and best model for chat yet"
Here's what you need to know...
OpenAI's SWE-Lancer benchmark seems like the most "real world" coding benchmark to date.
This might be the biggest arbitrage opportunity on the planet.
A thread on what they found 👇🧵
OpenAI's SWE-Lancer benchmark seems like the most "real world" coding benchmark to date.
This might be the biggest arbitrage opportunity on the planet.
A thread on what they found 👇🧵
But here’s the kicker: this strategy isn’t just for coding—it’s the clearest path to AGI and beyond.
Let’s break it down 🧵👇
But here’s the kicker: this strategy isn’t just for coding—it’s the clearest path to AGI and beyond.
Let’s break it down 🧵👇
Unlike Chain of Thought, this "latent reasoning" happens in the model's hidden space.
TONS of benefits from this approach.
Let me break down this fascinating paper...
Unlike Chain of Thought, this "latent reasoning" happens in the model's hidden space.
TONS of benefits from this approach.
Let me break down this fascinating paper...
The crazy part?
The code for the boost was WRITTEN BY R1 itself!
Self-improving AI is here.
The crazy part?
The code for the boost was WRITTEN BY R1 itself!
Self-improving AI is here.
Imagine an AI model that competes with OpenAI’s best—trained for just $5M, open-source, and free.
Here's the wild story so far 🧵👇
Imagine an AI model that competes with OpenAI’s best—trained for just $5M, open-source, and free.
Here's the wild story so far 🧵👇
For the first time, OpenAI's agents can directly impact the real world.
The AI industry had strong reactions!
Here’s a roundup of reactions and incredible use cases. 🧵👇
For the first time, OpenAI's agents can directly impact the real world.
The AI industry had strong reactions!
Here’s a roundup of reactions and incredible use cases. 🧵👇
It's a self-adaptive architecture that allows AI to evolve at inference time.
Model weights are no longer "static"
Let’s break it down: 🧵
It's a self-adaptive architecture that allows AI to evolve at inference time.
Model weights are no longer "static"
Let’s break it down: 🧵
Thank you to @NVIDIA_AI_PC for inviting me. What an incredible team!
Check out our wrap up vid on IG (link below)
#NVIDIAPartner
Thank you to @NVIDIA_AI_PC for inviting me. What an incredible team!
Check out our wrap up vid on IG (link below)
#NVIDIAPartner
Biden Admin drops major AI chip rules today!
The 200+ page "AI Diffusion" framework completely reshapes global AI tech trade.
Key goal: Keep advanced AI development running on "American rails"
But not everyone is happy... 🧵
Biden Admin drops major AI chip rules today!
The 200+ page "AI Diffusion" framework completely reshapes global AI tech trade.
Key goal: Keep advanced AI development running on "American rails"
But not everyone is happy... 🧵
I found a great prediction on LessWrong by L Rudolf L (link below).
Capital will matter MORE after AGI.
A thread on the future of wealth, power & human agency 🧵
I found a great prediction on LessWrong by L Rudolf L (link below).
Capital will matter MORE after AGI.
A thread on the future of wealth, power & human agency 🧵
Singularity, superintelligence, slow vs. fast takeoff...
Here's what you need to know 🧵
Singularity, superintelligence, slow vs. fast takeoff...
Here's what you need to know 🧵
NotDiamond CEO @tomas_hk and former OpenAI exec @iamthezack just dropped an article showing a clear vision of AI for 2025.
For most use cases, having multiple smaller, more specialized models will be the right solution.
Here’s everything you need to know: 🧵
NotDiamond CEO @tomas_hk and former OpenAI exec @iamthezack just dropped an article showing a clear vision of AI for 2025.
For most use cases, having multiple smaller, more specialized models will be the right solution.
Here’s everything you need to know: 🧵
They are moving to a more closed and for-profit model while doubling down on AGI safety and scalability.
Is this the right balance of ethics and ambition, or is it a departure from their ideals?
Let’s unpack. 🧵
They are moving to a more closed and for-profit model while doubling down on AGI safety and scalability.
Is this the right balance of ethics and ambition, or is it a departure from their ideals?
Let’s unpack. 🧵
I've collected some of the reactions from the biggest names in AI: 🧵
I've collected some of the reactions from the biggest names in AI: 🧵
This is AGI (not clickbait)
o3 is the best AI ever created, and its performance is WILD.
Here's everything you need to know: 🧵
This is AGI (not clickbait)
o3 is the best AI ever created, and its performance is WILD.
Here's everything you need to know: 🧵
A groundbreaking framework for creating, training, & deploying embodied agents in simulated environments!
And it's open-source!
Here's why you should care: 🧵
A groundbreaking framework for creating, training, & deploying embodied agents in simulated environments!
And it's open-source!
Here's why you should care: 🧵
Not only does it crack EVERY frontier model, but it's also super easy to do.
ThIS iZ aLL iT TakE$ 🔥
Here's everything you need to know: 🧵
Not only does it crack EVERY frontier model, but it's also super easy to do.
ThIS iZ aLL iT TakE$ 🔥
Here's everything you need to know: 🧵
"Apps as we know them are going away in favor of agents."
Is this the end of traditional software?
Here's what you need to know: 🧵
"Apps as we know them are going away in favor of agents."
Is this the end of traditional software?
Here's what you need to know: 🧵