Phydu
phydu.bsky.social
Phydu
@phydu.bsky.social
A Londoner who does theoretical physics research, amongst other things.
Reposted by Phydu
Deepseek R1 for Everyone by neuralnets

Discusses how the Deepseek R1 model actually works in detail but with very less math!

The blog will have 3 main parts

1. **Chain of Thought Reasoning**
2. **Reinforcement Learning**
3. **GRPO**
4. **Distillation**

trite-song-d6a.notion.site/Deepseek-R1-...
Deepseek R1 for Everyone | Notion
made by neuralnets
trite-song-d6a.notion.site
January 26, 2025 at 1:06 AM
If anyone's curious why Musk is attacking the UK:
January 7, 2025 at 7:03 PM
London really is beautiful in autumn
November 17, 2024 at 2:12 PM