banner
ariveram2111.bsky.social
@ariveram2111.bsky.social
ML engineer | Professor | Tech speaker
#AI #ML #MachineLearning #Software #tech
Embracing new challenges | Tech-driven mindset 🚀
But the most remarkable part from my point of view? 🤯
DeepSeek-R1-Zero is trained with almost pure RL, without supervised fine-tuning (SFT) as a preliminary step. The result? Outstanding reasoning capabilities with minimal labeled data

#AI #ML #MachineLearning #DL #DeepLearning
January 20, 2025 at 5:53 PM
🚀 New DeepSeek models are here!

🔥 Key highlights:
-Performance on par with OpenAI GPT-4o
-Open weights, fully accessible
-MIT license, allowing commercial use
-API available – 27x cheaper than OpenAI GPT-4o
-Distilled from DeepSeek-R1, 6 small models fully open-sourced
#ML #AI #DeepLearning #DL
January 20, 2025 at 5:51 PM
But the most remarkable part from my point of view? 🤯
DeepSeek-R1-Zero is trained with almost pure RL, without supervised fine-tuning (SFT) as a preliminary step. The result? Outstanding reasoning capabilities with minimal labeled data
January 20, 2025 at 5:35 PM