The theme: 8 feathers for our 8 incredible Olympians. Let's cheer them on!
#IOAI2025 #TeamIndia #AI
The theme: 8 feathers for our 8 incredible Olympians. Let's cheer them on!
#IOAI2025 #TeamIndia #AI
meresophistry.substack.com/p/the-mental...
meresophistry.substack.com/p/the-mental...
What is the fundamental difference between active learning and data filtering?
Well, obviously, the difference is that:
1/11
What is the fundamental difference between active learning and data filtering?
Well, obviously, the difference is that:
1/11
arxiv.org/abs/2504.12501
arxiv.org/abs/2504.12501
142-page report diving into the reasoning chains of R1. It spans 9 unique axes: safety, world modeling, faithfulness, long context, etc.
142-page report diving into the reasoning chains of R1. It spans 9 unique axes: safety, world modeling, faithfulness, long context, etc.
Check it out here: arxiv.org/pdf/2409.14509
Check it out here: arxiv.org/pdf/2409.14509
- 1.4M high-quality reasoning problems with verified solutions
- 900K entries distilled from DeepSeek-R1-671B
- Covers math, code, and complex reasoning tasks
- Bilingual (Chinese/English)
huggingface.co/datasets/a-m...
- 1.4M high-quality reasoning problems with verified solutions
- 900K entries distilled from DeepSeek-R1-671B
- Covers math, code, and complex reasoning tasks
- Bilingual (Chinese/English)
huggingface.co/datasets/a-m...
drubinstein.github.io/pokerl/
drubinstein.github.io/pokerl/
- Performance surpasses models like Llama3.1-8B and Qwen2.5-7B
- Capable of deep reasoning with system prompts
- Trained only on 4T high-quality tokens
huggingface.co/collections/...
- Performance surpasses models like Llama3.1-8B and Qwen2.5-7B
- Capable of deep reasoning with system prompts
- Trained only on 4T high-quality tokens
huggingface.co/collections/...