We’re building cutting-edge, open-source AI tutoring models for high-quality, adaptive learning for all pupils with support from the Hector Foundation.
👉 forms.gle/sxvXbJhZSccr...
Find out more: institute-tue.ellis.eu/en/news/from...
How does LLM training loss translate to downstream performance?
We show that pretraining data and tokenizer shape loss-to-loss scaling, while architecture and other factors play a surprisingly minor role!
brendel-group.github.io/llm-line/ 🧵1/8
How does LLM training loss translate to downstream performance?
We show that pretraining data and tokenizer shape loss-to-loss scaling, while architecture and other factors play a surprisingly minor role!
brendel-group.github.io/llm-line/ 🧵1/8
Since DeepSeek-R1 introduced reasoning-based RL, datasets like Open-R1 & OpenThoughts emerged for fine-tuning & GRPO. Our deep dive found major flaws — 25% of OpenThoughts needed elimination by data curation.
Here's why 👇🧵
Since DeepSeek-R1 introduced reasoning-based RL, datasets like Open-R1 & OpenThoughts emerged for fine-tuning & GRPO. Our deep dive found major flaws — 25% of OpenThoughts needed elimination by data curation.
Here's why 👇🧵
We’re building cutting-edge, open-source AI tutoring models for high-quality, adaptive learning for all pupils with support from the Hector Foundation.
👉 forms.gle/sxvXbJhZSccr...
We’re building cutting-edge, open-source AI tutoring models for high-quality, adaptive learning for all pupils with support from the Hector Foundation.
👉 forms.gle/sxvXbJhZSccr...
arxiv.org/pdf/2502.05151
arxiv.org/pdf/2502.05151
📍 Tübingen
📅 Apply by March 3rd, 2025
📌 Get details: tuebingen.ai/careers
📍 Tübingen
📅 Apply by March 3rd, 2025
📌 Get details: tuebingen.ai/careers
From tomorrow, February 11, to Saturday, February 15, the Didacta education fair will take place in Stuttgart:
📍 Find BWKI in Hall 7, Stand 7D75 as part of the AI & Digital Workshop area - workshops offered!
From tomorrow, February 11, to Saturday, February 15, the Didacta education fair will take place in Stuttgart:
📍 Find BWKI in Hall 7, Stand 7D75 as part of the AI & Digital Workshop area - workshops offered!