It's a quick little game about laying tiles in a grid, inspired by a mathematical puzzle.
Play free on itch, in the browser: illomens.itch.io/sqorp #pico8
It's a quick little game about laying tiles in a grid, inspired by a mathematical puzzle.
Play free on itch, in the browser: illomens.itch.io/sqorp #pico8
We've revamped it to make continuous control with multi-agent RL even more dynamic & competitive.
Link: github.com/proroklab/VectorizedMultiAgentSimulator/releases/tag/1.5.0
Here’s what’s new: 🧵👇
#AI #ReinforcementLearning #MARL #MachineLearning
We've revamped it to make continuous control with multi-agent RL even more dynamic & competitive.
Link: github.com/proroklab/VectorizedMultiAgentSimulator/releases/tag/1.5.0
Here’s what’s new: 🧵👇
#AI #ReinforcementLearning #MARL #MachineLearning
arxiv.org/abs/2501.16937
In our #ICLR2025 paper, we try to push the limits of model distillation! We also trained a SOTA ‘smol’ 🇯🇵LLM that works entirely on-device, and runs inside a web browser.
sakana.ai/taid-jp
私たちは、大規模言語モデル(LLM)の知識を効率的に小規模モデルへ転移させる新しい知識蒸留手法「TAID」を開発しました。この手法では、小規模モデルの学習進度に合わせて大規模モデルの知識を転移させることで、効果的な知識転移を実現します。この研究は機械学習分野の国際会議ICLR 2025に採択されました。
論文: arxiv.org/abs/2501.16937
デモ: pub.sakana.ai/tinyswallow/
arxiv.org/abs/2501.16937
In our #ICLR2025 paper, we try to push the limits of model distillation! We also trained a SOTA ‘smol’ 🇯🇵LLM that works entirely on-device, and runs inside a web browser.