We’re looking to hire experienced Software Engineers to join our R&D team. You will help productionize our advanced AI-driven discovery platform and our model-development efforts.
sakana.ai/careers/#sof...
Japanese language fluency is not required for this role.
We’re looking to hire experienced Software Engineers to join our R&D team. You will help productionize our advanced AI-driven discovery platform and our model-development efforts.
sakana.ai/careers/#sof...
Japanese language fluency is not required for this role.
sakana.ai/series-b
From day one, Sakana AI has done things differently. Our research has always focused on developing efficient AI technology sustainably, driven by the belief that resource constraints—not limitless compute—are key to true innovation.
sakana.ai/series-b
From day one, Sakana AI has done things differently. Our research has always focused on developing efficient AI technology sustainably, driven by the belief that resource constraints—not limitless compute—are key to true innovation.
www.technologyreview.com/2025/08/06/1...
www.technologyreview.com/2025/08/06/1...
「試行錯誤」と「集合知」で新たな推論時スケーリングへ
ブログ: sakana.ai/ab-mcts-jp/
論文: arxiv.org/abs/2503.04412
このたびSakana AIは新アルゴリズム「AB-MCTS」を開発し、ARC-AGI-2ベンチマークで有望な結果を得ました。
「試行錯誤」と「集合知」で新たな推論時スケーリングへ
ブログ: sakana.ai/ab-mcts-jp/
論文: arxiv.org/abs/2503.04412
このたびSakana AIは新アルゴリズム「AB-MCTS」を開発し、ARC-AGI-2ベンチマークで有望な結果を得ました。
wired.jp/article/saka...
フロンティアモデルと呼ばれるAIを単体ではなく“混ぜて”使えば、個々のモデル─ChatGPT、Gemini、DeepSeek─を使うよりも大幅に上回る成績を出すことが可能だと、日本発AIスタートアップのSakana AIが発表した。
wired.jp/article/saka...
フロンティアモデルと呼ばれるAIを単体ではなく“混ぜて”使えば、個々のモデル─ChatGPT、Gemini、DeepSeek─を使うよりも大幅に上回る成績を出すことが可能だと、日本発AIスタートアップのSakana AIが発表した。
3人集まれば1人よりも優れた知恵が出るということわざ「3人寄れば文殊の知恵」が、AIにも当てはまった格好だ。 🐡🐟🐠
xtech.nikkei.com/atcl/nxt/new...
3人集まれば1人よりも優れた知恵が出るということわざ「3人寄れば文殊の知恵」が、AIにも当てはまった格好だ。 🐡🐟🐠
xtech.nikkei.com/atcl/nxt/new...
arxiv.org/abs/2503.04412
arxiv.org/abs/2503.04412
sakana.ai/careers/#app...
正社員だけでなく学生インターンシップも歓迎です✨
金融・保険などのエンタープライズ分野から政府・防衛などの公共分野での業務に興味のある方
最先端のAI技術を実社会に導入してインパクトを出したい方
雇用期間や勤務スタイルの相談もできますのでぜひご応募ください!
sakana.ai/careers/#app...
正社員だけでなく学生インターンシップも歓迎です✨
金融・保険などのエンタープライズ分野から政府・防衛などの公共分野での業務に興味のある方
最先端のAI技術を実社会に導入してインパクトを出したい方
雇用期間や勤務スタイルの相談もできますのでぜひご応募ください!
Our new inference-time scaling algorithm enables collective intelligence for AI by allowing multiple frontier models (like Gemini 2.5 Pro, o4-mini, DeepSeek-R1-0528) to cooperate.
Blog: sakana.ai/ab-mcts
Paper: arxiv.org/abs/2503.04412
Our new inference-time scaling algorithm enables collective intelligence for AI by allowing multiple frontier models (like Gemini 2.5 Pro, o4-mini, DeepSeek-R1-0528) to cooperate.
Blog: sakana.ai/ab-mcts
Paper: arxiv.org/abs/2503.04412
Our new inference-time scaling algorithm enables collective intelligence for AI by allowing multiple frontier models (like Gemini 2.5 Pro, o4-mini, DeepSeek-R1-0528) to cooperate.
Blog: sakana.ai/ab-mcts
Paper: arxiv.org/abs/2503.04412
At Sakana AI, we remain committed to pioneering novel AI systems by applying nature-inspired principles such as evolution and collective intelligence.
At Sakana AI, we remain committed to pioneering novel AI systems by applying nature-inspired principles such as evolution and collective intelligence.
Algorithm (TreeQuest): github.com/SakanaAI/tre...
Algorithm (TreeQuest): github.com/SakanaAI/tre...
sakana.ai/ab-mcts/
We developed AB-MCTS, a new inference-time scaling algorithm that enables multiple frontier AI models to cooperate, achieving promising initial results on the ARC-AGI-2 benchmark.
sakana.ai/ab-mcts/
We developed AB-MCTS, a new inference-time scaling algorithm that enables multiple frontier AI models to cooperate, achieving promising initial results on the ARC-AGI-2 benchmark.
日本語ブログ : sakana.ai/ctm-jp
インタラクティブレポート : pub.sakana.ai/ctm
Sakana AIは、時間情報を明示的に扱う新しいAIモデル「Continuous Thought Machine(CTM)」を発表しました 。
日本語ブログ : sakana.ai/ctm-jp
インタラクティブレポート : pub.sakana.ai/ctm
Sakana AIは、時間情報を明示的に扱う新しいAIモデル「Continuous Thought Machine(CTM)」を発表しました 。
Blog → sakana.ai/ctm
Modern AI is powerful, but it's still distinct from human-like flexible intelligence. We believe neural timing is key. Our Continuous Thought Machine is built from the ground up to use neural dynamics as a powerful representation for intelligence.
Blog → sakana.ai/ctm
Modern AI is powerful, but it's still distinct from human-like flexible intelligence. We believe neural timing is key. Our Continuous Thought Machine is built from the ground up to use neural dynamics as a powerful representation for intelligence.
Interactive Paper (with web-demo): pub.sakana.ai/ctm/
Full Paper: arxiv.org/abs/2505.05522
GitHub Project: github.com/SakanaAI/con...
Interactive Paper (with web-demo): pub.sakana.ai/ctm/
Full Paper: arxiv.org/abs/2505.05522
GitHub Project: github.com/SakanaAI/con...
“While there are many AI companies in the US and China, Japanese firms have had little global presence. We believe there’s a demand—particularly among government agencies—for domestically developed AI solutions.”
asia.nikkei.com/Business/Tec...
“While there are many AI companies in the US and China, Japanese firms have had little global presence. We believe there’s a demand—particularly among government agencies—for domestically developed AI solutions.”
asia.nikkei.com/Business/Tec...
Paper: openreview.net/forum?id=dh4...
Transformer-Squared adapts its weights on the fly for each query, achieving strong performance across tasks and enabling parameter-efficient life-long learning.
Paper: openreview.net/forum?id=dh4...
Transformer-Squared adapts its weights on the fly for each query, achieving strong performance across tasks and enabling parameter-efficient life-long learning.
Paper: openreview.net/forum?id=Kvd...
CycleQD is an ecological niche-inspired model-merging approach that achieves great performance on computer science tasks while retaining language capabilities.
Paper: openreview.net/forum?id=Kvd...
CycleQD is an ecological niche-inspired model-merging approach that achieves great performance on computer science tasks while retaining language capabilities.
Paper: openreview.net/forum?id=s1k...
Neural Attention Memory Models (NAMMs) is an evolved memory system trained to improve performance and efficiency on language transformers, and zero-shot transferring to vision and RL foundation models.
Paper: openreview.net/forum?id=s1k...
Neural Attention Memory Models (NAMMs) is an evolved memory system trained to improve performance and efficiency on language transformers, and zero-shot transferring to vision and RL foundation models.
Paper: openreview.net/forum?id=cqs...
TAID is a novel knowledge distillation method that uses a time-dependent intermediate distribution addressing common challenges in distilling LLMs.
Paper: openreview.net/forum?id=cqs...
TAID is a novel knowledge distillation method that uses a time-dependent intermediate distribution addressing common challenges in distilling LLMs.