→ Real-world validation, delivering up to 2.25× better performance per watt compared to GPUs
→ Major SDK advances across multi-chip scaling and tensor parallelism
→ Introduction of the NXT RNGD Server
→ A $125M Series C bridge to accelerate our next phase of execution
→ Real-world validation, delivering up to 2.25× better performance per watt compared to GPUs
→ Major SDK advances across multi-chip scaling and tensor parallelism
→ Introduction of the NXT RNGD Server
→ A $125M Series C bridge to accelerate our next phase of execution
Read the paper here: arxiv.org/abs/2507.06996
Read the paper here: arxiv.org/abs/2507.06996
www.crn.com/news/compone...
www.crn.com/news/compone...
Interested in the future of efficient compute? Come talk to our team to learn more.
Interested in the future of efficient compute? Come talk to our team to learn more.
✅ Llama 3.1 8B: Up to 4.5% average throughput boost and up to 55% average reduction in Time-to-First-Token (TTFT)
✅ Llama 3.1 70B: Up to 3× average throughput improvement and up to 35% TTFT reduction
✅ Llama 3.1 8B: Up to 4.5% average throughput boost and up to 55% average reduction in Time-to-First-Token (TTFT)
✅ Llama 3.1 70B: Up to 3× average throughput improvement and up to 35% TTFT reduction