Omer Celik
banner
omercelik.com
Omer Celik
@omercelik.com
Engineer at Amazon &
Developer of tureng.com
Here is another test with Llama 4 Scout - 6 bit quantized version on M4 Max. Got ~30t/s which is faster than most of the hosted models.
April 6, 2025 at 10:03 PM