Colin White
crwhite-ml.bsky.social
Colin White
@crwhite-ml.bsky.social
LLM evaluation
Head of Research at Abacus.AI. PhD from CMU
https://crwhite.ml
Reposted by Colin White
Shiny! The newly released Llama 3.3 LLM leads the LiveBench ranking for instruction following¹, beating Claude 3.5, GPT-4o, OpenAI o1, and you can run it on your local² machine.

> ollama run llama3.3

livebench.ai#/?IF=as
December 9, 2024 at 8:50 PM