Lightnews — Scholar-powered news

Colin White

@crwhite-ml.bsky.social

15 followers 16 following 1 posts

LLM evaluation
Head of Research at Abacus.AI. PhD from CMU
https://crwhite.ml

Posts Replies Media Videos

Reposted by Colin White

Henry Garner

@hendroid.io

Shiny! The newly released Llama 3.3 LLM leads the LiveBench ranking for instruction following¹, beating Claude 3.5, GPT-4o, OpenAI o1, and you can run it on your local² machine.

> ollama run llama3.3

livebench.ai#/?IF=as

The LiveBench leaderboard showing llama-3.3-70b-instruct-turbo in the leading position by average instruction following performance

December 9, 2024 at 8:50 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news