🔸10% Pledge at GivingWhatWeCan.
Love how clicking @METR_Evals's new Notes page changes the whole site to handwritten font and chalk background.
Strong visual screaming "no seriously, this is rough".
Love how clicking @METR_Evals's new Notes page changes the whole site to handwritten font and chalk background.
Strong visual screaming "no seriously, this is rough".
Today we're launching STREAM: a checklist for more transparent eval results.
I read a lot of model reports. Often they miss important details, like human baselines. STREAM helps make peer review more systematic.
Today we're launching STREAM: a checklist for more transparent eval results.
I read a lot of model reports. Often they miss important details, like human baselines. STREAM helps make peer review more systematic.
• 4 cases of late safety results (out of 27, so ~15%)
• Notably 2 cases were late results showed increases in risk
• The most recent set of releases in August were all on time
x.com/HarryBooth5...
• 4 cases of late safety results (out of 27, so ~15%)
• Notably 2 cases were late results showed increases in risk
• The most recent set of releases in August were all on time
x.com/HarryBooth5...
If LLMs do very well on a virology eval, human-caused epidemics could increase 2-5x.
Most thought this was >5yrs away. In fact, the threshold was hit just *months* after the survey. 🧵
If LLMs do very well on a virology eval, human-caused epidemics could increase 2-5x.
Most thought this was >5yrs away. In fact, the threshold was hit just *months* after the survey. 🧵
Now court documents against his accomplice show the terrorist asked AI to help build the bomb.
A thread on what I think those documents do and don't show 🧵…
x.com/CNBC/status...
Now court documents against his accomplice show the terrorist asked AI to help build the bomb.
A thread on what I think those documents do and don't show 🧵…
x.com/CNBC/status...
Kudos to OpenAI for consistently publishing these eval results, and great to see Anthropic now sharing a lot more too.
Kudos to OpenAI for consistently publishing these eval results, and great to see Anthropic now sharing a lot more too.
Now that o1 is out, how does it stack up?
Better! (Though there’s still room for improvement.)
Here’s my new o1 scorecard. 🧵👇
Now that o1 is out, how does it stack up?
Better! (Though there’s still room for improvement.)
Here’s my new o1 scorecard. 🧵👇
Framing climate change as an inequality problem —not an extinction risk— highlights the need for global aid, LMIC growth, and valuing all lives equally.
Framing climate change as an inequality problem —not an extinction risk— highlights the need for global aid, LMIC growth, and valuing all lives equally.