Samuel Albanie
samuelalbanie.bsky.social
Samuel Albanie
@samuelalbanie.bsky.social
This is a nice benchmark for AI R&D

LLMs are closing the gap to humans

Details: metr.org/AI_R_D_Evalu...
November 23, 2024 at 7:17 PM