tremere01.bsky.social
@tremere01.bsky.social
Reposted
I wrote about a new AI evaluation called "Humanity's Last Exam," a collection of 3,000 questions submitted by leading academics to try to stump leading AI models, which mostly find today's college-level tests too easy.

www.nytimes.com/2025/01/23/t...
A Test So Hard No AI System Can Pass It — Yet (Gift Article)
The creators of a new test called “Humanity’s Last Exam” argue we may soon lose the ability to create tests hard enough for A.I. models.
www.nytimes.com
January 23, 2025 at 4:41 PM