Researcher @aial.ie @tcddublin.bsky.social | Formerly Research Engineer @ DeepMind
The results in this post are entirely predicated on those definitions which, if shoddy, could completely undermine their reported progress.
openai.com/index/streng...
The results in this post are entirely predicated on those definitions which, if shoddy, could completely undermine their reported progress.
openai.com/index/streng...