Also @pekka on T2 / Pebble.
Yep, it's that same old HLE. They have submitted the paper 07 May 2025. And no, I don't know what the point of publishing it like that is either. Looks good on CVs, I guess.
Yep, it's that same old HLE. They have submitted the paper 07 May 2025. And no, I don't know what the point of publishing it like that is either. Looks good on CVs, I guess.
All progress is significant, as humanity's baseline is also at zero. The very best humans are estimated to have 50% chance of solving these with weeks or years of full-time work.
AI hasn’t solved any of these yet, but the game is young!
All progress is significant, as humanity's baseline is also at zero. The very best humans are estimated to have 50% chance of solving these with weeks or years of full-time work.
Said a human who doesn't know and can't explain what that actually means.
Said a human who just learned that statement from the Internet.
Said a human who doesn't know and can't explain what that actually means.
Why not force the sale of TikTok USDS to some private Chinese company, like ByteDance, which has expertise running that sort of thing.
Why not force the sale of TikTok USDS to some private Chinese company, like ByteDance, which has expertise running that sort of thing.
But their demo app indicates it's not perfect yet.
blog.google/innovation-a...
But their demo app indicates it's not perfect yet.
blog.google/innovation-a...
"AI is teaching us...our idea of what intelligence is is not really accurate"
"we were looking for some elusive intelligent way of of thinking and we don't see it in the tools that actually solve our goals...maybe it's actually because intelligence is not what we think it is"
"AI is teaching us...our idea of what intelligence is is not really accurate"
"we were looking for some elusive intelligent way of of thinking and we don't see it in the tools that actually solve our goals...maybe it's actually because intelligence is not what we think it is"
The US is not one of them.
(There are no penguins in Greenland.)
The US is not one of them.
(There are no penguins in Greenland.)
10 of those were in the held-out set OpenAI doesn't have access to.
There's only 2 problems some other model has solved but it hasn't.
10 of those were in the held-out set OpenAI doesn't have access to.
There's only 2 problems some other model has solved but it hasn't.
"other known psychedelic compounds also usually produce idiosyncratic trips that vary not only from person to person but also from one experience to the next within the same individual. With L. asiatica, though, "the perception of little people is very reliably and repeatedly reported"
"other known psychedelic compounds also usually produce idiosyncratic trips that vary not only from person to person but also from one experience to the next within the same individual. With L. asiatica, though, "the perception of little people is very reliably and repeatedly reported"
Said a human who just learned that statement from the Internet.
Said a human who just learned that statement from the Internet.
"The White House...posted an altered photo of an attorney arrested after a Minnesota church protest"
"Abigail Jackson, a White House spokeswoman, mocked people questioning the image with an X post that said: “uM, eXCuSe mE??? iS tHAt DiGiTAlLy AlTeReD?!?!?!?!?!”"
@donmoyn.bsky.social told me this moves us "closer to the Stalinesque manipulation of images that we think about with authoritarian propaganda, where you really cannot trust materials the state is putting out"
Gift link: wapo.st/4sUEnmr
"The White House...posted an altered photo of an attorney arrested after a Minnesota church protest"
"Abigail Jackson, a White House spokeswoman, mocked people questioning the image with an X post that said: “uM, eXCuSe mE??? iS tHAt DiGiTAlLy AlTeReD?!?!?!?!?!”"
I asked Gemini to do it based on the limited information on their site, and it basically ended up agreeing with what I thought.
I asked Gemini to do it based on the limited information on their site, and it basically ended up agreeing with what I thought.
"The best" of the "current LLMs" in that study was GPT-4. Results would likely be different with actual current LLMs.
Traditional publishing is just too slow for stuff like that.
"The best" of the "current LLMs" in that study was GPT-4. Results would likely be different with actual current LLMs.
Traditional publishing is just too slow for stuff like that.
ICE then hid the memo from the public, passing it along by word of mouth and private conversation.
ICE then hid the memo from the public, passing it along by word of mouth and private conversation.
"Speaking in general terms about the development cycle, Bosworth said: "There's a tremendous amount of work to do post-training" for AI, "to actually deliver the model in a way that's usable internally and by consumers.""
"Speaking in general terms about the development cycle, Bosworth said: "There's a tremendous amount of work to do post-training" for AI, "to actually deliver the model in a way that's usable internally and by consumers.""
"Trump, who has demanded permission from Denmark to take control of Greenland"
"Trump, who has demanded permission from Denmark to take control of Greenland"