Do we care at all about generalizability viz. novel problem-solving, which human brains can do but Claude 3.7 just falls down? Language is not enough.
bsky.app/profile/fcho...
Do we care at all about generalizability viz. novel problem-solving, which human brains can do but Claude 3.7 just falls down? Language is not enough.
bsky.app/profile/fcho...