I also give a proposed timeline with powerful AI in early 2027 so we can (hopefully) update before it is too late.
I also give a proposed timeline with powerful AI in early 2027 so we can (hopefully) update before it is too late.
I'm skeptical, though I agree that AIs are writing a high fraction of code at Anthropic.
I'm skeptical, though I agree that AIs are writing a high fraction of code at Anthropic.
I discuss how promising this is and how we might do this in a new post: www.lesswrong.com/posts/v6K3hn...
I discuss how promising this is and how we might do this in a new post: www.lesswrong.com/posts/v6K3hn...
AI companies should be transparent about whether (and how) they train against CoT. While OpenAI is doing better, all AI companies should say more. 1/
AI companies should be transparent about whether (and how) they train against CoT. While OpenAI is doing better, all AI companies should say more. 1/
I've found it helpful to separate this out into Plan A/B/C/D and to plan for these situations somewhat separately.
I say more in a new post:
I've found it helpful to separate this out into Plan A/B/C/D and to plan for these situations somewhat separately.
I say more in a new post:
At the start of 2025, I thought full automation of AI R&D before 2029 was ~25% likely, now I think it's only ~15% likely.
At the start of 2025, I thought full automation of AI R&D before 2029 was ~25% likely, now I think it's only ~15% likely.
My bluesky account will cross post my posts from X/twitter starting with some of my historical posts that people might be interested in.
My bluesky account will cross post my posts from X/twitter starting with some of my historical posts that people might be interested in.