From a technical perspective, safeguarding open-weight model safety is AI safety in hard mode. But there's still a lot of progress to be made. Our new paper covers 16 open problems.
🧵🧵🧵
From a technical perspective, safeguarding open-weight model safety is AI safety in hard mode. But there's still a lot of progress to be made. Our new paper covers 16 open problems.
🧵🧵🧵
It summarises the state of the science on AI capabilities and risks, and how to mitigate those risks. 🧵
Full Report: assets.publishing.service.gov.uk/media/679a0c...
1/21
It summarises the state of the science on AI capabilities and risks, and how to mitigate those risks. 🧵
Full Report: assets.publishing.service.gov.uk/media/679a0c...
1/21
Proud to have served as the Scientific Lead, working under Yoshua Bengio with experts from 33 governments and researchers worldwide to assess scientific evidence on AI capabilities, risks, and mitigations.
It summarises the state of the science on AI capabilities and risks, and how to mitigate those risks. 🧵
Full Report: assets.publishing.service.gov.uk/media/679a0c...
1/21
Proud to have served as the Scientific Lead, working under Yoshua Bengio with experts from 33 governments and researchers worldwide to assess scientific evidence on AI capabilities, risks, and mitigations.
www.anthropic.com/research/ali...
www.anthropic.com/research/ali...
www.euractiv.com/section/tech...
www.euractiv.com/section/tech...