AI Risk Hub
banner
airiskhub.bsky.social
AI Risk Hub
@airiskhub.bsky.social
The latest research and policy news on AI risks.

Tag to share your work.

AIRiskHub.org
Reposted by AI Risk Hub
December 19, 2024 at 12:08 AM
Reposted by AI Risk Hub
An extraordinarily difficult math AI benchmark: epoch.ai/frontiermath
FrontierMath
FrontierMath is a benchmark of hundreds of unpublished and extremely challenging math problems to help us to understand the limits of artificial intelligence.
epoch.ai
December 14, 2024 at 6:46 PM
Reposted by AI Risk Hub
I’m keen to dig more into safety cases, there’s something ‘proving a negative’ about them but equally it’s good to see a really concrete attempt to tether speculation. Here’s a new piece from UK AISI @girving.bsky.social and gov AI attempting to provide a template

arxiv.org/abs/2411.08088
Safety case template for frontier AI: A cyber inability argument
Frontier artificial intelligence (AI) systems pose increasing risks to society, making it essential for developers to provide assurances about their safety. One approach to offering such assurances is...
arxiv.org
November 17, 2024 at 2:18 PM