Sabri Pllana
banner
sabripllana.bsky.social
Sabri Pllana
@sabripllana.bsky.social
EIS interactive tool 2025
projects.research-and-innovation.ec.europa.eu
August 29, 2025 at 7:04 AM
It was done “Without thinking” 🙂
August 8, 2025 at 6:41 AM
GPT-5 (#GPT5) reliability and accuracy
August 7, 2025 at 5:28 PM
EIS interactive tool 2025
projects.research-and-innovation.ec.europa.eu
July 15, 2025 at 1:11 PM
„Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs“

Mantas Mazeika, Xuwang Yin, Rishub Tamirisa, Jaehyuk Lim, Bruce W. Lee, Richard Ren, Long Phan, Norman Mu, Adam Khoja, Oliver Zhang, Dan Hendrycks

arxiv.org/abs/2502.08640
Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs
As AIs rapidly advance and become more agentic, the risk they pose is governed not only by their capabilities but increasingly by their propensities, including goals and values. Tracking the emergence...
arxiv.org
June 16, 2025 at 3:38 PM