lucasmpaes.bsky.social
@lucasmpaes.bsky.social
PhD candidate @Harvard | @Apple Scholar | Trustworthy AI and Information Theory |

Reposted
AI is built to “be helpful” or “avoid harm”, but which principles should it prioritize and when? We call this alignment discretion. As Asimov's stories show: balancing such principles for AI behavior is tricky. In fact, we find that AI has its own set of priorities. (comic by @xkcd.com)🧵👇
February 19, 2025 at 9:08 PM