42% had or have a friend with an AI “friend/companion.”
42% had or have a friend who got “mental health support” from AI.
(Source: cdt.org/wp-content/u..., n = 1,030, June-Aug 2025, quotas.)
42% had or have a friend with an AI “friend/companion.”
42% had or have a friend who got “mental health support” from AI.
(Source: cdt.org/wp-content/u..., n = 1,030, June-Aug 2025, quotas.)
First impressions will shape the future of human-AI interaction—for better or worse. Accepted at #CSCW2025. See you in Norway! dl.acm.org/doi/10.1145/...
First impressions will shape the future of human-AI interaction—for better or worse. Accepted at #CSCW2025. See you in Norway! dl.acm.org/doi/10.1145/...
We think the AI community needs a shift towards scalable, conceptually rich evals. HumanAgencyBench is an open-source scaffolding for this.
We think the AI community needs a shift towards scalable, conceptually rich evals. HumanAgencyBench is an open-source scaffolding for this.
We propose human agency as a new alignment target in HumanAgencyBench, made possible by AI simulation/evals. We find e.g., Claude most supports agency but also most tries to steer user values 👇 arxiv.org/abs/2509.08494
We propose human agency as a new alignment target in HumanAgencyBench, made possible by AI simulation/evals. We find e.g., Claude most supports agency but also most tries to steer user values 👇 arxiv.org/abs/2509.08494
but the law supports many forms of discrimination! E.g., synagogues should hire Jewish rabbis. LLMs often get this wrong aclanthology.org/2025.acl-lon... #ACL2025NLP
but the law supports many forms of discrimination! E.g., synagogues should hire Jewish rabbis. LLMs often get this wrong aclanthology.org/2025.acl-lon... #ACL2025NLP
Bias in Language Models: Beyond Trick Tests and Towards RUTEd Evaluation
🗓️ Mon 11–12:30
The Impossibility of Fair LLMs
🗓️ Tue 16–17:30
Bias in Language Models: Beyond Trick Tests and Towards RUTEd Evaluation
🗓️ Mon 11–12:30
The Impossibility of Fair LLMs
🗓️ Tue 16–17:30
Theory: aclanthology.org/2025.acl-lon...
Empirics: aclanthology.org/2025.acl-lon...
Theory: aclanthology.org/2025.acl-lon...
Empirics: aclanthology.org/2025.acl-lon...
E.g., I saw 3 frameworks in 24 hours!
- #ICLR2025 "coevolution" (2 red-eye flights!)
- #CHI2025 keynote
- "bidirectional alignment" ICLR+CHI event
@gaganbansal.bsky.social shared how they evaluate Microsoft's agent when it tries to recruit its own humans, file FOIA, counter bot detection, etc. Fascinating work today at the HEAL workshop #CHI2025
@gaganbansal.bsky.social shared how they evaluate Microsoft's agent when it tries to recruit its own humans, file FOIA, counter bot detection, etc. Fascinating work today at the HEAL workshop #CHI2025
We're living through AI takeoff. AI technology is rocket fuel, but interaction is humanity's flight path.
In this thread I'll share insights from the coming week in Japan 🤖✈️🌸⛩️
We're living through AI takeoff. AI technology is rocket fuel, but interaction is humanity's flight path.
In this thread I'll share insights from the coming week in Japan 🤖✈️🌸⛩️