Dylan Hilier
dashillier.bsky.social
Dylan Hilier
@dashillier.bsky.social
PhD @SMU/ASTAR doing social learning stuff
Reposted by Dylan Hilier
TextArena is live on arXiv❗
We present a benchmark of 57+ competitive text-based games to evaluate&train LLMs
including negotiation, deception, theory of mind...
Multiplayer support
Human-vs-models
Model-vs-model

Perfect for social interaction, Multi-Agent, multi-turn reasoning and Planning
🤖📈
April 16, 2025 at 11:44 AM
Yiiiikes,
ok TBF we did implicitly prompt this a little by asking why the punchable people it generated were always white buuuut not sure it was making quite the right inference after that
April 14, 2025 at 9:48 AM