I’m delighted to tell you about our new paper, Soft Condorcet Optimization (SCO) for Ranking of General Agents, to be presented at AAMAS 2025! 🧵 1/N
I’m delighted to tell you about our new paper, Soft Condorcet Optimization (SCO) for Ranking of General Agents, to be presented at AAMAS 2025! 🧵 1/N
It is SOTA on every planning benchmark we tried.
In self-play, it goes 20 years between collisions.
It is SOTA on every planning benchmark we tried.
In self-play, it goes 20 years between collisions.
www.metacareers.com/jobs/1459691...
www.metacareers.com/jobs/1459691...
www.metacareers.com/jobs/1459691...
www.metacareers.com/jobs/1459691...
📚 Our new paper (w. Q. Zheng, @mikaelhenaff.bsky.social, A. Zhang, A. Grover) studies LLM-driven feedback for NetHack!
Paper: arxiv.org/abs/2410.23022
Code: github.com/facebookrese...
📚 Our new paper (w. Q. Zheng, @mikaelhenaff.bsky.social, A. Zhang, A. Grover) studies LLM-driven feedback for NetHack!
Paper: arxiv.org/abs/2410.23022
Code: github.com/facebookrese...
Please apply here and message me:
www.metacareers.com/jobs/3950223...
Please apply here and message me:
www.metacareers.com/jobs/3950223...
pdf: drive.google.com/file/d/1CTGo...
ipynb: github.com/kuleshov/cor...
pdf: drive.google.com/file/d/1CTGo...
ipynb: github.com/kuleshov/cor...