Léo Boisvert
leo-boisvert.bsky.social
Léo Boisvert
@leo-boisvert.bsky.social
PhD student @ MILA, ServiceNow Research
Reposted by Léo Boisvert
We’re really excited to release this large collaborative work for unifying web agent benchmarks under the same roof.

In this TMLR paper, we dive in-depth into #BrowserGym and #AgentLab. We also present some unexpected performances from Claude 3.5-Sonnet
December 12, 2024 at 5:55 PM
Reposted by Léo Boisvert
🧵-1
We are thrilled to release #AgentLab, a new open-source package for developing and evaluating web agents. This builds on the new #BrowserGym package which supports 10 different benchmarks, including #WebArena.
December 3, 2024 at 9:02 PM