Gradient-descent enthusiast building LLM agents.
Formerly Mila, Deepmind, Amazon, ElemenAI, Spotify
Feel free to mention yourself and others. :)
go.bsky.app/LUrLWXe
#LLMAgents #LLMReasoning
Feel free to mention yourself and others. :)
go.bsky.app/LUrLWXe
#LLMAgents #LLMReasoning
In this TMLR paper, we dive in-depth into #BrowserGym and #AgentLab. We also present some unexpected performances from Claude 3.5-Sonnet
In this TMLR paper, we dive in-depth into #BrowserGym and #AgentLab. We also present some unexpected performances from Claude 3.5-Sonnet
If you are building AI agents stuff, I'd be happy to include you in 😁
💡 Share what you're building in the comment
🧡 Like and repost for visibility
go.bsky.app/JPx5hfV
If you are building AI agents stuff, I'd be happy to include you in 😁
💡 Share what you're building in the comment
🧡 Like and repost for visibility
go.bsky.app/JPx5hfV
I am missing a lot, and many are not on bsky yet, so if I missed you or someone you know, please send me a DM with the link to a relevant paper and I will update the starter pack!
I am missing a lot, and many are not on bsky yet, so if I missed you or someone you know, please send me a DM with the link to a relevant paper and I will update the starter pack!
If you are building AI stuff, I'd be happy to include you in 😁
💡 Share what you're building in the comment
🧡 Like and repost for visibility
go.bsky.app/UcofkF4
If you are building AI stuff, I'd be happy to include you in 😁
💡 Share what you're building in the comment
🧡 Like and repost for visibility
go.bsky.app/UcofkF4
We are thrilled to release #AgentLab, a new open-source package for developing and evaluating web agents. This builds on the new #BrowserGym package which supports 10 different benchmarks, including #WebArena.
We are thrilled to release #AgentLab, a new open-source package for developing and evaluating web agents. This builds on the new #BrowserGym package which supports 10 different benchmarks, including #WebArena.