Massimo Caccia
banner
masscaccia.bsky.social
Massimo Caccia
@masscaccia.bsky.social
Research Scientist at ServiceNow

Gradient-descent enthusiast building LLM agents.

Formerly Mila, Deepmind, Amazon, ElemenAI, Spotify
Reposted by Massimo Caccia
Pretty cool people are being added to the LLM Agent & LLM Reasoning group. Thanks @lisaalaz.bsky.social for suggesting @jhamrick.bsky.social @gabepsilon.bsky.social and others.

Feel free to mention yourself and others. :)

go.bsky.app/LUrLWXe

#LLMAgents #LLMReasoning
November 23, 2024 at 7:36 PM
Reposted by Massimo Caccia
We’re really excited to release this large collaborative work for unifying web agent benchmarks under the same roof.

In this TMLR paper, we dive in-depth into #BrowserGym and #AgentLab. We also present some unexpected performances from Claude 3.5-Sonnet
December 12, 2024 at 5:55 PM
Reposted by Massimo Caccia
I finally created my first starter pack for #buildinpublic #indiehacker and #founder who are building in the AI Agent Space

If you are building AI agents stuff, I'd be happy to include you in 😁

💡 Share what you're building in the comment
🧡 Like and repost for visibility

go.bsky.app/JPx5hfV
November 30, 2024 at 1:24 PM
Reposted by Massimo Caccia
I've created a starter pack of researchers working on digital agents (focusing on web, mobile and OS agents).

I am missing a lot, and many are not on bsky yet, so if I missed you or someone you know, please send me a DM with the link to a relevant paper and I will update the starter pack!
December 5, 2024 at 7:21 PM
Reposted by Massimo Caccia
I finally created my first starter pack for #buildinpublic #indiehacker and #founder who are building tools related to AI and LLM.

If you are building AI stuff, I'd be happy to include you in 😁

💡 Share what you're building in the comment
🧡 Like and repost for visibility

go.bsky.app/UcofkF4
November 25, 2024 at 10:23 PM
Reposted by Massimo Caccia
🧵-1
We are thrilled to release #AgentLab, a new open-source package for developing and evaluating web agents. This builds on the new #BrowserGym package which supports 10 different benchmarks, including #WebArena.
December 3, 2024 at 9:02 PM