talorab.bsky.social
@talorab.bsky.social
While these SoTA results are achieved using Claude 3.5, EnIGMA also works great with other models. We solve 10% of challenges using Llama 3.1 405B, surpassing the 7.5% result presented in CyBench for Llama 3.1.

More details, paper and source code at enigma-agent.com
EnIGMA
This is the landing and main page of EnIGMA
enigma-agent.com
December 5, 2024 at 5:45 PM