Lightnews — Scholar-powered news

Refact.ai

@refact-ai.bsky.social

Open-source AI Agent that solves engineering tasks end-to-end, fully autonomously.

Get for VS and JetBrains: https://linktr.ee/refactai

Posts Replies Media Videos

Refact.ai

@refact-ai.bsky.social

🔥 Refact.ai is now the #1 open-source AI Agent on the SWE-bench Leaderboard:
🔹 SWE-bench Verified → 70.4% solved
🔹 SWE-bench Lite → 60% solved

Soon: new score with Claude 4 Sonnet 🙂‍↕️

Full tech breakdown: refact.ai/blog/2025/op...

Our open-source SWE-bench pipeline [GH]: github.com/smallcloudai...

May 30, 2025 at 11:37 AM

Refact.ai

@refact-ai.bsky.social

We're the new open-source SOTA AI Agent on SWE-bench Verified.
Score: 69.9% — 349/500 tasks solved.

Key tech behind the run:
• debug_script() sub-agent using pdb
• strategic_planning() tool powered by o3
• Automated guardrails that course-correct mid-run

🧵

Refact.ai became the best open-source Agent in SWE-bench Verified

May 22, 2025 at 8:18 PM

Refact.ai

@refact-ai.bsky.social

🤔 Can Gemini 2.5 Pro dethrone Claude 3.7 Sonnet in coding?

We tested it on aider’s polyglot bench with Refact.ai: 82.2% score — 3rd place behind Claude’s 93.3% and 92.9%.

Solid — and available in Refact.ai now.

P.S. Next up: evaluation with Gemini-2.5-Pro-Preview-05-06🔥

May 12, 2025 at 7:16 AM

Refact.ai

@refact-ai.bsky.social

Open-source Refact.ai achieves #1 on SWE-bench Lite with a 59.7% score!

Our approach: fully autonomous Agent, no user intervention needed. Just assign a task & let AI handle it end-to-end

• Claude 3.7 Sonnet — core model
• deep_analysis() tool with o4-mini — reasoning

refact.ai/blog/2025/so...

🧵

Open-Source Refact.ai Agent is SOTA on SWE-bench Lite With a 59,7% Score

May 7, 2025 at 9:56 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news