Lakshya A Agrawal
lakshyaaagrawal.bsky.social
Lakshya A Agrawal
@lakshyaaagrawal.bsky.social
PhD @ucberkeleyofficial.bsky.social | Past: AI4Code Research Fellow @msftresearch.bsky.social | Summer @EPFL Scholar, CS and Applied Maths @IIITDelhi | Hobbyist Saxophonist

https://lakshyaaagrawal.github.io

Maintainer of https://aka.ms/multilspy
Reposted by Lakshya A Agrawal
GEPA (Genetic-Pareto) is a sample-efficient prompt optimization method for compound AI systems that works by reflectively evolving prompts using natural language feedback instead of traditional scalar rewards.
October 21, 2025 at 3:03 PM
Reposted by Lakshya A Agrawal
In each iteration, GEPA samples system rollouts (including reasoning traces, tool outputs, and any diagnostic text), reflects on them via an LLM to identify issues or propose improvements, and updates specific module prompts accordingly based on the feedback.
October 21, 2025 at 3:03 PM
Reposted by Lakshya A Agrawal
To ensure diversity and avoid local optima, GEPA maintains a pool of candidates and uses Pareto-based selection, which keeps all non-dominated strategies discovered so far and stochastically proposes new prompt variants, enabling robust generalization with far fewer rollouts than reinforcement
October 21, 2025 at 3:03 PM
Reposted by Lakshya A Agrawal
Automating Agentic Prompts: A new algorithm called GEPA, developed by researchers at UC Berkeley, Stanford, and other institutions, improves the performance of agentic systems by automatically refining their prompts.
October 23, 2025 at 7:30 PM
Hey, would love to get any feedback on how you'd think about improving the interface
October 17, 2025 at 5:01 PM
Reposted by Lakshya A Agrawal
Just what I was looking for. Thank you for sharing, looking forward to the read.
September 28, 2025 at 12:58 AM
Reposted by Lakshya A Agrawal
propose and test prompt updates, and combine complementary lessons from the Pareto frontier of its own attempts.

arxiv.org/abs/2507.19457
GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
Large language models (LLMs) are increasingly adapted to downstream tasks via reinforcement learning (RL) methods like Group Relative Policy Optimization (GRPO), which often require thousands of rollo...
arxiv.org
September 28, 2025 at 12:28 AM
Reposted by Lakshya A Agrawal
..GEPA and prompt optimization explained: https://arxiv.org/abs/2507.19457v1

(7/7)
September 28, 2025 at 11:13 AM
Reposted by Lakshya A Agrawal
..make adapting large models more practical—especially when compute or data is limited. It’s like giving AI a way to learn from its own “thinking out loud,” turning natural language into a powerful tool for self-improvement.

Links:
Paper on arXiv: https://arxiv.org/abs/2507.19457 ..

(6/7)
September 28, 2025 at 11:13 AM
Reposted by Lakshya A Agrawal
..code on the fly.

What’s cool here is the shift from treating AI tuning as a blind search for a higher score to a reflective process that leverages the AI’s native strength: language. By evolving prompts through thoughtful reflections, GEPA unlocks smarter, faster learning that could..

(5/7)
September 28, 2025 at 11:13 AM
Reposted by Lakshya A Agrawal
..fewer attempts than traditional reinforcement learning methods. On several tough tasks like multi-step question answering and instruction following, GEPA consistently outperforms both standard reinforcement learning and previous prompt optimizers. It even shows promise for optimizing..

(4/7)
September 28, 2025 at 11:13 AM
Reposted by Lakshya A Agrawal
..strategies by mixing and matching what works best.

GEPA treats AI prompt tuning like a conversation with itself, iterating through generations of prompts that learn from detailed feedback written in words, not just numbers. This lets it learn much more efficiently—up to 35 times..

(3/7)
September 28, 2025 at 11:13 AM
Reposted by Lakshya A Agrawal
..what went wrong and how to fix it? That’s the idea behind a new approach called GEPA. Instead of relying solely on those sparse reward signals, GEPA has AI inspect its own attempts using natural language reflections. It diagnoses errors, proposes prompt fixes, and evolves smarter..

(2/7)
September 28, 2025 at 11:13 AM