You can now create datasets, run experiments, and attach evaluations to experiments using the Phoenix TS/JS client.
Shoutout to @anthonypowell.me and @mikeldking.bsky.social for the work here!
You can now create datasets, run experiments, and attach evaluations to experiments using the Phoenix TS/JS client.
Shoutout to @anthonypowell.me and @mikeldking.bsky.social for the work here!
This guide is a great primer on common approaches we see towards automated prompt optimization. If you've already read 100 "prompting tips and tricks" blogs but aren't yet a full DSPy contributor, then let this be your bridge!
This guide is a great primer on common approaches we see towards automated prompt optimization. If you've already read 100 "prompting tips and tricks" blogs but aren't yet a full DSPy contributor, then let this be your bridge!
I’ve been really liking some of the eval tools from Pydantic's evals package.
Wanted to see if I could combine these with Phoenix’s tracing so I could run Pydantic evals on traces captured in Phoenix
I’ve been really liking some of the eval tools from Pydantic's evals package.
Wanted to see if I could combine these with Phoenix’s tracing so I could run Pydantic evals on traces captured in Phoenix
It's amazing to see how fast the discourse has moved from "just agents" to now multi-agent flows, optimized evals, and automated improvement strategies.
It's amazing to see how fast the discourse has moved from "just agents" to now multi-agent flows, optimized evals, and automated improvement strategies.
Tag a function with `@ tracer.llm` to automatically capture it as an @opentelemetry.io span.
- Automatically parses input and output messages
- Comes in decorator or context manager flavors
Tag a function with `@ tracer.llm` to automatically capture it as an @opentelemetry.io span.
- Automatically parses input and output messages
- Comes in decorator or context manager flavors
Phoenix has changed a TON since its first iteration.
I'm constantly in awe of the execution speed and quality of this team. Here's to the next 5k and beyond!
Phoenix has changed a TON since its first iteration.
I'm constantly in awe of the execution speed and quality of this team. Here's to the next 5k and beyond!
Aman combined our recent Agent Evaluation course with the latest prompt optimization techniques to automate the improvement process.
Aman combined our recent Agent Evaluation course with the latest prompt optimization techniques to automate the improvement process.
Forget manual prompt engineering - there are better (read: "more automatic") ways to improve your prompts.
This video and notebook break down these techniques.
Featuring:
- DSPy
- @arize-phoenix.bsky.social
Forget manual prompt engineering - there are better (read: "more automatic") ways to improve your prompts.
This video and notebook break down these techniques.
Featuring:
- DSPy
- @arize-phoenix.bsky.social
Too often, teams are stuck using disconnected tools—one for evaluation, another for monitoring, and yet another for debugging.
So, we built a unified approach.
arize.com/blog/why-ai-...
Too often, teams are stuck using disconnected tools—one for evaluation, another for monitoring, and yet another for debugging.
So, we built a unified approach.
arize.com/blog/why-ai-...
Our newest blog post on @hf.co has you covered!
This post shows you how to use @arize-phoenix.bsky.social to trace and evaluate your smolagents.
Credit to @srichavali.bsky.social and @aymeric-roucher.bsky.social
Our newest blog post on @hf.co has you covered!
This post shows you how to use @arize-phoenix.bsky.social to trace and evaluate your smolagents.
Credit to @srichavali.bsky.social and @aymeric-roucher.bsky.social