Lightnews — Scholar-powered news

John Gilhuly

@johngilhuly.bsky.social

🧪 📊 The @arize-phoenix.bsky.social TS/JS client now supports Experiments and Datasets!

You can now create datasets, run experiments, and attach evaluations to experiments using the Phoenix TS/JS client.

Shoutout to @anthonypowell.me and @mikeldking.bsky.social for the work here!

May 21, 2025 at 2:26 PM

John Gilhuly

@johngilhuly.bsky.social

Tired of tweaking prompts yourself? Let the machines do it for you! 🤖

This guide is a great primer on common approaches we see towards automated prompt optimization. If you've already read 100 "prompting tips and tricks" blogs but aren't yet a full DSPy contributor, then let this be your bridge!

May 12, 2025 at 6:20 PM

John Gilhuly

@johngilhuly.bsky.social

@pydantic.dev evals 🤝 @arize-phoenix.bsky.social tracing and UI

I’ve been really liking some of the eval tools from Pydantic's evals package.

Wanted to see if I could combine these with Phoenix’s tracing so I could run Pydantic evals on traces captured in Phoenix

May 2, 2025 at 6:02 PM

John Gilhuly

@johngilhuly.bsky.social

Had a fantastic time talking about Self-Improving AI Agents with @arize-phoenix.bsky.social at the AI Camp NYC meetup this past week!

It's amazing to see how fast the discourse has moved from "just agents" to now multi-agent flows, optimized evals, and automated improvement strategies.

April 19, 2025 at 4:48 PM

John Gilhuly

@johngilhuly.bsky.social

We've added new LLM decorators to @arize-phoenix.bsky.social 's OpenInference library 🎁

Tag a function with `@ tracer.llm` to automatically capture it as an @opentelemetry.io span.
- Automatically parses input and output messages
- Comes in decorator or context manager flavors

April 18, 2025 at 2:21 AM

John Gilhuly

@johngilhuly.bsky.social

In case you missed it, Arize AI Phoenix crossed the 5k GitHub star mark last week! ⭐️

Phoenix has changed a TON since its first iteration.

I'm constantly in awe of the execution speed and quality of this team. Here's to the next 5k and beyond!

March 20, 2025 at 4:07 PM

John Gilhuly

@johngilhuly.bsky.social

Shoutout to Aman Khan and the rest of the @arize.bsky.social team for delivering a top-notch talk at @deeplearningai.bsky.social's inaugural AI Dev 25 conference! 📣

Aman combined our recent Agent Evaluation course with the latest prompt optimization techniques to automate the improvement process.

March 15, 2025 at 3:48 AM

John Gilhuly

@johngilhuly.bsky.social

How can you programmatically improve your prompts? 🤔 🤖

Forget manual prompt engineering - there are better (read: "more automatic") ways to improve your prompts.

This video and notebook break down these techniques.

Featuring:
- DSPy
- @arize-phoenix.bsky.social

March 3, 2025 at 5:01 PM

John Gilhuly

@johngilhuly.bsky.social

Separate AI tools for dev and prod aren't just inefficient—they're actively sabotaging your model performance.

Too often, teams are stuck using disconnected tools—one for evaluation, another for monitoring, and yet another for debugging.

So, we built a unified approach.

arize.com/blog/why-ai-...

March 1, 2025 at 4:16 PM

John Gilhuly

@johngilhuly.bsky.social

🤖 Building agents, but not sure how to measure their performance?

Our newest blog post on @hf.co has you covered!

This post shows you how to use @arize-phoenix.bsky.social to trace and evaluate your smolagents.

Credit to @srichavali.bsky.social and @aymeric-roucher.bsky.social

February 28, 2025 at 5:19 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news