@arize.bsky.social OSS Prompt Playground
@arize-phoenix.bsky.social gets Deepseek support! Now you can compare outputs of all the top tier reasoning models.
Which LLM provider would you like to see next? Let us know on GitHub!
github.com/Arize-ai/pho...
@arize.bsky.social OSS Prompt Playground
@arize-phoenix.bsky.social gets Deepseek support! Now you can compare outputs of all the top tier reasoning models.
Which LLM provider would you like to see next? Let us know on GitHub!
github.com/Arize-ai/pho...
Announcing OpenInference instrumentation for Agno, Mastra, Bedrock Agents, and AutoGen AgentChat!
At @arize.bsky.social we believe observability deserves to be built in the open
s/o @anthonypowell.me and many others
github.com/Arize-ai/ope...
Announcing OpenInference instrumentation for Agno, Mastra, Bedrock Agents, and AutoGen AgentChat!
At @arize.bsky.social we believe observability deserves to be built in the open
s/o @anthonypowell.me and many others
github.com/Arize-ai/ope...
You can now create datasets, run experiments, and attach evaluations to experiments using the Phoenix TS/JS client.
Shoutout to @anthonypowell.me and @mikeldking.bsky.social for the work here!
You can now create datasets, run experiments, and attach evaluations to experiments using the Phoenix TS/JS client.
Shoutout to @anthonypowell.me and @mikeldking.bsky.social for the work here!
@arize-phoenix.bsky.social javascript client gets experiments 🧪
s/o @anthonypowell.me !
- native tracing of ai tasks and evaluators,
- async concurrency queues
- support for any evaluator (e.g. bring your own evals) and more!
@arize-phoenix.bsky.social javascript client gets experiments 🧪
s/o @anthonypowell.me !
- native tracing of ai tasks and evaluators,
- async concurrency queues
- support for any evaluator (e.g. bring your own evals) and more!
A true testament that AI observability should be built in the open 👏
@arize-phoenix.bsky.social
pypi.org/project/open...
A true testament that AI observability should be built in the open 👏
@arize-phoenix.bsky.social
pypi.org/project/open...
Part of the "Look at the Data" initiative, create custom rubrics and forms to annotate your spans.
s/o to @anthonypowell.me here who built out all the rich UI features.
Part of the "Look at the Data" initiative, create custom rubrics and forms to annotate your spans.
s/o to @anthonypowell.me here who built out all the rich UI features.
Project Retention Policies
Customize the data retention of your projects by number of days or by trace count. No more cron jobs or manual deleting of traces needed!
A much requested ask from our on-prem users and phoenix-cloud users alike.
Project Retention Policies
Customize the data retention of your projects by number of days or by trace count. No more cron jobs or manual deleting of traces needed!
A much requested ask from our on-prem users and phoenix-cloud users alike.
arize.com/observe-2025
arize.com/observe-2025
✔️ Trace agent decisions at every step
✔️ Offline and Online Evals using LLM as a Judge
If you're building agents, measuring them is essential.
Full vid and cookbook below
✔️ Trace agent decisions at every step
✔️ Offline and Online Evals using LLM as a Judge
If you're building agents, measuring them is essential.
Full vid and cookbook below
Apply here: docs.google.com/forms/d/e/1F...
Apply here: docs.google.com/forms/d/e/1F...
Phoenix has changed a TON since its first iteration.
I'm constantly in awe of the execution speed and quality of this team. Here's to the next 5k and beyond!
Phoenix has changed a TON since its first iteration.
I'm constantly in awe of the execution speed and quality of this team. Here's to the next 5k and beyond!
We're celebrating Phoenix reaching 5000 stars on GitHub! This milestone underscores the growing demand for robust, open-source tools that tackle the complexities of AI and LLM development
Check it out: github.com/Arize-ai/pho...
www.youtube.com/watch?v=bW5Z...
We're hosting an in-person office hours tomorrow all around LLM and Agent Evals.
Join for the free snacks/drinks, stay for the heated discussions about the validity of Pokemon-based model evaluations ⚡️🐀
We're hosting an in-person office hours tomorrow all around LLM and Agent Evals.
Join for the free snacks/drinks, stay for the heated discussions about the validity of Pokemon-based model evaluations ⚡️🐀
In my latest tutorial, I explore how few-shot prompting boosts accuracy without massive datasets or retraining—using @arize-phoenix.bsky.social prompts and experiments to break it down.
This kicks off my prompting series... more to come!
In my latest tutorial, I explore how few-shot prompting boosts accuracy without massive datasets or retraining—using @arize-phoenix.bsky.social prompts and experiments to break it down.
This kicks off my prompting series... more to come!
openinference-instrumentation-openai-agents, an OpenTelememetry instrumentor that is compatible with any OTel backend like @arize-phoenix.bsky.social. Fully OSS and free to use!
openinference-instrumentation-openai-agents, an OpenTelememetry instrumentor that is compatible with any OTel backend like @arize-phoenix.bsky.social. Fully OSS and free to use!
Forget manual prompt engineering - there are better (read: "more automatic") ways to improve your prompts.
This video and notebook break down these techniques.
Featuring:
- DSPy
- @arize-phoenix.bsky.social
Forget manual prompt engineering - there are better (read: "more automatic") ways to improve your prompts.
This video and notebook break down these techniques.
Featuring:
- DSPy
- @arize-phoenix.bsky.social
With Phoenix 8.0, we built a prompt management system that prioritizes: LLM reproducibility, prompt versioning & tracking, & developer flexibility—no vendor lock-in
arize.com/blog/prompt-...
With Phoenix 8.0, we built a prompt management system that prioritizes: LLM reproducibility, prompt versioning & tracking, & developer flexibility—no vendor lock-in
arize.com/blog/prompt-...
🔗 TypeScript Client: Sync prompts with your JavaScript runtime
🐍 Python Client: Sync templates & apply them directly to AI SDKs
🔄 Native prompt normalization & much more!
youtu.be/qbeohWaRlsM
🔗 TypeScript Client: Sync prompts with your JavaScript runtime
🐍 Python Client: Sync templates & apply them directly to AI SDKs
🔄 Native prompt normalization & much more!
youtu.be/qbeohWaRlsM
Our newest blog post on @hf.co has you covered!
This post shows you how to use @arize-phoenix.bsky.social to trace and evaluate your smolagents.
Credit to @srichavali.bsky.social and @aymeric-roucher.bsky.social
Our newest blog post on @hf.co has you covered!
This post shows you how to use @arize-phoenix.bsky.social to trace and evaluate your smolagents.
Credit to @srichavali.bsky.social and @aymeric-roucher.bsky.social