Learn how to design your eval from scratch -- including what to measure, which model to use, how to prompt effectively, and how to improve your eval.
Learn how to design your eval from scratch -- including what to measure, which model to use, how to prompt effectively, and how to improve your eval.
Aparna has a session on building & evaluating self-improving agents with Arize, Databricks MLFlow, & Mosaic AI.
Info here: www.databricks.com/dataaisummit...
Aparna has a session on building & evaluating self-improving agents with Arize, Databricks MLFlow, & Mosaic AI.
Info here: www.databricks.com/dataaisummit...
@arize.bsky.social OSS Prompt Playground
@arize-phoenix.bsky.social gets Deepseek support! Now you can compare outputs of all the top tier reasoning models.
Which LLM provider would you like to see next? Let us know on GitHub!
github.com/Arize-ai/pho...
@arize.bsky.social OSS Prompt Playground
@arize-phoenix.bsky.social gets Deepseek support! Now you can compare outputs of all the top tier reasoning models.
Which LLM provider would you like to see next? Let us know on GitHub!
github.com/Arize-ai/pho...
Shack15 in SF on June 25. Builders, researchers, and leaders from @anthropic.com @microsoft.com @llamaindex.bsky.social (+ many more).
Get tickets: arize.com/observe-2025
Shack15 in SF on June 25. Builders, researchers, and leaders from @anthropic.com @microsoft.com @llamaindex.bsky.social (+ many more).
Get tickets: arize.com/observe-2025
Join us May 19. Space is limited!
Register: lu.ma/d6mo5zxs
Join us May 19. Space is limited!
Register: lu.ma/d6mo5zxs
arize.com/observe-2025
arize.com/observe-2025
Hear from the people building the next generation of AI systems—it's conference by engineers, for engineers.
Most of our speakers on the site. 👀
Register: arize.com/observe-2025/
Hear from the people building the next generation of AI systems—it's conference by engineers, for engineers.
Most of our speakers on the site. 👀
Register: arize.com/observe-2025/
Apply here: docs.google.com/forms/d/e/1F...
Apply here: docs.google.com/forms/d/e/1F...
We're hosting an in-person office hours tomorrow all around LLM and Agent Evals.
Join for the free snacks/drinks, stay for the heated discussions about the validity of Pokemon-based model evaluations ⚡️🐀
We're hosting an in-person office hours tomorrow all around LLM and Agent Evals.
Join for the free snacks/drinks, stay for the heated discussions about the validity of Pokemon-based model evaluations ⚡️🐀
Our newest blog post on @hf.co has you covered!
This post shows you how to use @arize-phoenix.bsky.social to trace and evaluate your smolagents.
Credit to @srichavali.bsky.social and @aymeric-roucher.bsky.social
Our newest blog post on @hf.co has you covered!
This post shows you how to use @arize-phoenix.bsky.social to trace and evaluate your smolagents.
Credit to @srichavali.bsky.social and @aymeric-roucher.bsky.social
Our latest: arize.com/blog/memory-...
Our latest: arize.com/blog/memory-...
Past speakers have included top builders and researchers driving AI innovation and tackling its most important challenges. Be a part of the conversation shaping AI’s future.
arize.com/observe-2025/
Past speakers have included top builders and researchers driving AI innovation and tackling its most important challenges. Be a part of the conversation shaping AI’s future.
arize.com/observe-2025/
arize.com/blog/quick-g...
arize.com/blog/quick-g...
In this example, we also use @llamaindex.bsky.social to simplify query engine creation for structured and unstructured data.
www.youtube.com/watch?v=1_73...
In this example, we also use @llamaindex.bsky.social to simplify query engine creation for structured and unstructured data.
www.youtube.com/watch?v=1_73...
arize.com/blog/how-to-...
arize.com/blog/how-to-...
For anyone working on chatbots, virtual assistants, or complex decision-making systems.
lu.ma/agent-tracing
For anyone working on chatbots, virtual assistants, or complex decision-making systems.
lu.ma/agent-tracing