The other week at @devrelcon.bsky.social I sat down to chat with Joseph Petty from @appsmith.bsky.social about AI, why you need evaluations, and how @rungalileo.bsky.social can help you.
Oh, and 🌶️ Jim's spicy take on AI 👀
youtu.be/I2vRx5Ieak8?...
The other week at @devrelcon.bsky.social I sat down to chat with Joseph Petty from @appsmith.bsky.social about AI, why you need evaluations, and how @rungalileo.bsky.social can help you.
Oh, and 🌶️ Jim's spicy take on AI 👀
youtu.be/I2vRx5Ieak8?...
@erinmikail.bsky.social's new tutorial shows how to build and track tailored custom metrics using Galileo for reliable AI evaluation.
Read Erin's blog here: galileo.ai/blog/silly-s...
@erinmikail.bsky.social's new tutorial shows how to build and track tailored custom metrics using Galileo for reliable AI evaluation.
Read Erin's blog here: galileo.ai/blog/silly-s...
#AI #LLM #AIEvaluation #MLOps #DataQuality #Cohesity #GalileoAI #ChainOfThought #Podcast
#AI #LLM #AIEvaluation #MLOps #DataQuality #Cohesity #GalileoAI #ChainOfThought #Podcast
- Timeline View: No more guessing where your agent gets stuck, see execution flow & bottlenecks quickly.
- Conversation View: Debug from the user's perspective, not just the system's
- Timeline View: No more guessing where your agent gets stuck, see execution flow & bottlenecks quickly.
- Conversation View: Debug from the user's perspective, not just the system's
Join @erinmikail.bsky.social at the #databricks #DataAISummit as she breaks down what it really takes to test LLMs in unexpected domains—like generating humor.
Come for the eval benchmarks. Stay for the chaos.
#GenAI #LLMevals #AIUX #LLMops
Join @erinmikail.bsky.social at the #databricks #DataAISummit as she breaks down what it really takes to test LLMs in unexpected domains—like generating humor.
Come for the eval benchmarks. Stay for the chaos.
#GenAI #LLMevals #AIUX #LLMops
If you’re building with LLMs, testing agents, or just trying to trust what your models are doing in production, come find us at Booth #120
If you’re building with LLMs, testing agents, or just trying to trust what your models are doing in production, come find us at Booth #120
"I recommend Galileo as the antivirus equivalent for your AI system - you need these checks & balances. A MacBook is secure by nature, having that additional layer catches things the core system might miss."
"I recommend Galileo as the antivirus equivalent for your AI system - you need these checks & balances. A MacBook is secure by nature, having that additional layer catches things the core system might miss."
I’ll (@JimBobBennett) be there with the Galileo crew—booth, talks, party, and all. I’m giving a talk on “Taming Your AI Agents with Evaluations”, aka how to stop your AI from making up entire book reports (Chicago Sun-Times, we see you 👀).
I’ll (@JimBobBennett) be there with the Galileo crew—booth, talks, party, and all. I’m giving a talk on “Taming Your AI Agents with Evaluations”, aka how to stop your AI from making up entire book reports (Chicago Sun-Times, we see you 👀).
@poolsideai co-founders @JasoncWarner and @EisoKant believe AI will soon collaborate with teams inside high-consequence environments such as banking, energy, and healthcare-grade software.
@poolsideai co-founders @JasoncWarner and @EisoKant believe AI will soon collaborate with teams inside high-consequence environments such as banking, energy, and healthcare-grade software.
On the Chain of Thought podcast with @ConorBronsdon, @Amplitude_HQ's Chief Engineering Officer, @Wade Chambers, explains how systems like Ask Amplitude transform AI from a tool into a team of PhDs embedded in your product.
On the Chain of Thought podcast with @ConorBronsdon, @Amplitude_HQ's Chief Engineering Officer, @Wade Chambers, explains how systems like Ask Amplitude transform AI from a tool into a team of PhDs embedded in your product.
Wade Chambers, Chief Engineering Officer at Amplitude, explains on the Chain of Thought podcast with @conorbronsdon.bsky.social that we've passed the leap-of-faith phase. The proof:
✅ Better decision-making
✅ Faster research
✅ Real agentic solutions
Wade Chambers, Chief Engineering Officer at Amplitude, explains on the Chain of Thought podcast with @conorbronsdon.bsky.social that we've passed the leap-of-faith phase. The proof:
✅ Better decision-making
✅ Faster research
✅ Real agentic solutions
It’s been amazing connecting with builders, developers, and curious minds. If you’re interested in agents, LLM apps, or just want to see how we’re helping to ship reliable AI apps, stop by for a Galileo demo.
#MSbuild
It’s been amazing connecting with builders, developers, and curious minds. If you’re interested in agents, LLM apps, or just want to see how we’re helping to ship reliable AI apps, stop by for a Galileo demo.
#MSbuild
Delighted to be part of NVIDIA's AI Factory Validated Designs!
Delighted to be part of NVIDIA's AI Factory Validated Designs!
While everyone rushed to scale the next-token prediction post-ChatGPT, Jason Warner and Eiso Kant (poolside's CEO and CTO) bet on a different path: reinforcement learning as the deeper scaling axis of intelligence itself.
While everyone rushed to scale the next-token prediction post-ChatGPT, Jason Warner and Eiso Kant (poolside's CEO and CTO) bet on a different path: reinforcement learning as the deeper scaling axis of intelligence itself.
Explore more: buff.ly/uYDp76M
#NVIDIACOMPUTEX #AgenticAI
Explore more: buff.ly/uYDp76M
#NVIDIACOMPUTEX #AgenticAI
Find us today at #MSbuild:
- 1:30p - 2:05p at Hub Theater A
- 2:30p - 6:30p Galileo's AI Reliability + Evals platform on Azure
- 3:30p - 5p at the NVIDIA Inception Partner Showcase
Find us today at #MSbuild:
- 1:30p - 2:05p at Hub Theater A
- 2:30p - 6:30p Galileo's AI Reliability + Evals platform on Azure
- 3:30p - 5p at the NVIDIA Inception Partner Showcase
AI infra shouldn’t trade privacy for performance.
✅ Trust through transparency
✅ No overreach
We’re here for it.
AI infra shouldn’t trade privacy for performance.
✅ Trust through transparency
✅ No overreach
We’re here for it.