Lightnews — Scholar-powered news

Ivan Nardini

@ivnardini.bsky.social

Many have asked about agent memory and its differences from RAG.

We just released a whitepaper, "Context Engineering: Sessions and Memory," detailing how memory evolves from raw conversations to curated agent knowledge.

You can find the paper here: lnkd.in/euud4BUB

November 12, 2025 at 10:11 AM

Ivan Nardini

@ivnardini.bsky.social

TIME to upgrade: ADK introduces Visual Agent Builder 🚀

Last week, in the latest ADK release (v1.18.0), the team introduced a low-code Visual Agent Builder, along with new observability and testing features.

Release notes and a blog from Thomas Chong about the visual builder in the 🧵

November 10, 2025 at 7:00 PM

Ivan Nardini

@ivnardini.bsky.social

Vertex AI Agent Engine adds Memory Revisions!

Trusting an agent's memory is tricky. Is chat info verified? Without history, it's guess. Memory Revisions (preview) helps with version control through snapshots for each change.

Code & doc in 🧵

November 7, 2025 at 4:00 PM

Ivan Nardini

@ivnardini.bsky.social

🚢 The New Vertex AI Agent Builder is OUT!

Vertex AI launched major updates to Vertex AI Agent Builder for easier deployment and scaling agents in production.

I'm working with the Agent Engine team to release content on our new features, starting today with Memory Bank. Stay tuned!

November 6, 2025 at 5:51 PM

Ivan Nardini

@ivnardini.bsky.social

Spent some time last week benchmarking LLMs on Vertex AI. I couldn't find a tutorial on using the @vllm_project bench library with Vertex, so I made one.

It's a walkthrough that compares Llama 4 (baseline vs. EAGLE) on 8x H100s and includes the code patch needed to make vLLM work.

Hope it helps! 👇

November 3, 2025 at 4:30 PM

Ivan Nardini

@ivnardini.bsky.social

🔥 Benchmarking a new optimization integrated by the Model Garden team for serving LLMs on Vertex AI.

Can't wait to share!

#VertexAI #LLMs #Benchmarking #Optimization #ModelGarden #LLMServing

October 30, 2025 at 5:41 AM

Ivan Nardini

@ivnardini.bsky.social

Another week, another episode of the Agent Factory podcast!

This time, Amit and I welcomed a special guest: Ravin Kumar from Google DeepMind. He shared insights on building open models with agentic capabilities.

Stay tuned! The episode will soon be on the Google Cloud Tech YouTube channel👇

October 28, 2025 at 9:00 PM

Ivan Nardini

@ivnardini.bsky.social

Since the beginning of the year, I've wanted to dedicate time on some content about LLM inference in a way that was accessible to everyone. Today I'm excited to launch the 1st learning path on LLM inference, built with NVIDIA!

Check out the course and blog in the 🧵 And stay tuned...more is coming!

October 27, 2025 at 3:33 PM

Ivan Nardini

@ivnardini.bsky.social

ADK just released v1.17.0!

The team rolled out features focused on secure code execution, stateful debugging, and better DevX.

Check out the full release notes in 🧵. And keep an eye on the repo for the next ADK community call!

October 22, 2025 at 9:35 PM

Ivan Nardini

@ivnardini.bsky.social

In the upcoming webinar, together with Alex Notov from @AnthropicAI we're building a complete multi-agent system, exploring the key protocols (MCP & A2A) and how to scale agents using Claude on Vertex AI Agent Engine.

RVSP and tutorial in the 🧵

October 21, 2025 at 4:30 PM

Ivan Nardini

@ivnardini.bsky.social

Vertex AI Agent Engine is in the Cloud Foundation Fabric!

We launched a Terraform module for agent deployment on Vertex AI, initially needing a local Python script and GCS uploads.

The new blueprint automates agent serialization and packaging during the Terraform apply cycle.

Link in the 🧵

October 20, 2025 at 4:19 PM

Ivan Nardini

@ivnardini.bsky.social

This is why I'm excited about Google Cloud's agent builder stack!

Building multi-agent systems using various models and frameworks can be challenging. So today I spent some time on what you can build and deploy with ADK, MCP, A2A, Agent Engine, and Vertex AI.

Full tutorial dropping soon!

October 19, 2025 at 6:57 PM

Ivan Nardini

@ivnardini.bsky.social

🚀 vLLM on TPU just got a massive upgrade!

Google and vLLM announced a new backend uses tpu-inference for efficient PyTorch and JAX models on TPUs.

Check out the full blog and try it on Vertex AI with the new vLLM TPU container!

October 16, 2025 at 9:00 PM

Ivan Nardini

@ivnardini.bsky.social

This morning, I checked out Vertex AI docs and was impressed by the open-source models as APIs.

Model as a Service (MaaS) gives access to large open models via a managed, serverless API, removing the need for your infrastructure.

Check the new documentation in 🧵 to learn more

October 16, 2025 at 5:30 PM

Ivan Nardini

@ivnardini.bsky.social

Can you use Agent Engine services on GKE or Cloud Run? Yes! You can combine managed services like memory bank with your preferred runtime.

Check out the two new tutorials in 🧵 on building AI agents using the Agent Development Kit (ADK) + Vertex AI Agent Engine for Sessions & Memory.

October 16, 2025 at 3:00 PM

Ivan Nardini

@ivnardini.bsky.social

🚀 Deploying agents on Vertex AI Agent Engine with Terraform!

Vertex AI launched a Terraform resource to deploy agents using custom classes or agentic frameworks like ADK.

Check out the notebook and blog post for the full code and a step-by-step guide in the 🧵

October 15, 2025 at 3:00 PM

Ivan Nardini

@ivnardini.bsky.social

🗓️ SAVE THE DATE: ADK Community Call #1 is coming!

Join us next week for the 1st ADK community call! In this 1-hour session, we'll share the technical roadmap, address technical questions and discuss contributions.

🗓️ Date: October 15, 2025
⏰ Time: 9:30 AM - 10:30 AM PST
🔗 Virtual (links in 🧵)

October 10, 2025 at 5:21 PM

Ivan Nardini

@ivnardini.bsky.social

🚀 Deploying open models with Terraform on Vertex AI!

Vertex AI Model Garden just launched the google_vertex_ai_endpoint_with_model_garden_deployment Terraform resource to manage your open model deployment (Hugging Face or Model Garden) with one unique main.tf.

Docs and code in the 🧵

October 8, 2025 at 3:30 PM

Ivan Nardini

@ivnardini.bsky.social

🚀 Managed PEFT Support for Gemma 3 & Qwen 3 is NOW available on Vertex AI!

To adapt Gemma 3 27B or Qwen 3 32B without managing infrastructure, you can use Vertex AI's expanded OSS managed tuning service, now supporting PEFT for these models.

Docs in the 🧵

October 6, 2025 at 10:00 PM

Ivan Nardini

@ivnardini.bsky.social

Building an MCP (Model Context Protocol) server for Vertex AI's Memory Bank

Inspired a great request from a Vertex AI user, I've spent some time building an unofficial MCP server for Vertex AI Memory Bank to create context-aware and personalized applications.

Link in the 🧵

October 6, 2025 at 2:58 PM

Ivan Nardini

@ivnardini.bsky.social

NEW TUTORIAL! Build AI agents with multimodal memory

I've mentioned why agents need memory, but extracting facts from images, videos, and audio requires Gemini-powered memory.

Check the new tutorial in 🧵 on building an agent with Vertex AI Memory Bank for personalized multimodal interactions!

September 29, 2025 at 6:59 PM

Ivan Nardini

@ivnardini.bsky.social

ADK v1.15.0 JUST DROPPED with context caching, new observability feature, and Google Maps integration!

Building AI agents with ADK has been significantly upgraded to reduce latency and costs while grounding with Google Maps data and more.

Release notes:
> github.com/google/adk-...

September 26, 2025 at 6:17 AM

Ivan Nardini

@ivnardini.bsky.social

✅ SUPPORTED: One-Line ADK CLI Deployment to Vertex AI Agent Engine!

Many of you asked, and we've simplified the process: transitioning from an ADK agent to a scalable agent on Vertex AI is now just a single CLI command.

Check out the new docs to try!
👉 Docs: google.github.io/adk-docs/de...

September 18, 2025 at 4:14 PM

Ivan Nardini

@ivnardini.bsky.social

🔥 Scale real-time AI Agents with Vertex AI Agent Engine & Live API!

Stop deploying turn-based agents! Vertex AI Agent Engine just integrated the new Live API , enabling fully-managed, serverless, bidirectional streaming agents (text & audio)!

Code & blog in 🧵

September 18, 2025 at 1:00 PM

Ivan Nardini

@ivnardini.bsky.social

Anthropic's tech report on recent Claude issues

Scaling AI is tough. Three overlapping infra bugs recently hit Claude's quality. Anthropic just dropped a super detailed technical postmortem on what broke.

Check out the full breakdown: www.anthropic.com/engineering...

September 18, 2025 at 5:47 AM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news