Fine-tuning and LLMOps
My views
https://www.linkedin.com/in/erwinhuizenga/
🔹 Improved benchmark performance
🔹Enhanced front-end capabilities
🔹Supports JSON output & function calling
We've made sure its available from day one on Vertex AI for deployment. with Nvidia's H200
🔹 Improved benchmark performance
🔹Enhanced front-end capabilities
🔹Supports JSON output & function calling
We've made sure its available from day one on Vertex AI for deployment. with Nvidia's H200
Being part of this program will give startups early access to Google's latest AI models as well as resources, technical expertise and equity funding to accelerate their progress.
Apply here:
labs.google/aifuturesfund
Being part of this program will give startups early access to Google's latest AI models as well as resources, technical expertise and equity funding to accelerate their progress.
Apply here:
labs.google/aifuturesfund
Today, we're announcing:
1️⃣An A2A Python SDK AND an updated A2A protocol spec (v0.2).
2️⃣A new Java SDK for Agent Development Kit (ADK) AND the v1.0.0 stable release of our Python ADK.
Read more in our launch blog 👇
developers.googleblog.com/en/agents-ad...
Today, we're announcing:
1️⃣An A2A Python SDK AND an updated A2A protocol spec (v0.2).
2️⃣A new Java SDK for Agent Development Kit (ADK) AND the v1.0.0 stable release of our Python ADK.
Read more in our launch blog 👇
developers.googleblog.com/en/agents-ad...
I'm excited about our announcements. Watch the livestream to stay ahead of the curve.
Link below 👇
I'm excited about our announcements. Watch the livestream to stay ahead of the curve.
Link below 👇
What I find impressive is that a single system can contribute to numerous open problems in maths and computer science. Which proves that Agents can discover novel and useful algorithms
What I find impressive is that a single system can contribute to numerous open problems in maths and computer science. Which proves that Agents can discover novel and useful algorithms
What it offers:
🛠️ Build with the Agent Development Kit (ADK)
🏆 $50K cash prize pool
🧑🏫 Insights from Google experts
Sign up link below 👇
What it offers:
🛠️ Build with the Agent Development Kit (ADK)
🏆 $50K cash prize pool
🧑🏫 Insights from Google experts
Sign up link below 👇
Ivan Nardini and I documented how you can fine-tune and deploy Gemma 3 on Vertex AI.
Check it out and let us know your thoughts! 👇
www.youtube.com/watch?v=pC2D...
Ivan Nardini and I documented how you can fine-tune and deploy Gemma 3 on Vertex AI.
Check it out and let us know your thoughts! 👇
www.youtube.com/watch?v=pC2D...
The post dives into common patterns developers can adopt. Great for anyone building with agents.
medium.com/google-cloud...
The post dives into common patterns developers can adopt. Great for anyone building with agents.
medium.com/google-cloud...
Sir David Attenborough, thank you for a lifetime dedicated to the natural world, and for sharing its story with wisdom, wonder, and grace.
You've inspired generations to fall in love with nature.
Sir David Attenborough, thank you for a lifetime dedicated to the natural world, and for sharing its story with wisdom, wonder, and grace.
You've inspired generations to fall in love with nature.
Offering better visual quality and more accurate text rendering.
Notebook below to get you started 👇
Offering better visual quality and more accurate text rendering.
Notebook below to get you started 👇
It' got better at coding, with significant gains in front-end web development, editing, and transformation. We also fixed a bunch of function calling issue.
You can access it as gemini-2.5-pro-preview-05-06 on VertexAI and AIStudio
More details in 🧵
It' got better at coding, with significant gains in front-end web development, editing, and transformation. We also fixed a bunch of function calling issue.
You can access it as gemini-2.5-pro-preview-05-06 on VertexAI and AIStudio
More details in 🧵
> Agentic RAG
> Agent Evaluation
> Multi-Agent Evaluation
> Production Architectures
If you're comfortable with agent basics and looking to understand the next layer this is a great read 👇
> Agentic RAG
> Agent Evaluation
> Multi-Agent Evaluation
> Production Architectures
If you're comfortable with agent basics and looking to understand the next layer this is a great read 👇
Query w/ natural language using Gemini & Vertex AI RAG Engine, and chat in Vertex AI Studio.
🎬 Video: youtube.com/watch?v=sgKB...
📖 Article:
medium.com/google-cloud...
💾 Notebook:
github.com/kweinmeister...
@weightsbiases.bsky.social on the Agent2Agent (A2A) protocol.
The blog talks talks about how it works, its core components and how to use it with CrewAI and @langchain.bsky.social.
Blog below 👇
@weightsbiases.bsky.social on the Agent2Agent (A2A) protocol.
The blog talks talks about how it works, its core components and how to use it with CrewAI and @langchain.bsky.social.
Blog below 👇
💻 We've just added a demo showcasing agents using the A2A protocol to interact.
📄 We've refreshed the docs to lower the barrier for you to get started.
Keep sharing your A2A feedback!
💻 We've just added a demo showcasing agents using the A2A protocol to interact.
📄 We've refreshed the docs to lower the barrier for you to get started.
Keep sharing your A2A feedback!
- Up to 3.6x speedup vs HF+FA2
- ~50% memory reduction
- Now includes Sequence Parallelism, Cut-Cross Entropy & Liger kernels
Plus you can fine-tune Gemma-3 27B with just 21.6GB VRAM!
- Up to 3.6x speedup vs HF+FA2
- ~50% memory reduction
- Now includes Sequence Parallelism, Cut-Cross Entropy & Liger kernels
Plus you can fine-tune Gemma-3 27B with just 21.6GB VRAM!
↔️Offers a wide range of sizes, from 0.6B to 235B.
🧠 Includes MoE variants.
🎛️ Hybrid Thinking Control with `enable_thinking=True`.
🌍 Multilingual Support: with support for 119 languages and dialects.
↔️Offers a wide range of sizes, from 0.6B to 235B.
🧠 Includes MoE variants.
🎛️ Hybrid Thinking Control with `enable_thinking=True`.
🌍 Multilingual Support: with support for 119 languages and dialects.
@FeinbergVlad
talk on Gemini pretraining.
This talk covers scaling laws and how scaling approaches might need to be modified in the face of inference constraints.
Link below 👇
@FeinbergVlad
talk on Gemini pretraining.
This talk covers scaling laws and how scaling approaches might need to be modified in the face of inference constraints.
Link below 👇
⚡️High quality for best cost
🧩 Ability to dynamically think on complex tasks
🎛️ Configurable thinking budget (decide how many tokens to allocate to "thinking".
⚡️High quality for best cost
🧩 Ability to dynamically think on complex tasks
🎛️ Configurable thinking budget (decide how many tokens to allocate to "thinking".
Details & steps on the blog ⤵️
Details & steps on the blog ⤵️
On to the next.
On to the next.
Watch here ⤵️
www.youtube.com/watch?v=zgrO...
Watch here ⤵️
www.youtube.com/watch?v=zgrO...
Check out the quickstart below ⬇️
Check out the quickstart below ⬇️
Hereby an updated guide that helps you get started with Gemma 3 and
@ollama
on Colab for a fast start. Based on one of our most popular 2024 posts. ⤵️
Hereby an updated guide that helps you get started with Gemma 3 and
@ollama
on Colab for a fast start. Based on one of our most popular 2024 posts. ⤵️