Dave Davies
banner
onlineinference.com
Dave Davies
@onlineinference.com
I'm an SEO who gets to work with people who teach machines how to think.
♊ Gemini now supports structured outputs 🧩 - letting you define JSON Schemas for predictable, parsable results.
🤖 Huge for agents & data extraction.
Think: cleaner automation, safer outputs, and I'm about to have some SEO fun. 🎉
Full announcement 👇
ai.google.dev/gemini-api/d...
Structured Outputs  |  Gemini API  |  Google AI for Developers
Learn how to generate structured JSON output with the Gemini API.
ai.google.dev
November 10, 2025 at 5:03 PM
🛼 Rolling out in Search Console Insights: Query groups. Google now clusters similar search terms (like misspellings & variants) to show broader intent trends.
🔗 Link to the announcement from Google in the comments.
October 27, 2025 at 3:43 PM
CoreWeave is acquiring Monolith AI 🤝
A serious combo: Monolith brings ML for hardcore physics problems, CoreWeave brings the AI infra. Expect faster 🚀, smarter 🧠 industrial R&D.
Something we desperately need to carry us into the future.
www.coreweave.com/news/corewea...
CoreWeave to acquire Monolith AI, combining AI cloud and ML engineering tools to accelerate innovation in manufacturing, automotive, and aerospace.
www.coreweave.com
October 7, 2025 at 6:58 PM
I am SUPER stoked!
Chrome is finally getting agentic. 🤖
Google's latest update adds Gemini for AI summaries, multi-tab context, and task automation (yes, it’ll actually DO stuff for you).
The browser is officially becoming an assistant. 💁
blog.google/products/chr...
Go behind the browser with Chrome’s new AI features
Google Chrome is getting upgraded with the latest AI to make it safer, smarter and more useful
blog.google
September 22, 2025 at 8:07 PM
91% of SEOs say clients/management now ask about AI search visibility - but only 35% have a strategy.
It's early days, but leadership is watching. 👀
Survey via @aleyda.bsky.social 👇
hub.seofomo.co/surveys/stat...
The State of AI Search Optimization - 2025 Edition
Learn about the state of AI search optimization with the results of the SEOFOMO AI Search Optimization Survey, taken by +200 Senior SEO specialists.
hub.seofomo.co
September 15, 2025 at 1:04 PM
Today, I'm thinking about how to create advanced prompts to monitor AI Mode results that set the stage for what the final question in a back-and-forth might be, where context matters more than the final query.
September 10, 2025 at 4:49 PM
I hate it when someone is essentially right, but you still want them to lose.
wandb.ai/byyoung3/ml-...
X and xAI Sue Apple and OpenAI Over Alleged AI Monopoly
Publish your model insights with interactive plots for performance metrics, predictions, and hyperparameters. Made by Brett Young using Weights & Biases
wandb.ai
August 26, 2025 at 4:41 PM
Google wants you in AI Mode faster ⏩: an “Ask Anything” box is showing in AI Overviews, sending users straight to AI results. 👇
www.seroundtable.com/google-ai-ov...
Google AI Overview Ask Anything Box Leads To AI Mode
Google is now testing adding an "Ask Anything" box within the AI Overviews, and when you type in that box and click search, it takes you into the Google AI Mode results.
www.seroundtable.com
August 25, 2025 at 4:54 PM
LLMs, lost attribution & agentic AI.
🎙️ I talked w/ Search Engine Land ahead of my SMX Advanced keynote on how SEO is evolving & the weirdest SEO gotcha I’ve hit yet. 👇
searchengineland.com/dave-davies-...
Dave Davies on LLM content SEO shortcuts, attribution loss, and agentic AI
SMX Advanced keynote speaker Dave Davies on agentic AI, LLM pitfalls, weird tech gotchas, and why generative engines are the future.
searchengineland.com
June 3, 2025 at 9:46 PM
AI agents that read, summarize, and document a GitHub repo — end-to-end automation using CrewAI + @weightsbiases.bsky.social Weave for observability 👀.

A great demo and tutorial on multi-agent orchestration 👇
wandb.ai/byyoung3/cre...
Building a Github repo summarizer with CrewAI
A hands-on guide to building a fully automated GitHub documentation system using CrewAI for multi-agent coordination and Weave for real-time debugging and observability.
wandb.ai
May 14, 2025 at 2:33 PM
ChatGPT just got way more useful for devs:
🔎Deep Research now connects to GitHub. You can query repos 📚, analyze APIs, and break down code structures.
o4-mini fine-tuning 🎛️ & GPT-4.1 nano access expanded too - w/ verification required.
Details 👇
wandb.ai/byyoung3/ml-...
ChatGPT Deep Research adds GitHub Connector for Code Analysis and o4-Mini fine-tuning
Publish your model insights with interactive plots for performance metrics, predictions, and hyperparameters. Made by Brett Young using Weights & Biases
wandb.ai
May 9, 2025 at 3:27 PM
May 4, 2025 at 3:38 PM
Qwen3 just crushed AIME 2024 with 66.7% - 3x DeepSeek R1. 🤯

Fine-tune it with Unsloth, evaluate with @weightsbiases.bsky.social Weave, and toggle its reasoning mode.

Wildly flexible open-source LLM. Read more 👇
wandb.ai/byyoung3/Gen...
How to fine-tune and evaluate Qwen3 with Unsloth
This article provides a comprehensive guide to fine-tuning, evaluating, and deploying the Qwen3 language model, emphasizing its flexibility, performance, and unique reasoning-toggle feature. .
wandb.ai
May 2, 2025 at 7:19 PM
New papers from @SFResearch & @Tsinghua_Uni suggest RL in LLMs may be overrated.
📊Simple filtering > complex RL
🍪RLVR ≠ new reasoning
These finding are covered on the @weights_biases blog, and may be a game-changer for post-training strategies. 👇
wandb.ai/byyoung3/ml-...
New studies uncover interesting findings for reasoning models
Discover how two recent studies challenge conventional reinforcement learning in LLM reasoning - revealing that simple data filtering can rival complex methods and that RLVR may only optimize known ab...
wandb.ai
April 24, 2025 at 2:37 PM
Most agents today can’t talk to each other. A2A changes that.

Agent2Agent is an open protocol for secure, cross-platform agent collaboration.

Think: LangGraph x CrewAI in one workflow. No glue code.

Full write-up + tutorial over on @weightsbiases.bsky.social blog 👇
wandb.ai/byyoung3/Gen...
How the Agent2Agent (A2A) protocol enables seamless AI agent collaboration
The Agent2Agent (A2A) protocol is an open standard that enables autonomous AI agents to securely discover, communicate, and collaborate across platforms. Learn how it works, its core components, and h...
wandb.ai
April 22, 2025 at 11:23 PM
👏Big congrats to @cohere.com on Embed 4 — a new multimodal embedding engine for enterprise AI.

It handles 📚 128k token docs, 🌏 100+ languages, and real-world mess like legal PDFs & product decks.

Built for 🤖 RAG, agents, and cross-lingual search.
wandb.ai/byyoung3/ml-...
Cohere Releases Embed 4: Multimodal Embedding Engine for Enterprise AI
Publish your model insights with interactive plots for performance metrics, predictions, and hyperparameters. Made by Brett Young using Weights & Biases
wandb.ai
April 16, 2025 at 2:15 PM
SEO in the agentic era isn’t theory - it’s here.
Over on @searchengineland.bsky.social I explore 🔍 the impact agents are having on how (and where) we optimize, and what that means for your strategy.
I even outline an agentic system I'm working on.
Enjoy 👇
searchengineland.com/ai-agents-di...
How AI agents are revolutionizing digital marketing
Explore how AI agents autonomously solve problems, enhance personalization and enable next-gen marketing strategies using agentic frameworks.
searchengineland.com
April 15, 2025 at 2:11 PM
I just published a GPT-4.1 quickstart using the OpenAI API over on the @weights_biases blog, including a Colab to get going fast.

It includes full W&B Weave integration so you can track everything out of the gate. 👀
wandb.ai/onlineinfere...
GPT-4.1 Python quickstart using the OpenAI API
Getting set up and running GPT-4.1 on your machine in Python using the OpenAI API. Made by Dave Davies using Weights & Biases
wandb.ai
April 14, 2025 at 5:56 PM
How long until we start talking about ACO (Agent Card Optimization)? 🤔
google.github.io/A2A/#/docume...
Agent2Agent Protocol
An open protocol enabling communication and interoperability between opaque agentic applications.
google.github.io
April 11, 2025 at 4:25 PM
🛒Retailers are using AI agents 🤖 to do a lot more than recommend products.
This post from @weightsbiases.bsky.social breaks down a smart LLM-powered system for triaging customer emails and building real-time recs via vector search. 👇
wandb.ai/byyoung3/Gen...
AI agents in retail and e-commerce
This article explores how AI agents are transforming retail by automating customer interactions, optimizing decision-making, and enhancing product recommendations using LLM-driven vector search.
wandb.ai
April 9, 2025 at 4:37 PM
LLMs don’t need to think + talk in the same pass. Retrieval Augmented Thinking (RAT) 🐀 splits reasoning from response - boosting transparency + control. DeepSeek, Claude, GPT-4o all in the mix.
Code + Weave traces👇
wandb.ai/byyoung3/Gen...
What is Retrieval Augmented Thinking (RAT) and how does it work?
Retrieval Augmented Thinking (RAT) separates AI reasoning from response generation, improving efficiency, interpretability, and customization by using one model for structured thought and another for ...
wandb.ai
April 8, 2025 at 9:58 PM
Amazon Nova Reel 1.1 🎥 now supports 2-minute, multi-shot videos—with automated and manual modes. Great flexibility, now on AWS Bedrock.
wandb.ai/byyoung3/ml-...
Amazon Nova Reel 1.1 Expands Video Generation Capabilities
Publish your model insights with interactive plots for performance metrics, predictions, and hyperparameters. Made by Brett Young using Weights & Biases
wandb.ai
April 8, 2025 at 4:34 PM
Llama 4 🦙 isn’t just open—it’s competitive.

@weightsbiases.bsky.social tested it head-to-head with GPT-4o on ChartQA using Weave Evaluations. Maverick scored higher on multimodal accuracy and costs a fraction to run.

Here's how they did it + the code 👇
wandb.ai/byyoung3/Gen...
Running inference and evaluating Llama 4 in Python
Deploy Llama 4 locally or via API with Python scripts. We test multimodal performance against GPT-4o on ChartQA and show how to debug and compare results using Weave.
wandb.ai
April 7, 2025 at 7:02 PM
🚶‍♂️ Gemini 2.5 Pro Experimental is a big step up.
🧠 Multimodal input, 1M-token context, native code execution, and better math/code reasoning.
@weightsbiases.bsky.social evaluated it against Flash 2.0 on AIME problems using Weave; in a tutorial you can do yourself.
Results? 👇
wandb.ai/byyoung3/Gen...
Evaluating the new Gemini 2.5 Pro Experimental model
Gemini 2.5 Pro Experimental is Google's most advanced AI model to date, featuring multimodal input support, a massive 1 million-token context window, and the ability to solve complex problems. .
wandb.ai
March 28, 2025 at 10:08 PM