Heiko Hotz
banner
heikohotz.bsky.social
Heiko Hotz
@heikohotz.bsky.social
AI Engineer @ Google 👨‍💻 — Educator 👨‍🏫 — Traveller ✈️ — Hobby photographer 📷 — Foodie 🌮 — Film fan 🍿 — Boardgamer 🎲 — Londoner💂‍♂️

Medium: https://heiko-hotz.medium.com/
Github: https://github.com/heiko-hotz
LI: https://www.linkedin.com/in/heikohotz/
This year, the advanced Gemini model operated end-to-end in natural language, producing rigorous mathematical proofs directly from the official problem descriptions – all within the 4.5-hour competition time limit.
July 21, 2025 at 10:23 PM
At IMO 2024, AlphaGeometry and AlphaProof required experts to first translate problems from natural language into domain-specific languages, such as Lean, and vice-versa for the proofs.
July 21, 2025 at 10:23 PM
This achievement is a significant advance over last year’s result.
July 21, 2025 at 10:22 PM
An advanced version was able to solve 5 out of 6 problems.
July 21, 2025 at 10:22 PM
The result? Saves tons of time, money, and builds super reliable voice assistants that have undergone a rigorous evaluation process. No more guesswork! 📈
Full details + code here: towardsdatascience.com/let-ai-tune-...
Let AI Tune Your Voice Assistant | Towards Data Science
A practical guide to automating prompt engineering for voice assistants
towardsdatascience.com
July 15, 2025 at 7:06 AM
GOOD NEWS: I built an #AutomatedPromptEngineering (APE) pipeline specifically for voice AI! 🤖✨ My new @towardsdatascience blog post dives deep.
What it does:
✅ Creates diverse audio tests
✅ Automates performance eval
✅ LLM optimizes your prompts! 👇
July 15, 2025 at 7:06 AM
Thanks for sharing @towardsdatascience.com 🤗
July 15, 2025 at 7:03 AM
Building a v1 GenAI app on an existing platform while overhauling the foundation for a better 'V2' is a common strategy. But explaining this to everyday consumers is challenging. These kinds of interviews really help communicate that effectively. What did you think?
June 12, 2025 at 3:40 PM
To give an example, right out of the gate she asks, "𝙇𝙖𝙨𝙩 𝙮𝙚𝙖𝙧 𝙮𝙤𝙪 𝙖𝙣𝙣𝙤𝙪𝙣𝙘𝙚𝙙 𝙖 𝙨𝙢𝙖𝙧𝙩𝙚𝙧 𝘼𝙄-𝙙𝙧𝙞𝙫𝙚𝙣 𝙎𝙞𝙧𝙞. 𝙒𝙝𝙚𝙧𝙚 𝙞𝙨 𝙨𝙝𝙚?"
From a developer's point of view, Apple's answers made a lot of sense: a 'V1' worked, but didn't meet their high quality/reliability standards when users went 'off the beaten path'.
June 12, 2025 at 3:40 PM
This year, however, Craig Federighi and Greg Joswiak were interviewed by other outlets, including Tom's Guide, TechRadar, and The Wall Street Journal. I particularly liked Joanna Stern's interview and her style: direct, concise, and challenging.
June 12, 2025 at 3:40 PM
While it's not fair to characterise Gruber as an "Apple fanboy," I consistently found his questions too long-winded and too softball. By the end, it often felt (to me, at least) like just a few folk were a bit too cosy on stage.
June 12, 2025 at 3:39 PM
i definitely hear you on that one 😅 out of curiosity - what are the benefits you are looking to gain from an agent framework (in general)?
January 16, 2025 at 3:19 PM
Not perfect by any means, but much better already than "traditional" voice assistants, and we are only at the beginning of this journey.

You can try it yourself with the Developer Guide for Gemini's Multimodal Live API 🤗

github.com/heiko-hotz/g...
GitHub - heiko-hotz/gemini-multimodal-live-dev-guide: A developer guide for Gemini's Multimodal Live API
A developer guide for Gemini's Multimodal Live API - heiko-hotz/gemini-multimodal-live-dev-guide
github.com
January 8, 2025 at 7:43 AM
I believe that multimodal AI models have the potential to change that. They allow me to speak much more freely about what I want them to do and oftentimes they understand and execute in the way I expected them to.
January 8, 2025 at 7:43 AM
But soon I realised that these voice assistants still require a rigid syntax: I would have to phrase commands in a very specific way for the voice assistant to understand what I meant.
January 8, 2025 at 7:43 AM
To me it was a magical moment when I got my first Amazon Echo in 2015 and could just shout words into the air and got a response.
January 8, 2025 at 7:43 AM