Medium: https://heiko-hotz.medium.com/
Github: https://github.com/heiko-hotz
LI: https://www.linkedin.com/in/heikohotz/
For years, high-level Apple execs would come to John Gruber's (from Daring Fireball) Talk Show at WWDC. I often found these interviews less than insightful, and sometimes even annoying.
For years, high-level Apple execs would come to John Gruber's (from Daring Fireball) Talk Show at WWDC. I often found these interviews less than insightful, and sometimes even annoying.
In this quick demo, I’ve integrated a “Slide Creation Agent” into my personal project, Project Pastra. Watch how it effortlessly generates slides based on voice instructions.
In this quick demo, I’ve integrated a “Slide Creation Agent” into my personal project, Project Pastra. Watch how it effortlessly generates slides based on voice instructions.
youtu.be/0OEDHAjY6LM
youtu.be/0OEDHAjY6LM
After the hard fought battle of implementing proper audio communication in chapter 5, adding video capabilities to the multimodal live app a la Project Astra was a breeze.
After the hard fought battle of implementing proper audio communication in chapter 5, adding video capabilities to the multimodal live app a la Project Astra was a breeze.
More times than I'm proud to admit did I utter these words into my laptop over the past few days 😅
More times than I'm proud to admit did I utter these words into my laptop over the past few days 😅
To get started with the API I wrote a small Python script (83 lines of code) that demonstrates how to set up a real-time, two-way audio communication with a Gemini language model.
To get started with the API I wrote a small Python script (83 lines of code) that demonstrates how to set up a real-time, two-way audio communication with a Gemini language model.
Gemini 2.0 Flash Thinking vs GPT-4o vs Claude 3.5 Sonnet. I was honestly surprised by the results.
Would love if someone could check with o1(Pro) 🤗
(Credit to the Bluesky community where I saw this puzzle a few days ago)
Gemini 2.0 Flash Thinking vs GPT-4o vs Claude 3.5 Sonnet. I was honestly surprised by the results.
Would love if someone could check with o1(Pro) 🤗
(Credit to the Bluesky community where I saw this puzzle a few days ago)
You thought we were done shipping, am I right? But the Google DeepMind folks had one more ace up their sleeves, and it's a big one!
#google #gemini #gemini2.0 #deepmind
You thought we were done shipping, am I right? But the Google DeepMind folks had one more ace up their sleeves, and it's a big one!
#google #gemini #gemini2.0 #deepmind
The Gemini Multimodal Live API has taken the developer community by storm and many have already started building with it. Here I show how to talk to your docs.
youtu.be/0ak684rtRvA
#google #gemini #live #multimodal
The Gemini Multimodal Live API has taken the developer community by storm and many have already started building with it. Here I show how to talk to your docs.
youtu.be/0ak684rtRvA
#google #gemini #live #multimodal