github.com/pipecat-ai/p...
My favorite thing about this demo is that it's a really nice example of composite function calling.
Here are the function definitions. Gemini figures out solely from the argument descriptions how to find a conversation from "a few minutes ago"!
github.com/pipecat-ai/p...
My favorite thing about this demo is that it's a really nice example of composite function calling.
Here are the function definitions. Gemini figures out solely from the argument descriptions how to find a conversation from "a few minutes ago"!
There are several ways to store (and later, retrieve) conversation state. One of the simplest is just to define a couple of functions and use your local filesystem!
Here, @chadbailey.net shows how to do that, using Gemini 2.0 Flash.
There are several ways to store (and later, retrieve) conversation state. One of the simplest is just to define a couple of functions and use your local filesystem!
Here, @chadbailey.net shows how to do that, using Gemini 2.0 Flash.
Sean is the creator of Pion. Pion is an Open Source WebRTC implementation that is influential and very widely used (including at OpenAI, where Sean works).
Sean is the creator of Pion. Pion is an Open Source WebRTC implementation that is influential and very widely used (including at OpenAI, where Sean works).
Filipi Fuchter added an iOS example to the Pipecat "Simple Chatbot" repo. With the Pipecat iOS SDK, you can build apps that use Gemini Multimodal Live and Gemini Flash with WebRTC, WebSockets, and HTTP networking.
Filipi Fuchter added an iOS example to the Pipecat "Simple Chatbot" repo. With the Pipecat iOS SDK, you can build apps that use Gemini Multimodal Live and Gemini Flash with WebRTC, WebSockets, and HTTP networking.
Gemini 1.5 Flash occupies an interesting place in the capabilities matrix for voice AI. It's fast, very inexpensive, has a long context window, and has native audio input.
I've been experimenting with Gemini a lot. Here's an interesting Pipecat pipeline:
Gemini 1.5 Flash occupies an interesting place in the capabilities matrix for voice AI. It's fast, very inexpensive, has a long context window, and has native audio input.
I've been experimenting with Gemini a lot. Here's an interesting Pipecat pipeline:
There was a four-participant limit on the team size. We have five, but two are robots.
Last night was a very long session with lots of tiny little screws and some heavy ifconfig action.
There was a four-participant limit on the team size. We have five, but two are robots.
Last night was a very long session with lots of tiny little screws and some heavy ifconfig action.