John Kutay
banner
jkxosound.com
John Kutay
@jkxosound.com
Product & Engineering @ Striim (streaming sql, change data capture), creator/host @ What's New in Data pod, music @ jkxo
Pinned
"If I could also get added to the PR before you refactor the schema design that would be great"
Reposted by John Kutay
Fun conversation with @jkxosound.com about where I think things are going in the AI agents space, Strands Agents, and rats 🐀
@clare.dev is one of the leaders making AI engineering simple and scalable at AWS. Had a great time chatting with her as we discussed Strands Agents and patterns like “Retrieval as a Tool”.
July 17, 2025 at 7:48 PM
@clare.dev is one of the leaders making AI engineering simple and scalable at AWS. Had a great time chatting with her as we discussed Strands Agents and patterns like “Retrieval as a Tool”.
July 17, 2025 at 6:54 PM
Really appreciate the depth at which the Hex team broke down their Text-to-SQL implementation. Everyone's trying to teach LLMs SQL like it's a training problem but it's really a graph traversal problem.
July 13, 2025 at 11:17 PM
In my post about #DuckDB I digress into the role of the database buffer cache to discuss how we segregate transactional workloads from analytical. DuckDB turned out to be a natural, lightweight approach to offloading analytical queries ensuring our application upheld performance requirements.
April 2, 2025 at 11:01 PM
Instead of materialized views, we built in-process DuckDB caching in the control plane of Striim Developer — improving query performance 5–10x with zero added infra.

PostgreSQL for OLTP, DuckDB for Operational OLAP. But I won't call it HTAP 🤐

medium.com/striim/beyon...
Beyond Materialized Views: Using DuckDB for In-Process Columnar Caching
In this post we will talk about using DuckDB as the operational analytics store for the control plane of Striim Developer — a serverless…
medium.com
April 2, 2025 at 5:47 PM
@marcbrooker.bsky.social breaks down how they've architected a fully ACID-compliant database service that combines simple, serverless management with high availability and massive scale on AWS Aurora DSQL.

youtube.com/shorts/dScUi...
Distributed PostgreSQL with Aurora DSQL
YouTube video by Striim
youtube.com
March 26, 2025 at 4:48 PM
This was actually my longest podcast ever at over 70 minutes. Not sure I could have made it any shorter because nerding out on databases with Andy Pavlo was too fun.
Was super fun chatting with @andypavlo.bsky.social
to kick off the new season of What's New in Data. We dive into vector databases, text to sql, trends in data infrastructure, and Andy's awesome (and open) database course.

youtube.com/shorts/tjLmx...
Andy Pavlo on Vector Databases
YouTube video by Striim
youtube.com
March 19, 2025 at 5:06 AM
Was super fun chatting with @andypavlo.bsky.social
to kick off the new season of What's New in Data. We dive into vector databases, text to sql, trends in data infrastructure, and Andy's awesome (and open) database course.

youtube.com/shorts/tjLmx...
Andy Pavlo on Vector Databases
YouTube video by Striim
youtube.com
March 18, 2025 at 3:51 PM
A side effect of LLMs: I'm taking on way more than I ever have in my life. I don't know if this is more productive or diluting myself. tbd!
February 26, 2025 at 1:17 AM
Just found out one of the internal b2b CRUD app vendors is more like CRD because it doesn't support updating submissions. AI gonna cook that sector so hard.
February 25, 2025 at 1:28 AM
and that’s why I’m working on a Saturday morning 🫠
February 22, 2025 at 7:21 PM
Your adversaries are taking (not my) Presidents Day off. Time to ship. 🚀
February 17, 2025 at 8:25 PM
I’ll never forget where I was the day I learned oats could be milked.
February 17, 2025 at 1:38 AM
Them: Wait so you're saying I don't need to deploy Kafka?
Me: No
Them: Kinesis?
Me: No
Them: Zookeeper? YARN?
Me: No
Them: Will you write every record to disk and replicate it?
Me: No

Unfortunately the bar of complexity for streaming has been set so high. I'm calling it Streamholm Syndrome.
February 14, 2025 at 7:45 PM
I'm not sure whether to be more amazed at the hate for FiveTran's price increase or the fact that Reddit doesn't know Striim exists and are proposing batch solutions to this persons obvious streaming CDC use case.

www.reddit.com/r/dataengine...
I am trying to escape the Fivetran price increase
www.reddit.com
February 14, 2025 at 7:33 AM
They really gave the smell of rain an epic name: petrichor. They really did that.
February 14, 2025 at 4:02 AM
A single Striim cluster (multi-node for scalability and fault tolernace) can handle 35k, very wide, very active databases that produce millions of DML per hour hour and dozens of DDL per day. The 'intelligence' layer or Striim was able to apply rule based logic on how to handle complex DDL.
February 13, 2025 at 8:58 PM
I will die on this hill but MySQL's 'Alter Table Add Column AFTER' DDL is pointless. It doesn't change the layout on disk. If you care about order of the columns, that's purely a read side construct and you should address it in your query not your DDL!
February 12, 2025 at 4:45 AM
We’ve shifted embedding generation and transformers left into the streaming layer to support near real-time RAG. Take a read if you want to hear the optimizations we made for change data capture and incremental embedding generation.

www.striim.com/blog/real-ti...
Real-Time RAG: Streaming Vector Embeddings and Low-Latency AI Search
Imagine searching for products on an online store by simply typing “best eco-friendly toys for toddlers under $50” and getting instant, accurate results—while the inventory is synchronized seamlessly ...
www.striim.com
February 12, 2025 at 12:35 AM
Remind me: 1,000 days.
February 9, 2025 at 11:05 PM
Driving from SF to LA talking to ChatGPT about Kafka. I think that’s how schizo starts.
February 8, 2025 at 10:47 PM
I always love seeing clovers in the wild! Knowing every transaction is streamed to Snowflake in real time with Striim’s streaming CDC service. We do a lot of work to ensure transactions are bound reliably and replicated with no duplicates while maintaining low latency.
February 8, 2025 at 8:50 PM
youtu.be/9YSU-M0m1Jk?...

When I saw the fire up the hill, I knew we wouldn't have much time to get out before the only escape road would become gridlocked. I texted my friends up on Lachman Ln an hour before they got an alert from the city to leave.
Escape from the Palisades: Split-second decision making, confusing responses
YouTube video by Los Angeles Times
m.youtube.com
February 5, 2025 at 9:55 PM
It's 2025 why am I still explaining federation vs code conversion vs replication.
February 5, 2025 at 6:59 AM
As someone taking inventory of receipts of everything I ever owned, I regret every single purchase I didn't make online because its so hard to find the receipt.
February 5, 2025 at 4:41 AM