rmoff 🏃‍♂️🫖🥓
banner
rmoff.net
rmoff 🏃‍♂️🫖🥓
@rmoff.net
Shitposting & Memes.

Data & Stuff.

#dataBS #trailrunning

✍️ https://rmoff.net 🔗 https://rmoff.info
Pinned
My most popular blog post is one I wrote about Kafka listener configuration. But the one that I am most proud of when people says it's been useful for them is when I wrote about trying to get the balance right when travel is part of your job: rmoff.net/2019/02/09/t....
Travelling for Work, with Kids at Home
rmoff.net
Protobuf - use it faster, or don't use it at all? Couple of interesting talk from P99 CONF this year:

* www.p99conf.io/session/8x-b...
* www.p99conf.io/session/pars...
November 11, 2025 at 4:05 PM
I missed this release last week - #ApacheFluss now supports writing to both #ApacheIceberg and Lance (as well as the original #ApachePaimon.
fluss.apache.org/blog/release...
Blog | Apache Fluss™ (Incubating)
Blog
fluss.apache.org
November 11, 2025 at 2:20 PM
Three Flink-related talks at P99 CONF this year - here are the recordings and slides:
* www.p99conf.io/session/apac...
* www.p99conf.io/session/buil...
* www.p99conf.io/session/rivi...
November 11, 2025 at 11:10 AM
P99 CONF nails it in terms of both content, *and* UX for attending. Super-simple registration. Free registration. Videos available on-demand afterwards with no gating or games. Clear and usable website. These folk know how to do a developer conference. A+++

🎥 Videos & slides: www.p99conf.io
P99 CONF – All Things Performance On-Demand
P99 CONF is a cross-industry virtual event for engineers and by engineers, centered around low-latency, high-performance design.
www.p99conf.io
November 10, 2025 at 11:15 AM
You Should Write An Agent fly.io/blog/everyon...
You Should Write An Agent
They're like riding a bike: easy, and you don't get it until you try.
fly.io
November 9, 2025 at 4:05 PM
How DataDog built a low-latency, multi-tenant data replication platform with Kafka, Kafka Connect, Debezium, and Temporal www.datadoghq.com/blog/enginee...
Replication redefined: How we built a low-latency, multi-tenant data replication platform | Datadog
Discover how Datadog engineered a scalable Change Data Capture (CDC) platform to replicate data across systems in near real time—reducing search latency by 87%, increasing availability, and powering…
www.datadoghq.com
November 8, 2025 at 4:05 PM
Blogged: How we built the demo for the Current NOLA Day 2 keynote using Kafka, Flink, and AI

🔗 rmoff.net/2025/11/06/h...
How we built the demo for the Current NOLA Day 2 keynote using Flink and AI
rmoff.net
November 7, 2025 at 5:28 PM
How Faire use Cursor as their analytics IDE craft.faire.com/we-moved-ana...
We moved analytics into an IDE — and haven’t looked back
Faire’s journey to AI-native analysis with Cursor
craft.faire.com
November 7, 2025 at 4:05 PM
Good post from Sam Harris about public speaking, including some good lessons for conference speakers specifically samharris.substack.com/p/the-silent...
The Silent Crowd
Overcoming Your Fear of Public Speaking
samharris.substack.com
November 6, 2025 at 4:05 PM
#duckDB question - is there a smart way to expose all the `value` string fields without manually unpacking them by name?

e.g. this is tedious:

SELECT value.animalName.string, value.deviceID.string

Sample record: gist.github.com/rmoff/38aff1...
November 6, 2025 at 11:55 AM
Quick post about the data blips on the latest @thoughtworks.com tech radar:
rmoff.net/2025/11/05/t...
Tech Radar (Nov 2025) - data blips
rmoff.net
November 5, 2025 at 5:16 PM
Reposted by rmoff 🏃‍♂️🫖🥓
Stream-order vs batch-order in Iceberg:
* Flink wants temporal locality.
* Spark wants value locality.

Same table, conflicting physics.

New post: jack-vanlightly.com/blog/2025/11...
How Would You Like Your Iceberg Sir? Stream or Batch Ordered? — Jack Vanlightly
Today I want to talk about stream analytics, batch analytics and Apache Iceberg. Stream and batch analytics work differently but both can be built on top of Iceberg, but due to their differences there...
jack-vanlightly.com
November 5, 2025 at 2:52 PM
Reposted by rmoff 🏃‍♂️🫖🥓
💯 we had an entire "logs" database at WePay and it was a nightmare. It's convenient until it isn't. And backing out is brutal.
📝 Blogged: "'You Don't Need Kafka, Just Use Postgres' Considered Harmful"

In which I'm arguing that both Postgres and Kafka are great tools for their respective purposes. But don't create your custom implementation of one on top of the other.

👉 www.morling.dev/blog/you-don...
November 3, 2025 at 4:06 PM
Check it out - I was on a podcast :) @gamussa.dev was kind enough to spend some time chatting with me about my time working with Oracle, Kafka, and more.

🎧 confluent.buzzsprout.com/186154/episo...
🎥 www.youtube.com/watch?v=U0t5...
November 3, 2025 at 4:05 PM
ICYMI: Interesting Links in the data world for October: rmoff.net/2025/10/31/i...
November 3, 2025 at 10:24 AM
With nary 8.5 hours to spare (GMT) before the end of the month, herewith a whole lotta links about things in the data engineering world that I found interesting this month.

👉 rmoff.net/2025/10/31/i...
Interesting links - October 2025
rmoff.net
October 31, 2025 at 3:35 PM
Reposted by rmoff 🏃‍♂️🫖🥓
If you were affected by recent changes to MinIO, check out the @chainguard.dev  MinIO image. We didn't want people to be stuck on an unmaintained image, so we've added the our version to the free tier.

Secure and Free MinIO Chainguard Containers
MinIO pulled its free images—but Chainguard has you covered. Get zero-CVE, continuously built MinIO and MinIO Client containers, free and secure from Chainguard.
www.chainguard.dev
October 28, 2025 at 11:05 AM
🐿️ My new favourite laptop sticker 😍😆
October 28, 2025 at 9:45 AM
One of the things I love about @vanlightly.bsky.social's posts is that he is both opinionated *and* highly informed and articulate in his argument. This post is a great example: Why I’m not a fan of zero-copy Apache Kafka-Apache Iceberg jack-vanlightly.com/blog/2025/10...
Why I’m not a fan of zero-copy Apache Kafka-Apache Iceberg — Jack Vanlightly
Over the past few months, I’ve seen a growing number of posts on social media promoting the idea of a “zero-copy” integration between Apache Kafka and Apache Iceberg. The idea is that Kafka topics…
jack-vanlightly.com
October 24, 2025 at 3:08 PM
Excellent analysis from @vanlightly.bsky.social: A Fork in the Road: Deciding #ApacheKafka’s Diskless Future jack-vanlightly.com/blog/2025/10...
October 23, 2025 at 3:07 PM
Ended up going with asciidoc + revealJS.
Love the low friction and focus on writing.

Check out the results: rmoff.net/talk/blog-wr... and the source: raw.githubusercontent.com/rmoff/rmoff-...
October 23, 2025 at 10:53 AM
Reposted by rmoff 🏃‍♂️🫖🥓
#P99CONF is going live in just minutes! Here's a peek at the behind-the-scenes prep. If you haven't registered yet, hurry over to p99conf.io. It's free and virtual, so no excuses. 😅
October 22, 2025 at 2:13 PM
Reposted by rmoff 🏃‍♂️🫖🥓
I used #matterjs to do a bit of data visualization using physical metaphors and I loved it. We should all do more #datavisualization with physics engines, especially for systems like this.
Also, if you want to learn about watermarks, give it a try.
flink-watermarks.wtf
October 20, 2025 at 4:10 PM
Have you ever found yourself thinking…Flink Watermarks…WTF?
🔥 This new tool might be for you :)

It's a hands-on, scrollytelling walkthrough of what watermarks in #ApacheFlink are, why they matter, and how to use them.

Try it out: flink-watermarks.wtf

#dataBS
Flink Watermarks…WTF?
Master Apache Flink watermarks with this interactive simulation. Learn what they are, why you need them, and how to configure them through hands-on examples.
flink-watermarks.wtf
October 20, 2025 at 1:44 PM