Jan Kaul
jankaul.bsky.social
Jan Kaul
@jankaul.bsky.social
Reposted by Jan Kaul
At long last, @chris.blue and I have submitted the final manuscript of Designing Data-Intensive Applications, second edition, to the publisher. There is always more that could be improved but at some point we just have to call it done. Now it goes into production; probably shipping in ~4 months.
October 20, 2025 at 7:54 PM
So #databs, who wants to try S3Tables?

I've created a Datafusion distribution with support for S3Tables that you can run from the command line. This is all you need:

frostbow -u arn:aws:s3tables:us-east-1:123456789:bucket/my-bucket-prefix

Check it out the example: github.com/JanKaul/fros...
github.com
December 11, 2024 at 10:50 AM
Does anybody have an idea how S3 Tables can achieve 3x performance compared to regular Iceberg tables on S3?

Does it work similar to the object-store file layout?
iceberg.apache.org/docs/latest/...

#databs
AWS - Apache Iceberg™
iceberg.apache.org
December 5, 2024 at 6:55 PM
Join me tomorrow for my talk: "Dashtool - A data build tool to use Iceberg Materialized Views for data transformations" as part of the Open Source Analytics Community Series.

December 3rd, 15:00 UTC

us02web.zoom.us/w/8299438519...
Passcode: 299889
December 2, 2024 at 9:12 PM
Reposted by Jan Kaul
This has been a long time coming. After the concerted efforts many engineers from across companies and across continents, posted some great benchmark results

datafusion.apache.org/blog/2024/11...
November 22, 2024 at 3:03 PM
There is a discussion on the Top 2025 Data Trends with Matt Housley today. Could be interesting. #databs
www.linkedin.com/events/top20...
Top 2025 Data Trends with Matt Housley | LinkedIn
What are the top data trends entering 2025 and who will win the data and AI wars? This week we're joined by superstar author and Leonardo Dicaprio lookalike Matthew Housley to go deep on what data tre...
www.linkedin.com
November 14, 2024 at 10:13 AM
I recently gave a talk about Iceberg Materialized Views at Chill Data Summit London.

I think Iceberg Materialized Views can really revolutionize the way we approach data transformations.

Watch the recording here:
youtu.be/bDxUeReHyHQ?...
Analytical Data Transformations with Apache Iceberg Materialized Views | Presentation by Jan Kaul
YouTube video by Upsolver
youtu.be
November 5, 2024 at 9:16 PM
Reposted by Jan Kaul
Databricks wants to merge Apache Iceberg and Delta Lake...
October 28, 2024 at 2:15 PM
Bluesky now has over 10 million users, and I was #2.062.888!
September 18, 2024 at 8:25 PM