Delta Lake
banner
deltalakeoss.bsky.social
Delta Lake
@deltalakeoss.bsky.social
Delta Lake is an open-source storage framework that enables building a Lakehouse architecture for Spark, Flink, Trino, Hive, Scala, Java, Rust, Python, & more.
Did you know you can easily enable Variant Type in existing Delta tables? 💡 Just run an ALTER TABLE command to activate the Delta feature: `variantType`.

Watch the full webinar ➡️ www.youtube.com/live/0BfUG2P...

#DeltaLake #OpenSource #OSS
October 16, 2025 at 6:48 PM
Open Lakehouse + AI spotlight: @hannes.muehleisen.org shares how anyone can contribute to DuckDB & DuckLake—both MIT-licensed, open source, & welcoming PRs and community engagement on GitHub.

Check out this clip to hear how easy it is to get involved and start contributing today. 🚀

#openlakehouse
October 15, 2025 at 7:13 PM
One of the most powerful new capabilities in Delta Lake 4.0 is support for collations, giving you much finer control over how text is compared and sorted. ✅

In this clip, Youssef Mrini explains how to enable collations.

🎥 Watch the full webinar: youtube.com/live/0BfUG2PZyNs?si=nzcalgk1gvOEFBrY
October 8, 2025 at 7:25 PM
Delta Connect is a plugin atop Spark Connect, introduced in Spark 3.4, and enables gRPC communication using Protocol Buffers. ✅ This allows client implementations in languages such as Rust and Go to interact with Spark outside the JVM, provided they support the protocol.

#opensource #oss #deltalake
October 3, 2025 at 4:46 PM
At Open Lakehouse + AI Amsterdam last month, Sammy Sidhu—co-creator of the Daft multi-modal query engine and CEO of Eventual—shared how his open source journey began. 🚀

Ready to shape the future of data, open source, and AI together? Join us at our next Open Lakehouse + AI events! ⬇️

#opensource
September 30, 2025 at 2:30 PM
“Open source is giving back, because it means everyone who supports our work can participate and benefit from the software we are making." — @hannes.muehleisen.org (@duckdb.org) at Open Lakehouse + AI Amsterdam 🇳🇱

Ready to shape the future of data, open source, and AI together? ⬇️

#opensource #oss
September 22, 2025 at 2:40 PM
Dropping features in #DeltaLake used to break table compatibility and history. Delta 4.0 fixes this by protecting checkpoints and rewriting recent history—keeping tables safe and stable. 💪

🎥 Watch the full webinar replay: www.youtube.com/watch?v=0BfU...

#opensource #oss
September 19, 2025 at 7:01 PM
Curious about how Delta Lake structures its tables and achieves both scalability and reliability?

In our recent webinar, Scott Haines breaks down the building blocks of a #DeltaLake table.

▶️ Watch the full webinar to learn more: www.youtube.com/live/O8_82Cu...

#opensource #oss #metadata #parquet
July 24, 2025 at 5:10 PM
In this clip, Ion Koutsouris explains how lakeFS garbage collection policy integrates with #DeltaLake to manage unreferenced files and automate cleanups.

By leveraging #Spark jobs, #lakeFS ensures your Delta Lake storage remains lean and organized. ✅

🎥 Learn more: www.youtube.com/live/OH1vH19...
July 18, 2025 at 7:34 PM
“We cut our streaming ingestion costs over 90% by adopting kafka-delta-ingest, which means we can invest those savings in really interesting large language model products or innovative data processes.” — R. Tyler Croy, Principal Engineer at Scribd & Maintainer of delta-rs 💬
July 18, 2025 at 1:36 PM
🤔 How fast can you process a 10M-row CSV into Delta Lake?

Daniel Beach put Daft to the test by reading a 1.1GB CSV (10 million lines) and writing it to Delta Lake—all in just 3 minutes on AWS Lambda. 🚀

🎥 Watch the full webinar: www.youtube.com/watch?v=BR9o...

#opensource #oss #deltalake #daft
July 17, 2025 at 7:18 PM
With Liquid Clustering, you don’t need to worry about cardinality. Whether your data has high or low cardinality, #LiquidClustering dynamically handles it—no need for manual partition tuning. ✅

🎥 Learn more: www.youtube.com/live/l8CEyXg...

(1/2)

#opensource #oss #deltalake #linuxfoundation
July 16, 2025 at 6:11 PM
"With over three petabytes of processed data and more than 1,200 active users, our Lakehouse platform powered by Delta Lake is at the core of how we drive insights at scale." - Satya Mandavilli

👉 Watch the full video: www.youtube.com/live/1Bp8VUW...

#opensource #oss #deltalake #lakehouse
July 15, 2025 at 5:59 PM
In this clip, Youssef Mrini explains how deletion vectors in #DeltaLake help you avoid rewriting #Parquet files for every update or delete. Instead, a bitmap marks which rows are deleted, so files are only rewritten when necessary. ✅

🎥 Watch the full video for more: www.youtube.com/live/O8_82Cu...
July 14, 2025 at 9:26 PM