datafusion.apache.org/blog/2025/10...
datafusion.apache.org/blog/2025/10...
www.streamingdata.tech/p/streaming-...
datafusion.apache.org/blog/2025/09...
datafusion.apache.org/blog/2025/09...
Reduce your Flink compute cost by up to 2x or handle 2x more data with the same infrastructure.
Reduce your Flink compute cost by up to 2x or handle 2x more data with the same infrastructure.
SF1000 (1TB raw, 220GB in @ApacheParquet ) in less than 10 mins (6m45s) on aging laptop
Try it now:
pip install tpchgen-cli
tpchgen-cli --scale-factor 1000 --parts 100 --format=parquet
github.com/clflushopt/t...
SF1000 (1TB raw, 220GB in @ApacheParquet ) in less than 10 mins (6m45s) on aging laptop
Try it now:
pip install tpchgen-cli
tpchgen-cli --scale-factor 1000 --parts 100 --format=parquet
github.com/clflushopt/t...
You get HA Postgres plus seamless replication and DataFusion-based queries. This query turned out 6x faster than PG.
You get HA Postgres plus seamless replication and DataFusion-based queries. This query turned out 6x faster than PG.
datafusion.apache.org/comet/contri...
datafusion.apache.org/comet/contri...
Come work on super interesting problems with world class team. Help us build better Cassandra!
Ping me if you’re interested!
jobs.apple.com/en-us/detail...
Come work on super interesting problems with world class team. Help us build better Cassandra!
Ping me if you’re interested!
jobs.apple.com/en-us/detail...
github.com/apache/dataf...
github.com/apache/dataf...
datafusion.apache.org/blog/2025/03...
#DataFusion #Python #DataFrame #PyData #Apache
datafusion.apache.org/blog/2025/03...
#DataFusion #Python #DataFrame #PyData #Apache
jobs.apple.com/en-us/detail...
jobs.apple.com/en-us/detail...
The repo has been updated with the latest benchmark results. For single executor TPC-H @ 100 GB, we now see a 2.2x increase over Spark (up from 2x in 0.6.0).
github.com/apache/dataf...
The repo has been updated with the latest benchmark results. For single executor TPC-H @ 100 GB, we now see a 2.2x increase over Spark (up from 2x in 0.6.0).
github.com/apache/dataf...
I have replaced the scrolling time with listening to podcasts.
I now stay in touch with family overseas via email and photo sharing, and I use Snapchat for sharing photos with immediate family, privately. Works great.
It is a real shame, though, because it was a good way to stay connected with family.
Is there a viable alternative? What are others using instead?
I have replaced the scrolling time with listening to podcasts.
I now stay in touch with family overseas via email and photo sharing, and I use Snapchat for sharing photos with immediate family, privately. Works great.
cnr.sh/posts/compar...
cnr.sh/posts/compar...
datafusion.apache.org/blog/2025/02...
datafusion.apache.org/blog/2025/02...
datafusion.apache.org/blog/2025/02...
datafusion.apache.org/blog/2025/02...
It is a great overview of how to build a distributed system on top of DataFusion.
www.youtube.com/watch?v=ceTo...
It is a great overview of how to build a distributed system on top of DataFusion.
www.youtube.com/watch?v=ceTo...
It is a real shame, though, because it was a good way to stay connected with family.
Is there a viable alternative? What are others using instead?
It is a real shame, though, because it was a good way to stay connected with family.
Is there a viable alternative? What are others using instead?
Inspired by @andrewlamb1111.bsky.social's weekly updates in DataFusion core, I am going to start doing the same in Comet to help keep the community updated on current events.
github.com/apache/dataf...
Inspired by @andrewlamb1111.bsky.social's weekly updates in DataFusion core, I am going to start doing the same in Comet to help keep the community updated on current events.
github.com/apache/dataf...
datafusion.apache.org/blog/2025/01...
datafusion.apache.org/blog/2025/01...