Andy Grove
banner
andygrove.io
Andy Grove
@andygrove.io
Apache Arrow & DataFusion PMC Member. Original creator of Apache DataFusion.
We have TPC-H benchmarks for single node with a small scale factor in the contributors guide. We only benchmark against Spark though and not against Spark RAPIDS.

datafusion.apache.org/comet/contri...
Apache DataFusion Comet: Benchmarks Derived From TPC-H — Apache DataFusion Comet documentation
datafusion.apache.org
March 21, 2025 at 12:49 AM
I hate to say it, but "it depends". I'd recommend running your own benchmarks for your specific workloads. Performance will also vary greatly by environment (number of CPUs vs GPUs, different GPU types, and so on).
March 19, 2025 at 10:20 PM
Is this using Arrow and/or DataFusion? If so, our Discord is probably a good place to ask.

datafusion.apache.org/contributor-...
Communication — Apache DataFusion documentation
datafusion.apache.org
January 23, 2025 at 10:17 PM