Andi Zimmerer
andizimmerer.bsky.social
Andi Zimmerer
@andizimmerer.bsky.social
PhD at University of Technology Nuremberg, researching on Database Systems. Formerly engineer at Snowflake Inc. on query acceleration; spent some academic time at MIT 🇺🇸, TUM 🇩🇪 and NTU 🇸🇬. 🎯 Berlin
https://www.andi-zimmerer.com
Reposted by Andi Zimmerer
I love older papers.

Lenstra and Kan, 1979 "Computational Complexity of Discrete Optimization Problems" Annals of Discrete Mathematics

#orms
May 6, 2025 at 4:45 PM
"The fastest way of processing data is to not process it."

Our SIGMOD 2025 paper shows how Snowflake skips 99.4% of data with new pruning techniques for LIMIT, top-k, and JOIN queries.

Blog: snowflakepruning.github.io
Paper: arxiv.org/abs/2504.11540

@sigmod2025.bsky.social
Andi Zimmerer | Pruning in Snowflake: Working Smarter, Not Harder
Modern cloud-based data analytics systems must efficiently process petabytes of data residing on cloud storage. A key optimization technique in state-of-the-art systems like Snowflake is partition pru...
snowflakepruning.github.io
May 5, 2025 at 5:09 AM
Camera-ready version of the paper submitted => 115 tabs in Chrome closed.
a woman is sitting at a table with a dart board in the background and says `` i 'm done '' .
ALT: a woman is sitting at a table with a dart board in the background and says `` i 'm done '' .
media.tenor.com
April 8, 2025 at 6:42 AM
Reposted by Andi Zimmerer
We just released Redbench, a new benchmark that contains 30 analytical SQL workloads that can be used to benchmark workload-driven optimizations. Go check it out!

GitHub: github.com/utndatasyste...
GitHub - utndatasystems/redbench: Redbench is a set of 30 analytical SQL workloads that can be used to benchmark workload-driven optimizations.
Redbench is a set of 30 analytical SQL workloads that can be used to benchmark workload-driven optimizations. - utndatasystems/redbench
github.com
March 25, 2025 at 9:42 PM
The first day of the BTW Conference in Bamberg is coming to an end.

Some personal favorites:
- Ismail's talk on Pruning in Snowflake
- @stefan-grafberger.com's talk on what-if analysis in ML pipelines and automatically patching ML pipelines in the background
- Observe Inc's presentation
March 4, 2025 at 7:26 PM
Reposted by Andi Zimmerer
Please help spread the word by reposting!

We've just created the official DEEM Workshop account: @deem-workshop.bsky.social
The Data Management for End-to-End Machine Learning workshop (@deem-workshop.bsky.social) will be back at #SIGMOD2025! ✨

🔗 Check out the CfP: deem-workshop.github.io
📝 Submission deadline: March 21
📢 Notifications: April 25

Join us for the 9th edition in Berlin!

#DEEM2025
DEEM - The 9th Workshop on End-to-End Data Management is also co-located with SIGMOD/PODS 2025. The deadline for papers is March 21st. For more details checkout the website
deem-workshop.github.io
February 7, 2025 at 9:10 PM
My very first paper got accepted to @sigmod2025.bsky.social! Yay! Means I'll be playing a home game in Berlin
February 26, 2025 at 5:13 PM
Reading cacm.acm.org/blogcacm/21s... makes me think that Rust was just a giant research project and valuable findings are now being streamed back into C++, making them usable to a broader audience.
21st Century C++ – Communications of the ACM
cacm.acm.org
February 10, 2025 at 5:31 PM
My professor jokingly threatened me that I would get fired if my VO2 Max is too low. After a run with him it's at 58 now. I guess I can continue my PhD 😋
February 4, 2025 at 7:40 PM
The Nuremberg Data Systems Lab is now on Bluesky 🙌 @utndatasystems.bsky.social
January 29, 2025 at 6:03 PM
In academia, everyone always has a Colleague Working On Exactly This Problem. I still have to find one. Applications open.
January 15, 2025 at 6:20 PM
Reposted by Andi Zimmerer
Exciting News! 🎉
#Tampere will host EDBT/ICDT 2026! ✨
Even before the 2025 edition, the important dates are already out:
📅 Round 1 starts on:
February 5 for EDBT Papers
March 13 for ICDT Papers
edbticdt2026.github.io

We can’t wait to see your great submissions and welcome you to Tampere! 🙌
EDBT/ICDT 2026 Joint Conference - 24th March - 27th March, 2026 - Tampere, Finland
edbticdt2026.github.io
January 14, 2025 at 5:30 PM
Reposted by Andi Zimmerer
The @sigmod2025.bsky.social Programming Contest goes into another round. We (Bo Tang, Tilmann Rabl, and myself) just published the timeline and task overview:
sigmod-contest-2025.github.io/index.html

Thanks to Carlo Curino and @microsoft.com for the continued support.
January 4, 2025 at 10:30 AM
Reposted by Andi Zimmerer
If I’m ever a professor again, I want to give a graduate seminar, topics to include:

- how not to say stupid shit about fields outside your expertise
- what is your expertise, anyway?
- how not to be an insufferable bore
- your PhD doesn’t make you a better person: coping with that

Other ideas?
December 12, 2024 at 8:30 PM
I love how dedicated some students are. They are supposed to create a 5min video about a topic and one of them sends me their slides for review. 15(!) detailed(!) slides(!)
December 12, 2024 at 5:25 PM
I proudly sneaked in an example about nature conservation into a database research paper 🌳
December 1, 2024 at 4:15 PM
The scale of Google Spanner is staggering - 5 billion queries per second! youtu.be/uy3LjRPFoKw?...
Evolution of the Storage Engine for Spanner, an Exabyte-scale Database System
YouTube video by Jignesh Patel
youtu.be
December 1, 2024 at 12:21 AM