Stefan Grafberger
stefan-grafberger.com
Stefan Grafberger
@stefan-grafberger.com
PhD Student at BIFOLD & TU Berlin, researching data management for ML. Previously worked with UvA, Microsoft GSL, Amazon Research, Oracle Labs, and others.

https://stefan-grafberger.com
Very excited to share that I've started as a Software Engineer at Snowflake! 🥳

I’m also wrapping up my PhD: this week I’m at VLDB in London to present the last demo paper from my time as PhD student, and on September 17 I’ll defend my PhD in Amsterdam.

Really looking forward to this next chapter!
September 1, 2025 at 11:46 PM
Reposted by Stefan Grafberger
Join us for discussions and talks on data management aspects for end-to-end ML on 27 June at @deem-workshop.bsky.social in Berlin. Keynotes by @pinartozun.bsky.social and @gaelvaroquaux.bsky.social 🤩

Check the full schedule deem-workshop.github.io#schedule & proceedings dl.acm.org/doi/proceedi...
Proceedings of the Workshop on Data Management for End-to-End Machine Learning | ACM Conferences
dl.acm.org
June 16, 2025 at 7:49 AM
Our demo "mlidea: Interactively Improving ML Data Preparation Code via 'Shadow Pipelines'" was accepted at VLDB! 🥳

We demo suggestions for ML pipelines, similar to IntelliJ code inspections or Grammarly suggestions

youtu.be/ePGm1J6S2qk

Joint work w/ @mersault.bsky.social @p-groth.bsky.social
May 30, 2025 at 7:09 PM
Reposted by Stefan Grafberger
📢 Deadline extension for DEEM 2025 @sigmod2025.bsky.social!

Following requests, we're extending the submission deadline to April 1, 5pm Pacific Time. More info at: deem-workshop.github.io
DEEM: Workshop on Data Management for End-to-End Machine Learning @ ACM SIGMOD 2025
deem-workshop.github.io
March 15, 2025 at 7:15 PM
Our vision "Towards Regaining Control over Messy ML Pipelines" was accepted for the DAIS workshop at ICDE! 🥳

Initial experiments show LLMs are promising for extracting declarative query plans from messy ML code.

Joint work w/ @guangchen811.bsky.social @oovcharenko.bsky.social @mersault.bsky.social
March 7, 2025 at 1:56 PM
Please help spread the word by reposting!

We've just created the official DEEM Workshop account: @deem-workshop.bsky.social
The Data Management for End-to-End Machine Learning workshop (@deem-workshop.bsky.social) will be back at #SIGMOD2025! ✨

🔗 Check out the CfP: deem-workshop.github.io
📝 Submission deadline: March 21
📢 Notifications: April 25

Join us for the 9th edition in Berlin!

#DEEM2025
DEEM - The 9th Workshop on End-to-End Data Management is also co-located with SIGMOD/PODS 2025. The deadline for papers is March 21st. For more details checkout the website
deem-workshop.github.io
February 7, 2025 at 9:10 PM
Reposted by Stefan Grafberger
We have a **Postdoc opening** in Berlin on Responsible Data Engineering!

This is a fully-funded position with salary level E14 at the newly founded DEEM Lab, as part of @bifold.berlin .

Details available at deem.berlin#jobs-57624
February 5, 2025 at 8:31 AM
Reposted by Stefan Grafberger
@stefan-grafberger.com, a Ph.D. student in the DEEM Lab at BIFOLD is among the author team, which presented the paper "Towards Query Optimizer as a Service (QOaaS) in a Unified LakeHouse Platform: Can One QO Rule Them All? at the #CIDR2025.

#QOaaS #CIDR

www.bifold.berlin/news-events/...
January 22, 2025 at 2:15 PM
Reposted by Stefan Grafberger
Interested in a *PhD in Data Engineering* in Berlin? Our institute has several openings for PhD positions as part of its graduate school, see the post below!

And check out the following page for details on how to work with the DEEM Lab as part of the graduate school deem.berlin#jobs-189196
January 6, 2025 at 1:49 PM
Our CIDR'25 paper "Towards Query Optimizer as a Service (QOaaS) in a Unified LakeHouse Ecosystem: Can One QO Rule Them All?" is now on ArXiv! Excited to have been a part of this project during my internship at Microsoft GSL!

arxiv.org/pdf/2411.13704
arxiv.org
November 22, 2024 at 8:18 PM
Reposted by Stefan Grafberger
Pls repost:

We, the DEEM Lab at TU Berlin, are hiring a postdoctoral researcher in data engineering for machine learning. Details available at:

deem.berlin#jobs-57624

This fully-funded position is part of the Berlin Institute for the Foundations of Learning and Data (BIFOLD).

#databs #datasky
November 15, 2024 at 9:30 AM