Lightnews — Scholar-powered news

Reposted by Mihail Stoian

Disseminate: The Computer Science Research Podcast

@disseminatepodcast.bsky.social

🦆 Episode 3 in Season 2 of the DuckDB in Research series is out now!

🎙️ Parachute: Rethinking Query Execution and Bidirectional Information Flow in DuckDB (@duckdb.org) with Mihail Stoian (@mihailstoian.bsky.social)

🔗 Listen now on Spotify: open.spotify.com/episode/21HQ...

Parachute: Rethinking Query Execution and Bidirectional Information Flow in DuckDB - with Mihail Stoian

open.spotify.com

October 30, 2025 at 8:42 AM

Mihail Stoian

@mihailstoian.bsky.social

Big thanks to Jack Waudby for having me on his @disseminatepodcast.bsky.social! Enjoyed our chat about 🪂 Parachute and how @duckdb.org's ecosystem makes testing database research prototypes smoother than ever.

Highly recommend the podcast for anyone into cutting-edge CS research.

DuckDB @duckdb.org · 18d

📢 A new DuckDB in Research podcast episode is out.

📈 In this week's episode, Jack Waudby interviews Mihail Stoian (@mihailstoian.bsky.social), PhD student at the Data Systems Lab, UT Nuremberg about the Parachute approach for robust query processing.

🎧 Listen at duckdb.org/science/miha...

October 30, 2025 at 9:32 PM

Reposted by Mihail Stoian

Andi Zimmerer

@andizimmerer.bsky.social

"The fastest way of processing data is to not process it."

Our SIGMOD 2025 paper shows how Snowflake skips 99.4% of data with new pruning techniques for LIMIT, top-k, and JOIN queries.

Blog: snowflakepruning.github.io
Paper: arxiv.org/abs/2504.11540

@sigmod2025.bsky.social

Andi Zimmerer | Pruning in Snowflake: Working Smarter, Not Harder

Modern cloud-based data analytics systems must efficiently process petabytes of data residing on cloud storage. A key optimization technique in state-of-the-art systems like Snowflake is partition pru...

snowflakepruning.github.io

May 5, 2025 at 5:09 AM

Reposted by Mihail Stoian

SIGMOD/PODS 2025

@sigmod2025.bsky.social

SIGMOD BEST PAPER Honorable Mentions
🥇 CRDV: Conflict-free Replicated Data Views
Nuno Faria (INESCTEC & U. Minho)*; José Pereira (U. Minho & INESCTEC)
🥇 DPconv: Super-Polynomially Faster Join Ordering
Mihail Stoian (UTN)*; Andreas Kipf (UTN)

April 22, 2025 at 7:29 PM

Mihail Stoian

@mihailstoian.bsky.social

DPconv just won a SIGMOD'25 Honorable Mention! 🥁

I was quite impressed, given this year's high-quality papers. Let's see who won the big prize.

My list of candidates in the thread below 🧵.

Paper: dl.acm.org/doi/10.1145/...
Slides: stoianmihail.github.io/assets/dpcon...

April 10, 2025 at 5:57 PM

Mihail Stoian

@mihailstoian.bsky.social

🔺Redbench is now live: github.com/utndatasyste....

Let's see how workload-aware your system really is.

Andreas Kipf @andreaskipf.bsky.social · Mar 25

We just released Redbench, a new benchmark that contains 30 analytical SQL workloads that can be used to benchmark workload-driven optimizations. Go check it out!

GitHub: github.com/utndatasyste...

GitHub - utndatasystems/redbench: Redbench is a set of 30 analytical SQL workloads that can be used to benchmark workload-driven optimizations.

Redbench is a set of 30 analytical SQL workloads that can be used to benchmark workload-driven optimizations. - utndatasystems/redbench

github.com

April 9, 2025 at 10:11 PM

Reposted by Mihail Stoian

Andreas Kipf

@andreaskipf.bsky.social

Thrilled to share that we've received the Best Demonstration Award 🏆 at EDBT 2025!

Congratulations to my students @mihailstoian.bsky.social and Ping-Lin Kuo for their excellent work and dedication over the past few weeks—well deserved!

Paper: openproceedings.org/2025/conf/ed...

March 28, 2025 at 1:39 PM

Mihail Stoian

@mihailstoian.bsky.social

Umbra's DP optimizer for queries of ~100 relations ran in cubic time.

AWS Redshift's Redset captures a 2,296-relation query.

Our revamped DP enumeration optimizes tree queries like snowflakes of *millions* of relations within 1 sec. 🛸

Joint work w/ Altan Birler & Thomas Neumann.

January 13, 2025 at 7:20 AM

Mihail Stoian

@mihailstoian.bsky.social

Are you a fan of Parquet and at #NeurIPS2024 tomorrow? Let's meet at our poster at @trl-research.bsky.social to see how you can reduce your Parquet file sizes by up to 40%.

Virtual compresses tables via functions while ensuring fast column scans.

⏰ 2.30pm
📍East Meeting Room 11 & 12

December 14, 2024 at 1:06 AM

Reposted by Mihail Stoian

PVLDB

@pvldb.bsky.social

Vol:17 No:12 → DataLoom: Simplifying Data Loading with LLMs
👥 Authors: Alexander Van Renen, Mihail Stoian, Andreas Kipf
📄 PDF: https://www.vldb.org/pvldb/vol17/p4449-renen.pdf

Thumbnail: DataLoom: Simplifying Data Loading with LLMs

December 2, 2024 at 5:00 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news