Lightnews — Scholar-powered news

@xevix.bsky.social

Software Developer interested in data, web, languages. Silicon Valley/Tokyo.

https://medium.com/@xevix
https://github.com/xevix

Posts Replies Media Videos

xevix.bsky.social

@xevix.bsky.social

It’s interesting the tradeoffs if the main goal is no operating cost and decent startup time. Definitely painful to develop on regularly but for a one and done this makes a lot of sense. I wonder if Rust compile times will come down further one day.

October 4, 2025 at 10:21 PM

xevix.bsky.social

@xevix.bsky.social

I didn’t quite make it in time for Hive filtering lazy list to speed up filtering Hive folder with many partitions, but will pick up again before next release w/ luck 🙇‍♂️ github.com/duckdb/duckd...

Push Hive filtering into Glob() by xevix · Pull Request #18518 · duckdb/duckdb

Summary Addresses part of #7620 for local filesystem. Part 1 of the work split off from the original PR #18430. The next part will handle fallback to eager loading in case of Hive issues. Push down...

github.com

September 16, 2025 at 4:25 PM

xevix.bsky.social

@xevix.bsky.social

Yeah, I don’t think MS is interested in 3rd party devs so much.

August 26, 2025 at 4:45 PM

xevix.bsky.social

@xevix.bsky.social

Unfortunately yes, I was already going to get one for something else and this put me over the edge. Maybe I'll also build a gaming rig one day in the distant future haha.

August 25, 2025 at 10:12 PM

xevix.bsky.social

@xevix.bsky.social

Just the 1 day of data above is ~125GiB compressed, ~585GiB uncompressed. One month is about 3.75TiB compressed, or 17.5TiB. It makes sense this dataset is so popular for testing and analysis, wow.

August 15, 2025 at 3:19 AM

xevix.bsky.social

@xevix.bsky.social

Uploaded a simplified query. Had to delete and repost since no edit button on the snippets site, sorry for the spam haha.

August 13, 2025 at 6:08 AM

xevix.bsky.social

@xevix.bsky.social

Haha already posted in case someone benefits there too.

August 13, 2025 at 5:13 AM

xevix.bsky.social

@xevix.bsky.social

Automator using simple shell script to call sqlfluff. Added keyboard shortcut for the service too. Easier than making browser extensions for each browser, although unfortunately not cross-platform.

June 29, 2025 at 7:46 PM

xevix.bsky.social

@xevix.bsky.social

The pieces were already there, but progress is not always linear. Majority of cases now handled by [Mother]Duck[DB|Lake], Spark for extreme cases. Single-node compute 10 years from now is going to be mindblowing, but already exciting what we can do today.

June 26, 2025 at 11:17 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news