xevix.bsky.social
@xevix.bsky.social
Software Developer interested in data, web, languages. Silicon Valley/Tokyo.

https://medium.com/@xevix
https://github.com/xevix
It’s interesting the tradeoffs if the main goal is no operating cost and decent startup time. Definitely painful to develop on regularly but for a one and done this makes a lot of sense. I wonder if Rust compile times will come down further one day.
October 4, 2025 at 10:21 PM
I didn’t quite make it in time for Hive filtering lazy list to speed up filtering Hive folder with many partitions, but will pick up again before next release w/ luck 🙇‍♂️ github.com/duckdb/duckd...
Push Hive filtering into Glob() by xevix · Pull Request #18518 · duckdb/duckdb
Summary Addresses part of #7620 for local filesystem. Part 1 of the work split off from the original PR #18430. The next part will handle fallback to eager loading in case of Hive issues. Push down...
github.com
September 16, 2025 at 4:25 PM
Yeah, I don’t think MS is interested in 3rd party devs so much.
August 26, 2025 at 4:45 PM
Unfortunately yes, I was already going to get one for something else and this put me over the edge. Maybe I'll also build a gaming rig one day in the distant future haha.
August 25, 2025 at 10:12 PM
Just the 1 day of data above is ~125GiB compressed, ~585GiB uncompressed. One month is about 3.75TiB compressed, or 17.5TiB. It makes sense this dataset is so popular for testing and analysis, wow.
August 15, 2025 at 3:19 AM
Uploaded a simplified query. Had to delete and repost since no edit button on the snippets site, sorry for the spam haha.
August 13, 2025 at 6:08 AM
Haha already posted in case someone benefits there too.
August 13, 2025 at 5:13 AM
Automator using simple shell script to call sqlfluff. Added keyboard shortcut for the service too. Easier than making browser extensions for each browser, although unfortunately not cross-platform.
June 29, 2025 at 7:46 PM
The pieces were already there, but progress is not always linear. Majority of cases now handled by [Mother]Duck[DB|Lake], Spark for extreme cases. Single-node compute 10 years from now is going to be mindblowing, but already exciting what we can do today.
June 26, 2025 at 11:17 PM