philpax
banner
philpax.me
philpax
@philpax.me
creative engineer · Rust, gamedev/modding, reverse engineering, writing, AI, XR, etc

a better world is possible · 🇦🇺🇸🇪🏳️‍🌈
Reposted by philpax
Dota 2
November 17, 2025 at 6:01 PM
thinking about how unbelievably fire the umineko ost is again
December 26, 2025 at 1:14 PM
Reposted by philpax
concept: a bluesky LLM bot account called [your username]'s secretary, and any time some asshole gets up in your mentions you flick a switch and said bot goes out of their way to waste their time with nitpicking replies
September 7, 2025 at 4:31 AM
Reposted by philpax
Package managers keep using git as a database, it never works out.

https://nesbitt.io/2025/12/24/package-managers-keep-using-git-as-a-database.html
Package managers keep using git as a database, it never works out
Using git as a database is a seductive idea. You get version history for free. Pull requests give you a review workflow. It’s distributed by design. GitHub will host it for free. Everyone already knows how to use it. Package managers keep falling for this. And it keeps not working out. ## Cargo The crates.io index started as a git repository. Every Cargo client cloned it. This worked fine when the registry was small, but the index kept growing. Users would see progress bars like “Resolving deltas: 74.01%, (64415/95919)” hanging for ages, the visible symptom of Cargo’s libgit2 library grinding through delta resolution on a repository with thousands of historic commits. The problem was worst in CI. Stateless environments would download the full index, use a tiny fraction of it, and throw it away. Every build, every time. RFC 2789 introduced a sparse HTTP protocol. Instead of cloning the whole index, Cargo now fetches files directly over HTTPS, downloading only the metadata for dependencies your project actually uses. (This is the “full index replication vs on-demand queries” tradeoff in action.) By April 2025, 99% of crates.io requests came from Cargo versions where sparse is the default. The git index still exists, still growing by thousands of commits per day, but most users never touch it. ## Homebrew GitHub explicitly asked Homebrew to stop using shallow clones. Updating them was “an extremely expensive operation” due to the tree layout and traffic of homebrew-core and homebrew-cask. Users were downloading 331MB just to unshallow homebrew-core. The .git folder approached 1GB on some machines. Every `brew update` meant waiting for git to grind through delta resolution. Homebrew 4.0.0 in February 2023 switched to JSON downloads for tap updates. The reasoning was blunt: “they are expensive to git fetch and git clone and GitHub would rather we didn’t do that… they are slow to git fetch and git clone and this provides a bad experience to end users.” Auto-updates now run every 24 hours instead of every 5 minutes, and they’re much faster because there’s no git fetch involved. ## CocoaPods CocoaPods is the package manager for iOS and macOS development. It hit the limits hard. The Specs repo grew to hundreds of thousands of podspecs across a deeply nested directory structure. Cloning took minutes. Updating took minutes. CI time vanished into git operations. GitHub imposed CPU rate limits. The culprit was shallow clones, which force GitHub’s servers to compute which objects the client already has. The team tried various band-aids: stopping auto-fetch on `pod install`, converting shallow clones to full clones, sharding the repository. The CocoaPods blog captured it well: “Git was invented at a time when ‘slow network’ and ‘no backups’ were legitimate design concerns. Running endless builds as part of continuous integration wasn’t commonplace.” CocoaPods 1.8 gave up on git entirely for most users. A CDN became the default, serving podspec files directly over HTTP. The migration saved users about a gigabyte of disk space and made `pod install` nearly instant for new setups. ## Go modules Grab’s engineering team went from 18 minutes for `go get` to 12 seconds after deploying a module proxy. That’s not a typo. Eighteen minutes down to twelve seconds. The problem was that `go get` needed to fetch each dependency’s source code just to read its go.mod file and resolve transitive dependencies. Cloning entire repositories to get a single file. Go had security concerns too. The original design wanted to remove version control tools entirely because “these fragment the ecosystem: packages developed using Bazaar or Fossil, for example, are effectively unavailable to users who cannot or choose not to install these tools.” Beyond fragmentation, the Go team worried about security bugs in version control systems becoming security bugs in `go get`. You’re not just importing code; you’re importing the attack surface of every VCS tool on the developer’s machine. GOPROXY became the default in Go 1.13. The proxy serves source archives and go.mod files independently over HTTP. Go also introduced a checksum database (sumdb) that records cryptographic hashes of module contents. This protects against force pushes silently changing tagged releases, and ensures modules remain available even if the original repository is deleted. ## Beyond package managers The same pattern shows up wherever developers try to use git as a database. Git-based wikis like Gollum (used by GitHub and GitLab) become “somewhat too slow to be usable” at scale. Browsing directory structure takes seconds per click. Loading pages takes longer. GitLab plans to move away from Gollum entirely. Git-based CMS platforms like Decap hit GitHub’s API rate limits. A Decap project on GitHub scales to about 10,000 entries if you have a lot of collection relations. A new user with an empty cache makes a request per entry to populate it, burning through the 5,000 request limit quickly. If your site has lots of content or updates frequently, use a database instead. Even GitOps tools that embrace git as a source of truth have to work around its limitations. ArgoCD’s repo server can run out of disk space cloning repositories. A single commit invalidates the cache for all applications in that repo. Large monorepos need special scaling considerations. ## The pattern The hosting problems are symptoms. The underlying issue is that git inherits filesystem limitations, and filesystems make terrible databases. **Directory limits.** Directories with too many files become slow. CocoaPods had 16,000 pod directories in a single Specs folder, requiring huge tree objects and expensive computation. Their fix was hash-based sharding: split directories by the first few characters of a hashed name, so no single directory has too many entries. Git itself does this internally with its objects folder, splitting into 256 subdirectories. You’re reinventing B-trees, badly. **Case sensitivity.** Git is case-sensitive, but macOS and Windows filesystems typically aren’t. Check out a repo containing both `File.txt` and `file.txt` on Windows, and the second overwrites the first. Azure DevOps had to add server-side enforcement to block pushes with case-conflicting paths. **Path length limits.** Windows restricts paths to 260 characters, a constraint dating back to DOS. Git supports longer paths, but Git for Windows inherits the OS limitation. This is painful with deeply nested node_modules directories, where `git status` fails with “Filename too long” errors. **Missing database features.** Databases have CHECK constraints and UNIQUE constraints; git has nothing, so every package manager builds its own validation layer. Databases have locking; git doesn’t. Databases have indexes for queries like “all packages depending on X”; with git you either traverse every file or build your own index. Databases have migrations for schema changes; git has “rewrite history and force everyone to re-clone.” The progression is predictable. Start with a flat directory of files. Hit filesystem limits. Implement sharding. Hit cross-platform issues. Build server-side enforcement. Build custom indexes. Eventually give up and use HTTP or an actual database. You’ve built a worse version of what databases already provide, spread across git hooks, CI pipelines, and bespoke tooling. None of this means git is bad. Git excels at what it was designed for: distributed collaboration on source code, with branching, merging, and offline work. The problem is using it for something else entirely. Package registries need fast point queries for metadata. Git gives you a full-document sync protocol when you need a key-value lookup. If you’re building a package manager and git-as-index seems appealing, look at Cargo, Homebrew, CocoaPods, Go. They all had to build workarounds as they grew, causing pain for users and maintainers. The pull request workflow is nice. The version history is nice. You will hit the same walls they did.
nesbitt.io
December 24, 2025 at 4:49 PM
ngl kinda impressed that the Wikipedia-dump-parsing code that I wrote with a 24-core Threadripper/128GB of RAM in mind seems to be scaling down just fine to my 7-year-old 4-core ThinkPad/16GB of RAM

putting this down as another W for the Rust programming language
December 25, 2025 at 1:42 PM
thinking about buying a new portable gaming device to play VNs and such, noting that this would mostly be for bed/couch/long train/plane use. do I use next year's hardware budget to:

1) sell my Steam Deck and buy a Steam Deck OLED
2) buy a Steam Frame
3) buy an AYN Thor
December 25, 2025 at 12:03 PM
i wonder if anyone has made an anki interface that's optimised for Steam Deck
December 25, 2025 at 10:02 AM
received the Nebula subscription renewal email and figured I'd go to cancel it, given I have no easy way of watching it on my TV, but at 30 USD/year I can forget about it until next year

I really like the idea of Nebula but it's just nowhere near as convenient or as accessible as YouTube
December 25, 2025 at 8:22 AM
Reposted by philpax
Atheists would rather skeet than go to therapy
December 25, 2025 at 6:55 AM
least complicated SciADV route branching guide
December 24, 2025 at 2:51 PM
Reposted by philpax
damn now I need to come up with something for 2026
in 2025 I will probably go on the computer
December 24, 2025 at 9:55 AM
Reposted by philpax
Just published a new #rustlang reqwest release candidate: v0.13.0-rc.1.

This has some breaking changes, the biggest was switching to rustls by default.

I'd appreciate if you gave it a spin 🙏

github.com/seanmonstar/...
Release v0.13.0-rc.1 · seanmonstar/reqwest
👀 Discussion here if you give it try, thanks! Main breaking changes rustls is now default instead of native-tls rustls provider defaults to aws-lc instead of ring (rustls-no-provider exists if you...
github.com
December 23, 2025 at 10:03 PM
Reposted by philpax
While we could all unite against Go instead…
Haskell devs talking shit about rust is exactly as tiring as rust devs talking shit about Haskell.
December 23, 2025 at 2:38 PM
Reposted by philpax
human user speaks fluent Claude, instance shocked
December 22, 2025 at 3:43 AM
Reposted by philpax
Messing around with creating posters out of open model weights. I made a tool you can run yourself on any open model (though I suspect there will be plenty of bugs!)

uvx owart mlx-community/Llama-3.2-1B-Instruct-4bit

Code here: github.com/samwho/owart, readme explains the process at a high level.
December 23, 2025 at 1:10 PM
one day i will `wget` the model instead of the HTML file pointing to the model from Hugging Face, but today is not that day
December 23, 2025 at 11:34 AM
how are people reusing their Claude plans to power their personal agents? are they using Claude Code as the agent harness and gluing their own tech into it, or are they using the internal APIs that CC uses?
December 23, 2025 at 10:55 AM
should write a Claude Code hook that immediately and violently rebukes it when it writes "simplified approach" or "for now" and forces it to do the task properly
December 23, 2025 at 3:04 AM
Reposted by philpax
windbg would surely cause lifelong trauma for a 3 year old
December 22, 2025 at 12:45 PM
realising with some horror that my website has never had a font set for its codeblocks and the only reason I thought it did is because my workstation's browser had my preferred font already set for monospace
December 22, 2025 at 1:11 PM
jyn's writing style has done a fantastic job of making the cunts self-report here news.ycombinator.com/item?id=4628...
I'm just having fun | Hacker News
news.ycombinator.com
December 22, 2025 at 10:24 AM
Reposted by philpax
anna's archive scraped all of spotify, so now we have the coolest graph ever

the most used key is C and the least used one is D# (by a lot !!)
December 21, 2025 at 10:40 AM
December 21, 2025 at 11:24 AM
Reposted by philpax
June 11, 2025 at 9:28 AM
finally finished Chaos;Child; it was certainly much more cohesive than Chaos;Head, with a protagonist that I felt much less inclined to throttle, but I don't think it hit as hard as C;H

C;H is much messier and suffers from being the first in the series, but I think it took much bigger swings
December 20, 2025 at 3:16 PM