Robin Linacre
robinlinacre.bsky.social
Robin Linacre
@robinlinacre.bsky.social
Lead developer of Splink. Data scientist at Ministry of Justice. Trustee, GiveDirectly UK. Pledgee, http://givingwhatwecan.org. All views my own.
No worries - thanks for the report on the repo, we'll take a look
October 2, 2025 at 1:08 PM
(Incidentally, uk_address_matcher should work ok for non-UK addresses, that's just no our focus. See examples here for how to use the package github.com/moj-analytic...)
uk_address_matcher/examples at main · moj-analytical-services/uk_address_matcher
Contribute to moj-analytical-services/uk_address_matcher development by creating an account on GitHub.
github.com
October 2, 2025 at 6:21 AM
Did you try github.com/moj-analytic...?

The trie is WIP, but the idea is that it will be used as an initial step to skim off the easy ones. The remainder will go through to the main matching phase which already exists in uk_address_matcher, but is more computationally intensive
GitHub - moj-analytical-services/uk_address_matcher
Contribute to moj-analytical-services/uk_address_matcher development by creating an account on GitHub.
github.com
October 2, 2025 at 6:21 AM
You select the columns you want, and it handles the joins for you.

It's just a rough sketch for now. I feel like it must have done before, but couldn't find anything. Feedback welcome!
August 18, 2025 at 6:40 AM
Then give output to VS Code copilot in agent mode to implement
July 11, 2025 at 8:33 AM
I think more blocking stage. UK blocking is relatively easy because postcode gets you down to about 50 or fewer addresses. So if your postcodes are accurate, blocking isn't too hard. For addresses outside UK, you might need to lean more heavily on the signature based approaches
July 5, 2025 at 9:01 PM
Visual Fraction Addition
rupertlinacre.com
May 22, 2025 at 10:31 PM
It's pretty easy to set up a markdown-based blog using github pages for free. Custom styling is much easier now we're in the world of ChatGPT!
March 16, 2025 at 8:19 PM