Sebastian Majstorovic
banner
storytracer.com
Sebastian Majstorovic
@storytracer.com
Open Data Consultant for @eleutherai.bsky.social & Digital History Advisor for @eui-history.bsky.social. Co-founder of @datarescueproject.org and @sucho-org.bsky.social. Website: https://www.storytracer.com/
Stanford created a similar tool for the Roman Empire more than a decade ago: orbis.stanford.edu. ORBIS lets you calculate travel times by land, river, and sea, with options for different modes of transport and travel speeds. It's truly an amazing resource and I'm so grateful they keep hosting it.
October 25, 2025 at 4:19 PM
If you still want to give ocrmypdf a shot you can try installing it using homebrew. It might help to avoid dependency hell. In principle ocrmypdf is a pretty good. formulae.brew.sh/formula/ocrm...
ocrmypdf
Homebrew’s package index
formulae.brew.sh
September 24, 2025 at 2:52 AM
It‘s a great project. It can‘t reassemble PDFs yet though, it can only extract the content as Markdown, plain text, etc. I‘ve been meaning to write a plugin for Docling to reassemble PDFs using the methodology from ocrmypdf. Your post is a reminder for me to tackle this sometime soon 😅.
September 24, 2025 at 2:30 AM
Give Docling a try! It can using OCR using a tesseract backend as well as the latest state-of-the-art VLM models. github.com/docling-proj...
GitHub - docling-project/docling: Get your documents ready for gen AI
Get your documents ready for gen AI. Contribute to docling-project/docling development by creating an account on GitHub.
github.com
September 24, 2025 at 2:24 AM
Perhaps Omeka-S with the IIIF presentation module or the IIIF Server module enabled? omeka.org/s/docs/user-... gitlab.com/Daniel-KM/Om...
IIIF Presentation - Omeka S User Manual
omeka.org
September 22, 2025 at 6:43 PM