Neal Caren
banner
haphazardsoc.bsky.social
Neal Caren
@haphazardsoc.bsky.social
AOL Keyword: Carolina Sociology

[Coding Snippets](https://nealcaren.github.io/notes/)
[Website](http://nealcaren.org)
His The Awful Truth “Beat the Rich” youtu.be/akEI9ZwtKcM?...
Beat The Rich / The Sodomobile
YouTube video by Cineverse
youtu.be
November 4, 2025 at 12:09 AM
I've tried Tesseract and EasyOCR, and neither performs well on this corpus. Abbyy Finereader probably does better than those, but I need to restart my license.

Preliminary checks on Tesseract+LLM for cleaning are very hopeful for printed works.
September 15, 2025 at 11:36 AM
Your LLM structured data stuff was 100% the impetutus from turning this into a more systemicatic process.
September 11, 2025 at 9:02 PM
🤗 Datasets?
July 6, 2025 at 8:39 PM
This is awesome!
May 2, 2025 at 1:56 PM