Thomas
banner
thomashaighton.com
Thomas
@thomashaighton.com
Metadata specialist @ National Library of the Netherlands KB.nl | Data QA | Digital Library Standards | Python | AI | Code | Data | Audio | Music

Dutch/English
Reposted by Thomas
New database for #CorpusLinguistics: Historical Corpus of Dutch, hcd.ivdnt.org/corpus-front... #Linguistics #EarlyModern - "a diachronic, regionally balanced, multigenre corpus of written #Dutch" from C16 to C19.
Historical Corpus of Dutch (HCD) search
Historical Corpus of Dutch (HCD) provided by the Dutch Language Institute in Leiden.
hcd.ivdnt.org
April 26, 2025 at 6:21 AM
Reposted by Thomas
Trump wil een militaire parade, maar no way dat hij dit gaat toppen.
www.youtube.com/watch?v=vhQk...
French army band medleys Daft Punk following Bastille Day parade
YouTube video by Guardian News
www.youtube.com
April 9, 2025 at 9:11 AM
Reposted by Thomas
We kregen ontzettend veel reacties van jullie binnen op onze oproep aan de Nederlandse overheid om per direct van sociale media platformen van Big Tech-bedrijven af te gaan, zoals X en Instagram.

Wil je meer weten over de oproep? www.bitsoffreedom.nl/campagnes/op... 📢
January 30, 2025 at 11:35 AM
Reposted by Thomas
While we’re banning books…

Finland is teaching children in school how to recognize fake news and propaganda as part of critical thinking and civic responsibility. Some of this will seem very familiar.

Be. Like. Finland.
January 4, 2025 at 6:01 PM
January 1, 2025 at 5:29 PM
Reposted by Thomas
Don't think there is a better illustration of AI-in-everything-whether-we-want-it-or-not than an AI bro buying an AI toy for his child who played with it for a bit and then seemed singularly unimpressed with the AI and turned it off.
He keeps turning it back on, and she keeps turning it off.
December 27, 2024 at 12:20 PM
Reposted by Thomas
This article highlights why more “crazy motherfuckers”are likely going to kill more extremely wealthy, “heartless, motherfuckers.”

Society has reached the tipping point on inequities. Especially those leaving the masses to suffer and die.

www.cnbc.com/2018/04/11/g...
Goldman Sachs asks in biotech research report: 'Is curing patients a sustainable business model?'
Goldman Sachs warns sales from the most successful disease treatments are difficult to maintain.
www.cnbc.com
December 18, 2024 at 7:41 AM
Reposted by Thomas
"One of the great tragedies of mankind is that morality has been hijacked by religion. So now people assume that religion and morality have a necessary connection. But the basis of morality is really very simple and doesn't require religion at all."

-- Arthur C. Clarke, born #OTD 1917
December 16, 2024 at 11:09 AM
Reposted by Thomas
gitched sign + one of those pointless TikTok-style ads = unintentional cyberpunk at the Whole Foods
December 12, 2024 at 4:38 AM
Goede stof voor bibliotheekmedewerkers die gebruik maken van AI of AI zelf ontwikkelen.
📣 New zine!! "A Librarian Against AI; or, I Think AI Should Leave" is a 40-page zine about why we should think twice about using & supporting generative AI. violetbfox.info/against-ai/ #noAI #zines
December 6, 2024 at 9:02 AM
Reposted by Thomas
This is why I was so disturbed to overhear some older woman in hospital a few years ago saying that we should move to an insurance system. I thought: you have no idea. You would be denied healthcare under that system.
“Denied by AI,” the multi-part STAT News investigation of how #UnitedHealthcare used an opaque algorithmic system to deny care to people who needed it is a #mustread www.statnews.com/2023/03/13/m...
December 6, 2024 at 7:49 AM
Reposted by Thomas
Breaking News: In the first case of its kind in Canada, major news outlets are suing OpenAI, accusing the company of illegally using their content.
Major Canadian News Outlets Sue OpenAI In New Copyright Case
A coalition of some of Canada’s biggest media companies is seeking billions of dollars in compensation for what they say is copyright infringement on their work through ChatGPT.
www.nytimes.com
November 29, 2024 at 7:57 PM
Reposted by Thomas
Former ICC chief prosecutor says she faced threats and ‘thug-style tactics’
www.theguardian.com/law/2024/nov...
Former ICC chief prosecutor says she faced threats and ‘thug-style tactics’
Fatou Bensouda says she and her family were subjected to ‘direct threats’ while working on the most sensitive cases
www.theguardian.com
November 27, 2024 at 11:32 AM
Reposted by Thomas
Voor het eerst ter wereld heeft een ziekenhuis zonder financiële steun van Big Pharma een medicijn tegen uitgezaaide huidkanker ontwikkeld. Het Antoni van Leeuwenhoekziekenhuis creëerde een middel dat vijf keer goedkoper is dan vergelijkbare middelen. www.ftm.nl/artikelen/ru...
Een kankermedicijn ontwikkelen zonder Big Pharma: dit ziekenhuis laat zien dat het kan
Het Antoni van Leeuwenhoek ziekenhuis heeft een wereldwijde primeur: het ontwikkelde zonder hulp van commerciële investeerders een behandeling tegen uitgezaaide kanker. De behandeling is vijf keer goe...
www.ftm.nl
November 27, 2024 at 7:57 AM
Reposted by Thomas
Big Tech is training AI on thousands of TV/film scripts. I spoke to showrunners, IP and business affairs lawyers — as well as the programmer at The Atlantic who examined this data set — to find out what happens next

The latest Series Business @theankler.bsky.social
theankler.com/p/tv-writers...
TV Writers Found 139,000 of Their Scripts Trained AI. Hell Broke Loose
It's 'organized crime,' says one, as scribes from Shonda Rhimes to Robert King face a system where studio policy and law remain murky
theankler.com
November 25, 2024 at 6:46 PM
Reposted by Thomas
Trying out Docling for PDF info extraction and it’s surprisingly good! The chunk feature splits PDFs into metadata (labels like headers, paragraphs, tabels etc.) and text (actual content). @tedunderwood.me seems like the pdf problem is close to being solved!
github.com/DS4SD/docling
#NLPSky
GitHub - DS4SD/docling: Get your documents ready for gen AI
Get your documents ready for gen AI. Contribute to DS4SD/docling development by creating an account on GitHub.
github.com
November 26, 2024 at 1:17 AM
Nederlandse versie misschien interessant
My latest labeler is finally working!

What it shows for each politician

- 3 labels for the top corp donators
- 3 labels for the top industries that donate to them

Subscribe to this labeler to be more aware of the interests our politicians represent 🚀

bsky.app/profile/us-g...
November 23, 2024 at 6:50 AM
Reposted by Thomas
Steeds meer wetenschappelijk onderzoek laat zien dat het materialistische wereldbeeld niet klopt, zegt filosoof en computertechnoloog Bernardo Kastrup. Ons bewustzijn komt niet voort uit ons brein. Ons brein komt voort uit bewustzijn.
https://buff.ly/3ZmkyYE
Computers mét bewustzijn? ‘Bullshit’, zegt filosoof Bernardo Kastrup. ‘Ze verwerken alleen data’
Steeds meer wetenschappelijk onderzoek laat zien dat het materialistische wereldbeeld niet klopt, zegt filosoof en computertechnoloog Bernardo Kastrup. Ons bewustzijn komt niet voort uit ons brein.…
buff.ly
November 22, 2024 at 10:35 AM
Transparency around used datasets now. There should prob be laws around using copyrighted material to train AI models. Maybe an official label that can research a dataset to determine it was ethically composed.
November 22, 2024 at 7:01 AM
Reposted by Thomas
De eerste versie van de 'Human Cell Atlas' is er, een gigantische database met beelden en beschrijvingen van alle cellen in het menselijk lichaam. Hiermee zullen wetenschappers allerlei ziekten beter begrijpen, zo hopen de makers. https://buff.ly/3Z0mDIg
Het landschap van alle menselijke cellen in kaart
Geneeskunde: Met de beschrijving van alle cellen in een mensenlichaam zullen wetenschappers ook allerlei ziekten beter begrijpen, zo hopen de makers van de Human Cell Atlas.
buff.ly
November 20, 2024 at 4:06 PM
#eurorack first setup for ambient type stuff.
November 19, 2024 at 3:47 PM
#aiiir Den Haag
November 19, 2024 at 3:00 PM
DOMi & JD Beck - Madvillainy Tribute (Madlib - MF DOOM)
YouTube video by DOMi & JD BECK
www.youtube.com
November 19, 2024 at 2:31 PM