Jörg Lehmann
banner
jrglmn.bsky.social
Jörg Lehmann
@jrglmn.bsky.social
Digital humanism | machine learning | digital cultural heritage | Berlin State Library |
„Name a bias – we have it!“
A colleague from the UK and I bring in our two cents on a highly divisive issue, written from the perspective of European CHIs and research libraries.

A Position Paper on AI and Copyrights in Cultural Heritage and Research (EU and UK)

doi.org/10.5334/johd.290

#genAI #commons #openness
A Position Paper on AI and Copyrights in Cultural Heritage and Research (EU and UK) | Journal of Open Humanities Data
doi.org
April 2, 2025 at 3:35 PM
Sigh. This topic #openness, intellectual property rights #IPR, #genAI is getting really complicated for #GLAM institutions.
Wrote a blogpost to chart what's up in the EU and what we currently need:

mmk.sbb.berlin/2024/03/13/o...

Currently, there is no technical solution to implement an opt out...
Orientation in Turbulent Times – Mensch.Maschine.Kultur
mmk.sbb.berlin
March 14, 2024 at 2:42 PM
New post on the "power hungry magic" of contemporary artificial intelligence published on the blog of the HumanMachineCulture project:
Energy, CO2 intensity and sustainability as mostly overlooked issues in the deployment of GPTs.

mmk.sbb.berlin/2024/01/26/p...

#LLMs #ChatGPT #metaverse
Power Hungry Magic – Mensch.Maschine.Kultur
mmk.sbb.berlin
January 26, 2024 at 3:40 PM
Copyright is but one indicator of the value of digital texts, which have gone through a quality filter called ‚publishing houses‘. The same can apply to texts in the public domain, and GLAM institutions should reflect on this. Texts in open access are as well valuable, see
doi.org/10.54900/zg9...
January 18, 2024 at 9:32 PM
Dutch National Library restricts access for commercial AI
Blocking is done via the robots.txt. Crawlers are thus excluded regardless of copyright. Consequently, public domain material is not accessible to the crawlers. Restriction is selective: Googlebot-image, dataforseo.com, GPTBot, ChatGPT-User
January 14, 2024 at 8:58 AM
New post "Feeding the cuckoo" published on the blog of the MMK project, focusing on privacy issues in large language models, especially Google's Bard (my friend, the poet).

mmk.sbb.berlin/2024/01/12/f...

#LLMs #privacy #ChatGPT #ethics #elsi
Feeding the Cuckoo – Mensch.Maschine.Kultur
mmk.sbb.berlin
January 12, 2024 at 2:23 PM
Power Hungry Processing

Luccioni, Jernite & Strubell, November 2023

"the most efficient text generation model uses as much energy as 16% of a full smartphone charge for 1,000 inferences, whereas the least efficient image generation model uses as much energy as 950 smartphone charges (11.49 kWh)"
Power Hungry Processing: Watts Driving the Cost of AI Deployment?
Recent years have seen a surge in the popularity of commercial AI products based on generative, multi-purpose AI systems promising a unified approach to building machine learning (ML) models into...
doi.org
January 9, 2024 at 3:40 PM
I wrote a blogpost on LLMs and anthropomorphism for the blog of our project:

mmk.sbb.berlin/2023/12/20/h...

People who are a bit lonely before Christmas may want to read it…
Human-Machine-Cognition – Mensch.Maschine.Kultur
mmk.sbb.berlin
December 23, 2023 at 2:28 PM
Datasheets for Digital Cultural Heritage Datasets:

doi.org/10.5334/johd.124

What are the characteristics of digital cultural heritage datasets? How would dataset documentation look like?
We formulate a series of recommendations and propose a datasheet template, see:
doi.org/10.5281/ZENODO.8375033
December 22, 2023 at 5:02 PM