Laure Thompson
laurejt.bsky.social
Laure Thompson
@laurejt.bsky.social
Research software engineer @princetoncdh.bsky.social.

natural language processing, machine learning, cultural analytics

https://laurejt.github.io/
Reposted by Laure Thompson
In unexpected, kinda terrifying life turns—I’m trying to get a small leather goods store off the ground this holiday season. Please consider checking it out! Thus far: keychains, valet trays, bag charms, a few custom wallets and notebook covers.

goldberry.studio or StudioGoldberry on Etsy
November 11, 2025 at 2:26 AM
Reposted by Laure Thompson
📣 New preprint! We know humans are biased against AI-creativity. But what about LLMs, now often judging creativity in various contexts? Do they replicate, transform, or amplify this bias? We tested it. Turns out: AI is 2.5X more biased against its own work than humans. arxiv.org/pdf/2510.08831 🧵
arxiv.org
October 13, 2025 at 1:56 PM
Reposted by Laure Thompson
We are launching our Graduate School Application Financial Aid Program (www.queerinai.com/grad-app-aid) for 2025-2026. We’ll give up to $750 per person to LGBTQIA+ STEM scholars applying to graduate programs. Apply at openreview.net/group?id=Que.... 1/5
Grad App Aid — Queer in AI
www.queerinai.com
October 9, 2025 at 12:37 AM
Reposted by Laure Thompson
NEH Collaborative research grant program now explicitly prohibits digital projects. Output should be a book or journal issue. FFS
www.neh.gov/grants/resea...
Collaborative Research
Supports groups of two or more scholars engaging in significant and sustained research in the humanities.
www.neh.gov
September 25, 2025 at 5:06 PM
Reposted by Laure Thompson
I'm hiring a mid-level full-stack SWE! Our team at @jstor.bsky.social Labs is looking for yet another product-minded engineer to join our team. We come from all kinds of backgrounds, tech and non-tech alike.

Please apply or send to your awesome friends, and DM me with ?s: grnh.se/19o370345us
Software Engineer (Full-stack) - ITHAKA
ITHAKA’s mission is to expand access to knowledge and education around the world. Our services — Artstor, JSTOR, Portico, and Ithaka S+R — enable people everywhere to learn, to grow, and to overcome b...
grnh.se
September 22, 2025 at 7:48 PM
Reposted by Laure Thompson
Excited to be co-editing a special issue of @dhquarterly.bsky.social on Artificial Intelligence for Digital Humanities: Research problems and critical approaches
dhq.digitalhumanities.org/news/news.html

We're inviting abstracts now - please feel free to reach out with any questions!
DHQ: Digital Humanities Quarterly: News
dhq.digitalhumanities.org
September 9, 2025 at 8:28 PM
Reposted by Laure Thompson
Please share! We're trying to crowd-source a dataset of post-1929 novels with maps.
We are trying to create a list of in-copyright novels that contain maps. If you know of some, drop them in the thread below! 🧵👇
August 28, 2025 at 2:55 PM
Reposted by Laure Thompson
Check out Rebecca Hicke @dmimno.bsky.social piece on LLMs! “Language models have the ability to identify the characteristics of much shorter literary passages than was thought feasible with traditional stylometry. We evaluate authorship and genre detection for a new corpus of literary novels…”
August 19, 2025 at 6:17 PM
Reposted by Laure Thompson
For PW, I wrote about the persistent gender gap in fictional animal characters—a pattern I noticed while analyzing 100s of picture books with @puddingviz.bsky.social.

It's a more interesting (and pervasive) problem than I first thought.

#kidlit #booksky

🔗: www.publishersweekly.com/pw/by-topic/...
August 5, 2025 at 11:29 PM
Reposted by Laure Thompson
The @princetoncdh.bsky.social newsletter is always a treat (and you should subscribe if you dont) but having an exclusive interview with Ted Chiang sort of takes the cake!
mailchi.mp/princeton/au...
Q&A with Ted Chiang 🤖
mailchi.mp
August 14, 2025 at 1:18 PM
Reposted by Laure Thompson
If you’re an incoming Princeton 🐯 this fall, consider joining FRS159 - a unique chance to dive into technology, culture and African Languages. More about the course in the post below.
Thinking about enrolling in @happybuzaaba1.bsky.social's first-year seminar, Teaching Computers to Understand African Languages (FRS 159), this fall? Check out the feature story on it from earlier this year: www.princeton.edu/news/2025/02...

Course: registrar.princeton.edu/course-offer...
August 12, 2025 at 4:30 PM
Reposted by Laure Thompson
Octavia Butler’s Parable of the Sower was published in 1993 and starts in 2024—a 31-year leap. Are creators imagining futures that are closer or further away?

Explore a *new* dataset of 2.5k narrative works set in the future, each tagged with its release year and setting.

doi.org/10.18737/552...
June 25, 2025 at 9:01 AM
Reposted by Laure Thompson
I just pulled out my copy of Newton's Principia, a book that was originally written in Latin, and noticed that the translation I am using was completed with support from the National Endowment for the Humanities, which has basically been destroyed

Science depends on the humanities 🧪
June 17, 2025 at 3:40 PM
Reposted by Laure Thompson
Check out the camera-ready version of our ICML position paper ("Position: Evaluating Generative AI Systems Is a Social Science Measurement Challenge") to learn more!!! arxiv.org/abs/2502.00561

(6/6)
Position: Evaluating Generative AI Systems Is a Social Science Measurement Challenge
The measurement tasks involved in evaluating generative AI (GenAI) systems lack sufficient scientific rigor, leading to what has been described as "a tangle of sloppy tests [and] apples-to-oranges com...
arxiv.org
June 15, 2025 at 12:20 AM
Reposted by Laure Thompson
Today we released Institutional Books 1.0, a 242B token dataset from Harvard Library's collections, refined for accuracy and usability. 🧵
June 12, 2025 at 9:12 PM
Reposted by Laure Thompson
The NEH has been a singular force for bringing people together. In this moment of hate, fear and isolation, the NEH helped us learn about our past and about each other.

Today we are losing almost all of the NEH public servants. We are all in their debt. A national shame. A profound cultural loss.
June 10, 2025 at 3:41 PM
Reposted by Laure Thompson
My course proposal for Cultural Analytics @berkeleyischool.bsky.social has been approved for Fall 2025! This is the fullest expression of my vision for CA: a radical interdisciplinary experiment for rethinking knowledge production at the intersection of the humanities and machine learning. (1/9)
June 4, 2025 at 5:20 PM
Reposted by Laure Thompson
Many libraries now use automated tools to measure diversity in their collections.

We examined how these tools work and whether library workers find them useful. A complex case study of libraries navigating automation, DEI, & shrinking public funding.

Our new FAccT paper: arxiv.org/abs/2505.14890
May 26, 2025 at 12:10 PM
Reposted by Laure Thompson
Worth a read. Not about AI. Many thoughts come to mind about literacy (and political literacy), grading standards, perceptions of studying literature as "easy" compared to science (it's not easy), how to translate expertise so people can understand.

kittenbeloved.substack.com/p/college-en...
College English majors can't read
They have one job and they can't do it
kittenbeloved.substack.com
May 22, 2025 at 9:06 AM
Reposted by Laure Thompson
Llama 3.1 70B contains copies of nearly the entirety of some books. Harry Potter is just one of them. I don’t know if this means it’s an infringing copy. But the first question to answer is if it’s a copy at all/in the first place. That’s what our new results suggest:

arxiv.org/abs/2505.12546
Extracting memorized pieces of (copyrighted) books from open-weight language models
Plaintiffs and defendants in copyright lawsuits over generative AI often make sweeping, opposing claims about the extent to which large language models (LLMs) have memorized plaintiffs' protected expr...
arxiv.org
May 21, 2025 at 11:20 AM
Reposted by Laure Thompson
I spoke to the person who AI-generated the Chicago Sun-Times reading list. Says he's very embarrassed. This was part of a generic package inserted into newspapers and other publications, so likely to run elsewhere. He didn't know it'd be in Chicago Sun-Times

www.404media.co/chicago-sun-...
Chicago Sun-Times Prints AI-Generated Summer Reading List With Books That Don't Exist
"I can't believe I missed it because it's so obvious. No excuses," the writer said. "I'm completely embarrassed."
www.404media.co
May 20, 2025 at 2:47 PM
Reposted by Laure Thompson
Public Access > Open Access
May 20, 2025 at 1:41 AM
Reposted by Laure Thompson
bad
This was just posted by @tbretc.bsky.social on another platform. The Chicago Sun-Times obviously gets ChatGPT to write a ‘summer reads’ feature almost entirely made up of real authors but completely fake books. What are we coming to?
May 20, 2025 at 11:28 AM