Lightnews — Scholar-powered news

Alexander Hoyle

@alexanderhoyle.bsky.social

Happy to be at #EMNLP2025! Please say hello and come see our lovely work

The Medium Is Not the Message: Deconfounding Document Embeddings via Linear Concept Erasure — Tuesday at 11:00, Poster

Co-DETECT: Collaborative Discovery of Edge Cases in Text Classification — Tuesday at 14:30, Demo

Measuring Scalar Constructs in Social Science with LLMs — Friday at 10:30, Oral at CSS

How Persuasive is Your Context? — Friday at 14:00, Poster

November 5, 2025 at 2:23 AM

Alexander Hoyle

@alexanderhoyle.bsky.social

[corrected link]

LLMs are often used for text annotation in social science. In some cases, this involves placing text items on a scale: eg, 1 for liberal and 9 for conservative

There are a few ways to handle this task. Which work best? Our new EMNLP paper has some answers🧵
arxiv.org/abs/2509.03116

A diagram illustrating pointwise scoring with a large language model (LLM). At the top is a text box containing instructions: 'You will see the text of a political advertisement about a candidate. Rate it on a scale ranging from 1 to 9, where 1 indicates a positive view of the candidate and 9 indicates a negative view of the candidate.' Below this is a green text box containing an example ad text: 'Joe Biden is going to eat your grandchildren for dinner.' An arrow points down from this text to an illustration of a computer with 'LLM' displayed on its monitor. Finally, an arrow points from the computer down to the number '9' in large teal text, representing the LLM's scoring output. This diagram demonstrates how an LLM directly assigns a numerical score to text based on given criteria

October 28, 2025 at 6:23 AM

Alexander Hoyle

@alexanderhoyle.bsky.social

LLMs are often used for text annotation, especially in social science. In some cases, this involves placing text items on a scale: eg, 1 for liberal and 9 for conservative

There are a few ways to accomplish this task. Which work best? Our new EMNLP paper has some answers🧵
arxiv.org/pdf/2507.00828

October 27, 2025 at 2:59 PM

Reposted by Alexander Hoyle

Manoel Horta Ribeiro

@manoelhortaribeiro.bsky.social

Computer Science is no longer just about building systems or proving theorems--it's about observation and experiments.

In my latest blog post, I argue it’s time we had our own "Econometrics," a discipline devoted to empirical rigor.

doomscrollingbabel.manoel.xyz/p/the-missin...

October 5, 2025 at 4:07 PM

Alexander Hoyle

@alexanderhoyle.bsky.social

Accepted to EMNLP (and more to come 👀)! The camera ready version is now online---very happy with how this turned out

arxiv.org/abs/2507.01234

Alexander Hoyle @alexanderhoyle.bsky.social · Jul 17

New preprint! Have you ever tried to cluster text embeddings from different sources, but the clusters just reproduce the sources? Or attempted to retrieve similar documents across multiple languages, and even multilingual embeddings return items in the same language?

Turns out there's an easy fix🧵

Barchart of number of items in four clusters of text embeddings, with colors showing the distribution of sources in each cluster.

Caption: Clustering text embeddings from disparate sources (here, U.S. congressional bill summaries and senators’ tweets) can produce clusters where one source dominates (Panel A). Using linear erasure to remove the source information produces more evenly balanced clusters that maintain semantic coherence (Panel B; sampled items relate to immigration). Four random clusters of k-means shown (k=25), trained on a combined 5,000 samples from each dataset

September 24, 2025 at 3:21 PM

Alexander Hoyle

@alexanderhoyle.bsky.social

this looks terrific, very excited to read

Ethan Mollick @emollick.bsky.social · Sep 15

LLMs introduce a huge range of new capabilities for research, but also make it possible for researchers to "hack" their results in new ways by how they chose to use models for annotation

This is a useful pass at quantifying some of the risk, and some mitigation strategies arxiv.org/pdf/2509.08825

September 15, 2025 at 7:53 PM

Reposted by Alexander Hoyle

Dallas Card

@dallascard.bsky.social

I am delighted to share our new #PNAS paper, with @grvkamath.bsky.social @msonderegger.bsky.social and @sivareddyg.bsky.social, on whether age matters for the adoption of new meanings. That is, as words change meaning, does the rate of adoption vary across generations? www.pnas.org/doi/epdf/10....

July 29, 2025 at 12:31 PM

Alexander Hoyle

@alexanderhoyle.bsky.social

At #ACL2025 this week! Please reach out if you want to chat :)

We have two lovely posters:
Tues Session 2, 10:30-11:50 — Large Language Models Struggle to Describe the Haystack without Human Help

Wed Session 4 11:00-12:30 — ProxAnn: Use-Oriented Evaluations of Topic Models and Document Clustering

Alexander Hoyle @alexanderhoyle.bsky.social · Jul 8

Evaluating topic models (and document clustering methods) is hard. In fact, since our paper critiquing standard evaluation practices four years ago, there hasn't been a good replacement metric

That ends today (we hope)! Our new ACL paper introduces an LLM-based evaluation protocol 🧵

Screenshot of first page of paper. It is here: https://arxiv.org/pdf/2507.00828

Abstract: Topic model and document-clustering evaluations either use automated metrics that align poorly with human preferences or require expert labels that are intractable to scale. We design a scalable human evaluation protocol and a corresponding automated approximation that reflect practitioners' real-world usage of models. Annotators -- or an LLM-based proxy -- review text items assigned to a topic or cluster, infer a category for the group, then apply that category to other documents. Using this protocol, we collect extensive crowdworker annotations of outputs from a diverse set of topic models on two datasets. We then use these annotations to validate automated proxies, finding that the best LLM proxies are statistically indistinguishable from a human annotator and can therefore serve as a reasonable substitute in automated evaluations

July 27, 2025 at 9:30 AM

Reposted by Alexander Hoyle

Hope Schroeder

@hopeschroeder.bsky.social

🗣️ Excited to share our new #ACL2025 Findings paper: “Just Put a Human in the Loop? Investigating LLM-Assisted Annotation for Subjective Tasks” with Jad Kabbara and Deb Roy. Arxiv: arxiv.org/abs/2507.15821
Read about our findings ⤵️

Just Put a Human in the Loop? Investigating LLM-Assisted Annotation for Subjective Tasks

LLM use in annotation is becoming widespread, and given LLMs' overall promising performance and speed, simply "reviewing" LLM annotations in interpretive tasks can be tempting. In subjective annotatio...

arxiv.org

July 22, 2025 at 8:32 AM

Reposted by Alexander Hoyle

boydgraber.bsky.social

@boydgraber.bsky.social

The precursor to this paper "The Incoherence of Coherence" had our most-watched paper video ever, so I thought we had to surpass it somehow ... so we decided to do a song parody (of Roxanne, obviously):

youtu.be/87OBxEM8a9E

July 18, 2025 at 6:37 PM

Alexander Hoyle

@alexanderhoyle.bsky.social

New preprint! Have you ever tried to cluster text embeddings from different sources, but the clusters just reproduce the sources? Or attempted to retrieve similar documents across multiple languages, and even multilingual embeddings return items in the same language?

Turns out there's an easy fix🧵

July 17, 2025 at 10:53 AM

Alexander Hoyle

@alexanderhoyle.bsky.social

Evaluating topic models (and document clustering methods) is hard. In fact, since our paper critiquing standard evaluation practices four years ago, there hasn't been a good replacement metric

That ends today (we hope)! Our new ACL paper introduces an LLM-based evaluation protocol 🧵

July 8, 2025 at 12:40 PM

Alexander Hoyle

@alexanderhoyle.bsky.social

Michael Roth's recent outspokenness has made me proud to be a Wes alum. A decade of NYT op-ed handwringing about "free speech" on campuses has only provided ammunition for bad faith attacks on academia

(Perhaps I should be better at responding to those fundraising emails)

Daniel Drezner @dandrezner.bsky.social · Jun 26

www.insidehighered.com/news/faculty...

June 26, 2025 at 2:09 PM

Alexander Hoyle

@alexanderhoyle.bsky.social

I for one am grateful for the opportunity to meditate on the meaning of “scientific artifact” at 2:15am

Ana Marasović @anamarasovic.bsky.social · May 20

Do folks really find the ARR checklist valuable enough to justify that a paper submission takes this much effort?

May 20, 2025 at 1:08 PM

Alexander Hoyle

@alexanderhoyle.bsky.social

They added multi-file search!

May 11, 2025 at 9:55 PM

Alexander Hoyle

@alexanderhoyle.bsky.social

Heartbreaking and evil. International students have always been treated like an indentured underclass, but we’ve moved from byzantine indifference to deliberate terrorizing. Unforgivable

Are there mutual aid networks for international students? What can we as citizens do here?

Stuart Schrader @stschrader1.bsky.social · Apr 8

A dozen Johns Hopkins students have had their visas revoked, for unspecified reasons. This confirms rumors swirling around campus yesterday. www.thebaltimorebanner.com/education/hi...

"Approximately a dozen" international students at the Johns Hopkins University had their visas to study in the United States revoked, university officials said in a statement Tuesday morning, joining schools across the country that have reported students being given little warning that their visas are suddenly invalid.
"We have received no information about the specific basis for the revocations, and we have no indication that the revocations are associated with free expression activities on campus," said a

April 8, 2025 at 8:48 PM

Alexander Hoyle

@alexanderhoyle.bsky.social

this holiday season I am thankful that, rather than fixing the literally decade-old problem of multi-file search, Overleaf instead implemented the world's worst writing assistance tool

screenshot of writeful grammar correction in an overleaf document with paywall

github issue for overleaf from 2014, titled "search can't search multiple files"

December 12, 2024 at 10:37 AM

Alexander Hoyle

@alexanderhoyle.bsky.social

BERTopic users: how do you retrieve the documents most associated with a given topic? I can see some possible options from the documentation, but I'm most interested in standard practice

(NB: please don't take this question as a tacit endorsement of BERTopic, I'm just trying to evaluate it fairly)

December 4, 2024 at 12:44 PM

Alexander Hoyle

@alexanderhoyle.bsky.social

Maria has been a consistent source of excellent insight, advice, and research since I’ve known her—you should apply!

(And now the bluesky is taking off you will be connected to NLP’s #1 influencer ! )

Maria Antoniak @mariaa.bsky.social · Nov 19

I'm recruiting 1-2 PhD students to work with me at the University of Colorado Boulder! Looking for creative students with interests in #NLP and #CulturalAnalytics.

Boulder is a lovely college town 30 minutes from Denver and 1 hour from Rocky Mountain National Park 😎

Apply by December 15th!

A photo of Boulder, Colorado, shot from above the university campus and looking toward the Flatirons.

November 19, 2024 at 2:37 PM

Alexander Hoyle

@alexanderhoyle.bsky.social

Once again thinking about this description of a George Wallace campaign rally (from Gary Wills’ “Nixon Agonistes”)

“He’ll have to go the whole way to satisfy this audience. “Ah hadn’ meant to say this tonight, but yew-know, if one of those hippies lays down in front of mah car when Ah become President …” They drown out the punch line in happy fulfilled anger. Refrain of some favorite song, it is too longed-for to be audible when it comes.
Their happiness is enough to break the heart. They vomit laughter. Trying to eject the vacuum inside them. They are not hungry or underprivileged or deprived in material ways. Each has, in some minor way, “made it.” And it all means nothing. Washington does not care. The children do not care. They have worked, and for what? As I looked through the crowd—the very young, and then a jump to middle age, no college students there but the protesting peaceniks—I wondered if the young mother from the street corner was there (someone watching her bright smear of baby), the one who screamed at the marching priests. Had the policeman come, the one who said last night that he did not back off in fourteen years? Had he turned in his resignation that day?—the[…]”

Excerpt From
Nixon Agonistes
Garry Wills

November 7, 2024 at 5:36 PM

Reposted by Alexander Hoyle

Michael Hobbes

@michaelhobbes.bsky.social

if a computer told you how fucking stupid this is would you believe it
twitter.com/emollick/sta...

April 21, 2024 at 2:03 AM

Alexander Hoyle

@alexanderhoyle.bsky.social

What is in the water in Amsterdam?? For my dissertation I've been reading these excellent critical papers on measurement and validation and so many authors have a connection to UvA

pubmed.ncbi.nlm.nih.gov/15482073/

www.tandfonline.com/doi/epdf/10....

www.tandfonline.com/doi/full/10....

Oskar van der Wal @ovdw.bsky.social · Jan 24

I am super excited to share that our paper "Undesirable Biases in NLP: Addressing Challenges of Measurement" has been published in JAIR!
doi.org/10.1613/jair...

Undesirable Biases in NLP: Addressing Challenges of Measurement | Journal of Artificial In...

doi.org

January 29, 2024 at 5:21 PM

Reposted by Alexander Hoyle

Quinn Dombrowski

@quinnanya.me

The #DataSittersClub is back with an all-new book on topic modeling! If the LDA buffet explainer didn't do it for you, give this one a try: thanks to Xanda Schofield and her student Sathvika Anand, I now feel like I actually understand how it works. datasittersclub.github.io/site/dsc20.h...

Cover of "DSC 20: Xanda Rescues the Topic Modeling Disaster" with Kristy and her baseball team, and the text "Topic modelng is a disaster until you understand it!"

November 30, 2023 at 3:30 PM

Reposted by Alexander Hoyle

Vilém Zouhar #EMNLP

@zouharvi.bsky.social

Dominik @dominsta.bsky.social talks about how to evaluate topic models, particularly with LLMs. 📄🧾📑🗞️📰📜
Joint work with Alexander @alexanderhoyle.bsky.social, Mrinmaya Sachan, and Elliott @elliottash.bsky.social.
www.youtube.com/watch?v=qIDj...

- YouTube

Enjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.

www.youtube.com

November 9, 2023 at 2:49 PM

Reposted by Alexander Hoyle

Sathvik

@sathvik.bsky.social

Honored my paper was accepted to Findings of #EMNLP2023! Many psycholinguistics studies use LLMs to estimate the probability of words in context. But LLMs process statistically derived subword tokens, while human processing doesn't. Does this matter? (w/Philip Resnik) 🧵
arxiv.org/abs/2310.17774

November 2, 2023 at 10:20 PM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news