Lightnews — Scholar-powered news

Sean Trott

@seantrott.bsky.social

41 followers 16 following 7 posts

Posts Replies Media Videos

Reposted by Sean Trott

Kyle Mahowald

@kmahowald.bsky.social

A confounding thing for the linguistics of LMs: the best way to assess their grammatical ability is string probability. Yet string probability and grammaticality are famously not the same!

Really excited to have this out, where we give a formal account, w/ experiments, of how to make sense of that!

Jennifer Hu @jennhu.bsky.social · 12d

New work to appear @ TACL!

Language models (LMs) are remarkably good at generating novel well-formed sentences, leading to claims that they have mastered grammar.

Yet they often assign higher probability to ungrammatical strings than to grammatical strings.

How can both things be true? 🧵👇

Screenshot of a figure with two panels, labeled (a) and (b). The caption reads: "Figure 1: (a) Illustration of messages (left) and strings (right) in toy domain. Blue = grammatical strings. Red = ungrammatical strings. (b) Surprisal (negative log probability) assigned to toy strings by GPT-2."

November 10, 2025 at 10:23 PM

Reposted by Sean Trott

Eran Mukamel

@neurome.bsky.social

Hard to process the news about Harvard and international students. Other universities should stand in solidarity with our colleagues who are being persecuted.

May 23, 2025 at 4:40 AM