Lightnews — Scholar-powered news

Jennifer Hu

@jennhu.bsky.social

New work to appear @ TACL!

Language models (LMs) are remarkably good at generating novel well-formed sentences, leading to claims that they have mastered grammar.

Yet they often assign higher probability to ungrammatical strings than to grammatical strings.

How can both things be true? 🧵👇

Screenshot of a figure with two panels, labeled (a) and (b). The caption reads: "Figure 1: (a) Illustration of messages (left) and strings (right) in toy domain. Blue = grammatical strings. Red = ungrammatical strings. (b) Surprisal (negative log probability) assigned to toy strings by GPT-2."

November 10, 2025 at 10:11 PM

Reposted by Jennifer Hu

Tomer Ullman

@tomerullman.bsky.social

It’s grad school application season, and I wanted to give some public advice.

Caveats:
-*-*-*-*

 > These are my opinions, based on my experiences, they are not secret tricks or guarantees
 > They are general guidelines, not meant to cover a host of idiosyncrasies and special cases

November 6, 2025 at 2:55 PM

Jennifer Hu

@jennhu.bsky.social

Interested in doing a PhD at the intersection of human and machine cognition? ✨ I'm recruiting students for Fall 2026! ✨

Topics of interest include pragmatics, metacognition, reasoning, & interpretability (in humans and AI).

Check out JHU's mentoring program (due 11/15) for help with your SoP 👇

JHU Cognitive Science @jhucogsci.bsky.social · 15d

The department of Cognitive Science @jhu.edu is seeking motivated students interested in joining our interdisciplinary PhD program! Applications due 1 Dec

Our PhD students also run an application mentoring program for prospective students. Mentoring requests due November 15.

tinyurl.com/2nrn4jf9

Call for applications to cognitive science PhD program with QR code to the link above

November 4, 2025 at 2:44 PM

Reposted by Jennifer Hu

Tomer Ullman

@tomerullman.bsky.social

New preprint!

"Non-commitment in mental imagery is distinct from perceptual inattention, and supports hierarchical scene construction"

(by Li, Hammond, & me)

link: doi.org/10.31234/osf...

-- the title's a bit of a mouthful, but the nice thing is that it's a pretty decent summary

October 14, 2025 at 1:22 PM

Jennifer Hu

@jennhu.bsky.social

At #COLM2025 and would love to chat all things cogsci, LMs, & interpretability 🍁🥯 I'm also recruiting!

👉 I'm presenting at two workshops (PragLM, Visions) on Fri

👉 Also check out "Language Models Fail to Introspect About Their Knowledge of Language" (presented by @siyuansong.bsky.social Tue 11-1)

October 7, 2025 at 1:39 AM

Jennifer Hu

@jennhu.bsky.social

Can AI models introspect? What does introspection even mean for AI?

We revisit a recent proposal by Comșa & Shanahan, and provide new experiments + an alternate definition of introspection.

Check out this new work w/ @siyuansong.bsky.social, @harveylederman.bsky.social, & @kmahowald.bsky.social 👇

Siyuan Song @siyuansong.bsky.social · Aug 26

How reliable is what an AI says about itself? The answer depends on whether models can introspect. But, if an LLM says its temperature parameter is high (and it is!)….does that mean it’s introspecting? Surprisingly tricky to pin down. Our paper: arxiv.org/abs/2508.14802 (1/n)

August 26, 2025 at 5:59 PM

Jennifer Hu

@jennhu.bsky.social

Due to popular demand, we are extending the CogInterp submission deadline again! 🗓️🥳

Submit by *8/27* (midnight AoE)

Jennifer Hu @jennhu.bsky.social · Jul 16

Excited to announce the first workshop on CogInterp: Interpreting Cognition in Deep Learning Models @ NeurIPS 2025! 📣

How can we interpret the algorithms and representations underlying complex behavior in deep learning models?

🌐 coginterp.github.io/neurips2025/

1/4

Home

First Workshop on Interpreting Cognition in Deep Learning Models (NeurIPS 2025)

coginterp.github.io

August 22, 2025 at 12:53 PM

Jennifer Hu

@jennhu.bsky.social

🗓️ The submission deadline for CogInterp @ NeurIPS has officially been *extended* to 8/22 (AoE)! 👇

Looking forward to seeing your submissions!

Jennifer Hu @jennhu.bsky.social · Jul 16

Excited to announce the first workshop on CogInterp: Interpreting Cognition in Deep Learning Models @ NeurIPS 2025! 📣

How can we interpret the algorithms and representations underlying complex behavior in deep learning models?

🌐 coginterp.github.io/neurips2025/

1/4

Home

First Workshop on Interpreting Cognition in Deep Learning Models (NeurIPS 2025)

coginterp.github.io

August 14, 2025 at 1:22 PM

Jennifer Hu

@jennhu.bsky.social

Heading to CogSci this week! ✈️

Find me giving talks on:
💬 Prod-comp asymmetry in children and LMs (Thu 7/31)
💬 How people make sense of nonsense (Sat 8/2)

📣 Also, I’m recruiting grad students + postdocs for my new lab at Hopkins! 📣

If you’re interested in language / cognition / AI, let’s chat! 😄

July 28, 2025 at 4:04 PM

Jennifer Hu

@jennhu.bsky.social

Excited to announce the first workshop on CogInterp: Interpreting Cognition in Deep Learning Models @ NeurIPS 2025! 📣

How can we interpret the algorithms and representations underlying complex behavior in deep learning models?

🌐 coginterp.github.io/neurips2025/

1/4

Home

First Workshop on Interpreting Cognition in Deep Learning Models (NeurIPS 2025)

coginterp.github.io

July 16, 2025 at 1:08 PM

Reposted by Jennifer Hu

Robert Hawkins

@rdhawkins.bsky.social

Happy to announce the first workshop on Pragmatic Reasoning in Language Models — PragLM @ COLM 2025! 🎉
How do LLMs engage in pragmatic reasoning, and what core pragmatic capacities remain beyond their reach?
🌐 sites.google.com/berkeley.edu/praglm/
📅 Submit by June 23rd

PragLM @ COLM '25

IMPORTANT DATES

sites.google.com

May 28, 2025 at 6:21 PM

Jennifer Hu

@jennhu.bsky.social

Excited to share a new preprint w/ @michael-lepori.bsky.social & Michael Franke!

A dominant approach in AI/cogsci uses *outputs* from AI models (eg logprobs) to predict human behavior.

But how does model *processing* (across layers in a forward pass) relate to human real-time processing? 👇 (1/12)

Screenshot of Figure 1, which has two panels labeled (a) and (b). The caption states the following. Figure 1: Overview of our study. (a) Experiment 1: We explore whether forward passes show mechanistic signatures of competitor interference, first preferring a salient competing intuitive answer before preferring the correct answer. (b) Experiment 2: We systematically investigate the ability of dynamic measures derived from forward passes to predict indicators of processing load in humans.

May 20, 2025 at 2:26 PM

Jennifer Hu

@jennhu.bsky.social

Check out our new work on introspection in LLMs! 🔍

TL;DR we find no evidence that LLMs have privileged access to their own knowledge.

Beyond the study of LLM introspection, our findings inform an ongoing debate in linguistics research: prompting (eg grammaticality judgments) =/= prob measurement!

Siyuan Song @siyuansong.bsky.social · Mar 12

New preprint w/ @jennhu.bsky.social @kmahowald.bsky.social : Can LLMs introspect about their knowledge of language?
Across models and domains, we did not find evidence that LLMs have privileged access to their own predictions. 🧵(1/8)

March 12, 2025 at 5:43 PM

Reposted by Jennifer Hu

Tomer Ullman

@tomerullman.bsky.social

new preprint on Theory of Mind in LLMs, a topic I know a lot of people care about (I care. I'm part of people):

"Re-evaluating Theory of Mind evaluation in large language models"

(by Hu* @jennhu.bsky.social , Sosa, and me)

link: arxiv.org/pdf/2502.21098

March 6, 2025 at 1:33 PM

Reposted by Jennifer Hu

Mike Frank

@mcxfrank.bsky.social

AI models are fascinating, impressive, and sometimes problematic. But what can they tell us about the human mind?

In a new review paper, @noahdgoodman.bsky.social and I discuss how modern AI can be used for cognitive modeling: osf.io/preprints/ps...

Figure 1. A schematic depiction of a model-mechanism mapping between a human learning system (left side) and a cognitive model (right side). Candidate model mechanism mappings are pictured as mapping between representations but also can be in terms of input data, architecture, or learning objective.

Figure 2. Data efficiency in human learning. (left) Order of magnitude of LLM vs. human training data, plotted by human age. Ranges are approximated from Frank (2023a). (right) A schematic depiction of evaluation scaling curves for human learners vs. models plotted by training data
quantity.

March 6, 2025 at 5:39 PM

Jennifer Hu

@jennhu.bsky.social

Some things are more impossible than others. But some things might be even *more impossible* than impossible.

(How) do people differentiate between the inconceivable and the merely impossible? Do language models also make similar distinctions?

Check out our new preprint below!

Tomer Ullman @tomerullman.bsky.social · Mar 3

The Red Queen believed "6 impossible things before breakfast."

But what about *inconceivable* things?

For your breakfast read, check out the new preprint:

"Shades of Zero: Distinguishing Impossibility from Inconceivability"

(by @jennhu.bsky.social , Sosa, & me)

arxiv: arxiv.org/pdf/2502.20469

March 3, 2025 at 4:29 PM

Reposted by Jennifer Hu

Tomer Ullman

@tomerullman.bsky.social

Hello! I'm looking to hire a post-doc, to start this Summer or Fall.

It'd be great if you could share this widely with people you think might be interested.

More details on the position & how to apply: bit.ly/cocodev_post...

Official posting here: academicpositions.harvard.edu/postings/14723

February 13, 2025 at 2:07 PM

Reposted by Jennifer Hu

Mike Frank

@mcxfrank.bsky.social

Now hiring for two lab manager positions at Stanford! Hyo Gweon and I are coordinating joint searches since our labs collaborate frequently. Please join us!

careersearch.stanford.edu/jobs/researc...
and
careersearch.stanford.edu/jobs/lab-coo...

February 10, 2025 at 5:05 PM

Reposted by Jennifer Hu

Sonia Murthy

@soniakmurthy.bsky.social

(1/9) Excited to share my recent work on "Alignment reduces LM's conceptual diversity" with @tomerullman.bsky.social and @jennhu.bsky.social, to appear at #NAACL2025! 🐟

We want models that match our values...but could this hurt their diversity of thought?
Preprint: arxiv.org/abs/2411.04427

February 10, 2025 at 5:20 PM

Reposted by Jennifer Hu

Kyle Mahowald

@kmahowald.bsky.social

LMs need linguistics! New paper, with @futrell.bsky.social, on LMs and linguistics that conveys our excitement about what the present moment means for linguistics and what linguistics can do for LMs. Paper: arxiv.org/abs/2501.17047. 🧵below.

January 29, 2025 at 4:07 PM

Jennifer Hu

@jennhu.bsky.social

Stop by our #NeurIPS tutorial on Experimental Design & Analysis for AI Researchers! 📊

neurips.cc/virtual/2024/tutorial/99528

Are you an AI researcher interested in comparing models/methods? Then your conclusions rely on well-designed experiments. We'll cover best practices + case studies. 👇

NeurIPS Tutorial Experimental Design and Analysis for AI ResearchersNeurIPS 2024

neurips.cc

December 7, 2024 at 6:15 PM

Jennifer Hu

@jennhu.bsky.social

To researchers doing LLM evaluation: prompting is *not a substitute* for direct probability measurements. Check out the camera-ready version of our work, to appear at EMNLP 2023! (w/ @rplevy.bsky.social)

Paper: arxiv.org/abs/2305.13264

Original thread: twitter.com/_jennhu/stat...

October 24, 2023 at 3:03 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news