Lightnews — Scholar-powered news

Juan Diego Rodriguez

@juand-r.bsky.social

4.3K followers 2.2K following 600 posts

CS PhD student at UT Austin in #NLP
Interested in language, reasoning, semantics and cognitive science. One day we'll have more efficient, interpretable and robust models!

Other interests: math, philosophy, cinema

https://www.juandiego-rodriguez.com/

Posts Replies Media Videos

Juan Diego Rodriguez

@juand-r.bsky.social

GPT-5 being difficult

November 11, 2025 at 9:07 PM

Juan Diego Rodriguez

@juand-r.bsky.social

Some Halloween reading

October 31, 2025 at 6:16 PM

Juan Diego Rodriguez

@juand-r.bsky.social

Relatedly, I think this may be a better depiction of LLMs than the shoggoth meme

September 24, 2025 at 2:37 PM

Juan Diego Rodriguez

@juand-r.bsky.social

I keep coming back to this bit from John Stuart Mill's
"On Liberty":

August 6, 2025 at 3:47 AM

Juan Diego Rodriguez

@juand-r.bsky.social

These encompass three tiers of interpretive complexity:

1. Stylistic feature identification
2. Context retrieval (e.g., historical or literary context)
3. Multi-hop reasoning between style and boarder contexts.

July 27, 2025 at 7:19 PM

Juan Diego Rodriguez

@juand-r.bsky.social

🎉 New Benchmark Alert: KRISTEVA – Close‑Reading for LLMs📚

I’m excited to announce a new paper accepted to ACL 2025, in collaboration with Patrick Sui, Philippe Laban, and others!

July 27, 2025 at 7:19 PM

Juan Diego Rodriguez

@juand-r.bsky.social

Multi-document summarization paper, pistachio milk cake, and mufawar coffee.

June 7, 2025 at 5:15 PM

Juan Diego Rodriguez

@juand-r.bsky.social

For those attending #NAACL, I’d definitely recommend side trips to Santa Fe and Taos

Now off to Albuquerque!

The Earthship near Taos

https://taos.org/explore/landmarks/taos-earthships/

April 28, 2025 at 1:14 AM

Juan Diego Rodriguez

@juand-r.bsky.social

Our final South by Semantics lecture at UT Austin is happening on Wednesday April 23!

South by Semantics Workshop
Title: "Not-your-mother's connectionism: LLMs as cognitive models"
Speaker: Ellie Pavlick (Brown University)
Date and time: April 23, 2025. 3:30 - 5 PM.
Location: GDC 6.302

April 21, 2025 at 1:39 PM

Juan Diego Rodriguez

@juand-r.bsky.social

🔍 Results? We find RankAlign reduces the Generator-Validator gap by 31.8% (Pearson correlation) on average, significantly outperforming existing methods.

🎯 RankAlign also generalizes well to new, unseen tasks and lexical items, making LLMs more trustworthy!

Detailed performance metrics across tasks and methods for Gemma-2-2B on the Hypernymy and SWORDS datasets.

April 16, 2025 at 6:03 PM

Juan Diego Rodriguez

@juand-r.bsky.social

🛠️ We define the Generator-Validator gap in a new way using the correlation between "generator" and “validator” scores: the likelihood of a completion vs. the likelihood of confirming it.

This extends prior work by reasoning about the entire distribution of possible outputs across all examples.

Distribution of generator and validator log-odds for hypernym prediction.

April 16, 2025 at 6:03 PM

Juan Diego Rodriguez

@juand-r.bsky.social

One of the ways that LLMs can be inconsistent is the "generator-validator gap," where LLMs deem their own answers incorrect.

🎯 We demonstrate that ranking-based discriminator training can significantly reduce this gap, and improvements on one task often generalize to others!

🧵👇

A visualization of the generator-validator gap, where the LM likelihoods of for the generator and discriminator forms of questions are poorly correlated.

Aligning the validator and generator rankings can fix it!

April 16, 2025 at 6:03 PM

Juan Diego Rodriguez

@juand-r.bsky.social

Announcing another South by Semantics talk happening at UT Austin!

I'm looking forward to hearing Cameron Buckner (@cameronbuckner.bsky.social) talk about "Language Models as Models of Human Reasoning"on March 26th.

March 25, 2025 at 4:51 PM

Juan Diego Rodriguez

@juand-r.bsky.social

Gabe Dupre (Assistant Professor of Philosophy at UC Davis) will talk about LLMs and the linguistic system.

gabedupre.weebly.com

Date: Friday March 14, 2025
Time: 3:30 pm
Location: WAG 316

Title: 21st Century Wickelphones: Language Models and the Linguistic System