Juan Diego Rodriguez
banner
juand-r.bsky.social
Juan Diego Rodriguez
@juand-r.bsky.social
CS PhD student at UT Austin in #NLP
Interested in language, reasoning, semantics and cognitive science. One day we'll have more efficient, interpretable and robust models!

Other interests: math, philosophy, cinema

https://www.juandiego-rodriguez.com/
GPT-5 being difficult
November 11, 2025 at 9:07 PM
Some Halloween reading
October 31, 2025 at 6:16 PM
Relatedly, I think this may be a better depiction of LLMs than the shoggoth meme
September 24, 2025 at 2:37 PM
I keep coming back to this bit from John Stuart Mill's
"On Liberty":
August 6, 2025 at 3:47 AM
These encompass three tiers of interpretive complexity:

1. Stylistic feature identification
2. Context retrieval (e.g., historical or literary context)
3. Multi-hop reasoning between style and boarder contexts.
July 27, 2025 at 7:19 PM
🎉 New Benchmark Alert: KRISTEVA – Close‑Reading for LLMs📚

I’m excited to announce a new paper accepted to ACL 2025, in collaboration with Patrick Sui, Philippe Laban, and others!
July 27, 2025 at 7:19 PM
June 7, 2025 at 5:15 PM
For those attending #NAACL, I’d definitely recommend side trips to Santa Fe and Taos

Now off to Albuquerque!
April 28, 2025 at 1:14 AM
Our final South by Semantics lecture at UT Austin is happening on Wednesday April 23!
April 21, 2025 at 1:39 PM
🔍 Results? We find RankAlign reduces the Generator-Validator gap by 31.8% (Pearson correlation) on average, significantly outperforming existing methods.

🎯 RankAlign also generalizes well to new, unseen tasks and lexical items, making LLMs more trustworthy!
April 16, 2025 at 6:03 PM
🛠️ We define the Generator-Validator gap in a new way using the correlation between "generator" and “validator” scores: the likelihood of a completion vs. the likelihood of confirming it.

This extends prior work by reasoning about the entire distribution of possible outputs across all examples.
April 16, 2025 at 6:03 PM
One of the ways that LLMs can be inconsistent is the "generator-validator gap," where LLMs deem their own answers incorrect.

🎯 We demonstrate that ranking-based discriminator training can significantly reduce this gap, and improvements on one task often generalize to others!

🧵👇
April 16, 2025 at 6:03 PM
Announcing another South by Semantics talk happening at UT Austin!

I'm looking forward to hearing Cameron Buckner (@cameronbuckner.bsky.social) talk about "Language Models as Models of Human Reasoning"on March 26th.
March 25, 2025 at 4:51 PM
Gabe Dupre (Assistant Professor of Philosophy at UC Davis) will talk about LLMs and the linguistic system.

gabedupre.weebly.com

Date: Friday March 14, 2025
Time: 3:30 pm
Location: WAG 316

Title: 21st Century Wickelphones: Language Models and the Linguistic System
March 14, 2025 at 4:14 AM
February 11, 2025 at 4:43 AM
Finally made a good amaretto sour, following Morgenthaler’s recipe
February 10, 2025 at 3:54 AM
Musk takes control of the federal government.

The NYT fails to report what's happening yet again:
February 3, 2025 at 3:22 PM
February 1, 2025 at 8:19 PM
My new favorite cocktail: the sherry flip

2 oz oloroso sherry, 1/2 oz Demerara syrup, 1 egg, and nutmeg
January 25, 2025 at 8:06 PM
Today I found this illustration of a 17th century thermometer with Stranger Things vibes
January 2, 2025 at 11:47 PM
7/20
December 13, 2024 at 8:53 PM
6/20
December 7, 2024 at 7:25 PM
5/20
December 6, 2024 at 5:28 AM
4/20
November 30, 2024 at 10:02 PM
3/20
November 27, 2024 at 8:36 PM