Lightnews — Scholar-powered news

Tiago Pimentel

@tpimentel.bsky.social

2.2K followers 130 following 32 posts

Postdoc at ETH. Formerly, PhD student at the University of Cambridge :)

Posts Replies Media Videos

Tiago Pimentel

@tpimentel.bsky.social

LLMs are trained to mimic a “true” distribution—their reducing cross-entropy then confirms they get closer to this target while training. Do similar models approach this target distribution in similar ways, though? 🤔 Not really! Our new paper studies this, finding 4-convergence phases in training 🧵

Figure showing the four phases of convergence in LM training

October 1, 2025 at 6:08 PM

Tiago Pimentel

@tpimentel.bsky.social

Mechanistic interpretability often relies on *interventions* to study how DNNs work. Are these interventions enough to guarantee the features we find are not spurious? No!⚠️ In our new paper, we show many mech int methods implicitly rely on the linear representation hypothesis🧵

Paper title "The Non-Linear Representation Dilemma: Is Causal Abstraction Enough for Mechanistic Interpretability?" with the paper's graphical abstract showing how more powerful alignment maps between a DNN and an algorithm allow more complex features to be found and more "accurate" abstractions.

July 14, 2025 at 12:15 PM

Tiago Pimentel

@tpimentel.bsky.social

A string may get 17 times less probability if tokenised as two symbols (e.g., ⟨he, llo⟩) than as one (e.g., ⟨hello⟩)—by an LM trained from scratch in each situation! Our new ACL paper proposes an observational method to estimate this causal effect! Longer thread soon!

Title of paper "Causal Estimation of Tokenisation Bias" and schematic of how we define tokenisation bias, which is the causal effect we are interested in.

June 4, 2025 at 10:51 AM

Tiago Pimentel

@tpimentel.bsky.social

If you're finishing your camera-ready for ACL or ICML and want to cite co-first authors more fairly, I just made a simple fix to do this! Just add $^*$ to the authors' names in your bibtex, and the citations should change :)

github.com/tpimentelms/...

Inline citations with only first author name, or first two co-first author names.

May 29, 2025 at 8:53 AM

Tiago Pimentel

@tpimentel.bsky.social

Are you interested in word lengths and natural language’s efficiency? If yes, check out our new #EMNLP2023 paper! It has everything you need: drama, suspense, a new derivation of Zipf’s law, an update to Piantadosi et al’s classic word length paper, transformers... 😄

arxiv.org/abs/2312.03897

Screenshot of paper's title. Paper title is: "Revisiting the Optimality of Word Lengths"

December 8, 2023 at 5:46 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news