Zilei Shao
zoeshao.bsky.social
Zilei Shao
@zoeshao.bsky.social
First-year Ph.D. Student @ StarAI Lab, UCLA
Harvey Mudd College ‘24
What happens if we tokenize cat as [ca, t] rather than [cat]?

LLMs are trained on just one tokenization per word, but they still understand alternative tokenizations. We show that this can be exploited to bypass safety filters without changing the text itself.

#AI #LLMs #tokenization #alignment
March 11, 2025 at 11:13 PM