Lightnews — Scholar-powered news

Zilei Shao

@zoeshao.bsky.social

28 followers 25 following 3 posts

First-year Ph.D. Student @ StarAI Lab, UCLA
Harvey Mudd College ‘24

Posts Replies Media Videos

Zilei Shao

@zoeshao.bsky.social

What happens if we tokenize cat as [ca, t] rather than [cat]?

LLMs are trained on just one tokenization per word, but they still understand alternative tokenizations. We show that this can be exploited to bypass safety filters without changing the text itself.

#AI #LLMs #tokenization #alignment

March 11, 2025 at 11:13 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news