Lightnews — Scholar-powered news

Kaj Bostrom

@bostromk.net

720 followers 410 following 35 posts

I hate slop and yet I work on generative models
PhD from UT Austin, applied scientist @ AWS
He/him • https://bostromk.net

Posts Replies Media Videos

Reposted by Kaj Bostrom

Pietro Lesci

@pietrolesci.bsky.social

As our main result, we find that when a token is in a model’s vocabulary—i.e., when its characters are tokenised as a single symbol—the model may assign it up to 17x more probability than if it had been split into two tokens instead

June 5, 2025 at 10:43 AM

Reposted by Kaj Bostrom

Melanie Mitchell

@melaniemitchell.bsky.social

Finally found it!

November 22, 2024 at 6:46 PM

Kaj Bostrom

@bostromk.net

power's back, now I can resume flailing ("running ICLR rebuttal experiments")

November 21, 2024 at 5:10 PM

Reposted by Kaj Bostrom

Kyle Lo @ NeurIPS 2025

@kylelo.bsky.social

another starter pack, this time for folks (past & current) from Ai2 (@ai2.bsky.social) 😍

go.bsky.app/Qjyc97J

November 21, 2024 at 4:10 PM

Kaj Bostrom

@bostromk.net

EMNLP was so fun miami is interesting

November 17, 2024 at 5:06 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news