burkov.bsky.social
@burkov.bsky.social
In doing research for my book, I discovered that byte-pair encoding (BPE), the algorithm used to tokenize data for modern language models, one of the most important algorithms of our times, is described incorrectly in almost all online resources.
December 1, 2024 at 1:48 AM