Ross Taylor
banner
rosstaylor90.bsky.social
Ross Taylor
@rosstaylor90.bsky.social
Building new thing. Same handle at the other place
Really simple principle that would go a long way: assume good faith. Most people are trying to do the right thing.
November 28, 2024 at 6:27 AM
Reposted by Ross Taylor
I personally vote text is self supervised learning (SSL) for the same reason image is SSL.

Text tokenization is a modification but so are patches in vision. If you want you can go all the way down to bits or raw audio samples. We know text bits work as (again modded) it's used in LLM compression.
November 19, 2024 at 9:20 AM