|| Machine Learning Research @ Featurespace || PhD from Uni of Manchester in Compsci /Natural Language Processing || MSc Maths, Stellenbosch ||
Cambridge, UK
Also I got to gush about her Traitor Baru Cormorant episode on @mealofthorns.bsky.social, IMPORTANT
Also I got to gush about her Traitor Baru Cormorant episode on @mealofthorns.bsky.social, IMPORTANT
Also I got to gush about her Traitor Baru Cormorant episode on @mealofthorns.bsky.social, IMPORTANT
Chants if Sennaar!
Absolute stunner of a game based on language learning, grammar, text and context.
Chants if Sennaar!
Absolute stunner of a game based on language learning, grammar, text and context.
Violets are blue
Violets are blue
1. The data it was trained & evaluated on
2. The code
3. Model architecture
4. Model weights.
DeepSeek only gives 3, 4. And I'll see the day that anyone gives us #1 without being forced to do so, because all of them are stealing data.
1. The data it was trained & evaluated on
2. The code
3. Model architecture
4. Model weights.
DeepSeek only gives 3, 4. And I'll see the day that anyone gives us #1 without being forced to do so, because all of them are stealing data.
www.nytimes.com/2025/01/02/o...
www.nytimes.com/2025/01/02/o...
Chants if Sennaar!
Absolute stunner of a game based on language learning, grammar, text and context.
Chants if Sennaar!
Absolute stunner of a game based on language learning, grammar, text and context.
Table of contents:
Table of contents:
Paging @ruha9.bsky.social & @alondra.bsky.social!!
We present an efficient framework – MAPS – for inferring the functionality of attention heads in LLMs ✨directly from their parameters✨
A new preprint with Amit Elhelo 🧵 (1/10)
We present an efficient framework – MAPS – for inferring the functionality of attention heads in LLMs ✨directly from their parameters✨
A new preprint with Amit Elhelo 🧵 (1/10)