sebviteri.bsky.social
@sebviteri.bsky.social
Reposted
Let me tell you about some other random shit while we wait for the model to train: Did you know that almost all convolutional architectures have a flaw that makes them suboptimal for segmentation or other 2d tasks?
It's the padding! Let me show you how to fix it!🧵 #mlsky
November 19, 2024 at 12:26 PM
Reposted
okay Microsoft, you've got my attention...
github.com/microsoft/Bi...

will try the Llama version later, right now i have a bug in llama.cpp to describe...
GitHub - microsoft/BitNet: Official inference framework for 1-bit LLMs
Official inference framework for 1-bit LLMs. Contribute to microsoft/BitNet development by creating an account on GitHub.
github.com
October 20, 2024 at 10:47 AM