TLDR: deleting one single weight from a 7B model turns it completely incoherent, destroying it’s ability to generate legible text.
arxiv.org/pdf/2411.07191
TLDR: deleting one single weight from a 7B model turns it completely incoherent, destroying it’s ability to generate legible text.
arxiv.org/pdf/2411.07191
Students do different research, go on the job market, and recruit other students. Ping me and I'll add you!
Students do different research, go on the job market, and recruit other students. Ping me and I'll add you!