For more general ML stuff: https://n1o.github.io/
For more ML focus on Vunerability Research: https://codebreakers.re/
Github: https://github.com/n1o
takes the knowledge from the linear layers of the droped transfomer block. This is done by fusing (adding) the removed linear weights to the linear weight in its neighbourhood trough an low rank projection matrix.
takes the knowledge from the linear layers of the droped transfomer block. This is done by fusing (adding) the removed linear weights to the linear weight in its neighbourhood trough an low rank projection matrix.