https://andrewilyas.com
My talk: simons.berkeley.edu/talks/andrew...
The paper: arxiv.org/abs/2503.13751
Joint work with @logn.bsky.social, Benjamin Chen, Axel Feldmann, Billy Moses, and @aleksmadry.bsky.social
My talk: simons.berkeley.edu/talks/andrew...
The paper: arxiv.org/abs/2503.13751
Joint work with @logn.bsky.social, Benjamin Chen, Axel Feldmann, Billy Moses, and @aleksmadry.bsky.social
tl;dr: we show how to compute gradients *through* the training process & use them to optimize training. Immediate big gains on data selection, poisoning, attribution & more!
tl;dr: we show how to compute gradients *through* the training process & use them to optimize training. Immediate big gains on data selection, poisoning, attribution & more!