PhD student at Imperial College London.
ML, interpretability, privacy, and stuff
🏳️🌈
https://igorshilov.com/
arxiv: arxiv.org/abs/2411.05743
See you in Seattle!
And thanks to my amazing co-authors: Joseph Pollock, Euodia Dodd and @yvesalexandre.bsky.social
arxiv: arxiv.org/abs/2411.05743
See you in Seattle!
And thanks to my amazing co-authors: Joseph Pollock, Euodia Dodd and @yvesalexandre.bsky.social
✅ Enables iterative privacy risk assessment during model development
✅ Zero additional computational cost
✅ Could inform targeted defenses (selective unlearning, data removal)
✅ Practical for large models where shadow model approaches fail
✅ Enables iterative privacy risk assessment during model development
✅ Zero additional computational cost
✅ Could inform targeted defenses (selective unlearning, data removal)
✅ Practical for large models where shadow model approaches fail
Easy-to-fit outliers: Loss drops late but reaches near zero → most vulnerable
Hard-to-fit outliers: Loss drops slowly, stays relatively high → somewhat vulnerable
Average samples: Loss drops quickly and stays low → least vulnerable
Easy-to-fit outliers: Loss drops late but reaches near zero → most vulnerable
Hard-to-fit outliers: Loss drops slowly, stays relatively high → somewhat vulnerable
Average samples: Loss drops quickly and stays low → least vulnerable
Solution: Loss pattern throughout training tells you a lot about individual's vulnerability.
⬇️
Solution: Loss pattern throughout training tells you a lot about individual's vulnerability.
⬇️
imperial.ac.uk/events/18318...
imperial.ac.uk/events/18318...
If you're interested, please sign up here:
docs.google.com/forms/d/e/1F...
If you're interested, please sign up here:
docs.google.com/forms/d/e/1F...
- Graham Cormode (University of Warwick/Meta AI)
- Lukas Wutschitz (M365 Research, Microsoft)
- Jamie Hayes (Google DeepMind)
- Ilia Shumailov (Google DeepMind)
- Graham Cormode (University of Warwick/Meta AI)
- Lukas Wutschitz (M365 Research, Microsoft)
- Jamie Hayes (Google DeepMind)
- Ilia Shumailov (Google DeepMind)
We will be hosting research talks from our amazing invited speakers, followed by a happy hour.
We will be hosting research talks from our amazing invited speakers, followed by a happy hour.
Почему ты вообще туда писал
Почему ты вообще туда писал