PhD student at Imperial College London.
ML, interpretability, privacy, and stuff
🏳️🌈
https://igorshilov.com/
More conferences with mountain views please!
Ping me if you want to chat about privacy and security of LLMs!
More conferences with mountain views please!
Ping me if you want to chat about privacy and security of LLMs!
Easy-to-fit outliers: Loss drops late but reaches near zero → most vulnerable
Hard-to-fit outliers: Loss drops slowly, stays relatively high → somewhat vulnerable
Average samples: Loss drops quickly and stays low → least vulnerable
Easy-to-fit outliers: Loss drops late but reaches near zero → most vulnerable
Hard-to-fit outliers: Loss drops slowly, stays relatively high → somewhat vulnerable
Average samples: Loss drops quickly and stays low → least vulnerable
- Graham Cormode (University of Warwick/Meta AI)
- Lukas Wutschitz (M365 Research, Microsoft)
- Jamie Hayes (Google DeepMind)
- Ilia Shumailov (Google DeepMind)
- Graham Cormode (University of Warwick/Meta AI)
- Lukas Wutschitz (M365 Research, Microsoft)
- Jamie Hayes (Google DeepMind)
- Ilia Shumailov (Google DeepMind)