Andreas Madsen
banner
andreasmadsen.bsky.social
Andreas Madsen
@andreasmadsen.bsky.social
Ph.D. in NLP Interpretability from Mila. Previously: independent researcher, freelancer in ML, and Node.js core developer.
Excited to finally announce that I have joined @guidelabs.bsky.social. We are building LLMs from scratch designed to be interpretable. Many have asked what I'm doing after my Ph.D., so great to finally get it out. We have a lot of open positions, from engineering to scientist to intern.
February 7, 2025 at 5:01 PM
I’m thrilled to share that I’ve finished my Ph.D. at Mila and Polytechnique Montreal. For the last 4.5 years, I have worked on creating new faithfulness-centric paradigms for NLP Interpretability. Read my vision for the future of interpretability in our new position paper: arxiv.org/abs/2405.05386
Interpretability Needs a New Paradigm
Interpretability is the study of explaining models in understandable terms to humans. At present, interpretability is divided into two paradigms: the intrinsic paradigm, which believes that only model...
arxiv.org
November 28, 2024 at 1:39 PM