Intersecting mechanistic interpretability and health AI 😎
We trained and interpreted sparse autoencoders on MAIRA-2, our radiology MLLM. We found a range of human-interpretable radiology reporting concepts, but also many uninterpretable SAE features.
Intersecting mechanistic interpretability and health AI 😎
We trained and interpreted sparse autoencoders on MAIRA-2, our radiology MLLM. We found a range of human-interpretable radiology reporting concepts, but also many uninterpretable SAE features.
@dccastr0.bsky.social #AI #MedSky #MLSky
@dccastr0.bsky.social #AI #MedSky #MLSky
@dccastr0.bsky.social #AI #MedSky #MLSky
@dccastr0.bsky.social #AI #MedSky #MLSky
📰 Also check out our blog post for why we're so excited about it: www.microsoft.com/en-us/resear...
💾 The dataset can be downloaded here: bimcv.cipf.es/bimcv-projec...
📰 Also check out our blog post for why we're so excited about it: www.microsoft.com/en-us/resear...
💾 The dataset can be downloaded here: bimcv.cipf.es/bimcv-projec...